Latest Web Technology News and Web Technologies June 17 2003, the latest breaking New York Web design news brought to you by,
Web Designs Now,Website Designs Now,New York Web Design Homepage,Web Design Services for New York, Connecticut, Long Island,New York Web Design Client Testimonials,Website Portfolio of New York Web Design, About this New York Web Design Firm,Contact this New York Web Design Firm

XML & Unicode: Mix with Care
Latest Web Technology & Web Design News, June 17, 2003

XML & Unicode: Mix with Care
Yahoo! & BT Team
Internet 6000X Faster?
Bugbear.B Continues Rampage
Experts Warn of Worm Variant

More Web Design News:
2011 Latest Web Technology News
2011 April
2011 March
2010 December
2009 April
2008 November
2008 October
2008 July
2008 June
2007 June
2007 May
2007 March
2006 November
2006 September
2006 August
2006 July
2006 June
2006 May
2006 April
2006 March
2006 February
2006 January
2005 December
2005 November
2005 October
2005 September
2005 August
2005 July
2005 June
2005 May
2005 April
2005 March
2005 February
2004 March
2004 February
2004 January
2003 December
2003 November
2003 October
2003 September
2003 August
2003 July
2003 June
2003 March - May



June 17, 2003

The character set that lets computers write in every language from Czech to Chinese could make Web browsers tongue-tied, two standards groups warned on Friday.
By Paul Festa

Published by the Unicode Consortium, Unicode is a standard character set for computers that aims to assign a number for every character in every written language. XML (Extensible Markup Language), a World Wide Web Consortium (W3C) recommendation for marking up digital documents and creating new markup languages for specific tasks or industries, relies on Unicode and closely tracks its revisions.

But a technical report released by the Unicode Consortium--and simultaneously published as a note by the W3C's internationalization activity--warns document authors that some Unicode features are going to cause XML applications, HTML browsers, and other programs to choke.

Conflict arises between Unicode and Web markup languages from the fundamentally different philosophies that underlie the character set and Web standards. While Unicode produces a one-for-one, linear correspondence for every character on the page, XML and its Web-based relatives are more flexible in that they let authors assign different style and functional attributes to a single character, word or page.

For example, Unicode provides what's called "compatibility characters," separate numbers to designate superscript or subscript numerals or letters. With HTML or XML, by contrast, the author would use the basic character and then style it as superscript or subscript.

All things being equal, the W3C advises authors to use the markup alternatives.

Compatibility characters are "just not the long-term, sound way to do things," said Martin Duerst, the W3C's internationalization activity lead and a visiting scientist at the Massachusetts Institute of Technology's Laboratory for Computer Science. "We're urging authors to use Unicode in a responsible and adequate way when it's used with XML."

Many times, authors know that their Unicode is destined to be read by Web browsers and other XML applications. But some of the conflicts crop up as a surprise when XML applications are fed information from older databases and information repositories.

That's when applications that are designed for markup languages start stuttering on characters that designate things like vertical tabulators, tab feeds and other controls.

"In the report we go through a lot of different kinds of characters that, in one way or another, may make sense in a legacy system or in plain text, but once you have markup at your disposition, you can use structure," Duerst said. "You want to use structure instead of a character, a number. If you're using XML, use what XML makes available. Control character stuff really doesn't work."

The fourth version of Unicode will be out in book form later this year. Prepublication versions of Unicode 4.0 are available online now.

Web Designs Now
Back to the Top


 © Copyright 2011, All rights reserved  |  Privacy Web Design Forums  |  Web Design News  |  Advertise  |  About Us  |  Contact Us  |  W3C HTML 
 Related Websites: New-York-WebDesign.com