You are here: TWiki> Codev Web>EncodingUtf8 (2003-09-06, RichardDonkin)
Tags:
create new tag
, view all tags
What about supporting UTF-8 encoding by default? This should include:

  • regexp/search ability
  • topic names (wiki words)
  • RCS/diff

This seems to be the only way to complete the Internationalization of TWiki.

-- MaximRomashchenko - 15 May 2003

Agreed, this is definitely the next step. See InternationalisationIssues for one small step (EncodeURLsWithUTF8, i.e. encoding URIs so that people can leave browers in UTF-8 URL encoding mode), and InternationalisationUTF8 for some discussion. I'm a bit short of time for TWiki coding at the moment, so contributions are welcome as always.

Searching and topic names would be an extension of the current 8-bit support. RCS and diff should 'just work' since UTF-8 is an 8-bit format, but if not we might need to require specific versions. They don't have any problem with filenames having the 8th bit set as long as you use the versions in TWikiSystemRequirements - see CyrillicSupport for some examples.

Perl 5.8 has some UTF-8 issues still, and making UTF-8 work on this and Perl 5.6.1 would be quite a challenge. The existing 8-bit character support should be preserved since a lot of people are still running Perl 5.005 on older Linux boxes, Solaris 8, etc.

-- RichardDonkin - 15 May 2003  

Topic revision: r3 - 2003-09-06 - 09:01:42 - RichardDonkin
 

Codev

share: Twitter Delicious Facebook Digg Google Bookmarks E-mail LinkedIn Reddit StumbleUpon
Twiki, Inc.
This site is powered by the TWiki collaboration platform
Ideas, requests, problems regarding TWiki? Send feedback
Copyright © 1999-2010 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.