What about supporting UTF-8 encoding by default?
This should include:
- regexp/search ability
- topic names (wiki words)
- RCS/diff
This seems to be the only way to complete the Internationalization of TWiki.
--
MaximRomashchenko - 15 May 2003
Agreed, this is definitely the next step. See
InternationalisationIssues for one small step (
EncodeURLsWithUTF8, i.e. encoding URIs so that people can leave browers in UTF-8 URL encoding mode), and
InternationalisationUTF8 for some discussion. I'm a bit short of time for TWiki coding at the moment, so contributions are welcome as always.
Searching and topic names would be an extension of the current 8-bit support.
RCS and diff should 'just work' since UTF-8 is an 8-bit format, but if not we might need to require specific versions. They don't have any problem with filenames having the 8th bit set as long as you use the versions in
TWikiSystemRequirements - see
CyrillicSupport for some examples.
Perl 5.8 has some UTF-8 issues still, and making UTF-8 work on this and Perl 5.6.1 would be quite a challenge. The existing 8-bit character support should be preserved since a lot of people are still running Perl 5.005 on older Linux boxes, Solaris 8, etc.
--
RichardDonkin - 15 May 2003