The attached perl script should be added to your CGI bin directory and run to test character encodings. You can use the script to generate a range of characters in your chosen encoding. A textarea then shows you exactly what those characters will look like in a TWiki "raw" edit. You can then run tests to push the characters through the same transforms that the TinyMCEPlugin and WysiwygPlugin perform, and check that the results are identical to the input.
If you have even the most passing of interests in international characters, you should run the test script and satisfy yourself that your chosen characters are handled correctly.
Caveat emptor.
When you have run a test, please add the details below:
Encoding |
Code points |
FF2 |
IE7 |
IE6 |
Saf |
Comments |
iso-8859-1 |
33..126 |
Pass |
|
|
Pass |
Safari does not get "gt" for the curl on 62 in test 2 |
iso-8859-1 |
126..255 |
Pass |
Pass |
Pass |
Pass |
IE6 is incredibly slow, but gets there eventually |
iso-8859-15 |
33..126 |
Pass |
|
|
Pass |
Safari does not get "gt" for the curl on 62 in test 2 |
iso-8859-15 |
160..255 |
Pass |
Pass |
Pass |
Pass |
|
utf8 |
33..126 |
Pass |
|
|
Pass |
Safari does not get "gt" for the curl on 62 in test 2 |
utf8 |
126..512 |
Pass |
Fail |
Fail |
Pass |
Garbled characters in IE |
koi8-r |
1040..1060 |
Pass |
Pass |
Pass |
Pass |
|
utf-8 |
126..512 |
Pass |
Pass |
Pass |
Pass |
|
utf-8 |
13056-14079 |
Pass |
Pass |
|
Pass |
CJK Compatibility |
utf-8 |
19968-20991 |
Pass |
|
|
|
CJK Unified Ideographs |
utf-8 |
63744-64767 |
Pass |
|
|
|
CJK Compatibility Ideographs |
UTF-16 |
33..126 |
Fail |
|
|
Pass |
FF gets wrong input from the start. Safari does not get "gt" for the curl on 62 in test 2 |