Multilanguage support

Jun 14, 2010 at 10:50 PM

Where can I find what languages are supported by AntiXss module and by the library?

Coordinator
Jun 14, 2010 at 10:55 PM

If by support you mean does not encode to unicode points then right now your best bet is the code; in the AntiXSS source you'll see exclusions for various known languages. This will be expanded in the next release.

                    (i >= 256 && i <= 591) ||       // Latin,Extended-A,Latin Extended-B
                    (i >= 880 && i <= 2047) || // Greek and Coptic,Cyrillic,Cyrillic Supplement,Armenian,Hebrew,Arabic,Syriac,Arabic,Supplement,Thaana,NKo
                    (i >= 2304 && i <= 6319) ||     // Devanagari,Bengali,Gurmukhi,Gujarati,Oriya,Tamil,Telugu,Kannada,Malayalam,Sinhala,Thai,Lao,Tibetan,Myanmar,eorgian,Hangul Jamo,Ethiopic,Ethiopic Supplement,Cherokee,Unified Canadian Aboriginal Syllabics,Ogham,Runic,Tagalog,Hanunoo,Buhid,Tagbanwa,Khmer,Mongolian  
                    (i >= 6400 && i <= 6687) ||     // Limbu, Tai Le, New Tai Lue, Khmer, Symbols, Buginese
                    (i >= 6912 && i <= 7039) ||     // Balinese        
                    (i >= 7680 && i <= 8191) ||     // Latin Extended Additional, Greek Extended       
                    (i >= 11264 && i <= 11743) ||   // Glagolitic, Latin Extended-C, Coptic, Georgian Supplement, Tifinagh, Ethiopic Extended   
                    (i >= 12352 && i <= 12591) ||   // Hiragana, Katakana, Bopomofo      
                    (i >= 12688 && i <= 12735) ||   // Kanbun, Bopomofo Extended       
                    (i >= 12784 && i <= 12799) ||   // Katakana, Phonetic Extensions        
                    (i >= 40960 && i <= 42191) ||   // Yi Syllables, Yi Radicals       
                    (i >= 42784 && i <= 43055) ||   // Latin Extended-D, Syloti, Nagri       
                    (i >= 43072 && i <= 43135) ||   // Phags-pa        
                    (i >= 44032 && i <= 55215) ||   // Hangul Syllables
                    (i >= 19968 && i <= 40899)      // Mixed japanese/chinese/korean

Note that it's not languages, but their unicode code pages.