2012-07-23 05:17:59 +02:00
|
|
|
ක
|
|
|
|
කං
|
|
|
|
කඃ
|
|
|
|
කා
|
|
|
|
කැ
|
|
|
|
කෑ
|
|
|
|
කි
|
|
|
|
කී
|
|
|
|
කු
|
|
|
|
කූ
|
|
|
|
කෘ
|
|
|
|
කෲ
|
|
|
|
කෟ
|
|
|
|
කෳ
|
|
|
|
කෙ
|
2012-01-21 02:48:14 +01:00
|
|
|
කො
|
|
|
|
කෞ
|
2012-07-23 05:17:59 +02:00
|
|
|
කේ
|
2012-01-21 02:48:14 +01:00
|
|
|
කේ
|
2012-07-23 05:17:59 +02:00
|
|
|
කෛ
|
|
|
|
කො
|
2012-01-21 02:48:14 +01:00
|
|
|
කෝ
|
|
|
|
කෝ
|
2012-07-23 05:17:59 +02:00
|
|
|
කෞ
|
|
|
|
ක්
|
|
|
|
ක්ය
|
|
|
|
ක්ර
|
2012-07-19 05:25:58 +02:00
|
|
|
ක්රම
|
2012-07-23 05:17:59 +02:00
|
|
|
ර්ම
|
2012-07-19 05:25:58 +02:00
|
|
|
ශී්ර
|
2012-07-23 05:17:59 +02:00
|
|
|
ස්ට්රේ
|
2012-07-24 02:07:50 +02:00
|
|
|
ග්යෙ
|
2012-07-24 05:51:29 +02:00
|
|
|
ර්ය්ය
|
2012-07-24 06:09:12 +02:00
|
|
|
එඬේ
|
[Indic] Further adjust base algorithm for Sinhala
Apparently if there is C,V,ZWJ,C, the first C will be base, but if
it's C,ZWJ,V,C, the second one will be.
Note that Uniscribe implements this differently, by breaking syllable in
the case of C,ZWJ,V,C and putting the first consonant in one syllable
and the rest in the next syllable.
Sinhala failures down from 208 to 158 (0.0581209%). No changes to
Khmer.
2012-07-24 06:21:16 +02:00
|
|
|
න්ගේ
|
|
|
|
න්ගේ
|
|
|
|
න්ගේ
|
2012-07-24 06:26:43 +02:00
|
|
|
ර්
|
2012-09-07 20:55:07 +02:00
|
|
|
ක්රා
|
2012-11-14 19:56:02 +01:00
|
|
|
කේ
|
[indic] Don't apply presentation features across syllables
More like Uniscribe... We still allow user-defined features to
work across syllables, but not pres,blws,abs,psts,etc.
This "regressed" Sinhala numbers by 11. These are cases were
there's Consonant followed by Ra,Halant,ZWJ at the of text.
The Ra,Halant,ZWJ ends up forming reph, which is wrong...
But before we were also ligating that reph with the previous
consonant. That's even more wrong. That's also what Uniscribe
does.
Current numbers:
BENGALI: 353732 out of 354188 tests passed. 456 failed (0.128745%)
DEVANAGARI: 707307 out of 707394 tests passed. 87 failed (0.0122987%)
GUJARATI: 366349 out of 366457 tests passed. 108 failed (0.0294714%)
GURMUKHI: 60732 out of 60747 tests passed. 15 failed (0.0246926%)
KANNADA: 951030 out of 951913 tests passed. 883 failed (0.0927606%)
KHMER: 299070 out of 299124 tests passed. 54 failed (0.0180527%)
MALAYALAM: 1048140 out of 1048334 tests passed. 194 failed (0.0185056%)
ORIYA: 42320 out of 42329 tests passed. 9 failed (0.021262%)
SINHALA: 271655 out of 271847 tests passed. 192 failed (0.070628%)
TAMIL: 1091753 out of 1091754 tests passed. 1 failed (9.15957e-05%)
TELUGU: 970555 out of 970573 tests passed. 18 failed (0.00185457%)
2013-10-15 13:47:27 +02:00
|
|
|
ගර්
|