harfbuzz

Commit Graph

Author	SHA1	Message	Date
Behdad Esfahbod	63db692fa9	[indic] Fix shaping of U+0AFB GUJARATI SIGN SHADDA Fixes https://github.com/behdad/harfbuzz/issues/552	2017-10-03 16:00:18 +02:00
Behdad Esfahbod	f559c63307	[indic] Implement Consonent_With_Stacker Fixes https://github.com/behdad/harfbuzz/issues/528	2017-10-03 15:20:07 +02:00
Behdad Esfahbod	06cb162cd7	[indic] Treat Consonant_With_Stacker as consonant Fixes https://github.com/behdad/harfbuzz/issues/528 "Kannada JIHVAMULIYA and UPADHMANIYA insert dotted circles"	2017-09-01 10:34:56 -07:00
Behdad Esfahbod	3cc84f45b9	[indic] Fix https://github.com/behdad/harfbuzz/issues/478	2017-07-14 15:50:22 +01:00
Behdad Esfahbod	8b5d6e755b	[indic] Remove unused Javanese bits	2016-05-06 15:59:27 +01:00
Behdad Esfahbod	2813e3049a	[indic] Update data tables to Unicode 8.0 Test stats remain unchanged, except for Malayalam, which we investigate: BENGALI: 353725 out of 354188 tests passed. 463 failed (0.130722%) DEVANAGARI: 707307 out of 707394 tests passed. 87 failed (0.0122987%) GUJARATI: 366349 out of 366457 tests passed. 108 failed (0.0294714%) GURMUKHI: 60732 out of 60747 tests passed. 15 failed (0.0246926%) KANNADA: 951190 out of 951913 tests passed. 723 failed (0.0759523%) KHMER: 299070 out of 299124 tests passed. 54 failed (0.0180527%) MALAYALAM: 1047584 out of 1048334 tests passed. 750 failed (0.0715421%) ORIYA: 42320 out of 42329 tests passed. 9 failed (0.021262%) SINHALA: 271662 out of 271847 tests passed. 185 failed (0.068053%) TAMIL: 1091753 out of 1091754 tests passed. 1 failed (9.15957e-05%) TELUGU: 970555 out of 970573 tests passed. 18 failed (0.00185457%) Myanmar, compared to Windows 10 mmrtext.ttf: MYANMAR: 1123865 out of 1123883 tests passed. 18 failed (0.00160159%)	2015-12-18 11:05:11 +00:00
Behdad Esfahbod	d6adca9fbb	Remove unused macro ASSERT_STATIC_EXPR()	2015-07-21 15:17:27 +01:00
Behdad Esfahbod	d743ce78e1	[indic-table] Update to Unicode 7.0 data Touch code just enough to preserve previous syllable structure and functionality as closely as possible. Many further cleanups coming later.	2014-06-30 15:24:45 -04:00
Behdad Esfahbod	5c4e3e9a57	Whitespace	2014-06-30 14:25:18 -04:00
Behdad Esfahbod	cf78dd483c	[indic/myanmar] Rename OT_NBSP to OT_PLACEHOLDER	2014-05-27 17:53:37 -04:00
Behdad Esfahbod	cf71d28c38	[indic/myanmar] Refactor a few macros	2014-05-27 17:47:43 -04:00
Behdad Esfahbod	9f9bd9bf31	[indic] Rename avagraha cluster to symbol cluster In anticipation of adding more characters to that class of clusters.	2014-05-23 15:35:38 -04:00
Behdad Esfahbod	6e613f3365	Fix "shift count >= width of type" issue	2013-10-23 23:34:13 +02:00
Behdad Esfahbod	c16012e901	[indic] Add Javanese support! Seems to be working just fine!	2013-10-18 18:17:29 +02:00
Behdad Esfahbod	3756efaf4e	[indic] Misc harmless fixes! First, we were abusing OT_VD instead of OT_A. Fix that but moving OT_A in the grammar where it belongs (which is different from what the spec says). Also, only allow medial consonants after all other consonants. This doesn't affect any current character. Finally, fix Halant attachment in presence of medial consonants. Again, this currently doesn't affect any sequence. I lied. There's Gurmukhi U+0A75 which is Consonant_Medial. Uniscribe allows one of those in each of these positions: before matras, after matras and before syllable modifiers, and after syllable modifiers! We currently just allow unlimited numbers of it, before matras.	2013-10-16 19:06:29 +02:00
Behdad Esfahbod	3c7b3641cf	[indic] Handle Avagraha It can come either at the end(ish!) of the syllable, or independently. When independent, it accepts a few bits and pieces.	2013-10-15 13:14:31 +02:00
Behdad Esfahbod	9e1f80ab3e	[SEA] Treat Consonant_Final like Consonant_Medial	2013-02-12 15:28:21 -05:00
Behdad Esfahbod	3a83d33ec0	Add South-East Asian shaper Handles Tai Tham, Cham, and New Tai Lue for now.	2013-02-12 12:14:10 -05:00
Behdad Esfahbod	e67072bb17	[Indic] Handle overstruck matra position	2012-11-14 15:00:53 -08:00
Behdad Esfahbod	9cac1338c4	[Indic] Allow Consonant_Medial's after Consonant's Mostly affects Myanmar, but also Tai Tham, Javanese, and Cham. The latter three are untested (no fonts!).	2012-11-12 18:41:22 -08:00
Behdad Esfahbod	d187099cba	[Indic] Categorize Myanmar "tone marks" as nuktas	2012-11-12 18:38:06 -08:00
Behdad Esfahbod	dba186711e	[Indic] Make more room in the table To be used in upcoming commits.	2012-11-12 14:48:33 -08:00
Behdad Esfahbod	b85800f9de	[Indic] Implement dotted-circle insertion for broken clusters No panic, we reeally insert dotted circle when it's absolutely broken. Fixes most of the dotted-circle cases against Uniscribe. (for Devanagari fixes 80% of them, for Khmer 70%; the rest look like Uniscribe being really bogus...) I had to make a decision. Apparently Uniscribe adds one dotted circle to each broken character. I tried that, but that goes wrong easily with split matras. So I made it add only one dotted circle to an entire broken syllable tail. As in: "if there was a dotted circle here, this would have formed a correct cluster." That works better for split stuff, and I like it more.	2012-08-31 19:18:20 -04:00
Behdad Esfahbod	cd0c6e148f	Shuffle buffer variable allocations around To room for more allocations, coming.	2012-08-09 21:48:55 -04:00
Behdad Esfahbod	11b0e20ba4	[Indic] Add per-script configuration tables This concludes the Indic shape_plan work. May do for Arabic also...	2012-08-02 14:21:40 -04:00
Behdad Esfahbod	3eb6f81fd3	[Indic] Refactor Move all the logic that needs to eventually move into the indic table into hb-ot-shape-complex-indic-private.hh.	2012-08-02 07:38:39 -04:00
Behdad Esfahbod	1d002048d5	[Indic] Minor	2012-08-02 05:02:53 -04:00
Behdad Esfahbod	2ec934c6c2	[Indic] Change "unknown" position to end of syllable	2012-07-23 23:49:04 -04:00
Behdad Esfahbod	81202bd860	[Indic] Don't attach SM/VD to other characters	2012-07-20 15:14:51 -04:00
Behdad Esfahbod	f31d97e44e	[Indic] Form Telugu Reph out of Ra,Virama,ZWJ Apparently this was approved in Feb 2012. No font yet.	2012-07-20 14:13:35 -04:00
Behdad Esfahbod	f055442716	[Indic] Lookup consonant position in the font Fixes most failures of Oriya, and improves others a bit.	2012-07-19 16:20:21 -04:00
Behdad Esfahbod	8c973ebf0f	[Indic] Implement per-script matra positioning Following what the spec says. Brings down Telugu failures from 40% to 3.75%, and Kannada failures from 44% to 10%. Does NOT affect other scripts' test results.	2012-07-19 13:25:08 -04:00
Behdad Esfahbod	8bb32458f9	[Indic] More refactoring	2012-07-19 13:04:44 -04:00
Behdad Esfahbod	f83aaa3133	[Indic] Minor	2012-07-19 12:23:23 -04:00
Behdad Esfahbod	be8b9f5f71	[Indic] Start refactoring different matra positions per script	2012-07-19 12:11:12 -04:00
Behdad Esfahbod	3285e107c9	[Indic] Implement Sinhala "Al Lakuna" Reph behavior In Sinhala, Reph is formed only explicitly, by the presence of a ZWJ.	2012-07-18 17:22:14 -04:00
Behdad Esfahbod	e092c556fb	[Indic] Minor	2012-07-18 14:09:25 -04:00
Behdad Esfahbod	db8981f1e0	[Indic] Position Khmer Robat It's a visual Repha. Still not positioning logical Repha as occurs in Malayalam. Another 200 Khmer failures fixed. 547 to go. That's better than Devanagari!	2012-07-17 23:42:04 -04:00
Behdad Esfahbod	25bc489498	[Indic] Better categorize Register Shifters and Khmer Various signs Down another 500 or so Khmer failures!	2012-07-17 17:53:03 -04:00
Behdad Esfahbod	55f70ebfb9	[Indic] Position final subjoined consonants (and vowels) after matras In Khmer, a final subjoined consonant or independent vowel can occur after matras. This final subjoined thing should NOT be reordered to before the matra even though it's subjoined. Fixes another 1k of the Khmer failures. Not much left really.	2012-07-17 12:50:13 -04:00
Behdad Esfahbod	deb521dee4	[Indic] Add a separate Coeng class No characters recategorized yet. No semantic change.	2012-07-17 11:37:32 -04:00
Behdad Esfahbod	7d09c98a1f	[Indic] Recognizer Register Shifter marks Fixes another 6% of the Khmer failures.	2012-07-16 16:45:22 -04:00
Behdad Esfahbod	fcdc5f1c88	[Indic] Categorize Khmer Ro Khmer failures down from 58% to 36%.	2012-07-16 15:52:54 -04:00
Behdad Esfahbod	8aa801a6fd	[Indic] Adjust position for split matras We are going to split matras without a Unicode decompositions in a way that the second half takes the codepoint of the whole matra. So, position them where the second half is supposed to end up.	2012-07-16 13:24:26 -04:00
Behdad Esfahbod	f7e8dcfd4f	[Indic] Unbreak Devanagari And this, concludes the HarfBuzz Massala Hackfest. I like to specially thank Jonathan Kew for doing all the decription and letting me get commit points.	2012-05-11 22:01:33 +02:00
Behdad Esfahbod	6a091df9b4	[Indic] Disambiguate sub vs post vs above matras Bengali is at just above 5% now.	2012-05-11 21:42:27 +02:00
Behdad Esfahbod	18c06e189b	[Indic] Add Uniscribe bug feature for dotted circle For dotted-circle independent clusters, Uniscribe does no Reph shaping for the exact sequence Ra+Halant+25CC. Which also is the only possible sequence with 25CC at the end.	2012-05-11 20:02:14 +02:00
Behdad Esfahbod	cee7187447	[Indic] Move syllable tracking from Indic to generic layer This is to incorporate it into GSUB/GPOS processing.	2012-05-11 11:41:39 +02:00
Behdad Esfahbod	74e54cf446	[Indic] Add Ra back for scripts without Reph We now check that the 'rphp' table exists before forming Reph, so we don't need to comment out Ra for those scripts.	2012-05-10 21:22:58 +02:00
Behdad Esfahbod	dbb105883c	[Indic] Do Reph repositioning in final reordering like the spec says This introduced a failure, which we tracked down to a test case like this: U+092E,U+094B,U+094D,U+0930 The final character is a Ra that should be put in a syllable of it's own. And we do. But it will interact with the Halant before it. So now we finally are convinced that we have to limit features to syllable boundaries. That's coming after lunch!	2012-05-10 13:45:52 +02:00

1 2

58 Commits