harfbuzz

Commit Graph

Author	SHA1	Message	Date
Behdad Esfahbod	e2b878055b	Disable OTL processing for Hebrew if GPOS doesn't have Hebrew subtable New approach to fix this: `69f9fbc420` Previous approach was reverted as it was too broad. See context: https://github.com/behdad/harfbuzz/issues/347#issuecomment-267838368 With U+05E9,U+05B8,U+05C1,U+05DC and Arial Unicode, we now (correctly) disable GDEF and GPOS, so we get results very close to Uniscribe, but slightly different since our fallback position logic is not exactly the same: Before: [gid1166=3+991\|gid1142=0+737\|gid5798=0+1434] After: [gid1166=3+991\|gid1142=0@402,-26+0\|gid5798=0+1434] Uniscribe: [gid1166=3+991\|gid1142=0@348,0+0\|gid5798=0+1434]	2016-12-22 14:43:23 -06:00
Behdad Esfahbod	a2b03de5b3	[myanmar] Handle U+AA74..U+AA76 Fixes https://github.com/behdad/harfbuzz/issues/218	2016-05-06 17:56:07 +01:00
Behdad Esfahbod	8b5bc141cd	Add get_nominal_glyph() and get_variation_glyph() instead of get_glyph() New API: - hb_font_get_nominal_glyph_func_t - hb_font_get_variation_glyph_func_t - hb_font_funcs_set_nominal_glyph_func() - hb_font_funcs_set_variation_glyph_func() - hb_font_get_nominal_glyph() - hb_font_get_variation_glyph() Deprecated API: - hb_font_get_glyph_func_t - hb_font_funcs_set_glyph_func() Clients that implement their own font-funcs are encouraged to replace their get_glyph() implementation with a get_nominal_glyph() and get_variation_glyph() pair. The variation version can assume that variation_selector argument is not zero.	2016-02-24 19:05:23 +09:00
Behdad Esfahbod	b894a85ad1	Fix more hangs in case of buffer allocation errors Hopefully fixes https://github.com/behdad/harfbuzz/issues/214	2016-02-02 16:39:19 +08:00
Behdad Esfahbod	2813e3049a	[indic] Update data tables to Unicode 8.0 Test stats remain unchanged, except for Malayalam, which we investigate: BENGALI: 353725 out of 354188 tests passed. 463 failed (0.130722%) DEVANAGARI: 707307 out of 707394 tests passed. 87 failed (0.0122987%) GUJARATI: 366349 out of 366457 tests passed. 108 failed (0.0294714%) GURMUKHI: 60732 out of 60747 tests passed. 15 failed (0.0246926%) KANNADA: 951190 out of 951913 tests passed. 723 failed (0.0759523%) KHMER: 299070 out of 299124 tests passed. 54 failed (0.0180527%) MALAYALAM: 1047584 out of 1048334 tests passed. 750 failed (0.0715421%) ORIYA: 42320 out of 42329 tests passed. 9 failed (0.021262%) SINHALA: 271662 out of 271847 tests passed. 185 failed (0.068053%) TAMIL: 1091753 out of 1091754 tests passed. 1 failed (9.15957e-05%) TELUGU: 970555 out of 970573 tests passed. 18 failed (0.00185457%) Myanmar, compared to Windows 10 mmrtext.ttf: MYANMAR: 1123865 out of 1123883 tests passed. 18 failed (0.00160159%)	2015-12-18 11:05:11 +00:00
Behdad Esfahbod	136863371c	Add new shaper method postprocess_glyphs() Unused currently. To be used for Syriac stretch implementation. https://github.com/behdad/harfbuzz/issues/141	2015-11-05 13:24:15 -08:00
Behdad Esfahbod	6f932bc8f9	Fix a few more -Wshadow-local warnings https://bugzilla.mozilla.org/show_bug.cgi?id=1215894	2015-10-21 11:16:49 -02:00
Behdad Esfahbod	c403d63200	[myanmar] Use buffer->sort() to sort cluster This can possibly produce more granular clusters.	2015-09-01 16:15:25 +01:00
Behdad Esfahbod	85846b3de7	Use insertion-sort instead of bubble-sort Needed for upcoming merge-clusters fix.	2015-09-01 15:07:52 +01:00
Behdad Esfahbod	56f71ff988	Use foreach_syllable in Myanmar shaper	2015-07-22 11:58:11 +01:00
Behdad Esfahbod	f8160a4959	Add FLAG_SAFE() for values known to be small-enough And add check to FLAG()	2015-07-21 15:50:02 +01:00
Behdad Esfahbod	6f2d9ba52a	Add old-Myanmar shaper Looks like Unsicribe responds to the 'mymr' tag by zeroing marks GDEF_LATE instead of generic-shaper UNICODE_LATE. Implement that. Fixes Bug 81775 - Incorrect Rendering with harfbuzz-ng myanmar unicode https://bugs.freedesktop.org/show_bug.cgi?id=81775 Micro-test added based on Padauk.	2014-07-26 19:18:59 -04:00
Behdad Esfahbod	7cd33f2304	Micro optimization	2014-07-17 14:39:07 -04:00
Behdad Esfahbod	7627100f42	Mark unsigned integer literals with the u suffix Simplifies hb_in_range() calls as the type can be inferred. The rest is obsessiveness, I admit.	2014-07-11 16:22:13 -04:00
Behdad Esfahbod	d743ce78e1	[indic-table] Update to Unicode 7.0 data Touch code just enough to preserve previous syllable structure and functionality as closely as possible. Many further cleanups coming later.	2014-06-30 15:24:45 -04:00
Behdad Esfahbod	cf78dd483c	[indic/myanmar] Rename OT_NBSP to OT_PLACEHOLDER	2014-05-27 17:53:37 -04:00
Behdad Esfahbod	186ece94c8	[myanmar] Use OT_NBSP instead of OT_DOTTEDCIRCLE for OT_GB No functional change.	2014-05-27 17:49:45 -04:00
Behdad Esfahbod	cf71d28c38	[indic/myanmar] Refactor a few macros	2014-05-27 17:47:43 -04:00
Behdad Esfahbod	3d6ca0d32e	[ot] Simplify normalization_preference again No shaper has more than one behavior re this, so no need for a callback.	2013-12-31 16:35:37 +08:00
Behdad Esfahbod	9174a9db5c	[myanmar] Allow punctuation clusters The spec and Uniscribe don't allow these, but UTN#11 specifically says the sequence U+104B,U+1038 is valid. As such, allow all "P V" sequences. There's about eight sequences that match that structure, but Roozbeh thinks it's fine to allow all of them. Test case: U+104B, U+1038 https://bugs.freedesktop.org/show_bug.cgi?id=71947	2013-11-25 18:10:38 -05:00
Behdad Esfahbod	096b71e8ef	[myanmar] Mark U+104E MYANMAR SYMBOL AFOREMENTIONED as Consonant The spec and Uniscribe treat it as consonant in the grammar, but it's not in IndicSyllableCategory.txt, so fix up. Test sequence: U+1004,U+103A,U+1039,U+104E https://bugs.freedesktop.org/show_bug.cgi?id=71948	2013-11-25 18:03:34 -05:00
Behdad Esfahbod	71b4c999a5	Revert "Zero marks by GDEF for Tibetan" This reverts commit `d5bd0590ae`. The reasoning behind that logic was flawed and made under a misunderstanding of the original problem, and caused regressions as reported by Jonathan Kew in thread titled "tibetan marks" in Oct 2013. Apparently I have had fixed the original problem with this commit: `7e08f1258d` So, revert the faulty commit and everything seems to be in good shape.	2013-10-28 00:43:27 +01:00
Behdad Esfahbod	d5bd0590ae	Zero marks by GDEF for Tibetan See: http://lists.freedesktop.org/archives/harfbuzz/2013-April/003101.html	2013-10-18 18:17:29 +02:00
Behdad Esfahbod	a1f7b28561	[otlayout] Switch over from old is_a_ligature() to IS_LIGATED Impact should be minimal and positive.	2013-10-18 11:25:24 +02:00
Behdad Esfahbod	3ddf892b53	[otlayout] Renaming	2013-10-18 11:21:15 +02:00
Behdad Esfahbod	5e7432b817	[myanmar] Apply abvm/blwm	2013-10-15 12:33:18 +02:00
Behdad Esfahbod	6fadd9dd7c	Apply 'mark' to Myanmar According to Andrew Glass: "The issue with Myanmar <mark> feature was fixed via a servicing patch as soon as Windows 8 became available."	2013-07-26 10:33:06 -04:00
Behdad Esfahbod	127daf15e0	Arabic mark width-zeroing regression Mozilla Bug 873902 - Display Arabic text with diacritics is bad https://bugzilla.mozilla.org/show_bug.cgi?id=873902	2013-05-20 09:11:35 -04:00
Behdad Esfahbod	a8cf7b43fa	[Indic] Futher adjust ZWJ handling in Indic-like shapers After the Ngapi hackfest work, we were assuming that fonts won't use presentation features to choose specific forms (eg. conjuncts). As such, we were using auto-joiner behavior for such features. It proved to be troublesome as many fonts used presentation forms ('pres') for example to form conjuncts, which need to be disabled when a ZWJ is inserted. Two examples: U+0D2F,U+200D,U+0D4D,U+0D2F with kartika.ttf U+0995,U+09CD,U+200D,U+09B7 with vrinda.ttf What we do now is to never do magic to ZWJ during GSUB's main input match for Indic-style shapers. Note that backtrack/lookahead are still matched liberally, as is GPOS. This seems to be an acceptable compromise. As to the bug that initially started this work, that one needs to be fixed differently: Bug 58714 - Kannada u+0cb0 u+200d u+0ccd u+0c95 u+0cbe does not provide same results as Windows8 https://bugs.freedesktop.org/show_bug.cgi?id=58714 New numbers: BENGALI: 353689 out of 354188 tests passed. 499 failed (0.140886%) DEVANAGARI: 707305 out of 707394 tests passed. 89 failed (0.0125814%) GUJARATI: 366349 out of 366457 tests passed. 108 failed (0.0294714%) GURMUKHI: 60706 out of 60747 tests passed. 41 failed (0.067493%) KANNADA: 951030 out of 951913 tests passed. 883 failed (0.0927606%) KHMER: 299070 out of 299124 tests passed. 54 failed (0.0180527%) LAO: 53611 out of 53644 tests passed. 33 failed (0.0615167%) MALAYALAM: 1048102 out of 1048334 tests passed. 232 failed (0.0221304%) ORIYA: 42320 out of 42329 tests passed. 9 failed (0.021262%) SINHALA: 271666 out of 271847 tests passed. 181 failed (0.0665816%) TAMIL: 1091753 out of 1091754 tests passed. 1 failed (9.15957e-05%) TELUGU: 970555 out of 970573 tests passed. 18 failed (0.00185457%) TIBETAN: 208469 out of 208469 tests passed. 0 failed (0%)	2013-03-19 06:22:06 -04:00
Behdad Esfahbod	ee9c3a17d0	Minor refactoring	2013-02-15 06:22:52 -05:00
Behdad Esfahbod	cfc507c543	[Indic-like] Disable automatic joiner handling for basic shaping features Not for Arabic, but for Indic-like scripts. ZWJ/ZWNJ have special meanings in those scripts, so let font lookups take full control. This undoes the regression caused by automatic-joiners handling introduced two commits ago. We only disable automatic joiner handling for the "basic shaping features" of Indic, Myanmar, and SEAsian shapers. The "presentation forms" and other features are still applied with automatic-joiner handling. This change also changes the test suite failure statistics, such that a few scripts show more "failures". The most affected is Kannada. However, upon inspection, we believe that in most, if not all, of the new failures, we are producing results superior to Uniscribe. Hard to count those! Here's an example of what is fixed by the recent joiner-handling changes: https://bugs.freedesktop.org/show_bug.cgi?id=58714 New numbers, for future reference: BENGALI: 353892 out of 354188 tests passed. 296 failed (0.0835714%) DEVANAGARI: 707336 out of 707394 tests passed. 58 failed (0.00819911%) GUJARATI: 366262 out of 366457 tests passed. 195 failed (0.0532122%) GURMUKHI: 60706 out of 60747 tests passed. 41 failed (0.067493%) KANNADA: 950680 out of 951913 tests passed. 1233 failed (0.129529%) KHMER: 299074 out of 299124 tests passed. 50 failed (0.0167155%) LAO: 53611 out of 53644 tests passed. 33 failed (0.0615167%) MALAYALAM: 1047983 out of 1048334 tests passed. 351 failed (0.0334817%) ORIYA: 42320 out of 42329 tests passed. 9 failed (0.021262%) SINHALA: 271539 out of 271847 tests passed. 308 failed (0.113299%) TAMIL: 1091753 out of 1091754 tests passed. 1 failed (9.15957e-05%) TELUGU: 970555 out of 970573 tests passed. 18 failed (0.00185457%) TIBETAN: 208469 out of 208469 tests passed. 0 failed (0%)	2013-02-14 13:10:54 -05:00
Behdad Esfahbod	ec5448667b	Add hb_ot_map_feature_flags_t Code cleanup. No (intended) functional change.	2013-02-14 12:53:57 -05:00
Behdad Esfahbod	e7ffcfafb1	Clean-up add_bool_feature	2013-02-14 11:58:13 -05:00
Behdad Esfahbod	f9b660534c	[Myanmar] Use master Indic table for syllable data	2013-02-12 16:13:56 -05:00
Behdad Esfahbod	bab02d339f	Rename HB_OT_INDIC_OPTIONS env var to HB_OPTIONS The Myanmar shaper now respects the uniscribe-bug-compatibility option too.	2013-02-12 15:26:45 -05:00
Behdad Esfahbod	3a83d33ec0	Add South-East Asian shaper Handles Tai Tham, Cham, and New Tai Lue for now.	2013-02-12 12:14:10 -05:00
Behdad Esfahbod	568000274c	Adjust mark advance-width zeroing logic for Myanmar Before, we were zeroing advance width of attached marks for non-Indic scripts, and not doing it for Indic. We have now three different behaviors, which seem to better reflect what Uniscribe is doing: - For Indic, no explicit zeroing happens whatsoever, which is the same as before, - For Myanmar, zero advance width of glyphs marked as marks in GDEF, and do that before applying GPOS. This seems to be what the new Win8 Myanmar shaper does, - For everything else, zero advance width of glyphs that are from General_Category=Mn Unicode characters, and do so before applying GPOS. This seems to be what Uniscribe does for Latin at least. With these changes, positioning of all tests matches for Myanmar, except for the glitch in Uniscribe not applying 'mark'. See preivous commit.	2013-02-12 09:44:57 -05:00
Behdad Esfahbod	99749ca8e0	[Myanmar] Add note re Uniscribe NOT applying 'mark'	2013-02-12 09:44:35 -05:00
Behdad Esfahbod	419c933ed1	[Myanmar] Fix handling of Punctuation and Symbol types Testing with "clusters" now on par with testing without them. 15 failures both.	2013-02-11 16:16:16 -05:00
Behdad Esfahbod	0572c1410a	[Myanmar] Fixup handling of joiners and GB characters	2013-02-11 16:16:07 -05:00
Behdad Esfahbod	98628cac9f	Add Win8-style Myanmar shaper Myanmar failures down from 51% to 0.00204648%! MYANMAR: 1123860 out of 1123883 tests passed. 23 failed (0.00204648%)	2013-02-11 14:20:08 -05:00

41 Commits