harfbuzz

Commit Graph

Author	SHA1	Message	Date
Behdad Esfahbod	cb90b1bbe6	[OTLayout] Respect syllable boundaries for backtrack/lookahead matching Originally we meant to match backtrack/lookahead across syllable boundaries. But a bug in the code meant that this was NOT done for backtrack. We "fixed" that in `2c7d0b6b80`, but that broke Myanmar shaping. We now believe that for Indic-like shapers (which is where syllables are used), all basic shaping forms should be fully contained within their syllables, so now we limit backtrack/lookahead matching to the syllable too. Unbreaks Myanmar.	2013-02-15 07:02:08 -05:00
Behdad Esfahbod	ee9c3a17d0	Minor refactoring	2013-02-15 06:22:52 -05:00
Behdad Esfahbod	cfc507c543	[Indic-like] Disable automatic joiner handling for basic shaping features Not for Arabic, but for Indic-like scripts. ZWJ/ZWNJ have special meanings in those scripts, so let font lookups take full control. This undoes the regression caused by automatic-joiners handling introduced two commits ago. We only disable automatic joiner handling for the "basic shaping features" of Indic, Myanmar, and SEAsian shapers. The "presentation forms" and other features are still applied with automatic-joiner handling. This change also changes the test suite failure statistics, such that a few scripts show more "failures". The most affected is Kannada. However, upon inspection, we believe that in most, if not all, of the new failures, we are producing results superior to Uniscribe. Hard to count those! Here's an example of what is fixed by the recent joiner-handling changes: https://bugs.freedesktop.org/show_bug.cgi?id=58714 New numbers, for future reference: BENGALI: 353892 out of 354188 tests passed. 296 failed (0.0835714%) DEVANAGARI: 707336 out of 707394 tests passed. 58 failed (0.00819911%) GUJARATI: 366262 out of 366457 tests passed. 195 failed (0.0532122%) GURMUKHI: 60706 out of 60747 tests passed. 41 failed (0.067493%) KANNADA: 950680 out of 951913 tests passed. 1233 failed (0.129529%) KHMER: 299074 out of 299124 tests passed. 50 failed (0.0167155%) LAO: 53611 out of 53644 tests passed. 33 failed (0.0615167%) MALAYALAM: 1047983 out of 1048334 tests passed. 351 failed (0.0334817%) ORIYA: 42320 out of 42329 tests passed. 9 failed (0.021262%) SINHALA: 271539 out of 271847 tests passed. 308 failed (0.113299%) TAMIL: 1091753 out of 1091754 tests passed. 1 failed (9.15957e-05%) TELUGU: 970555 out of 970573 tests passed. 18 failed (0.00185457%) TIBETAN: 208469 out of 208469 tests passed. 0 failed (0%)	2013-02-14 13:10:54 -05:00
Behdad Esfahbod	0b45479198	[OTLayout] Add fine-grained control over ZWJ matching Not used yet. Next commit...	2013-02-14 13:02:13 -05:00
Behdad Esfahbod	607feb7cff	[OTLayout] Ignore default-ignorables when matching GSUB/GPOS When matching lookups, be smart about default-ignorable characters. In particular: Do nothing specific about ZWNJ, but for the other default-ignorables: If the lookup in question uses the ignorable character in a sequence, then match it as we used to do. However, if the sequence match will fail because the default-ignorable blocked it, try skipping the ignorable character and continue. The most immediate thing it means is that if Lam-Alef forms a ligature, then Lam-ZWJ-Alef will do to. Finally! One exception: when matching for GPOS, or for backtrack/lookahead of GSUB, we ignore ZWNJ too. That's the right thing to do. It certainly is possible to build fonts that this feature will result in undesirable glyphs, but it's hard to think of a real-world case that that would happen. This does break Indic shaping right now, since Indic Unicode has specific rules for what ZWJ/ZWNJ mean, and skipping ZWJ is breaking those rules. That will be fixed in upcoming commits.	2013-02-14 12:57:50 -05:00
Behdad Esfahbod	ec5448667b	Add hb_ot_map_feature_flags_t Code cleanup. No (intended) functional change.	2013-02-14 12:53:57 -05:00
Behdad Esfahbod	e7ffcfafb1	Clean-up add_bool_feature	2013-02-14 11:58:13 -05:00
Behdad Esfahbod	e7562f53fe	Fix compile warnings for ragel-generated machines	2013-02-14 11:58:13 -05:00
Behdad Esfahbod	4e51df73a3	[OTLayout] Remove unused function	2013-02-14 07:42:42 -05:00
Behdad Esfahbod	8820bb235b	[OTLayout] Port apply_lookup to skippy_iter	2013-02-14 07:41:23 -05:00
Behdad Esfahbod	dfca269f06	[OTLayout] Port ligate_input to skippy_iter	2013-02-14 07:41:23 -05:00
Behdad Esfahbod	7e53415c2d	[OTLayout] Minor fix for apply_lookup() Should NOT change behavior, since first glyph is a match.	2013-02-14 06:24:30 -05:00
Behdad Esfahbod	6880f7e19d	[OTLayout] Make table type known to apply context	2013-02-13 12:17:25 -05:00
Behdad Esfahbod	2c7d0b6b80	[OTLayou] Unbreak backtrack matching Was introduced by `28b9d502bb`.	2013-02-13 12:10:08 -05:00
Behdad Esfahbod	c074ebc466	[OTLayout] Minor refactoring	2013-02-13 11:22:42 -05:00
Behdad Esfahbod	407fc12466	[OTLayout] Remove bogus caching of glyph property	2013-02-13 11:13:06 -05:00
Behdad Esfahbod	6b1e3502e2	Remember ZWNJ To be used in upcoming changes.	2013-02-13 11:02:54 -05:00
Behdad Esfahbod	1f91c39677	Indent	2013-02-13 09:38:40 -05:00
Behdad Esfahbod	a0cb9f33ee	[Indic] Improve base finding in final_reordering Fixes 5 Malayalam failures! MALAYALAM: 1048016 out of 1048334 tests passed. 318 failed (0.0303338%)	2013-02-13 09:26:55 -05:00
Behdad Esfahbod	126f39cd16	Add more dot-reph tests	2013-02-13 08:29:21 -05:00
Behdad Esfahbod	f22b7e7778	[Indic] Track base position when reordering things Ouch, how did things ever work without this?! The added test that has a dot-reph as well as a pre-base reordering Ra perfectly demonstrates the bug (tested with Nirmala font from Win8 for example). Testing suggests that Win8 shaper has the exact same bug / behavior that we used to have. Odd.	2013-02-13 07:32:46 -05:00
Behdad Esfahbod	bc11de144c	[SEA] Don't zero any mark advances Keep the logic simple, easier to explain to font developers.	2013-02-13 05:59:06 -05:00
Behdad Esfahbod	0291a65286	Further adjust mark advance zeroing This is a followup to `568000274c`. Looks like in the Latin shaper, Uniscribe zeroes all Unicode NSM advances after GPOS, not before. Match that. Can be tested using DejaVu Sans Mono, since that font has GPOS rules to zero the mark advances on its own.	2013-02-13 05:57:24 -05:00
Behdad Esfahbod	85c51ec2e1	[Indic] Fix Eyelash Ra with old Devanagari spec	2013-02-12 18:17:39 -05:00
Behdad Esfahbod	cc5f24cde0	[tests] Add tests for Devanagary Eyelash Ra Currently broken with Sanskrit 2003 font.	2013-02-12 18:17:12 -05:00
Behdad Esfahbod	63e48bc33b	[Indic] Apply 'blwf' before 'half' This reverts `167b625d98`. It didn't matter before, but that's going to change with next commit.	2013-02-12 18:02:07 -05:00
Behdad Esfahbod	70d6565711	[Indic] Apply 'vatu' before 'cjct' This essentially reverts `1d6846db9e`, but that commit is from way back when. We should be better following the spec order now again.	2013-02-12 18:02:07 -05:00
Behdad Esfahbod	64bb2ae857	Didn't mean to push this out Ouch!	2013-02-12 16:29:25 -05:00
Behdad Esfahbod	f9b660534c	[Myanmar] Use master Indic table for syllable data	2013-02-12 16:13:56 -05:00
Behdad Esfahbod	f60793e854	[tests] Add Cham sample	2013-02-12 15:45:59 -05:00
Behdad Esfahbod	e2aab4b5db	Improve checks for setmode() As reported by Jonathan, OS X has setmode() that is something other than what setmode() is on Win32. So, limit invocation to Windows platforms only.	2013-02-12 15:35:32 -05:00
Behdad Esfahbod	a6c1e040e5	Improve check for Windows platforms Instead of checking for compiler, check for platform.	2013-02-12 15:31:58 -05:00
Behdad Esfahbod	9e1f80ab3e	[SEA] Treat Consonant_Final like Consonant_Medial	2013-02-12 15:28:21 -05:00
Behdad Esfahbod	bab02d339f	Rename HB_OT_INDIC_OPTIONS env var to HB_OPTIONS The Myanmar shaper now respects the uniscribe-bug-compatibility option too.	2013-02-12 15:26:45 -05:00
Behdad Esfahbod	3a83d33ec0	Add South-East Asian shaper Handles Tai Tham, Cham, and New Tai Lue for now.	2013-02-12 12:14:10 -05:00
Behdad Esfahbod	fb96021206	Minor test reshufflings	2013-02-12 10:33:58 -05:00
Behdad Esfahbod	5676d5d527	[Indic] Make sure New Tai Lue works!	2013-02-12 10:31:14 -05:00
Behdad Esfahbod	568000274c	Adjust mark advance-width zeroing logic for Myanmar Before, we were zeroing advance width of attached marks for non-Indic scripts, and not doing it for Indic. We have now three different behaviors, which seem to better reflect what Uniscribe is doing: - For Indic, no explicit zeroing happens whatsoever, which is the same as before, - For Myanmar, zero advance width of glyphs marked as marks in GDEF, and do that before applying GPOS. This seems to be what the new Win8 Myanmar shaper does, - For everything else, zero advance width of glyphs that are from General_Category=Mn Unicode characters, and do so before applying GPOS. This seems to be what Uniscribe does for Latin at least. With these changes, positioning of all tests matches for Myanmar, except for the glitch in Uniscribe not applying 'mark'. See preivous commit.	2013-02-12 09:44:57 -05:00
Behdad Esfahbod	99749ca8e0	[Myanmar] Add note re Uniscribe NOT applying 'mark'	2013-02-12 09:44:35 -05:00
Behdad Esfahbod	b842780138	Minor	2013-02-11 17:02:17 -05:00
Behdad Esfahbod	419c933ed1	[Myanmar] Fix handling of Punctuation and Symbol types Testing with "clusters" now on par with testing without them. 15 failures both.	2013-02-11 16:16:16 -05:00
Behdad Esfahbod	0572c1410a	[Myanmar] Fixup handling of joiners and GB characters	2013-02-11 16:16:07 -05:00
Behdad Esfahbod	1c8654ead4	[Myanmar] Prevent reordering between Asat and Dot below Implemented as a hack for now. Myanmar failures down from 23 to 15. MYANMAR: 1123868 out of 1123883 tests passed. 15 failed (0.00133466%) The remaining 15 cases are all where the syllable is wrong according to the OpenType spec. We insert dottedcircle. Uniscribe fails to do that, but it also fails to reorder the prebase-reordering medial-Ra. So it gets it wrong.	2013-02-11 14:28:59 -05:00
Behdad Esfahbod	bed687f886	Shuffle test data around	2013-02-11 14:24:03 -05:00
Behdad Esfahbod	98628cac9f	Add Win8-style Myanmar shaper Myanmar failures down from 51% to 0.00204648%! MYANMAR: 1123860 out of 1123883 tests passed. 23 failed (0.00204648%)	2013-02-11 14:20:08 -05:00
Behdad Esfahbod	1df5644958	Minor	2013-02-11 14:18:09 -05:00
Behdad Esfahbod	54f7b4d9ec	[OTLayout] Respect lookup-flags skipping over non-mark glyphs Before, when matching ligatures, we never skipping over base / liga glyphs even if that was what the LookupFlags asked for. Fixed now. We carefully reviewed all instances of this, and tested with Amiri as well as some Indic scripts, and are confident that this should NOT break anyone's fonts. It's also how Uniscribe does it, from what we can tell.	2013-02-11 13:27:17 -05:00
Behdad Esfahbod	9082efc4aa	[OTLayout] s/mark_skipping/skipping/ In aticipation of upcoming changes.	2013-02-11 13:14:56 -05:00
Behdad Esfahbod	9621e0ba29	[Indic] Fix bug introduced in `8b217f5ac5` Was breaking reph formation logic when the Ra is the only consonant. Devanagari regression fixed. Down to 57 failures again. Ouch.	2013-02-11 12:59:36 -05:00
Behdad Esfahbod	6e74c64211	Improve normalization heuristic Before, for most scripts, we were not trying to recompose two characters if the second one had ccc=0. That fails for Myanmar where U+1026 decomposes to U+1025,U+102E, both of which have ccc=0. However, we do want to try to recompose those. We now check whether the second is a mark, using general category instead. At the same time, remove optimization that was conflicting with this. [Let the Ngapi hackfest begin!]	2013-02-11 12:59:00 -05:00

1 2 3 4 5 ...

2952 Commits All Branches Search

2952 Commits

All Branches