harfbuzz

Commit Graph

Author	SHA1	Message	Date
Behdad Esfahbod	a8cf7b43fa	[Indic] Futher adjust ZWJ handling in Indic-like shapers After the Ngapi hackfest work, we were assuming that fonts won't use presentation features to choose specific forms (eg. conjuncts). As such, we were using auto-joiner behavior for such features. It proved to be troublesome as many fonts used presentation forms ('pres') for example to form conjuncts, which need to be disabled when a ZWJ is inserted. Two examples: U+0D2F,U+200D,U+0D4D,U+0D2F with kartika.ttf U+0995,U+09CD,U+200D,U+09B7 with vrinda.ttf What we do now is to never do magic to ZWJ during GSUB's main input match for Indic-style shapers. Note that backtrack/lookahead are still matched liberally, as is GPOS. This seems to be an acceptable compromise. As to the bug that initially started this work, that one needs to be fixed differently: Bug 58714 - Kannada u+0cb0 u+200d u+0ccd u+0c95 u+0cbe does not provide same results as Windows8 https://bugs.freedesktop.org/show_bug.cgi?id=58714 New numbers: BENGALI: 353689 out of 354188 tests passed. 499 failed (0.140886%) DEVANAGARI: 707305 out of 707394 tests passed. 89 failed (0.0125814%) GUJARATI: 366349 out of 366457 tests passed. 108 failed (0.0294714%) GURMUKHI: 60706 out of 60747 tests passed. 41 failed (0.067493%) KANNADA: 951030 out of 951913 tests passed. 883 failed (0.0927606%) KHMER: 299070 out of 299124 tests passed. 54 failed (0.0180527%) LAO: 53611 out of 53644 tests passed. 33 failed (0.0615167%) MALAYALAM: 1048102 out of 1048334 tests passed. 232 failed (0.0221304%) ORIYA: 42320 out of 42329 tests passed. 9 failed (0.021262%) SINHALA: 271666 out of 271847 tests passed. 181 failed (0.0665816%) TAMIL: 1091753 out of 1091754 tests passed. 1 failed (9.15957e-05%) TELUGU: 970555 out of 970573 tests passed. 18 failed (0.00185457%) TIBETAN: 208469 out of 208469 tests passed. 0 failed (0%)	2013-03-19 06:22:06 -04:00
Behdad Esfahbod	57542d7f41	Minor	2013-02-21 15:55:49 -05:00
Behdad Esfahbod	cfc507c543	[Indic-like] Disable automatic joiner handling for basic shaping features Not for Arabic, but for Indic-like scripts. ZWJ/ZWNJ have special meanings in those scripts, so let font lookups take full control. This undoes the regression caused by automatic-joiners handling introduced two commits ago. We only disable automatic joiner handling for the "basic shaping features" of Indic, Myanmar, and SEAsian shapers. The "presentation forms" and other features are still applied with automatic-joiner handling. This change also changes the test suite failure statistics, such that a few scripts show more "failures". The most affected is Kannada. However, upon inspection, we believe that in most, if not all, of the new failures, we are producing results superior to Uniscribe. Hard to count those! Here's an example of what is fixed by the recent joiner-handling changes: https://bugs.freedesktop.org/show_bug.cgi?id=58714 New numbers, for future reference: BENGALI: 353892 out of 354188 tests passed. 296 failed (0.0835714%) DEVANAGARI: 707336 out of 707394 tests passed. 58 failed (0.00819911%) GUJARATI: 366262 out of 366457 tests passed. 195 failed (0.0532122%) GURMUKHI: 60706 out of 60747 tests passed. 41 failed (0.067493%) KANNADA: 950680 out of 951913 tests passed. 1233 failed (0.129529%) KHMER: 299074 out of 299124 tests passed. 50 failed (0.0167155%) LAO: 53611 out of 53644 tests passed. 33 failed (0.0615167%) MALAYALAM: 1047983 out of 1048334 tests passed. 351 failed (0.0334817%) ORIYA: 42320 out of 42329 tests passed. 9 failed (0.021262%) SINHALA: 271539 out of 271847 tests passed. 308 failed (0.113299%) TAMIL: 1091753 out of 1091754 tests passed. 1 failed (9.15957e-05%) TELUGU: 970555 out of 970573 tests passed. 18 failed (0.00185457%) TIBETAN: 208469 out of 208469 tests passed. 0 failed (0%)	2013-02-14 13:10:54 -05:00
Behdad Esfahbod	0b45479198	[OTLayout] Add fine-grained control over ZWJ matching Not used yet. Next commit...	2013-02-14 13:02:13 -05:00
Behdad Esfahbod	607feb7cff	[OTLayout] Ignore default-ignorables when matching GSUB/GPOS When matching lookups, be smart about default-ignorable characters. In particular: Do nothing specific about ZWNJ, but for the other default-ignorables: If the lookup in question uses the ignorable character in a sequence, then match it as we used to do. However, if the sequence match will fail because the default-ignorable blocked it, try skipping the ignorable character and continue. The most immediate thing it means is that if Lam-Alef forms a ligature, then Lam-ZWJ-Alef will do to. Finally! One exception: when matching for GPOS, or for backtrack/lookahead of GSUB, we ignore ZWNJ too. That's the right thing to do. It certainly is possible to build fonts that this feature will result in undesirable glyphs, but it's hard to think of a real-world case that that would happen. This does break Indic shaping right now, since Indic Unicode has specific rules for what ZWJ/ZWNJ mean, and skipping ZWJ is breaking those rules. That will be fixed in upcoming commits.	2013-02-14 12:57:50 -05:00
Behdad Esfahbod	568000274c	Adjust mark advance-width zeroing logic for Myanmar Before, we were zeroing advance width of attached marks for non-Indic scripts, and not doing it for Indic. We have now three different behaviors, which seem to better reflect what Uniscribe is doing: - For Indic, no explicit zeroing happens whatsoever, which is the same as before, - For Myanmar, zero advance width of glyphs marked as marks in GDEF, and do that before applying GPOS. This seems to be what the new Win8 Myanmar shaper does, - For everything else, zero advance width of glyphs that are from General_Category=Mn Unicode characters, and do so before applying GPOS. This seems to be what Uniscribe does for Latin at least. With these changes, positioning of all tests matches for Myanmar, except for the glitch in Uniscribe not applying 'mark'. See preivous commit.	2013-02-12 09:44:57 -05:00
Behdad Esfahbod	5a08ecf920	Implement hb_ot_layout_get_glyph_class()	2012-11-16 13:34:29 -08:00
Behdad Esfahbod	f30641038b	Bunch of independent changes (ouch) API additions: hb_segment_properties_t HB_SEGMENT_PROPERTIES_DEFAULT hb_segment_properties_equal() hb_segment_properties_hash() hb_buffer_set_segment_properties() hb_buffer_get_segment_properties() hb_ot_layout_glyph_class_t hb_shape_plan_t hb_shape_plan_create() hb_shape_plan_create_cached() hb_shape_plan_get_empty() hb_shape_plan_reference() hb_shape_plan_destroy() hb_shape_plan_set_user_data() hb_shape_plan_get_user_data() hb_shape_plan_execute() hb_ot_shape_plan_collect_lookups() API changes: Rename hb_ot_layout_feature_get_lookup_indexes() to hb_ot_layout_feature_get_lookups(). New header file: hb-shape-plan.h And a bunch of prototyped but not implemented stuff. Coming soon. (Tests fail because of the prototypes right now.)	2012-11-15 18:48:10 -08:00
Behdad Esfahbod	362a990b22	Rename hb_ot_layout_would_substitute_lookup() and hb_ot_layout_substitute_closure_lookup() To match upcoming API.	2012-11-15 14:57:31 -08:00
Behdad Esfahbod	7c8e844d92	Use namespace for OpenType tables Avoids USHORT, SHORT, ULONG, LONG clashes with Windows API.	2012-08-28 17:57:49 -04:00
Behdad Esfahbod	d9b204d3d2	[GSUB] Allow non-zero-context matching in would_apply() To be used in the next patch.	2012-08-23 16:22:28 -04:00
Behdad Esfahbod	cd0c6e148f	Shuffle buffer variable allocations around To room for more allocations, coming.	2012-08-09 21:48:55 -04:00
Behdad Esfahbod	afbcc24be0	[GSUB] Wire the font, not just the face, down to substitute() We need the font for glyph lookup during GSUB pauses in Indic shaper. Could perhaps be avoided, but at this point, we don't mean to support separate substitute()/position() entry points (anymore), so there is no point in not providing the font to GSUB.	2012-08-02 08:36:40 -04:00
Behdad Esfahbod	b0e6a26a10	[OT] Hide some API It was impossible to meaningfully use them from the outside these days.	2012-08-02 08:11:14 -04:00
Behdad Esfahbod	6f76113755	[GSUB/GPOS] Check array size before accessing digests	2012-08-02 04:00:31 -04:00
Behdad Esfahbod	1336ecdf8e	[GSUB/GPOS] Use Coverage digests as gatekeeper Gives me a good 10% speedup for the Devanagari test case. Less so for less lookup-intensive tests. For the Devanagari test case, the false positive rate of the GSUB digest is 4%.	2012-08-01 21:46:36 -04:00
Behdad Esfahbod	c2e42c3db6	Minor	2012-07-30 19:54:50 -04:00
Behdad Esfahbod	05bd1b6342	[GSUB/GPOS] Move glyph props matching around	2012-07-30 19:30:01 -04:00
Behdad Esfahbod	f860366456	[OT] Gain back some lost speed	2012-07-30 03:16:38 -04:00
Behdad Esfahbod	578e42182b	Minor	2012-07-30 02:35:07 -04:00
Behdad Esfahbod	4751dec8be	Minor	2012-07-30 00:42:07 -04:00
Behdad Esfahbod	fe20c0f84f	[GSUB] Fix mark component stuff when ligatures form ligatures! See comments. Fixes https://bugzilla.gnome.org/show_bug.cgi?id=437633	2012-07-30 00:00:59 -04:00
Behdad Esfahbod	2ec3ba46a3	[GSUB/GPOS] Minor Start squeezing more out of lig_id/lig_comp.	2012-07-29 22:16:15 -04:00
Behdad Esfahbod	5d874d566f	[GPOS] Fix mark-to-mark positioning when one of the marks is a ligature This commit: `a3313e5400` broke MarkMarkPos when one of the marks itself is a ligature. That regressed 26 Tibetan tests (up from zero!). Fix that. Tibetan back to zero.	2012-07-28 21:05:25 -04:00
Behdad Esfahbod	ea278d3895	Partially switch ot shaper to shape_plan	2012-07-27 02:12:28 -04:00
Behdad Esfahbod	a3313e5400	[GPOS] Fix MarkMarkPos applied to results of MultipleSubst This was broken as a result of `7b84c536c1`. As Khaled reported, MarkMark positioning was broken with glyphs resulting from a MultipleSubst. Fixed. Test with the ALLAH character in Amiri.	2012-07-25 18:37:51 -04:00
Behdad Esfahbod	49c5ec5144	Minor refactoring	2012-07-23 20:14:13 -04:00
Behdad Esfahbod	0f8fea71a6	Minor. Hide _hb_ot_layout_get_glyph_property()	2012-06-09 00:24:38 -04:00
Behdad Esfahbod	cee7187447	[Indic] Move syllable tracking from Indic to generic layer This is to incorporate it into GSUB/GPOS processing.	2012-05-11 11:41:39 +02:00
Behdad Esfahbod	0b08adb353	Add hb_set_t	2012-04-23 22:44:59 -04:00
Behdad Esfahbod	6a9be5bd35	Rename hb_glyph_map_t to hb_set_t	2012-04-23 22:23:17 -04:00
Behdad Esfahbod	31081f7390	Implement closure() for Context and ChainContext lookups	2012-04-23 16:54:58 -04:00
Behdad Esfahbod	c64ddab3c3	Flesh out closure() for GSUB The GSUBGPOS part still missing.	2012-04-23 15:28:35 -04:00
Behdad Esfahbod	5e88aa6682	Remove public enum names again As was reported to me, glib-mkenum does not understand named enums, so remove for now.	2012-04-14 18:51:50 -04:00
Behdad Esfahbod	c6035cf802	Add names to enums gdb was showing <anonymous enum> instead of useful stuff, so name all our enums.	2012-04-12 13:23:59 -04:00
Behdad Esfahbod	3897335c76	[API] Sort out get_blob API hb_face_get_blob() renamed to hb_face_reference_blob(), returns a reference now. hb_face_[sg]et_index() added. hb_face_set_upem() added.	2011-08-08 23:37:41 +02:00
Behdad Esfahbod	c605bbbb6d	Remove C++ guards from source files Where causing issues for people with MSVC.	2011-08-04 20:00:53 -04:00
Behdad Esfahbod	abcfe9b59b	Remove hb_ot_layout_context_t, simplify code	2011-05-11 00:02:02 -04:00
Behdad Esfahbod	266b34418c	Refactor to keep hb-object-private.h and hb-open-type.h separate Needed to be able to include <Windows.h> from hb-object-private.h.	2011-05-03 00:35:53 -04:00
Behdad Esfahbod	2409d5f8d7	Update Copyright headers	2011-04-21 17:14:28 -04:00
Behdad Esfahbod	da97541988	[API] Allow negative font x_scale/y_scale I was reconsidering whether y should grow down, since all three/four times I've used this API I was tricked and got that wrong in my use. So I was very inclined to make y grow down instead of up. However, considering that the font space has y up and it would be very confusing for callbacks to work against that, I decided that what I really want is for the user to be able to set y_scale to a negative number to imply that user-space y grows down. Changing x_scale/y_scale from unsigned int to int allows that, and I've made pango to use that instead of negating glyph y_offset later. hb-ft however still has y group up. I guess that's how FreeType works? I'm not sure, FreeType docs don't make this clear... I'm happy with the resolution :-).	2011-04-21 15:15:02 -04:00
Behdad Esfahbod	c57d454acc	Rename all private sources and headers to C++ files So we can liberally use the simple features of C++ that parts of the codebase is already using.	2011-04-20 18:50:27 -04:00
Behdad Esfahbod	3a852ae7fe	Save general category and combining class in the buffer for reuse	2010-11-03 16:37:24 -04:00
Behdad Esfahbod	4a2d844c2f	Minor code shuffling	2010-11-03 15:28:56 -04:00
Behdad Esfahbod	c2709119c8	Move things around some more	2010-11-02 19:12:59 -04:00
Behdad Esfahbod	194d4566ec	Move buffer var allocation local	2010-11-02 19:12:59 -04:00
Behdad Esfahbod	bf94b3ad22	Move some more code around	2010-11-02 19:12:59 -04:00
Behdad Esfahbod	8c69e65abe	Rename lookup_flags to lookup_props since it's more than just flags	2010-11-02 19:12:59 -04:00
Behdad Esfahbod	98370e89d1	WIP removing external synthesized GDEF support and implementing it internally	2010-11-02 19:12:58 -04:00
Behdad Esfahbod	1115890b90	More cleanup	2010-11-02 19:12:58 -04:00
Behdad Esfahbod	88474c6fda	Get rid of the OpenType-specific internal buffer representation Add variant integers to buffer item types. More cleanup coming.	2010-11-02 19:12:58 -04:00
Behdad Esfahbod	af5d02a269	Rewrite Cursive joining to act more like other pair lookups Look forward for next character instead of joining to the last character.	2010-10-27 11:54:26 -04:00
Behdad Esfahbod	f0feb084b0	Minor	2010-10-03 19:09:39 -04:00
Behdad Esfahbod	ac0c1663fa	Avoid div-by-zero, validate upem	2010-10-01 19:09:23 -04:00
Behdad Esfahbod	acdba3f90b	Prefer C linkage	2010-07-23 15:39:27 -04:00
Behdad Esfahbod	cc6d52279d	De-C++ where possible Helps with avoiding many "extern C" declarations in source files.	2010-07-23 15:00:13 -04:00
Behdad Esfahbod	0dd200d28f	Header dep cleanup	2010-07-23 14:56:29 -04:00
Behdad Esfahbod	0f0cd9d361	Fix header tags	2010-06-09 06:32:56 -04:00
Behdad Esfahbod	b634beb39e	Fix delta scale, again...	2010-05-20 17:44:52 +01:00
Behdad Esfahbod	0a4399ca22	Fix scale issues hb_font_set_scale() now sets the value to be used to represent a unit pixel. For example, if rendering a 10px font with a 26.6 representation, you would set scale to (10 << 6). For 10px in 16.16 you would set it to (10 << 16). This space should be the same space that the get_glyph_metrics and get_kerning callbacks work in.	2010-05-19 15:45:06 -04:00
Behdad Esfahbod	eee8598d75	Hide internal symbols	2010-05-12 23:22:55 -04:00
Behdad Esfahbod	22da7fd94d	Rename a few files to be C++ sources In anticipation for buffer revamp coming.	2010-05-12 18:23:21 -04:00

1 2 3

112 Commits