2019-04-30 18:23:19 +02:00
|
|
|
<?xml version="1.0"?>
|
|
|
|
<!DOCTYPE book PUBLIC "-//OASIS//DTD DocBook XML V4.3//EN"
|
|
|
|
"http://www.oasis-open.org/docbook/xml/4.3/docbookx.dtd" [
|
|
|
|
<!ENTITY % local.common.attrib "xmlns:xi CDATA #FIXED 'http://www.w3.org/2003/XInclude'">
|
|
|
|
<!ENTITY version SYSTEM "version.xml">
|
|
|
|
]>
|
|
|
|
<chapter id="utilities">
|
|
|
|
<title>Utilities</title>
|
|
|
|
<para>
|
|
|
|
HarfBuzz includes several auxiliary components in addition to the
|
|
|
|
main APIs. These include a set of command-line tools, a set of
|
|
|
|
lower-level APIs for common data types that may be of interest to
|
|
|
|
client programs, and an embedded library for working with
|
|
|
|
Unicode Character Database (UCD) data.
|
|
|
|
</para>
|
|
|
|
|
|
|
|
<section id="utilities-command-line-tools">
|
|
|
|
<title>Command-line tools</title>
|
|
|
|
<para>
|
|
|
|
HarfBuzz include three command-line tools:
|
2020-04-17 16:54:40 +02:00
|
|
|
<command>hb-shape</command>, <command>hb-view</command>, and
|
|
|
|
<command>hb-subset</command>. They can be used to examine
|
2019-04-30 18:23:19 +02:00
|
|
|
HarfBuzz's functionality, debug font binaries, or explore the
|
|
|
|
various shaping models and features from a terminal.
|
|
|
|
</para>
|
|
|
|
|
|
|
|
<section id="utilities-command-line-hbshape">
|
|
|
|
<title>hb-shape</title>
|
|
|
|
<para>
|
2020-04-17 16:54:40 +02:00
|
|
|
<emphasis><command>hb-shape</command></emphasis> allows you to run HarfBuzz's
|
2019-04-30 18:23:19 +02:00
|
|
|
<function>hb_shape()</function> function on an input string and
|
|
|
|
to examine the outcome, in human-readable form, as terminal
|
2020-04-17 16:54:40 +02:00
|
|
|
output. <command>hb-shape</command> does
|
2019-04-30 18:23:19 +02:00
|
|
|
<emphasis>not</emphasis> render the results of the shaping call
|
2020-04-17 16:54:40 +02:00
|
|
|
into rendered text (you can use <command>hb-view</command>, below, for
|
2019-04-30 18:23:19 +02:00
|
|
|
that). Instead, it prints out the final glyph indices and
|
|
|
|
positions, taking all shaping operations into account, as if the
|
|
|
|
input string were a HarfBuzz input buffer.
|
|
|
|
</para>
|
|
|
|
<para>
|
|
|
|
You can specify the font to be used for shaping and, with
|
|
|
|
command-line options, you can add various aspects of the
|
|
|
|
internal state to the output that is sent to the terminal. The
|
|
|
|
general format is
|
|
|
|
</para>
|
|
|
|
<programlisting>
|
|
|
|
<command>hb-shape</command> <optional>[OPTIONS]</optional>
|
|
|
|
<parameter>path/to/font/file.ttf</parameter>
|
|
|
|
<parameter>yourinputtext</parameter>
|
|
|
|
</programlisting>
|
|
|
|
<para>
|
|
|
|
The default output format is plain text (although JSON output
|
|
|
|
can be selected instead by specifying the option
|
|
|
|
<optional>--output-format=json</optional>). The default output
|
|
|
|
syntax reports each glyph name (or glyph index if there is no
|
|
|
|
name) followed by its cluster value, its horizontal and vertical
|
|
|
|
position displacement, and its horizontal and vertical advances.
|
|
|
|
</para>
|
|
|
|
<para>
|
|
|
|
Output options exist to skip any of these elements in the
|
|
|
|
output, and to include additional data, such as Unicode
|
|
|
|
code-point values, glyph extents, glyph flags, or interim
|
|
|
|
shaping results.
|
|
|
|
</para>
|
|
|
|
<para>
|
|
|
|
Output can also be redirected to a file, or input read from a
|
|
|
|
file. Additional options enable you to enable or disable
|
|
|
|
specific font features, to set variation-font axis values, to
|
|
|
|
alter the language, script, direction, and clustering settings
|
|
|
|
used, to enable sanity checks, or to change which shaping engine is used.
|
|
|
|
</para>
|
|
|
|
<para>
|
|
|
|
For a complete explanation of the options available, run
|
|
|
|
</para>
|
|
|
|
<programlisting>
|
|
|
|
<command>hb-shape</command> <parameter>--help</parameter>
|
|
|
|
</programlisting>
|
|
|
|
</section>
|
|
|
|
|
|
|
|
<section id="utilities-command-line-hbview">
|
|
|
|
<title>hb-view</title>
|
|
|
|
<para>
|
2020-04-17 16:54:40 +02:00
|
|
|
<emphasis><command>hb-view</command></emphasis> allows you to
|
2019-04-30 18:23:19 +02:00
|
|
|
see the shaped output of an input string in rendered
|
2020-04-17 16:54:40 +02:00
|
|
|
form. Like <command>hb-shape</command>,
|
|
|
|
<command>hb-view</command> takes a font file and a text string
|
2019-04-30 18:23:19 +02:00
|
|
|
as its arguments:
|
|
|
|
</para>
|
|
|
|
<programlisting>
|
|
|
|
<command>hb-view</command> <optional>[OPTIONS]</optional>
|
|
|
|
<parameter>path/to/font/file.ttf</parameter>
|
|
|
|
<parameter>yourinputtext</parameter>
|
|
|
|
</programlisting>
|
|
|
|
<para>
|
2020-04-17 16:54:40 +02:00
|
|
|
By default, <command>hb-view</command> renders the shaped
|
2019-04-30 18:23:19 +02:00
|
|
|
text in ASCII block-character images as terminal output. By
|
|
|
|
appending the
|
|
|
|
<command>--output-file=<optional>filename</optional></command>
|
|
|
|
switch, you can write the output to a PNG, SVG, or PDF file
|
|
|
|
(among other formats).
|
|
|
|
</para>
|
|
|
|
<para>
|
2020-04-17 16:54:40 +02:00
|
|
|
As with <command>hb-shape</command>, a lengthy set of options
|
2019-04-30 18:23:19 +02:00
|
|
|
is available, with which you can enable or disable
|
|
|
|
specific font features, set variation-font axis values,
|
|
|
|
alter the language, script, direction, and clustering settings
|
|
|
|
used, enable sanity checks, or change which shaping engine is
|
|
|
|
used.
|
|
|
|
</para>
|
|
|
|
<para>
|
|
|
|
You can also set the foreground and background colors used for
|
|
|
|
the output, independently control the width of all four
|
|
|
|
margins, alter the line spacing, and annotate the output image
|
|
|
|
with
|
|
|
|
</para>
|
|
|
|
<para>
|
2020-04-17 16:54:40 +02:00
|
|
|
In general, <command>hb-view</command> is a quick way to
|
2019-04-30 18:23:19 +02:00
|
|
|
verify that the output of HarfBuzz's shaping operation looks
|
|
|
|
correct for a given text-and-font combination, but you may
|
2020-04-17 16:54:40 +02:00
|
|
|
want to use <command>hb-shape</command> to figure out exactly
|
2019-04-30 18:23:19 +02:00
|
|
|
why something does not appear as expected.
|
|
|
|
</para>
|
|
|
|
</section>
|
|
|
|
|
|
|
|
<section id="utilities-command-line-hbsubset">
|
|
|
|
<title>hb-subset</title>
|
|
|
|
<para>
|
2020-04-17 16:54:40 +02:00
|
|
|
<emphasis><command>hb-subset</command></emphasis> allows you
|
2019-04-30 18:23:19 +02:00
|
|
|
to generate a subset of a given font, with a limited set of
|
|
|
|
supported characters, features, and variation settings.
|
|
|
|
</para>
|
|
|
|
<para>
|
|
|
|
By default, you provide an input font and an input text string
|
2020-04-17 16:54:40 +02:00
|
|
|
as the arguments to <command>hb-subset</command>, and it will
|
2019-04-30 18:23:19 +02:00
|
|
|
generate a font that covers the input text exactly like the
|
|
|
|
input font does, but includes no other characters or features.
|
|
|
|
</para>
|
|
|
|
<programlisting>
|
|
|
|
<command>hb-subset</command> <optional>[OPTIONS]</optional>
|
|
|
|
<parameter>path/to/font/file.ttf</parameter>
|
|
|
|
<parameter>yourinputtext</parameter>
|
|
|
|
</programlisting>
|
|
|
|
<para>
|
|
|
|
For example, to create a subset of Noto Serif that just includes the
|
|
|
|
numerals and the lowercase Latin alphabet, you could run
|
|
|
|
</para>
|
|
|
|
<programlisting>
|
|
|
|
<command>hb-subset</command> <optional>[OPTIONS]</optional>
|
|
|
|
<parameter>NotoSerif-Regular.ttf</parameter>
|
|
|
|
<parameter>0123456789abcdefghijklmnopqrstuvwxyz</parameter>
|
|
|
|
</programlisting>
|
|
|
|
<para>
|
|
|
|
There are options available to remove hinting from the
|
|
|
|
subsetted font and to specify a list of variation-axis settings.
|
|
|
|
</para>
|
|
|
|
</section>
|
|
|
|
|
|
|
|
</section>
|
|
|
|
|
|
|
|
<section id="utilities-common-types-apis">
|
|
|
|
<title>Common data types and APIs</title>
|
|
|
|
<para>
|
|
|
|
HarfBuzz includes several APIs for working with general-purpose
|
|
|
|
data that you may find convenient to leverage in your own
|
|
|
|
software. They include set operations and integer-to-integer
|
|
|
|
mapping operations.
|
|
|
|
</para>
|
|
|
|
<para>
|
|
|
|
HarfBuzz uses set operations for internal bookkeeping, such as
|
|
|
|
when it collects all of the glyph IDs covered by a particular
|
|
|
|
font feature. You can also use the set API to build sets, add
|
|
|
|
and remove elements, test whether or not sets contain particular
|
|
|
|
elements, or compute the unions, intersections, or differences
|
|
|
|
between sets.
|
|
|
|
</para>
|
|
|
|
<para>
|
|
|
|
All set elements are integers (specifically,
|
|
|
|
<type>hb_codepoint_t</type> 32-bit unsigned ints), and there are
|
|
|
|
functions for fetching the minimum and maximum element from a
|
|
|
|
set. The set API also includes some functions that might not
|
|
|
|
be part of a generic set facility, such as the ability to add a
|
|
|
|
contiguous range of integer elements to a set in bulk, and the
|
|
|
|
ability to fetch the next-smallest or next-largest element.
|
|
|
|
</para>
|
|
|
|
<para>
|
|
|
|
The HarfBuzz set API includes some conveniences as well. All
|
|
|
|
sets are lifecycle-managed, just like other HarfBuzz
|
|
|
|
objects. You increase the reference count on a set with
|
|
|
|
<function>hb_set_reference()</function> and decrease it with
|
|
|
|
<function>hb_set_destroy()</function>. You can also attach
|
|
|
|
user data to a set, just like you can to blobs, buffers, faces,
|
|
|
|
fonts, and other objects, and set destroy callbacks.
|
|
|
|
</para>
|
|
|
|
<para>
|
|
|
|
HarfBuzz also provides an API for keeping track of
|
|
|
|
integer-to-integer mappings. As with the set API, each integer is
|
|
|
|
stored as an unsigned 32-bit <type>hb_codepoint_t</type>
|
|
|
|
element. Maps, like other objects, are reference counted with
|
|
|
|
reference and destroy functions, and you can attach user data to
|
|
|
|
them. The mapping operations include adding and deleting
|
|
|
|
integer-to-integer key:value pairs to the map, testing for the
|
|
|
|
presence of a key, fetching the population of the map, and so on.
|
|
|
|
</para>
|
|
|
|
<para>
|
|
|
|
There are several other internal HarfBuzz facilities that are
|
|
|
|
exposed publicly and which you may want to take advantage of
|
|
|
|
while processing text. HarfBuzz uses a common
|
|
|
|
<type>hb_tag_t</type> for a variety of OpenType tag identifiers (for
|
|
|
|
scripts, languages, font features, table names, variation-axis
|
|
|
|
names, and more), and provides functions for converting strings
|
|
|
|
to tags and vice-versa.
|
|
|
|
</para>
|
|
|
|
<para>
|
|
|
|
Finally, HarfBuzz also includes data type for Booleans, bit
|
|
|
|
masks, and other simple types.
|
|
|
|
</para>
|
|
|
|
</section>
|
|
|
|
|
|
|
|
<section id="utilities-ucdn">
|
|
|
|
<title>UCDN</title>
|
|
|
|
<para>
|
|
|
|
HarfBuzz includes a copy of the <ulink
|
|
|
|
url="https://github.com/grigorig/ucdn">UCDN</ulink> (Unicode
|
|
|
|
Database and Normalization) library, which provides functions
|
|
|
|
for accessing basic Unicode character properties, performing
|
|
|
|
canonical composition, and performing both canonical and
|
|
|
|
compatibility decomposition.
|
|
|
|
</para>
|
|
|
|
<para>
|
|
|
|
Currently, UCDN supports direct queries for several more character
|
|
|
|
properties than HarfBuzz's built-in set of Unicode functions
|
|
|
|
does, such as the BiDirectional Class, East Asian Width, Paired
|
|
|
|
Bracket and Resolved Linebreak properties. If you need to access
|
|
|
|
more properties than HarfBuzz's internal implementation
|
|
|
|
provides, using the built-in UCDN functions may be a useful solution.
|
|
|
|
</para>
|
|
|
|
<para>
|
|
|
|
The built-in UCDN functions are compiled by default when
|
|
|
|
building HarfBuzz from source, but this can be disabled with a
|
|
|
|
compile-time switch.
|
|
|
|
</para>
|
|
|
|
</section>
|
|
|
|
|
|
|
|
</chapter>
|