Incremental Font Transfer

1. Introduction

This section is not normative.

Incremental Font Transfer (IFT) is a technology to improve the latency of remote fonts (or "web fonts") on the web. Without this technology, a browser needs to download every last byte of a font before it can render any characters using that font. IFT allows the browser to download only some of the bytes in the file, thereby decreasing the perceived latency between the time when a browser realizes it needs a font and when the necessary text can be rendered with that font.

The success of WebFonts is unevenly distributed. This specification allows WebFonts to be used where slow networks, very large fonts, or complex subsetting requirements currently preclude their use. For example, even using WOFF 2 [WOFF2], fonts for CJK languages can be too large to be practical.

1.1. Technical Motivation: Evaluation Report

See the Progressive Font Enrichment: Evaluation Report [PFE-report] for the investigation which led to this specification.

1.2. Overview

An IFT font is a regular OpenType font that is reformated to include incremental functionality, partly in virtue of of two additional tables. Using these new tables the font can be augmented (eg. to cover more codepoints) by loading and applying patches to it.

The IFT technology has four main pieces:

§ 4 Extensions to the Font Format: defines the new tables which contain a list of patches that are available to be applied to a font.
§ 5 Font Patch Formats: defines three different types of patches that can be used. Two are "generic" binary patches, one is specific to the font’s format for storing glyph data.
§ 6 Extending a Font Subset: provides the algorithm that is used by a client to select and apply patches.
§ 7 Encoder: creates the font and associated patches that form an incremental font.

At a high level an IFT font is used like this:

The client downloads an initial font file, which contains some initial subset of data from the full version of the font along with embedded data describing the set of patches which can be used to extend the font.
Based on the content to be rendered, the client selects, downloads, and applies patches to extend the font to cover additional characters. This step is repeated each time there is new content.

1.3. Creating an Incremental Font

It is expected that the most common way to produce an incremental font will be to convert an existing font to use the incremental encoding defined in this specification. At a high level converting an existing font to be incremental will look like this:

Choose the content of the initial subset, this will be included in the initial font file the client loads and usually consists of any data from the original font that is expected to always be needed.
Choose a segmentation of the font. Individual segments will be added to the base subset by the client using patches. Choosing an appropriate segmentation is one of the most important parts of producing an efficient encoding.
Based on these choices, generate a set of patches, where each patch adds the data for a particular segment relative to either the initial font file or a previous patch.
Generate the initial font file including the initial subset and a patch mapping. This mapping lists all of the available patch files, the url they reside at, and information on what data the patch will add to the font.

Note: this is a highly simplified description of creating an incremental font, a more in-depth discussion of generating an encoding and requirements on the encoding can be found in the § 7 Encoder section.

1.4. Performance Considerations and the use of Incremental Font Transfer

Using incremental transfer may not always be beneficial, depending on the characteristics of the font, the network, and the content being rendered. This section provides non-normative guidance to help decide when incremental transfer should be utilized.

Incrementally loading a font has a fundamental performance trade off versus loading the whole font. Simplistically, under incremental transfer less bytes may be transferred at the potential cost of increasing the total number of network requests being made, and/or increased request processing latency. In general incremental font transfer will be beneficial where the reduction in latency from sending less bytes outweighs additional latency introduced by the incremental transfer method.

The first factor to consider is the language of the content being rendered. The evaluation report contains the results of simulating incremental font transfer across three categories of languages (Progressive Font Enrichment: Evaluation Report § langtype). See it’s conclusions Progressive Font Enrichment: Evaluation Report § conclusions for a discussion of the anticipated performance of incremental font transfer across the language categories.

Next, how much of the font is expected to be needed? If it’s expected that most of the font will be needed to render the content, then incremental font transfer is unlikely to be beneficial. In many cases however only part of a font is expected to be needed. For example:

If the font contains support for several languages but a user is expected to only render content in a subset of those languages.
If the content being rendered uses a small subset of the total characters in a font. This is often the case for Chinese, Japanese, Korean, Emoji, and Icon fonts.
Only a small amount of text is being rendered. For example a font that is only used for a headline.

An alternative to incremental transfer is to break a font into distinct subsets (typically by script) and use the unicode range feature of @font-face to load only the subsets needed. However, this can alter the rendering of some content Progressive Font Enrichment: Evaluation Report § fail-subset if there are layout rules between characters in different subsets. Incremental font transfer does not suffer from this issue as it can emcompass the original font and all of it’s layout rules.

1.4.1. Reducing the Number of Network Requests

As discussed in the previous section the most basic implementation of incremental font transfer will tend to increase the total number of requests made vs traditional font loading. Since each augmentation will typically require at least one round trip time, performance can be negatively impacted if too many requests are made. Depending on which patch types are available and how much information is provided in the patch mapping, a client might preemptively request patches for codepoints that are not currently needed, but expected to be needed in the future. Intelligent use of this feature by an implementation can help reduce the total number of requests being made. The evaluation report explored this by testing the performance of a basic character frequency based codepoint prediction scheme and found it improved overall performance.

2. Opt-In Mechanism

Web pages can choose to opt-in to incremental transfer for a font via the use of a CSS font tech keyword (CSS Fonts 4 § 11.1 Font tech) inside the ''@font-face'' block. The keyword incremental is used to indicate the referenced font contains IFT data and should only be loaded by a user agent which supports incremental font transfer.

@font-face {
    font-family: "MyCoolWebFont";
    src: url("MyCoolWebFont.otf") tech(incremental);
}

@font-face’s that include the incremental tech keyword should also include a unicode-range descriptor. This informs the client which codepoints are available in the font prior to making the first request, which can be used to avoid requesting the font unnecessarily.

Note: Each individual @font-face block may or may not opt-in to IFT. This is due to the variety of ways fonts are used on web pages. Authors have control over which fonts they want to use this technology with, and which they do not.

Note: the IFT tech keyword can be used in conjunction with other font tech specifiers to perform font feature selection. For example a @font-face could include two URIs one with tech(incremental, color-COLRv1) and the other with tech(incremental, color-COLRv0).

2.1. Offline Usage

Special consideration must be taken when saving a page for offline usage that uses an incrementally transferred font since the saved page won’t be able to increment the font if content changes (eg. due to JavaScript execution). In these cases the page saving mechanism should fully expand the incremental font by invoking Fully Expand a Font Subset and replace references to the incremental font with the fully expanded one.

3. Definitions

3.1. Font Subset

A font subset is a modified version of a font file [iso14496-22] that contains only the data needed to render a subset of:

the codepoints,
layout features,
and design-variation space.

supported by the original font. When a subsetted font is used to render text using any combination of the subset codepoints, layout features, or design-variation space it should render identically to the original font. This includes rendering with the use of any optional typographic features that a renderer may choose to use from the original font, such as hinting instructions. Design variation spaces are specified using the user-axis scales (OpenType Specification § otvaroverview#coordinate-scales-and-normalization).

A font subset definition describes the minimum data (codepoints, layout features, variation axis space) that a font subset should support.

Note: For convenience the remainder of this document links to the [open-type] specification which is a copy of [iso14496-22].

3.2. Data Types

Encoded data structures in the remainder of this specification are described in terms of the data types defined in OpenType Specification § otff#data-types. As with the rest of OpenType, all fields use "big-endian" byte ordering.

3.3. URI Templates

URI templates [rfc6570] are used to convert numeric indices into URIs where patch files are located. Given a numeric index value, several variables are defined which are used to produce the expansion of the template:

Variable	Value
id	The numeric index value converted to a string in hexadecimal representation (using the digits 0-9, A-F). No padding with 0’s is used.
d1	The first digit (least significant digit) of the numeric index in hexadecimal.
d2	The second digit of the numeric index in hexadecimal, or 0 if the index does not have a second digit
d3	The third digit of the numeric index in hexadecimal, or 0 if the index does not have a third digit
d4	The fourth digit of the numeric index in hexadecimal, or 0 if the index does not have a fourth digit

Some example inputs and the corresponding expansions:

Template	Input Index	Expansion
//foo.bar/{id}	123	//foo.bar/7B
//foo.bar{/d1,d2,id}	123	//foo.bar/B/7/7B
//foo.bar{/d1,d2,d3,id}	123	//foo.bar/B/7/0/7B

4. Extensions to the Font Format

An incremental font follows the existing OpenType format, but include two new tables identified by the 4-byte tags 'IFT ' and 'IFTX'. These new tables are both patch maps, which encode a collection of mappings from font subset definitions to URIs which host patches that extend the incremental font. All incremental fonts must contain the 'IFT ' table. The 'IFTX' table is optional. When both tables are present, the mapping of the font as a whole is the union of the mappings of the two tables.

Note: allowing the mapping to be split between two distinct tables allows an incremental font to more easily make use of multiple patch types. For example all patches of one type can be specified in the 'IFT ' table, and all patches of a second type in the 'IFTX' table. Those patches can make updates only to one of the mapping tables and avoid making conflicting updates.

4.1. Patch Map Table

A patch map table encodes a list of patch map entries, where each entry has a key and value. The key is a font subset definition and the value is a URI and the § 5 Font Patch Formats used by the data at the URI. A map is encoded in one of two formats:

Format 1: a limited, but more compact encoding. It encodes a one-to-one mapping from glyph id to patch URIs. It does not support font subset definitions with design space or entries with overlapping subset definitions.
Format 2: can encode arbitrary mappings including ones with design space or overlapping subset definitions. However, it is typically less compact than format 1.

4.1.1. Patch Map Table: Format 1

Format 1 Patch Map encoding:

Type	Name	Description
uint8	format	Set to 1, identifies this as format 1.
uint32	reserved	Not used, set to 0.
uint32	id[4]	Unique ID used to identify patches that are compatible with this font.
uint32	entryCount	Number of entries encoded in this table.
uint32	glyphCount	Number of glyphs that mappings are provided for. Must match the numGlyphs field from maxp.
Offset32	glyphMapOffset	Offset to a Glyph Map sub table. Offset is from the start of this table.
Offset32	featureMapOffset	Offset to a Feature Map sub table. Offset is from the start of this table.
uint8	appliedEntriesBitMap[(entryCount + 7)/8]	A bit map which tracks which entries have been applied. If bit `i` is set that indicates the patch for entry `i` has been applied to this font. Bit 0 is the least significant bit of appliedEntriesBitMap[0], while bit 7 is the most significant bit. Bit 8 is the least significant bit of appliedEntriesBitMap[1] and so on.
uint16	uriTemplateLength	Length of the uriTemplate string.
uint8	uriTemplate[uriTemplateLength]	A [UTF-8] encoded string. Contains a § 3.3 URI Templates which is used to produce URIs associated with each entry.
uint8	patchEncoding	Specifies the format of the patches linked to by uriTemplate. Using the ID number from the § 5.2 Formats Summary table.

Glyph Map encoding:

A glyph map table associates each glyph index in the font with an entry index.

Type	Name	Description
uint16	firstMappedGlyph	All glyph indices less than firstMappedGlyph are implicitly mapped to entry index 0.
uint8/uint16	entryIndex[glyphCount - firstMappedGlyph]	The entry index for glyph `i` is stored in entryIndex[`i` - firstMappedGlyph]. Array members are uint8 if entryCount is less than 256, otherwise they are uint16.

Feature Map encoding:

A feature map table associates combinations of feature tags and glyphs with an entry index.

Type	Name	Description
uint16	featureCount	Number of featureRecords.
FeatureRecord	featureRecords[featureCount]	Provides mappings for a specific feature tag. Must be sorted by featureTag with any feature tag occurring at most once.
EntryMapRecord	entryMapRecords[variable]	Provides the key (entry index) for each feature mapping. The entryMapRecords array contains as many entries as the sum of the entryMapCount fields in the featureRecords array, with entryMapRecords[0] corresponding to the first entry of featureRecords[0], entryMapRecords[featureRecord[0].entryMapCount] corresponding to the first entry of featureRecords[1], entryMapRecords[featureRecords[0].entryMapCount + featureRecord[1].entryMapCount]] corresponding to the first entry of featureRecords[2], and so on.

FeatureRecord encoding:

Type	Name	Description
Tag	featureTag	The feature tag this mapping is for.
uint8/uint16	firstEntryIndex	uint8 if entryCount is less than 256, otherwise uint16. The first entry index this record maps too.
uint8/uint16	entryMapCount	uint8 if entryCount is less than 256, otherwise uint16. The number of EntryMapRecords associated with this feature.

EntryMapRecord encoding:

Type	Name	Description
uint8/uint16	firstEntryIndex	uint8 if entryCount is less than 256, otherwise uint16.
uint8/uint16	lastEntryIndex	uint8 if entryCount is less than 256, otherwise uint16. Must be greater than or equal to firstEntryIndex.

An entry map record matches any entry indices that are greater than or equal to firstEntryIndex and less than or equal to lastEntryIndex.

4.1.1.1. Interpreting Format 1

This algorithm is used to convert a format 1 patch map into a list of patch map entries.

Interpret Format 1 Patch Map

The inputs to this algorithm are:

patch map: a § 4.1.1 Patch Map Table: Format 1 encoded patch map.
font subset: the font subset which contains patch map.

The algorithm outputs:

entry list: a list of patch map entries.

The algorithm:

Check that the patch map is valid according to the requirements in § 4.1.1 Patch Map Table: Format 1. If it is not return an error.
For each unique entry index in Glyph Map:
- a. If the bit for entry index in appliedEntriesBitMap is set to 1, skip this entry index.
- b. Collect the set of glyph indices that map to the entry index.
- c. Convert the set of glyph indices to a set of unicode code points using the code point to glyph mapping in the cmap table of font subset. Ignore any glyph indices that are not mapped by cmap.
- d. Convert entry index into a URI by applying uriTemplate following § 3.3 URI Templates.
- e. Add an entry to entry list whose subset definition contains only the unicode code point set and maps to the generated URI and the patch encoding specified by patchEncoding.
For each FeatureRecord and associated EntryMapRecord in featureRecords:
- For each entry index between firstEntryIndex (inclusive) and lastEntryIndex (inclusive), find the set of unicode codepoints associated with that entry index using the same process as in step 2b through 2c.
  - Compute mapped entry index = FeatureRecord::firstEntryIndex + entry index - EntryMapRecord::firstEntryIndex.
  - Convert mapped entry index into a URI by applying uriTemplate following § 3.3 URI Templates.
  - If the bit for mapped entry index in appliedEntriesBitMap is set to 1, skip this entry index.
  - Add an entry to entry list whose subset definition contains the unicode code point set and a feature tag set containing only featureTag. The entry maps to the generated URI and the patch encoding specified by patchEncoding.

4.1.2. Patch Map Table: Format 2

Format 2 Patch Map encoding:

Type	Name	Description
uint8	format	Set to 2, identifies this as format 2.
uint32	reserved	Not used, set to 0.
uint32	id[4]	Unique ID used to identify patches that are compatible with this font.
uint8	defaultPatchEncoding	Specifies the format of the patches linked to by uriTemplate (unless overridden by an entry). Uses the ID numbers from the § 5.2 Formats Summary table.
uint16	entryCount	Number of entries encoded in this table.
Offset32	entries	Offset to a Mapping Entries sub table. Offset is from the start of this table.
uint16	uriTemplateLength	Length of the uriTemplate string.
uint8	uriTemplate[uriTemplateLength]	A [UTF-8] encoded string. Contains a § 3.3 URI Templates which is used to produce URIs associated with each entry.

Mapping Entries encoding:

Type	Name	Description
uint8	entries[variable]	Byte array containing the encoded bytes of entryCount Mapping Entry's. Each entry has a variable length, which is determined following Interpret Format 2 Patch Map Entry.

Mapping Entry encoding:

Type	Name	Description
uint8	formatFlags	A bit field. Bit 0 (least significant bit) through bit 5 indicate the presence of optional fields. If bit 6 is set this entry is ignored. Bit 7 is reserved for future use and set to 0.
uint8	featureCount	Number of feature tags in the featureTags list. Only present if formatFlags bit 0 is set.
Tag	featureTags[featureCount]	List of feature tags in the entry’s font subset definition. Only present if formatFlags bit 0 is set.
uint8	designSpaceCount	Number of elements in the design space list. Only present if formatFlags bit 0 is set.
Design Space Segment	designSpaceSegments[designSpaceCount]	List of design space segments in the entry’s font subset definition. Only present if formatFlags bit 0 is set.
uint8	copyCount	Number of entries in the copyIndices list. Only present if formatFlags bit 1 is set.
uint16	copyIndices[copyCount]	List of indices from the entries array whose font subset definition should be copied into this entry. All values must be less than the index of this Mapping Entry in entries. Only present if formatFlags bit 1 is set.
int16	entryIdDelta	Signed delta which is used to calculate the id for this entry. The id for this entry is the entry id of the previous Mapping Entry + 1 + entryIdDelta. Only present if formatFlags bit 2 is set. If not present delta is assumed to be 0.
uint8	patchEncoding	Specifies the format of the patch linked to by this entry. Uses the ID numbers from the § 5.2 Formats Summary table. Overrides defaultPatchEncoding. Only present if formatFlags bit 3 is set.
uint16/uint24	bias	Bias value which is added to all codepoint values in the codepoints set. If format bit 4 is 0 and bit 5 is 1, then this is present and a uint16. If format bit 4 is 1 and bit 5 is 1, then this is present and a uint24. Otherwise it is not present.
uint8	codepoints[variable]	Set of codepoints for this mapping. Encoded as a § 4.1.2.2 Sparse Bit Set. Only present if formatFlags bit 4 and/or 5 is set. The length is determined by following the decoding procedures in § 4.1.2.2 Sparse Bit Set.

Design Space Segment encoding:

Type	Name	Description
Tag	tag	Axis tag value.
Fixed	start	Start (inclusive) of the segment. This value uses the user axis scale: OpenType Specification § otvaroverview#coordinate-scales-and-normalization.
Fixed	end	End (inclusive) of the segment. Must be greater than or equal to start. This value uses the user axis scale: OpenType Specification § otvaroverview#coordinate-scales-and-normalization.

4.1.2.1. Interpreting Format 2

This algorithm is used to convert a format 2 patch map into a list of patch map entries.

Interpret Format 2 Patch Map

The inputs to this algorithm are:

patch map: a § 4.1.2 Patch Map Table: Format 2 encoded patch map.

The algorithm outputs:

entry list: a list of patch map entries.

The algorithm:

Check that the patch map is valid according to the requirements in § 4.1.2 Patch Map Table: Format 2. If it is not return an error.
Initialize last entry id to 0, and current byte to 0.
Invoke Interpret Format 2 Patch Map Entry, entryCount times. For each invocation:
- pass in the bytes from patch map starting from entries[current byte] to the end of patch map, last entry id, defaultPatchEncoding, and uriTemplate.
- Set last entry id to the returned entry id.
- Add the returned consumed byte count to current byte.
- If the returned value of ignored is false, then add the returned entry to entry list.
Return entry list.

Interpret Format 2 Patch Map Entry

The inputs to this algorithm are:

entry bytes: a byte array that contains an encoded Mapping Entry.
last entry id: the entry id of the entry preceding this one.
default patch encoding: the default patch encoding if one isn’t specified.
uri template: the URI template used to locate patches.

The algorithm outputs:

entry id: the numeric id of this entry.
entry: a single entry.
consumed bytes: the number of bytes used to encode the entry.
ignored: if true, then this entry should be ignored.

The algorithm:

For the all steps whenever data is loaded from entry bytes increment consumed bytes with the number of bytes read.
Set entry id = last entry id + 1.
Set the patch encoding of entry to default patch encoding.
Read formatFlags from entry bytes.
If formatFlags bit 0 is set, then the feature tag and design space lists are present:
- Read the feature tag list specified by featureCount and featureTags from entry bytes and add the loaded tags to entry.
- Read the design space segment list specified by designSpaceCount and designSpaceSegments from entry bytes and add the design space segments to entry.
If formatFlags bit 1 is set, then the copy indices list is present:
- Read the copy indices list specified by copyCount and copyIndices from entry bytes.
- The copy indices refer to previously loaded entries. 0 is the first Mapping Entry in entries, 1 the second and so on. For each index in copyIndices locate the previously loaded entry with a matching index and add all code points, feature tags, and design space segments from it to entry.
If formatFlags bit 2 is set, then an id delta is present. Read the id delta specified by entryIdDelta from entry bytes and add the delta to entry id. If entry id is negative this is an error and the mapping is malformed.
If formatFlags bit 3 is set, then a patch encoding is present. Read the encoding specified by patchEncoding from entry bytes and set the patch encoding of entry to the read value.
If one or both of formatFlags bit 4 and bit 5 are set, then a codepoint list is present:
- If formatFlags bit 4 is 0 and bit 5 is 1, then read the 2 byte (uint16) bias value from entry bytes.
- If formatFlags bit 4 is 1 and bit 5 is 1, then read the 3 byte (uint24) bias value from entry bytes.
- Otherwise the bias is 0.
- Read the sparse bit set codepoints from entry bytes following § 4.1.2.2 Sparse Bit Set. For each codepoint add the bias value to it and then add the result to the codepoint set in entry.
If formatFlags bit 6 is set, then set ignored to true. Otherwise ignored is false.
Convert entry id into a URI by applying uri template following § 3.3 URI Templates. Set the patch uri of entry to the generated URI.
Return entry id, entry, consumed bytes, and ignored.

4.1.2.2. Sparse Bit Set

A sparse bit set is a data structure which compactly stores a set of distinct unsigned integers. The set is represented as a tree where each node has a fixed number of children that recursively sub-divides an interval into equal partitions. A tree of height H with branching factor B can store set membership for integers in the interval [0 to B^H-1] inclusive. The tree is encoded into an array of bytes for transport.

Sparse Bit Set encoding:

Type	Name	Description
uint8	header	Bits 0 (least significant) and 1 encode the trees branch factor B. Bits 2 through 6 are a 5-bit unsigned integer which encodes the value of H. Bit 7 is set to 0 and reserved for future use.
uint8	treeData[variable]	Binary encoding of the tree.

The exact length of treeData is initially unknown, it’s length is determined by executing the decoding algorithm. When using branch factors of 2 or 4 the last node may only partially consume the bits in a byte. In that case all remaining bits are unused and ignored.

Branch Factor Encoding:

Bit 1	Bit 0	Branch Factor (B)
0	0	2
0	1	4
1	0	8
1	1	32

Decoding sparse bit set treeData

The inputs to this algorithm are:

H: the tree height, H, from the header byte.
B: the branch factor, B, from the header byte.
treeData: array of bytes to be decoded.

The algorithm outputs:

S: a set of unsigned integers.

The algorithm, using a FIFO (first in first out) queue Q:

If H is equal to 0, then this is an empty set and no bytes of treeData are consumed. Return an empty set.
Insert the tuple (0, 1) into Q.
Initialize S to an empty set.
treeData is interpreted as a string of bits where the least significant bit of treeData[0] is the first bit in the string, the most significant bit of treeData[0] is the 8th bit, and so on.
If Q is empty return S.
Extract the next tuple t from Q. The first value of in t is start and the second value is depth.
Remove the next B bits from the treeData bit string. The first removed bit is v₁, the second is v₂, and so on until the last removed bit which is v_B. If prior to removal there were less than B bits left in treeData, then treeData is malformed, return an error.
If all bits v₁ through v_B are 0, then insert all integers in the interval [start, start + B^{H - depth + 1}) into S. Go to step 5.
For each v_i which is equal to 1 in v₁ through v_B: If depth is equal to H add integer start + i - 1 to S. Otherwise, insert the tuple (start + (i - 1) * B^{H - depth}, depth + 1) into Q.
Go to step 5.

Note: when encoding sparse bit sets the encoder can use any of the possible branching factors, but it is recommended to use 4 as that has been shown to give the smallest encodings for most unicode codepoint sets typically encountered.

The set {2, 33, 323} in a tree with a branching factor of 8 can be encoded as the bit string:

bit string:
|-- header --|- lvl 0 |---- level 1 ----|------- level 2 -----------|
| B=8 H=3    |   n0   |   n1       n2   |   n3       n4       n5    |
[ 01  11000 0 10000100 10001000 10000000 00100000 01000000 00010000 ]

Which then becomes the byte string:
[
  0b00001110,
  0b00100001,
  0b00010001,
  0b00000001,
  0b00000100,
  0b00000010,
  0b00001000
]

The empty set in a tree with a branching factor of 2 is encoded as the bit string:

bit string:
|-- header -- |
| B=2 H=0     |
[ 00  00000 0 ]

Which then becomes the byte string:
[
  0b00000000,
]

The set {0, 1, 2, ..., 17} can be encoded with a branching factor of 4 as:

bit string:
|-- header --| l0 |- lvl 1 -| l2  |
| B=4 H=3    | n0 | n1 | n2 | n3  |
[ 10  11000 0 1100 0000 1000 1100 ]

byte string:
[
  0b00001101,
  0b00000011,
  0b00110001
]

5. Font Patch Formats

In incremental font transfer font subsets are extended by applying patches. This specification defines three patch formats, each appropriate to its own set of augmentation scenarios. A single encoding can make use of more than one patch format.

5.1. Definitions

A font patch is a file which encodes changes to be made to an IFT-encoded font. Patches are used to extend an existing font subset and provide expanded coverage.

A patch format is a specified encoding of changes to be applied relative to a font subset. A set of changes encoded according to the format is a font patch. Each patch format has an associated patch application algorithm which takes a font subset and a font patch encoded in the patch format as input and outputs an extended font subset. Each patch application algorithm can be categorized as either independent or dependent. Given a set of font patches, P, that are all valid to be applied to a specific font subset, F:

dependent patches are only valid to be applied to font subset F. Applying any patches from P to font subset F will change the subset and thus invalidate any remaining dependent patches in P.
independent patches are only valid to be applied to font subset F and any font subset derived from F by the application of one or more independent patches from P.

5.2. Formats Summary

The following patch formats are defined by this specification:

Name	Type	ID	Description
§ 5.3 Shared Brotli Patch	dependent	1	A shared brotli encoded binary diff that uses the entire font subset as a base.
§ 5.4 Per Table Shared Brotli	dependent	2	A collection of shared brotli encoded binary diffs that use tables from the font subset as bases.
§ 5.5 Glyph Keyed	independent	3	Contains a collection of opaque binary blobs, each associated with a glyph id and table.

More detailed descriptions of each algorithm can be found in the following sections.

5.3. Shared Brotli Patch

In a shared brotli patch the target file is encoded with brotli compression using the source file as a shared LZ77 dictionary. A shared brotli encoded patch consists of a short header followed by brotli encoded data.

Shared brotli patch encoding:

Type	Name	Description
Tag	format	Identifies the encoding as shared brotli, set to 'ifbr'
uint32	id[4]	The id of the font subset which this patch can be applied too.
uint8	brotliStream[variable]	Brotli encoded byte stream.

5.3.1. Applying Shared Brotli Patches

This algorithm is used to apply a shared brotli patch to extend a font subset to cover additional codepoints, features, and/or design-variation space.

Apply shared brotli patch

The inputs to this algorithm are:

base font subset: a font subset which is to be extended.
patch: a shared brotli patch to be applied to base font subset.

The algorithm outputs:

extended font subset: a font subset that has been extended by the patch.

The algorithm:

Check that the format field in patch is equal to 'ifbr', if it is not equal, then patch is not correctly formatted. Patch application has failed, return an error.
Check that the id field in patch is equal to the at least one of the ids found in the 'IFT ' or 'IFTX' table of base font subset. If there is no match, or base font subset does not have either an 'IFT ' or 'IFTX' table, then patch application has failed, return an error.
Decode the brotli encoded data in brotliStream following Brotli Compressed Data Format § section-10 and using the base font subset as a shared LZ77 dictionary. Return the decoded result as the extended font subset

5.4. Per Table Shared Brotli

A per table shared brotli patch contains a collection of patches which are applied to the individual font tables in the input font file. Each table patch is encoded with brotli compression using the corresponding table from the input font file as a shared LZ77 dictionary. A per table shared brotli encoded patch consists of a short header followed by one or more brotli encoded patches. In addition to patching tables, patches may also replace (existing table data is not used) or remove tables in a font subset.

Per table shared brotli patch encoding:

Type	Name	Description
Tag	format	Identifies the encoding as shared brotli, set to 'ifbt'
uint32	id[4]	The id of the font subset which this patch can be applied too.
uint16	patchesCount	The number of entries in the patches array.
Offset32	patches[patchesCount+1]	Each entry is an offset from the start of this table to a TablePatch. Offsets must be sorted in ascending order.

The difference between two consecutive offsets in the patches array gives the size of that TablePatch.

TablePatch encoding:

Type	Name	Description
Tag	tag	The tag that identifies the font table this patch applies too.
uint8	flags	Bit-field. If bit 0 (least significant bit) is set this patch replaces the existing table. If bit 1 is set this table is removed.
uint8	brotliStream[variable]	Brotli encoded byte stream.

5.4.1. Applying Per Table Shared Brotli Patches

This algorithm is used to apply a per table shared brotli patch to extend a font subset to cover additional codepoints, features, and/or design-variation space.

Apply per table shared brotli patch

The inputs to this algorithm are:

base font subset: a font subset which is to be extended.
patch: a per table shared brotli patch to be applied to base font subset.

The algorithm outputs:

extended font subset: a font subset that has been extended by the patch.

The algorithm:

Check that the format field in patch is equal to 'ifbt', if it is not equal then patch is not correctly formatted. Patch application has failed, return an error.
Check that the id field in patch is equal to the at least one of the ids found in the 'IFT ' or 'IFTX' table of base font subset. If there is no match, or base font subset does not have either an 'IFT ' or 'IFTX' table, then patch application has failed, return an error.
For each entry in patches, with index i:
- Find the TablePatch associated with index i. The object starts at the offset patches[i] (inclusive) and ends at the offset patches[i+1] (exclusive). Both offsets are relative to the start of the patch.
- If an entry in patches was previously applied that has the same tag as this entry, then ignore this entry and continue the iteration to the next one. Entries are processed in same order as they are listed in the patches array.
- If bit 0 (least significant bit) of flags is set, then decode brotliStream following Brotli Compressed Data Format § section-10. No shared dictionary is used. Add a table to extended font subset identified by tag with it’s contents set to the decoded brotliStream.
- If bit 1 of flags is set, then do not copy or add a table to extended font subset identified by tag.
- Otherwise, decode brotliStream following Brotli Compressed Data Format § section-10 and using the table identified by tag in base font subset as a shared LZ77 dictionary. Add a table to extended font subset identified by tag with it’s contents set to the decoded brotliStream.
For each table in base font subset which has a tag that was not found in any of the entries processed in step 3, add a copy of that table to extended font subset.

5.5. Glyph Keyed

A glyph keyed patch contains a collection of data chunks that are each associated with a glyph index and a font table. The encoded data replaces any existing data for that glyph index in the referenced table. Glyph keyed patches can encode data for glyf/loca, gvar, CFF, and CFF2 tables.

Glyph keyed patch encoding:

Type	Name	Description
Tag	format	Identifies the encoding as glyph keyed, set to 'ifgk'
uint32	reserved	Reserved for future use, set to 0.
uint32	id[4]	The id of the font subset which this patch can be applied too.
uint32	length	The uncompressed length of brotliStream.
uint8	brotliStream[variable]	Brotli encoded GlyphPatches table.

GlyphPatches encoding:

Type	Name	Description
uint32	glyphCount	The number of glyphs encoded in the patch.
uint8	tableCount	The number of tables the patch has data for.
uint16	glyphIds[glyphCount]	An array of glyph indices included in the patch.
Tag	tables[tableCount]	An array of tables (by tag) included in the patch.
Offset32	glyphDataOffsets[glyphCount * tableCount + 1]	An array of offsets of to glyph data for each table. The first glyphCount offsets corresponding to tables[0], the next glyphCount offsets (if present) corresponding to tables[1], and so on. All offsets are from the start of the GlyphPatches table. Offsets must be sorted in ascending order.
uint8	glyphData[variable]	The actual glyph data picked out by the offsets.

The difference between two consecutive offsets in the glyphDataOffsets array gives the size of that glyph data.

5.5.1. Applying Glyph Keyed Patches

This algorithm is used to apply a glyph keyed patch to extend a font subset to cover additional codepoints, features, and/or design-variation space.

Apply glyph keyed patch

The inputs to this algorithm are:

base font subset: a font subset which is to be extended.
patch: a glyph keyed patch to be applied to base font subset.

The algorithm outputs:

extended font subset: a font subset that has been extended by the patch.

The algorithm:

Check that the format field in patch is equal to 'ifgk', if it is not equal, then patch is not correctly formatted and patch application has failed, return an error.
Check that the id field in patch is equal to the at least one of the ids found in the 'IFT ' or 'IFTX' table of base font subset. If there is no match, or base font subset does not have either an 'IFT ' or 'IFTX' table, then patch application has failed, return an error.
Decode the brotli encoded data in brotliStream following Brotli Compressed Data Format § section-10. The decoded data is a GlyphPatches table.
For each font table listed in tables, with index i:
- Using the corresponding table in base font subset, synthesize a new table where the data for each glyph is replaced with the data corresponding to that glyph index from glyphData if present, otherwise copied from the corresponding table in base font subset for that glyph index.
- The patch glyph data for a glyph index is located by finding glyphIds[j] equal to the glyph index. The offset to the associated glyph data is glyphDataOffsets[i * glyphCount + j]. The length of the associated glyph data is glyphDataOffsets[i * glyphCount + j + 1] minus glyphDataOffsets[i * glyphCount + j].
- The specific process for synthesizing the new table, depends on the specified format of the table. Any non-glyph associated data should be copied from the table in base font subset. Tables of the type glyf/loca, gvar, CFF, or CFF2 are supported. Entries for tables of any other types must be ignored.
- If base font subset does not have a matching table, return an error.
- Insert the synthesized table into extended font subset.
For each table in base font subset which has a tag that was not found in any of the entries processed in step 4, add a copy of that table to extended font subset.
Modify the mappings in the IFT and IFTX table(s) in extended font subset if present to remove any mapping entries that map to this patch.

6. Extending a Font Subset

This sections defines the algorithm that a client uses to extend an incremental font subset to cover additional codepoints, layout features and/or design space.

Extend a Font Subset

The inputs to this algorithm are:

font subset: an incremental font subset.
target subset definition: the font subset definition that the client wants to extend font subset to cover.

The algorithm outputs:

extended font subset: an extended version of font subset. May or may not be an incremental font.

The algorithm:

Set extended font subset to font subset.
Load the 'IFT ' and 'IFTX' (if present) mapping tables from extended font subset. Both tables are formatted as a § 4.1 Patch Map Table. Check that they are valid according to the requirements in § 4.1 Patch Map Table. If either table is not valid, invoke Handle errors. If extended font subset does not have an 'IFT ' table, then it is not an incremental font and cannot be extended, return extended font subset.
For each of tables 'IFT ' and 'IFTX' (if present): convert the table into a list of entries by invoking Interpret Format 1 Patch Map. Concatenate the output entry lists into a single list, entry list.
For each entry in entry list invoke Check entry intersection with entry and target subset definition as inputs, if it returns false remove entry from entry list.
If entry list is empty, then the extension operation is finished, return extended font subset.
If entry list contains one or more patch map entries which have a patch format that is dependent, then select exactly one of the dependent entries in entry list and remove all others. The criteria for selecting the single dependent entry is left up to the implementation to decide.
For each entry in entry list invoke Load patch file with the entries URI. Collect the returned patch files into patch list.
For each patch in patch list use the appropriate application algorithm (matching the patches format) from § 5 Font Patch Formats to apply the patch to extended font subset. The order of patch application is left up to the implementation.
Go to step 2.

Check entry intersection

The inputs to this algorithm are:

mapping entry: a patch map entry.
subset definition: a font subset definition.

The algorithm outputs:

intersects: true if subset definition intersects mapping entry, otherwise false.

The algorithm:

For each set in subset definition (codepoints, feature tags, design space) check if the set intersects the corresponding set from mapping entry. A set intersects when:

	subset definition set is empty	subset definition set is not empty
mapping entry set is empty	true	true
mapping entry set is not empty	false	true if the two sets intersect

If all sets checked in step 1 intersect, then return true for intersects otherwise false.

Load patch file

The inputs to this algorithm are:

URI: a URI identifying the patch file to load.

The algorithm outputs:

patch file: the patch file identified by URI.

The algorithm:

TODO Coming soon.

Handle errors

Coming soon.

6.1. Fully Expanding a Font

This sections defines an algorithm that can be used to transform an incremental font into a fully expanded non-incremental font. This process loads all available data provided by the incremental font and produces a single static font file that contains no further patches to be applied.

Fully Expand a Font Subset

The inputs to this algorithm are:

font subset: an incremental font subset.

The algorithm outputs:

expanded font: an [open-type] font that is not incremental.

The algorithm:

Invoke Extend a Font Subset with font subset. The input target subset definition is a special one which is considered to intersect all entries in the Check entry intersection step. Return the resulting font subset as the expanded font.

7. Encoder

An encoder is a tool which produces an incremental font and set of associated patches. The incremental font and associated patches produced by a compliant encoder:

Must meet all of the requirements in § 4 Extensions to the Font Format and § 5 Font Patch Formats.
Must be consistent, that is: for any possible font subset definition the result of invoking Extend a Font Subset with that subset definition and the incremental font must always be the same regardless of the particular order of dependent patch selection chosen in step 6 of Extend a Font Subset.
Should preserve the functionality of the fully expanded font, that is: given the fully expanded font derived from the incremental font and any content, then the font subset produced by invoking Extend a Font Subset with the incremental font and the minimal subset definition covering that content should render identically to the fully expanded font for that content.
When an encoder is used to transform an existing font into an incremental font the associated fully expanded font should be equal to the existing font.

When an encoder is used to transform an existing font file into and incremental font and a client is implemented according to the other sections of this document, the intent of the IFT specification is that appearance and behavior of the font in the client will be the same as if the entire file were transferred to the client. A primary goal of the IFT specification is that the IFT format and protocol can serve as a neutral medium for font transfer, comparable to WOFF2. If an encoder produces an encoding from a source font which meets all of the above requirements (1. through 4.), then the encoding will preserve all of the functionality of the original font.

This may be important for cases where a foundry or other rights-owner of a font wants be confident that the encoding and transfer of that font using IFT will not change its behavior and therefore the intent of the font’s creators. Licenses or contracts might then include requirements about IFT conformance, and situations in which re-encoding a font in WOFF2 format is de facto permissible due to its content-neutrality might also permit IFT encoding of that font.

However, nothing about these requirements on encoding conformance is meant to rule out or deprecate the possibility and practical use of encodings that do not preserve all of the functionality of a source font. Any encoding meeting the minimum requirements (1. and 2. above) is valid and may have an appropriate use. Under some circumstances it might be desirable for an encoded font to omit support for some functionality/data from all of its patch files even if those were included in the original font file. In other cases a font might be directly encoded into the IFT format from font authoring source files.

7.1. Encoder Considerations

This section is not normative.

The details of the encoding process may differ by encoder and are beyond the scope of this document. However, this section provides guidance that encoder implementations may want to consider.

Utilize an existing font subsetter implementation

Choosing segmentations

One of the most important and complex decisions an encoder needs to make is how to segment the data in the encoded font. To maximize efficiency the encoder should try and group data (eg. codepoints) that are commonly used together into the same segments. This will reduce the amount of unneeded data load by clients when extending the font. The encoder must also decide the size of segments. Smaller segments will produce more patches, and thus incur more overhead by requiring more network requests, but will typically result in less unneeded data in each segment compared to larger segments. When segmenting codepoints, data on codepoint usage frequency can be helpful to guide segmentation.

During segmentation an encoder should also consider how codepoints interact with each other in the font. Keeping interacting codepoints in the same segment can avoid having to duplicate data between patches, which may be necessary to preserve the functionality of the original font when interacting codepoints reside in different segments.

Dependent patches can form patch graphs

Choosing patch formats

An encoder must choose the appropriate patch type to use. Dependent patches allow all types of data in the font to be patched, while independent patches are limited to updating outline and variation delta data. However, using dependent patches can cause the number of patches to increase exponentially as the number of segments increases. Independent patches on the other hand scale linearly with the number of segments.

Dependent patches are most useful when a font has sizable non-outline data and requires only a small number of segments. Independent patches are most useful when a fonts data is primarily in the outline tables and the font would benefit from using a large segment count.

Using per-table brotli patches and two mapping tables it becomes possible to mix both indepedent and dependent patching in the same incremental font. This can be accomplished by:

Keep all dependent patch entries in one mapping table and all independent entries in the other mapping table.
Use per-table shared brotli patches to update all tables except for the tables touched by the independent patches (outline, variation deltas, and the independent patch mapping table). These patches should use a small number of large segments to keep the patch count reasonable.
Lastly, use independent patches to update the remaining tables, here much smaller fine-grained segments can be utilized without required too many patches.

Managing the number of patches

Privacy Considerations

Coming soon.

Security Considerations

Coming soon.

Changes

Since the Working Draft of 30 May 2023 (see commit history):

Complete rewrite of the specification. Separate 'Patch Subset' and 'Range Request' methods have been removed in favour of a single unified approach.

Incremental Font Transfer

Abstract

Status of this document

1. Introduction

1.1. Technical Motivation: Evaluation Report

1.2. Overview

1.3. Creating an Incremental Font

1.4. Performance Considerations and the use of Incremental Font Transfer

1.4.1. Reducing the Number of Network Requests

2. Opt-In Mechanism

2.1. Offline Usage

3. Definitions

3.1. Font Subset

3.2. Data Types

3.3. URI Templates

4. Extensions to the Font Format

4.1. Patch Map Table

4.1.1. Patch Map Table: Format 1

4.1.1.1. Interpreting Format 1

4.1.2. Patch Map Table: Format 2

4.1.2.1. Interpreting Format 2

4.1.2.2. Sparse Bit Set

5. Font Patch Formats

5.1. Definitions

5.2. Formats Summary

5.3. Shared Brotli Patch

5.3.1. Applying Shared Brotli Patches

5.4. Per Table Shared Brotli

5.4.1. Applying Per Table Shared Brotli Patches

5.5. Glyph Keyed

5.5.1. Applying Glyph Keyed Patches

6. Extending a Font Subset

6.1. Fully Expanding a Font

7. Encoder

7.1. Encoder Considerations

Privacy Considerations

Security Considerations

Changes

Conformance

Document conventions

Conformant Algorithms

Index

Terms defined by this specification

References

Normative References

Informative References