# Difference between revisions of "Unicode blocks in ConTeXt"

Line 7: | Line 7: | ||

A '''Unicode block''' is an organisational unit of the Unicode code space. The Unicode code space is the set of all integers from 0 to 0x10FFF. | A '''Unicode block''' is an organisational unit of the Unicode code space. The Unicode code space is the set of all integers from 0 to 0x10FFF. | ||

− | Every block is an interval of code points. Different blocks | + | Every block is an interval of code points. Different blocks are disjoint from each other. In particular, the blocks form a partition of the set of all Unicode code points. Every block has a unique name that describes the common semantic nature of its code points. |

− | A code block starts at a code point that is a multiple of 16. The number of code points in each block is also a multiple of 16. Thus, the first code point in a block is of the form '' | + | A code block starts at a code point that is a multiple of 16. The number of code points in each block is also a multiple of 16. Thus, the first code point in a block is of the form 0x''pqrs''0, and the last code point in it is of the form 0x''tuvw''F. |

## Revision as of 18:59, 24 October 2017

A *Unicode block* is an interval of code points which represent characters that are semantically related to each other. For example, there is a Unicode block for characters from the Devanagari script which is used by several Indian languages. Another Unicode block corresponds to characters which denote mathematical operators, such as those that indicate the union and the intersection of sets.

ConTeXt has special names for all Unicode blocks. These names can be used to specify ranges of code points in the setups of several commands.

## Unicode blocks

A **Unicode block** is an organisational unit of the Unicode code space. The Unicode code space is the set of all integers from 0 to 0x10FFF.

Every block is an interval of code points. Different blocks are disjoint from each other. In particular, the blocks form a partition of the set of all Unicode code points. Every block has a unique name that describes the common semantic nature of its code points.

A code block starts at a code point that is a multiple of 16. The number of code points in each block is also a multiple of 16. Thus, the first code point in a block is of the form 0x*pqrs*0, and the last code point in it is of the form 0x*tuvw*F.