Changes

Jump to navigation Jump to search
no edit summary
== Unicode blocks ==
A '''Unicode block''' is an organisational unit of the Unicode code space. The Unicode code space is the set of all code points, that is, the set of all integers from 0 to 0x10FFFthe integer whose hexadecimal representation is 10FFF. The official list of the blocks is available [ftp://www.unicode.org/Public/UNIDATA/Blocks.txt at the Unicode Web site].
Every block is an interval of code points. Different blocks are disjoint from each other, and every code point belongs to at least one block. In particularThus, the blocks form a partition of the set of all Unicode code points. The number of code points in a block varies. Some have just 16 code points, and some others have thousands of code points.
A code block starts at a code point that is a multiple of 16. The number of code points in each block is also a multiple of 16. Thus, the hexadecimal representation of the first code point in a block is of the form 0x''pqrs''0, and that of the last code point in it is of the form 0x''tuvw''F, where ''p'', ''q'', ''r'', ''s'', ''t'', ''u'', and ''v'', are hexadecimal digits.
The Unicode standard gives every block a unique name that describes the common semantic nature of its code points. These names are case insensitive, and the hyphens, spaces, and underscores, in them are insignificant. For example, one can refer to the block whose Unicode name is <code>Myanmar Extended-A</code> as <code>myanmarextendeda</code>, <code>MyanmarExtendedA</code>, or <code>myanmar_extended_a</code>. ConTeXt chooses the first of these alternative styles for the names of blocks, as described below.
== The list of blocks ==
 
See the article [[List of Unicode blocks]] for a table of Unicode blocks, their ConTeXt names, and links to more information about them.
 
 
The following table lists all the Unicode blocks. Each row of the table describes a block. The first cell in the row is the interval of code points in that block. The second cell is the Unicode name of the block. The third cell is the ConTeXt name of the block. The last cell is a link to the current code chart of the block at the Unicode Web site. This chart contains glyphs for the graphic characters whose code points are in the block, and additional information, such as alternative names of some of the characters.
51

edits

Navigation menu