Changes

Jump to navigation Jump to search
no edit summary
A Unicode block, or, simply, a block, is any of the subsets of the Unicode code space that are listed in the file {{code|Blocks.txt}}<ref name="Blocks">The Unicode Consortium, ''Blocks.txt'', ftp://www.unicode.org/Public/UNIDATA/Blocks.txt, Retrieved 2017-11-03.</ref> of the Unicode Character Database. The Unicode code space is the set of all code points, that is, the set of all integers from 0 to the integer whose hexadecimal representation is 10FFF.
The main properties of blocks are described in the Unicode standard<ref>The Unicode Consortium, ''The Unicode Standard'', Version 10.0.0, The Unicode Consortium, Mountain View, CA, USA, 2016, http://www.unicode.org/versions/Unicode10.0.0/UnicodeStandard-10.0.pdf, Retrieved 2017-11-03.</ref> (Section 3.4, paragraph D10b). Every block is an interval of code points, and different blocks are disjoint from each other. In particular, the blocks form a partition of a subset of the Unicode code space. The number of code points in a block varies. Some have just 16 code points, and some others have thousands of code points. Every assigned code point belongs to some block, but there are blocks which contain unassigned code points too. Lastly, there are some code points, necessarily unassigned, which do not belong to any block.
A code block starts at a code point that is a multiple of 16. The number of code points in each block is also a multiple of 16. Thus, the hexadecimal representation of the first code point in a block is of the form ''pqrs''0, and that of the last code point in it is of the form ''tuvw''F, where ''p'', ''q'', ''r'', ''s'', ''t'', ''u'', ''v'', and ''w'', are hexadecimal digits.
99

edits

Navigation menu