Basic Latin (Unicode block)
| Basic Latin or C0 Controls and Basic Latin | |
|---|---|
| Range | U+0000..U+007F (128 code points) |
| Plane | BMP |
| Scripts | Latin (52 characters) Common (76 characters) |
| Major alphabets | English French German Spanish Vietnamese |
| Symbol sets | Arabic numerals Punctuation |
| Assigned | 128 code points 33 Control or Format |
| Unused | 0 reserved code points |
| Source standards | ISO/IEC 8859, ISO 646 |
| Unicode version history | |
| 1.0.0 (1991) | 128 (+128) |
| Unicode documentation | |
| Code chart ∣ Web page | |
| Note: [1][2] | |
The Basic Latin Unicode block,[3] sometimes informally called C0 Controls and Basic Latin,[4] is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.
The Basic Latin block was included in its present form from version 1.0.0 of the Unicode Standard, without addition or alteration of the character repertoire.[5] Its block name in Unicode 1.0 was ASCII.[6]
Table of characters
[edit | edit source]- A The letter U+005C (\) may show up as a Yen(¥) or Won(₩) sign in Japanese/Korean fonts mistaking Unicode (especially UTF-8) as a legacy character set which replaced the backslash with these signs.[7]
Subheadings
[edit | edit source]The C0 Controls and Basic Latin block contains six subheadings.[8]
C0 controls
[edit | edit source]The C0 Controls, referred to as C0 ASCII control codes in version 1.0, are inherited from ASCII and other 7-bit and 8-bit encoding schemes. The Alias names for C0 controls are taken from the ISO/IEC 6429:1992 standard.[8]
ASCII punctuation and symbols
[edit | edit source]This subheading refers to standard punctuation characters, simple mathematical operators, and symbols like the dollar sign, percent, ampersand, underscore, and pipe.[8]
ASCII digits
[edit | edit source]The ASCII Digits subheading contains the standard European number characters 1–9 and 0.[8]
Uppercase Latin alphabet
[edit | edit source]The Uppercase Latin alphabet subheading contains the standard 26-letter unaccented Latin alphabet in the majuscule.[8]
Lowercase Latin alphabet
[edit | edit source]The Lowercase Latin Alphabet subheading contains the standard 26-letter unaccented Latin alphabet in the minuscule.[8]
Control character
[edit | edit source]The Control Character subheading contains the "Delete" character.[8]
Number of symbols, letters and control codes
[edit | edit source]The table below shows the number of letters, symbols and control codes in each of the subheadings in the C0 Controls and Basic Latin block.
| Subheading | Number of symbols | Range of characters |
|---|---|---|
| C0 controls | 32 control codes | U+0000 to U+001F |
| ASCII punctuation and symbols | 33 punctuation marks and symbols | U+0020 to U+002F, U+003A to U+0040, U+005B to U+0060 and U+007B to U+007E |
| ASCII digits | 10 digits | U+0030 to U+0039 |
| Uppercase Latin Alphabet | 26 unaccented Latin letters in the majuscule. | U+0041 to U+005A |
| Lowercase Latin Alphabet | 26 unaccented Latin letters in the minuscule. | U+0061 to U+007A |
| Control character | 1 control code containing the "Delete" character. | U+007F |
Chart
[edit | edit source]| C0 Controls and Basic Latin[a] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| U+000x | NUL | SOH | STX | ETX | EOT | ENQ | ACK | BEL | BS | HT | LF | VT | FF | CR | SO | SI |
| U+001x | DLE | DC1 | DC2 | DC3 | DC4 | NAK | SYN | ETB | CAN | EM | SUB | ESC | FS | GS | RS | US |
| U+002x | SP | ! | " | # | $ | % | & | ' | ( | ) | * | + | , | - | . | / |
| U+003x | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | < | = | > | ? |
| U+004x | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
| U+005x | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ |
| U+006x | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
| U+007x | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | DEL |
| ||||||||||||||||
Variants
[edit | edit source]Several of the characters are defined to render as a standardized variant if followed by variant indicators.
A variant is defined for a zero with a short diagonal stroke: U+0030 DIGIT ZERO, U+FE00 VS1 (0︀).[9][10]
Twelve characters (#, *, and the digits) can be followed by U+FE0E VS15 or U+FE0F VS16 to create emoji variants.[11][12][13][14] They are keycap base characters, for example #️⃣ (U+0023 NUMBER SIGN U+FE0F VS16 U+20E3 COMBINING ENCLOSING KEYCAP). The VS15 version is "text presentation" while the VS16 version is "emoji-style".[10]
| U+ | 0023 | 002A | 0030 | 0031 | 0032 | 0033 | 0034 | 0035 | 0036 | 0037 | 0038 | 0039 |
| base | # | * | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
| base+VS15+keycap | #︎⃣ | *︎⃣ | 0︎⃣ | 1︎⃣ | 2︎⃣ | 3︎⃣ | 4︎⃣ | 5︎⃣ | 6︎⃣ | 7︎⃣ | 8︎⃣ | 9︎⃣ |
| base+VS16+keycap | #️⃣ | *️⃣ | 0️⃣ | 1️⃣ | 2️⃣ | 3️⃣ | 4️⃣ | 5️⃣ | 6️⃣ | 7️⃣ | 8️⃣ | 9️⃣ |
History
[edit | edit source]The following Unicode-related documents record the purpose and process of defining specific characters in the Basic Latin block:
| Version | Final code points[a] | Count | UTC ID | L2 ID | WG2 ID | Document |
|---|---|---|---|---|---|---|
| 1.0.0 | U+0000..007F | 128 | (to be determined) | |||
| UTC/1999-013 | Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). | |||||
| L2/99-176R | Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). | |||||
| L2/04-145 | Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). | |||||
| L2/04-202 | Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). | |||||
| N3046 | Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). | |||||
| N3103 (pdf, doc) | Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). | |||||
| L2/11-043 | Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). | |||||
| L2/11-160 | Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). | |||||
| L2/11-261R2 | Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). | |||||
| L2/11-438[b][c] | N4182 | Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). | ||||
| L2/15-107 | Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). | |||||
| L2/15-268 | Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). | |||||
| L2/15-301[d][c] | Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). | |||||
| L2/15-254 | Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). | |||||
| L2/17-294 | N4914 | Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). | ||||
| L2/22-019 | Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). | |||||
| L2/22-016 | Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). | |||||
| ||||||
See also
[edit | edit source]Lua error in mw.title.lua at line 392: bad argument #2 to 'title.new' (unrecognized namespace name 'Portal').
- Latin script in Unicode
- Latin-1 Supplement
- Character encoding
- ISO/IEC 8859-1
- Latin script
- ISO basic Latin alphabet
References
[edit | edit source]- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). Also available at: http://archives.miloush.net/michkap/archive/2005/09/17/469941.html
- ^ a b c d e f g Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ a b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
External links
[edit | edit source]Lua error in Module:Sister_project_links at line 396: attempt to index field 'wikibase' (a nil value).
Lua error in Module:Authority_control at line 153: attempt to index field 'wikibase' (a nil value).