GB2312 Encoding Table

Complete Simplified Chinese character set — 7,445 characters across 94 zones. Browse by zone, search by character, and inspect EUC-CN encoding.

Copied!

How to Browse the GB2312 Table

Four simple steps to navigate the encoding standard.

1
Select a Zone
Use the color-coded zone tabs to navigate 94 zones. Yellow = symbols (01-09), Green = Level 1 Chinese (16-55), Blue = Level 2 Chinese (56-87), Gray = reserved zones (10-15, 88-94).
2
Browse the Character Grid
Each zone displays up to 94 characters in a 10-column grid. Every cell shows the character and its 2-digit position code within that zone.
3
Click for Encoding Details
Click any character to see its GB2312 zone-position code (区位码), EUC-CN hexadecimal encoding, and Unicode code point — all in a detail popup.
4
Search by Character
Type a Chinese character into the search box to instantly find and navigate to the zone containing it. Copy characters with one click.

What is GB2312?

GB2312 (GB0, 国家标准 2312) is the People's Republic of China's national standard for Simplified Chinese character encoding, published by the China National Bureau of Standards in 1980 and effective since May 1, 1981. It was the first standardized Chinese character encoding and formed the foundation for all subsequent Chinese encoding standards including GBK, GB18030, and influenced the design of Unicode's CJK Unified Ideographs block.

GB2312 encodes 6,763 Chinese characters (3,755 Level 1 + 3,008 Level 2) plus 682 full-width symbols — including Latin letters, Greek letters, Cyrillic letters, Japanese hiragana and katakana, Chinese punctuation, and typographic symbols — for a total of 7,445 assigned code points out of 8,836 possible positions.

GB2312 Zone-Position Structure (区位码)

GB2312 uses a two-dimensional coordinate system: 94 zones (区, qū) numbered 01–94, each containing 94 positions (位, wèi) numbered 01–94. This gives 94 × 94 = 8,836 possible code points. A character's location is expressed as its 区位码 (qūwèi mǎ): the zone number followed by the position number, e.g., 啊 is at zone 16, position 01, written as 16-01.

ZonesContentCountSort Order
01 – 09Full-width symbols & punctuation846By character type
10 – 15Reserved (later expanded by GBK)
16 – 55Level 1 Chinese (一级汉字)3,755By Pinyin (拼音)
56 – 87Level 2 Chinese (二级汉字)3,008By Radical + Stroke (部首/笔画)
88 – 94Reserved (later expanded by GBK)

EUC-CN Encoding: How GB2312 Works in Computers

GB2312 defines a logical coordinate system (zone, position), but computers need a byte-level encoding. The standard computer representation is EUC-CN (Extended Unix Code for China), which encodes each GB2312 character as two bytes:

Byte 1 = Zone  + 0xA0  (0xA1 – 0xFE)
Byte 2 = Position + 0xA0  (0xA1 – 0xFE)

Adding 0xA0 (160 in decimal) to each coordinate ensures that both bytes are always ≥ 0xA1, placing them safely above the ASCII range (0x00–0x7F) and allowing GB2312 text to coexist with ASCII in the same document without ambiguity. For example, the character 啊 at zone 16, position 01 is encoded as bytes 0xB0 0xA1 (16+160=176=0xB0, 1+160=161=0xA1). Each byte is in the range 0xA1–0xFE (161–254), giving 94 possible values each and enabling the full 94×94 coordinate space.

GB2312 vs GBK vs GB18030

StandardYearCharactersEncodingNotes
GB231219807,4452-byte EUC-CNOriginal standard; Simplified Chinese only
GBK199521,8862-byte (extended range)Backward-compatible superset; includes all GB2312 chars + Traditional Chinese + more symbols; uses zones 10-15 and 88-94
GB18030-2005200570,244+1/2/4-byte variableMandatory in China since 2006; covers all of Unicode; fully backward-compatible with GB2312 and GBK
GB18030-2022202287,887+1/2/4-byte variableLatest revision; includes minority scripts

Key point: Any valid GB2312 text is also valid GBK and GB18030. The EUC-CN byte pairs for the original 7,445 GB2312 characters are identical across all three standards — making GB2312 fully forward-compatible.

Level 1 vs Level 2 Chinese Characters

GB2312 splits Chinese characters into two levels based on frequency of use:

Historical Significance

GB2312 was developed at a pivotal moment in Chinese computing history. Before its adoption, Chinese text could not be reliably represented in digital form — dozens of incompatible encoding schemes existed. GB2312 unified Simplified Chinese encoding under a single national standard, enabling the first generation of Chinese-language software, operating systems (CCDOS, early Windows Chinese editions), email, and eventually the Chinese internet. Its design principles — the 94×94 zone grid, the 0xA0 offset for ASCII coexistence, and the pinyin/radical sort orders — directly influenced GBK, GB18030, Big5 (Traditional Chinese), and even the organization of Unicode's CJK Unified Ideographs block. While modern systems now use Unicode (UTF-8), understanding GB2312 remains essential for working with legacy Chinese systems, embedded devices, mainframe terminals, and historical digital archives.