Top Banner
1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28 (replaces L2/19-394) Having been inspired by L2/19-346, which is entitled On Encoding Policy of Gongche Notations and Upcoming Para-ideographs, this document serves as an updated proposal to add a new provisional Unihan Database property tentatively designated kStrange. This property is in- tended to identify Han ideographs that are considered “strange” in one or more ways, and specifies 12 fairly distinct categories. The 12 sections that begin on the next page correspond to each of the 12 categories, and provide the Han ideographs that are tentatively assigned to each category. Below is a summary listing of the 12 categories, their single-letter abbrevia- tions, and their names: • Category A [A]symmetric • Category B [B]opomofo • Category C [C]ursive • Category F [F]ully-reflective • Category H [H]angul Component • Category I [I]ncomplete • Category K [K]atakana Component • Category M [M]irrored • Category O [O]dd Component • Category R [R]otated • Category S [S]troke-heavy • Category U [U]nusual Arrangment/Structure The “Code Point” and “Ideograph” cells of Han ideographs that are assigned to more than one category are cyan-filled. In the proposed data file, the maximum number of categories to which a Han ideograph is currently assigned is two. A total of 697 unique Han ideographs are covered in this proposal. Soon aſter this new Unihan Database property is accepted, I plan to prepare a new UTN (Uni- code Technical Note) that will be a minor extension of this document, and will be updated as additional Han ideographs are added to this property, which can happen through discovery, by establishing a new category, or by the encoding of a new CJK Unified Ideographs block. For example, I have thus far identified 33 kStrange candidates in IRG Working Set 2017, which is expected to become the Extension H block. Data Files The kstrange-data.txt data file, which is a PDF attachment, provides everything that is neces- sary for adding this property to the Unihan Database, and for adding a full description of the property to UAX #38, Unicode Han Database (Unihan).
42

L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

Jul 18, 2021

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

1

L2/20-059Title: Proposal for new provisional Unihan Database property: kStrangeAuthor: Ken LundeDate: 2020-01-28 (replaces L2/19-394)

Having been inspired by L2/19-346, which is entitled On Encoding Policy of Gongche Notations and Upcoming Para-ideographs, this document serves as an updated proposal to add a new provisional Unihan Database property tentatively designated kStrange. This property is in-tended to identify Han ideographs that are considered “strange” in one or more ways, and specifies 12 fairly distinct categories. The 12 sections that begin on the next page correspond to each of the 12 categories, and provide the Han ideographs that are tentatively assigned to each category. Below is a summary listing of the 12 categories, their single-letter abbrevia-tions, and their names:

• Category A [A]symmetric• Category B [B]opomofo• Category C [C]ursive• Category F [F]ully-reflective• Category H [H]angul Component• Category I [I]ncomplete• Category K [K]atakana Component• Category M [M]irrored• Category O [O]dd Component• Category R [R]otated• Category S [S]troke-heavy• Category U [U]nusual Arrangment/Structure

The “Code Point” and “Ideograph” cells of Han ideographs that are assigned to more than one category are cyan-filled. In the proposed data file, the maximum number of categories to which a Han ideograph is currently assigned is two. A total of 697 unique Han ideographs are covered in this proposal.Soon after this new Unihan Database property is accepted, I plan to prepare a new UTN (Uni-code Technical Note) that will be a minor extension of this document, and will be updated as additional Han ideographs are added to this property, which can happen through discovery, by establishing a new category, or by the encoding of a new CJK Unified Ideographs block. For example, I have thus far identified 33 kStrange candidates in IRG Working Set 2017, which is expected to become the Extension H block.

Data FilesThe kstrange-data.txt data file, which is a PDF attachment, provides everything that is neces-sary for adding this property to the Unihan Database, and for adding a full description of the property to UAX #38, Unicode Han Database (Unihan).

Page 2: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

2

The kstrange-data-ws2017.txt data file, which is also a PDF attachment, provides the serial number, source reference(s), and kStrange property value(s) for the 33 candidates that have been identified thus far in IRG Working Set 2017.

Representative Glyphs & Code Chart ExcerptsThe representative glyphs in the “Ideograph” and “Reference” columns, along with those in the “Bopomofo” and “Katakana” columns, are from the Hanazono Mincho (花園明朝) fonts, specifically the OpenType/CFF versions: HanaMinA.otf (BMP), HanaMinB.otf (Extension B), and HanaMinC.otf (Extensions C through F). Extension G representative glyphs are not included (their “Ideograph” cells are filled green), except for a very small number whose representative glyphs are from Source Han Sans.The code chart excerpts are based on those for the Unicode Version 13.0 BETA, and those for Extension G are highlighted in yellow.

Category A—[A]symmetricHan ideographs with this property value exhibit a structure that appears to be asymmetric.

Category Block Code Point Ideograph Code Chart Excerpt

A B U+2074E 𠝎A B U+21CFF 𡳿A B U+24C03 𤰃A B U+26A03 𦨃A B U+26B69 𦭩

Category B—[B]opomofoHan ideographs with this property value visually resemble a bopomofo character (common Han ideographs and radicals that resemble a bopomofo character, such as 一 (U+4E00) versus ㄧ (U+3127), have been explicitly excluded). In addition to being tagged with the letter “B,” the code point for the corresponding bopomofo character is also specified.

Category Block Code Point Ideograph Bopomofo Code Chart Excerpt

B URO U+4E02 丂 ㄎB URO U+4E05 丅 ㄒB URO U+4E29 丩 ㄐ

Page 3: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

3

Category Block Code Point Ideograph Bopomofo Code Chart Excerpt

B URO U+4E2B 丫 ㄚB URO U+4E40 乀 ㄟB URO U+4E5C 乜 ㆤB URO U+5DDC 巜 ㄍB URO U+5E00 帀 ㄭB A U+3405 㐅 ㄨB A U+37A2 㞢 ㄓB B U+20000 𠀀 ㄛB B U+20005 𠀅 ㄞB B U+200CA 𠃊 ㆹB B U+200CB 𠃋 ㄥB B U+200D2 𠃒 ㄝB B U+2010E 𠄎 ㄋB B U+206A3 𠚣 ㄉB B U+20AD3 𠫓 ㄊB B U+21C23 𡰣 ㄕB B U+21FE8 𡿨 ㄑB G U+3018A ㄗ

Page 4: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

4

Category C—[C]ursiveHan ideographs with this property value are either cursive or include one or more cursive components that do not adhere to Han ideograph stroke conventions.

Category Block Code Point Ideograph Code Chart Excerpt

C URO U+4E44 乄C B U+201AD 𠆭C B U+201C7 𠇇C B U+2034B 𠍋C B U+20AB3 𠪳C B U+211A2 𡆢C B U+219B9 𡦹C B U+219D1 𡧑C B U+22013 𢀓C B U+26E57 𦹗C F U+2CEF7 𬻷C F U+2CEFF 𬻿C F U+2CF02 𬼂C F U+2D047 𭁇C F U+2D143 𭅃C F U+2D37B 𭍻C F U+2D44A 𭑊

Page 5: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

5

Category Block Code Point Ideograph Code Chart Excerpt

C F U+2D6A5 𭚥C F U+2D92A 𭤪C F U+2D95F 𭥟C F U+2E4E0 𮓠C F U+2E979 𮥹

Category F—[F]ully-reflectiveHan ideographs with this property value are fully-reflective or include components that are fully-reflective, meaning that the mirrored and unmirrored components are arranged side-by-side or stacked top-and-bottom. In addition to being tagged with the letter “F,” the code point for the corresponding unreflected Han ideograph, if any and as shown in the “Reference” col-umn, is also specified.

Category Block Code Point Ideograph Reference Code Chart Excerpt

F URO U+56CD 囍F URO U+71DB 燛F URO U+81E6 臦 𦣦F URO U+81E9 臩F B U+21155 𡅕F B U+221D6 𢇖F B U+22374 𢍴F B U+223FD 𢏽F B U+23960 𣥠 𣥖F B U+244EB 𤓫

Page 6: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

6

Category Block Code Point Ideograph Reference Code Chart Excerpt

F B U+24570 𤕰 㸞F B U+249A1 𤦡F B U+268E9 𦣩 𦣦F B U+286DC 𨛜 䣈F B U+287A0 𨞠F B U+287B0 𨞰F B U+28944 𨥄F B U+28CC8 𨳈 門F B U+28E85 𨺅F B U+28F31 𨼱F B U+28F44 𨽄F B U+28F5D 𨽝F B U+28F61 𨽡F B U+28F69 𨽩F B U+28F74 𨽴F B U+28F75 𨽵F E U+2B935 𫤵F E U+2BA23 𫨣F E U+2BC92 𫲒

Page 7: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

7

Category Block Code Point Ideograph Reference Code Chart Excerpt

F E U+2BE2A 𫸪 弜F E U+2C1BB 𬆻F E U+2C30B 𬌋F E U+2C6FF 𬛿F E U+2CD18 𬴘F E U+2CD1C 𬴜F E U+2CD20 𬴠F E U+2CD21 𬴡F E U+2CD22 𬴢F E U+2CD23 𬴣F E U+2CD24 𬴤F E U+2CD25 𬴥F E U+2CD26 𬴦F F U+2D5B2 𭖲F G U+31044

Category H—[H]angul ComponentHan ideographs with this property value include a hangul component. In addition to being tagged with the letter “H,” the code point for the hangul component is also specified.

Category Block Code Point Ideograph Hangul Code Chart Excerpt

H URO U+5DEA 巪 ㄱ

Page 8: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

8

Category Block Code Point Ideograph Hangul Code Chart Excerpt

H A U+3514 㔔 ㅇH A U+3516 㔖 ㄱH A U+3AB2 㪲 ㄱH A U+3AB3 㪳 ㅇH A U+3AC7 㫇 ㄱH A U+3AC8 㫈 ㅇH A U+439E 䎞 ㄱH B U+200CD 𠃍 ㄱH C U+2A8B3 𪢳 ㄱH C U+2A941 𪥁 ㄱH F U+2D03B 𭀻 ㄱH F U+2D1BE 𭆾 ㄱH F U+2D81A 𭠚 ㄱH F U+2D939 𭤹 ㅅH F U+2D94B 𭥋 ㄱH F U+2DA58 𭩘 ㄱH F U+2E78C 𮞌 ㄱH G U+301C8 ㄱH G U+30255 ㄱ

Page 9: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

9

Category Block Code Point Ideograph Hangul Code Chart Excerpt

H G U+30481 ㄱH G U+30912 ㄱH G U+30BEE ㄱH G U+30C2F ㄱH G U+30D97 ㄱH G U+30F18 ㄱ

Category I—[I]ncompleteHan ideographs with this property value appear to be incomplete versions of an existing or possible Han ideograph (meaning that one or more components appear to be incomplete), without regard to semantics. In addition to being tagged with the letter “I,” the code points for the corresponding complete Han ideographs, if any and as shown in the “Reference” column, are also specified.

Category Block Code Point Ideograph Reference Code Chart Excerpt

I URO U+4E04 丄 上I URO U+4E05 丅 下I URO U+4E52 乒 兵I URO U+4E53 乓 兵I URO U+5187 冇 有I URO U+56EC 囬 面I URO U+5B52 孒 子I URO U+5DDC 巜 巛I URO U+8002 耂 老

Page 10: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

10

Category Block Code Point Ideograph Reference Code Chart Excerpt

I URO U+8080 肀 聿I URO U+9F50 齐 斉I URO U+9FB0 龰 足I A U+382A 㠪 正I A U+39B0 㦰 韱I A U+3C50 㱐 武I A U+4AA3 䪣I B U+20016 𠀖 共I B U+20017 𠀗 共I B U+2002A 𠀪 其I B U+2002B 𠀫 其I B U+20035 𠀵I B U+2003D 𠀽I B U+20063 𠁣 門I B U+20064 𠁤 西I B U+20080 𠂀 丼I B U+20092 𠂒 生I B U+20099 𠂙 耒I B U+2009A 𠂚 乔

Page 11: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

11

Category Block Code Point Ideograph Reference Code Chart Excerpt

I B U+2009B 𠂛 企I B U+200B3 𠂳 𥾜I B U+200D2 𠃒 世I B U+200DB 𠃛 門I B U+20118 𠄘 承I B U+20119 𠄙 事I B U+20149 𠅉I B U+2015B 𠅛I B U+2017E 𠅾I B U+201B1 𠆱 𬽫I B U+2053E 𠔾 舟I B U+20546 𠕆 有I B U+20936 𠤶I B U+209B1 𠦱I B U+20A64 𠩤 原I B U+20AB1 𠪱 𠪾I B U+20B0A 𠬊I B U+20B35 𠬵 𥸩I B U+20B6B 𠭫

Page 12: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

12

Category Block Code Point Ideograph Reference Code Chart Excerpt

I B U+20EDA 𠻚 暮I B U+20F28 𠼨I B U+2115B 𡅛I B U+21245 𡉅 吉I B U+21246 𡉆 吉I B U+213CE 𡏎I B U+21428 𡐨 壄I B U+2151C 𡔜 声I B U+21556 𡕖 夆I B U+216F7 𡛷 𪥱I B U+219AD 𡦭I B U+219D8 𡧘 家I B U+21C23 𡰣 尸I B U+21FE8 𡿨 巛I B U+22053 𢁓 布I B U+22064 𢁤I B U+220FB 𢃻I B U+2218D 𢆍 重I B U+221AF 𢆯 糸

Page 13: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

13

Category Block Code Point Ideograph Reference Code Chart Excerpt

I B U+221BD 𢆽 紗I B U+221CA 𢇊 綤I B U+221CB 𢇋 䋵I B U+22324 𢌤 建I B U+2239E 𢎞 弘I B U+223B1 𢎱 𢎨I B U+223BA 𢎺 𢏛I B U+223C0 𢏀I B U+224B4 𢒴I B U+2267B 𢙻I B U+22779 𢝹 𢞖I B U+22868 𢡨I B U+22877 𢡷I B U+22994 𢦔I B U+22AC2 𢫂I B U+22BBD 𢮽I B U+23D11 𣴑 流I B U+24993 𤦓 𤨎I B U+225A9 𢖩 心

Page 14: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

14

Category Block Code Point Ideograph Reference Code Chart Excerpt

I B U+22606 𢘆 恒I B U+22634 𢘴 㤻I B U+226C0 𢛀 悶I B U+2298F 𢦏 哉¹I B U+22998 𢦘 或I B U+22A6F 𢩯 𡥅I B U+22AA2 𢪢 挑I B U+22ACE 𢫎 𢬘I B U+22B61 𢭡 挿I B U+22F0B 𢼋 敇I B U+2314A 𣅊 昌I B U+23150 𣅐 旻I B U+23169 𣅩 眉I B U+231B9 𣆹 畳I B U+231D3 𣇓 鼎I B U+232B3 𣊳 曛I B U+23332 𣌲I B U+233C1 𣏁 杲I B U+233CA 𣏊 枂

Page 15: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

15

Category Block Code Point Ideograph Reference Code Chart Excerpt

I B U+23408 𣐈 栜I B U+2347D 𣑽 梵I B U+23652 𣙒 𪔇I B U+236D0 𣛐I B U+239E8 𣧨 𣨁I B U+23C16 𣰖 𣰚I B U+23C71 𣱱 水I B U+23C96 𣲖 泒I B U+23D2B 𣴫 溊I B U+23D49 𣵉 湙I B U+23DD2 𣷒I B U+2404E 𤁎I B U+24121 𤄡I B U+241D7 𤇗 𤈖I B U+2435E 𤍞 燁I B U+2437B 𤍻I B U+2447B 𤑻 爗I B U+244F0 𤓰 瓜I B U+24642 𤙂

Page 16: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

16

Category Block Code Point Ideograph Reference Code Chart Excerpt

I B U+248E5 𤣥 玄I B U+248E6 𤣦I B U+24ADD 𤫝I B U+24E48 𤹈I B U+24F3D 𤼽 皁皐I B U+24F6A 𤽪I B U+2506B 𥁫I B U+25100 𥄀I B U+25186 𥆆 𥆨I B U+2549B 𥒛I B U+256C4 𥛄 禛I B U+2574C 𥝌 禾I B U+25844 𥡄 穐I B U+2584C 𥡌 穐I B U+25952 𥥒 䆥I B U+25A5A 𥩚I B U+25A88 𥪈I B U+25B7B 𥭻 箸I B U+25CC1 𥳁 𥲤

Page 17: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

17

Category Block Code Point Ideograph Reference Code Chart Excerpt

I B U+25D6F 𥵯I B U+25F26 𥼦I B U+25F94 𥾔 納I B U+25FAC 𥾬 絃I B U+25FAD 𥾭I B U+26165 𦅥I B U+2626A 𦉪 四I B U+2626B 𦉫 而I B U+26285 𦊅 突I B U+26316 𦌖I B U+26356 𦍖I B U+26419 𦐙I B U+264D0 𦓐 而I B U+26541 𦕁I B U+265C9 𦗉 𬚦I B U+26612 𦘒 聿I B U+2664D 𦙍 胤I B U+26738 𦜸I B U+268FA 𦣺 臯

Page 18: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

18

Category Block Code Point Ideograph Reference Code Chart Excerpt

I B U+26965 𦥥I B U+26A0A 𦨊 𦨎I B U+26AF7 𦫷 芘I B U+26B34 𦬴 䒫I B U+26B44 𦭄 茦I B U+26B71 𦭱I B U+26B81 𦮁 𦰙I B U+26B9F 𦮟I B U+26BFA 𦯺 菨I B U+26BFE 𦯾I B U+26C18 𦰘 菧I B U+26C66 𦱦 董I B U+26DC3 𦷃I B U+27268 𧉨 蛩I B U+27475 𧑵I B U+27538 𧔸I B U+275EE 𧗮 衙I B U+27607 𧘇 衣I B U+2761B 𧘛 袨

Page 19: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

19

Category Block Code Point Ideograph Reference Code Chart Excerpt

I B U+27825 𧠥 𧠵I B U+278D8 𧣘I B U+2795B 𧥛 言I B U+2795C 𧥜 言I B U+27968 𧥨 詍I B U+2796A 𧥪I B U+279B6 𧦶 話I B U+279DF 𧧟 䜟I B U+27A25 𧨥 諙I B U+27C1E 𧰞I B U+27C27 𧰧 豕I B U+27C28 𧰨 豕I B U+27E90 𧺐 𧺙I B U+27FCB 𧿋I B U+28013 𨀓I B U+28029 𨀩 踻I B U+28210 𨈐 身I B U+28211 𨈑 身I B U+2844E 𨑎 𨑾

Page 20: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

20

Category Block Code Point Ideograph Reference Code Chart Excerpt

I B U+28498 𨒘 𨒪I B U+284B5 𨒵 過I B U+28538 𨔸 遘I B U+28925 𨤥 量I B U+2895D 𨥝 鈲I B U+28973 𨥳 𨦉I B U+28979 𨥹 銫I B U+28C06 𨰆I B U+28CC7 𨳇 門I B U+28D8D 𨶍I B U+28E8A 𨺊I B U+28EDB 𨻛I B U+2907E 𩁾 雭I B U+2909A 𩂚I B U+2928B 𩊋I B U+2944E 𩑎 順I B U+2947F 𩑿 𩒕I B U+2948B 𩒋 𩒕I B U+296D6 𩛖

Page 21: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

21

Category Block Code Point Ideograph Reference Code Chart Excerpt

I B U+29C0A 𩰊 鬥I B U+29C0B 𩰋 鬥I B U+29C1C 𩰜 𩰟I B U+29C86 𩲆 鬽I B U+29C87 𩲇I B U+29D2B 𩴫 𫙎I B U+29D30 𩴰I B U+2A544 𪕄I C U+2AA72 𪩲 朿I D U+2B740 𫝀 五I D U+2B744 𫝄 久I E U+2B820 𫠠 弋戈I E U+2B829 𫠩 兹I E U+2B84F 𫡏 攵I E U+2B851 𫡑 𠂹I E U+2BA51 𫩑 面靣I E U+2BFED 𫿭 斉I E U+2C09B 𬂛 木I E U+2C0C7 𬃇 楹

Page 22: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

22

Category Block Code Point Ideograph Reference Code Chart Excerpt

I E U+2C52D 𬔭I E U+2C889 𬢉I F U+2CEB0 𬺰 才I F U+2CEB1 𬺱 木I F U+2CEB7 𬺷 豕I F U+2CEBB 𬺻 豕I F U+2CECC 𬻌 東I F U+2CECD 𬻍 東I F U+2CF2B 𬼫 惢I F U+2D0B8 𭂸I F U+2D110 𭄐I F U+2D1B1 𭆱I F U+2D1C1 𭇁 吾I F U+2D57F 𭕿 䖝I F U+2D6A0 𭚠 戍I F U+2D80D 𭠍 戈I F U+2D928 𭤨I F U+2D95D 𭥝I F U+2DA72 𭩲

Page 23: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

23

Category Block Code Point Ideograph Reference Code Chart Excerpt

I F U+2DA97 𭪗I F U+2DC3E 𭰾 溋I F U+2DEBD 𭺽 東I F U+2DF9B 𭾛 貞I F U+2E39B 𮎛 色I G U+30006 立I G U+3002A 彡I G U+300E6 向I G U+30333

I G U+30367 小I G U+30368 小I G U+304A8 幾I G U+306C4 水I G U+306C5 水I G U+308B5 画I G U+30B67 老

1 U+2298F 𢦏 is an extremely productive component that appears in 46 additional Han ideo-graphs in the BMP and Plane 2: 㘽㦲㦳䳒䵧截戴栽烖胾臷蛓裁載载酨韯戴𢎇𢦛𢦷𢦼𢧇𢧑𢧜𢧨𢧭𢨆𢨎𢨣𤱱𥅤𥅰𦀂𧟭𧧟𧧬𧯥𨚵𪭋𪭒𫻭𬰫𬲏𮚂𮧇. It also appears in three Han ideo-graphs in Plane 3 (Extension G): U+30666 ⿹𢦏止, U+309F8 ⿹𢦏示, and U+31267 ⿹𢦏鸟. In other words, the kStrange property value for U+2298F 𢦏 consists of “I” following by 50 colon-separated code points.

Page 24: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

24

Category K—[K]atakana ComponentHan ideographs with this property value include one or more components that visually re-semble a katakana syllable (although 𢀖, the PRC simplified form of 巠, is included, Han ideo-graphs that include it as a component have been explicitly excluded). In addition to being tagged with the letter “K,” the code points for the katakana or katakana-like components are also specified.

Category Block Code Point Ideograph Katakana Code Chart Excerpt

K B U+211A5 𡆥 ト¹K B U+22016 𢀖 スK B U+282A3 𨊣 ト²K C U+2A708 𪜈 モK D U+2B742 𫝂 ツK E U+2B9A4 𫦤 カK E U+2B9AB 𫦫 カナK E U+2BCCD 𫳍 ウツホK E U+2C711 𬜑 カ³K F U+2CEC0 𬻀 サK F U+2CECB 𬻋 サK F U+2CF00 𬼀 シテK F U+2CF61 𬽡 スK F U+2D580 𭖀 スK F U+2D6DD 𭛝 ヱK F U+2DF86 𭾆 ケ

Page 25: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

25

Category Block Code Point Ideograph Katakana Code Chart Excerpt

K F U+2E307 𮌇 スK F U+2E695 𮚕 ケ

1 U+211A5 𡆥 is an abbreviated form that means “library” (トショカン, which is the reading of 図書館 toshokan).

2 U+282A3 𨊣 is an abbreviated form that means “truck” (トラック torakku), as in a type of vehicle (車).

3 U+2C711 𬜑 is an abbreviated form that means “cutter” (カッター kattā), as in a type of ship (舟).

Category M—[M]irroredHan ideographs with this property value are either mirrored or include one or more compo-nents that are mirrored. In addition to being tagged with the letter “M,” the code point for the corresponding unmirrored Han ideograph, if any and as shown in the “Reference” column, is also specified.

Category Block Code Point Ideograph Reference Code Chart Excerpt

M URO U+4EFA 仺M URO U+5350 卐 卍M B U+2009C 𠂜 𠂛M B U+20141 𠅁 亡M B U+2091C 𠤜 𠤗M B U+22044 𢁄M B U+23944 𣥄 正M B U+23957 𣥗 𣥕M B U+2456A 𤕪 㠯M B U+26B62 𦭢¹

Page 26: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

26

Category Block Code Point Ideograph Reference Code Chart Excerpt

M B U+28668 𨙨 邑M B U+2907F 𩁿 雪M G U+30002

M G U+30004 彐1 U+26B62 𦭢 is not actually mirrored, but its construction gives such an impression.

Category O—[O]dd ComponentHan ideographs with this property value include one or more components that are symbol-like or are otherwise considered odd. In addition to being tagged with the letter “O,” the code point for a related character, if any and as shown in the “Reference” column, is also specified.

Category Block Code Point Ideograph Reference Code Chart Excerpt

O A U+3403 㐃O B U+200E0 𠃠¹O B U+20137 𠄷O B U+205F1 𠗱O B U+20696 𠚖O B U+2069C 𠚜O B U+206A1 𠚡O B U+20953 𠥓O B U+20967 𠥧O B U+20969 𠥩O B U+2096A 𠥪

Page 27: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

27

Category Block Code Point Ideograph Reference Code Chart Excerpt

O B U+2096B 𠥫O B U+2096C 𠥬O B U+21261 𡉡O B U+242C5 𤋅O B U+24548 𤕈O B U+26B99 𦮙O B U+2700D 𧀍O B U+291E7 𩇧O B U+2A6D7 合O B U+2A6D8 四O B U+2A6D9 一O B U+2A6DA 上O B U+2A6DB 尺O B U+2A6DC 工O B U+2A6DD 凡O F U+2CF01 𬼁 ʒ²O F U+2CF04 𬼄 ℥³O F U+2D1AC 𭆬O F U+2DF86 𭾆

Page 28: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

28

Category Block Code Point Ideograph Reference Code Chart Excerpt

O F U+2DF8B 𭾋O F U+2DF8F 𭾏O F U+2E34C 𮍌O F U+2E34D 𮍍O F U+2E350 𮍐O F U+2E5D8 𮗘O F U+2EA08 𮨈

1 If Japan were to horizontally-extend U+200E0 𠃠 with JMJ-030425 as its source reference, the two ⼂ components would be rendered as ・ components, hence its assignment to this category.

2 U+0292 ʒ LATIN SMALL LETTER EZH is the symbol for “dram,” and U+2CF01 𬼁 carries the same meaning.

3 U+2125 ℥ OUNCE SIGN is the symbol for “ounce,” and U+2CF04 𬼄 carries the same mean-ing.

Category R—[R]otatedHan ideographs with this property value are either rotated or include one or more compo-nents that are rotated. In addition to being tagged with the letter “R,” the code point for the corresponding unrotated Han ideograph, if any and as shown in the “Reference” column, is also specified.

Category Block Code Point Ideograph Reference Code Chart Excerpt

R B U+2010F 𠄏 了R B U+20114 𠄔 予R B U+20432 𠐲R B U+20544 𠕄 凹R B U+221B4 𢆴 𢆳

Page 29: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

29

Category Block Code Point Ideograph Reference Code Chart Excerpt

R B U+22A0B 𢨋 ¹R B U+23028 𣀨R B U+23952 𣥒 𣥕R B U+24173 𤅳R B U+24489 𤒉 𤒜R B U+24493 𤒓R B U+27951 𧥑R B U+27E42 𧹂 贙R E U+2BA66 𫩦 𠱃R E U+2C886 𬢆R F U+2E5D9 𮗙R G U+304A5 戔R G U+30A07

R G U+30C9E 𫻺

1 U+304B2 ⿱或或 (Extension G) is the equivalent Han ideograph without a rotated compo-nent.

Category S—[S]troke-heavyHan ideographs with this property value have 40 or more strokes, covering 22 Han ideographs. If this threshold were to be decreased to 35 or 30 strokes, the number of Han ideographs with this property value would increase to 71 or 336, respectively. In addition to being tagged with the letter “S,” the kTotalStrokes value is also specified.

Page 30: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

30

Category Block Code Point Ideograph Strokes Code Chart Excerpt

S URO U+9F98 龘 48

S A U+4A3B 䨻 52

S A U+4C9C 䲜 44

S B U+2053B 𠔻 64

S B U+269C4 𦧄 42

S B U+269C5 𦧅 48

S B U+27198 𧆘 43

S B U+278B1 𧢱 44

S B U+291D3 𩇓 40

S B U+291D4 𩇔 48

S B U+29663 𩙣 46

S B U+29664 𩙤 48

S B U+2A4CA 𪓊 41

S B U+2A68D 𪚍 40

S B U+2A68E 𪚎 40

S B U+2A6A5 𪚥 64

S E U+2C6A9 𬚩 53

S F U+2E8F1 𮣱 41

S G U+30EDD ⿺辶⿳穴⿰月⿰⿲⿱幺长⿱言马⿱幺长刂心 43

Page 31: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

31

Category Block Code Point Ideograph Strokes Code Chart Excerpt

S G U+30EDE ⿺辶⿳穴⿰月⿰⿲⿱幺長⿱言馬⿱幺長刂心 58

S G U+30F54 76

S G U+3106C ⿳雲⿲雲龍雲⿰龍龍 84

Category U—[U]nusual Arrangement/StructureHan ideographs with this property value have an unusual structure or component arrange-ment. This includes clusters of four or more identical elements, along with three identical ele-ments in a row arranged horizontally or vertically. Note that while 𠂭/叕/𠈌/㸚/㗊/⿱𢆶𢆶/𠱠 as stand-alone Han ideographs are considered unusual, their use as components of other Han ideographs is not, and Han ideographs that include such components have therefore been explicitly excluded.

Category Block Code Point Ideograph Code Chart Excerpt

U URO U+53D5 叕U URO U+58E8 壨U URO U+6724 朤U URO U+71DA 燚U URO U+833B 茻U URO U+96E6 雦U URO U+971B 霛U A U+35CA 㗊U A U+37A1 㞡U A U+382D 㠭U A U+3D07 㴇

Page 32: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

32

Category Block Code Point Ideograph Code Chart Excerpt

U A U+3D58 㵘U A U+3E1A 㸚U A U+4A3B 䨻U A U+4C9C 䲜U B U+20060 𠁠U B U+20069 𠁩U B U+2006E 𠁮U B U+200AD 𠂭U B U+200EC 𠃬U B U+2011E 𠄞U B U+2011F 𠄟U B U+20120 𠄠U B U+2018F 𠆏U B U+2020C 𠈌U B U+204D9 𠓙U B U+2053B 𠔻U B U+2055E 𠕞U B U+20562 𠕢U B U+20572 𠕲

Page 33: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

33

Category Block Code Point Ideograph Code Chart Excerpt

U B U+20658 𠙘U B U+20674 𠙴U B U+20692 𠚒U B U+20908 𠤈U B U+20AEC 𠫬U B U+20B11 𠬑U B U+20B97 𠮗U B U+20C60 𠱠U B U+20D3F 𠴿U B U+21236 𡈶U B U+21239 𡈹U B U+214FF 𡓿U B U+21547 𡕇U B U+21685 𡚅U B U+2168C 𡚌U B U+216A6 𡚦U B U+2189B 𡢛U B U+21966 𡥦U B U+2198F 𡦏

Page 34: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

34

Category Block Code Point Ideograph Code Chart Excerpt

U B U+219AA 𡦪U B U+219B7 𡦷U B U+21ADF 𡫟U B U+21AE9 𡫩U B U+21AF3 𡫳U B U+21AFC 𡫼U B U+21AFE 𡫾U B U+21B1E 𡬞U B U+21B6F 𡭯U B U+21B90 𡮐U B U+21B9F 𡮟U B U+2201A 𢀚U B U+22191 𢆑U B U+22330 𢌰U B U+2233D 𢌽U B U+223DD 𢏝U B U+22434 𢐴U B U+2244D 𢑍U B U+2247D 𢑽

Page 35: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

35

Category Block Code Point Ideograph Code Chart Excerpt

U B U+224B9 𢒹U B U+22911 𢤑U B U+2295B 𢥛U B U+2295C 𢥜U B U+22973 𢥳U B U+229DF 𢧟U B U+22A55 𢩕U B U+22EA6 𢺦U B U+232AB 𣊫U B U+232AD 𣊭U B U+23320 𣌠U B U+23786 𣞆U B U+2384D 𣡍U B U+23855 𣡕U B U+2387A 𣡺U B U+2387D 𣡽U B U+2387E 𣡾U B U+23B05 𣬅U B U+23C9C 𣲜

Page 36: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

36

Category Block Code Point Ideograph Code Chart Excerpt

U B U+240CA 𤃊U B U+24181 𤆁U B U+243CE 𤏎U B U+2452A 𤔪U B U+2452D 𤔭U B U+2453C 𤔼U B U+24540 𤕀U B U+2459A 𤖚U B U+246ED 𤛭U B U+247B9 𤞹U B U+249A1 𤦡U B U+24CF3 𤳳U B U+24CF5 𤳵U B U+24CF9 𤳹U B U+24D04 𤴄U B U+24D07 𤴇U B U+24D0A 𤴊U B U+24D0C 𤴌U B U+24D10 𤴐

Page 37: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

37

Category Block Code Point Ideograph Code Chart Excerpt

U B U+24D11 𤴑U B U+24D12 𤴒U B U+24FC4 𤿄U B U+2516B 𥅫U B U+253C0 𥏀U B U+25506 𥔆U B U+255C9 𥗉U B U+2597C 𥥼U B U+25AD3 𥫓U B U+25D10 𥴐U B U+25DE0 𥷠U B U+25DF9 𥷹U B U+25F6E 𥽮U B U+2616A 𦅪U B U+26269 𦉩U B U+263F2 𦏲U B U+264CB 𦓋U B U+267ED 𦟭U B U+26817 𦠗

Page 38: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

38

Category Block Code Point Ideograph Code Chart Excerpt

U B U+268F7 𦣷U B U+2697A 𦥺U B U+269F5 𦧵U B U+26A67 𦩧U B U+26B8F 𦮏U B U+26C60 𦱠U B U+26C79 𦱹U B U+26D9B 𦶛U B U+26DF6 𦷶U B U+26F71 𦽱U B U+26FD1 𦿑U B U+27047 𧁇U B U+27172 𧅲U B U+27195 𧆕U B U+27198 𧆘U B U+2749F 𧒟U B U+27517 𧔗U B U+2752F 𧔯U B U+27589 𧖉

Page 39: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

39

Category Block Code Point Ideograph Code Chart Excerpt

U B U+2758E 𧖎U B U+27738 𧜸U B U+27751 𧝑U B U+27763 𧝣U B U+277DC 𧟜U B U+2789B 𧢛U B U+278B0 𧢰U B U+27B5B 𧭛U B U+27B9F 𧮟U B U+27BA6 𧮦U B U+27C8F 𧲏U B U+27F9C 𧾜U B U+27FAD 𧾭U B U+283FF 𨏿U B U+2840B 𨐋U B U+28944 𨥄U B U+28C3B 𨰻U B U+28D1F 𨴟U B U+28DFE 𨷾

Page 40: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

40

Category Block Code Point Ideograph Code Chart Excerpt

U B U+2900F 𩀏U B U+29176 𩅶U B U+291C3 𩇃U B U+291D3 𩇓U B U+291D4 𩇔U B U+2938E 𩎎U B U+29661 𩙡U B U+29663 𩙣U B U+29664 𩙤U B U+29867 𩡧U B U+299DD 𩧝U B U+299E2 𩧢U B U+2A235 𪈵U B U+2A240 𪉀U B U+2A6A5 𪚥U C U+2AC76 𪱶U C U+2AE9A 𪺚U E U+2BB2F 𫬯U E U+2C348 𬍈

Page 41: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

41

Category Block Code Point Ideograph Code Chart Excerpt

U E U+2C47D 𬑽U E U+2C6A9 𬚩U F U+2CEB9 𬺹U F U+2CEDF 𬻟U F U+2D05F 𭁟U F U+2D0BC 𭂼U F U+2D16B 𭅫U F U+2D1E9 𭇩U F U+2D2B2 𭊲U F U+2D2B6 𭊶U F U+2D337 𭌷U F U+2D359 𭍙U F U+2D4DD 𭓝U F U+2D588 𭖈U F U+2D5B2 𭖲U F U+2D600 𭘀U F U+2D651 𭙑U F U+2D6D5 𭛕U F U+2D6D6 𭛖

Page 42: L2/20-059 (Proposal for new provisional Unihan Database ... · 1 L2/20-059 Title: Proposal for new provisional Unihan Database property: kStrange Author: Ken Lunde Date: 2020-01-28

42

Category Block Code Point Ideograph Code Chart Excerpt

U F U+2D8D5 𭣕U F U+2DB74 𭭴U F U+2DEF9 𭻹U F U+2E3B3 𮎳U F U+2EB5F 𮭟U G U+30030

U G U+30031

U G U+3016F

U G U+30649

U G U+3080A

U G U+3088A

U G U+308DE

U G U+30D3F

U G U+30F54

U G U+3106A

U G U+31152

That is all.