mirror of
https://github.com/crystalidea/qt6windows7.git
synced 2025-07-05 00:35:27 +08:00
qt 6.5.1 original
This commit is contained in:
993
util/unicode/data/ArabicShaping.txt
Normal file
993
util/unicode/data/ArabicShaping.txt
Normal file
@ -0,0 +1,993 @@
|
||||
# ArabicShaping-15.0.0.txt
|
||||
# Date: 2022-02-14, 18:50:00 GMT [KW, RP]
|
||||
# © 2022 Unicode®, Inc.
|
||||
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
|
||||
# For terms of use, see https://www.unicode.org/terms_of_use.html
|
||||
#
|
||||
# This file is a normative contributory data file in the
|
||||
# Unicode Character Database.
|
||||
#
|
||||
# This file defines the Joining_Type and Joining_Group property
|
||||
# values for Arabic, Syriac, N'Ko, Mandaic, and Manichaean positional
|
||||
# shaping, repeating in machine readable form the information
|
||||
# exemplified in Tables 9-3, 9-8, 9-9, 9-10, 9-14, 9-15, 9-16, 9-19,
|
||||
# 9-20, 10-4, 10-5, 10-6, 10-7, and 19-5 of The Unicode Standard core
|
||||
# specification. This file also defines Joining_Type values for
|
||||
# Mongolian, Phags-pa, Psalter Pahlavi, Sogdian, Old Uyghur, Chorasmian,
|
||||
# and Adlam positional shaping,
|
||||
# and Joining_Type and Joining_Group values for Hanifi Rohingya positional shaping,
|
||||
# which are not listed in tables in the standard.
|
||||
#
|
||||
# See Sections 9.2, 9.3, 9.5, 10.5, 10.6, 13.5, 14.4, 14.10, 14.11, 16.14, 19.4, and 19.9
|
||||
# of The Unicode Standard core specification for more information.
|
||||
#
|
||||
# Each line contains four fields, separated by a semicolon.
|
||||
#
|
||||
# Field 0: the code point, in 4-digit hexadecimal
|
||||
# form, of a character.
|
||||
#
|
||||
# Field 1: gives a short schematic name for that character.
|
||||
# The schematic name is descriptive of the shape, based as
|
||||
# consistently as possible on a name for the skeleton and
|
||||
# then the diacritic marks applied to the skeleton, if any.
|
||||
# Note that this schematic name is considered a comment,
|
||||
# and does not constitute a formal property value.
|
||||
#
|
||||
# Field 2: defines the joining type (property name: Joining_Type)
|
||||
# R Right_Joining
|
||||
# L Left_Joining
|
||||
# D Dual_Joining
|
||||
# C Join_Causing
|
||||
# U Non_Joining
|
||||
# T Transparent
|
||||
#
|
||||
# See Section 9.2, Arabic for more information on these joining types.
|
||||
# Note that for cursive joining scripts which are typically rendered
|
||||
# top-to-bottom, rather than right-to-left, Joining_Type=L conventionally
|
||||
# refers to bottom joining, and Joining_Type=R conventionally refers
|
||||
# to top joining. See Section 14.4, Phags-pa for more information on the
|
||||
# interpretation of joining types in vertical layout.
|
||||
#
|
||||
# Field 3: defines the joining group (property name: Joining_Group)
|
||||
#
|
||||
# The values of the joining group are based schematically on character
|
||||
# names. Where a schematic character name consists of two or more parts
|
||||
# separated by spaces, the formal Joining_Group property value, as specified in
|
||||
# PropertyValueAliases.txt, consists of the same name parts joined by
|
||||
# underscores. Hence, the entry:
|
||||
#
|
||||
# 0629; TEH MARBUTA; R; TEH MARBUTA
|
||||
#
|
||||
# corresponds to [Joining_Group = Teh_Marbuta].
|
||||
#
|
||||
# Note: The property value now designated [Joining_Group = Teh_Marbuta_Goal]
|
||||
# used to apply to both of the following characters
|
||||
# in earlier versions of the standard:
|
||||
#
|
||||
# U+06C2 ARABIC LETTER HEH GOAL WITH HAMZA ABOVE
|
||||
# U+06C3 ARABIC LETTER TEH MARBUTA GOAL
|
||||
#
|
||||
# However, it currently applies only to U+06C3, and *not* to U+06C2.
|
||||
# To avoid destabilizing existing Joining_Group property aliases, the
|
||||
# prior Joining_Group value for U+06C3 (Hamza_On_Heh_Goal) has been
|
||||
# retained as a property value alias, despite the fact that it
|
||||
# no longer applies to its namesake character, U+06C2.
|
||||
# See PropertyValueAliases.txt.
|
||||
#
|
||||
# When other cursive scripts are added to the Unicode Standard in the
|
||||
# future, the joining group value of all its letters will default to
|
||||
# jg=No_Joining_Group in this data file. Other, more specific
|
||||
# joining group values will be defined only if an explicit proposal
|
||||
# to define those values exactly has been approved by the UTC. This
|
||||
# is the convention exemplified by the N'Ko, Mandaic, Mongolian,
|
||||
# Phags-pa, Psalter Pahlavi, Sogdian, Old Uyghur, Chorasmian, and Adlam scripts.
|
||||
# Only the Arabic, Manichaean, and Syriac scripts currently have
|
||||
# explicit joining group values defined for all characters, including
|
||||
# those which have only a single character in a particular Joining_Group
|
||||
# class. Hanifi Rohingya has explicit Joining_Group values assigned only for
|
||||
# the few characters which share a particular Joining_Group class, but
|
||||
# assigns jg=No_Joining_Group to all the singletons.
|
||||
#
|
||||
# Note: Code points that are not explicitly listed in this file are
|
||||
# either of joining type T or U:
|
||||
#
|
||||
# - Those that are not explicitly listed and that are of General Category Mn, Me, or Cf
|
||||
# have joining type T.
|
||||
# - All others not explicitly listed have joining type U.
|
||||
#
|
||||
# For an explicit listing of all characters of joining type T, see
|
||||
# the derived property file DerivedJoiningType.txt.
|
||||
#
|
||||
# #############################################################
|
||||
|
||||
# Unicode; Schematic Name; Joining Type; Joining Group
|
||||
|
||||
# Arabic Characters
|
||||
|
||||
0600; ARABIC NUMBER SIGN; U; No_Joining_Group
|
||||
0601; ARABIC SIGN SANAH; U; No_Joining_Group
|
||||
0602; ARABIC FOOTNOTE MARKER; U; No_Joining_Group
|
||||
0603; ARABIC SIGN SAFHA; U; No_Joining_Group
|
||||
0604; ARABIC SIGN SAMVAT; U; No_Joining_Group
|
||||
0605; ARABIC NUMBER MARK ABOVE; U; No_Joining_Group
|
||||
0608; ARABIC RAY; U; No_Joining_Group
|
||||
060B; AFGHANI SIGN; U; No_Joining_Group
|
||||
0620; DOTLESS YEH WITH SEPARATE RING BELOW; D; YEH
|
||||
0621; HAMZA; U; No_Joining_Group
|
||||
0622; ALEF WITH MADDA ABOVE; R; ALEF
|
||||
0623; ALEF WITH HAMZA ABOVE; R; ALEF
|
||||
0624; WAW WITH HAMZA ABOVE; R; WAW
|
||||
0625; ALEF WITH HAMZA BELOW; R; ALEF
|
||||
0626; DOTLESS YEH WITH HAMZA ABOVE; D; YEH
|
||||
0627; ALEF; R; ALEF
|
||||
0628; BEH; D; BEH
|
||||
0629; TEH MARBUTA; R; TEH MARBUTA
|
||||
062A; DOTLESS BEH WITH 2 DOTS ABOVE; D; BEH
|
||||
062B; DOTLESS BEH WITH 3 DOTS ABOVE; D; BEH
|
||||
062C; HAH WITH DOT BELOW; D; HAH
|
||||
062D; HAH; D; HAH
|
||||
062E; HAH WITH DOT ABOVE; D; HAH
|
||||
062F; DAL; R; DAL
|
||||
0630; DAL WITH DOT ABOVE; R; DAL
|
||||
0631; REH; R; REH
|
||||
0632; REH WITH DOT ABOVE; R; REH
|
||||
0633; SEEN; D; SEEN
|
||||
0634; SEEN WITH 3 DOTS ABOVE; D; SEEN
|
||||
0635; SAD; D; SAD
|
||||
0636; SAD WITH DOT ABOVE; D; SAD
|
||||
0637; TAH; D; TAH
|
||||
0638; TAH WITH DOT ABOVE; D; TAH
|
||||
0639; AIN; D; AIN
|
||||
063A; AIN WITH DOT ABOVE; D; AIN
|
||||
063B; KEHEH WITH 2 DOTS ABOVE; D; GAF
|
||||
063C; KEHEH WITH 3 DOTS BELOW; D; GAF
|
||||
063D; FARSI YEH WITH INVERTED V ABOVE; D; FARSI YEH
|
||||
063E; FARSI YEH WITH 2 DOTS ABOVE; D; FARSI YEH
|
||||
063F; FARSI YEH WITH 3 DOTS ABOVE; D; FARSI YEH
|
||||
0640; TATWEEL; C; No_Joining_Group
|
||||
0641; FEH; D; FEH
|
||||
0642; QAF; D; QAF
|
||||
0643; KAF; D; KAF
|
||||
0644; LAM; D; LAM
|
||||
0645; MEEM; D; MEEM
|
||||
0646; NOON; D; NOON
|
||||
0647; HEH; D; HEH
|
||||
0648; WAW; R; WAW
|
||||
0649; DOTLESS YEH; D; YEH
|
||||
064A; YEH; D; YEH
|
||||
066E; DOTLESS BEH; D; BEH
|
||||
066F; DOTLESS QAF; D; QAF
|
||||
0671; ALEF WITH WASLA ABOVE; R; ALEF
|
||||
0672; ALEF WITH WAVY HAMZA ABOVE; R; ALEF
|
||||
0673; ALEF WITH WAVY HAMZA BELOW; R; ALEF
|
||||
0674; HIGH HAMZA; U; No_Joining_Group
|
||||
0675; HIGH HAMZA ALEF; R; ALEF
|
||||
0676; HIGH HAMZA WAW; R; WAW
|
||||
0677; HIGH HAMZA WAW WITH COMMA ABOVE; R; WAW
|
||||
0678; HIGH HAMZA DOTLESS YEH; D; YEH
|
||||
0679; DOTLESS BEH WITH TAH ABOVE; D; BEH
|
||||
067A; DOTLESS BEH WITH VERTICAL 2 DOTS ABOVE; D; BEH
|
||||
067B; DOTLESS BEH WITH VERTICAL 2 DOTS BELOW; D; BEH
|
||||
067C; DOTLESS BEH WITH ATTACHED RING BELOW AND 2 DOTS ABOVE; D; BEH
|
||||
067D; DOTLESS BEH WITH INVERTED 3 DOTS ABOVE; D; BEH
|
||||
067E; DOTLESS BEH WITH 3 DOTS BELOW; D; BEH
|
||||
067F; DOTLESS BEH WITH 4 DOTS ABOVE; D; BEH
|
||||
0680; DOTLESS BEH WITH 4 DOTS BELOW; D; BEH
|
||||
0681; HAH WITH HAMZA ABOVE; D; HAH
|
||||
0682; HAH WITH VERTICAL 2 DOTS ABOVE; D; HAH
|
||||
0683; HAH WITH 2 DOTS BELOW; D; HAH
|
||||
0684; HAH WITH VERTICAL 2 DOTS BELOW; D; HAH
|
||||
0685; HAH WITH 3 DOTS ABOVE; D; HAH
|
||||
0686; HAH WITH 3 DOTS BELOW; D; HAH
|
||||
0687; HAH WITH 4 DOTS BELOW; D; HAH
|
||||
0688; DAL WITH TAH ABOVE; R; DAL
|
||||
0689; DAL WITH ATTACHED RING BELOW; R; DAL
|
||||
068A; DAL WITH DOT BELOW; R; DAL
|
||||
068B; DAL WITH DOT BELOW AND TAH ABOVE; R; DAL
|
||||
068C; DAL WITH 2 DOTS ABOVE; R; DAL
|
||||
068D; DAL WITH 2 DOTS BELOW; R; DAL
|
||||
068E; DAL WITH 3 DOTS ABOVE; R; DAL
|
||||
068F; DAL WITH INVERTED 3 DOTS ABOVE; R; DAL
|
||||
0690; DAL WITH 4 DOTS ABOVE; R; DAL
|
||||
0691; REH WITH TAH ABOVE; R; REH
|
||||
0692; REH WITH V ABOVE; R; REH
|
||||
0693; REH WITH ATTACHED RING BELOW; R; REH
|
||||
0694; REH WITH DOT BELOW; R; REH
|
||||
0695; REH WITH V BELOW; R; REH
|
||||
0696; REH WITH DOT BELOW AND DOT WITHIN; R; REH
|
||||
0697; REH WITH 2 DOTS ABOVE; R; REH
|
||||
0698; REH WITH 3 DOTS ABOVE; R; REH
|
||||
0699; REH WITH 4 DOTS ABOVE; R; REH
|
||||
069A; SEEN WITH DOT BELOW AND DOT ABOVE; D; SEEN
|
||||
069B; SEEN WITH 3 DOTS BELOW; D; SEEN
|
||||
069C; SEEN WITH 3 DOTS BELOW AND 3 DOTS ABOVE; D; SEEN
|
||||
069D; SAD WITH 2 DOTS BELOW; D; SAD
|
||||
069E; SAD WITH 3 DOTS ABOVE; D; SAD
|
||||
069F; TAH WITH 3 DOTS ABOVE; D; TAH
|
||||
06A0; AIN WITH 3 DOTS ABOVE; D; AIN
|
||||
06A1; DOTLESS FEH; D; FEH
|
||||
06A2; DOTLESS FEH WITH DOT BELOW; D; FEH
|
||||
06A3; FEH WITH DOT BELOW; D; FEH
|
||||
06A4; DOTLESS FEH WITH 3 DOTS ABOVE; D; FEH
|
||||
06A5; DOTLESS FEH WITH 3 DOTS BELOW; D; FEH
|
||||
06A6; DOTLESS FEH WITH 4 DOTS ABOVE; D; FEH
|
||||
06A7; DOTLESS QAF WITH DOT ABOVE; D; QAF
|
||||
06A8; DOTLESS QAF WITH 3 DOTS ABOVE; D; QAF
|
||||
06A9; KEHEH; D; GAF
|
||||
06AA; SWASH KAF; D; SWASH KAF
|
||||
06AB; KEHEH WITH ATTACHED RING BELOW; D; GAF
|
||||
06AC; KAF WITH DOT ABOVE; D; KAF
|
||||
06AD; KAF WITH 3 DOTS ABOVE; D; KAF
|
||||
06AE; KAF WITH 3 DOTS BELOW; D; KAF
|
||||
06AF; GAF; D; GAF
|
||||
06B0; GAF WITH ATTACHED RING BELOW; D; GAF
|
||||
06B1; GAF WITH 2 DOTS ABOVE; D; GAF
|
||||
06B2; GAF WITH 2 DOTS BELOW; D; GAF
|
||||
06B3; GAF WITH VERTICAL 2 DOTS BELOW; D; GAF
|
||||
06B4; GAF WITH 3 DOTS ABOVE; D; GAF
|
||||
06B5; LAM WITH V ABOVE; D; LAM
|
||||
06B6; LAM WITH DOT ABOVE; D; LAM
|
||||
06B7; LAM WITH 3 DOTS ABOVE; D; LAM
|
||||
06B8; LAM WITH 3 DOTS BELOW; D; LAM
|
||||
06B9; NOON WITH DOT BELOW; D; NOON
|
||||
06BA; DOTLESS NOON; D; NOON
|
||||
06BB; DOTLESS NOON WITH TAH ABOVE; D; NOON
|
||||
06BC; NOON WITH ATTACHED RING BELOW; D; NOON
|
||||
06BD; NYA; D; NYA
|
||||
06BE; KNOTTED HEH; D; KNOTTED HEH
|
||||
06BF; HAH WITH 3 DOTS BELOW AND DOT ABOVE; D; HAH
|
||||
06C0; DOTLESS TEH MARBUTA WITH HAMZA ABOVE; R; TEH MARBUTA
|
||||
06C1; HEH GOAL; D; HEH GOAL
|
||||
06C2; HEH GOAL WITH HAMZA ABOVE; D; HEH GOAL
|
||||
06C3; TEH MARBUTA GOAL; R; TEH MARBUTA GOAL
|
||||
06C4; WAW WITH ATTACHED RING WITHIN; R; WAW
|
||||
06C5; WAW WITH LOOP; R; WAW
|
||||
06C6; WAW WITH V ABOVE; R; WAW
|
||||
06C7; WAW WITH COMMA ABOVE; R; WAW
|
||||
06C8; WAW WITH ALEF ABOVE; R; WAW
|
||||
06C9; WAW WITH INVERTED V ABOVE; R; WAW
|
||||
06CA; WAW WITH 2 DOTS ABOVE; R; WAW
|
||||
06CB; WAW WITH 3 DOTS ABOVE; R; WAW
|
||||
06CC; FARSI YEH; D; FARSI YEH
|
||||
06CD; YEH WITH TAIL; R; YEH WITH TAIL
|
||||
06CE; FARSI YEH WITH V ABOVE; D; FARSI YEH
|
||||
06CF; WAW WITH DOT ABOVE; R; WAW
|
||||
06D0; DOTLESS YEH WITH VERTICAL 2 DOTS BELOW; D; YEH
|
||||
06D1; DOTLESS YEH WITH 3 DOTS BELOW; D; YEH
|
||||
06D2; YEH BARREE; R; YEH BARREE
|
||||
06D3; YEH BARREE WITH HAMZA ABOVE; R; YEH BARREE
|
||||
06D5; DOTLESS TEH MARBUTA; R; TEH MARBUTA
|
||||
06DD; ARABIC END OF AYAH; U; No_Joining_Group
|
||||
06EE; DAL WITH INVERTED V ABOVE; R; DAL
|
||||
06EF; REH WITH INVERTED V ABOVE; R; REH
|
||||
06FA; SEEN WITH DOT BELOW AND 3 DOTS ABOVE; D; SEEN
|
||||
06FB; SAD WITH DOT BELOW AND DOT ABOVE; D; SAD
|
||||
06FC; AIN WITH DOT BELOW AND DOT ABOVE; D; AIN
|
||||
06FF; KNOTTED HEH WITH INVERTED V ABOVE; D; KNOTTED HEH
|
||||
|
||||
# Syriac Characters
|
||||
|
||||
070F; SYRIAC ABBREVIATION MARK; T; No_Joining_Group
|
||||
0710; ALAPH; R; ALAPH
|
||||
0712; BETH; D; BETH
|
||||
0713; GAMAL; D; GAMAL
|
||||
0714; GAMAL GARSHUNI; D; GAMAL
|
||||
0715; DALATH; R; DALATH RISH
|
||||
0716; DOTLESS DALATH RISH; R; DALATH RISH
|
||||
0717; HE; R; HE
|
||||
0718; WAW; R; SYRIAC WAW
|
||||
0719; ZAIN; R; ZAIN
|
||||
071A; HETH; D; HETH
|
||||
071B; TETH; D; TETH
|
||||
071C; TETH GARSHUNI; D; TETH
|
||||
071D; YUDH; D; YUDH
|
||||
071E; YUDH HE; R; YUDH HE
|
||||
071F; KAPH; D; KAPH
|
||||
0720; LAMADH; D; LAMADH
|
||||
0721; MIM; D; MIM
|
||||
0722; NUN; D; NUN
|
||||
0723; SEMKATH; D; SEMKATH
|
||||
0724; FINAL SEMKATH; D; FINAL SEMKATH
|
||||
0725; E; D; E
|
||||
0726; PE; D; PE
|
||||
0727; REVERSED PE; D; REVERSED PE
|
||||
0728; SADHE; R; SADHE
|
||||
0729; QAPH; D; QAPH
|
||||
072A; RISH; R; DALATH RISH
|
||||
072B; SHIN; D; SHIN
|
||||
072C; TAW; R; TAW
|
||||
072D; PERSIAN BHETH; D; BETH
|
||||
072E; PERSIAN GHAMAL; D; GAMAL
|
||||
072F; PERSIAN DHALATH; R; DALATH RISH
|
||||
074D; SOGDIAN ZHAIN; R; ZHAIN
|
||||
074E; SOGDIAN KHAPH; D; KHAPH
|
||||
074F; SOGDIAN FE; D; FE
|
||||
|
||||
# Arabic Supplement Characters
|
||||
|
||||
0750; DOTLESS BEH WITH HORIZONTAL 3 DOTS BELOW; D; BEH
|
||||
0751; BEH WITH 3 DOTS ABOVE; D; BEH
|
||||
0752; DOTLESS BEH WITH INVERTED 3 DOTS BELOW; D; BEH
|
||||
0753; DOTLESS BEH WITH INVERTED 3 DOTS BELOW AND 2 DOTS ABOVE; D; BEH
|
||||
0754; DOTLESS BEH WITH 2 DOTS BELOW AND DOT ABOVE; D; BEH
|
||||
0755; DOTLESS BEH WITH INVERTED V BELOW; D; BEH
|
||||
0756; DOTLESS BEH WITH V ABOVE; D; BEH
|
||||
0757; HAH WITH 2 DOTS ABOVE; D; HAH
|
||||
0758; HAH WITH INVERTED 3 DOTS BELOW; D; HAH
|
||||
0759; DAL WITH VERTICAL 2 DOTS BELOW AND TAH ABOVE; R; DAL
|
||||
075A; DAL WITH INVERTED V BELOW; R; DAL
|
||||
075B; REH WITH BAR; R; REH
|
||||
075C; SEEN WITH 4 DOTS ABOVE; D; SEEN
|
||||
075D; AIN WITH 2 DOTS ABOVE; D; AIN
|
||||
075E; AIN WITH INVERTED 3 DOTS ABOVE; D; AIN
|
||||
075F; AIN WITH VERTICAL 2 DOTS ABOVE; D; AIN
|
||||
0760; DOTLESS FEH WITH 2 DOTS BELOW; D; FEH
|
||||
0761; DOTLESS FEH WITH INVERTED 3 DOTS BELOW; D; FEH
|
||||
0762; KEHEH WITH DOT ABOVE; D; GAF
|
||||
0763; KEHEH WITH 3 DOTS ABOVE; D; GAF
|
||||
0764; KEHEH WITH INVERTED 3 DOTS BELOW; D; GAF
|
||||
0765; MEEM WITH DOT ABOVE; D; MEEM
|
||||
0766; MEEM WITH DOT BELOW; D; MEEM
|
||||
0767; NOON WITH 2 DOTS BELOW; D; NOON
|
||||
0768; NOON WITH TAH ABOVE; D; NOON
|
||||
0769; NOON WITH V ABOVE; D; NOON
|
||||
076A; LAM WITH BAR; D; LAM
|
||||
076B; REH WITH VERTICAL 2 DOTS ABOVE; R; REH
|
||||
076C; REH WITH HAMZA ABOVE; R; REH
|
||||
076D; SEEN WITH VERTICAL 2 DOTS ABOVE; D; SEEN
|
||||
076E; HAH WITH TAH BELOW; D; HAH
|
||||
076F; HAH WITH TAH AND 2 DOTS BELOW; D; HAH
|
||||
0770; SEEN WITH 2 DOTS AND TAH ABOVE; D; SEEN
|
||||
0771; REH WITH 2 DOTS AND TAH ABOVE; R; REH
|
||||
0772; HAH WITH TAH ABOVE; D; HAH
|
||||
0773; ALEF WITH DIGIT TWO ABOVE; R; ALEF
|
||||
0774; ALEF WITH DIGIT THREE ABOVE; R; ALEF
|
||||
0775; FARSI YEH WITH DIGIT TWO ABOVE; D; FARSI YEH
|
||||
0776; FARSI YEH WITH DIGIT THREE ABOVE; D; FARSI YEH
|
||||
0777; DOTLESS YEH WITH DIGIT FOUR BELOW; D; YEH
|
||||
0778; WAW WITH DIGIT TWO ABOVE; R; WAW
|
||||
0779; WAW WITH DIGIT THREE ABOVE; R; WAW
|
||||
077A; BURUSHASKI YEH BARREE WITH DIGIT TWO ABOVE; D; BURUSHASKI YEH BARREE
|
||||
077B; BURUSHASKI YEH BARREE WITH DIGIT THREE ABOVE; D; BURUSHASKI YEH BARREE
|
||||
077C; HAH WITH DIGIT FOUR BELOW; D; HAH
|
||||
077D; SEEN WITH DIGIT FOUR ABOVE; D; SEEN
|
||||
077E; SEEN WITH INVERTED V ABOVE; D; SEEN
|
||||
077F; KAF WITH 2 DOTS ABOVE; D; KAF
|
||||
|
||||
# N'Ko Characters
|
||||
|
||||
07CA; NKO A; D; No_Joining_Group
|
||||
07CB; NKO EE; D; No_Joining_Group
|
||||
07CC; NKO I; D; No_Joining_Group
|
||||
07CD; NKO E; D; No_Joining_Group
|
||||
07CE; NKO U; D; No_Joining_Group
|
||||
07CF; NKO OO; D; No_Joining_Group
|
||||
07D0; NKO O; D; No_Joining_Group
|
||||
07D1; NKO DAGBASINNA; D; No_Joining_Group
|
||||
07D2; NKO N; D; No_Joining_Group
|
||||
07D3; NKO BA; D; No_Joining_Group
|
||||
07D4; NKO PA; D; No_Joining_Group
|
||||
07D5; NKO TA; D; No_Joining_Group
|
||||
07D6; NKO JA; D; No_Joining_Group
|
||||
07D7; NKO CHA; D; No_Joining_Group
|
||||
07D8; NKO DA; D; No_Joining_Group
|
||||
07D9; NKO RA; D; No_Joining_Group
|
||||
07DA; NKO RRA; D; No_Joining_Group
|
||||
07DB; NKO SA; D; No_Joining_Group
|
||||
07DC; NKO GBA; D; No_Joining_Group
|
||||
07DD; NKO FA; D; No_Joining_Group
|
||||
07DE; NKO KA; D; No_Joining_Group
|
||||
07DF; NKO LA; D; No_Joining_Group
|
||||
07E0; NKO NA WOLOSO; D; No_Joining_Group
|
||||
07E1; NKO MA; D; No_Joining_Group
|
||||
07E2; NKO NYA; D; No_Joining_Group
|
||||
07E3; NKO NA; D; No_Joining_Group
|
||||
07E4; NKO HA; D; No_Joining_Group
|
||||
07E5; NKO WA; D; No_Joining_Group
|
||||
07E6; NKO YA; D; No_Joining_Group
|
||||
07E7; NKO NYA WOLOSO; D; No_Joining_Group
|
||||
07E8; NKO JONA JA; D; No_Joining_Group
|
||||
07E9; NKO JONA CHA; D; No_Joining_Group
|
||||
07EA; NKO JONA RA; D; No_Joining_Group
|
||||
07FA; NKO LAJANYALAN; C; No_Joining_Group
|
||||
|
||||
# Mandaic Characters
|
||||
|
||||
0840; MANDAIC HALQA; R; No_Joining_Group
|
||||
0841; MANDAIC AB; D; No_Joining_Group
|
||||
0842; MANDAIC AG; D; No_Joining_Group
|
||||
0843; MANDAIC AD; D; No_Joining_Group
|
||||
0844; MANDAIC AH; D; No_Joining_Group
|
||||
0845; MANDAIC USHENNA; D; No_Joining_Group
|
||||
0846; MANDAIC AZ; R; No_Joining_Group
|
||||
0847; MANDAIC IT; R; No_Joining_Group
|
||||
0848; MANDAIC ATT; D; No_Joining_Group
|
||||
0849; MANDAIC AKSA; R; No_Joining_Group
|
||||
084A; MANDAIC AK; D; No_Joining_Group
|
||||
084B; MANDAIC AL; D; No_Joining_Group
|
||||
084C; MANDAIC AM; D; No_Joining_Group
|
||||
084D; MANDAIC AN; D; No_Joining_Group
|
||||
084E; MANDAIC AS; D; No_Joining_Group
|
||||
084F; MANDAIC IN; D; No_Joining_Group
|
||||
0850; MANDAIC AP; D; No_Joining_Group
|
||||
0851; MANDAIC ASZ; D; No_Joining_Group
|
||||
0852; MANDAIC AQ; D; No_Joining_Group
|
||||
0853; MANDAIC AR; D; No_Joining_Group
|
||||
0854; MANDAIC ASH; R; No_Joining_Group
|
||||
0855; MANDAIC AT; D; No_Joining_Group
|
||||
0856; MANDAIC DUSHENNA; R; No_Joining_Group
|
||||
0857; MANDAIC KAD; R; No_Joining_Group
|
||||
0858; MANDAIC AIN; R; No_Joining_Group
|
||||
|
||||
# Syriac Supplement Characters
|
||||
|
||||
0860; MALAYALAM NGA; D; MALAYALAM NGA
|
||||
0861; MALAYALAM JA; U; MALAYALAM JA
|
||||
0862; MALAYALAM NYA; D; MALAYALAM NYA
|
||||
0863; MALAYALAM TTA; D; MALAYALAM TTA
|
||||
0864; MALAYALAM NNA; D; MALAYALAM NNA
|
||||
0865; MALAYALAM NNNA; D; MALAYALAM NNNA
|
||||
0866; MALAYALAM BHA; U; MALAYALAM BHA
|
||||
0867; MALAYALAM RA; R; MALAYALAM RA
|
||||
0868; MALAYALAM LLA; D; MALAYALAM LLA
|
||||
0869; MALAYALAM LLLA; R; MALAYALAM LLLA
|
||||
086A; MALAYALAM SSA; R; MALAYALAM SSA
|
||||
|
||||
# Arabic Extended-B Characters
|
||||
|
||||
0870; ALEF WITH ATTACHED FATHA; R; ALEF
|
||||
0871; ALEF WITH ATTACHED TOP RIGHT FATHA; R; ALEF
|
||||
0872; ALEF WITH RIGHT MIDDLE STROKE; R; ALEF
|
||||
0873; ALEF WITH LEFT MIDDLE STROKE; R; ALEF
|
||||
0874; ALEF WITH ATTACHED KASRA; R; ALEF
|
||||
0875; ALEF WITH ATTACHED BOTTOM RIGHT KASRA; R; ALEF
|
||||
0876; ALEF WITH ATTACHED ROUND DOT ABOVE; R; ALEF
|
||||
0877; ALEF WITH ATTACHED RIGHT ROUND DOT; R; ALEF
|
||||
0878; ALEF WITH ATTACHED LEFT ROUND DOT; R; ALEF
|
||||
0879; ALEF WITH ATTACHED ROUND DOT BELOW; R; ALEF
|
||||
087A; ALEF WITH DOT ABOVE; R; ALEF
|
||||
087B; ALEF WITH ATTACHED TOP RIGHT FATHA AND DOT ABOVE; R; ALEF
|
||||
087C; ALEF WITH RIGHT MIDDLE STROKE AND DOT ABOVE; R; ALEF
|
||||
087D; ALEF WITH ATTACHED BOTTOM RIGHT KASRA AND DOT ABOVE; R; ALEF
|
||||
087E; ALEF WITH ATTACHED TOP RIGHT FATHA AND LEFT RING; R; ALEF
|
||||
087F; ALEF WITH RIGHT MIDDLE STROKE AND LEFT RING; R; ALEF
|
||||
0880; ALEF WITH ATTACHED BOTTOM RIGHT KASRA AND LEFT RING; R; ALEF
|
||||
0881; ALEF WITH ATTACHED RIGHT HAMZA; R; ALEF
|
||||
0882; ALEF WITH ATTACHED LEFT HAMZA; R; ALEF
|
||||
0883; TATWEEL WITH OVERSTRUCK HAMZA; C; No_Joining_Group
|
||||
0884; TATWEEL WITH OVERSTRUCK WAW; C; No_Joining_Group
|
||||
0885; TATWEEL WITH TWO DOTS BELOW; C; No_Joining_Group
|
||||
0886; THIN YEH; D; THIN YEH
|
||||
0887; ARABIC BASELINE ROUND DOT; U; No_Joining_Group
|
||||
0888; ARABIC RAISED ROUND DOT; U; No_Joining_Group
|
||||
0889; DOTLESS NOON WITH INVERTED V ABOVE; D; NOON
|
||||
088A; HAH WITH INVERTED V BELOW; D; HAH
|
||||
088B; TAH WITH DOT BELOW; D; TAH
|
||||
088C; TAH WITH 3 DOTS BELOW; D; TAH
|
||||
088D; KEHEH WITH VERTICAL 2 DOTS BELOW; D; GAF
|
||||
088E; VERTICAL TAIL; R; VERTICAL TAIL
|
||||
0890; ARABIC POUND MARK ABOVE; U; No_Joining_Group
|
||||
0891; ARABIC PIASTRE MARK ABOVE; U; No_Joining_Group
|
||||
|
||||
# Arabic Extended-A Characters
|
||||
|
||||
08A0; DOTLESS BEH WITH V BELOW; D; BEH
|
||||
08A1; BEH WITH HAMZA ABOVE; D; BEH
|
||||
08A2; HAH WITH DOT BELOW AND 2 DOTS ABOVE; D; HAH
|
||||
08A3; TAH WITH 2 DOTS ABOVE; D; TAH
|
||||
08A4; DOTLESS FEH WITH DOT BELOW AND 3 DOTS ABOVE; D; FEH
|
||||
08A5; QAF WITH DOT BELOW; D; QAF
|
||||
08A6; LAM WITH DOUBLE BAR; D; LAM
|
||||
08A7; MEEM WITH 3 DOTS ABOVE; D; MEEM
|
||||
08A8; YEH WITH HAMZA ABOVE; D; YEH
|
||||
08A9; YEH WITH DOT ABOVE; D; YEH
|
||||
08AA; REH WITH LOOP; R; REH
|
||||
08AB; WAW WITH DOT WITHIN; R; WAW
|
||||
08AC; ROHINGYA YEH; R; ROHINGYA YEH
|
||||
08AD; LOW ALEF; U; No_Joining_Group
|
||||
08AE; DAL WITH 3 DOTS BELOW; R; DAL
|
||||
08AF; SAD WITH 3 DOTS BELOW; D; SAD
|
||||
08B0; KEHEH WITH STROKE BELOW; D; GAF
|
||||
08B1; STRAIGHT WAW; R; STRAIGHT WAW
|
||||
08B2; REH WITH DOT AND INVERTED V ABOVE; R; REH
|
||||
08B3; AIN WITH 3 DOTS BELOW; D; AIN
|
||||
08B4; KAF WITH DOT BELOW; D; KAF
|
||||
08B5; DOTLESS QAF WITH DOT BELOW; D; QAF
|
||||
08B6; BEH WITH MEEM ABOVE; D; BEH
|
||||
08B7; DOTLESS BEH WITH 3 DOTS BELOW AND MEEM ABOVE; D; BEH
|
||||
08B8; DOTLESS BEH WITH TEH ABOVE; D; BEH
|
||||
08B9; REH WITH NOON ABOVE; R; REH
|
||||
08BA; YEH WITH NOON ABOVE; D; YEH
|
||||
08BB; AFRICAN FEH; D; AFRICAN FEH
|
||||
08BC; AFRICAN QAF; D; AFRICAN QAF
|
||||
08BD; AFRICAN NOON; D; AFRICAN NOON
|
||||
08BE; DOTLESS BEH WITH 3 DOTS BELOW AND V ABOVE; D; BEH
|
||||
08BF; DOTLESS BEH WITH 2 DOTS AND V ABOVE; D; BEH
|
||||
08C0; DOTLESS BEH WITH TAH AND V ABOVE; D; BEH
|
||||
08C1; HAH WITH 3 DOTS BELOW AND V ABOVE; D; HAH
|
||||
08C2; KEHEH WITH V ABOVE; D; GAF
|
||||
08C3; AIN WITH DIAMOND 4 DOTS ABOVE; D; AIN
|
||||
08C4; AFRICAN QAF WITH 3 DOTS ABOVE; D; AFRICAN QAF
|
||||
08C5; HAH WITH DOT BELOW AND 3 DOTS ABOVE; D; HAH
|
||||
08C6; HAH WITH DIAMOND 4 DOTS BELOW; D; HAH
|
||||
08C7; LAM WITH TAH ABOVE; D; LAM
|
||||
08C8; KEHEH WITH ELONGATED HAMZA ABOVE; D; GAF
|
||||
08E2; ARABIC DISPUTED END OF AYAH; U; No_Joining_Group
|
||||
|
||||
# Mongolian Characters
|
||||
|
||||
1806; MONGOLIAN TODO SOFT HYPHEN; U; No_Joining_Group
|
||||
1807; MONGOLIAN SIBE SYLLABLE BOUNDARY MARKER; D; No_Joining_Group
|
||||
180A; MONGOLIAN NIRUGU; C; No_Joining_Group
|
||||
180E; MONGOLIAN VOWEL SEPARATOR; U; No_Joining_Group
|
||||
1820; MONGOLIAN A; D; No_Joining_Group
|
||||
1821; MONGOLIAN E; D; No_Joining_Group
|
||||
1822; MONGOLIAN I; D; No_Joining_Group
|
||||
1823; MONGOLIAN O; D; No_Joining_Group
|
||||
1824; MONGOLIAN U; D; No_Joining_Group
|
||||
1825; MONGOLIAN OE; D; No_Joining_Group
|
||||
1826; MONGOLIAN UE; D; No_Joining_Group
|
||||
1827; MONGOLIAN EE; D; No_Joining_Group
|
||||
1828; MONGOLIAN NA; D; No_Joining_Group
|
||||
1829; MONGOLIAN ANG; D; No_Joining_Group
|
||||
182A; MONGOLIAN BA; D; No_Joining_Group
|
||||
182B; MONGOLIAN PA; D; No_Joining_Group
|
||||
182C; MONGOLIAN QA; D; No_Joining_Group
|
||||
182D; MONGOLIAN GA; D; No_Joining_Group
|
||||
182E; MONGOLIAN MA; D; No_Joining_Group
|
||||
182F; MONGOLIAN LA; D; No_Joining_Group
|
||||
1830; MONGOLIAN SA; D; No_Joining_Group
|
||||
1831; MONGOLIAN SHA; D; No_Joining_Group
|
||||
1832; MONGOLIAN TA; D; No_Joining_Group
|
||||
1833; MONGOLIAN DA; D; No_Joining_Group
|
||||
1834; MONGOLIAN CHA; D; No_Joining_Group
|
||||
1835; MONGOLIAN JA; D; No_Joining_Group
|
||||
1836; MONGOLIAN YA; D; No_Joining_Group
|
||||
1837; MONGOLIAN RA; D; No_Joining_Group
|
||||
1838; MONGOLIAN WA; D; No_Joining_Group
|
||||
1839; MONGOLIAN FA; D; No_Joining_Group
|
||||
183A; MONGOLIAN KA; D; No_Joining_Group
|
||||
183B; MONGOLIAN KHA; D; No_Joining_Group
|
||||
183C; MONGOLIAN TSA; D; No_Joining_Group
|
||||
183D; MONGOLIAN ZA; D; No_Joining_Group
|
||||
183E; MONGOLIAN HAA; D; No_Joining_Group
|
||||
183F; MONGOLIAN ZRA; D; No_Joining_Group
|
||||
1840; MONGOLIAN LHA; D; No_Joining_Group
|
||||
1841; MONGOLIAN ZHI; D; No_Joining_Group
|
||||
1842; MONGOLIAN CHI; D; No_Joining_Group
|
||||
1843; MONGOLIAN TODO LONG VOWEL SIGN; D; No_Joining_Group
|
||||
1844; MONGOLIAN TODO E; D; No_Joining_Group
|
||||
1845; MONGOLIAN TODO I; D; No_Joining_Group
|
||||
1846; MONGOLIAN TODO O; D; No_Joining_Group
|
||||
1847; MONGOLIAN TODO U; D; No_Joining_Group
|
||||
1848; MONGOLIAN TODO OE; D; No_Joining_Group
|
||||
1849; MONGOLIAN TODO UE; D; No_Joining_Group
|
||||
184A; MONGOLIAN TODO ANG; D; No_Joining_Group
|
||||
184B; MONGOLIAN TODO BA; D; No_Joining_Group
|
||||
184C; MONGOLIAN TODO PA; D; No_Joining_Group
|
||||
184D; MONGOLIAN TODO QA; D; No_Joining_Group
|
||||
184E; MONGOLIAN TODO GA; D; No_Joining_Group
|
||||
184F; MONGOLIAN TODO MA; D; No_Joining_Group
|
||||
1850; MONGOLIAN TODO TA; D; No_Joining_Group
|
||||
1851; MONGOLIAN TODO DA; D; No_Joining_Group
|
||||
1852; MONGOLIAN TODO CHA; D; No_Joining_Group
|
||||
1853; MONGOLIAN TODO JA; D; No_Joining_Group
|
||||
1854; MONGOLIAN TODO TSA; D; No_Joining_Group
|
||||
1855; MONGOLIAN TODO YA; D; No_Joining_Group
|
||||
1856; MONGOLIAN TODO WA; D; No_Joining_Group
|
||||
1857; MONGOLIAN TODO KA; D; No_Joining_Group
|
||||
1858; MONGOLIAN TODO GAA; D; No_Joining_Group
|
||||
1859; MONGOLIAN TODO HAA; D; No_Joining_Group
|
||||
185A; MONGOLIAN TODO JIA; D; No_Joining_Group
|
||||
185B; MONGOLIAN TODO NIA; D; No_Joining_Group
|
||||
185C; MONGOLIAN TODO DZA; D; No_Joining_Group
|
||||
185D; MONGOLIAN SIBE E; D; No_Joining_Group
|
||||
185E; MONGOLIAN SIBE I; D; No_Joining_Group
|
||||
185F; MONGOLIAN SIBE IY; D; No_Joining_Group
|
||||
1860; MONGOLIAN SIBE UE; D; No_Joining_Group
|
||||
1861; MONGOLIAN SIBE U; D; No_Joining_Group
|
||||
1862; MONGOLIAN SIBE ANG; D; No_Joining_Group
|
||||
1863; MONGOLIAN SIBE KA; D; No_Joining_Group
|
||||
1864; MONGOLIAN SIBE GA; D; No_Joining_Group
|
||||
1865; MONGOLIAN SIBE HA; D; No_Joining_Group
|
||||
1866; MONGOLIAN SIBE PA; D; No_Joining_Group
|
||||
1867; MONGOLIAN SIBE SHA; D; No_Joining_Group
|
||||
1868; MONGOLIAN SIBE TA; D; No_Joining_Group
|
||||
1869; MONGOLIAN SIBE DA; D; No_Joining_Group
|
||||
186A; MONGOLIAN SIBE JA; D; No_Joining_Group
|
||||
186B; MONGOLIAN SIBE FA; D; No_Joining_Group
|
||||
186C; MONGOLIAN SIBE GAA; D; No_Joining_Group
|
||||
186D; MONGOLIAN SIBE HAA; D; No_Joining_Group
|
||||
186E; MONGOLIAN SIBE TSA; D; No_Joining_Group
|
||||
186F; MONGOLIAN SIBE ZA; D; No_Joining_Group
|
||||
1870; MONGOLIAN SIBE RAA; D; No_Joining_Group
|
||||
1871; MONGOLIAN SIBE CHA; D; No_Joining_Group
|
||||
1872; MONGOLIAN SIBE ZHA; D; No_Joining_Group
|
||||
1873; MONGOLIAN MANCHU I; D; No_Joining_Group
|
||||
1874; MONGOLIAN MANCHU KA; D; No_Joining_Group
|
||||
1875; MONGOLIAN MANCHU RA; D; No_Joining_Group
|
||||
1876; MONGOLIAN MANCHU FA; D; No_Joining_Group
|
||||
1877; MONGOLIAN MANCHU ZHA; D; No_Joining_Group
|
||||
1878; MONGOLIAN MANCHU CHA WITH 2 DOTS; D; No_Joining_Group
|
||||
1880; MONGOLIAN ALI GALI ANUSVARA ONE; U; No_Joining_Group
|
||||
1881; MONGOLIAN ALI GALI VISARGA ONE; U; No_Joining_Group
|
||||
1882; MONGOLIAN ALI GALI DAMARU; U; No_Joining_Group
|
||||
1883; MONGOLIAN ALI GALI UBADAMA; U; No_Joining_Group
|
||||
1884; MONGOLIAN ALI GALI INVERTED UBADAMA; U; No_Joining_Group
|
||||
1885; MONGOLIAN ALI GALI BALUDA; T; No_Joining_Group
|
||||
1886; MONGOLIAN ALI GALI THREE BALUDA; T; No_Joining_Group
|
||||
1887; MONGOLIAN ALI GALI A; D; No_Joining_Group
|
||||
1888; MONGOLIAN ALI GALI I; D; No_Joining_Group
|
||||
1889; MONGOLIAN ALI GALI KA; D; No_Joining_Group
|
||||
188A; MONGOLIAN ALI GALI NGA; D; No_Joining_Group
|
||||
188B; MONGOLIAN ALI GALI CA; D; No_Joining_Group
|
||||
188C; MONGOLIAN ALI GALI TTA; D; No_Joining_Group
|
||||
188D; MONGOLIAN ALI GALI TTHA; D; No_Joining_Group
|
||||
188E; MONGOLIAN ALI GALI DDA; D; No_Joining_Group
|
||||
188F; MONGOLIAN ALI GALI NNA; D; No_Joining_Group
|
||||
1890; MONGOLIAN ALI GALI TA; D; No_Joining_Group
|
||||
1891; MONGOLIAN ALI GALI DA; D; No_Joining_Group
|
||||
1892; MONGOLIAN ALI GALI PA; D; No_Joining_Group
|
||||
1893; MONGOLIAN ALI GALI PHA; D; No_Joining_Group
|
||||
1894; MONGOLIAN ALI GALI SSA; D; No_Joining_Group
|
||||
1895; MONGOLIAN ALI GALI ZHA; D; No_Joining_Group
|
||||
1896; MONGOLIAN ALI GALI ZA; D; No_Joining_Group
|
||||
1897; MONGOLIAN ALI GALI AH; D; No_Joining_Group
|
||||
1898; MONGOLIAN TODO ALI GALI TA; D; No_Joining_Group
|
||||
1899; MONGOLIAN TODO ALI GALI ZHA; D; No_Joining_Group
|
||||
189A; MONGOLIAN MANCHU ALI GALI GHA; D; No_Joining_Group
|
||||
189B; MONGOLIAN MANCHU ALI GALI NGA; D; No_Joining_Group
|
||||
189C; MONGOLIAN MANCHU ALI GALI CA; D; No_Joining_Group
|
||||
189D; MONGOLIAN MANCHU ALI GALI JHA; D; No_Joining_Group
|
||||
189E; MONGOLIAN MANCHU ALI GALI TTA; D; No_Joining_Group
|
||||
189F; MONGOLIAN MANCHU ALI GALI DDHA; D; No_Joining_Group
|
||||
18A0; MONGOLIAN MANCHU ALI GALI TA; D; No_Joining_Group
|
||||
18A1; MONGOLIAN MANCHU ALI GALI DHA; D; No_Joining_Group
|
||||
18A2; MONGOLIAN MANCHU ALI GALI SSA; D; No_Joining_Group
|
||||
18A3; MONGOLIAN MANCHU ALI GALI CYA; D; No_Joining_Group
|
||||
18A4; MONGOLIAN MANCHU ALI GALI ZHA; D; No_Joining_Group
|
||||
18A5; MONGOLIAN MANCHU ALI GALI ZA; D; No_Joining_Group
|
||||
18A6; MONGOLIAN ALI GALI HALF U; D; No_Joining_Group
|
||||
18A7; MONGOLIAN ALI GALI HALF YA; D; No_Joining_Group
|
||||
18A8; MONGOLIAN MANCHU ALI GALI BHA; D; No_Joining_Group
|
||||
18AA; MONGOLIAN MANCHU ALI GALI LHA; D; No_Joining_Group
|
||||
|
||||
# Other
|
||||
|
||||
200C; ZERO WIDTH NON-JOINER; U; No_Joining_Group
|
||||
200D; ZERO WIDTH JOINER; C; No_Joining_Group
|
||||
202F; NARROW NO-BREAK SPACE; U; No_Joining_Group
|
||||
2066; LEFT-TO-RIGHT ISOLATE; U; No_Joining_Group
|
||||
2067; RIGHT-TO-LEFT ISOLATE; U; No_Joining_Group
|
||||
2068; FIRST STRONG ISOLATE; U; No_Joining_Group
|
||||
2069; POP DIRECTIONAL ISOLATE; U; No_Joining_Group
|
||||
|
||||
# Phags-Pa Characters
|
||||
|
||||
A840; PHAGS-PA KA; D; No_Joining_Group
|
||||
A841; PHAGS-PA KHA; D; No_Joining_Group
|
||||
A842; PHAGS-PA GA; D; No_Joining_Group
|
||||
A843; PHAGS-PA NGA; D; No_Joining_Group
|
||||
A844; PHAGS-PA CA; D; No_Joining_Group
|
||||
A845; PHAGS-PA CHA; D; No_Joining_Group
|
||||
A846; PHAGS-PA JA; D; No_Joining_Group
|
||||
A847; PHAGS-PA NYA; D; No_Joining_Group
|
||||
A848; PHAGS-PA TA; D; No_Joining_Group
|
||||
A849; PHAGS-PA THA; D; No_Joining_Group
|
||||
A84A; PHAGS-PA DA; D; No_Joining_Group
|
||||
A84B; PHAGS-PA NA; D; No_Joining_Group
|
||||
A84C; PHAGS-PA PA; D; No_Joining_Group
|
||||
A84D; PHAGS-PA PHA; D; No_Joining_Group
|
||||
A84E; PHAGS-PA BA; D; No_Joining_Group
|
||||
A84F; PHAGS-PA MA; D; No_Joining_Group
|
||||
A850; PHAGS-PA TSA; D; No_Joining_Group
|
||||
A851; PHAGS-PA TSHA; D; No_Joining_Group
|
||||
A852; PHAGS-PA DZA; D; No_Joining_Group
|
||||
A853; PHAGS-PA WA; D; No_Joining_Group
|
||||
A854; PHAGS-PA ZHA; D; No_Joining_Group
|
||||
A855; PHAGS-PA ZA; D; No_Joining_Group
|
||||
A856; PHAGS-PA SMALL A; D; No_Joining_Group
|
||||
A857; PHAGS-PA YA; D; No_Joining_Group
|
||||
A858; PHAGS-PA RA; D; No_Joining_Group
|
||||
A859; PHAGS-PA LA; D; No_Joining_Group
|
||||
A85A; PHAGS-PA SHA; D; No_Joining_Group
|
||||
A85B; PHAGS-PA SA; D; No_Joining_Group
|
||||
A85C; PHAGS-PA HA; D; No_Joining_Group
|
||||
A85D; PHAGS-PA A; D; No_Joining_Group
|
||||
A85E; PHAGS-PA I; D; No_Joining_Group
|
||||
A85F; PHAGS-PA U; D; No_Joining_Group
|
||||
A860; PHAGS-PA E; D; No_Joining_Group
|
||||
A861; PHAGS-PA O; D; No_Joining_Group
|
||||
A862; PHAGS-PA QA; D; No_Joining_Group
|
||||
A863; PHAGS-PA XA; D; No_Joining_Group
|
||||
A864; PHAGS-PA FA; D; No_Joining_Group
|
||||
A865; PHAGS-PA GGA; D; No_Joining_Group
|
||||
A866; PHAGS-PA EE; D; No_Joining_Group
|
||||
A867; PHAGS-PA SUBJOINED WA; D; No_Joining_Group
|
||||
A868; PHAGS-PA SUBJOINED YA; D; No_Joining_Group
|
||||
A869; PHAGS-PA TTA; D; No_Joining_Group
|
||||
A86A; PHAGS-PA TTHA; D; No_Joining_Group
|
||||
A86B; PHAGS-PA DDA; D; No_Joining_Group
|
||||
A86C; PHAGS-PA NNA; D; No_Joining_Group
|
||||
A86D; PHAGS-PA ALTERNATE YA; D; No_Joining_Group
|
||||
A86E; PHAGS-PA VOICELESS SHA; D; No_Joining_Group
|
||||
A86F; PHAGS-PA VOICED HA; D; No_Joining_Group
|
||||
A870; PHAGS-PA ASPIRATED FA; D; No_Joining_Group
|
||||
A871; PHAGS-PA SUBJOINED RA; D; No_Joining_Group
|
||||
A872; PHAGS-PA SUPERFIXED RA; L; No_Joining_Group
|
||||
A873; PHAGS-PA CANDRABINDU; U; No_Joining_Group
|
||||
|
||||
# Manichaean Characters
|
||||
|
||||
10AC0; MANICHAEAN ALEPH; D; MANICHAEAN ALEPH
|
||||
10AC1; MANICHAEAN BETH; D; MANICHAEAN BETH
|
||||
10AC2; MANICHAEAN BETH WITH 2 DOTS ABOVE; D; MANICHAEAN BETH
|
||||
10AC3; MANICHAEAN GIMEL; D; MANICHAEAN GIMEL
|
||||
10AC4; MANICHAEAN GIMEL WITH ATTACHED RING BELOW; D; MANICHAEAN GIMEL
|
||||
10AC5; MANICHAEAN DALETH; R; MANICHAEAN DALETH
|
||||
10AC6; MANICHAEAN HE; U; No_Joining_Group
|
||||
10AC7; MANICHAEAN WAW; R; MANICHAEAN WAW
|
||||
10AC8; MANICHAEAN UD; U; No_Joining_Group
|
||||
10AC9; MANICHAEAN ZAYIN; R; MANICHAEAN ZAYIN
|
||||
10ACA; MANICHAEAN ZAYIN WITH 2 DOTS ABOVE; R; MANICHAEAN ZAYIN
|
||||
10ACB; MANICHAEAN JAYIN; U; No_Joining_Group
|
||||
10ACC; MANICHAEAN JAYIN WITH 2 DOTS ABOVE; U; No_Joining_Group
|
||||
10ACD; MANICHAEAN HETH; L; MANICHAEAN HETH
|
||||
10ACE; MANICHAEAN TETH; R; MANICHAEAN TETH
|
||||
10ACF; MANICHAEAN YODH; R; MANICHAEAN YODH
|
||||
10AD0; MANICHAEAN KAPH; R; MANICHAEAN KAPH
|
||||
10AD1; MANICHAEAN KAPH WITH DOT ABOVE; R; MANICHAEAN KAPH
|
||||
10AD2; MANICHAEAN KAPH WITH 2 DOTS ABOVE; R; MANICHAEAN KAPH
|
||||
10AD3; MANICHAEAN LAMEDH; D; MANICHAEAN LAMEDH
|
||||
10AD4; MANICHAEAN DHAMEDH; D; MANICHAEAN DHAMEDH
|
||||
10AD5; MANICHAEAN THAMEDH; D; MANICHAEAN THAMEDH
|
||||
10AD6; MANICHAEAN MEM; D; MANICHAEAN MEM
|
||||
10AD7; MANICHAEAN NUN; L; MANICHAEAN NUN
|
||||
10AD8; MANICHAEAN SAMEKH; D; MANICHAEAN SAMEKH
|
||||
10AD9; MANICHAEAN AYIN; D; MANICHAEAN AYIN
|
||||
10ADA; MANICHAEAN AYIN WITH 2 DOTS ABOVE; D; MANICHAEAN AYIN
|
||||
10ADB; MANICHAEAN PE; D; MANICHAEAN PE
|
||||
10ADC; MANICHAEAN PE WITH DOT ABOVE; D; MANICHAEAN PE
|
||||
10ADD; MANICHAEAN SADHE; R; MANICHAEAN SADHE
|
||||
10ADE; MANICHAEAN QOPH; D; MANICHAEAN QOPH
|
||||
10ADF; MANICHAEAN QOPH WITH DOT ABOVE; D; MANICHAEAN QOPH
|
||||
10AE0; MANICHAEAN QOPH WITH 2 DOTS ABOVE; D; MANICHAEAN QOPH
|
||||
10AE1; MANICHAEAN RESH; R; MANICHAEAN RESH
|
||||
10AE2; MANICHAEAN SHIN; U; No_Joining_Group
|
||||
10AE3; MANICHAEAN SHIN WITH 2 DOTS ABOVE; U; No_Joining_Group
|
||||
10AE4; MANICHAEAN TAW; R; MANICHAEAN TAW
|
||||
10AEB; MANICHAEAN ONE; D; MANICHAEAN ONE
|
||||
10AEC; MANICHAEAN FIVE; D; MANICHAEAN FIVE
|
||||
10AED; MANICHAEAN TEN; D; MANICHAEAN TEN
|
||||
10AEE; MANICHAEAN TWENTY; D; MANICHAEAN TWENTY
|
||||
10AEF; MANICHAEAN HUNDRED; R; MANICHAEAN HUNDRED
|
||||
|
||||
# Psalter Pahlavi Characters
|
||||
|
||||
10B80; PSALTER PAHLAVI ALEPH; D; No_Joining_Group
|
||||
10B81; PSALTER PAHLAVI BETH; R; No_Joining_Group
|
||||
10B82; PSALTER PAHLAVI GIMEL; D; No_Joining_Group
|
||||
10B83; PSALTER PAHLAVI DALETH; R; No_Joining_Group
|
||||
10B84; PSALTER PAHLAVI HE; R; No_Joining_Group
|
||||
10B85; PSALTER PAHLAVI WAW-AYIN-RESH; R; No_Joining_Group
|
||||
10B86; PSALTER PAHLAVI ZAYIN; D; No_Joining_Group
|
||||
10B87; PSALTER PAHLAVI HETH; D; No_Joining_Group
|
||||
10B88; PSALTER PAHLAVI YODH; D; No_Joining_Group
|
||||
10B89; PSALTER PAHLAVI KAPH; R; No_Joining_Group
|
||||
10B8A; PSALTER PAHLAVI LAMEDH; D; No_Joining_Group
|
||||
10B8B; PSALTER PAHLAVI MEM-QOPH; D; No_Joining_Group
|
||||
10B8C; PSALTER PAHLAVI NUN; R; No_Joining_Group
|
||||
10B8D; PSALTER PAHLAVI SAMEKH; D; No_Joining_Group
|
||||
10B8E; PSALTER PAHLAVI PE; R; No_Joining_Group
|
||||
10B8F; PSALTER PAHLAVI SADHE; R; No_Joining_Group
|
||||
10B90; PSALTER PAHLAVI SHIN; D; No_Joining_Group
|
||||
10B91; PSALTER PAHLAVI TAW; R; No_Joining_Group
|
||||
10BA9; PSALTER PAHLAVI ONE; R; No_Joining_Group
|
||||
10BAA; PSALTER PAHLAVI TWO; R; No_Joining_Group
|
||||
10BAB; PSALTER PAHLAVI THREE; R; No_Joining_Group
|
||||
10BAC; PSALTER PAHLAVI FOUR; R; No_Joining_Group
|
||||
10BAD; PSALTER PAHLAVI TEN; D; No_Joining_Group
|
||||
10BAE; PSALTER PAHLAVI TWENTY; D; No_Joining_Group
|
||||
10BAF; PSALTER PAHLAVI HUNDRED; U; No_Joining_Group
|
||||
|
||||
# Hanifi Rohingya Characters
|
||||
|
||||
10D00; HANIFI ROHINGYA A; L; No_Joining_Group
|
||||
10D01; HANIFI ROHINGYA BA; D; No_Joining_Group
|
||||
10D02; HANIFI ROHINGYA PA; D; HANIFI ROHINGYA PA
|
||||
10D03; HANIFI ROHINGYA TA; D; No_Joining_Group
|
||||
10D04; HANIFI ROHINGYA TTA; D; No_Joining_Group
|
||||
10D05; HANIFI ROHINGYA JA; D; No_Joining_Group
|
||||
10D06; HANIFI ROHINGYA CA; D; No_Joining_Group
|
||||
10D07; HANIFI ROHINGYA HA; D; No_Joining_Group
|
||||
10D08; HANIFI ROHINGYA KHA; D; No_Joining_Group
|
||||
10D09; HANIFI ROHINGYA PA WITH DOT ABOVE; D; HANIFI ROHINGYA PA
|
||||
10D0A; HANIFI ROHINGYA DA; D; No_Joining_Group
|
||||
10D0B; HANIFI ROHINGYA DDA; D; No_Joining_Group
|
||||
10D0C; HANIFI ROHINGYA RA; D; No_Joining_Group
|
||||
10D0D; HANIFI ROHINGYA RRA; D; No_Joining_Group
|
||||
10D0E; HANIFI ROHINGYA ZA; D; No_Joining_Group
|
||||
10D0F; HANIFI ROHINGYA SA; D; No_Joining_Group
|
||||
10D10; HANIFI ROHINGYA SHA; D; No_Joining_Group
|
||||
10D11; HANIFI ROHINGYA KA; D; No_Joining_Group
|
||||
10D12; HANIFI ROHINGYA GA; D; No_Joining_Group
|
||||
10D13; HANIFI ROHINGYA LA; D; No_Joining_Group
|
||||
10D14; HANIFI ROHINGYA MA; D; No_Joining_Group
|
||||
10D15; HANIFI ROHINGYA NA; D; No_Joining_Group
|
||||
10D16; HANIFI ROHINGYA WA; D; No_Joining_Group
|
||||
10D17; HANIFI ROHINGYA KINNA WA; D; No_Joining_Group
|
||||
10D18; HANIFI ROHINGYA YA; D; No_Joining_Group
|
||||
10D19; HANIFI ROHINGYA KINNA YA; D; HANIFI ROHINGYA KINNA YA
|
||||
10D1A; HANIFI ROHINGYA NGA; D; No_Joining_Group
|
||||
10D1B; HANIFI ROHINGYA NYA; D; No_Joining_Group
|
||||
10D1C; HANIFI ROHINGYA PA WITH 3 DOTS ABOVE; D; HANIFI ROHINGYA PA
|
||||
10D1D; HANIFI ROHINGYA VOWEL A; D; No_Joining_Group
|
||||
10D1E; HANIFI ROHINGYA DOTLESS KINNA YA WITH LEFT-FACING HOOK BELOW; D; HANIFI ROHINGYA KINNA YA
|
||||
10D1F; HANIFI ROHINGYA VOWEL U; D; No_Joining_Group
|
||||
10D20; HANIFI ROHINGYA DOTLESS KINNA YA WITH RIGHT-FACING HOOK BELOW; D; HANIFI ROHINGYA KINNA YA
|
||||
10D21; HANIFI ROHINGYA VOWEL O; D; No_Joining_Group
|
||||
10D22; HANIFI ROHINGYA SAKIN; R; No_Joining_Group
|
||||
10D23; HANIFI ROHINGYA DOTLESS KINNA YA WITH DOT ABOVE; D; HANIFI ROHINGYA KINNA YA
|
||||
|
||||
# Sogdian Characters
|
||||
|
||||
10F30; SOGDIAN ALEPH; D; No_Joining_Group
|
||||
10F31; SOGDIAN BETH; D; No_Joining_Group
|
||||
10F32; SOGDIAN GIMEL; D; No_Joining_Group
|
||||
10F33; SOGDIAN HE; R; No_Joining_Group
|
||||
10F34; SOGDIAN WAW; D; No_Joining_Group
|
||||
10F35; SOGDIAN ZAYIN; D; No_Joining_Group
|
||||
10F36; SOGDIAN HETH; D; No_Joining_Group
|
||||
10F37; SOGDIAN YODH; D; No_Joining_Group
|
||||
10F38; SOGDIAN KAPH; D; No_Joining_Group
|
||||
10F39; SOGDIAN LAMEDH; D; No_Joining_Group
|
||||
10F3A; SOGDIAN MEM; D; No_Joining_Group
|
||||
10F3B; SOGDIAN NUN; D; No_Joining_Group
|
||||
10F3C; SOGDIAN SAMEKH; D; No_Joining_Group
|
||||
10F3D; SOGDIAN AYIN; D; No_Joining_Group
|
||||
10F3E; SOGDIAN PE; D; No_Joining_Group
|
||||
10F3F; SOGDIAN SADHE; D; No_Joining_Group
|
||||
10F40; SOGDIAN RESH-AYIN; D; No_Joining_Group
|
||||
10F41; SOGDIAN SHIN; D; No_Joining_Group
|
||||
10F42; SOGDIAN TAW; D; No_Joining_Group
|
||||
10F43; SOGDIAN FETH; D; No_Joining_Group
|
||||
10F44; SOGDIAN LESH; D; No_Joining_Group
|
||||
10F45; SOGDIAN INDEPENDENT SHIN; U; No_Joining_Group
|
||||
10F51; SOGDIAN ONE; D; No_Joining_Group
|
||||
10F52; SOGDIAN TEN; D; No_Joining_Group
|
||||
10F53; SOGDIAN TWENTY; D; No_Joining_Group
|
||||
10F54; SOGDIAN ONE HUNDRED; R; No_Joining_Group
|
||||
|
||||
# Old Uyghur Characters
|
||||
|
||||
10F70; OLD UYGHUR ALEPH; D; No_Joining_Group
|
||||
10F71; OLD UYGHUR BETH; D; No_Joining_Group
|
||||
10F72; OLD UYGHUR GIMEL-HETH; D; No_Joining_Group
|
||||
10F73; OLD UYGHUR WAW; D; No_Joining_Group
|
||||
10F74; OLD UYGHUR ZAYIN; R; No_Joining_Group
|
||||
10F75; OLD UYGHUR FINAL HETH; R; No_Joining_Group
|
||||
10F76; OLD UYGHUR YODH; D; No_Joining_Group
|
||||
10F77; OLD UYGHUR KAPH; D; No_Joining_Group
|
||||
10F78; OLD UYGHUR LAMEDH; D; No_Joining_Group
|
||||
10F79; OLD UYGHUR MEM; D; No_Joining_Group
|
||||
10F7A; OLD UYGHUR NUN; D; No_Joining_Group
|
||||
10F7B; OLD UYGHUR SAMEKH; D; No_Joining_Group
|
||||
10F7C; OLD UYGHUR PE; D; No_Joining_Group
|
||||
10F7D; OLD UYGHUR SADHE; D; No_Joining_Group
|
||||
10F7E; OLD UYGHUR RESH; D; No_Joining_Group
|
||||
10F7F; OLD UYGHUR SHIN; D; No_Joining_Group
|
||||
10F80; OLD UYGHUR TAW; D; No_Joining_Group
|
||||
10F81; OLD UYGHUR LESH; D; No_Joining_Group
|
||||
|
||||
# Chorasmian Characters
|
||||
|
||||
10FB0; CHORASMIAN ALEPH; D; No_Joining_Group
|
||||
10FB1; CHORASMIAN SMALL ALEPH; U; No_Joining_Group
|
||||
10FB2; CHORASMIAN BETH; D; No_Joining_Group
|
||||
10FB3; CHORASMIAN GIMEL; D; No_Joining_Group
|
||||
10FB4; CHORASMIAN DALETH; R; No_Joining_Group
|
||||
10FB5; CHORASMIAN HE; R; No_Joining_Group
|
||||
10FB6; CHORASMIAN WAW; R; No_Joining_Group
|
||||
10FB7; CHORASMIAN CURLED WAW; U; No_Joining_Group
|
||||
10FB8; CHORASMIAN ZAYIN; D; No_Joining_Group
|
||||
10FB9; CHORASMIAN HETH; R; No_Joining_Group
|
||||
10FBA; CHORASMIAN YODH; R; No_Joining_Group
|
||||
10FBB; CHORASMIAN KAPH; D; No_Joining_Group
|
||||
10FBC; CHORASMIAN LAMEDH; D; No_Joining_Group
|
||||
10FBD; CHORASMIAN MEM; R; No_Joining_Group
|
||||
10FBE; CHORASMIAN NUN; D; No_Joining_Group
|
||||
10FBF; CHORASMIAN SAMEKH; D; No_Joining_Group
|
||||
10FC0; CHORASMIAN AYIN; U; No_Joining_Group
|
||||
10FC1; CHORASMIAN PE; D; No_Joining_Group
|
||||
10FC2; CHORASMIAN RESH; R; No_Joining_Group
|
||||
10FC3; CHORASMIAN SHIN; R; No_Joining_Group
|
||||
10FC4; CHORASMIAN TAW; D; No_Joining_Group
|
||||
10FC5; CHORASMIAN ONE; U; No_Joining_Group
|
||||
10FC6; CHORASMIAN TWO; U; No_Joining_Group
|
||||
10FC7; CHORASMIAN THREE; U; No_Joining_Group
|
||||
10FC8; CHORASMIAN FOUR; U; No_Joining_Group
|
||||
10FC9; CHORASMIAN TEN; R; No_Joining_Group
|
||||
10FCA; CHORASMIAN TWENTY; D; No_Joining_Group
|
||||
10FCB; CHORASMIAN ONE HUNDRED; L; No_Joining_Group
|
||||
|
||||
# Kaithi Number Signs
|
||||
# These are prepended concatenation marks, comparable
|
||||
# to the number signs in the Arabic script.
|
||||
# Listed here for consistency in property values.
|
||||
|
||||
110BD; KAITHI NUMBER SIGN; U; No_Joining_Group
|
||||
110CD; KAITHI NUMBER SIGN ABOVE; U; No_Joining_Group
|
||||
|
||||
# Adlam Characters
|
||||
|
||||
1E900;ADLAM CAPITAL ALIF; D; No_Joining_Group
|
||||
1E901;ADLAM CAPITAL DAALI; D; No_Joining_Group
|
||||
1E902;ADLAM CAPITAL LAAM; D; No_Joining_Group
|
||||
1E903;ADLAM CAPITAL MIIM; D; No_Joining_Group
|
||||
1E904;ADLAM CAPITAL BA; D; No_Joining_Group
|
||||
1E905;ADLAM CAPITAL SINNYIIYHE; D; No_Joining_Group
|
||||
1E906;ADLAM CAPITAL PE; D; No_Joining_Group
|
||||
1E907;ADLAM CAPITAL BHE; D; No_Joining_Group
|
||||
1E908;ADLAM CAPITAL RA; D; No_Joining_Group
|
||||
1E909;ADLAM CAPITAL E; D; No_Joining_Group
|
||||
1E90A;ADLAM CAPITAL FA; D; No_Joining_Group
|
||||
1E90B;ADLAM CAPITAL I; D; No_Joining_Group
|
||||
1E90C;ADLAM CAPITAL O; D; No_Joining_Group
|
||||
1E90D;ADLAM CAPITAL DHA; D; No_Joining_Group
|
||||
1E90E;ADLAM CAPITAL YHE; D; No_Joining_Group
|
||||
1E90F;ADLAM CAPITAL WAW; D; No_Joining_Group
|
||||
1E910;ADLAM CAPITAL NUN; D; No_Joining_Group
|
||||
1E911;ADLAM CAPITAL KAF; D; No_Joining_Group
|
||||
1E912;ADLAM CAPITAL YA; D; No_Joining_Group
|
||||
1E913;ADLAM CAPITAL U; D; No_Joining_Group
|
||||
1E914;ADLAM CAPITAL JIIM; D; No_Joining_Group
|
||||
1E915;ADLAM CAPITAL CHI; D; No_Joining_Group
|
||||
1E916;ADLAM CAPITAL HA; D; No_Joining_Group
|
||||
1E917;ADLAM CAPITAL QAAF; D; No_Joining_Group
|
||||
1E918;ADLAM CAPITAL GA; D; No_Joining_Group
|
||||
1E919;ADLAM CAPITAL NYA; D; No_Joining_Group
|
||||
1E91A;ADLAM CAPITAL TU; D; No_Joining_Group
|
||||
1E91B;ADLAM CAPITAL NHA; D; No_Joining_Group
|
||||
1E91C;ADLAM CAPITAL VA; D; No_Joining_Group
|
||||
1E91D;ADLAM CAPITAL KHA; D; No_Joining_Group
|
||||
1E91E;ADLAM CAPITAL GBE; D; No_Joining_Group
|
||||
1E91F;ADLAM CAPITAL ZAL; D; No_Joining_Group
|
||||
1E920;ADLAM CAPITAL KPO; D; No_Joining_Group
|
||||
1E921;ADLAM CAPITAL SHA; D; No_Joining_Group
|
||||
1E922;ADLAM SMALL ALIF; D; No_Joining_Group
|
||||
1E923;ADLAM SMALL DAALI; D; No_Joining_Group
|
||||
1E924;ADLAM SMALL LAAM; D; No_Joining_Group
|
||||
1E925;ADLAM SMALL MIIM; D; No_Joining_Group
|
||||
1E926;ADLAM SMALL BA; D; No_Joining_Group
|
||||
1E927;ADLAM SMALL SINNYIIYHE; D; No_Joining_Group
|
||||
1E928;ADLAM SMALL PE; D; No_Joining_Group
|
||||
1E929;ADLAM SMALL BHE; D; No_Joining_Group
|
||||
1E92A;ADLAM SMALL RA; D; No_Joining_Group
|
||||
1E92B;ADLAM SMALL E; D; No_Joining_Group
|
||||
1E92C;ADLAM SMALL FA; D; No_Joining_Group
|
||||
1E92D;ADLAM SMALL I; D; No_Joining_Group
|
||||
1E92E;ADLAM SMALL O; D; No_Joining_Group
|
||||
1E92F;ADLAM SMALL DHA; D; No_Joining_Group
|
||||
1E930;ADLAM SMALL YHE; D; No_Joining_Group
|
||||
1E931;ADLAM SMALL WAW; D; No_Joining_Group
|
||||
1E932;ADLAM SMALL NUN; D; No_Joining_Group
|
||||
1E933;ADLAM SMALL KAF; D; No_Joining_Group
|
||||
1E934;ADLAM SMALL YA; D; No_Joining_Group
|
||||
1E935;ADLAM SMALL U; D; No_Joining_Group
|
||||
1E936;ADLAM SMALL JIIM; D; No_Joining_Group
|
||||
1E937;ADLAM SMALL CHI; D; No_Joining_Group
|
||||
1E938;ADLAM SMALL HA; D; No_Joining_Group
|
||||
1E939;ADLAM SMALL QAAF; D; No_Joining_Group
|
||||
1E93A;ADLAM SMALL GA; D; No_Joining_Group
|
||||
1E93B;ADLAM SMALL NYA; D; No_Joining_Group
|
||||
1E93C;ADLAM SMALL TU; D; No_Joining_Group
|
||||
1E93D;ADLAM SMALL NHA; D; No_Joining_Group
|
||||
1E93E;ADLAM SMALL VA; D; No_Joining_Group
|
||||
1E93F;ADLAM SMALL KHA; D; No_Joining_Group
|
||||
1E940;ADLAM SMALL GBE; D; No_Joining_Group
|
||||
1E941;ADLAM SMALL ZAL; D; No_Joining_Group
|
||||
1E942;ADLAM SMALL KPO; D; No_Joining_Group
|
||||
1E943;ADLAM SMALL SHA; D; No_Joining_Group
|
||||
1E94B;ADLAM NASALIZATION MARK; T; No_Joining_Group
|
||||
|
||||
# EOF
|
633
util/unicode/data/BidiMirroring.txt
Normal file
633
util/unicode/data/BidiMirroring.txt
Normal file
@ -0,0 +1,633 @@
|
||||
# BidiMirroring-15.0.0.txt
|
||||
# Date: 2022-05-03, 18:47:00 GMT [KW, RP]
|
||||
# © 2022 Unicode®, Inc.
|
||||
# For terms of use, see https://www.unicode.org/terms_of_use.html
|
||||
#
|
||||
# Unicode Character Database
|
||||
# For documentation, see https://www.unicode.org/reports/tr44/
|
||||
#
|
||||
# Bidi_Mirroring_Glyph Property
|
||||
#
|
||||
# This file is an informative contributory data file in the
|
||||
# Unicode Character Database.
|
||||
#
|
||||
# This data file lists characters that have the Bidi_Mirrored=Yes property
|
||||
# value, for which there is another Unicode character that typically has a glyph
|
||||
# that is the mirror image of the original character's glyph.
|
||||
#
|
||||
# The repertoire covered by the file is Unicode 15.0.0.
|
||||
#
|
||||
# The file contains a list of lines with mappings from one code point
|
||||
# to another one for character-based mirroring.
|
||||
# Note that for "real" mirroring, a rendering engine needs to select
|
||||
# appropriate alternative glyphs, and that many Unicode characters do not
|
||||
# have a mirror-image Unicode character.
|
||||
#
|
||||
# Each mapping line contains two fields, separated by a semicolon (';').
|
||||
# Each of the two fields contains a code point represented as a
|
||||
# variable-length hexadecimal value with 4 to 6 digits.
|
||||
# A comment indicates where the characters are "BEST FIT" mirroring.
|
||||
#
|
||||
# Code points for which Bidi_Mirrored=Yes, but for which no appropriate
|
||||
# characters exist with mirrored glyphs, are
|
||||
# listed as comments at the end of the file.
|
||||
#
|
||||
# Formally, the default value of the Bidi_Mirroring_Glyph property
|
||||
# for each code point is <none>, unless a mapping to
|
||||
# some other character is specified in this data file. When a code
|
||||
# point has the default value for the Bidi_Mirroring_Glyph property,
|
||||
# that means that no other character exists whose glyph is suitable
|
||||
# for character-based mirroring.
|
||||
#
|
||||
# For information on bidi mirroring, see UAX #9: Unicode Bidirectional Algorithm,
|
||||
# at https://www.unicode.org/reports/tr9/
|
||||
#
|
||||
# This file was originally created by Markus Scherer.
|
||||
# Extended for Unicode 3.2, 4.0, 4.1, 5.0, 5.1, 5.2, and 6.0 by Ken Whistler,
|
||||
# and for subsequent versions by Ken Whistler, Laurentiu Iancu, and Roozbeh Pournader.
|
||||
#
|
||||
# Historical and Compatibility Information:
|
||||
#
|
||||
# The OpenType Mirroring Pairs List (OMPL) is frozen to match the
|
||||
# Unicode 5.1 version of the Bidi_Mirroring_Glyph property (2008).
|
||||
# See https://www.microsoft.com/typography/otspec/ompl.txt
|
||||
#
|
||||
# The Unicode 6.1 version of the Bidi_Mirroring_Glyph property (2011)
|
||||
# added one mirroring pair: 27CB <--> 27CD.
|
||||
#
|
||||
# The Unicode 11.0 version of the Bidi_Mirroring_Glyph property (2018)
|
||||
# underwent a substantial revision, to formally recognize all of the
|
||||
# exact mirroring pairs and "BEST FIT" mirroring pairs that had been
|
||||
# added after the freezing of the OMPL list. As a result, starting
|
||||
# with Unicode 11.0, the bmg mapping values more accurately reflect
|
||||
# the current status of glyphs for Bidi_Mirrored characters in
|
||||
# the Unicode Standard, but this listing now extends significantly
|
||||
# beyond the frozen OMPL list. Implementers should be aware of this
|
||||
# intentional distinction.
|
||||
#
|
||||
# ############################################################
|
||||
#
|
||||
# Property: Bidi_Mirroring_Glyph
|
||||
#
|
||||
# @missing: 0000..10FFFF; <none>
|
||||
|
||||
0028; 0029 # LEFT PARENTHESIS
|
||||
0029; 0028 # RIGHT PARENTHESIS
|
||||
003C; 003E # LESS-THAN SIGN
|
||||
003E; 003C # GREATER-THAN SIGN
|
||||
005B; 005D # LEFT SQUARE BRACKET
|
||||
005D; 005B # RIGHT SQUARE BRACKET
|
||||
007B; 007D # LEFT CURLY BRACKET
|
||||
007D; 007B # RIGHT CURLY BRACKET
|
||||
00AB; 00BB # LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
|
||||
00BB; 00AB # RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
|
||||
0F3A; 0F3B # TIBETAN MARK GUG RTAGS GYON
|
||||
0F3B; 0F3A # TIBETAN MARK GUG RTAGS GYAS
|
||||
0F3C; 0F3D # TIBETAN MARK ANG KHANG GYON
|
||||
0F3D; 0F3C # TIBETAN MARK ANG KHANG GYAS
|
||||
169B; 169C # OGHAM FEATHER MARK
|
||||
169C; 169B # OGHAM REVERSED FEATHER MARK
|
||||
2039; 203A # SINGLE LEFT-POINTING ANGLE QUOTATION MARK
|
||||
203A; 2039 # SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
|
||||
2045; 2046 # LEFT SQUARE BRACKET WITH QUILL
|
||||
2046; 2045 # RIGHT SQUARE BRACKET WITH QUILL
|
||||
207D; 207E # SUPERSCRIPT LEFT PARENTHESIS
|
||||
207E; 207D # SUPERSCRIPT RIGHT PARENTHESIS
|
||||
208D; 208E # SUBSCRIPT LEFT PARENTHESIS
|
||||
208E; 208D # SUBSCRIPT RIGHT PARENTHESIS
|
||||
2208; 220B # ELEMENT OF
|
||||
2209; 220C # [BEST FIT] NOT AN ELEMENT OF
|
||||
220A; 220D # SMALL ELEMENT OF
|
||||
220B; 2208 # CONTAINS AS MEMBER
|
||||
220C; 2209 # [BEST FIT] DOES NOT CONTAIN AS MEMBER
|
||||
220D; 220A # SMALL CONTAINS AS MEMBER
|
||||
2215; 29F5 # DIVISION SLASH
|
||||
221F; 2BFE # RIGHT ANGLE
|
||||
2220; 29A3 # ANGLE
|
||||
2221; 299B # MEASURED ANGLE
|
||||
2222; 29A0 # SPHERICAL ANGLE
|
||||
2224; 2AEE # DOES NOT DIVIDE
|
||||
223C; 223D # TILDE OPERATOR
|
||||
223D; 223C # REVERSED TILDE
|
||||
2243; 22CD # ASYMPTOTICALLY EQUAL TO
|
||||
2245; 224C # APPROXIMATELY EQUAL TO
|
||||
224C; 2245 # ALL EQUAL TO
|
||||
2252; 2253 # APPROXIMATELY EQUAL TO OR THE IMAGE OF
|
||||
2253; 2252 # IMAGE OF OR APPROXIMATELY EQUAL TO
|
||||
2254; 2255 # COLON EQUALS
|
||||
2255; 2254 # EQUALS COLON
|
||||
2264; 2265 # LESS-THAN OR EQUAL TO
|
||||
2265; 2264 # GREATER-THAN OR EQUAL TO
|
||||
2266; 2267 # LESS-THAN OVER EQUAL TO
|
||||
2267; 2266 # GREATER-THAN OVER EQUAL TO
|
||||
2268; 2269 # [BEST FIT] LESS-THAN BUT NOT EQUAL TO
|
||||
2269; 2268 # [BEST FIT] GREATER-THAN BUT NOT EQUAL TO
|
||||
226A; 226B # MUCH LESS-THAN
|
||||
226B; 226A # MUCH GREATER-THAN
|
||||
226E; 226F # [BEST FIT] NOT LESS-THAN
|
||||
226F; 226E # [BEST FIT] NOT GREATER-THAN
|
||||
2270; 2271 # [BEST FIT] NEITHER LESS-THAN NOR EQUAL TO
|
||||
2271; 2270 # [BEST FIT] NEITHER GREATER-THAN NOR EQUAL TO
|
||||
2272; 2273 # [BEST FIT] LESS-THAN OR EQUIVALENT TO
|
||||
2273; 2272 # [BEST FIT] GREATER-THAN OR EQUIVALENT TO
|
||||
2274; 2275 # [BEST FIT] NEITHER LESS-THAN NOR EQUIVALENT TO
|
||||
2275; 2274 # [BEST FIT] NEITHER GREATER-THAN NOR EQUIVALENT TO
|
||||
2276; 2277 # LESS-THAN OR GREATER-THAN
|
||||
2277; 2276 # GREATER-THAN OR LESS-THAN
|
||||
2278; 2279 # [BEST FIT] NEITHER LESS-THAN NOR GREATER-THAN
|
||||
2279; 2278 # [BEST FIT] NEITHER GREATER-THAN NOR LESS-THAN
|
||||
227A; 227B # PRECEDES
|
||||
227B; 227A # SUCCEEDS
|
||||
227C; 227D # PRECEDES OR EQUAL TO
|
||||
227D; 227C # SUCCEEDS OR EQUAL TO
|
||||
227E; 227F # [BEST FIT] PRECEDES OR EQUIVALENT TO
|
||||
227F; 227E # [BEST FIT] SUCCEEDS OR EQUIVALENT TO
|
||||
2280; 2281 # [BEST FIT] DOES NOT PRECEDE
|
||||
2281; 2280 # [BEST FIT] DOES NOT SUCCEED
|
||||
2282; 2283 # SUBSET OF
|
||||
2283; 2282 # SUPERSET OF
|
||||
2284; 2285 # [BEST FIT] NOT A SUBSET OF
|
||||
2285; 2284 # [BEST FIT] NOT A SUPERSET OF
|
||||
2286; 2287 # SUBSET OF OR EQUAL TO
|
||||
2287; 2286 # SUPERSET OF OR EQUAL TO
|
||||
2288; 2289 # [BEST FIT] NEITHER A SUBSET OF NOR EQUAL TO
|
||||
2289; 2288 # [BEST FIT] NEITHER A SUPERSET OF NOR EQUAL TO
|
||||
228A; 228B # [BEST FIT] SUBSET OF WITH NOT EQUAL TO
|
||||
228B; 228A # [BEST FIT] SUPERSET OF WITH NOT EQUAL TO
|
||||
228F; 2290 # SQUARE IMAGE OF
|
||||
2290; 228F # SQUARE ORIGINAL OF
|
||||
2291; 2292 # SQUARE IMAGE OF OR EQUAL TO
|
||||
2292; 2291 # SQUARE ORIGINAL OF OR EQUAL TO
|
||||
2298; 29B8 # CIRCLED DIVISION SLASH
|
||||
22A2; 22A3 # RIGHT TACK
|
||||
22A3; 22A2 # LEFT TACK
|
||||
22A6; 2ADE # ASSERTION
|
||||
22A8; 2AE4 # TRUE
|
||||
22A9; 2AE3 # FORCES
|
||||
22AB; 2AE5 # DOUBLE VERTICAL BAR DOUBLE RIGHT TURNSTILE
|
||||
22B0; 22B1 # PRECEDES UNDER RELATION
|
||||
22B1; 22B0 # SUCCEEDS UNDER RELATION
|
||||
22B2; 22B3 # NORMAL SUBGROUP OF
|
||||
22B3; 22B2 # CONTAINS AS NORMAL SUBGROUP
|
||||
22B4; 22B5 # NORMAL SUBGROUP OF OR EQUAL TO
|
||||
22B5; 22B4 # CONTAINS AS NORMAL SUBGROUP OR EQUAL TO
|
||||
22B6; 22B7 # ORIGINAL OF
|
||||
22B7; 22B6 # IMAGE OF
|
||||
22B8; 27DC # MULTIMAP
|
||||
22C9; 22CA # LEFT NORMAL FACTOR SEMIDIRECT PRODUCT
|
||||
22CA; 22C9 # RIGHT NORMAL FACTOR SEMIDIRECT PRODUCT
|
||||
22CB; 22CC # LEFT SEMIDIRECT PRODUCT
|
||||
22CC; 22CB # RIGHT SEMIDIRECT PRODUCT
|
||||
22CD; 2243 # REVERSED TILDE EQUALS
|
||||
22D0; 22D1 # DOUBLE SUBSET
|
||||
22D1; 22D0 # DOUBLE SUPERSET
|
||||
22D6; 22D7 # LESS-THAN WITH DOT
|
||||
22D7; 22D6 # GREATER-THAN WITH DOT
|
||||
22D8; 22D9 # VERY MUCH LESS-THAN
|
||||
22D9; 22D8 # VERY MUCH GREATER-THAN
|
||||
22DA; 22DB # LESS-THAN EQUAL TO OR GREATER-THAN
|
||||
22DB; 22DA # GREATER-THAN EQUAL TO OR LESS-THAN
|
||||
22DC; 22DD # EQUAL TO OR LESS-THAN
|
||||
22DD; 22DC # EQUAL TO OR GREATER-THAN
|
||||
22DE; 22DF # EQUAL TO OR PRECEDES
|
||||
22DF; 22DE # EQUAL TO OR SUCCEEDS
|
||||
22E0; 22E1 # [BEST FIT] DOES NOT PRECEDE OR EQUAL
|
||||
22E1; 22E0 # [BEST FIT] DOES NOT SUCCEED OR EQUAL
|
||||
22E2; 22E3 # [BEST FIT] NOT SQUARE IMAGE OF OR EQUAL TO
|
||||
22E3; 22E2 # [BEST FIT] NOT SQUARE ORIGINAL OF OR EQUAL TO
|
||||
22E4; 22E5 # [BEST FIT] SQUARE IMAGE OF OR NOT EQUAL TO
|
||||
22E5; 22E4 # [BEST FIT] SQUARE ORIGINAL OF OR NOT EQUAL TO
|
||||
22E6; 22E7 # [BEST FIT] LESS-THAN BUT NOT EQUIVALENT TO
|
||||
22E7; 22E6 # [BEST FIT] GREATER-THAN BUT NOT EQUIVALENT TO
|
||||
22E8; 22E9 # [BEST FIT] PRECEDES BUT NOT EQUIVALENT TO
|
||||
22E9; 22E8 # [BEST FIT] SUCCEEDS BUT NOT EQUIVALENT TO
|
||||
22EA; 22EB # [BEST FIT] NOT NORMAL SUBGROUP OF
|
||||
22EB; 22EA # [BEST FIT] DOES NOT CONTAIN AS NORMAL SUBGROUP
|
||||
22EC; 22ED # [BEST FIT] NOT NORMAL SUBGROUP OF OR EQUAL TO
|
||||
22ED; 22EC # [BEST FIT] DOES NOT CONTAIN AS NORMAL SUBGROUP OR EQUAL
|
||||
22F0; 22F1 # UP RIGHT DIAGONAL ELLIPSIS
|
||||
22F1; 22F0 # DOWN RIGHT DIAGONAL ELLIPSIS
|
||||
22F2; 22FA # ELEMENT OF WITH LONG HORIZONTAL STROKE
|
||||
22F3; 22FB # ELEMENT OF WITH VERTICAL BAR AT END OF HORIZONTAL STROKE
|
||||
22F4; 22FC # SMALL ELEMENT OF WITH VERTICAL BAR AT END OF HORIZONTAL STROKE
|
||||
22F6; 22FD # ELEMENT OF WITH OVERBAR
|
||||
22F7; 22FE # SMALL ELEMENT OF WITH OVERBAR
|
||||
22FA; 22F2 # CONTAINS WITH LONG HORIZONTAL STROKE
|
||||
22FB; 22F3 # CONTAINS WITH VERTICAL BAR AT END OF HORIZONTAL STROKE
|
||||
22FC; 22F4 # SMALL CONTAINS WITH VERTICAL BAR AT END OF HORIZONTAL STROKE
|
||||
22FD; 22F6 # CONTAINS WITH OVERBAR
|
||||
22FE; 22F7 # SMALL CONTAINS WITH OVERBAR
|
||||
2308; 2309 # LEFT CEILING
|
||||
2309; 2308 # RIGHT CEILING
|
||||
230A; 230B # LEFT FLOOR
|
||||
230B; 230A # RIGHT FLOOR
|
||||
2329; 232A # LEFT-POINTING ANGLE BRACKET
|
||||
232A; 2329 # RIGHT-POINTING ANGLE BRACKET
|
||||
2768; 2769 # MEDIUM LEFT PARENTHESIS ORNAMENT
|
||||
2769; 2768 # MEDIUM RIGHT PARENTHESIS ORNAMENT
|
||||
276A; 276B # MEDIUM FLATTENED LEFT PARENTHESIS ORNAMENT
|
||||
276B; 276A # MEDIUM FLATTENED RIGHT PARENTHESIS ORNAMENT
|
||||
276C; 276D # MEDIUM LEFT-POINTING ANGLE BRACKET ORNAMENT
|
||||
276D; 276C # MEDIUM RIGHT-POINTING ANGLE BRACKET ORNAMENT
|
||||
276E; 276F # HEAVY LEFT-POINTING ANGLE QUOTATION MARK ORNAMENT
|
||||
276F; 276E # HEAVY RIGHT-POINTING ANGLE QUOTATION MARK ORNAMENT
|
||||
2770; 2771 # HEAVY LEFT-POINTING ANGLE BRACKET ORNAMENT
|
||||
2771; 2770 # HEAVY RIGHT-POINTING ANGLE BRACKET ORNAMENT
|
||||
2772; 2773 # LIGHT LEFT TORTOISE SHELL BRACKET ORNAMENT
|
||||
2773; 2772 # LIGHT RIGHT TORTOISE SHELL BRACKET ORNAMENT
|
||||
2774; 2775 # MEDIUM LEFT CURLY BRACKET ORNAMENT
|
||||
2775; 2774 # MEDIUM RIGHT CURLY BRACKET ORNAMENT
|
||||
27C3; 27C4 # OPEN SUBSET
|
||||
27C4; 27C3 # OPEN SUPERSET
|
||||
27C5; 27C6 # LEFT S-SHAPED BAG DELIMITER
|
||||
27C6; 27C5 # RIGHT S-SHAPED BAG DELIMITER
|
||||
27C8; 27C9 # REVERSE SOLIDUS PRECEDING SUBSET
|
||||
27C9; 27C8 # SUPERSET PRECEDING SOLIDUS
|
||||
27CB; 27CD # MATHEMATICAL RISING DIAGONAL
|
||||
27CD; 27CB # MATHEMATICAL FALLING DIAGONAL
|
||||
27D5; 27D6 # LEFT OUTER JOIN
|
||||
27D6; 27D5 # RIGHT OUTER JOIN
|
||||
27DC; 22B8 # LEFT MULTIMAP
|
||||
27DD; 27DE # LONG RIGHT TACK
|
||||
27DE; 27DD # LONG LEFT TACK
|
||||
27E2; 27E3 # WHITE CONCAVE-SIDED DIAMOND WITH LEFTWARDS TICK
|
||||
27E3; 27E2 # WHITE CONCAVE-SIDED DIAMOND WITH RIGHTWARDS TICK
|
||||
27E4; 27E5 # WHITE SQUARE WITH LEFTWARDS TICK
|
||||
27E5; 27E4 # WHITE SQUARE WITH RIGHTWARDS TICK
|
||||
27E6; 27E7 # MATHEMATICAL LEFT WHITE SQUARE BRACKET
|
||||
27E7; 27E6 # MATHEMATICAL RIGHT WHITE SQUARE BRACKET
|
||||
27E8; 27E9 # MATHEMATICAL LEFT ANGLE BRACKET
|
||||
27E9; 27E8 # MATHEMATICAL RIGHT ANGLE BRACKET
|
||||
27EA; 27EB # MATHEMATICAL LEFT DOUBLE ANGLE BRACKET
|
||||
27EB; 27EA # MATHEMATICAL RIGHT DOUBLE ANGLE BRACKET
|
||||
27EC; 27ED # MATHEMATICAL LEFT WHITE TORTOISE SHELL BRACKET
|
||||
27ED; 27EC # MATHEMATICAL RIGHT WHITE TORTOISE SHELL BRACKET
|
||||
27EE; 27EF # MATHEMATICAL LEFT FLATTENED PARENTHESIS
|
||||
27EF; 27EE # MATHEMATICAL RIGHT FLATTENED PARENTHESIS
|
||||
2983; 2984 # LEFT WHITE CURLY BRACKET
|
||||
2984; 2983 # RIGHT WHITE CURLY BRACKET
|
||||
2985; 2986 # LEFT WHITE PARENTHESIS
|
||||
2986; 2985 # RIGHT WHITE PARENTHESIS
|
||||
2987; 2988 # Z NOTATION LEFT IMAGE BRACKET
|
||||
2988; 2987 # Z NOTATION RIGHT IMAGE BRACKET
|
||||
2989; 298A # Z NOTATION LEFT BINDING BRACKET
|
||||
298A; 2989 # Z NOTATION RIGHT BINDING BRACKET
|
||||
298B; 298C # LEFT SQUARE BRACKET WITH UNDERBAR
|
||||
298C; 298B # RIGHT SQUARE BRACKET WITH UNDERBAR
|
||||
298D; 2990 # LEFT SQUARE BRACKET WITH TICK IN TOP CORNER
|
||||
298E; 298F # RIGHT SQUARE BRACKET WITH TICK IN BOTTOM CORNER
|
||||
298F; 298E # LEFT SQUARE BRACKET WITH TICK IN BOTTOM CORNER
|
||||
2990; 298D # RIGHT SQUARE BRACKET WITH TICK IN TOP CORNER
|
||||
2991; 2992 # LEFT ANGLE BRACKET WITH DOT
|
||||
2992; 2991 # RIGHT ANGLE BRACKET WITH DOT
|
||||
2993; 2994 # LEFT ARC LESS-THAN BRACKET
|
||||
2994; 2993 # RIGHT ARC GREATER-THAN BRACKET
|
||||
2995; 2996 # DOUBLE LEFT ARC GREATER-THAN BRACKET
|
||||
2996; 2995 # DOUBLE RIGHT ARC LESS-THAN BRACKET
|
||||
2997; 2998 # LEFT BLACK TORTOISE SHELL BRACKET
|
||||
2998; 2997 # RIGHT BLACK TORTOISE SHELL BRACKET
|
||||
299B; 2221 # MEASURED ANGLE OPENING LEFT
|
||||
29A0; 2222 # SPHERICAL ANGLE OPENING LEFT
|
||||
29A3; 2220 # REVERSED ANGLE
|
||||
29A4; 29A5 # ANGLE WITH UNDERBAR
|
||||
29A5; 29A4 # REVERSED ANGLE WITH UNDERBAR
|
||||
29A8; 29A9 # MEASURED ANGLE WITH OPEN ARM ENDING IN ARROW POINTING UP AND RIGHT
|
||||
29A9; 29A8 # MEASURED ANGLE WITH OPEN ARM ENDING IN ARROW POINTING UP AND LEFT
|
||||
29AA; 29AB # MEASURED ANGLE WITH OPEN ARM ENDING IN ARROW POINTING DOWN AND RIGHT
|
||||
29AB; 29AA # MEASURED ANGLE WITH OPEN ARM ENDING IN ARROW POINTING DOWN AND LEFT
|
||||
29AC; 29AD # MEASURED ANGLE WITH OPEN ARM ENDING IN ARROW POINTING RIGHT AND UP
|
||||
29AD; 29AC # MEASURED ANGLE WITH OPEN ARM ENDING IN ARROW POINTING LEFT AND UP
|
||||
29AE; 29AF # MEASURED ANGLE WITH OPEN ARM ENDING IN ARROW POINTING RIGHT AND DOWN
|
||||
29AF; 29AE # MEASURED ANGLE WITH OPEN ARM ENDING IN ARROW POINTING LEFT AND DOWN
|
||||
29B8; 2298 # CIRCLED REVERSE SOLIDUS
|
||||
29C0; 29C1 # CIRCLED LESS-THAN
|
||||
29C1; 29C0 # CIRCLED GREATER-THAN
|
||||
29C4; 29C5 # SQUARED RISING DIAGONAL SLASH
|
||||
29C5; 29C4 # SQUARED FALLING DIAGONAL SLASH
|
||||
29CF; 29D0 # LEFT TRIANGLE BESIDE VERTICAL BAR
|
||||
29D0; 29CF # VERTICAL BAR BESIDE RIGHT TRIANGLE
|
||||
29D1; 29D2 # BOWTIE WITH LEFT HALF BLACK
|
||||
29D2; 29D1 # BOWTIE WITH RIGHT HALF BLACK
|
||||
29D4; 29D5 # TIMES WITH LEFT HALF BLACK
|
||||
29D5; 29D4 # TIMES WITH RIGHT HALF BLACK
|
||||
29D8; 29D9 # LEFT WIGGLY FENCE
|
||||
29D9; 29D8 # RIGHT WIGGLY FENCE
|
||||
29DA; 29DB # LEFT DOUBLE WIGGLY FENCE
|
||||
29DB; 29DA # RIGHT DOUBLE WIGGLY FENCE
|
||||
29E8; 29E9 # DOWN-POINTING TRIANGLE WITH LEFT HALF BLACK
|
||||
29E9; 29E8 # DOWN-POINTING TRIANGLE WITH RIGHT HALF BLACK
|
||||
29F5; 2215 # REVERSE SOLIDUS OPERATOR
|
||||
29F8; 29F9 # BIG SOLIDUS
|
||||
29F9; 29F8 # BIG REVERSE SOLIDUS
|
||||
29FC; 29FD # LEFT-POINTING CURVED ANGLE BRACKET
|
||||
29FD; 29FC # RIGHT-POINTING CURVED ANGLE BRACKET
|
||||
2A2B; 2A2C # MINUS SIGN WITH FALLING DOTS
|
||||
2A2C; 2A2B # MINUS SIGN WITH RISING DOTS
|
||||
2A2D; 2A2E # PLUS SIGN IN LEFT HALF CIRCLE
|
||||
2A2E; 2A2D # PLUS SIGN IN RIGHT HALF CIRCLE
|
||||
2A34; 2A35 # MULTIPLICATION SIGN IN LEFT HALF CIRCLE
|
||||
2A35; 2A34 # MULTIPLICATION SIGN IN RIGHT HALF CIRCLE
|
||||
2A3C; 2A3D # INTERIOR PRODUCT
|
||||
2A3D; 2A3C # RIGHTHAND INTERIOR PRODUCT
|
||||
2A64; 2A65 # Z NOTATION DOMAIN ANTIRESTRICTION
|
||||
2A65; 2A64 # Z NOTATION RANGE ANTIRESTRICTION
|
||||
2A79; 2A7A # LESS-THAN WITH CIRCLE INSIDE
|
||||
2A7A; 2A79 # GREATER-THAN WITH CIRCLE INSIDE
|
||||
2A7B; 2A7C # [BEST FIT] LESS-THAN WITH QUESTION MARK ABOVE
|
||||
2A7C; 2A7B # [BEST FIT] GREATER-THAN WITH QUESTION MARK ABOVE
|
||||
2A7D; 2A7E # LESS-THAN OR SLANTED EQUAL TO
|
||||
2A7E; 2A7D # GREATER-THAN OR SLANTED EQUAL TO
|
||||
2A7F; 2A80 # LESS-THAN OR SLANTED EQUAL TO WITH DOT INSIDE
|
||||
2A80; 2A7F # GREATER-THAN OR SLANTED EQUAL TO WITH DOT INSIDE
|
||||
2A81; 2A82 # LESS-THAN OR SLANTED EQUAL TO WITH DOT ABOVE
|
||||
2A82; 2A81 # GREATER-THAN OR SLANTED EQUAL TO WITH DOT ABOVE
|
||||
2A83; 2A84 # LESS-THAN OR SLANTED EQUAL TO WITH DOT ABOVE RIGHT
|
||||
2A84; 2A83 # GREATER-THAN OR SLANTED EQUAL TO WITH DOT ABOVE LEFT
|
||||
2A85; 2A86 # [BEST FIT] LESS-THAN OR APPROXIMATE
|
||||
2A86; 2A85 # [BEST FIT] GREATER-THAN OR APPROXIMATE
|
||||
2A87; 2A88 # [BEST FIT] LESS-THAN AND SINGLE-LINE NOT EQUAL TO
|
||||
2A88; 2A87 # [BEST FIT] GREATER-THAN AND SINGLE-LINE NOT EQUAL TO
|
||||
2A89; 2A8A # [BEST FIT] LESS-THAN AND NOT APPROXIMATE
|
||||
2A8A; 2A89 # [BEST FIT] GREATER-THAN AND NOT APPROXIMATE
|
||||
2A8B; 2A8C # LESS-THAN ABOVE DOUBLE-LINE EQUAL ABOVE GREATER-THAN
|
||||
2A8C; 2A8B # GREATER-THAN ABOVE DOUBLE-LINE EQUAL ABOVE LESS-THAN
|
||||
2A8D; 2A8E # [BEST FIT] LESS-THAN ABOVE SIMILAR OR EQUAL
|
||||
2A8E; 2A8D # [BEST FIT] GREATER-THAN ABOVE SIMILAR OR EQUAL
|
||||
2A8F; 2A90 # [BEST FIT] LESS-THAN ABOVE SIMILAR ABOVE GREATER-THAN
|
||||
2A90; 2A8F # [BEST FIT] GREATER-THAN ABOVE SIMILAR ABOVE LESS-THAN
|
||||
2A91; 2A92 # LESS-THAN ABOVE GREATER-THAN ABOVE DOUBLE-LINE EQUAL
|
||||
2A92; 2A91 # GREATER-THAN ABOVE LESS-THAN ABOVE DOUBLE-LINE EQUAL
|
||||
2A93; 2A94 # LESS-THAN ABOVE SLANTED EQUAL ABOVE GREATER-THAN ABOVE SLANTED EQUAL
|
||||
2A94; 2A93 # GREATER-THAN ABOVE SLANTED EQUAL ABOVE LESS-THAN ABOVE SLANTED EQUAL
|
||||
2A95; 2A96 # SLANTED EQUAL TO OR LESS-THAN
|
||||
2A96; 2A95 # SLANTED EQUAL TO OR GREATER-THAN
|
||||
2A97; 2A98 # SLANTED EQUAL TO OR LESS-THAN WITH DOT INSIDE
|
||||
2A98; 2A97 # SLANTED EQUAL TO OR GREATER-THAN WITH DOT INSIDE
|
||||
2A99; 2A9A # DOUBLE-LINE EQUAL TO OR LESS-THAN
|
||||
2A9A; 2A99 # DOUBLE-LINE EQUAL TO OR GREATER-THAN
|
||||
2A9B; 2A9C # DOUBLE-LINE SLANTED EQUAL TO OR LESS-THAN
|
||||
2A9C; 2A9B # DOUBLE-LINE SLANTED EQUAL TO OR GREATER-THAN
|
||||
2A9D; 2A9E # [BEST FIT] SIMILAR OR LESS-THAN
|
||||
2A9E; 2A9D # [BEST FIT] SIMILAR OR GREATER-THAN
|
||||
2A9F; 2AA0 # [BEST FIT] SIMILAR ABOVE LESS-THAN ABOVE EQUALS SIGN
|
||||
2AA0; 2A9F # [BEST FIT] SIMILAR ABOVE GREATER-THAN ABOVE EQUALS SIGN
|
||||
2AA1; 2AA2 # DOUBLE NESTED LESS-THAN
|
||||
2AA2; 2AA1 # DOUBLE NESTED GREATER-THAN
|
||||
2AA6; 2AA7 # LESS-THAN CLOSED BY CURVE
|
||||
2AA7; 2AA6 # GREATER-THAN CLOSED BY CURVE
|
||||
2AA8; 2AA9 # LESS-THAN CLOSED BY CURVE ABOVE SLANTED EQUAL
|
||||
2AA9; 2AA8 # GREATER-THAN CLOSED BY CURVE ABOVE SLANTED EQUAL
|
||||
2AAA; 2AAB # SMALLER THAN
|
||||
2AAB; 2AAA # LARGER THAN
|
||||
2AAC; 2AAD # SMALLER THAN OR EQUAL TO
|
||||
2AAD; 2AAC # LARGER THAN OR EQUAL TO
|
||||
2AAF; 2AB0 # PRECEDES ABOVE SINGLE-LINE EQUALS SIGN
|
||||
2AB0; 2AAF # SUCCEEDS ABOVE SINGLE-LINE EQUALS SIGN
|
||||
2AB1; 2AB2 # [BEST FIT] PRECEDES ABOVE SINGLE-LINE NOT EQUAL TO
|
||||
2AB2; 2AB1 # [BEST FIT] SUCCEEDS ABOVE SINGLE-LINE NOT EQUAL TO
|
||||
2AB3; 2AB4 # PRECEDES ABOVE EQUALS SIGN
|
||||
2AB4; 2AB3 # SUCCEEDS ABOVE EQUALS SIGN
|
||||
2AB5; 2AB6 # [BEST FIT] PRECEDES ABOVE NOT EQUAL TO
|
||||
2AB6; 2AB5 # [BEST FIT] SUCCEEDS ABOVE NOT EQUAL TO
|
||||
2AB7; 2AB8 # [BEST FIT] PRECEDES ABOVE ALMOST EQUAL TO
|
||||
2AB8; 2AB7 # [BEST FIT] SUCCEEDS ABOVE ALMOST EQUAL TO
|
||||
2AB9; 2ABA # [BEST FIT] PRECEDES ABOVE NOT ALMOST EQUAL TO
|
||||
2ABA; 2AB9 # [BEST FIT] SUCCEEDS ABOVE NOT ALMOST EQUAL TO
|
||||
2ABB; 2ABC # DOUBLE PRECEDES
|
||||
2ABC; 2ABB # DOUBLE SUCCEEDS
|
||||
2ABD; 2ABE # SUBSET WITH DOT
|
||||
2ABE; 2ABD # SUPERSET WITH DOT
|
||||
2ABF; 2AC0 # SUBSET WITH PLUS SIGN BELOW
|
||||
2AC0; 2ABF # SUPERSET WITH PLUS SIGN BELOW
|
||||
2AC1; 2AC2 # SUBSET WITH MULTIPLICATION SIGN BELOW
|
||||
2AC2; 2AC1 # SUPERSET WITH MULTIPLICATION SIGN BELOW
|
||||
2AC3; 2AC4 # SUBSET OF OR EQUAL TO WITH DOT ABOVE
|
||||
2AC4; 2AC3 # SUPERSET OF OR EQUAL TO WITH DOT ABOVE
|
||||
2AC5; 2AC6 # SUBSET OF ABOVE EQUALS SIGN
|
||||
2AC6; 2AC5 # SUPERSET OF ABOVE EQUALS SIGN
|
||||
2AC7; 2AC8 # [BEST FIT] SUBSET OF ABOVE TILDE OPERATOR
|
||||
2AC8; 2AC7 # [BEST FIT] SUPERSET OF ABOVE TILDE OPERATOR
|
||||
2AC9; 2ACA # [BEST FIT] SUBSET OF ABOVE ALMOST EQUAL TO
|
||||
2ACA; 2AC9 # [BEST FIT] SUPERSET OF ABOVE ALMOST EQUAL TO
|
||||
2ACB; 2ACC # [BEST FIT] SUBSET OF ABOVE NOT EQUAL TO
|
||||
2ACC; 2ACB # [BEST FIT] SUPERSET OF ABOVE NOT EQUAL TO
|
||||
2ACD; 2ACE # SQUARE LEFT OPEN BOX OPERATOR
|
||||
2ACE; 2ACD # SQUARE RIGHT OPEN BOX OPERATOR
|
||||
2ACF; 2AD0 # CLOSED SUBSET
|
||||
2AD0; 2ACF # CLOSED SUPERSET
|
||||
2AD1; 2AD2 # CLOSED SUBSET OR EQUAL TO
|
||||
2AD2; 2AD1 # CLOSED SUPERSET OR EQUAL TO
|
||||
2AD3; 2AD4 # SUBSET ABOVE SUPERSET
|
||||
2AD4; 2AD3 # SUPERSET ABOVE SUBSET
|
||||
2AD5; 2AD6 # SUBSET ABOVE SUBSET
|
||||
2AD6; 2AD5 # SUPERSET ABOVE SUPERSET
|
||||
2ADE; 22A6 # SHORT LEFT TACK
|
||||
2AE3; 22A9 # DOUBLE VERTICAL BAR LEFT TURNSTILE
|
||||
2AE4; 22A8 # VERTICAL BAR DOUBLE LEFT TURNSTILE
|
||||
2AE5; 22AB # DOUBLE VERTICAL BAR DOUBLE LEFT TURNSTILE
|
||||
2AEC; 2AED # DOUBLE STROKE NOT SIGN
|
||||
2AED; 2AEC # REVERSED DOUBLE STROKE NOT SIGN
|
||||
2AEE; 2224 # DOES NOT DIVIDE WITH REVERSED NEGATION SLASH
|
||||
2AF7; 2AF8 # TRIPLE NESTED LESS-THAN
|
||||
2AF8; 2AF7 # TRIPLE NESTED GREATER-THAN
|
||||
2AF9; 2AFA # DOUBLE-LINE SLANTED LESS-THAN OR EQUAL TO
|
||||
2AFA; 2AF9 # DOUBLE-LINE SLANTED GREATER-THAN OR EQUAL TO
|
||||
2BFE; 221F # REVERSED RIGHT ANGLE
|
||||
2E02; 2E03 # LEFT SUBSTITUTION BRACKET
|
||||
2E03; 2E02 # RIGHT SUBSTITUTION BRACKET
|
||||
2E04; 2E05 # LEFT DOTTED SUBSTITUTION BRACKET
|
||||
2E05; 2E04 # RIGHT DOTTED SUBSTITUTION BRACKET
|
||||
2E09; 2E0A # LEFT TRANSPOSITION BRACKET
|
||||
2E0A; 2E09 # RIGHT TRANSPOSITION BRACKET
|
||||
2E0C; 2E0D # LEFT RAISED OMISSION BRACKET
|
||||
2E0D; 2E0C # RIGHT RAISED OMISSION BRACKET
|
||||
2E1C; 2E1D # LEFT LOW PARAPHRASE BRACKET
|
||||
2E1D; 2E1C # RIGHT LOW PARAPHRASE BRACKET
|
||||
2E20; 2E21 # LEFT VERTICAL BAR WITH QUILL
|
||||
2E21; 2E20 # RIGHT VERTICAL BAR WITH QUILL
|
||||
2E22; 2E23 # TOP LEFT HALF BRACKET
|
||||
2E23; 2E22 # TOP RIGHT HALF BRACKET
|
||||
2E24; 2E25 # BOTTOM LEFT HALF BRACKET
|
||||
2E25; 2E24 # BOTTOM RIGHT HALF BRACKET
|
||||
2E26; 2E27 # LEFT SIDEWAYS U BRACKET
|
||||
2E27; 2E26 # RIGHT SIDEWAYS U BRACKET
|
||||
2E28; 2E29 # LEFT DOUBLE PARENTHESIS
|
||||
2E29; 2E28 # RIGHT DOUBLE PARENTHESIS
|
||||
2E55; 2E56 # LEFT SQUARE BRACKET WITH STROKE
|
||||
2E56; 2E55 # RIGHT SQUARE BRACKET WITH STROKE
|
||||
2E57; 2E58 # LEFT SQUARE BRACKET WITH DOUBLE STROKE
|
||||
2E58; 2E57 # RIGHT SQUARE BRACKET WITH DOUBLE STROKE
|
||||
2E59; 2E5A # TOP HALF LEFT PARENTHESIS
|
||||
2E5A; 2E59 # TOP HALF RIGHT PARENTHESIS
|
||||
2E5B; 2E5C # BOTTOM HALF LEFT PARENTHESIS
|
||||
2E5C; 2E5B # BOTTOM HALF RIGHT PARENTHESIS
|
||||
3008; 3009 # LEFT ANGLE BRACKET
|
||||
3009; 3008 # RIGHT ANGLE BRACKET
|
||||
300A; 300B # LEFT DOUBLE ANGLE BRACKET
|
||||
300B; 300A # RIGHT DOUBLE ANGLE BRACKET
|
||||
300C; 300D # [BEST FIT] LEFT CORNER BRACKET
|
||||
300D; 300C # [BEST FIT] RIGHT CORNER BRACKET
|
||||
300E; 300F # [BEST FIT] LEFT WHITE CORNER BRACKET
|
||||
300F; 300E # [BEST FIT] RIGHT WHITE CORNER BRACKET
|
||||
3010; 3011 # LEFT BLACK LENTICULAR BRACKET
|
||||
3011; 3010 # RIGHT BLACK LENTICULAR BRACKET
|
||||
3014; 3015 # LEFT TORTOISE SHELL BRACKET
|
||||
3015; 3014 # RIGHT TORTOISE SHELL BRACKET
|
||||
3016; 3017 # LEFT WHITE LENTICULAR BRACKET
|
||||
3017; 3016 # RIGHT WHITE LENTICULAR BRACKET
|
||||
3018; 3019 # LEFT WHITE TORTOISE SHELL BRACKET
|
||||
3019; 3018 # RIGHT WHITE TORTOISE SHELL BRACKET
|
||||
301A; 301B # LEFT WHITE SQUARE BRACKET
|
||||
301B; 301A # RIGHT WHITE SQUARE BRACKET
|
||||
FE59; FE5A # SMALL LEFT PARENTHESIS
|
||||
FE5A; FE59 # SMALL RIGHT PARENTHESIS
|
||||
FE5B; FE5C # SMALL LEFT CURLY BRACKET
|
||||
FE5C; FE5B # SMALL RIGHT CURLY BRACKET
|
||||
FE5D; FE5E # SMALL LEFT TORTOISE SHELL BRACKET
|
||||
FE5E; FE5D # SMALL RIGHT TORTOISE SHELL BRACKET
|
||||
FE64; FE65 # SMALL LESS-THAN SIGN
|
||||
FE65; FE64 # SMALL GREATER-THAN SIGN
|
||||
FF08; FF09 # FULLWIDTH LEFT PARENTHESIS
|
||||
FF09; FF08 # FULLWIDTH RIGHT PARENTHESIS
|
||||
FF1C; FF1E # FULLWIDTH LESS-THAN SIGN
|
||||
FF1E; FF1C # FULLWIDTH GREATER-THAN SIGN
|
||||
FF3B; FF3D # FULLWIDTH LEFT SQUARE BRACKET
|
||||
FF3D; FF3B # FULLWIDTH RIGHT SQUARE BRACKET
|
||||
FF5B; FF5D # FULLWIDTH LEFT CURLY BRACKET
|
||||
FF5D; FF5B # FULLWIDTH RIGHT CURLY BRACKET
|
||||
FF5F; FF60 # FULLWIDTH LEFT WHITE PARENTHESIS
|
||||
FF60; FF5F # FULLWIDTH RIGHT WHITE PARENTHESIS
|
||||
FF62; FF63 # [BEST FIT] HALFWIDTH LEFT CORNER BRACKET
|
||||
FF63; FF62 # [BEST FIT] HALFWIDTH RIGHT CORNER BRACKET
|
||||
|
||||
# The following characters have no appropriate mirroring character.
|
||||
# For these characters it is up to the rendering system
|
||||
# to provide mirrored glyphs.
|
||||
|
||||
# 2140; DOUBLE-STRUCK N-ARY SUMMATION
|
||||
# 2201; COMPLEMENT
|
||||
# 2202; PARTIAL DIFFERENTIAL
|
||||
# 2203; THERE EXISTS
|
||||
# 2204; THERE DOES NOT EXIST
|
||||
# 2211; N-ARY SUMMATION
|
||||
# 2216; SET MINUS
|
||||
# 221A; SQUARE ROOT
|
||||
# 221B; CUBE ROOT
|
||||
# 221C; FOURTH ROOT
|
||||
# 221D; PROPORTIONAL TO
|
||||
# 2226; NOT PARALLEL TO
|
||||
# 222B; INTEGRAL
|
||||
# 222C; DOUBLE INTEGRAL
|
||||
# 222D; TRIPLE INTEGRAL
|
||||
# 222E; CONTOUR INTEGRAL
|
||||
# 222F; SURFACE INTEGRAL
|
||||
# 2230; VOLUME INTEGRAL
|
||||
# 2231; CLOCKWISE INTEGRAL
|
||||
# 2232; CLOCKWISE CONTOUR INTEGRAL
|
||||
# 2233; ANTICLOCKWISE CONTOUR INTEGRAL
|
||||
# 2239; EXCESS
|
||||
# 223B; HOMOTHETIC
|
||||
# 223E; INVERTED LAZY S
|
||||
# 223F; SINE WAVE
|
||||
# 2240; WREATH PRODUCT
|
||||
# 2241; NOT TILDE
|
||||
# 2242; MINUS TILDE
|
||||
# 2244; NOT ASYMPTOTICALLY EQUAL TO
|
||||
# 2246; APPROXIMATELY BUT NOT ACTUALLY EQUAL TO
|
||||
# 2247; NEITHER APPROXIMATELY NOR ACTUALLY EQUAL TO
|
||||
# 2248; ALMOST EQUAL TO
|
||||
# 2249; NOT ALMOST EQUAL TO
|
||||
# 224A; ALMOST EQUAL OR EQUAL TO
|
||||
# 224B; TRIPLE TILDE
|
||||
# 225F; QUESTIONED EQUAL TO
|
||||
# 2260; NOT EQUAL TO
|
||||
# 2262; NOT IDENTICAL TO
|
||||
# 228C; MULTISET
|
||||
# 22A7; MODELS
|
||||
# 22AA; TRIPLE VERTICAL BAR RIGHT TURNSTILE
|
||||
# 22AC; DOES NOT PROVE
|
||||
# 22AD; NOT TRUE
|
||||
# 22AE; DOES NOT FORCE
|
||||
# 22AF; NEGATED DOUBLE VERTICAL BAR DOUBLE RIGHT TURNSTILE
|
||||
# 22BE; RIGHT ANGLE WITH ARC
|
||||
# 22BF; RIGHT TRIANGLE
|
||||
# 22F5; ELEMENT OF WITH DOT ABOVE
|
||||
# 22F8; ELEMENT OF WITH UNDERBAR
|
||||
# 22F9; ELEMENT OF WITH TWO HORIZONTAL STROKES
|
||||
# 22FF; Z NOTATION BAG MEMBERSHIP
|
||||
# 2320; TOP HALF INTEGRAL
|
||||
# 2321; BOTTOM HALF INTEGRAL
|
||||
# 27C0; THREE DIMENSIONAL ANGLE
|
||||
# 27CC; LONG DIVISION
|
||||
# 27D3; LOWER RIGHT CORNER WITH DOT
|
||||
# 27D4; UPPER LEFT CORNER WITH DOT
|
||||
# 299C; RIGHT ANGLE VARIANT WITH SQUARE
|
||||
# 299D; MEASURED RIGHT ANGLE WITH DOT
|
||||
# 299E; ANGLE WITH S INSIDE
|
||||
# 299F; ACUTE ANGLE
|
||||
# 29A2; TURNED ANGLE
|
||||
# 29A6; OBLIQUE ANGLE OPENING UP
|
||||
# 29A7; OBLIQUE ANGLE OPENING DOWN
|
||||
# 29C2; CIRCLE WITH SMALL CIRCLE TO THE RIGHT
|
||||
# 29C3; CIRCLE WITH TWO HORIZONTAL STROKES TO THE RIGHT
|
||||
# 29C9; TWO JOINED SQUARES
|
||||
# 29CE; RIGHT TRIANGLE ABOVE LEFT TRIANGLE
|
||||
# 29DC; INCOMPLETE INFINITY
|
||||
# 29E1; INCREASES AS
|
||||
# 29E3; EQUALS SIGN AND SLANTED PARALLEL
|
||||
# 29E4; EQUALS SIGN AND SLANTED PARALLEL WITH TILDE ABOVE
|
||||
# 29E5; IDENTICAL TO AND SLANTED PARALLEL
|
||||
# 29F4; RULE-DELAYED
|
||||
# 29F6; SOLIDUS WITH OVERBAR
|
||||
# 29F7; REVERSE SOLIDUS WITH HORIZONTAL STROKE
|
||||
# 2A0A; MODULO TWO SUM
|
||||
# 2A0B; SUMMATION WITH INTEGRAL
|
||||
# 2A0C; QUADRUPLE INTEGRAL OPERATOR
|
||||
# 2A0D; FINITE PART INTEGRAL
|
||||
# 2A0E; INTEGRAL WITH DOUBLE STROKE
|
||||
# 2A0F; INTEGRAL AVERAGE WITH SLASH
|
||||
# 2A10; CIRCULATION FUNCTION
|
||||
# 2A11; ANTICLOCKWISE INTEGRATION
|
||||
# 2A12; LINE INTEGRATION WITH RECTANGULAR PATH AROUND POLE
|
||||
# 2A13; LINE INTEGRATION WITH SEMICIRCULAR PATH AROUND POLE
|
||||
# 2A14; LINE INTEGRATION NOT INCLUDING THE POLE
|
||||
# 2A15; INTEGRAL AROUND A POINT OPERATOR
|
||||
# 2A16; QUATERNION INTEGRAL OPERATOR
|
||||
# 2A17; INTEGRAL WITH LEFTWARDS ARROW WITH HOOK
|
||||
# 2A18; INTEGRAL WITH TIMES SIGN
|
||||
# 2A19; INTEGRAL WITH INTERSECTION
|
||||
# 2A1A; INTEGRAL WITH UNION
|
||||
# 2A1B; INTEGRAL WITH OVERBAR
|
||||
# 2A1C; INTEGRAL WITH UNDERBAR
|
||||
# 2A1E; LARGE LEFT TRIANGLE OPERATOR
|
||||
# 2A1F; Z NOTATION SCHEMA COMPOSITION
|
||||
# 2A20; Z NOTATION SCHEMA PIPING
|
||||
# 2A21; Z NOTATION SCHEMA PROJECTION
|
||||
# 2A24; PLUS SIGN WITH TILDE ABOVE
|
||||
# 2A26; PLUS SIGN WITH TILDE BELOW
|
||||
# 2A29; MINUS SIGN WITH COMMA ABOVE
|
||||
# 2A3E; Z NOTATION RELATIONAL COMPOSITION
|
||||
# 2A57; SLOPING LARGE OR
|
||||
# 2A58; SLOPING LARGE AND
|
||||
# 2A6A; TILDE OPERATOR WITH DOT ABOVE
|
||||
# 2A6B; TILDE OPERATOR WITH RISING DOTS
|
||||
# 2A6C; SIMILAR MINUS SIMILAR
|
||||
# 2A6D; CONGRUENT WITH DOT ABOVE
|
||||
# 2A6F; ALMOST EQUAL TO WITH CIRCUMFLEX ACCENT
|
||||
# 2A70; APPROXIMATELY EQUAL OR EQUAL TO
|
||||
# 2A73; EQUALS SIGN ABOVE TILDE OPERATOR
|
||||
# 2A74; DOUBLE COLON EQUAL
|
||||
# 2AA3; DOUBLE NESTED LESS-THAN WITH UNDERBAR
|
||||
# 2ADC; FORKING
|
||||
# 2AE2; VERTICAL BAR TRIPLE RIGHT TURNSTILE
|
||||
# 2AE6; LONG DASH FROM LEFT MEMBER OF DOUBLE VERTICAL
|
||||
# 2AF3; PARALLEL WITH TILDE OPERATOR
|
||||
# 2AFB; TRIPLE SOLIDUS BINARY RELATION
|
||||
# 2AFD; DOUBLE SOLIDUS OPERATOR
|
||||
# 1D6DB; MATHEMATICAL BOLD PARTIAL DIFFERENTIAL
|
||||
# 1D715; MATHEMATICAL ITALIC PARTIAL DIFFERENTIAL
|
||||
# 1D74F; MATHEMATICAL BOLD ITALIC PARTIAL DIFFERENTIAL
|
||||
# 1D789; MATHEMATICAL SANS-SERIF BOLD PARTIAL DIFFERENTIAL
|
||||
# 1D7C3; MATHEMATICAL SANS-SERIF BOLD ITALIC PARTIAL DIFFERENTIAL
|
||||
|
||||
# EOF
|
363
util/unicode/data/Blocks.txt
Normal file
363
util/unicode/data/Blocks.txt
Normal file
@ -0,0 +1,363 @@
|
||||
# Blocks-15.0.0.txt
|
||||
# Date: 2022-01-28, 20:58:00 GMT [KW]
|
||||
# © 2022 Unicode®, Inc.
|
||||
# For terms of use, see https://www.unicode.org/terms_of_use.html
|
||||
#
|
||||
# Unicode Character Database
|
||||
# For documentation, see https://www.unicode.org/reports/tr44/
|
||||
#
|
||||
# Format:
|
||||
# Start Code..End Code; Block Name
|
||||
|
||||
# ================================================
|
||||
|
||||
# Note: When comparing block names, casing, whitespace, hyphens,
|
||||
# and underbars are ignored.
|
||||
# For example, "Latin Extended-A" and "latin extended a" are equivalent.
|
||||
# For more information on the comparison of property values,
|
||||
# see UAX #44: https://www.unicode.org/reports/tr44/
|
||||
#
|
||||
# All block ranges start with a value where (cp MOD 16) = 0,
|
||||
# and end with a value where (cp MOD 16) = 15. In other words,
|
||||
# the last hexadecimal digit of the start of range is ...0
|
||||
# and the last hexadecimal digit of the end of range is ...F.
|
||||
# This constraint on block ranges guarantees that allocations
|
||||
# are done in terms of whole columns, and that code chart display
|
||||
# never involves splitting columns in the charts.
|
||||
#
|
||||
# All code points not explicitly listed for Block
|
||||
# have the value No_Block.
|
||||
|
||||
# Property: Block
|
||||
#
|
||||
# @missing: 0000..10FFFF; No_Block
|
||||
|
||||
0000..007F; Basic Latin
|
||||
0080..00FF; Latin-1 Supplement
|
||||
0100..017F; Latin Extended-A
|
||||
0180..024F; Latin Extended-B
|
||||
0250..02AF; IPA Extensions
|
||||
02B0..02FF; Spacing Modifier Letters
|
||||
0300..036F; Combining Diacritical Marks
|
||||
0370..03FF; Greek and Coptic
|
||||
0400..04FF; Cyrillic
|
||||
0500..052F; Cyrillic Supplement
|
||||
0530..058F; Armenian
|
||||
0590..05FF; Hebrew
|
||||
0600..06FF; Arabic
|
||||
0700..074F; Syriac
|
||||
0750..077F; Arabic Supplement
|
||||
0780..07BF; Thaana
|
||||
07C0..07FF; NKo
|
||||
0800..083F; Samaritan
|
||||
0840..085F; Mandaic
|
||||
0860..086F; Syriac Supplement
|
||||
0870..089F; Arabic Extended-B
|
||||
08A0..08FF; Arabic Extended-A
|
||||
0900..097F; Devanagari
|
||||
0980..09FF; Bengali
|
||||
0A00..0A7F; Gurmukhi
|
||||
0A80..0AFF; Gujarati
|
||||
0B00..0B7F; Oriya
|
||||
0B80..0BFF; Tamil
|
||||
0C00..0C7F; Telugu
|
||||
0C80..0CFF; Kannada
|
||||
0D00..0D7F; Malayalam
|
||||
0D80..0DFF; Sinhala
|
||||
0E00..0E7F; Thai
|
||||
0E80..0EFF; Lao
|
||||
0F00..0FFF; Tibetan
|
||||
1000..109F; Myanmar
|
||||
10A0..10FF; Georgian
|
||||
1100..11FF; Hangul Jamo
|
||||
1200..137F; Ethiopic
|
||||
1380..139F; Ethiopic Supplement
|
||||
13A0..13FF; Cherokee
|
||||
1400..167F; Unified Canadian Aboriginal Syllabics
|
||||
1680..169F; Ogham
|
||||
16A0..16FF; Runic
|
||||
1700..171F; Tagalog
|
||||
1720..173F; Hanunoo
|
||||
1740..175F; Buhid
|
||||
1760..177F; Tagbanwa
|
||||
1780..17FF; Khmer
|
||||
1800..18AF; Mongolian
|
||||
18B0..18FF; Unified Canadian Aboriginal Syllabics Extended
|
||||
1900..194F; Limbu
|
||||
1950..197F; Tai Le
|
||||
1980..19DF; New Tai Lue
|
||||
19E0..19FF; Khmer Symbols
|
||||
1A00..1A1F; Buginese
|
||||
1A20..1AAF; Tai Tham
|
||||
1AB0..1AFF; Combining Diacritical Marks Extended
|
||||
1B00..1B7F; Balinese
|
||||
1B80..1BBF; Sundanese
|
||||
1BC0..1BFF; Batak
|
||||
1C00..1C4F; Lepcha
|
||||
1C50..1C7F; Ol Chiki
|
||||
1C80..1C8F; Cyrillic Extended-C
|
||||
1C90..1CBF; Georgian Extended
|
||||
1CC0..1CCF; Sundanese Supplement
|
||||
1CD0..1CFF; Vedic Extensions
|
||||
1D00..1D7F; Phonetic Extensions
|
||||
1D80..1DBF; Phonetic Extensions Supplement
|
||||
1DC0..1DFF; Combining Diacritical Marks Supplement
|
||||
1E00..1EFF; Latin Extended Additional
|
||||
1F00..1FFF; Greek Extended
|
||||
2000..206F; General Punctuation
|
||||
2070..209F; Superscripts and Subscripts
|
||||
20A0..20CF; Currency Symbols
|
||||
20D0..20FF; Combining Diacritical Marks for Symbols
|
||||
2100..214F; Letterlike Symbols
|
||||
2150..218F; Number Forms
|
||||
2190..21FF; Arrows
|
||||
2200..22FF; Mathematical Operators
|
||||
2300..23FF; Miscellaneous Technical
|
||||
2400..243F; Control Pictures
|
||||
2440..245F; Optical Character Recognition
|
||||
2460..24FF; Enclosed Alphanumerics
|
||||
2500..257F; Box Drawing
|
||||
2580..259F; Block Elements
|
||||
25A0..25FF; Geometric Shapes
|
||||
2600..26FF; Miscellaneous Symbols
|
||||
2700..27BF; Dingbats
|
||||
27C0..27EF; Miscellaneous Mathematical Symbols-A
|
||||
27F0..27FF; Supplemental Arrows-A
|
||||
2800..28FF; Braille Patterns
|
||||
2900..297F; Supplemental Arrows-B
|
||||
2980..29FF; Miscellaneous Mathematical Symbols-B
|
||||
2A00..2AFF; Supplemental Mathematical Operators
|
||||
2B00..2BFF; Miscellaneous Symbols and Arrows
|
||||
2C00..2C5F; Glagolitic
|
||||
2C60..2C7F; Latin Extended-C
|
||||
2C80..2CFF; Coptic
|
||||
2D00..2D2F; Georgian Supplement
|
||||
2D30..2D7F; Tifinagh
|
||||
2D80..2DDF; Ethiopic Extended
|
||||
2DE0..2DFF; Cyrillic Extended-A
|
||||
2E00..2E7F; Supplemental Punctuation
|
||||
2E80..2EFF; CJK Radicals Supplement
|
||||
2F00..2FDF; Kangxi Radicals
|
||||
2FF0..2FFF; Ideographic Description Characters
|
||||
3000..303F; CJK Symbols and Punctuation
|
||||
3040..309F; Hiragana
|
||||
30A0..30FF; Katakana
|
||||
3100..312F; Bopomofo
|
||||
3130..318F; Hangul Compatibility Jamo
|
||||
3190..319F; Kanbun
|
||||
31A0..31BF; Bopomofo Extended
|
||||
31C0..31EF; CJK Strokes
|
||||
31F0..31FF; Katakana Phonetic Extensions
|
||||
3200..32FF; Enclosed CJK Letters and Months
|
||||
3300..33FF; CJK Compatibility
|
||||
3400..4DBF; CJK Unified Ideographs Extension A
|
||||
4DC0..4DFF; Yijing Hexagram Symbols
|
||||
4E00..9FFF; CJK Unified Ideographs
|
||||
A000..A48F; Yi Syllables
|
||||
A490..A4CF; Yi Radicals
|
||||
A4D0..A4FF; Lisu
|
||||
A500..A63F; Vai
|
||||
A640..A69F; Cyrillic Extended-B
|
||||
A6A0..A6FF; Bamum
|
||||
A700..A71F; Modifier Tone Letters
|
||||
A720..A7FF; Latin Extended-D
|
||||
A800..A82F; Syloti Nagri
|
||||
A830..A83F; Common Indic Number Forms
|
||||
A840..A87F; Phags-pa
|
||||
A880..A8DF; Saurashtra
|
||||
A8E0..A8FF; Devanagari Extended
|
||||
A900..A92F; Kayah Li
|
||||
A930..A95F; Rejang
|
||||
A960..A97F; Hangul Jamo Extended-A
|
||||
A980..A9DF; Javanese
|
||||
A9E0..A9FF; Myanmar Extended-B
|
||||
AA00..AA5F; Cham
|
||||
AA60..AA7F; Myanmar Extended-A
|
||||
AA80..AADF; Tai Viet
|
||||
AAE0..AAFF; Meetei Mayek Extensions
|
||||
AB00..AB2F; Ethiopic Extended-A
|
||||
AB30..AB6F; Latin Extended-E
|
||||
AB70..ABBF; Cherokee Supplement
|
||||
ABC0..ABFF; Meetei Mayek
|
||||
AC00..D7AF; Hangul Syllables
|
||||
D7B0..D7FF; Hangul Jamo Extended-B
|
||||
D800..DB7F; High Surrogates
|
||||
DB80..DBFF; High Private Use Surrogates
|
||||
DC00..DFFF; Low Surrogates
|
||||
E000..F8FF; Private Use Area
|
||||
F900..FAFF; CJK Compatibility Ideographs
|
||||
FB00..FB4F; Alphabetic Presentation Forms
|
||||
FB50..FDFF; Arabic Presentation Forms-A
|
||||
FE00..FE0F; Variation Selectors
|
||||
FE10..FE1F; Vertical Forms
|
||||
FE20..FE2F; Combining Half Marks
|
||||
FE30..FE4F; CJK Compatibility Forms
|
||||
FE50..FE6F; Small Form Variants
|
||||
FE70..FEFF; Arabic Presentation Forms-B
|
||||
FF00..FFEF; Halfwidth and Fullwidth Forms
|
||||
FFF0..FFFF; Specials
|
||||
10000..1007F; Linear B Syllabary
|
||||
10080..100FF; Linear B Ideograms
|
||||
10100..1013F; Aegean Numbers
|
||||
10140..1018F; Ancient Greek Numbers
|
||||
10190..101CF; Ancient Symbols
|
||||
101D0..101FF; Phaistos Disc
|
||||
10280..1029F; Lycian
|
||||
102A0..102DF; Carian
|
||||
102E0..102FF; Coptic Epact Numbers
|
||||
10300..1032F; Old Italic
|
||||
10330..1034F; Gothic
|
||||
10350..1037F; Old Permic
|
||||
10380..1039F; Ugaritic
|
||||
103A0..103DF; Old Persian
|
||||
10400..1044F; Deseret
|
||||
10450..1047F; Shavian
|
||||
10480..104AF; Osmanya
|
||||
104B0..104FF; Osage
|
||||
10500..1052F; Elbasan
|
||||
10530..1056F; Caucasian Albanian
|
||||
10570..105BF; Vithkuqi
|
||||
10600..1077F; Linear A
|
||||
10780..107BF; Latin Extended-F
|
||||
10800..1083F; Cypriot Syllabary
|
||||
10840..1085F; Imperial Aramaic
|
||||
10860..1087F; Palmyrene
|
||||
10880..108AF; Nabataean
|
||||
108E0..108FF; Hatran
|
||||
10900..1091F; Phoenician
|
||||
10920..1093F; Lydian
|
||||
10980..1099F; Meroitic Hieroglyphs
|
||||
109A0..109FF; Meroitic Cursive
|
||||
10A00..10A5F; Kharoshthi
|
||||
10A60..10A7F; Old South Arabian
|
||||
10A80..10A9F; Old North Arabian
|
||||
10AC0..10AFF; Manichaean
|
||||
10B00..10B3F; Avestan
|
||||
10B40..10B5F; Inscriptional Parthian
|
||||
10B60..10B7F; Inscriptional Pahlavi
|
||||
10B80..10BAF; Psalter Pahlavi
|
||||
10C00..10C4F; Old Turkic
|
||||
10C80..10CFF; Old Hungarian
|
||||
10D00..10D3F; Hanifi Rohingya
|
||||
10E60..10E7F; Rumi Numeral Symbols
|
||||
10E80..10EBF; Yezidi
|
||||
10EC0..10EFF; Arabic Extended-C
|
||||
10F00..10F2F; Old Sogdian
|
||||
10F30..10F6F; Sogdian
|
||||
10F70..10FAF; Old Uyghur
|
||||
10FB0..10FDF; Chorasmian
|
||||
10FE0..10FFF; Elymaic
|
||||
11000..1107F; Brahmi
|
||||
11080..110CF; Kaithi
|
||||
110D0..110FF; Sora Sompeng
|
||||
11100..1114F; Chakma
|
||||
11150..1117F; Mahajani
|
||||
11180..111DF; Sharada
|
||||
111E0..111FF; Sinhala Archaic Numbers
|
||||
11200..1124F; Khojki
|
||||
11280..112AF; Multani
|
||||
112B0..112FF; Khudawadi
|
||||
11300..1137F; Grantha
|
||||
11400..1147F; Newa
|
||||
11480..114DF; Tirhuta
|
||||
11580..115FF; Siddham
|
||||
11600..1165F; Modi
|
||||
11660..1167F; Mongolian Supplement
|
||||
11680..116CF; Takri
|
||||
11700..1174F; Ahom
|
||||
11800..1184F; Dogra
|
||||
118A0..118FF; Warang Citi
|
||||
11900..1195F; Dives Akuru
|
||||
119A0..119FF; Nandinagari
|
||||
11A00..11A4F; Zanabazar Square
|
||||
11A50..11AAF; Soyombo
|
||||
11AB0..11ABF; Unified Canadian Aboriginal Syllabics Extended-A
|
||||
11AC0..11AFF; Pau Cin Hau
|
||||
11B00..11B5F; Devanagari Extended-A
|
||||
11C00..11C6F; Bhaiksuki
|
||||
11C70..11CBF; Marchen
|
||||
11D00..11D5F; Masaram Gondi
|
||||
11D60..11DAF; Gunjala Gondi
|
||||
11EE0..11EFF; Makasar
|
||||
11F00..11F5F; Kawi
|
||||
11FB0..11FBF; Lisu Supplement
|
||||
11FC0..11FFF; Tamil Supplement
|
||||
12000..123FF; Cuneiform
|
||||
12400..1247F; Cuneiform Numbers and Punctuation
|
||||
12480..1254F; Early Dynastic Cuneiform
|
||||
12F90..12FFF; Cypro-Minoan
|
||||
13000..1342F; Egyptian Hieroglyphs
|
||||
13430..1345F; Egyptian Hieroglyph Format Controls
|
||||
14400..1467F; Anatolian Hieroglyphs
|
||||
16800..16A3F; Bamum Supplement
|
||||
16A40..16A6F; Mro
|
||||
16A70..16ACF; Tangsa
|
||||
16AD0..16AFF; Bassa Vah
|
||||
16B00..16B8F; Pahawh Hmong
|
||||
16E40..16E9F; Medefaidrin
|
||||
16F00..16F9F; Miao
|
||||
16FE0..16FFF; Ideographic Symbols and Punctuation
|
||||
17000..187FF; Tangut
|
||||
18800..18AFF; Tangut Components
|
||||
18B00..18CFF; Khitan Small Script
|
||||
18D00..18D7F; Tangut Supplement
|
||||
1AFF0..1AFFF; Kana Extended-B
|
||||
1B000..1B0FF; Kana Supplement
|
||||
1B100..1B12F; Kana Extended-A
|
||||
1B130..1B16F; Small Kana Extension
|
||||
1B170..1B2FF; Nushu
|
||||
1BC00..1BC9F; Duployan
|
||||
1BCA0..1BCAF; Shorthand Format Controls
|
||||
1CF00..1CFCF; Znamenny Musical Notation
|
||||
1D000..1D0FF; Byzantine Musical Symbols
|
||||
1D100..1D1FF; Musical Symbols
|
||||
1D200..1D24F; Ancient Greek Musical Notation
|
||||
1D2C0..1D2DF; Kaktovik Numerals
|
||||
1D2E0..1D2FF; Mayan Numerals
|
||||
1D300..1D35F; Tai Xuan Jing Symbols
|
||||
1D360..1D37F; Counting Rod Numerals
|
||||
1D400..1D7FF; Mathematical Alphanumeric Symbols
|
||||
1D800..1DAAF; Sutton SignWriting
|
||||
1DF00..1DFFF; Latin Extended-G
|
||||
1E000..1E02F; Glagolitic Supplement
|
||||
1E030..1E08F; Cyrillic Extended-D
|
||||
1E100..1E14F; Nyiakeng Puachue Hmong
|
||||
1E290..1E2BF; Toto
|
||||
1E2C0..1E2FF; Wancho
|
||||
1E4D0..1E4FF; Nag Mundari
|
||||
1E7E0..1E7FF; Ethiopic Extended-B
|
||||
1E800..1E8DF; Mende Kikakui
|
||||
1E900..1E95F; Adlam
|
||||
1EC70..1ECBF; Indic Siyaq Numbers
|
||||
1ED00..1ED4F; Ottoman Siyaq Numbers
|
||||
1EE00..1EEFF; Arabic Mathematical Alphabetic Symbols
|
||||
1F000..1F02F; Mahjong Tiles
|
||||
1F030..1F09F; Domino Tiles
|
||||
1F0A0..1F0FF; Playing Cards
|
||||
1F100..1F1FF; Enclosed Alphanumeric Supplement
|
||||
1F200..1F2FF; Enclosed Ideographic Supplement
|
||||
1F300..1F5FF; Miscellaneous Symbols and Pictographs
|
||||
1F600..1F64F; Emoticons
|
||||
1F650..1F67F; Ornamental Dingbats
|
||||
1F680..1F6FF; Transport and Map Symbols
|
||||
1F700..1F77F; Alchemical Symbols
|
||||
1F780..1F7FF; Geometric Shapes Extended
|
||||
1F800..1F8FF; Supplemental Arrows-C
|
||||
1F900..1F9FF; Supplemental Symbols and Pictographs
|
||||
1FA00..1FA6F; Chess Symbols
|
||||
1FA70..1FAFF; Symbols and Pictographs Extended-A
|
||||
1FB00..1FBFF; Symbols for Legacy Computing
|
||||
20000..2A6DF; CJK Unified Ideographs Extension B
|
||||
2A700..2B73F; CJK Unified Ideographs Extension C
|
||||
2B740..2B81F; CJK Unified Ideographs Extension D
|
||||
2B820..2CEAF; CJK Unified Ideographs Extension E
|
||||
2CEB0..2EBEF; CJK Unified Ideographs Extension F
|
||||
2F800..2FA1F; CJK Compatibility Ideographs Supplement
|
||||
30000..3134F; CJK Unified Ideographs Extension G
|
||||
31350..323AF; CJK Unified Ideographs Extension H
|
||||
E0000..E007F; Tags
|
||||
E0100..E01EF; Variation Selectors Supplement
|
||||
F0000..FFFFF; Supplementary Private Use Area-A
|
||||
100000..10FFFF; Supplementary Private Use Area-B
|
||||
|
||||
# EOF
|
1624
util/unicode/data/CaseFolding.txt
Normal file
1624
util/unicode/data/CaseFolding.txt
Normal file
File diff suppressed because it is too large
Load Diff
1994
util/unicode/data/DerivedAge.txt
Normal file
1994
util/unicode/data/DerivedAge.txt
Normal file
File diff suppressed because it is too large
Load Diff
10018
util/unicode/data/DerivedNormalizationProps.txt
Normal file
10018
util/unicode/data/DerivedNormalizationProps.txt
Normal file
File diff suppressed because it is too large
Load Diff
2619
util/unicode/data/EastAsianWidth.txt
Normal file
2619
util/unicode/data/EastAsianWidth.txt
Normal file
File diff suppressed because it is too large
Load Diff
1475
util/unicode/data/GraphemeBreakProperty.txt
Normal file
1475
util/unicode/data/GraphemeBreakProperty.txt
Normal file
File diff suppressed because it is too large
Load Diff
9027
util/unicode/data/IdnaMappingTable.txt
Normal file
9027
util/unicode/data/IdnaMappingTable.txt
Normal file
File diff suppressed because it is too large
Load Diff
3597
util/unicode/data/LineBreak.txt
Normal file
3597
util/unicode/data/LineBreak.txt
Normal file
File diff suppressed because it is too large
Load Diff
52
util/unicode/data/NormalizationCorrections.txt
Normal file
52
util/unicode/data/NormalizationCorrections.txt
Normal file
@ -0,0 +1,52 @@
|
||||
# NormalizationCorrections-15.0.0.txt
|
||||
# Date: 2022-05-03, 18:53:00 GMT [KW, LI]
|
||||
# © 2022 Unicode®, Inc.
|
||||
# For terms of use, see https://www.unicode.org/terms_of_use.html
|
||||
#
|
||||
# Unicode Character Database
|
||||
# For documentation, see https://www.unicode.org/reports/tr44/
|
||||
#
|
||||
# This file is a normative contributory data file in the
|
||||
# Unicode Character Database.
|
||||
#
|
||||
# The normalization stability policy of the Unicode Consortium
|
||||
# ordinarily precludes any change to the decomposition
|
||||
# for any character, once established in a relevant version
|
||||
# of the UnicodeData.txt data file. However, under certain
|
||||
# exceptional (and rare) conditions, an error in a decomposition
|
||||
# mapping may be discovered that is truly just an unintended
|
||||
# typo in the data, and not a matter of dubious interpretation.
|
||||
#
|
||||
# Whenever such an error may be found, and if it meets the
|
||||
# requirements for possible exceptions to normalization
|
||||
# stability, the correction is entered in this data file,
|
||||
# so that any implementation depending on absolute stability
|
||||
# of normalization, *including* any errors in the data, can
|
||||
# safely reconstruct the exact state of the data tables at
|
||||
# any given version of Unicode.
|
||||
#
|
||||
# Currently this list has exactly six entries in it, one for the
|
||||
# typo found and corrected in Corrigendum #3, and five for
|
||||
# the typos and misidentifications found and corrected in
|
||||
# Corrigendum #4. All efforts
|
||||
# will be made to keep the entries limited to just those fixes.
|
||||
#
|
||||
# Interpretation of the fields:
|
||||
# Field 0: Unicode code point
|
||||
# Field 1: Original (erroneous) decomposition
|
||||
# Field 2: Corrected decomposition
|
||||
# Field 3: Version of Unicode for which the correction was
|
||||
# entered into UnicodeData.txt, in n.n.n format.
|
||||
# Comment: Indicates the Unicode Corrigendum which documents
|
||||
# the correction
|
||||
#
|
||||
# For more information, see UAX #15, Unicode Normalization Forms.
|
||||
#
|
||||
F951;96FB;964B;3.2.0 # Corrigendum 3
|
||||
2F868;2136A;36FC;4.0.0 # Corrigendum 4
|
||||
2F874;5F33;5F53;4.0.0 # Corrigendum 4
|
||||
2F91F;43AB;243AB;4.0.0 # Corrigendum 4
|
||||
2F95F;7AAE;7AEE;4.0.0 # Corrigendum 4
|
||||
2F9BF;4D57;45D7;4.0.0 # Corrigendum 4
|
||||
|
||||
# EOF
|
3031
util/unicode/data/Scripts.txt
Normal file
3031
util/unicode/data/Scripts.txt
Normal file
File diff suppressed because it is too large
Load Diff
2921
util/unicode/data/SentenceBreakProperty.txt
Normal file
2921
util/unicode/data/SentenceBreakProperty.txt
Normal file
File diff suppressed because it is too large
Load Diff
281
util/unicode/data/SpecialCasing.txt
Normal file
281
util/unicode/data/SpecialCasing.txt
Normal file
@ -0,0 +1,281 @@
|
||||
# SpecialCasing-15.0.0.txt
|
||||
# Date: 2022-02-02, 23:35:52 GMT
|
||||
# © 2022 Unicode®, Inc.
|
||||
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
|
||||
# For terms of use, see https://www.unicode.org/terms_of_use.html
|
||||
#
|
||||
# Unicode Character Database
|
||||
# For documentation, see https://www.unicode.org/reports/tr44/
|
||||
#
|
||||
# Special Casing
|
||||
#
|
||||
# This file is a supplement to the UnicodeData.txt file. It does not define any
|
||||
# properties, but rather provides additional information about the casing of
|
||||
# Unicode characters, for situations when casing incurs a change in string length
|
||||
# or is dependent on context or locale. For compatibility, the UnicodeData.txt
|
||||
# file only contains simple case mappings for characters where they are one-to-one
|
||||
# and independent of context and language. The data in this file, combined with
|
||||
# the simple case mappings in UnicodeData.txt, defines the full case mappings
|
||||
# Lowercase_Mapping (lc), Titlecase_Mapping (tc), and Uppercase_Mapping (uc).
|
||||
#
|
||||
# Note that the preferred mechanism for defining tailored casing operations is
|
||||
# the Unicode Common Locale Data Repository (CLDR). For more information, see the
|
||||
# discussion of case mappings and case algorithms in the Unicode Standard.
|
||||
#
|
||||
# All code points not listed in this file that do not have a simple case mappings
|
||||
# in UnicodeData.txt map to themselves.
|
||||
# ================================================================================
|
||||
# Format
|
||||
# ================================================================================
|
||||
# The entries in this file are in the following machine-readable format:
|
||||
#
|
||||
# <code>; <lower>; <title>; <upper>; (<condition_list>;)? # <comment>
|
||||
#
|
||||
# <code>, <lower>, <title>, and <upper> provide the respective full case mappings
|
||||
# of <code>, expressed as character values in hex. If there is more than one character,
|
||||
# they are separated by spaces. Other than as used to separate elements, spaces are
|
||||
# to be ignored.
|
||||
#
|
||||
# The <condition_list> is optional. Where present, it consists of one or more language IDs
|
||||
# or casing contexts, separated by spaces. In these conditions:
|
||||
# - A condition list overrides the normal behavior if all of the listed conditions are true.
|
||||
# - The casing context is always the context of the characters in the original string,
|
||||
# NOT in the resulting string.
|
||||
# - Case distinctions in the condition list are not significant.
|
||||
# - Conditions preceded by "Not_" represent the negation of the condition.
|
||||
# The condition list is not represented in the UCD as a formal property.
|
||||
#
|
||||
# A language ID is defined by BCP 47, with '-' and '_' treated equivalently.
|
||||
#
|
||||
# A casing context for a character is defined by Section 3.13 Default Case Algorithms
|
||||
# of The Unicode Standard.
|
||||
#
|
||||
# Parsers of this file must be prepared to deal with future additions to this format:
|
||||
# * Additional contexts
|
||||
# * Additional fields
|
||||
# ================================================================================
|
||||
|
||||
# ================================================================================
|
||||
# Unconditional mappings
|
||||
# ================================================================================
|
||||
|
||||
# The German es-zed is special--the normal mapping is to SS.
|
||||
# Note: the titlecase should never occur in practice. It is equal to titlecase(uppercase(<es-zed>))
|
||||
|
||||
00DF; 00DF; 0053 0073; 0053 0053; # LATIN SMALL LETTER SHARP S
|
||||
|
||||
# Preserve canonical equivalence for I with dot. Turkic is handled below.
|
||||
|
||||
0130; 0069 0307; 0130; 0130; # LATIN CAPITAL LETTER I WITH DOT ABOVE
|
||||
|
||||
# Ligatures
|
||||
|
||||
FB00; FB00; 0046 0066; 0046 0046; # LATIN SMALL LIGATURE FF
|
||||
FB01; FB01; 0046 0069; 0046 0049; # LATIN SMALL LIGATURE FI
|
||||
FB02; FB02; 0046 006C; 0046 004C; # LATIN SMALL LIGATURE FL
|
||||
FB03; FB03; 0046 0066 0069; 0046 0046 0049; # LATIN SMALL LIGATURE FFI
|
||||
FB04; FB04; 0046 0066 006C; 0046 0046 004C; # LATIN SMALL LIGATURE FFL
|
||||
FB05; FB05; 0053 0074; 0053 0054; # LATIN SMALL LIGATURE LONG S T
|
||||
FB06; FB06; 0053 0074; 0053 0054; # LATIN SMALL LIGATURE ST
|
||||
|
||||
0587; 0587; 0535 0582; 0535 0552; # ARMENIAN SMALL LIGATURE ECH YIWN
|
||||
FB13; FB13; 0544 0576; 0544 0546; # ARMENIAN SMALL LIGATURE MEN NOW
|
||||
FB14; FB14; 0544 0565; 0544 0535; # ARMENIAN SMALL LIGATURE MEN ECH
|
||||
FB15; FB15; 0544 056B; 0544 053B; # ARMENIAN SMALL LIGATURE MEN INI
|
||||
FB16; FB16; 054E 0576; 054E 0546; # ARMENIAN SMALL LIGATURE VEW NOW
|
||||
FB17; FB17; 0544 056D; 0544 053D; # ARMENIAN SMALL LIGATURE MEN XEH
|
||||
|
||||
# No corresponding uppercase precomposed character
|
||||
|
||||
0149; 0149; 02BC 004E; 02BC 004E; # LATIN SMALL LETTER N PRECEDED BY APOSTROPHE
|
||||
0390; 0390; 0399 0308 0301; 0399 0308 0301; # GREEK SMALL LETTER IOTA WITH DIALYTIKA AND TONOS
|
||||
03B0; 03B0; 03A5 0308 0301; 03A5 0308 0301; # GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND TONOS
|
||||
01F0; 01F0; 004A 030C; 004A 030C; # LATIN SMALL LETTER J WITH CARON
|
||||
1E96; 1E96; 0048 0331; 0048 0331; # LATIN SMALL LETTER H WITH LINE BELOW
|
||||
1E97; 1E97; 0054 0308; 0054 0308; # LATIN SMALL LETTER T WITH DIAERESIS
|
||||
1E98; 1E98; 0057 030A; 0057 030A; # LATIN SMALL LETTER W WITH RING ABOVE
|
||||
1E99; 1E99; 0059 030A; 0059 030A; # LATIN SMALL LETTER Y WITH RING ABOVE
|
||||
1E9A; 1E9A; 0041 02BE; 0041 02BE; # LATIN SMALL LETTER A WITH RIGHT HALF RING
|
||||
1F50; 1F50; 03A5 0313; 03A5 0313; # GREEK SMALL LETTER UPSILON WITH PSILI
|
||||
1F52; 1F52; 03A5 0313 0300; 03A5 0313 0300; # GREEK SMALL LETTER UPSILON WITH PSILI AND VARIA
|
||||
1F54; 1F54; 03A5 0313 0301; 03A5 0313 0301; # GREEK SMALL LETTER UPSILON WITH PSILI AND OXIA
|
||||
1F56; 1F56; 03A5 0313 0342; 03A5 0313 0342; # GREEK SMALL LETTER UPSILON WITH PSILI AND PERISPOMENI
|
||||
1FB6; 1FB6; 0391 0342; 0391 0342; # GREEK SMALL LETTER ALPHA WITH PERISPOMENI
|
||||
1FC6; 1FC6; 0397 0342; 0397 0342; # GREEK SMALL LETTER ETA WITH PERISPOMENI
|
||||
1FD2; 1FD2; 0399 0308 0300; 0399 0308 0300; # GREEK SMALL LETTER IOTA WITH DIALYTIKA AND VARIA
|
||||
1FD3; 1FD3; 0399 0308 0301; 0399 0308 0301; # GREEK SMALL LETTER IOTA WITH DIALYTIKA AND OXIA
|
||||
1FD6; 1FD6; 0399 0342; 0399 0342; # GREEK SMALL LETTER IOTA WITH PERISPOMENI
|
||||
1FD7; 1FD7; 0399 0308 0342; 0399 0308 0342; # GREEK SMALL LETTER IOTA WITH DIALYTIKA AND PERISPOMENI
|
||||
1FE2; 1FE2; 03A5 0308 0300; 03A5 0308 0300; # GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND VARIA
|
||||
1FE3; 1FE3; 03A5 0308 0301; 03A5 0308 0301; # GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND OXIA
|
||||
1FE4; 1FE4; 03A1 0313; 03A1 0313; # GREEK SMALL LETTER RHO WITH PSILI
|
||||
1FE6; 1FE6; 03A5 0342; 03A5 0342; # GREEK SMALL LETTER UPSILON WITH PERISPOMENI
|
||||
1FE7; 1FE7; 03A5 0308 0342; 03A5 0308 0342; # GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND PERISPOMENI
|
||||
1FF6; 1FF6; 03A9 0342; 03A9 0342; # GREEK SMALL LETTER OMEGA WITH PERISPOMENI
|
||||
|
||||
# IMPORTANT-when iota-subscript (0345) is uppercased or titlecased,
|
||||
# the result will be incorrect unless the iota-subscript is moved to the end
|
||||
# of any sequence of combining marks. Otherwise, the accents will go on the capital iota.
|
||||
# This process can be achieved by first transforming the text to NFC before casing.
|
||||
# E.g. <alpha><iota_subscript><acute> is uppercased to <ALPHA><acute><IOTA>
|
||||
|
||||
# The following cases are already in the UnicodeData.txt file, so are only commented here.
|
||||
|
||||
# 0345; 0345; 0399; 0399; # COMBINING GREEK YPOGEGRAMMENI
|
||||
|
||||
# All letters with YPOGEGRAMMENI (iota-subscript) or PROSGEGRAMMENI (iota adscript)
|
||||
# have special uppercases.
|
||||
# Note: characters with PROSGEGRAMMENI are actually titlecase, not uppercase!
|
||||
|
||||
1F80; 1F80; 1F88; 1F08 0399; # GREEK SMALL LETTER ALPHA WITH PSILI AND YPOGEGRAMMENI
|
||||
1F81; 1F81; 1F89; 1F09 0399; # GREEK SMALL LETTER ALPHA WITH DASIA AND YPOGEGRAMMENI
|
||||
1F82; 1F82; 1F8A; 1F0A 0399; # GREEK SMALL LETTER ALPHA WITH PSILI AND VARIA AND YPOGEGRAMMENI
|
||||
1F83; 1F83; 1F8B; 1F0B 0399; # GREEK SMALL LETTER ALPHA WITH DASIA AND VARIA AND YPOGEGRAMMENI
|
||||
1F84; 1F84; 1F8C; 1F0C 0399; # GREEK SMALL LETTER ALPHA WITH PSILI AND OXIA AND YPOGEGRAMMENI
|
||||
1F85; 1F85; 1F8D; 1F0D 0399; # GREEK SMALL LETTER ALPHA WITH DASIA AND OXIA AND YPOGEGRAMMENI
|
||||
1F86; 1F86; 1F8E; 1F0E 0399; # GREEK SMALL LETTER ALPHA WITH PSILI AND PERISPOMENI AND YPOGEGRAMMENI
|
||||
1F87; 1F87; 1F8F; 1F0F 0399; # GREEK SMALL LETTER ALPHA WITH DASIA AND PERISPOMENI AND YPOGEGRAMMENI
|
||||
1F88; 1F80; 1F88; 1F08 0399; # GREEK CAPITAL LETTER ALPHA WITH PSILI AND PROSGEGRAMMENI
|
||||
1F89; 1F81; 1F89; 1F09 0399; # GREEK CAPITAL LETTER ALPHA WITH DASIA AND PROSGEGRAMMENI
|
||||
1F8A; 1F82; 1F8A; 1F0A 0399; # GREEK CAPITAL LETTER ALPHA WITH PSILI AND VARIA AND PROSGEGRAMMENI
|
||||
1F8B; 1F83; 1F8B; 1F0B 0399; # GREEK CAPITAL LETTER ALPHA WITH DASIA AND VARIA AND PROSGEGRAMMENI
|
||||
1F8C; 1F84; 1F8C; 1F0C 0399; # GREEK CAPITAL LETTER ALPHA WITH PSILI AND OXIA AND PROSGEGRAMMENI
|
||||
1F8D; 1F85; 1F8D; 1F0D 0399; # GREEK CAPITAL LETTER ALPHA WITH DASIA AND OXIA AND PROSGEGRAMMENI
|
||||
1F8E; 1F86; 1F8E; 1F0E 0399; # GREEK CAPITAL LETTER ALPHA WITH PSILI AND PERISPOMENI AND PROSGEGRAMMENI
|
||||
1F8F; 1F87; 1F8F; 1F0F 0399; # GREEK CAPITAL LETTER ALPHA WITH DASIA AND PERISPOMENI AND PROSGEGRAMMENI
|
||||
1F90; 1F90; 1F98; 1F28 0399; # GREEK SMALL LETTER ETA WITH PSILI AND YPOGEGRAMMENI
|
||||
1F91; 1F91; 1F99; 1F29 0399; # GREEK SMALL LETTER ETA WITH DASIA AND YPOGEGRAMMENI
|
||||
1F92; 1F92; 1F9A; 1F2A 0399; # GREEK SMALL LETTER ETA WITH PSILI AND VARIA AND YPOGEGRAMMENI
|
||||
1F93; 1F93; 1F9B; 1F2B 0399; # GREEK SMALL LETTER ETA WITH DASIA AND VARIA AND YPOGEGRAMMENI
|
||||
1F94; 1F94; 1F9C; 1F2C 0399; # GREEK SMALL LETTER ETA WITH PSILI AND OXIA AND YPOGEGRAMMENI
|
||||
1F95; 1F95; 1F9D; 1F2D 0399; # GREEK SMALL LETTER ETA WITH DASIA AND OXIA AND YPOGEGRAMMENI
|
||||
1F96; 1F96; 1F9E; 1F2E 0399; # GREEK SMALL LETTER ETA WITH PSILI AND PERISPOMENI AND YPOGEGRAMMENI
|
||||
1F97; 1F97; 1F9F; 1F2F 0399; # GREEK SMALL LETTER ETA WITH DASIA AND PERISPOMENI AND YPOGEGRAMMENI
|
||||
1F98; 1F90; 1F98; 1F28 0399; # GREEK CAPITAL LETTER ETA WITH PSILI AND PROSGEGRAMMENI
|
||||
1F99; 1F91; 1F99; 1F29 0399; # GREEK CAPITAL LETTER ETA WITH DASIA AND PROSGEGRAMMENI
|
||||
1F9A; 1F92; 1F9A; 1F2A 0399; # GREEK CAPITAL LETTER ETA WITH PSILI AND VARIA AND PROSGEGRAMMENI
|
||||
1F9B; 1F93; 1F9B; 1F2B 0399; # GREEK CAPITAL LETTER ETA WITH DASIA AND VARIA AND PROSGEGRAMMENI
|
||||
1F9C; 1F94; 1F9C; 1F2C 0399; # GREEK CAPITAL LETTER ETA WITH PSILI AND OXIA AND PROSGEGRAMMENI
|
||||
1F9D; 1F95; 1F9D; 1F2D 0399; # GREEK CAPITAL LETTER ETA WITH DASIA AND OXIA AND PROSGEGRAMMENI
|
||||
1F9E; 1F96; 1F9E; 1F2E 0399; # GREEK CAPITAL LETTER ETA WITH PSILI AND PERISPOMENI AND PROSGEGRAMMENI
|
||||
1F9F; 1F97; 1F9F; 1F2F 0399; # GREEK CAPITAL LETTER ETA WITH DASIA AND PERISPOMENI AND PROSGEGRAMMENI
|
||||
1FA0; 1FA0; 1FA8; 1F68 0399; # GREEK SMALL LETTER OMEGA WITH PSILI AND YPOGEGRAMMENI
|
||||
1FA1; 1FA1; 1FA9; 1F69 0399; # GREEK SMALL LETTER OMEGA WITH DASIA AND YPOGEGRAMMENI
|
||||
1FA2; 1FA2; 1FAA; 1F6A 0399; # GREEK SMALL LETTER OMEGA WITH PSILI AND VARIA AND YPOGEGRAMMENI
|
||||
1FA3; 1FA3; 1FAB; 1F6B 0399; # GREEK SMALL LETTER OMEGA WITH DASIA AND VARIA AND YPOGEGRAMMENI
|
||||
1FA4; 1FA4; 1FAC; 1F6C 0399; # GREEK SMALL LETTER OMEGA WITH PSILI AND OXIA AND YPOGEGRAMMENI
|
||||
1FA5; 1FA5; 1FAD; 1F6D 0399; # GREEK SMALL LETTER OMEGA WITH DASIA AND OXIA AND YPOGEGRAMMENI
|
||||
1FA6; 1FA6; 1FAE; 1F6E 0399; # GREEK SMALL LETTER OMEGA WITH PSILI AND PERISPOMENI AND YPOGEGRAMMENI
|
||||
1FA7; 1FA7; 1FAF; 1F6F 0399; # GREEK SMALL LETTER OMEGA WITH DASIA AND PERISPOMENI AND YPOGEGRAMMENI
|
||||
1FA8; 1FA0; 1FA8; 1F68 0399; # GREEK CAPITAL LETTER OMEGA WITH PSILI AND PROSGEGRAMMENI
|
||||
1FA9; 1FA1; 1FA9; 1F69 0399; # GREEK CAPITAL LETTER OMEGA WITH DASIA AND PROSGEGRAMMENI
|
||||
1FAA; 1FA2; 1FAA; 1F6A 0399; # GREEK CAPITAL LETTER OMEGA WITH PSILI AND VARIA AND PROSGEGRAMMENI
|
||||
1FAB; 1FA3; 1FAB; 1F6B 0399; # GREEK CAPITAL LETTER OMEGA WITH DASIA AND VARIA AND PROSGEGRAMMENI
|
||||
1FAC; 1FA4; 1FAC; 1F6C 0399; # GREEK CAPITAL LETTER OMEGA WITH PSILI AND OXIA AND PROSGEGRAMMENI
|
||||
1FAD; 1FA5; 1FAD; 1F6D 0399; # GREEK CAPITAL LETTER OMEGA WITH DASIA AND OXIA AND PROSGEGRAMMENI
|
||||
1FAE; 1FA6; 1FAE; 1F6E 0399; # GREEK CAPITAL LETTER OMEGA WITH PSILI AND PERISPOMENI AND PROSGEGRAMMENI
|
||||
1FAF; 1FA7; 1FAF; 1F6F 0399; # GREEK CAPITAL LETTER OMEGA WITH DASIA AND PERISPOMENI AND PROSGEGRAMMENI
|
||||
1FB3; 1FB3; 1FBC; 0391 0399; # GREEK SMALL LETTER ALPHA WITH YPOGEGRAMMENI
|
||||
1FBC; 1FB3; 1FBC; 0391 0399; # GREEK CAPITAL LETTER ALPHA WITH PROSGEGRAMMENI
|
||||
1FC3; 1FC3; 1FCC; 0397 0399; # GREEK SMALL LETTER ETA WITH YPOGEGRAMMENI
|
||||
1FCC; 1FC3; 1FCC; 0397 0399; # GREEK CAPITAL LETTER ETA WITH PROSGEGRAMMENI
|
||||
1FF3; 1FF3; 1FFC; 03A9 0399; # GREEK SMALL LETTER OMEGA WITH YPOGEGRAMMENI
|
||||
1FFC; 1FF3; 1FFC; 03A9 0399; # GREEK CAPITAL LETTER OMEGA WITH PROSGEGRAMMENI
|
||||
|
||||
# Some characters with YPOGEGRAMMENI also have no corresponding titlecases
|
||||
|
||||
1FB2; 1FB2; 1FBA 0345; 1FBA 0399; # GREEK SMALL LETTER ALPHA WITH VARIA AND YPOGEGRAMMENI
|
||||
1FB4; 1FB4; 0386 0345; 0386 0399; # GREEK SMALL LETTER ALPHA WITH OXIA AND YPOGEGRAMMENI
|
||||
1FC2; 1FC2; 1FCA 0345; 1FCA 0399; # GREEK SMALL LETTER ETA WITH VARIA AND YPOGEGRAMMENI
|
||||
1FC4; 1FC4; 0389 0345; 0389 0399; # GREEK SMALL LETTER ETA WITH OXIA AND YPOGEGRAMMENI
|
||||
1FF2; 1FF2; 1FFA 0345; 1FFA 0399; # GREEK SMALL LETTER OMEGA WITH VARIA AND YPOGEGRAMMENI
|
||||
1FF4; 1FF4; 038F 0345; 038F 0399; # GREEK SMALL LETTER OMEGA WITH OXIA AND YPOGEGRAMMENI
|
||||
|
||||
1FB7; 1FB7; 0391 0342 0345; 0391 0342 0399; # GREEK SMALL LETTER ALPHA WITH PERISPOMENI AND YPOGEGRAMMENI
|
||||
1FC7; 1FC7; 0397 0342 0345; 0397 0342 0399; # GREEK SMALL LETTER ETA WITH PERISPOMENI AND YPOGEGRAMMENI
|
||||
1FF7; 1FF7; 03A9 0342 0345; 03A9 0342 0399; # GREEK SMALL LETTER OMEGA WITH PERISPOMENI AND YPOGEGRAMMENI
|
||||
|
||||
# ================================================================================
|
||||
# Conditional Mappings
|
||||
# The remainder of this file provides conditional casing data used to produce
|
||||
# full case mappings.
|
||||
# ================================================================================
|
||||
# Language-Insensitive Mappings
|
||||
# These are characters whose full case mappings do not depend on language, but do
|
||||
# depend on context (which characters come before or after). For more information
|
||||
# see the header of this file and the Unicode Standard.
|
||||
# ================================================================================
|
||||
|
||||
# Special case for final form of sigma
|
||||
|
||||
03A3; 03C2; 03A3; 03A3; Final_Sigma; # GREEK CAPITAL LETTER SIGMA
|
||||
|
||||
# Note: the following cases for non-final are already in the UnicodeData.txt file.
|
||||
|
||||
# 03A3; 03C3; 03A3; 03A3; # GREEK CAPITAL LETTER SIGMA
|
||||
# 03C3; 03C3; 03A3; 03A3; # GREEK SMALL LETTER SIGMA
|
||||
# 03C2; 03C2; 03A3; 03A3; # GREEK SMALL LETTER FINAL SIGMA
|
||||
|
||||
# Note: the following cases are not included, since they would case-fold in lowercasing
|
||||
|
||||
# 03C3; 03C2; 03A3; 03A3; Final_Sigma; # GREEK SMALL LETTER SIGMA
|
||||
# 03C2; 03C3; 03A3; 03A3; Not_Final_Sigma; # GREEK SMALL LETTER FINAL SIGMA
|
||||
|
||||
# ================================================================================
|
||||
# Language-Sensitive Mappings
|
||||
# These are characters whose full case mappings depend on language and perhaps also
|
||||
# context (which characters come before or after). For more information
|
||||
# see the header of this file and the Unicode Standard.
|
||||
# ================================================================================
|
||||
|
||||
# Lithuanian
|
||||
|
||||
# Lithuanian retains the dot in a lowercase i when followed by accents.
|
||||
|
||||
# Remove DOT ABOVE after "i" with upper or titlecase
|
||||
|
||||
0307; 0307; ; ; lt After_Soft_Dotted; # COMBINING DOT ABOVE
|
||||
|
||||
# Introduce an explicit dot above when lowercasing capital I's and J's
|
||||
# whenever there are more accents above.
|
||||
# (of the accents used in Lithuanian: grave, acute, tilde above, and ogonek)
|
||||
|
||||
0049; 0069 0307; 0049; 0049; lt More_Above; # LATIN CAPITAL LETTER I
|
||||
004A; 006A 0307; 004A; 004A; lt More_Above; # LATIN CAPITAL LETTER J
|
||||
012E; 012F 0307; 012E; 012E; lt More_Above; # LATIN CAPITAL LETTER I WITH OGONEK
|
||||
00CC; 0069 0307 0300; 00CC; 00CC; lt; # LATIN CAPITAL LETTER I WITH GRAVE
|
||||
00CD; 0069 0307 0301; 00CD; 00CD; lt; # LATIN CAPITAL LETTER I WITH ACUTE
|
||||
0128; 0069 0307 0303; 0128; 0128; lt; # LATIN CAPITAL LETTER I WITH TILDE
|
||||
|
||||
# ================================================================================
|
||||
|
||||
# Turkish and Azeri
|
||||
|
||||
# I and i-dotless; I-dot and i are case pairs in Turkish and Azeri
|
||||
# The following rules handle those cases.
|
||||
|
||||
0130; 0069; 0130; 0130; tr; # LATIN CAPITAL LETTER I WITH DOT ABOVE
|
||||
0130; 0069; 0130; 0130; az; # LATIN CAPITAL LETTER I WITH DOT ABOVE
|
||||
|
||||
# When lowercasing, remove dot_above in the sequence I + dot_above, which will turn into i.
|
||||
# This matches the behavior of the canonically equivalent I-dot_above
|
||||
|
||||
0307; ; 0307; 0307; tr After_I; # COMBINING DOT ABOVE
|
||||
0307; ; 0307; 0307; az After_I; # COMBINING DOT ABOVE
|
||||
|
||||
# When lowercasing, unless an I is before a dot_above, it turns into a dotless i.
|
||||
|
||||
0049; 0131; 0049; 0049; tr Not_Before_Dot; # LATIN CAPITAL LETTER I
|
||||
0049; 0131; 0049; 0049; az Not_Before_Dot; # LATIN CAPITAL LETTER I
|
||||
|
||||
# When uppercasing, i turns into a dotted capital I
|
||||
|
||||
0069; 0069; 0130; 0130; tr; # LATIN SMALL LETTER I
|
||||
0069; 0069; 0130; 0130; az; # LATIN SMALL LETTER I
|
||||
|
||||
# Note: the following case is already in the UnicodeData.txt file.
|
||||
|
||||
# 0131; 0131; 0049; 0049; tr; # LATIN SMALL LETTER DOTLESS I
|
||||
|
||||
# EOF
|
||||
|
34924
util/unicode/data/UnicodeData.txt
Normal file
34924
util/unicode/data/UnicodeData.txt
Normal file
File diff suppressed because it is too large
Load Diff
1468
util/unicode/data/WordBreakProperty.txt
Normal file
1468
util/unicode/data/WordBreakProperty.txt
Normal file
File diff suppressed because it is too large
Load Diff
1320
util/unicode/data/emoji-data.txt
Normal file
1320
util/unicode/data/emoji-data.txt
Normal file
File diff suppressed because it is too large
Load Diff
Reference in New Issue
Block a user