Category:Articles with short descriptionCategory:Short description is different from Wikidata

Category:Wikipedia pages semi-protected from banned users#Unicode%20subscripts%20and%20superscripts

Category:Use mdy dates from January 2025 Unicode has subscripted and superscripted versions of a number of characters including a full set of Arabic numerals.[1] These characters allow any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX.

The World Wide Web Consortium and the Unicode Consortium have made recommendations on the choice between using markup and using superscript and subscript characters:

When used in mathematical context (MathML) it is recommended to consistently use style markup for superscripts and subscripts […] However, when super and sub-scripts are to reflect semantic distinctions, it is easier to work with these meanings encoded in text rather than markup, for example, in phonetic or phonemic transcription.[2]

Uses

The difference between superscript/subscript and numerator/denominator glyphs. In many popular computer fonts the Unicode "superscript" and "subscript" characters are actually numerator and denominator glyphs.

The intended use[2] when these characters were added to Unicode was to produce true superscripts and subscripts so that chemical and algebraic formulas could be written without markup. Thus "H₂O" (using a subscript 2 character) is supposed to be identical to "H2O" (with subscript markup).

In reality, many fonts that include these characters ignore the Unicode definition, and instead design the digits for mathematical numerator and denominator glyphs,[3][4] which are aligned with the cap line and the baseline, respectively. When used with the solidus or the Fraction Slash, they produce an almost typographically correct diagonal fraction, such as ³/₄ for the ¾ glyph. Super and subscript markup does not produce a correct fraction (compare markup 3/4 with precomposed ¾). The change also makes the superscript letters useful for ordinal indicators, more closely matching the ª and º characters.

Unicode intended that diagonal fractions be rendered by a different mechanism: the fraction slash U+2044 is visually similar to the solidus, but when used with the ordinary digits (not the superscripts and subscripts), it instructs the layout system that a fraction such as ¾ is to be rendered using automatic glyph substitution.[5][a] User-end support was quite poor for a number of years, but fonts,[b] browsers,[c] word processors,[d] desktop publishing software[e] and others increasingly support the intended Unicode behavior. This browser and your default font render it as 34. (See Slash (punctuation)#Fractions for rendering in various other fonts.)

Superscripts and subscripts block

The most common superscript digits (1, 2, and 3) were included in ISO-8859-1 and were therefore carried over into those code points in the Latin-1 range of Unicode. The remainder were placed along with basic arithmetical symbols, and later some Latin subscripts, in a dedicated block at U+2070 to U+209F. The table below shows these characters together. Each superscript or subscript character is preceded by a baseline x to show the height of subscripting/superscripting.

Six code points in the "Superscripts and Subscripts" block are unassigned, and remain available for future characters. As of November 2024,Category:Articles containing potentially dated statements from November 2024Category:All articles containing potentially dated statements three of these (209D, 209E, and 209F) were provisionally assigned to new subscript characters, namely Latin lowercase w, y, and z.[6][7]

Unicode characters
0123456789ABCDEF
U+00Bx
U+207x x⁰xⁱ x⁴x⁵x⁶x⁷x⁸x⁹x⁺x⁻x⁼x⁽x⁾xⁿ
U+208x x₀x₁x₂x₃x₄x₅x₆x₇x₈x₉x₊x₋x₌x₍x₎
U+209x xₐxₑxₒxₓxₔ xₕxₖxₗxₘ xₙxₚxₛxₜ
  Not yet assigned.
  Other characters from Latin-1 not related to super- or sub-scripts.

Other superscript and subscript characters

Unicode also includes codepoints for subscript and superscript characters that are intended for semantic usage, in the following blocks:[1][8]

Superscript
  • The Latin-1 Supplement block contains the feminine and masculine ordinal indicators ª and º.
  • The Latin Extended-C block contains one superscript, ⱽ.
  • The Latin Extended-D block contains six superscripts: ꝰ ꟲ ꟳ ꟴ ꟸ ꟹ.
  • The Latin Extended-E block contains five superscripts: ꭜ ꭝ ꭞ ꭟ ꭩ.
  • The Latin Extended-F block is entirely superscript IPA letters: 𐞁 𐞂 𐞃 𐞄 𐞅 𐞇 𐞈 𐞉 𐞊 𐞋 𐞌 𐞍 𐞎 𐞏 𐞐 𐞑 𐞒 𐞓 𐞔 𐞕 𐞖 𐞗 𐞘 𐞙 𐞚 𐞛 𐞜 𐞝 𐞞 𐞟 𐞠 𐞡 𐞢 𐞣 𐞤 𐞥 𐞦 𐞧 𐞨 𐞩 𐞪 𐞫 𐞬 𐞭 𐞮 𐞯 𐞰 𐞲 𐞳 𐞴 𐞵 𐞶 𐞷 𐞸 𐞹 𐞺.
  • The Spacing Modifier Letters block has superscripted letters and symbols used for phonetic transcription: ʰ ʱ ʲ ʳ ʴ ʵ ʶ ʷ ʸ ˀ ˁ ˠ ˡ ˢ ˣ ˤ.
  • The Phonetic Extensions block has several superscripted letters and symbols: Latin/IPA ᴬ ᴭ ᴮ ᴯ ᴰ ᴱ ᴲ ᴳ ᴴ ᴵ ᴶ ᴷ ᴸ ᴹ ᴺ ᴻ ᴼ ᴽ ᴾ ᴿ ᵀ ᵁ ᵂ ᵃ ᵄ ᵅ ᵆ ᵇ ᵈ ᵉ ᵊ ᵋ ᵌ ᵍ ᵏ ᵐ ᵑ ᵒ ᵓ ᵖ ᵗ ᵘ ᵚ ᵛ, Greek ᵝ ᵞ ᵟ ᵠ ᵡ, Cyrillic ᵸ, other ᵎ ᵔ ᵕ ᵙ ᵜ. These are intended to indicate secondary articulation.
  • The Phonetic Extensions Supplement block has several more: Latin/IPA ᶛ ᶜ ᶝ ᶞ ᶟ ᶠ ᶡ ᶢ ᶣ ᶤ ᶥ ᶦ ᶧ ᶨ ᶩ ᶪ ᶫ ᶬ ᶭ ᶮ ᶯ ᶰ ᶱ ᶲ ᶳ ᶴ ᶵ ᶶ ᶷ ᶸ ᶹ ᶺ ᶻ ᶼ ᶽ ᶾ, Greek ᶿ.
  • The Cyrillic Extended-B block contains two Cyrillic superscripts: ꚜ ꚝ.
  • The Cyrillic Extended-D block contains many Cyrillic superscripts: 𞀰 𞀱 𞀲 𞀳 𞀷 𞀵 𞀶 𞀷 𞀸 𞀹 𞀺 𞀻 𞀼 𞀽 𞀾 𞀿 𞁀 𞁁 𞁂 𞁃 𞁄 𞁅 𞁆 𞁇 𞁈 𞁉 𞁊 𞁋 𞁌 𞁍 𞁎 𞁏 𞁐 𞁫 𞁬 𞁭.
  • The Georgian block contains one superscripted Mkhedruli letter: ჼ.
  • The Kanbun block has superscripted annotation characters used in Japanese copies of Classical Chinese texts: ㆒ ㆓ ㆔ ㆕ ㆖ ㆗ ㆘ ㆙ ㆚ ㆛ ㆜ ㆝ ㆞ ㆟.
  • The Tifinagh block has one superscript letter : ⵯ.
  • The Unified Canadian Aboriginal Syllabics and its Extended blocks contain several mostly consonant-only letters to indicate syllable coda called Finals, along with some characters that indicate syllable medial known as Medials: Main block ᐜ ᐝ ᐞ ᐟ ᐠ ᐡ ᐢ ᐣ ᐤ ᐥ ᐦ ᐧ ᐨ ᐩ ᐪ ᑉ ᑊ ᑋ ᒃ ᒄ ᒡ ᒢ ᒻ ᒼ ᒽ ᒾ ᓐ ᓑ ᓒ ᓪ ᓫ ᔅ ᔆ ᔇ ᔈ ᔉ ᔊ ᔋ ᔥ ᔾ ᔿ ᕀ ᕁ ᕐ ᕑ ᕝ ᕪ ᕻ ᕯ ᕽ ᖅ ᖕ ᖖ ᖟ ᖦ ᖮ ᗮ ᘁ ᙆ ᙇ ᙚ ᙾ ᙿ; Extended block: ᣔ ᣕ ᣖ ᣗ ᣘ ᣙ ᣚ ᣛ ᣜ ᣝ ᣞ ᣟ ᣳ ᣴ ᣵ.
Combining superscript
  • The Combining Diacritical Marks block contains medieval superscript letter diacritics. These letters are written directly above other letters appearing in medieval Germanic manuscripts, and so these glyphs do not include spacing, for example uͤ. They are shown here over the dotted circle placeholder ◌: ◌ͣ ◌ͤ ◌ͥ ◌ͦ ◌ͧ ◌ͨ ◌ͩ ◌ͪ ◌ͫ ◌ͬ ◌ͭ ◌ͮ ◌ͯ.
  • The Combining Diacritical Marks Extended block contains three combining insular letters for the Middle English Ormulum, ◌ᫌ ◌ᫍ ◌ᫎ.[9]
  • The Combining Diacritical Marks Supplement block contains additional medieval superscript letter diacritics, enough to complete the basic lowercase Latin alphabet except for j, q and y, a few small capitals and ligatures (ae, ao, av), and additional letters: ◌᷒ ◌ᷓ ◌ᷔ ◌ᷕ ◌ᷖ ◌ᷗ ◌ᷘ ◌ᷙ ◌ᷚ ◌ᷛ ◌ᷜ ◌ᷝ ◌ᷞ ◌ᷟ ◌ᷠ ◌ᷡ ◌ᷢ ◌ᷣ ◌ᷤ ◌ᷥ ◌ᷦ ◌ᷧ ◌ᷨ ◌ᷪ ◌ᷫ ◌ᷬ ◌ᷭ ◌ᷮ ◌ᷯ ◌ᷰ ◌ᷱ ◌ᷲ ◌ᷳ ◌ᷴ, Greek ◌ᷩ.
  • The Cyrillic Extended-A and -B blocks contains multiple medieval superscript letter diacritics, enough to complete the basic lowercase Cyrillic alphabet used in Church Slavonic texts, also includes an additional ligature (ст): ◌ⷠ ◌ⷡ ◌ⷢ ◌ⷣ ◌ⷤ ◌ⷥ ◌ⷦ ◌ⷧ ◌ⷨ ◌ⷩ ◌ⷪ ◌ⷫ ◌ⷬ ◌ⷭ ◌ⷮ ◌ⷯ ◌ⷰ ◌ⷱ ◌ⷲ ◌ⷳ ◌ⷴ ◌ⷵ ◌ⷶ ◌ⷷ ◌ⷸ ◌ⷹ ◌ⷺ ◌ⷻ ◌ⷼ ◌ⷽ ◌ⷾ ◌ⷿ ◌ꙴ ◌ꙵ ◌ꙶ ◌ꙷ ◌ꙸ ◌ꙹ ◌ꙺ ◌ꙻ ◌ꚞ ◌ꚟ.
  • The Cyrillic Extended-D block has one additional combining character, that being і: ◌𞂏.
Subscript
  • The Latin Extended-C block contains one subscript, ⱼ.
  • The Phonetic Extensions block has several subscripted letters and symbols: Latin/IPA ᵢ ᵣ ᵤ ᵥ and Greek ᵦ ᵧ ᵨ ᵩ ᵪ.
  • The Cyrillic Extended-D block also contains many Cyrillic subscripts: 𞁑 𞁒 𞁓 𞁔 𞁕 𞁖 𞁗 𞁘 𞁙 𞁚 𞁛 𞁜 𞁝 𞁞 𞁟 𞁠 𞁡 𞁢 𞁣 𞁤 𞁥 𞁦 𞁧 𞁨 𞁩 𞁪.
Combining subscript

Latin, Greek, Cyrillic, and IPA tables

A superscript small-cap W may be distinct from a superscript lowercase w in italic typeface, as in this phonetic notation.

Consolidated, the Unicode standard contains superscript and subscript versions of a subset of Latin, Greek and Cyrillic letters. Here they are arranged in alphabetical order for comparison (or for copy and paste convenience). Since these characters appear in different Unicode ranges, they may not appear to be the same size or position due to font substitution by the browser. Shaded cells mark petite capitals that are not very distinct from minuscules in roman typeface, but they may be distinct in italic typeface, as is used in some phonetic notation.

Little punctuation is encoded. Parentheses are shown in the basic superscript block above, and the exclamation mark Category:Pages with plain IPA is shown in the IPA table below. In a supporting font, a question mark may be created with a superscript gelded question mark and a combining dot below: ˀ̣Category:Pages with plain IPA.

Latin superscript and subscript letters
ABCDEFGHIJKLMNOPQRSTUVWXYZ
Superscript capital ᴿ*
Superscript small capital *𐞄 **𐞒𐞖 *𐞪 𐞲
Superscript minuscule ʰʲˡ𐞥ʳˢʷˣʸ
Overscript small capital ◌ᷛ◌ᷞ◌ᷟ◌ᷡ ◌ᷢ
Overscript minuscule ◌ͣ◌ᷨ◌ͨ◌ͩ◌ͤ◌ᷫ◌ᷚ◌ͪ◌ͥ◌ᷜ◌ᷝ◌ͫ◌ᷠ◌ͦ◌ᷮ◌ͬ◌ᷤ◌ͭ◌ͧ◌ͮ◌ᷱ◌ͯ◌ᷦ
Subscript minuscule ***
Underscript minuscule ◌᷊◌ᪿ

*Superscript versions of S, of petite capital A, D, E and P, of ƀ, and subscript versions of w, y and z have been accepted for a future version of the Unicode Standard.[6]

Additional Latin characters
ÆɃƎƏŊ
Superscript capital
Superscript minuscule 𐞃*
Overscript minuscule ◌ᷔ◌ᷪ ◌ᷬ
Subscript minuscule

Some of these superscript capitals are small caps in the source documents in the Unicode proposals.

Shaded cells Greek letters that are indistinguishable from Latin, and so would not be expected to be supported by Unicode.

Greek superscript and subscript letters
ΑΒΓΔΕΖΗΘΙΚΛΜΝΞΟΠΡΣΤΥΦΧΨΩ
Superscript minuscule [A]ᶿ [A]**
Overscript minuscule [A]◌ᷩ
Subscript minuscule ͺ[f]
Underscript minuscule ◌ͅ ◌̫[g]
  1. 1 2 3 In some fonts, Latin alpha ᵅ and upsilon ᶹ can be used as superscript Greek alpha and upsilon. ᵋ and ᶥ are also officially Latin letters, but display the same as Greek.

*Superscript versions of Greek psi and omega have been accepted for a future version of the Unicode Standard.[6]

Russian superscript and subscript letters
АБВГДЕЖЗИКЛМНОП РСТУФХЦЧШЩЪЫЬЭЮЯ
Superscript𞀰𞀱𞀲𞀳𞀴𞀵𞀶𞀷𞀸𞀹𞀺𞀻𞀼𞀽 𞀾𞀿𞁀𞁁𞁂𞁃𞁄𞁅𞁆𞁇𞁈𞁉
Overscript◌ⷶ◌ⷠ◌ⷡ◌ⷢ◌ⷣ◌ⷷ◌ⷤ◌ⷥ◌ꙵ◌ⷦ◌ⷧ◌ⷨ◌ⷩ◌ⷪ◌ⷫ ◌ⷬ◌ⷭ◌ⷮ◌ꙷ◌ꚞ◌ⷯ◌ⷰ◌ⷱ◌ⷲ◌ⷳ◌ꙸ◌ꙹ◌ꙺ◌ⷻ
Subscript𞁑𞁒𞁓𞁔𞁕𞁖𞁗𞁘𞁙𞁚𞁛𞁜𞁝 𞁞𞁟𞁠𞁡𞁢𞁣𞁤𞁥𞁦
Additional modern Cyrillic characters
ӘҐЄЅІЇЈӨҪҮҰЏӀ
Superscript𞁋𞁊𞁌𞁌̈𞁍𞁎𞁫𞁏𞁭𞁐
Overscript◌ꙴ𞂏◌ꙶ
Subscript𞁧𞁩𞁨𞁨̈𞁪
Additional medieval Cyrillic characters
ѠѢѤѦѪѬѲ
Overscript◌ⷹ◌ꙻ◌ⷺ◌ⷼ◌ꚟ◌ⷽ◌ⷾ◌ⷿ◌ⷴ
Superscript 𞁬

Superscript and subscript ё, ї, й, ў etc. are handled with diacritics, 𞀵̈ 𞁌̈ 𞀸̆ 𞁁̆ etc. Many of the Cyrillic characters were added to the Cyrillic Extended-D block, which was added to the free Gentium Plus and Andika fonts with version 6.2 in February 2023.

See also small caps in Unicode.

Superscript IPA

The Latin Extended-F block was created for the remaining superscript IPA letters. They are supported by the free Gentium Plus and Andika fonts. Additional superscript characters for historical and para-IPA letters have been accepted for future versions of the Unicode Standard.[6]

Consonant letters

The Unicode characters for superscript (modifier) IPA and extIPA consonant letters are as follows. The entire Latin Extended-F block is dedicated to superscript IPA. Characters for sounds with secondary articulation are set off in parentheses and placed below the base letters.

IPA and extIPA consonants, along with superscript variants and their Unicode code points
Bi­labial Labio­dental Dental Alveolar Post­alveolar Retro­flex Palatal Velar Uvular Pharyn­geal Glottal
Nasal m Category:Pages with plain IPA
1D50
ɱ Category:Pages with plain IPA
1DAC
n Category:Pages with plain IPA
207F
(Category:Pages with plain IPA)
 
 
(ȵCategory:Pages with plain IPA)
ɳ Category:Pages with plain IPA
1DAF
ɲ Category:Pages with plain IPA
1DAE
ŋ Category:Pages with plain IPA
1D51
ɴ Category:Pages with plain IPA
1DB0
Plosive p Category:Pages with plain IPA
1D56
b Category:Pages with plain IPA
1D47
t Category:Pages with plain IPA
1D57
(ƫ Category:Pages with plain IPA)
1DB5
d Category:Pages with plain IPA
1D48
(Category:Pages with plain IPA)
 
 
(ȶCategory:Pages with plain IPA)
 
 
(ȡCategory:Pages with plain IPA)
ʈ 𐞯Category:Pages with plain IPA
107AF
ɖ 𐞋Category:Pages with plain IPA
1078B
c Category:Pages with plain IPA
1D9C
ɟ Category:Pages with plain IPA
1DA1
k Category:Pages with plain IPA
1D4F
ɡ Category:Pages with plain IPA/g Category:Pages with plain IPA
1DA2/1D4D
q 𐞥Category:Pages with plain IPA
107A5
ɢ 𐞒Category:Pages with plain IPA
10792
ʡ 𐞳Category:Pages with plain IPA
107B3
ʔ ˀCategory:Pages with plain IPA
02C0
Affricate ʦ 𐞬Category:Pages with plain IPA
107AC
ʣ 𐞇Category:Pages with plain IPA
10787
ʧ 𐞮Category:Pages with plain IPA
107AE
(ʨ 𐞫Category:Pages with plain IPA)
107AB
ʤ 𐞊Category:Pages with plain IPA
1078A
(ʥ 𐞉Category:Pages with plain IPA)
10789
 𐞭Category:Pages with plain IPA
107AD
(𝼜Category:Pages with plain IPA)
 𐞈Category:Pages with plain IPA
10788
(𝼙Category:Pages with plain IPA)
Fricative ɸ Category:Pages with plain IPA
1DB2
β Category:Pages with plain IPA
1D5D
f Category:Pages with plain IPA
1DA0
v Category:Pages with plain IPA
1D5B
θ ᶿCategory:Pages with plain IPA
1DBF
ð Category:Pages with plain IPA
1D9E
s ˢCategory:Pages with plain IPA
02E2
(Category:Pages with plain IPA)
z Category:Pages with plain IPA
1DBB
(Category:Pages with plain IPA)
ʃ Category:Pages with plain IPA
1DB4
(ɕ Category:Pages with plain IPA)
1D9D
ʒ Category:Pages with plain IPA
1DBE
(ʑ Category:Pages with plain IPA)
1DBD
ʂ Category:Pages with plain IPA
1DB3
(Category:Pages with plain IPA)
ʐ Category:Pages with plain IPA
1DBC
(Category:Pages with plain IPA)
ç ᶜ̧Category:Pages with plain IPA
1D9C + 0327[h]
ʝ Category:Pages with plain IPA
1DA8
x ˣCategory:Pages with plain IPA
02E3
(ɧ 𐞗Category:Pages with plain IPA)
10797
ɣ ˠCategory:Pages with plain IPA
02E0
χ Category:Pages with plain IPA
1D61
ʁ ʶCategory:Pages with plain IPA
02B6
ħ 𐞕Category:Pages with plain IPA
10795
(ʩ 𐞐Category:Pages with plain IPA)
10790
ʕ ˤCategory:Pages with plain IPA
02E4[i]
h ʰCategory:Pages with plain IPA
02B0
(Category:Pages with plain IPA)
ɦ ʱCategory:Pages with plain IPA
02B1
Approximant ʋ Category:Pages with plain IPA
1DB9
ɹ ʴCategory:Pages with plain IPA
02B4
ɻ ʵCategory:Pages with plain IPA
02B5
j ʲCategory:Pages with plain IPA
02B2
(ɥ Category:Pages with plain IPA)
1DA3
 
 
(ʍ Category:Pages with plain IPA)
AB69
ɰ Category:Pages with plain IPA
1DAD
(w ʷCategory:Pages with plain IPA)
02B7
Tap/flap  𐞰Category:Pages with plain IPA
107B0
ɾ 𐞩Category:Pages with plain IPA
107A9
ɽ 𐞨Category:Pages with plain IPA
107A8
Trill ʙ 𐞄Category:Pages with plain IPA
10784
r ʳCategory:Pages with plain IPA
02B3
ʀ 𐞪Category:Pages with plain IPA
107AA
ʜ 𐞖Category:Pages with plain IPA
10796
ʢ 𐞴Category:Pages with plain IPA
107B4
Lateral fricative ɬ 𐞛Category:Pages with plain IPA
1079B
(ʪ 𐞙Category:Pages with plain IPA)
10799
ɮ 𐞞Category:Pages with plain IPA
1079E
(ʫ 𐞚Category:Pages with plain IPA)
1079A
 𐞝Category:Pages with plain IPA
1079D
𝼅 𐞟Category:Pages with plain IPA
1079F
𝼆 𐞡Category:Pages with plain IPA
107A1
𝼄 𐞜Category:Pages with plain IPA
1079C
Lateral approximant l ˡCategory:Pages with plain IPA
02E1
( Category:Pages with plain IPA)
1DAA
 
 
(ȴCategory:Pages with plain IPA)
ɭ Category:Pages with plain IPA
1DA9
ʎ 𐞠Category:Pages with plain IPA
107A0
ʟ Category:Pages with plain IPA
1DAB
(ɫ Category:Pages with plain IPA)[j]
AB5E
Lateral tap/flap ɺ 𐞦Category:Pages with plain IPA
107A6
𝼈 𐞧Category:Pages with plain IPA
107A7
Implosive ƥCategory:Pages with plain IPA ɓ 𐞅Category:Pages with plain IPA
10785
ƭCategory:Pages with plain IPA ɗ 𐞌Category:Pages with plain IPA
1078C
𝼉Category:Pages with plain IPA  𐞍Category:Pages with plain IPA
1078D
ƈCategory:Pages with plain IPA ʄ 𐞘Category:Pages with plain IPA
10798
ƙCategory:Pages with plain IPA ɠ 𐞓Category:Pages with plain IPA
10793
ʠCategory:Pages with plain IPA ʛ 𐞔Category:Pages with plain IPA
10794
Click release ʘ 𐞵Category:Pages with plain IPA
107B5
ɋCategory:Pages with plain IPA ǀ 𐞶Category:Pages with plain IPA
107B6
ʇCategory:Pages with plain IPA ǃ Category:Pages with plain IPA
A71D
ʗCategory:Pages with plain IPA 𝼊 𐞹Category:Pages with plain IPA
107B9
ψCategory:Pages with plain IPA ǂ 𐞸Category:Pages with plain IPA
107B8
𝼋Category:Pages with plain IPA (ʞCategory:Pages with plain IPA)
Lateral click
release
ǁ 𐞷Category:Pages with plain IPA
107B7
ʖCategory:Pages with plain IPA
Percussive ¡ Category:Pages with plain IPA
A71E[k]

The spacing diacritic for ejective consonants, U+2BC, works with superscript letters despite not being superscript itself: ᵖʼ ᵗʼ ᶜʼ ᵏˣʼCategory:Pages with plain IPA. If a distinction needs to be made, the combining apostrophe U+315 may be used: ̕ ̕ ̕ ᵏˣ̕Category:Pages with plain IPA. The spacing diacritic should be used for a baseline letter with a superscript release, such as [tˢʼ]Category:Pages with plain IPA or [kˣʼ]Category:Pages with plain IPA, where the scope of the apostrophe includes the non-superscript letter, but the combining apostrophe U+315 might be used to indicate a weakly articulated ejective consonant like [ᵗ̕]Category:Pages with plain IPA or [ᵏ̕]Category:Pages with plain IPA, where the whole consonant is written as a superscript, or together with U+2BC when separate apostrophes have scope over the base and modifier letters, as in pʼᵏˣ̕Category:Pages with plain IPA.[10]

Spacing diacritics, as in Category:Pages with plain IPA, cannot be secondarily superscripted in plain text: ᵗʲCategory:Pages with plain IPA. (In this instance, the old IPA letter for [tʲ]Category:Pages with plain IPA, ƫCategory:Pages with plain IPA, has a superscript variant in Unicode, U+1DB5 Category:Pages with plain IPA, but that is not generally the case.)

Among older letters, the most common letters with palatal hook are supported; they are displayed in the table above. IPA once had an idiosyncratic curl on some of the palatalized letters: these are the fricative letters ʆ ʓCategory:Pages with plain IPA. Their superscript forms have been accepted for a future version of the Unicode Standard. Old-style click letters and the retired letters ƞCategory:Pages with plain IPA and ɼCategory:Pages with plain IPA have also been accepted for a future version of the Unicode Standard.[6] The Teuthonista letter Category:Pages with plain IPA (U+A727) is an old graphic variant of ɮCategory:Pages with plain IPA. Its superscript is supported at Category:Pages with plain IPA (U+AB5C).

Among para-IPA letters, superscript variants of Sinological ȡ ȴ ȵ ȶCategory:Pages with plain IPA, of the Bantuist labio-dental plosives ȹCategory:Pages with plain IPA and ȸCategory:Pages with plain IPA, and of central semivowels ɉCategory:Pages with plain IPA, ɥ̶Category:Pages with plain IPA, and Category:Pages with plain IPA have been accepted for a future version of the Unicode Standard.[6]

Vowel letters

The Unicode characters for superscript (modifier) IPA vowel letters, plus a pair of extended letters ᵿCategory:Pages with plain IPA found in English dictionaries, are as follows. Recently retired alternative letters such as ɩ ɷCategory:Pages with plain IPA are also supported; they are set off in parentheses and placed below the standard IPA letters:

IPA vowels and superscript variants
Front Central Back
Close i Category:Pages with plain IPA
2071
y ʸCategory:Pages with plain IPA
02B8
ɨ Category:Pages with plain IPA
1DA4
ʉ Category:Pages with plain IPA
1DB6
ɯ Category:Pages with plain IPA
1D5A
u Category:Pages with plain IPA
1D58
Near-close ɪ Category:Pages with plain IPA
1DA6
(ɩ Category:Pages with plain IPA)
1DA5
ʏ 𐞲Category:Pages with plain IPA
107B2




( Category:Pages with plain IPA)
1DA7


(ᵿCategory:Pages with plain IPA)



(ωCategory:Pages with plain IPA)

ʊ Category:Pages with plain IPA
1DB7
(ɷ 𐞤Category:Pages with plain IPA)
107A4
Close-mid e Category:Pages with plain IPA
1D49
ø 𐞢Category:Pages with plain IPA
107A2
ɘ 𐞎Category:Pages with plain IPA
1078E
ɵ Category:Pages with plain IPA
1DB1
ɤ 𐞑Category:Pages with plain IPA
10791
o Category:Pages with plain IPA
1D52
Mid ə Category:Pages with plain IPA
1D4A
Open-mid ɛ Category:Pages with plain IPA
1D4B
œ Category:Pages with plain IPA
A7F9
ɜ Category:Pages with plain IPA
1D9F
( Category:Pages with plain IPA)
1D4C
ɞ 𐞏Category:Pages with plain IPA
1078F
ʌ Category:Pages with plain IPA
1DBA
ɔ Category:Pages with plain IPA
1D53
Near-open æ 𐞃Category:Pages with plain IPA
10783
ɶ 𐞣Category:Pages with plain IPA
107A3
ɐ Category:Pages with plain IPA
1D44
ɑ Category:Pages with plain IPA
1D45
ɒ Category:Pages with plain IPA
1D9B
Open a Category:Pages with plain IPA
1D43

The precomposed Unicode rhotic vowel letters ɚ ɝCategory:Pages with plain IPA are not directly supported. The rhotic diacritic U+02DE ◌˞Category:Pages with plain IPA should be used instead: ᵊ˞ ᶟ˞Category:Pages with plain IPA.[11]

ɜCategory:Pages with plain IPA and Category:Pages with plain IPA are reversed ɛCategory:Pages with plain IPA. The older IPA turned ɛCategory:Pages with plain IPA, Category:Pages with plain IPA, is also supported, at U+1D4C Category:Pages with plain IPA. However, the briefly resurrected vowel letter ʚCategory:Pages with plain IPA (U+029A) is not supported, only its reversed replacement ɞCategory:Pages with plain IPA is.

Among older letters, Category:Pages with plain IPA (U+1D1C), a graphic variant of ʊCategory:Pages with plain IPA, is supported at Category:Pages with plain IPA (U+1DB8).[12]

Among para-IPA letters, Sinological superscript ɿ ʅ ʮ ʯCategory:Pages with plain IPA and Category:Pages with plain IPA have been accepted for a future version of the Unicode Standard.[6]

Length marks

The two length marks are also supported:

Length marks
Long Half-long
ː 𐞁Category:Pages with plain IPA
10781
ˑ 𐞂Category:Pages with plain IPA
10782

These are used to add length to another superscript, such as Cʰ𐞁Category:Pages with plain IPA or Cʰ𐞂Category:Pages with plain IPA for long aspiration.

Wildcards

Superscript wildcards (full caps) are largely supported: e.g. ᴺCCategory:Pages with plain IPA (prenasalized consonant), ꟲNCategory:Pages with plain IPA (prestopped nasal), PꟳCategory:Pages with plain IPA (fricative release), NᴾFCategory:Pages with plain IPA (epenthetic plosive), CVNᵀCategory:Pages with plain IPA (tone-bearing syllable), CᴸCategory:Pages with plain IPA (liquid or lateral release), CᴿCategory:Pages with plain IPA (rhotic or resonant release), VᴳCategory:Pages with plain IPA (off-glide/diphthong), CⱽCategory:Pages with plain IPA (fleeting vowel). Superscript SCategory:Pages with plain IPA for sibilant release has been accepted for a future version of the Unicode Standard;[6] superscript Category:Pages with plain IPA for fleeting/epenthetic click has not. Other basic Latin superscript wildcards for tone and weak indeterminate sounds, as described in the article on the International Phonetic Alphabet, are mostly supported. (See table in previous section.)

Combining marks and subscripts

In addition, a very few IPA letters beyond the basic Latin alphabet have combining forms or are supported as subscripts:

Additional IPA characters
äɑæ βçðəʃʍ χʔʼ
Overscript ◌ᷲ◌ᷧ◌ᷔ ◌ᷩ◌ᷗ◌ᷙ◌ᷪ◌ᷯ ◌̉[l]◌̓
Subscript
Underscript ◌ᫀ ◌̦

Composite characters

Primarily for compatibility with earlier character sets, Unicode contains a number of characters that compose super- and subscripts with other symbols.[1] In most fonts these render much better than attempts to construct these symbols from the above characters or by using markup.

Notes

  1. For a general overview and technical information on glyph substitution (though not specifically for fractions), see GSUB — Glyph Substitution Table in the OpenType specification on the Microsoft Typography site.
  2. Such as Andika, Arno Pro, Brill, Brioso Pro, Calibri, Candara, Carlito, Cantarell, FiraGO, EB Garamond, Gentium Book, Lato, Linux Libertine, Noto Sans, Noto Serif, Open Sans and Yrsa
  3. Such as Chrome, Firefox and Falkon
  4. Such as LibreOffice Writer
  5. Such as Adobe InDesign and Scribus
  6. ͺCategory:Pages with plain IPA is set lower than a normal subscript. It is equivalent to underscript ◌ͅCategory:Pages with plain IPA on a space.
  7. ◌̫Category:Pages with plain IPA is traditionally typeset as an omega.
  8. Superscript çCategory:Pages with plain IPA is composed of superscript cCategory:Pages with plain IPA and a combining cedilla, which should display properly in a good font. Superscript c was specifically requested for this purpose in Unicode proposal L2/03-180.
  9. U+02E4 ˤ MODIFIER LETTER SMALL REVERSED GLOTTAL STOP is the superscript variant of U+0295 ʕ LATIN LETTER PHARYNGEAL VOICED FRICATIVE and is defined for IPA use. The similar character U+02C1 ˁ MODIFIER LETTER REVERSED GLOTTAL STOP is a reversed U+02C0 ˀ MODIFIER LETTER GLOTTAL STOP, perhaps a gelded reversed question mark. Fonts are inconsistent in whether they look different and what the difference is.
  10. In Microsoft fonts, superscript ɫCategory:Pages with plain IPA was erroneously designed as a superscript Category:Pages with plain IPA.
  11. U+A71D Category:Pages with plain IPA and A71E Category:Pages with plain IPA were adopted as the Africanist equivalents of the IPA characters Category:Pages with plain IPA downstep and Category:Pages with plain IPA upstep. The correspondence of U+A71D Category:Pages with plain IPA to the IPA click letter ǃCategory:Pages with plain IPA is thus accidental. Coincidentally, U+A71E Category:Pages with plain IPA serves as the superscript variant of the extIPA percussive consonant ¡Category:Pages with plain IPA; the other percussive letters, ʬCategory:Pages with plain IPA and ʭCategory:Pages with plain IPA, do not have superscript support in Unicode.
  12. This is actually the Vietnamese diacritic dấu hỏi, not specifically IPA, but graphically both are gelded question marks.

References

  1. 1 2 3 "UCD: UnicodeData.txt". The Unicode Standard. Retrieved May 14, 2016.
  2. 1 2 Martin Dürst, Asmus Freytag (May 16, 2007). "Unicode in XML and other Markup Languages". W3C. Retrieved September 13, 2010.
  3. "fraction | Dart Package". Dart packages. December 27, 2021. Retrieved September 21, 2022.
  4. "MathML | General layout elements | Fractions". data2type GmbH (in German). March 30, 2021. Archived from the original on January 28, 2021. Retrieved January 13, 2022.Category:CS1 German-language sources (de)
  5. Martin Dürst, Asmus Freytag (May 16, 2007). "Fraction Slash". W3C. Retrieved September 13, 2010.
  6. 1 2 3 4 5 6 7 8 "Additional draft repertoire for provisionally assigned code points for Unicode" (PDF). Unicode Consortium. November 26, 2024.
  7. "Approved Minutes of UTC Meeting 181". Unicode Consortium. January 27, 2025. Retrieved March 8, 2025.
  8. "UCD: Scripts.txt". The Unicode Standard. Retrieved September 21, 2022.
  9. Everson, Michael; West, Andrew (October 5, 2020). "L2/20-268: Revised proposal to add ten characters for Middle English to the UCS" (PDF).
  10. Kirk Miller & Michael Ashby, L2/20-253R Unicode request for IPA modifier letters (b), non-pulmonic.
  11. Kirk Miller & Michael Ashby, L2/20-252R Unicode request for IPA modifier-letters (a), pulmonic
  12. Kirk Miller (January 30, 2024). "L2/24-081: Latin Phonetic The for Middle Tilde" (PDF).
  13. Silva, Eduardo Marín (March 1, 2017). "L2/17-066R: Proposal to encode the Marca Registrada sign" (PDF).
Category:Unicode
Category:Redirects from Unicode characters Category:Unprintworthy redirects