The document provides an overview of the Taigi language, including:
- Taigi is derived from Chinese Min Nan and is also known as Taiwanese.
- It introduces some common Taigi words and their Mandarin translations.
- It describes the main writing systems used for Taigi - Pe?h-┃e-j┤, Taiwanese kana, and Taiwanese Romanization System (TRS).
- It provides details on the consonants, vowels, and tone marks in the Pe?h-┃e-j┤ system.
5. Taigi, ^Tai-wan ue? ̄, si jit kuan ui Man-nan-gu ia?n-
pian lai e gi-gia?n.
Taigi, or literally, ^Taiwanese Language ̄, is derived
from Chinese Min-Nan
13. Writing Systems
¢ P h-┃e-ji? (POJ)eo
C 1820
C has wikipedia language code: zh-min-nan
¢ Taiwanese kana (Katagana based)
C 1931-ish
¢ TRS (Taiwanese Romanization System)
C 2006, derived from POJ
14. POJ
¢ P h-┃e-ji? (POJ; Han-ji?eo 紺徭忖 ) si? 1 khoa?n i┃ng Latin
(L?-ma?) ph┬ng-im he?-thong la?i sia? Ta?i-?an e? gi-gia?n e? su-
bi?n b?n-ji?. In-u?i tong-chh si? th?an-kau-su? in--j p-la?i e?, s -oo ?o ┏o
i i h-u?-la?ng ka? POJ ki┛-ch┛ Kau-h┃e L?-ma?-ji?, h k-chia? si?ao eo
ka?n-chheng Kau-l?. Put-j?-k┛ hia?n-ta?i e? s┣-i┃ng-chia? be?-
chio -si? kau-t , kau-t ma? chin che? be?-hia?u POJ.mm ?o ?o
¢ Creators (1830s):
C Walter Henry Medhurst
C Elihu Doty
C John Van Nest Talmage
15. Consonants and Vowels
¢ p b ph m t th n nng l k g kh h chi ji chhi si
ch j chh s
¢ a ap at ak ah a? ok o? o e e? i ian eng ekoo
i? ai ai? au am om m ong ng u oa oe oai oan
i (i)u?
¢ sng / mng / m
¢ 717 known syllables (from moedict dataset)
16. Marks ○ Tones
1 2 3 4
a a? a ap/at/ak/ah
5 6 7 8
a? ┌ a? p/ t/ k/ hao ao ao ao
17. Interesting letters
¢ ? - U+0207F - SUPERSCRIPT LATIN SMALL
LETTER N
C chiu? / chhia?? / phi?
¢ oo
C o - U+0006F - LATIN SMALL LETTER O
C - U+00358 - COMBINING DOT ABOVE?o
RIGHT
19. Online Tools
¢ Moedict: https://moedict.tw/
C Multi-Dictionary with cross-ref
C Han-based search (some TRS-based search)
C Pronunciation
¢ itaigi: https://itaigi.tw/
C Community content with pronunciations
¢
Forvo
C Community content with speakers of multiple accents
C Han-script based index
¢
POJ Wikipedia
C Peh-oe-ji writing system
20. Tone-less TRS analyzer
¢ Normalize input into Unicode NFD form
¢ Remove all p{Combining_Diacritical_Marks}
¢ Split by non-letters
¢
@tokens =
split(
/[^a-z]/i,
(Unicode::Normalize::NFD($input)
=~ s/p{Combining_Diacritical_Marks}//gr
=~ s/┿/a/gr)
);