We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent 4673bba commit 28ad0f7Copy full SHA for 28ad0f7
2 files changed
itn/chinese/rules/cardinal.py
@@ -134,8 +134,9 @@ def build_tagger(self):
134
cardinal |= (number + dot + digits.plus)
135
# cardinal string like 110 or 12306 or 13125617878, used in phone,
136
# 340621199806051223, used in ID card
137
+ idcard_last_char = digits | 'X' | 'x'
138
cardinal |= (digits**3 | digits**4 | digits**5 | digits**11
- | digits**18)
139
+ | (digits**17 + idcard_last_char) | digits**18)
140
141
# 4. 特殊格式的数字 + 标准数字
142
# cardinal string like 23
itn/chinese/test/data/cardinal.txt
@@ -12,6 +12,7 @@
12
十七八万 => 17-8万
13
六十三四万 => 63-4万
14
我的身份证号是三四零二零三一九三七零幺零幺零五幺七 => 我的身份证号是340203193701010517
15
+我的身份证号是三四零二零三一九三七零幺零幺零五幺X => 我的身份证号是34020319370101051X
16
给一三三四五三一二二二一打电话 => 给13345312221打电话
17
给一三三四五三一二二二一拨电话 => 给13345312221拨电话
18
一二三四 => 1234
0 commit comments