-
Notifications
You must be signed in to change notification settings - Fork 145
Zh tn oct5 update #112
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Zh tn oct5 update #112
Changes from all commits
Commits
Show all changes
165 commits
Select commit
Hold shift + click to select a range
a22da43
Swedish telephone fix (#60)
jimregan 45f2e58
log instead of print in graph_utils.py (#68)
eginhard a7dd550
CER estimation speedup for audio-based text normalization (#73)
vsl9 de622b3
add measure coverage for TN and ITN (#62)
ealbasiri 34e761e
upload es-ES, es-LA, fr-FR and it-IT g2p dicts (#63)
mgrafu 2eb1cd2
add country codes from hu (#77)
jimregan 5f7de06
fix electronic case for username (#75)
ekmb 009f723
0.1.8 release (#79)
ekmb 143ff75
Codeswitched ES/EN ITN (#78)
anand-nv 0263378
electronic verbalizer fallback (#81)
ekmb 1168dc7
minor normalize.py edit for usability (#84)
lleaver 54a9fd4
Swedish ITN (#40)
jimregan 0df10a2
Italian_TN (#67)
GiacomoLeoneMaria 2fd5270
Zh itn (#74)
BuyuanCui 1312367
updated pynini_export.py file to create far files (#88)
BuyuanCui 68f482f
readd Swedish (#87)
jimregan a9aa462
Zh tn 0712 (#89)
BuyuanCui f5fce61
Zh tn char (#95)
BuyuanCui 9e994d1
audio-based TN fix for empty pred_text/text (#92)
ekmb fdad64e
pip 1.2.0
ekmb 9bd65c8
French tn (#91)
mgrafu 7678c51
Add whitelist_tech.tsv (#96)
anand-nv b5ce536
Zhitn 0727 (#93)
BuyuanCui 6fa8cc0
Es tn romans fix (#98)
mgrafu b5b18b4
Change docker image (#102)
anand-nv 4473d6f
Print warning instead exception (#97)
karpnv 2dd40ff
warning regardless of verbose flag (#107)
karpnv 42aa7d3
Unpin setuptools (#106)
pplantinga 9d2b2e3
fixed warnings: File is not always closes. (#113)
XuesongYang a866742
fix bug #111 (ar currencies) (#117)
mgrafu 739e4a2
Logging clean up + IT TN fix (#118)
ekmb a737374
Time_IT_TN (#105)
GiacomoLeoneMaria 1b9800f
IT TN improvement on tests (#120)
mgrafu 304ed7c
add single letter exception for roman numerals (#121)
mgrafu ae0e0bc
rewrote tokenizer
BuyuanCui cd9d786
removed the file and replaced it with char in 1.8
BuyuanCui 071aad3
jenkins file update
BuyuanCui 97b71c2
to fix tn bug@ xuesong
BuyuanCui d61c913
tn bug
BuyuanCui 04e440b
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 5951c5a
fixeds and updates
BuyuanCui 27c3887
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 45707d0
adjustments
BuyuanCui 4d41adb
testing commit
BuyuanCui a5de011
removing unsed file
BuyuanCui 633acf8
updated test cases
BuyuanCui 0026e92
updating etst cases
BuyuanCui 25a206c
updates adapting to graphs
BuyuanCui a759797
updated cases for SH tests
BuyuanCui c1c926b
updated cases
BuyuanCui 01f54e6
added some sentences
BuyuanCui cc9e5bf
test cases update
BuyuanCui fca36eb
solving rebase issue, repushing changes
BuyuanCui 0be4e23
resolving conflict
BuyuanCui 0f914cd
Merge branch 'main' into zh_tn_oct5_update
BuyuanCui 78b51a0
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 41174da
fixings according to ci
BuyuanCui 8323fa4
fixings according to the ci
BuyuanCui f98f9ad
Merge branch 'zh_tn_oct5_update' of https://github.com/NVIDIA/NeMo-te…
BuyuanCui f5903f4
removed not used
BuyuanCui e8623c9
notused removing
BuyuanCui eb7971d
format issue
BuyuanCui 5b091a3
formt issue
BuyuanCui 193ffe8
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] b1db6c5
removing unused files
BuyuanCui e136933
removing unused files
BuyuanCui 012b8bc
remiving unsed files;
BuyuanCui c46b3c2
removing unsed files
BuyuanCui 06e41df
removing unsed files
BuyuanCui cb8159a
Merge branch 'zh_tn_oct5_update' of https://github.com/NVIDIA/NeMo-te…
BuyuanCui 4cb65a7
added sentences as test cases
BuyuanCui 1a126bf
added senetnces as test cases
BuyuanCui 53bf81f
removed commentyed out tests
BuyuanCui b0f0474
updating dates
BuyuanCui 8910562
attemps to fix bug
BuyuanCui 431da95
inprocess of fixing the bug
BuyuanCui 3f352f6
fixing existing issue
BuyuanCui e58d6f8
updated graph_utils, tokenize and classify, and word graphs
BuyuanCui 1ed9f80
added bacl the ppostprocessor far creation
BuyuanCui 58c0c35
updated NEMO_NOT_ALPHA as a new variable
BuyuanCui 7d8aaca
far files
BuyuanCui ad94fe5
Merge branch 'main' into zh_tn_oct5_update
BuyuanCui 8bedcd6
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 0ef73b9
combiedn into measure
BuyuanCui 1db8c96
removing and combined to meaasure
BuyuanCui 0a69d22
removing, not used
BuyuanCui cad806c
mergeing for the existring tn update
BuyuanCui fb08fc4
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 2f9473a
updates to fix space issue
BuyuanCui 8edb96d
updates to fix space issue
BuyuanCui 36494cf
updates to fix space issue
BuyuanCui cb9a6c1
updates to solve the space issue
BuyuanCui fce448f
resolving sh issue
BuyuanCui 6ece2bd
resolving sh test issue
BuyuanCui 6efc8d5
adding anands updates
BuyuanCui ef3bd23
Merge branch 'zh_tn_oct5_update' of https://github.com/NVIDIA/NeMo-te…
BuyuanCui fa7668f
data updated for measure and whitelist
BuyuanCui 94352a9
updates
BuyuanCui 43d608f
updates
BuyuanCui d2d9076
updates
BuyuanCui f8f2ec3
removing fraction and math part
BuyuanCui 9e2b288
removing comments
BuyuanCui 2506b0b
removing preprocessor, updating measure, adding shitelist cases
BuyuanCui eaf8be3
removing processor, modification for sp test, shitelist and word
BuyuanCui d980c0b
updating zh date
BuyuanCui b726aa2
Merge branch 'main' into zh_tn_oct5_update
BuyuanCui 8b0fa4a
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] fb6d7b8
realized itn being cvommented out, adding back
BuyuanCui d558778
Merge branch 'zh_tn_oct5_update' of https://github.com/NVIDIA/NeMo-te…
BuyuanCui 215be06
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 1931fda
trying to run zh tn separately because it takes long time to run
BuyuanCui b28dbd4
Merge branch 'zh_tn_oct5_update' of https://github.com/NVIDIA/NeMo-te…
BuyuanCui 3a8faf4
modification to ru zh tn separately
BuyuanCui 43ac3de
independent zh tnitn tests for more time
BuyuanCui 80a43bc
adding lines to save far file
BuyuanCui 376d34c
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] fe930e4
updates for reducing testing time
BuyuanCui 6bafca4
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 9254658
for ounct graph
BuyuanCui e15c5c9
Merge branch 'zh_tn_oct5_update' of https://github.com/NVIDIA/NeMo-te…
BuyuanCui f8efd81
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 653d365
removing used graphs
BuyuanCui 30564fa
format and removing used comments
BuyuanCui 29ee1d8
removing this one, not used
BuyuanCui eb87a8e
remove unused commentss
BuyuanCui 6245414
removing unsed comments
BuyuanCui d86e123
removing unsed comments
BuyuanCui 4e65a12
removing comments
BuyuanCui 4667e9c
Delete tools/text_processing_deployment/zh directory
BuyuanCui 81710a6
updates according to the github comments
BuyuanCui 3da4b24
Merge branch 'zh_tn_oct5_update' of https://github.com/NVIDIA/NeMo-te…
BuyuanCui 1b4c52d
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] fd573fb
removing comments
BuyuanCui 4fa1d76
punct grammar
BuyuanCui 1c15111
Merge branch 'zh_tn_oct5_update' of https://github.com/NVIDIA/NeMo-te…
BuyuanCui d788b65
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 46064b2
Update test_cases_cardinal.txt
BuyuanCui 4e7e9cd
Update Dockerfile
BuyuanCui e4a3e8e
Update launch.sh
BuyuanCui cde7782
Update test_word.py
BuyuanCui 54f0223
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] ea3f78a
Update money.py
BuyuanCui b99634b
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 94ee1f5
Update Jenkinsfile
BuyuanCui 2841319
Update utils.py
BuyuanCui 8709751
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] b1faa7d
Update graph_utils.py
BuyuanCui a1129b7
Update measure.py
BuyuanCui 396b212
Update word.py
BuyuanCui e3c2adb
Update measure.py
BuyuanCui 5563b63
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 6340746
Update post_processing.py
BuyuanCui f32af56
Update post_processing.py
BuyuanCui 2142cd3
Update word.py
BuyuanCui 85c99cb
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 329cb13
Update cardinal.py
BuyuanCui 699b5bb
Update word.py
BuyuanCui 4d52a18
Update word.py
BuyuanCui 1403b3a
Update verbalize.py
BuyuanCui 35b556f
Update post_processing.py
BuyuanCui 011a0ff
Update test_sparrowhawk_normalization.sh
BuyuanCui bdfcb46
Update test_ordinal.py
BuyuanCui 38d99a4
Delete nemo_text_processing/text_normalization/zh/taggers/math_symbol.py
BuyuanCui 99ee464
Delete nemo_text_processing/text_normalization/zh/verbalizers/math_sy…
BuyuanCui cc9d49e
Update Jenkinsfile
BuyuanCui File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -70,3 +70,5 @@ | |
| … | ||
| ‧ | ||
| ﹏ | ||
| < | ||
| > | ||
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -5,3 +5,4 @@ | |
| × 乘 | ||
| ÷ 除 | ||
| ° 度 | ||
| - 减 | ||
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
211 changes: 0 additions & 211 deletions
211
nemo_text_processing/text_normalization/zh/data/measure/units_zh.tsv
This file was deleted.
Oops, something went wrong.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -168,7 +168,6 @@ Ft 匈牙利福林 | |
| ₪ 以色列谢克尔 | ||
| J$ 牙买加元 | ||
| лв 哈萨克斯坦腾格 | ||
| ₩ 朝鲜园 | ||
| лв 吉尔吉斯斯坦索姆 | ||
| ₭ 老挝基普 | ||
| ден 马其顿代纳尔 | ||
|
|
||
9 changes: 9 additions & 0 deletions
9
nemo_text_processing/text_normalization/zh/data/number/digit_alt.tsv
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,9 @@ | ||
| 1 一 | ||
| 2 两 | ||
| 3 三 | ||
| 4 四 | ||
| 5 五 | ||
| 6 六 | ||
| 7 七 | ||
| 8 八 | ||
| 9 九 |
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.