-
Notifications
You must be signed in to change notification settings - Fork 145
es and es_en changes for unified models #143
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
24 commits
Select commit
Hold shift + click to select a range
f40a62d
enable capitalized itn for es
mgrafu 5fcafae
enable capitalized itn for es_en
mgrafu e473184
Merge branch 'main' into en_es_unified_changes
mgrafu 64fd6c7
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] e36c813
style changes
mgrafu 5c701fe
style changes
mgrafu 16eeecc
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 1ea5fd5
fix imports
mgrafu fa0a970
mod eval scripts
mgrafu f9f6945
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] a85e039
update whitelist
mgrafu 2dbaf56
update currencies for es itn
mgrafu e95cb4f
bugfix for es time grammar
mgrafu a00b6a8
update cache for es
mgrafu 6f0e5da
Merge branch 'main' into en_es_unified_changes
mgrafu 52b1493
ordinal tagger fix
mgrafu d67fff6
telephone tagger fix
mgrafu f627075
time tagger fix
mgrafu b7a09c8
time verbalizer fix
mgrafu e232c43
update cache
mgrafu 90980c4
Merge branch 'main' into en_es_unified_changes
mgrafu 86ffe6c
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 5d9468c
Merge branch 'main' into en_es_unified_changes
mgrafu 134ffee
Merge branch 'main' into en_es_unified_changes
mgrafu File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
12 changes: 12 additions & 0 deletions
12
nemo_text_processing/inverse_text_normalization/es/data/dates/months_cased.tsv
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,12 @@ | ||
| Enero | ||
| Febrero | ||
| Marzo | ||
| Abril | ||
| Mayo | ||
| Junio | ||
| Julio | ||
| Agosto | ||
| Septiembre | ||
| Octubre | ||
| Noviembre | ||
| Diciembre |
11 changes: 11 additions & 0 deletions
11
nemo_text_processing/inverse_text_normalization/es/data/dates/year_suffix_cased.tsv
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,11 @@ | ||
| A. N. E. antes de nuestra era | ||
| A. E. C. antes de la era común | ||
| A. C. antes de Cristo | ||
| A. J. C. antes de Jesucristo | ||
| A. P. antes del presente | ||
| N. E. nuestra era | ||
| E. C. era común | ||
| D. C. después de Cristo | ||
| D. D. J. C. después de Jesucristo | ||
| B. C. B C | ||
| A. D. a d | ||
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
1 change: 1 addition & 0 deletions
1
nemo_text_processing/inverse_text_normalization/es/data/money/currency_major_plural.tsv
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,6 +1,7 @@ | ||
| € euros | ||
| US$ dólares estadounidenses | ||
| US$ dólares americanos | ||
| CAD$ dólares canadienses | ||
| $ dólares | ||
| $ pesos | ||
| ¥ yenes | ||
|
|
||
75 changes: 75 additions & 0 deletions
75
...processing/inverse_text_normalization/es/data/money/currency_major_plural_capitalized.tsv
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,75 @@ | ||
| US$ Dólares Estadounidenses | ||
| US$ dólares Estadounidenses | ||
| US$ Dólares estadounidenses | ||
| US$ Dólares Americanos | ||
| US$ dólares Americanos | ||
| US$ Dólares americanos | ||
| CAD$ dólares Canadienses | ||
| CAD$ Dólares canadienses | ||
| CAD$ Dólares Canadienses | ||
| AR$ Pesos Argentinos | ||
| AR$ pesos Argentinos | ||
| AR$ Pesos argentinos | ||
| BRL Reales Brasileños | ||
| BRL reales Brasileños | ||
| BRL Reales brasileños | ||
| CHF Francos Suizos | ||
| CHF francos Suizos | ||
| CHF Francos suizos | ||
| CLP Pesos Chilenos | ||
| CLP pesos Chilenos | ||
| CLP Pesos chilenos | ||
| CNY Yuan Chinos | ||
| CNY yuan Chinos | ||
| CNY Yuan chinos | ||
| COP Pesos Colombianos | ||
| COP pesos Colombianos | ||
| COP Pesos colombianos | ||
| CRC Colones Costarricenses | ||
| CRC colones Costarricenses | ||
| CRC Colones costarricenses | ||
| CUP Pesos Cubanos | ||
| CUP pesos Cubanos | ||
| CUP Pesos cubanos | ||
| RD$ Pesos Dominicanos | ||
| RD$ pesos Dominicanos | ||
| RD$ Pesos dominicanos | ||
| GBP Libras Esterlinas | ||
| GBP libras Esterlinas | ||
| GBP Libras esterlinas | ||
| HKD Dólares De Hong Kong | ||
| HKD dólares de Hong Kong | ||
| HKD Dólares de hong kong | ||
| INR Rupias Indias | ||
| INR rupias Indias | ||
| INR Rupias indias | ||
| Mex$ Pesos Mexicanos | ||
| Mex$ pesos Mexicanos | ||
| Mex$ Pesos mexicanos | ||
| SVC Colones Salvadoreños | ||
| SVC colones Salvadoreños | ||
| SVC Colones salvadoreños | ||
| UYU Pesos Uruguayos | ||
| UYU pesos Uruguayos | ||
| UYU Pesos uruguayos | ||
| VES Bolívares Soberanos | ||
| VES bolívares Soberanos | ||
| VES Bolívares soberanos | ||
| BOP Pesos Bolivianos | ||
| BOP pesos Bolivianos | ||
| BOP Pesos bolivianos | ||
| CLE Escudos Chilenos | ||
| CLE escudos Chilenos | ||
| CLE Escudos chilenos | ||
| ECS Sucres Ecuatorianos | ||
| ECS sucres Ecuatorianos | ||
| ECS Sucres ecuatorianos | ||
| PEH Soles De Oro | ||
| PEH soles de Oro | ||
| PEH Soles de oro | ||
| VEB Bolívares Venezolanos | ||
| VEB bolívares Venezolanos | ||
| VEB Bolívares venezolanos | ||
| VEF Bolívares Fuertes | ||
| VEF bolívares Fuertes | ||
| VEF Bolívares fuertes |
1 change: 1 addition & 0 deletions
1
nemo_text_processing/inverse_text_normalization/es/data/money/currency_major_singular.tsv
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,6 +1,7 @@ | ||
| € euro | ||
| US$ dólar estadounidense | ||
| US$ dólar americano | ||
| CAD$ dólar canadiense | ||
| $ dólar | ||
| $ peso | ||
| ¥ yen | ||
|
|
||
76 changes: 76 additions & 0 deletions
76
...ocessing/inverse_text_normalization/es/data/money/currency_major_singular_capitalized.tsv
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,76 @@ | ||
| US$ dólar Estadounidense | ||
| US$ Dólar Estadounidense | ||
| US$ Dólar estadounidense | ||
| US$ dólar Estadounidense | ||
| US$ dólar Americano | ||
| US$ Dólar Americano | ||
| US$ Dólar americano | ||
| CAD$ dólar Canadiense | ||
| CAD$ Dólar canadiense | ||
| CAD$ Dólar Canadiense | ||
| AR$ peso Argentino | ||
| AR$ Peso Argentino | ||
| AR$ Peso argentino | ||
| BRL real Brasileño | ||
| BRL Real Brasileño | ||
| BRL Real brasileño | ||
| CHF franco Suizo | ||
| CHF Franco Suizo | ||
| CHF Franco suizo | ||
| CLP Peso Chileno | ||
| CLP peso Chileno | ||
| CLP Peso chileno | ||
| CNY Yuan Chino | ||
| CNY yuan Chino | ||
| CNY Yuan chino | ||
| COP Peso Colombiano | ||
| COP peso Colombiano | ||
| COP Peso colombiano | ||
| CRC Colón Costarricense | ||
| CRC colón Costarricense | ||
| CRC Colón costarricense | ||
| CUP Peso Cubano | ||
| CUP peso Cubano | ||
| CUP Peso cubano | ||
| RD$ Peso Dominicano | ||
| RD$ peso Dominicano | ||
| RD$ Peso dominicano | ||
| GBP Libra Esterlina | ||
| GBP libra Esterlina | ||
| GBP Libra esterlina | ||
| HKD Dólar De Hong Kong | ||
| HKD dólar de Hong Kong | ||
| HKD Dólar de hong kong | ||
| INR Rupia India | ||
| INR rupia India | ||
| INR Rupia india | ||
| Mex$ Peso Mexicano | ||
| Mex$ peso Mexicano | ||
| Mex$ Peso mexicano | ||
| SVC Colón Salvadoreño | ||
| SVC colón Salvadoreño | ||
| SVC Colón salvadoreño | ||
| UYU Peso Uruguayo | ||
| UYU peso Uruguayo | ||
| UYU Peso uruguayo | ||
| VES Bolívar Soberano | ||
| VES bolívar Soberano | ||
| VES Bolívar soberano | ||
| BOP Peso Boliviano | ||
| BOP peso Boliviano | ||
| BOP Peso boliviano | ||
| CLE Escudo Chileno | ||
| CLE escudo Chileno | ||
| CLE Escudo chileno | ||
| ECS Sucre Ecuatoriano | ||
| ECS sucre Ecuatoriano | ||
| ECS Sucre ecuatoriano | ||
| PEH Sol De Oro | ||
| PEH sol de Oro | ||
| PEH Sol de oro | ||
| VEB Bolívar Venezolano | ||
| VEB bolívar Venezolano | ||
| VEB Bolívar venezolano | ||
| VEF Bolívar Fuerte | ||
| VEF bolívar Fuerte | ||
| VEF Bolívar fuerte |
22 changes: 22 additions & 0 deletions
22
nemo_text_processing/inverse_text_normalization/es/data/ordinals/digit_capitalized.tsv
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,22 @@ | ||
| Primero uno | ||
| Primera uno | ||
| Primer uno | ||
| Segundo dos | ||
| Segunda dos | ||
| Tercero tres | ||
| Tercera tres | ||
| Tercer tres | ||
| Cuarto cuatro | ||
| Cuarta cuatro | ||
| Quinto cinco | ||
| Quinta cinco | ||
| Sexto seis | ||
| Sexta seis | ||
| Séptimo siete | ||
| Séptima siete | ||
| Sétimo siete | ||
| Sétima siete | ||
| Octavo ocho | ||
| Octava ocho | ||
| Noveno nueve | ||
| Novena nueve |
18 changes: 18 additions & 0 deletions
18
nemo_text_processing/inverse_text_normalization/es/data/ordinals/hundreds_capitalized.tsv
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,18 @@ | ||
| Centésimo ciento | ||
| Centésima ciento | ||
| Ducentésimo doscientos | ||
| Ducentésima doscientos | ||
| Tricentésimo trescientos | ||
| Tricentésima trescientos | ||
| Cuadringentésimo cuatrocientos | ||
| Cuadringentésima cuatrocientos | ||
| Quingentésimo quinientos | ||
| Quingentésima quinientos | ||
| Sexcentésimo seiscientos | ||
| Sexcentésima seiscientos | ||
| Septingentésimo setecientos | ||
| Septingentésima setecientos | ||
| Octingentésimo ochocientos | ||
| Octingentésima ochocientos | ||
| Noningentésimo novecientos | ||
| Noningentésima novecientos |
60 changes: 60 additions & 0 deletions
60
nemo_text_processing/inverse_text_normalization/es/data/ordinals/teen_capitalized.tsv
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,60 @@ | ||
| Décimo diez | ||
| Décima diez | ||
| Decimoprimero once | ||
| Decimoprimera once | ||
| Decimoprimer once | ||
| Décimo Primero once | ||
| Décima Primera once | ||
| Décimo Primera once | ||
| Décimo Primer once | ||
| Undécimo once | ||
| Undécima once | ||
| Decimosegundo doce | ||
| Decimosegunda doce | ||
| Décimo Segundo doce | ||
| Décima Segunda doce | ||
| Décimo Segunda doce | ||
| Duodécimo doce | ||
| Duodécima doce | ||
| Decimotercero trece | ||
| Decimotercera trece | ||
| Decimotercer trece | ||
| Décimo Tercero trece | ||
| Décima Tercera trece | ||
| Décimo Tercera trece | ||
| Décimo Tercer trece | ||
| Decimocuarto catorce | ||
| Decimocuarta catorce | ||
| Décimo Cuarto catorce | ||
| Décima Cuarta catorce | ||
| Décimo Cuarta catorce | ||
| Decimoquinto quince | ||
| Decimoquinta quince | ||
| Décimo Quinto quince | ||
| Décima Quinta quince | ||
| Décimo Quinta quince | ||
| Decimosexto dieciséis | ||
| Decimosexta dieciséis | ||
| Décimo Sexto dieciséis | ||
| Décima Sexta dieciséis | ||
| Décimo Sexta dieciséis | ||
| Decimoséptimo diecisiete | ||
| Decimoséptima diecisiete | ||
| Décimo Séptimo diecisiete | ||
| Décima Séptima diecisiete | ||
| Décimo Séptima diecisiete | ||
| Décimo Sétimo diecisiete | ||
| Décimo Sétima diecisiete | ||
| Décima Sétima diecisiete | ||
| Decimosétimo diecisiete | ||
| Decimosétima diecisiete | ||
| Decimoctavo dieciocho | ||
| Decimoctava dieciocho | ||
| Décimo Octavo dieciocho | ||
| Décima Octava dieciocho | ||
| Décimo Octava dieciocho | ||
| Decimonoveno diecinueve | ||
| Decimonovena diecinueve | ||
| Décimo Noveno diecinueve | ||
| Décima Novena diecinueve | ||
| Décimo Novena diecinueve |
15 changes: 15 additions & 0 deletions
15
nemo_text_processing/inverse_text_normalization/es/data/ordinals/ties_capitalized.tsv
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,15 @@ | ||
| Vigésimo veinte | ||
| Vigésima veinte | ||
| Trigésimo treinta | ||
| Cuadragésimo cuarenta | ||
| Cuadragésima cuarenta | ||
| Quincuagésimo cincuenta | ||
| Quincuagésima cincuenta | ||
| Sexagésimo sesenta | ||
| Sexagésima sesenta | ||
| Septuagésimo setenta | ||
| Septuagésima setenta | ||
| Octogésimo ochenta | ||
| Octogésima ochenta | ||
| Nonagésimo noventa | ||
| Nonagésima noventa |
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.