ParaCrawl: Web-Scale Acquisition of Parallel Corpora M Bañón, P Chen, B Haddow, K Heafield, H Hoang, M Espla-Gomis, ... | 78* | |
Prompsit’s submission to WMT 2018 Parallel Corpus Filtering shared task VM Sánchez-Cartagena, M Bañón, SO Rojas, G Ramírez-Sánchez Proceedings of the Third Conference on Machine Translation: Shared Task …, 2018 | 41 | 2018 |
Bifixer and Bicleaner: two open-source tools to clean your parallel data MBSOR Gema Ramírez-Sánchez, Jaume Zaragoza-Bernabeu Proceedings of the 22nd Annual Conference of the European Association for …, 2020 | 11 | 2020 |
ParaCrawl corpus version 1.0 P Koehn, K Heafield, ML Forcada, M Espla-Gomis, S Ortiz-Rojas, ... LINDAT/CLARIN digital library at the Institute of Formal and Applied …, 2018 | 5 | 2018 |
MaCoCu: Massive collection and curation of monolingual and bilingual data: focus on under-resourced languages M Bañón, M Esplà-Gomis, ML Forcada, C García-Romero, T Kuzman, ... Proceedings of the 23rd Annual Conference of the European Association for …, 2022 | | 2022 |
Human evaluation of web-crawled parallel corpora for machine translation G Ramírez‐Sánchez, M Bañón, J Zaragoza-Bernabeu, S Ortíz-Rojas Proceedings of the 2nd Workshop on Human Evaluation of NLP Systems (HumEval …, 2022 | | 2022 |
Icelandic web corpus MaCoCu-is 1.0 M Bañón, M Esplà-Gomis, ML Forcada, C García-Romero, T Kuzman, ... Jožef Stefan Institute, 2022 | | 2022 |
Maltese web corpus MaCoCu-mt 1.0 M Bañón, M Esplà-Gomis, ML Forcada, C García-Romero, T Kuzman, ... Jožef Stefan Institute, 2022 | | 2022 |
Slovene-English parallel corpus MaCoCu-sl-en 1.0 M Bañón, M Esplà-Gomis, ML Forcada, C García-Romero, T Kuzman, ... Jožef Stefan Institute, 2022 | | 2022 |
Bulgarian-English parallel corpus MaCoCu-bg-en 1.0 M Bañón, M Esplà-Gomis, ML Forcada, C García-Romero, T Kuzman, ... Jožef Stefan Institute, 2022 | | 2022 |
Croatian-English parallel corpus MaCoCu-hr-en 1.0 M Bañón, M Esplà-Gomis, ML Forcada, C García-Romero, T Kuzman, ... Jožef Stefan Institute, 2022 | | 2022 |
Turkish-English parallel corpus MaCoCu-tr-en 1.0 M Bañón, M Esplà-Gomis, ML Forcada, C García-Romero, T Kuzman, ... Jožef Stefan Institute, 2022 | | 2022 |
DSI-enriched ParaCrawl 9 en-es corpus M Bañón, M Esplà-Gomis, ML Forcada, C García-Romero, T Kuzman, ... Jožef Stefan Institute, 2022 | | 2022 |
Macedonian-English parallel corpus MaCoCu-mk-en 1.0 M Bañón, M Esplà-Gomis, ML Forcada, C García-Romero, T Kuzman, ... Jožef Stefan Institute, 2022 | | 2022 |