Publications
2018
- Mikel Artetxe, Gorka Labaka, Eneko Agirre, Kyunghyun Cho (2018) Unsupervised Neural Machine Translation Sixth International Conference on Learning Representations (ICLR 2018)
- Uxoa Iñurrieta, Itziar Aduriz, Arantza Díaz de Ilarraza, Gorka Labaka, Kepa Sarasola (2018) Konbitzul: an MWE-specific Database for Spanish-BasqueProceedings of the 11th Language Resources and Evaluation Conference, Miyazaki, Japan. LREC-2018.pdf
- Thierry Etchegoyhen, Eva Martínez Garcia, Andoni Azpeitia, Gorka Labaka, Iñaki Alegria, Itziar Cortes Etxabe, Amaia Jauregi Carrera, Igor Ellakuria Santos, Maite Martin eta Eusebi Calonge (2018)Neural Machine Translation of Basque EAMT 2018. Alicante.
- Itziar Cortes, Igor Leturia, Iñaki Alegria, Aitzol Astigarraga, Kepa Sarasola eta Manex Garaio (2018) Massively multilingual accessible audioguides via cell phonesEAMT 2018 (Project/Product Track)
- Uxoa Iñurrieta, Itziar Aduriz, Arantza Díaz de Ilarraza, Gorka Labaka, Kepa Sarasola (2018) Analysing linguistic iniformation about word combinations for a Spanish-Basque Rule-Based Machine Translation systemMultiword Units in Machine Translation and Translation Technologies, John Benjamins Publishing Company (to be published). 41-60.
-
A robust self-learning method for fully unsupervised cross-lingual mappings of word embeddings Mikel Artetxe, Gorka Labaka, Eneko Agirre. In ACL 2018
- Generalizing and improving bilingual word embedding mappings with a multi-step framework of linear transformations Mikel Artetxe, Gorka Labaka, Eneko Agirre. In AAAI 2018
2017
- Nora Aranberri (2017) What Do Professional Translators Do when Post-Editing for the First Time? First Insight into the Spanish-Basque Language Pair HERMES - Journal of Language and Communication in Business, 56 97235-199464-1-SM.pdf
- Nora Aranberri, Gorka Labaka (2017) Euskarazko Itzulpen Automatikoa - IXA Taldea Senez, 48 (2017)
- Uxoa Iñurrieta, Itziar Aduriz, Arantza Díaz de Ilarraza, Gorka Labaka, Kepa Sarasola (2017) Aditza+izena konbinazioen itzulpen automatikoa, arau linguistikoen bidezIkerGazte: nazioarteko ikerketa euskaraz. Kongresuko artikulu-bilduma. Giza zientziak eta artea. Inurrieta_IkerGazte2017.pdf
- Learning bilingual word embeddings with (almost) no bilingual data Mikel Artetxe, Gorka Labaka, Eneko Agirre, In ACL 2017
-
Manex Agirrezabal, Iñaki Alegria, Mans Hulden (2017) A Comparison of Feature-Based and Neural Scansion of Poetry Proc. of the RANLP 2017. pp 18-23
-
Mikel Artetxe, Gorka Labaka, and Eneko Agirre. 2017. Learning bilingual word embeddings with (almost) no bilingual data. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 451-462. CoreA conference
-
Pablo Gamallo, Jose R. Pichel, Iñaki Alegria (2017) From language identification to language distance Physica A: Statistical Mechanics and its Applications. Volume 484, 15 October 2017, Pages 152–162. Citation Index Q1
-
Pablo Gamallo, Jose Ramom Pichel and Iñaki Alegria (2017) A Perplexity-Based Method for Similar Languages Discrimination Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects, pages 109–114, (EACL workshop)
-
Uxoa Iñurrieta, Itziar Aduriz, Arantza Díaz de Ilarraza, Gorka Labaka, Kepa Sarasola (2017) Rule-based translation of Spanish verb-noun combinations into Basque Proceedings of the 13th Workshop on Multiword Expressions MWE 2017 (EACL2017), pages 149–154,Valencia, Spain, April 4. ©2017 Association for Computational Linguistics
2016
-
Agirre E., Alegria I., Aranberri N., Artetxe M., Barrena A., Branco A., Diaz de Ilarraza A., Gojenola K., Labaka G., Otegi A., Sarasola K. (2015) Lexical semantics, Basque and Spanish in QTLeap: Quality Translation by Deep Language Engineering Approaches Procesamiento del Lenguaje natural, vol. 55, pp. 169-172. ISSN: 1135-5948
-
Manex Aguirrezabal, Iñaki Alegria, Mans Hulden (2016) Machine Learning for the Metrical Analysis of English Poetry Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pages 772–781, Osaka, Japan. ISBN:978-4-87974-702-0. CoreA conference
-
Iñaki Alegria, Izaskun Etxeberria (2016) EHU at the SIGMORPHON 2016 Shared Task. A Simple Proposal: Grapheme-to-Phoneme for Inflection SIGMORPHON 2016: Workshop on Computational Research in Phonetics, Phonology, and Morphology. At AC2016. ISBN: 978-1-945626-08-1
-
Nora Aranberri, Gorka Labaka, Oneka Jauregi, Arantza Díaz de Ilarraza, Iñaki Alegria, Eneko Agirre (2016) Tectogrammar-based machine translation for English-Spanish and English-Basque
SEPLN, 56, 73-80. ISSN: 1135-5948 -
Nora Aranberri, Gorka Labaka, Arantza Díaz de Ilarraza eta Kepa Sarasola (2016) Ebaluatoia: crowd evaluation for English;Basque machine translation Journal of Language Resources and Evaluation, p1-32, http://link.springer.com/article/ DOI: 10.1007/s10579-016-9335-x Citation Index Q3
-
Nora Aranberri (2016) Ba al dago lekurik euskararako postedizioarentzat? Senez, 47, p. 195-203. ISSN: 1132-2152. http://www.eizie.eus/Argitalpenak/Senez/20161103/13aranberri/senez47_ara...
-
Nora Aranberri, Eleftherios Avramidis, Aljoscha Burchardt, Ondrej Klejch, Martin Popel and Maja Popovic (2016) Tools and Guidelines for Principled Machine Translation Development Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016) Pages: 1877-1882 ISBN: 978-2-9517408-9-1.
-
Mikel Artetxe, Gorka Labaka, and Eneko Agirre. 2016. Learning principled bilingual mappings of word embeddings while preserving monolingual invariance. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 2289-2294. CoreA conference
-
Artetxe M., Labaka G., Saedi C., Rodrigues J., Silva J., Branco A., Agirre E. (2016) Adding syntactic structure to bilingual terminology for improved domain adaptation Proceedings of the 2nd Deep Machine Translation Workshop (DMTW 2016), pages 39-46 Lisbon, Portugal, 21 October 2016. ISBN 978-80-88132-02-8
-
Izaskun Etxeberria, Iñaki Alegria, Larraitz Uria and Mans Hulden (2016) Combining Phonology and Morphology for the Normalization of Historical Texts Latech 2016: Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities. At ACL 2016. ISBN 978-1-945626-09-8
-
Izaskun Etxeberria, Iñaki Alegria, Larraitz Uria and Mans Hulden (2016) Evaluating the Noisy Channel Model for the Normalization of Historical Texts: Basque, Spanish and Slovene LREC 2016 conference. pp: 1064-1069. ISBN 978-2-9517408-9-1
-
Pablo Gamallo, José Ramom Pichel, Iñaki Alegria and Manex Agirrezabal (2016) Comparing two Basic Methods for Discriminating Between Similar Languages and Varieties Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects pages 170–177, Osaka, Japan, December 12 2016 COLING 2016. ISBN:978-4-87974-716-7.
-
R. Gaudio, G. Labaka, E. Agirre, P. Osenova, K. Simov, M. Popel, D. Oele, G. van Noord, L. Gomes, J. Rodrigues, S. Neale, J. Silva, A. Querido, N. Rendeiro and A. Branco (2016) SMT and Hybrid systems of the QTLeap project in the WMT16 IT-task Proceedings of the First Conference on Machine Translation (WMT16) , Volume 2: Shared Task Papers, pages 435–441, Berlin, Germany, August 11-12, 2016.©2016 Association for Computational Linguistics.
-
Uxoa Iñurrieta, Itziar Aduriz, Arantza Díaz de Ilarraza, Gorka Labaka, Kepa Sarasola (2016) Analysing linguistic iniformation about word combinations for a Spanish-Basque Rule-Based Machine Translation system Multiword Units in Machine Translation and Translation Technologies, John Benjamins Publishing Company (to be published 2017)
-
Uxoa Iñurrieta, Arantza Díaz de Ilarraza, Gorka Labaka, Kepa Sarasola, Itziar Aduriz, John Carroll. (2016) Using Linguistic Data for English and Spanish Verb-Noun Combination Identificatio Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers pages 857–867, Osaka, Japan, December 11-17 2016. ISBN: 978-4-87974-702-0.
-
Uxoa Iñurrieta, Itziar Aduriz, Arantza Díaz de Ilarraza, Gorka Labaka, Kepa Sarasola (2016) Analysing linguistic iniformation about word combinations for a Spanish-Basque Rule-Based Machine Translation system Multiword Units in Machine Translation and Translation Technologies, John Benjamins Publishing Company (to be published)
-
Labaka G., Alegria I. and Sarasola K. (2016) Domain Adaptation in MT Using Titles in Wikipedia as a Parallel Corpus: Resources and Evaluation Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). http://www.lrec-conf.org/proceedings/lrec2016/pdf/632_Paper.pdf. pages: 2209-2213. ISBN: 978-2-9517408-9-1.
-
O. Perez-de-Viñaspre and G. Labaka (2016) IXA Biomedical Translation System at WMT16 Biomedical Translation Task Proceedings of the First Conference on Machine Translation, Volume 2: Shared Task Papers, pages 477–482, Berlin, Germany, August 11-12, 2016. 2016 Association for Computational Linguistics (WMT16)
-
Iñaki San Vicente, Iñaki Alegria, Nora Aranberri, Cristina España-Bonet, Pablo Gamallo, Hugo Gonçalo Oliveira, Eva Martínez, Antonio Toral, Arkaitz Zubiaga (2016) TweetMT: A parallel microblog corpus Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). Pages: 2936-2941. ISBN: 978-2-9517408-9-1.
-
Xabier Saralegi, Eneko Agirre and Iñaki Alegria (2016) Evaluating translation quality and CLIR performance of Query Sessions LREC 2016 conference. European Language Resources Association (ELRA) ISBN: 978-2-9517408-9-1
-
Arkaitz Zubiaga, Iñaki San Vicente, Pablo Gamallo, José Ramom Pichel, Iñaki Alegria, Nora Aranberri, Aitzol Ezeiza, Víctor Fresno (2015) TweetLID: A Benchmark for Tweet Language Identification. Journal of Language Resources and Evaluation. Springer, 50(4), 729-766. Citation Index Q3
2015
-
Iñaki Alegria, Nora Aranberri, Pere R. Comas, Víctor Fresno, Pablo Gamallo, Lluis Padró, Iñaki San Vicente, Jordi Turmo, Arkaitz Zubiaga (2015) TweetNorm: a benchmark for lexical normalization of Spanish tweets Journal of Language Resources and Evaluation. Volume 49, issue 3, September 2015. ISSN 1574-020X Citation Index Q3
-
Iñaki Alegria, Mikel Artetxe, Gorka Labaka, Kepa Sarasola (2015) EHU at TweetMT: Adapting MT Engines for Formal Tweets CEUS-WS, Vol-1445, 20-24 Proceedings of the Tweet Translation Workshop 2015 co-located with 31st Conference of the Spanish Society for Natural Language Processing (SEPLN 2015) ISSN 1613-0073
-
Mikel Artetxe, Eneko Agirre, Iñaki Alegria, Gorka Labaka (2015) Analyzing English-Spanish Named-Entity enhanced Machine Translation Ninth Workshop on Syntax, Semantics and Structure in Statistical Translation (SSST-9). NAACL2015
-
Uxoa Iñurrieta (2015) Translation of Spanish Multiword Expressions into Basque: linguistic analysis and detection experiment Actas del XXXI Congreso de la Sociedad Española para el Procesamiento del Lenguaje Natural. ISBN: 978-84-608-1989-9
-
Nora Aranberri (2015) SMT error analysis and mapping to syntactic, semantic and structural fixes Proceedings of SSST-9, Ninth Workshop on Syntax, Semantics and Structure in Statistical Translation , pages 30–38, Denver, Colorado, June 4, 2015.
-
Nora Aranberri, Arantza Diaz de Ilarraza, Gorka Labaka, Kepa Sarasola (2015) Ebaluatoia 2014: ingelesa-euskara itzultzaile automatikoen konparazioa X. Informatikari Euskaldunen Bilkura (IEB2015) Udako Euskal Unibertsitatea. Donostia
-
Aranberri N., Labaka G., Díaz de Ilarraza A. and Sarasola K. (2015) Exploiting portability to build an RBMT prototype for a new source language Proceedings of the 18 th Annual Conference of the European Association for Machine Translation, EAMT-2015, pp. 3-10, Antalya, Turkey
-
Uxoa Iñurrieta (2015) Konbitzul: euskarazko eta gaztelaniazko izen+aditz konbinazioen datu-basea IkerGazte: nazioarteko ikerketa euskaraz. Kongresuko artikulu bilduma, 32-38. ISBN: 978-84-8438-540-0
-
Gorka Labaka, Oneka Jauregi, Arantza Díaz de Ilarraza, Michael Ustaszewski, Nora Aranberri and Eneko Agirre (2015) Deep-syntax TectoMT for English-Spanish MT Proceedings of the 1st Deep Machine Translation Workshop (DMTW 2015), pages 55–63, Praha, Czech Republic,
-
Mikel Artetxe, Gorka Labaka, Kepa Sarasola (2015) Building hybrid machine translation systems by using an EBMT preprocessor to create partial translations Proceedings of the 18th Annual Conference of the European Association for Machine Translation (EAMT2015), pp. 11-18, Antalya, Turkey
-
Iñaki Alegria, Nora Aranberri, Cristina España-Bonet, Pablo Gamallo, Hugo Gonçalo Oliveira, Eva Martínez Garcia, Iñaki San Vicente, Antonio Toral, Arkaitz Zubiaga (2015) Overview of TweetMT: A Shared Task on Machine Translation of Tweets at SEPLN 2015 CEUS-WS, Vol-1445, 8-19 Proceedings of the Tweet Translation Workshop 2015 co-located with 31st Conference of the Spanish Society for Natural Language Processing (SEPLN 2015) ISSN 1613-0073