News & events

April 30 2019 | San Sebastian | First meeting of the consortium to launch the project

May 29 2019 | El grupo Ixa obtiene uno de las ayudas de investigación de la Fundación BBVA

 

May 25 2020 | San Sebastian | Publication in the first year of the project:

  • Aitor Ormazabal, Mikel Artetxe, Gorka Labaka, Aitor Soroa and Eneko Agirre (2019). Analyzing the Limitations of Cross-lingual Word Embedding Mappings. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 4990-4995. SCIE Class 1
  • Javier Álvez, Itziar Gonzalez-Dios, German Rigau (2020). Towards Word Sense Disambiguation by Reasoning Vampire 2018 and Vampire 2019. The 5th and 6th Vampire Workshops. EPiC Series in Computing. Pages 19-29. ISSN: 2398-7340
  • Kepa Bengoetxea, Itziar Gonzalez-Dios, Amaia Aguirregoitia (2020). AzterTest: Open source linguistic and stylistic analysis tool. Procesamiento del Lenguaje Natural, 64, 61-68.
  • Nora Aranberri (2020). Can translationese features help users select an MT system for post-editing? Revista Procesamiento del Lenguaje Natural, 64, 93-100.
  • Xabier Soto, Olatz Perez de Viñaspre, Maite Oronoz, Gorka Labaka (2019). Leveraging SNOMED CT terms and relations for machine translation of clinical texts from Basque to Spanish. Proceedings of the Second Workshop on Multilingualism at the Intersection of Knowledge Bases and Machine Translation
  • Mikel Artetxe, Gorka Labaka, Eneko Agirre (2019). An Effective Approach to Unsupervised Machine Translation. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 194-203. SCIE Class 1
  • Mikel Artetxe, Gorka Labaka, Eneko Agirre (2019). Bilingual Lexicon Induction through Unsupervised Machine Translation. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 5002-5007. SCIE Class 1
  • Javier Álvez, Itziar Gonzalez-Dios, German Rigau (2019). Commonsense Reasoning Using WordNet and SUMO: a Detailed Analysis. Proceedings of the Tenth Global Wordnet Conference, pp 197--205. ISBN 978-83-7493-108-3
  • Itziar Gonzalez-Dios, German Rigau (2019). Textual genre based approach to use wordnets in language-for-specific-purpose classroom as dictionary. Proceedings of the Tenth Global Wordnet Conference, pp 222--227. ISBN 978-83-7493-108-3

July 1 2021 | San Sebastian | We are excited to report the following awards related to the project:

  • Eneko Agirre, IP of the project, has been awarded the Aritmel SCIE-BBVA 2021 Spanish National Informatics Award
  • Mikel Artetxe, pre-doctoral researcher in the project, has been awarded the SCIE-BBVA 2021 National Prize for Young Researchers in Informatics
  • Mikel Artetxe's doctoral thesis has been awarded the best thesis in Artificial Intelligence in Europe EurAI 2020

July 1 2021 | San Sebastian | Publications in the second year of the project:

  •  Jon Ander Campos, Arantxa Otegi, Aitor Soroa, Jan Deriu, Mark Cieliebak, Eneko Agirre (2020). DoQA - Accessing Domain-Specific FAQs via Conversational QA. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 7302-7314. SCIE Class 1.
  • Mikel Artetxe, Gorka Labaka, Eneko Agirre (2020). Translation Artifacts in Cross-lingual Transfer Learning. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, pp. 7674–7684. SCIE Class 1.
  • Ivana Kvapilíková, Mikel Artetxe, Gorka Labaka, Eneko Agirre, Ondřej Bojar (2020). Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, pp. 255–262.
  • Mikel Artetxe, Gorka Labaka, Noe Casas, Eneko Agirre (2020). Do all roads lead to Rome? Understanding the role of initialization in iterative back-translation. Knowledge-Based Systems, v. 206, pp. 1-6. ISSN 0950-7051. JCR 5.921 (Q1).
  • Aitzol Elu, Gorka Azkune, Oier Lopez de Lacalle, Ignacio Arganda-Carreras, Aitor Soroa, Eneko Agirre (2021). Inferring spatial relations from textual descriptions of images. Pattern Recognition, v. 113, pp. 1-10. ISSN 0031-3203. JCR 7.196 (Q1).
  • Arantxa Otegi, Aitor Agirre, Jon Ander Campos, Aitor Soroa, Eneko Agirre (2020). Conversational Question Answering in Low Resource Scenarios: A Dataset and Case Study for Basque. Proceedings of the 12th Conference on Language Resources and Evaluation, pp. 429–435. SCIE Class 3.
  • Mikel Artetxe, Sebastian Ruder, Dani Yogatama, Gorka Labaka, Eneko Agirre (2020). A Call for More Rigor in Unsupervised Cross-lingual Learning. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 7375–7388. SCIE Class 1.
  • Juan J. Lastra-Díaz, Josu Goikoetxea, Mohamed Ali Hadj Taieb, Ana Garcia-Serrano, Mohamed Ben Aouicha, Eneko Agirre, David Sánchez (2021). A large reproducible benchmark of ontology-based methods and word embeddings for word similarity. Information Systems, v. 96, pp. 1-17. ISSN 0306-4379. JCR Q1.
  • Ander Barrena, Aitor Soroa, Eneko Agirre (2021). Towards zero-shot cross-lingual named entity disambiguation. Expert Systems with Applications v. 184, pp. 1-9. ISSN 0957-4174. JCR 6.954 (Q1).
  • Rodrigo Agerri, Iñaki San Vicente, Jon Ander Campos, Ander Barrena, Xabier Saralegi, Aitor Soroa, Eneko Agirre (2020). Give your Text Representation Models some Love: the Case for Basque. Proceedings of the 12th Conference on Language Resources and Evaluation, pp. 4781–4788. SCIE Class 3.
  • Oier Lopez de Lacalle, Ander Salaberria, Aitor Soroa, Gorka Azkune, Eneko Agirre (2020). Evaluating Multimodal Representations on Visual Semantic Textual Similarity. Proceedings of the Twenty-third European Conference on Artificial Intelligence, pp. 1990-1997. SCIE Class 2.
  • Elena Zotova, Rodrigo Agerri, German Rigau (2021). Semi-automatic generation of multilingual datasets for stance detection in Twitter. Expert Systems with Applications, v. 170, pp. 1-13. ISSN 0957-4174. JCR 5.452 (Q1).
  • Oscar Sainz, German Rigau (2021). Ask2Transformers: Zero-Shot Domain labelling with Pre-trained Language Models. Proceedings of the 11th Global WordNet Conference, pp. 44–52. ISBN 978-9-464027-31-0.
  • Itziar Gonzalez-Dios, Javier Álvez, German Rigau (2020). Towards modeling SUMO attributes through WordNet adjectives: a Case Study on Qualities. Proceedings of the Workshop on Multimodal Wordnets (LREC), pp. 1–6. ISBN 979-10-95546-41-2.
  • Rodrigo Agerri, German Rigau (2020). Projecting Heterogeneous Annotations for Named Entity Recognition. Proceedings of the Iberian Languages Evaluation Forum, pp. 45-51. Winner of the CAPITEL@IberLEF task on Spanish NER.
  • Elena Zotova, Rodrigo Agerri, Manuel Nuñez and German Rigau (2020). Multilingual Stance Detection in Tweets: The Catalonia Independence Corpus. Proceedings of the 12th Language Resources and Evaluation Conference, pp. 1368–1375. SCIE Class 3.
  • Rodrigo Agerri, German Rigau (2020). Language independent sequence labelling for Opinion Target Extraction. Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, pp. 5110-5115. SCIE Class 1.
  • Javier Álvez, Itziar Gonzalez-Dios, German Rigau (2020). Applying the Closed World Assumption to SUMO-based FOL Ontologies for Effective Commonsense Reasoning. Proceedings of the European Conference in Artificial Intelligence. Published as Frontiers in Artificial Intelligence and Applications. Giuseppe De Giacomo, Alejandro Catala, Bistra Dilkina, Michela Milano, Senén Barro, Alberto Bugarín, Jérôme Lang (eds.), v. 325, IOS Press Ebooks, pp. 585-592. ISBN 978-1-64368-100-9 (print) | 978-1-64368-101-6 (online). SCIE class 2.
  • Ainhoa Serna, Aitor Soroa, Rodrigo Agerri (2021). Applying Deep Learning Techniques for Sentiment Analysis to Assess Sustainable Transport. Sustainability 13, no. 4: 2397, pp. 1-19. ISSN 2071-1050. JCR Q2.
  • Nora Aranberri (2020). With or without you? Effects of using machine translation to write flash fiction in the foreign language. Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, pp. 165–174. ISBN 978-989-33-0589-8. SCIE Class 3.
  • Jon Alkorta, Itziar Gonzalez-Dios (2020). Exploring the Enrichment of Basque WordNet with a Sentiment Lexicon. Proceedings of the Workshop on Multimodal Wordnets (LREC), pp. 20–24. ISBN 979-10-95546-41-2.
  • Itziar Gonzalez-Dios, Kepa Bengoetxea, Amaia Aguirregoitia (2020). LagunTest: A NLP Based Application to Enhance Reading Comprehension. 1st Workshop on Tools and Resources to Empower People with REAding DIfficulties (LREC), pp. 63–69. ISBN: 979-10-95546-44-3. SCIE Class 3
  • Cecilia Domingo, Tatiana Gonzalez-Ferrero, Itziar Gonzalez-Dios (2021). What is on Social Media that is not in WordNet? A Preliminary Analysis on the TwitterAAE Corpus. Proceedings of the 11th Global Wordnet Conference, pp. 234-242. ISBN 978-9-464027-31-0.
  • Xabier Soto, Olatz Perez-de-Viñaspre, Gorka Labaka, Maite Oronoz (2020). Ixamed's submission description for WMT20 Biomedical shared task: benefits and limitations of using terminologies for domain adaptation. Proceedings of the Fifth Conference on Machine Translation, pp: 873–878. SCIE Class 3
  • Aitor Ormazabal, Mikel Artetxe, Aitor Soroa, Gorka Labaka, and Eneko Agirre (2021). Beyond offline mapping: Learning cross lingual word embeddings through context anchoring. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 6479–6489. SCIE Class 1