The 7th International Workshop of the ISCA Special Interest Group on Speech and Language Technology for Minority Languages (SaLTMiL: see http://ixa2.si.ehu.es/saltmil), will be held in Malta, on a date between May 17 and May 23, 2010 to be announced, as part of the 2010 International Language Resources and Evaluation Conference (LREC). Entitled "Creation and use of basic lexical resources for less-resourced languages", the workshop is intended to continue the series of SALTMIL/LREC workshops on computational language resources for minority languages, held in Granada (1998), Athens (2000), Las Palmas de Gran Canaria (2002), Lisbon (2004), Genoa (2006) and Marrakech (2008). The Malta 2010 workshop aims to share information on tools and best practice, so that isolated researchers will not need to start from scratch. An important aspect will be the forming of personal contacts, which can minimize duplication of effort. There will be a balance between presentations of existing language resources, and more general presentations designed to give background information needed by all researchers.
09.00 | Registration |
09.30 | Opening |
09.45 | Invited talk: Marc Kemps-Snijders. LAT team at the Max Planck Institute at Nijmegen. "ELAN and RELISH project" |
10.30 | Coffee break |
11.00 | Invited talk: Antton Gurrutxaga and Igor Leturia: Elhuyar Foundation. "Exploiting Internet to build language resources for less resourced languages" |
11.45 | Oral papers (20+5 min.): |
Tommi A Pirinen and Krister Lindén: "Finite-State Spell-Checking with Weighted Language and Error Models–Building and Evaluating Spell-Checkers with Wikipedia" as Corpus | |
Aric Bills, Lori S. Levin, Lawrence D. Kaplan, and Edna Agheak MacLean: "Finite-State Morphology for Iñupiaq" | |
12.35 | Poster session |
Marco Passarotti: "Leaving Behind the Less-Resourced Status. The Case of Latin through the Experience of the Index Thomisticus Treebank" | |
Anna Björk Nikulásdóttir and Matthew Whelpton: "Extraction of Semantic Relations as a Basis for a Future Semantic Database for Icelandic" | |
Gábor Prószéky, Attila Novák, István Endrédy, Beatrix Oszkó, László Fejes, Sándor Szeverényi, Zsuzsa Várnai and Beáta Wagner-Nagy: "Nganasan – Computational Resources of a Language on the Verge of Extinction" | |
Géraldine Walther and Benoît Sagot: "Developing a large-scale lexicon for a less-resourced language: Sorani Kurdish" | |
Hrafn Loftsson, Jökull Yngvason, Sigrún Helgadóttir and Eiríkur Rögnvaldsson: "Developing a PoS-tagged corpus using existing tools" | |
13.20 |
Panel: Less resourced languages and Language technology. Short- and medium-term objectives (SaLTMiL)
|
14.00 | Closing |
28 February 2010 4 March 2010 Deadline for submission
22 March 2010 Notification
29 March 2010 Final version
23 May 2010 Workshop
Registration form available in LREC 2010 site