<?xml version="1.0" encoding="utf-8"?>
<!-- generator="Joomla! 1.5 - Open Source Content Management" -->
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
	<channel>
		<title>Home</title>
		<description>Saltmil: SPEECH AND LANGUAGE TECHNOLOGY FOR MINORITY LANGUAGES</description>
		<link>http://ixa2.si.ehu.es/saltmil/index.php/en/home-mainmenu-28</link>
		<lastBuildDate>Thu, 11 Feb 2016 09:41:30 +0000</lastBuildDate>
		<generator>Joomla! 1.5 - Open Source Content Management</generator>
		<language>en-gb</language>
		<item>
			<title>Report on the 9th SaLTMiL Workshop (Reykjavik, 2014)</title>
			<link>http://ixa2.si.ehu.es/saltmil/index.php/en/home-mainmenu-28/13-home/104-report-on-the-9th-saltmil-workshop-reykjavik-2014</link>
			<guid>http://ixa2.si.ehu.es/saltmil/index.php/en/home-mainmenu-28/13-home/104-report-on-the-9th-saltmil-workshop-reykjavik-2014</guid>
			<description><![CDATA[<div id="magicdomid55" class="ace-line">
<h2 class="contentheading" style="text-align: center;">Report on the 9th SaLTMiL Workshop on  “<strong>Free/open-Source Language Resources for the Machine Translation of  Less-Resourced Languages</strong>”<br />by Mikel L. Forcada<br />Tuesday, 27 May 2014.<br />Reykjavik (Iceland)<strong><br />(</strong><span class="moz-txt-link-freetext"><a href="http://ixa2.si.ehu.es/%7Ejipsagak/SALTMIL/LREC_2014_Workshop_Proceedings_Saltmil.pdf" title="Proceedings of LREC2014 Saltmil Workshop">Proceedings</a>)</span></h2>
<p><span class="moz-txt-link-freetext"><br /></span></p>
<span>The 9th SaLTMiL  workshop on Free/open-source language resources  for the machine  translation of less-resourced languages, held as part  of LREC 2014 in  Reykjavík on May 27, 2014, from 09:30 to 13:30, was a  very well attended  event. About 40 people were present, more than the  31 attendees  registered as of 22nd May, 2014.</span></div>
<div id="magicdomid4"><span>After  a brief welcoming address by Mikel  Forcada, there were two oral  sessions, interrupted by the coffee break.  Both sessions ran very  smoothly, with plenty of questions asked from  the audience. </span></div>
<div>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 60px;"><strong>Iñaki Alegria, Unai Cabezon, Unai Fernandez de Betoño, Gorka Labaka, Aingeru<br />Mayor, Kepa Sarasola and Arkaitz Zubiaga</strong><br />Wikipedia and Machine Translation: killing two birds with one stone</p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 60px;"><strong>Gideon Kotzé and Friedel Wolff</strong><br />Experiments with syllable-based English-Zulu alignment</p>
</div>
<div><span>In the  second session, chaired by Trond Trosterud, the paper  by Matthew Marting  and Kevin Unhammer was presented by Francis Tyers  as the authors could  not make it to Iceland.</span><br />
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 60px;"><strong>Inari Listenmaa and Kaarel Kaljurand</strong><br />Computational Estonian Grammar in Grammatical Framework</p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 60px;"><strong>Matthew Marting and Kevin Unhammer</strong><br />FST Trimming: Ending Dictionary Redundancy in Apertium</p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 60px;"><strong>Hrvoje Peradin, Filip Petkovski and Francis Tyers</strong><br />Shallow-transfer rule-based machine translation for the Western group of South Slavic<br />languages</p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 60px;"><strong>Alex Rudnick, Annette Rios Gonzales and Michael Gasser</strong><br />Enhancing a Rule-Based MT System with Cross-Lingual WSD</p>
</div>
<div id="magicdomid6"><span>The  workshop ended with a 30-minute general  discussion, moderated by  Francis Tyers. Two main questions were posed  to the audience: </span></div>
<ol>
<li><span>Is research in minority-language machine translation already mainstream </span> </li>
<li><span>What are the main difficulties in building or putting together   free/open-source language resources for small languages, and how  should  they be addressed? Are we pooling these resources correctly?</span></li>
</ol>
<div id="magicdomid16"><span>The audience was also invited to openly discuss other issues if necessary. Here is a detailed summary of what was discussed.</span></div>
<div><span><br /></span></div>
<div id="magicdomid18"><span>Question (1) quickly turned into a discussion on research about rule-based systems. </span></div>
<div id="magicdomid22" style="padding-left: 30px;"><span>Lori   Levin said that minority languages are becoming mainstream and   researchers are publishing in journal venue, that we need to educate   people on the research issues related to rule-based language resources.   She added that "It seems that tinkering with statistical models is   research whereas tinkering with rules is not".</span></div>
<div style="padding-left: 30px;"><span>Robert   Frederking: It is hard to publish papers on rule-based machine   translation, at least in America. Francis Tyers replied that it may be   easier in Europe, perhaps because European and American funding   objectives are different.</span></div>
<div id="magicdomid24" style="padding-left: 30px;"><span>Someone   [not identified in Mikel Forcada's notes: apologies!] said that   linguistic research may help facing some issues. Lori Levin said that   the problem is partly linguistic and partly not, and the key is where to   spend time in rule-based machine translation for maximum impact.</span></div>
<div id="magicdomid26" style="padding-left: 30px;"><span>Francis   Tyers mentions that the statistical machine translation community is   strong partly because they use standardized evaluation measures. Antonio   Toral mentioned that most papers in machine translation conferences  are  on statistical machine translation, and improvements reported are   usually less than one BLEU point, but added that, in view of the results   of a workshop on machine translation evaluation held the day before,   there is very little correlation between BLEU score and productivity   gains. Maja Popović added that there is a tendency in statistical   machine translation towards morphologically rich, under-resourced   languages. Trond is sceptical about research that does not take into   account existing morphologies for these languages or does not aim at   developing them. Maja Popović adds that it is better to have something   than nothing and that all knowledge should be combined.</span><span>The   role of linguists and their involvement is also discussed. Jonathan   Washington explains his experience as a linguist getting involved in   morphologies for Turkic languages and the issues faced. Lori Levin says   that we should get more linguists involved, but that rule writing is a   skill that not everyone has; many people think they can, but we should   educate linguists to be rule writers. Also, on interaction between   linguists and prominent statistical machine translation researchers, she   says that they are very busy people and that it is quite hard to get   into their schedule to discuss these issues. </span></div>
<div id="magicdomid30" style="padding-left: 30px;"><span>Laurette   Pretorius speaks from a South African context. She says that   computational linguistics is not taught in South African Universities,   and stresses the importance of collaboration. She says that linguists   should not be assigned the boring tasks, such as annotation tasks, but   that they should be involved in the whole design. Francis Tyers also   talks about the choice between doing tedious annotation or more   interesting rule writing and understand that people would rather prefer   the second. Mikel Forcada warns about the fact that linguists tend to   get carried away by the low-frequency "jewels" of their languages and   lose sight of the high-frequency "building blocks" needed for working   systems.</span></div>
<div id="magicdomid32" style="padding-left: 30px;"><span>Christian   Buck returns to the fact that the statistical machine translation   community has yearly "shoot-outs" (contests) where they can test their   advancement, and that these contests do drive their research. </span></div>
<div id="magicdomid34" style="padding-left: 30px;"><span>Jonathan   Washington mentions that the computational perspective made him  rethink  many of the issues relating Turkic languages. Mikel mentions  that in  fact, computational linguistics descriptions are the best  descriptions  of language sometimes, and mentions the IXA Group's  "computational  morphology of Basque" or Elaine Uí Dhonnchadha's Irish  morphology as the  best description of their languages' morphologies.</span></div>
<div id="magicdomid36" style="padding-left: 30px;"><span>Lori   Levin stresses the fact that linguists have to be trained to do the   linguistic engineering. For instance, lexical-functional grammars may   teach aspects such as modularity.</span></div>
<div id="magicdomid38" style="padding-left: 30px;"><span>Mikel   L. Forcada mentions two problems in rule-based research: one, that   rule-based MT as a field is very fragmented after the pervasive   irruption of statistical machine translation, and, as a result, we do   not speak with one voice and use inconsistent terminologies which make   it very difficult to articulate ourselves as a field. Another one is   reproducibility: for our rule-based research to be reproducible we have   to make it all available, and this naturally leads to free/open-source   licensing.</span></div>
<div id="magicdomid40" style="padding-left: 30px;"><span>Sjur   Moshagen talks about the fact that resources should be reusable in   other language technologies and explains that in Tromsø they had no   option as they were building the only set of resources for Sámi   languages and they had to be reusable, and that linguists had to be   trained in engineering issues. In fact, one of the uses was the Apertium   machine translation systems for Sámi languages. </span></div>
<div><span><br /></span></div>
<div id="magicdomid42"><span>Sjur  Moshagen opens question (2) about  pooling and resource sharing and asks  where would be a good place to  take all these resources.</span></div>
<div id="magicdomid44" style="padding-left: 30px;"><span>On   reproducibility, it is discussed that commercial rule-based systems   often do not even want to be mentioned by name and ask to be called   "System A" or "System B" in contests. Some judge this to be unfortunate.</span></div>
<div id="magicdomid46" style="padding-left: 30px;"><span>Mikel   Forcada mentions that pooling should pay special attention to metadata   describing how to use the resources. He says that this is a more   difficult problem than licensing.</span></div>
<div id="magicdomid48" style="padding-left: 30px;"><span>Friedel   Wolff questions whether licensing is really an easy problem and talks   about incompatibilities among licenses such as Creative Commons and the   General Public License, and says that when it comes to license derived   data, decisions may be far from being trivial. </span></div>
<div style="padding-left: 30px;"><span><br /></span></div>
<div id="magicdomid50"><span>The  discussion stops here and moderator  Francis Tyers thanks everyone for  the rich discussion. Mikel L. Forcada  thanks everyone for attending and  closes the Workshop.</span></div>]]></description>
			<author>mlf@dlsi.ua.es (Mikel Forcada Zubizarreta)</author>
			<category>Home</category>
			<pubDate>Wed, 28 May 2014 00:00:00 +0000</pubDate>
		</item>
		<item>
			<title>Programme of the 9th SaLTMiL Workshop on “Free/open-Source Language Resources for the Machine Translation of Less-Resourced Languages”</title>
			<link>http://ixa2.si.ehu.es/saltmil/index.php/en/home-mainmenu-28/13-home/101-programme-of-the-9th-saltmil-workshop-on-freeopen-source-language-resources-for-the-machine-translation-of-less-resourced-languages</link>
			<guid>http://ixa2.si.ehu.es/saltmil/index.php/en/home-mainmenu-28/13-home/101-programme-of-the-9th-saltmil-workshop-on-freeopen-source-language-resources-for-the-machine-translation-of-less-resourced-languages</guid>
			<description><![CDATA[<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: center;">A half-day workshop at LREC 2014 <br />Tuesday, 27 May 2014.<br />Reykjavik (Iceland)</p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: center;">SALTMIL: <a class="moz-txt-link-freetext" href="http://ixa2.si.ehu.es/saltmil/">http://ixa2.si.ehu.es/saltmil/</a> <br />LREC 2014: <a href="http://lrec2014.lrec-conf.org/en/">http://lrec2014.lrec-conf.org/en/</a><br />Website: <a class="moz-txt-link-freetext" href="http://ixa2.si.ehu.es/saltmil/">http://ixa2.si.ehu.es/saltmil/</a></p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: center;"><a class="moz-txt-link-freetext" href="http://ixa2.si.ehu.es/saltmil/"><br /></a><span class="moz-txt-link-freetext"><strong>Registration</strong>: <a href="http://lrec2014.lrec-conf.org/en/registration/">http://lrec2014.lrec-conf.org/en/registration/ </a></span></p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: center;"> </p>
<h1 style="text-align: left;"><strong>Workshop Programme <br />(</strong><span class="moz-txt-link-freetext"><a href="http://ixa2.si.ehu.es/%7Ejipsagak/SALTMIL/LREC_2014_Workshop_Proceedings_Saltmil.pdf" title="Proceedings of LREC2014 Saltmil Workshop">Proceedings</a>)</span></h1>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 30px;">09:00 – 09:30 Welcoming address by Workshop co-chair Mikel L. Forcada</p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 30px;">09:30 – 10:30 Oral papers</p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 60px;"><strong>Iñaki Alegria, Unai Cabezon, Unai Fernandez de Betoño, Gorka Labaka, Aingeru<br />Mayor, Kepa Sarasola and Arkaitz Zubiaga</strong><br />Wikipedia and Machine Translation: killing two birds with one stone</p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 60px;"><strong>Gideon Kotzé and Friedel Wolff</strong><br />Experiments with syllable-based English-Zulu alignment</p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 30px;">10:30 – 11:00 Coffee break</p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 30px;">11:00 – 13:00 Oral papers</p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 60px;"><strong>Inari Listenmaa and Kaarel Kaljurand</strong><br />Computational Estonian Grammar in Grammatical Framework</p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 60px;"><strong>Matthew Marting and Kevin Unhammer</strong><br />FST Trimming: Ending Dictionary Redundancy in Apertium</p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 60px;"><strong>Hrvoje Peradin, Filip Petkovski and Francis Tyers</strong><br />Shallow-transfer rule-based machine translation for the Western group of South Slavic<br />languages</p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 60px;"><strong>Alex Rudnick, Annette Rios Gonzales and Michael Gasser</strong><br />Enhancing a Rule-Based MT System with Cross-Lingual WSD</p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 30px;">13:00 – 13:30 General discussion</p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 30px;">13:30 Closing</p>
<h1>Workshop Organizers</h1>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 30px;"><strong>Mikel L. Forcada</strong> Universitat d’Alacant, Spain<br /><strong>Kepa Sarasola</strong>. Euskal Herriko Unibertsitatea, Spain<br /><strong>Francis M. Tyers</strong>. UiT Norgga árktalaš universitehta, Norway</p>
<h1>Workshop Programme Committee</h1>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 30px;"><strong>Iñaki Alegria</strong> Euskal Herriko Unibertsitatea, Spain<br /><strong>Lars Borin</strong> Göteborgs Universitet, Sweden<br /><strong>Elaine Uí Dhonnchadha</strong> Trinity College Dublin, Ireland<br /><strong>Mikel L. Forcada</strong> Universitat d’Alacant, Spain<br /><strong>Michael Gasser</strong> Indiana University, USA<br /><strong>Måns Huldén</strong> Helsingin Yliopisto, Finland<br /><strong>Krister Lindén</strong> Helsingin Yliopisto, Finland<br /><strong>Nikola Ljubešić</strong> Sveučilište u Zagrebu, Croatia<br /><strong>Lluís Padró</strong> Universitat Politècnica de Catalunya, Spain<br /><strong>Juan Antonio Pérez-Ortiz</strong> Universitat d’Alacant, Spain<br /><strong>Felipe Sánchez-Martínez </strong>Universitat d’Alacant, Spain<br /><strong>Kepa Sarasola </strong>Euskal Herriko Unibertsitatea, Spain<br /><strong>Kevin P. Scannell</strong> Saint Louis University, USA<br />Antonio Toral Dublin City University, Ireland<br /><strong>Trond Trosterud</strong> UiT Norgga árktalaš universitehta, Norway<br /><strong>Francis M. Tyers</strong> UiT Norgga árktalaš universitehta, Norway</p>
<h1>Introduction</h1>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 30px;">The 9th International Workshop of the Special Interest Group on Speech and Language Technology for Minority Languages (SaLTMiL) will be held in Reykjavík, Iceland, on 27th May 2014, as part of the 2014 International Language Resources and Evaluation Conference (LREC). (For SALTMIL see: http://ixa2.si.ehu.es/saltmil/); it is also framed as one of the activities of European project Abu-Matran (http://www.abumatran.eu). Entitled "Free/open-source language resources for the machine translation of less-resourced languages", the workshop is intended to continue the series of SALTMIL/LREC workshops on computational language resources for minority languages, held in Granada (1998), Athens (2000), Las Palmas de Gran Canaria (2002), Lisbon (2004), Genoa (2006), Marrakech (2008), La Valetta (2010) and Istanbul (2012), and is also expected to attract the audience of Free Rule-Based Machine Translation workshops (2009, 2011, 2012). <br /><br />The workshop aims to share information on language resources, tools and best practice, to save isolated researchers from starting from scratch when building machine translation for a less-resourced language. An important aspect will be the strengthening of the free/open-source language resources community, which can minimize duplication of effort and optimize development and adoption, in line with the LREC 2014 hot topic ‘LRs in the Collaborative Age’ (http://is.gd/LREChot).<br /><br />Papers describe research and development in the following areas:</p>
<ul>
<li>Free/open-source language resources for rule-based machine translation (dictionaries, rule sets)</li>
<li>Free/open-source language resources for statistical machine translation (corpora)</li>
<li>Free/open-source tools to annotate, clean, preprocess, convert, etc. language resources for machine translation</li>
<li>Machine translation as a tool for creating or enriching free/open-source language resources for less-resourced languages</li>
</ul>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify; padding-left: 30px;"> </p>]]></description>
			<author>kepa.sarasola@ehu.es (Kepa Sarasola)</author>
			<category>Home</category>
			<pubDate>Fri, 02 May 2014 18:22:59 +0000</pubDate>
		</item>
		<item>
			<title>Call for Papers: 9th SaLTMiL workshop on “Free/open-source language resources for the machine translation of less-resourced languages” at LREC 2014.</title>
			<link>http://ixa2.si.ehu.es/saltmil/index.php/en/home-mainmenu-28/13-home/99-cfpsaltmil2014</link>
			<guid>http://ixa2.si.ehu.es/saltmil/index.php/en/home-mainmenu-28/13-home/99-cfpsaltmil2014</guid>
			<description><![CDATA[<p><!-- H3 { margin-top: 0.49cm; margin-bottom: 0.14cm; direction: ltr; font-variant: normal; color: rgb(0, 0, 0); line-height: 100%; text-align: left; widows: 2; orphans: 2; text-decoration: none; page-break-after: auto; }H3.western { font-family: "Arial",serif; font-style: normal; }H3.cjk { font-family: "Arial"; font-style: normal; }H3.ctl { font-family: "Arial"; font-size: 12pt; font-weight: normal; }P { margin-bottom: 0.21cm; }A:link {  } --></p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify;">A full-day workshop at LREC 2014 <br />Tuesday, 27 May 2014.<br />Reykjavik (Iceland)</p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify;">SALTMIL: <a class="moz-txt-link-freetext" href="http://ixa2.si.ehu.es/saltmil/">http://ixa2.si.ehu.es/saltmil/</a> <br />LREC 2014: <a href="http://lrec2014.lrec-conf.org/en/">http://lrec2014.lrec-conf.org/en/</a><br />Website: <a class="moz-txt-link-freetext" href="http://ixa2.si.ehu.es/saltmil/">http://ixa2.si.ehu.es/saltmil/</a> <br />Paper submission: <a class="moz-txt-link-freetext" href="https://www.softconf.com/lrec2014/SaLTMiL/">https://www.softconf.com/lrec2014/SaLTMiL/</a><span style="background: #ffff00;"> </span></p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify;">The 9th International Workshop of the Special Interest Group on Speech and Language Technology for Minority Languages (SaLTMiL) will be held in Reykjavík, Iceland, on May 24, 2014, as part of the 2014 International Language Resources and Evaluation Conference (LREC).  (For SALTMIL see: <a href="http://ixa2.si.ehu.es/saltmil/"><span style="color: #1155cc;"><span style="text-decoration: underline;">http://ixa2.si.ehu.es/saltmil/</span></span></a>); it is also framed as one of the activities of European project Abu-Matran (<span style="color: #1155cc;"><span style="text-decoration: underline;"><a href="http://www.abumatran.eu">http://www.abumatran.</a><a href="http://www.abumatran.eu">eu</a></span></span>).  Entitled "Free/open-source language resources for the machine translation of less-resourced languages", the workshop is intended to continue the series of SALTMIL/LREC workshops on computational language resources for minority languages, held in Granada (1998), Athens (2000), Las Palmas de Gran Canaria (2002), Lisbon (2004), Genoa (2006), Marrakech (2008), La Valetta (2010) and Istanbul (2012), and is also expected to attract the audience of Free Rule-Based Machine Translation workshops (2009, 2011, 2012). The workshop aims to share information on language resources, tools and best practice, to save isolated researchers from starting from scratch when building machine translation for a less-resourced language.  An important aspect will be the strengthening of the free/open-source language resources community, which can minimize duplication of effort and optimize development and adoption, in line with the LREC 2014 hot topic ‘LRs in the Collaborative Age’ (<a href="http://is.gd/LREChot"><span style="color: #1155cc;"><span style="text-decoration: underline;">http://is.gd/LREChot</span></span></a>).</p>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify;">The whole-day workshop will consist of short oral papers, a poster session preceded by a poster-boaster session (2 minutes, 2 slides per poster), and a round table.</p>
<p style="margin-bottom: 0cm">Papers are invited that describe research and development in the following areas:</p>
<ul>
<li>FOS LR for rule-based machine translation (dictionaries, rule sets)</li>
<li>FOS LR for statistical machine translation (corpora)</li>
<li>FOS tools to annotate, clean, preprocess, convert, etc. LRs for machine translation</li>
<li>Machine translation as a tool for creating or enriching FOS LRs for less-resourced languages</li>
</ul>
<ul>
</ul>
<p style="margin-bottom: 0.39cm; line-height: 100%; text-align: justify;">Position papers and (web based) demonstrations will also be considered for presentation.</p>
<p style="margin-bottom: 0cm; text-align: justify;">The best papers, as evaluated by the programme committee, will be presented orally and the remaining paper will be presented in poster format.</p>
<p style="margin-bottom: 0cm; text-align: justify;">We expect short papers of max 6,000 words (up to 6 pages) describing research addressing one of the above topics, to be submitted as PDF documents by using the LREC 2014 START conference management system (<a class="moz-txt-link-freetext" href="https://www.softconf.com/lrec2014/SaLTMiL/">https://www.softconf.com/lrec2014/SaLTMiL/</a><span style="background: #ffff00;"> </span>).</p>
<p style="margin-bottom: 0cm; text-align: justify;">Submissions should be anonymized. When submitting a paper through the START page, authors will be kindly asked to share the resources that have been used for the work described in their paper or that are the outcome of their research. For further information on this initiative, please refer to <a href="http://lrec2014.lrec-conf.org/en/calls-for-papers/lrec-2014-special-highlight/"><span style="color: #1155cc;"><span style="text-decoration: underline;">http://lrec2014.lrec-conf.org/en/calls-for-papers/lrec-2014-special-highlight/</span></span></a>.</p>
<p style="margin-bottom: 0cm; text-align: justify;">Submissions of papers should follow the same style as the papers for the main LREC conference (an <a href="http://lrec2014.lrec-conf.org/en/submission/authors-kit/">Author's Kit</a> made of specific guidelines and downloadable templates will be published on the conference web site in due time). All contributions will be included in the workshop proceedings (CD). They will also be published on the SALTMIL website.</p>
<p style="margin-bottom: 0cm; text-align: justify;">The registration fees will be duly announced at the LREC 2014 site. Registration in the workshop willl include a coffee break and the Proceedings of the Workshop. Registration will be handled by the LREC 2014 Secretariat.</p>
<p> </p>
<h3 class="western"><a name="h.ccjxq2278o6q"></a>Important dates</h3>
<p style="margin-bottom: 0cm"><strong> </strong></p>
<p style="margin-bottom: 0cm"><strong>Deadline for paper submission</strong>: <span style="text-decoration: line-through;">February 10 2014</span> February 17, 2014<br /><strong>Notification of acceptance sent</strong>: <span style="text-decoration: line-through;">March 3, 2014 March, 10, 2014</span> March, 14, 2014<br /><strong>Camera-ready paper due</strong>: March 21, 2014</p>
<h3 class="western"><a name="h.vocqkwuhobad"></a>Organizing committee</h3>
<p style="margin-bottom: 0cm">Joint e-mail address: saltmil2014@dlsi.ua.es</p>
<p style="margin-bottom: 0cm">(1) <strong>Dr Francis M Tyers</strong><br />Institutt for språkvitskap<br />Det humanistiske fakultet,<br />N-9037 Universitetet i Tromsø<br /><a href="mailto:ftyers@prompsit.com"><span style="color: #1155cc;"><span style="text-decoration: underline;">ftyers@prompsit.com</span></span></a></p>
<p style="margin-bottom: 0cm">(2) <strong>Dr Kepa Sarasola</strong><br />Computer Science Faculty<br />Dept. of Computer Languages<br />The University of the Basque Country<br />P.K. 649  20080 DONOSTIA<br />Basque Country, Spain<br />Tel: +34 943 01 81 54<br />Fax: +34 943 21 93 06<br />ksarasola@ehu.es<br />http://ixa.si.ehu.es</p>
<p style="margin-bottom: 0cm">(3) <strong>Prof Mikel L. Forcada</strong><br />Dept. Llenguatges i Sistemes informàtics<br />Universitat d’Alacant<br />E-03071 Alacant (Spain)<br />Tel: +34 96 590 9776<br />FAx: +34 96 590 9326<br /><a href="mailto:mlf@ua.es"><span style="color: #1155cc;"><span style="text-decoration: underline;">mlf@ua.es</span></span></a><br />http://www.dlsi.ua.es/~mlf</p>
<p style="margin-bottom: 0cm"> </p>
<h3 class="western">Programme Committee</h3>
<p style="margin-bottom: 0cm">Iñaki <strong>Alegria</strong>, Euskal Herriko Unibertsitatea, Spain<br />Lars <strong>Borin</strong>, Göteborgs Universitet, Sweden.<br />Elaine Uí <strong>Dhonnchadha</strong>, Trinity College Dublin, Ireland<br />Mikel L. <strong>Forcada</strong>, Universitat d’Alacant, Spain<br />Michael <strong>Gasser</strong>, Indiana University, USA<br />Måns <strong>Huldén</strong>, Helsingin Yliopisto, Finland<br />Krister <strong>Lindén</strong>, Helsingin Yliopisto, Finland<br />Nikola <strong>Ljubešić</strong>, Sveučilište u Zagrebu, Croatia<br />Lluís <strong>Padró</strong>, Universitat Politècnica de Catalunya, Spain<br />Juan Antonio <strong>Pérez-Ortiz</strong>, Universitat d’Alacant, Spain<br />Felipe <strong>Sánchez-Martínez</strong>, Universitat d’Alacant<br />Kepa <strong>Sarasola</strong>, Euskal Herriko Unibertsitatea, Spain<br />Kevin P. <strong>Scannell</strong>, Saint Louis University, USA<br />Antonio <strong>Toral</strong>, Dublin City University, Ireland<br />Trond <strong>Trosterud</strong>, Universitet i Tromsø, Norway<br />Francis M. <strong>Tyers</strong>, Universitet i Tromsø, Norway</p>
<p style="margin-bottom: 0cm"> </p>]]></description>
			<author>kepa.sarasola@ehu.es (Kepa Sarasola)</author>
			<category>Home</category>
			<pubDate>Thu, 26 Dec 2013 11:23:16 +0000</pubDate>
		</item>
		<item>
			<title>Report on the 8th LREC workshop. 2012</title>
			<link>http://ixa2.si.ehu.es/saltmil/index.php/en/home-mainmenu-28/13-home/98-report-on-the-8th-lrec-workshop-2012-home</link>
			<guid>http://ixa2.si.ehu.es/saltmil/index.php/en/home-mainmenu-28/13-home/98-report-on-the-8th-lrec-workshop-2012-home</guid>
			<description><![CDATA[<p>On May 22nd 2012, SALTMIL held in collaboration with <a href="http://AfLaT.org/">AfLaT</a> a  full-day <a href="http://ixa2.si.ehu.es/saltmil/index.php/en/news-mainmenu-287">workshop on "Language technology for normalisation of less-resourced languages"</a>.  This was a  satellite workshop preceding the biennial LREC (Language  Resources and  Evaluation Conference) in Istanbul, Turkey.</p>
<p>The program started with the invited talk presented by Sjur Moshagen  Nørstebø. This was then followed  by two sessions of four oral  presentations and a poster session with eight contributed  poster  papers. All the presentations and posters stimulated many questions and  discussions.</p>
<p>At 17.30, after a brief presentation by Francys Tyers and Guy De  Pauw, an interesting  discussion took place on "Language technology for  normalisation of less-resourced languages" and then the workshop was  closed  by thanking the audience for their participation in the whole  workshop.</p>
<p>About fourty five people were present in total, from a wide range of   countries, and representing work on a variety of less resourced   languages.</p>
<p>Addional materials related to this workshop are available:</p>
<ul>
<li>Proceedings    (<a href="http://ixa2.si.ehu.es/%7Ejipsagak/SALTMIL2012_AfLat_Proceedings.pdf">pdf</a>)</li>
<li>Slides of Sjur Moshagen Nørstebø's invited talk (<a href="http://ixa2.si.ehu.es/%7Ejipsagak/Moshagen_slides.pdf">pdf</a>)</li>
<li>Posters and slides of some oral presentations. (<a href="http://ixa2.si.ehu.es/%7Ejipsagak/SATLMIL2012_Presentations.zip">zip</a>)</li>
</ul>]]></description>
			<author>kepa.sarasola@ehu.es (Kepa Sarasola)</author>
			<category>Home</category>
			<pubDate>Wed, 04 Jul 2012 07:14:41 +0000</pubDate>
		</item>
		<item>
			<title>LREC-2012: SALTMIL-AfLaT Workshop on “Language technology for normalisation of less-resourced languages”</title>
			<link>http://ixa2.si.ehu.es/saltmil/index.php/en/home-mainmenu-28/13-home/95-lrec-2012-cfp-home</link>
			<guid>http://ixa2.si.ehu.es/saltmil/index.php/en/home-mainmenu-28/13-home/95-lrec-2012-cfp-home</guid>
			<description><![CDATA[<p style="text-align: center;"><br />A full-day workshop at LREC 2012 <br />Tuesday, 22 May 2012. <br />Lütfi Kirdar Istanbul Exhibition and Congress Centre, Istanbul, Turkey <br /> <br />SALTMIL: <a class="moz-txt-link-freetext" href="http://ixa2.si.ehu.es/saltmil/">http://ixa2.si.ehu.es/saltmil/</a> <br />AfLaT: <a class="moz-txt-link-freetext" href="http://aflat.org/">http://AfLaT.org/</a> <br />LREC 2012: <a class="moz-txt-link-freetext" href="http://www.lrec-conf.org/lrec2012/">http://www.lrec-conf.org/lrec2012/</a></p>
<p>WORKSHOP PROGRAMME</p>
<p>09:15–09:30 <strong>Welcome / Opening Session</strong><br />09:30–10:30<strong> Invited Talk: Sjur Moshagen Nørstebø</strong>. How to build language technology resources for the next 100 years<br />10:30–11:00 Coffee Break<br />11:00–13:00 <strong>Oral papers: </strong><strong>Resource Creation</strong></p>
<ul>
<li><strong>Elaine Uí Dhonnchadha, Alessio Frenda and </strong><strong> Brian Vaughan</strong><strong></strong>, Issues in Designing a Spoken Corpus of Irish.</li>
<li><span><strong>Wondwossen </strong></span><strong>Mulugeta and Michael Gasser</strong>, Learning Morphological Rules for Amharic Verbs Using Inductive LogicProgramming</li>
<li><strong>Krist ́n Bjarnadottir</strong>, The Database of Modern Icelandic Inflection</li>
<li><strong>Fadoua Ataa Allah and Siham Boulaknadel</strong>, Natural Language Processing for Amazigh Language: Challenges and Future Directions</li>
</ul>
<p>13:00–14:00 Lunch Break<br />14:00–16:00 <strong>Oral papers:<strong> </strong></strong><strong>Resource Use</strong></p>
<ul>
<li><strong>Tommi A. Pirinen and Francis M. Tyers</strong>. Compiling Apertium morphological dictionaries with HFST and using them in HFST applications.</li>
<li><strong>Borbóla Siklósi, György Orosz, Attila Novák and Gábor Prószéky</strong>. Automatic structuring and correction suggestion system for Hungarian clinical records.</li>
<li><strong>Linda Wiechetek.</strong> Constraint Grammar based Correction of Grammatical Errors for North Sàmi.</li>
<li><strong>Michael Gasser</strong>, Toward a Rule-Based System for English-Amharic Translation. </li>
</ul>
<p>16:00–16:30   Coffee Break<br />16:30–17:30   <strong>Poster Session</strong></p>
<ul>
<li><strong>Emmanuel Cartier and Paola Carrion Gonzalez</strong>, Technological Tools for Dictionary and Corpora Building for Minority Languages: Example of the French-based Creoles.</li>
<li><strong>Denys Duchier, Brunelle Magnana Ekoukou, Yannick Parmentier, Simon Petitjean and Emannuel Schang</strong>, Describing Morphologically-rich Languages using Metagrammars: a Look at Verbs in Ikota.</li>
<li><strong>Tjerk Hagemeijer, Iris Hendrickx, Abigail Tiny and Haldane Amaro</strong>, A Corpus of Santomé.</li>
<li><strong>Sigrún Helgad ́ ttir, Asta Svavarsdóttir, Eiríkur Rögnvaldsson, Kristín Bjarnadóttir and Hrafn Loftsson</strong>, The Tagged Icelandic Corpus (MM). </li>
<li><strong>Laurette Pretorius and Sonja Bosch</strong>, Semi-automated extraction of morphological grammars for Nguni with special reference to Southern Ndebele.</li>
<li><strong>Björn Gambäck</strong>, Tagging and Verifying an Amharic News Corpus.</li>
<li><strong>Guy De Pauw, Gilles-Maurice de Schryver and Janneke van de Loo</strong>. Resource-Light Bantu Part-of-Speech Tagging.</li>
<li><strong>Gulshan Dovudov, Vít Suchomel and Pavel Smerk</strong>, POS Annotated 50M Corpus of Tajik Language. </li>
</ul>
<p><br />CONTEXT AND FOCUS<br /> <br />The   8th International Workshop of the ISCA Special Interest Group on    Speech and Language Technology for Minority Languages (SALTMIL,  <a class="moz-txt-link-freetext" href="http://ixa2.si.ehu.es/saltmil/../undefined/saltmil">http://ixa2.si.ehu.es/saltmil</a>)  and the 4th Workshop on African Language  Technology (<a href="http://aflat.org/content/call-papers-workshop-language-technology-normalisation-less-resourced-languages-aflat2012sal">AfLaT2012</a>)  will  be held as a joint effort in Istanbul, in  May 2012, as part of  the 2012  International Language Resources and  Evaluation Conference  (LREC  2012). <br /> <br />Entitled "Language technology for normalisation  of  less-resourced  languages", the workshop is intended to continue the   series of  SALTMIL/LREC workshops on computational language resources   for minority  languages, held in Granada (1998), Athens (2000), Las   Palmas de Gran  Canaria (2002) and Lisbon (2004), Genoa (2006),   Marrakech (2008) and  Malta (2010) and the series of AfLaT workshops,   held in Athens  (EACL2009), Malta (LREC2010) and Addis Ababa (AGIS11). <br /> <br />The   Istanbul 2012 workshop aims to share information on tools and best    practices, so that isolated researchers will not need to start from    scratch. An important aspect will be the forming of personal contacts,    which can minimize duplication of effort. There will be a balance    between presentations of existing language resources, and more general    presentations designed to give background information needed by all    researchers. <br /> <br />While less-resourced languages and minority   languages often struggle to  find their place in a digital world   dominated by only a handful of  commercially interesting languages, a   growing number of researchers are  working on alleviating this   linguistic digital divide, through  localisation efforts, the   development of BLARKs (basic language resource  kits) and practical   applications of human language technologies. The  joint SALTMIL/AfLaT   workshop on "Language technology for normalisation  of less-resourced   languages" provides a unique opportunity to connect  these researchers   and set up a common forum to meet and share the latest  developments in   the field. <br /> <br /></p>
<p>ORGANIZERS (SALTMIL and AfLaT)</p>
<p>* Mikel L. Forcada (SALTMIL): Machine Translation Group, School of  Computing, Dublin City University, Dublin, Ireland <br />* Guy De Pauw (AfLaT): CLiPS - Computational Linguistics Group,  University of Antwerp, Antwerp, Belgium <br />*   Gilles-Maurice de Schryver(AfLaT): African Languages and Cultures,    TshwaneDJe HLT, South Africa &amp; Ghent University, Belgium <br />* Kepa Sarasola(SALTMIL): Dept. of Computer Languages, University of the  Basque Country <br />* Francis M. Tyers(SALTMIL), Departament de Llenguatges i Sistemes  Informàtics, Universitat d'Alacant, Spain <br />* Peter Waiganjo Wagacha(AfLaT): School of Computing &amp; Informatics,  University of Nairobi, Nairobi, Kenya <br /> <br />PROGRAMME COMMITTEE</p>
<p>* Iñaki Alegria: University of the Basque Country <br />* Núria Bel, Universitat Pompeu Fabra, Barcelona, Spain <br />* Lars Borin, Göteborgs universitet, Sweden <br />* Sonja Bosch, University of South Africa, South Africa <br />* Khalid Choukri (ELRA,ELDA, France) <br />* Mikel L. Forcada, Universitat d’Alacant <br />* Dafydd Gibbon, University of Bielefeld, Germany <br />* Girish Nath Jha, Jawaharlal Nehru University, India <br />* Hrafn Loftsson,  Reykjavik University<br />* Guy De Pauw, CLiPS, Universiteit Antwerpen <br />* Laurette Pretorius, University of South Africa, South Africa <br />* Lori Levin, Carnegie Mellon University, USA <br />* Odetunji Odejobi, Obafemi Awolowo University, Nigeria <br />* Benoît Sagot, INRIA Paris Rocquencourt &amp; Université Paris 7, France<br />* Felipe Sánchez-Martínez, Universitat d'Alacant <br />* Kepa Sarasola, University of the Basque Country <br />* Kevin Scannell, Saint Louis University, USA <br />* Gilles-Maurice de Schryver, Universiteit Gent <br />* Trond Trosterud, Universitetet i Tromsø, Norway<br />* Francis M. Tyers, Universitat d'Alacant <br />* Peter Waiganjo Wagacha, University of Nairobi</p>
<p><strong><strong><br /></strong></strong>REGISTRATION<strong><strong><br /></strong></strong></p>
<p class="Textbody" style="padding-left: 30px;">See <a href="http://www.lrec-conf.org/lrec2012/?-Registration-">Registration</a> in LREC 2012 site</p>]]></description>
			<author>kepa.sarasola@ehu.es (Kepa Sarasola)</author>
			<category>Home</category>
			<pubDate>Fri, 20 Apr 2012 07:35:05 +0000</pubDate>
		</item>
		<item>
			<title>Report on the 7th SALTMIL LREC 2010 workshop</title>
			<link>http://ixa2.si.ehu.es/saltmil/index.php/en/home-mainmenu-28/13-home/90-report-on-the-7th-saltmil-lrec-2010-workshop</link>
			<guid>http://ixa2.si.ehu.es/saltmil/index.php/en/home-mainmenu-28/13-home/90-report-on-the-7th-saltmil-lrec-2010-workshop</guid>
			<description><![CDATA[<p>On May 23rd 2010, SALTMIL held a morning workshop on "Creation and use of basic lexical resources for less-resourced languages". This was a satellite workshop preceding the biennial LREC (Language Resources and Evaluation Conference) in Malta.</p>
<p>The program was very full, beginning with two invited talks, each of which stimulated many questions and discussions. This was then followed by two oral presentations and a poster session with five contributed poster papers. Finally, after a brief presentation by Mikel Forcada (see slides), an interesting panel discussion took place on "Less resourced languages and Language technology. Short- and medium-term objectives".  There were many contributions to this discussion, among others those of Per Langgård, Khalid Choukri, Igor Leturia, Trond Trosterud, Benoît Sagot, Kepa Sarasola and Mikel Forcada. At 13.55, Mikel Forcada closed the panel session by thanking the audience for their participation in the panel and in the whole workshop.</p>
<p>About fifty people were present in total, from a wide range of countries, and representing work on a variety of less resourced languages.</p>
<p>Addional material realted to this workshop are available:</p>
<ul>
<li><a href="http://ixa2.si.ehu.es/%7Ejipsagak/SALTMIL2010_Proceedings.pdf">Proceedings    (pdf)</a></li>
<li><a href="http://ixa2.si.ehu.es/saltmil/index.php/en/activities-mainmenu-73/saltmil-workshops-mainmenu-77/84">Call  for papers and programme</a></li>
<li><a href="http://ixa2.si.ehu.es/%7Ejipsagak/panel-trebuchet.pdf">Slides</a> and <a href="http://ixa2.si.ehu.es/saltmil/index.php/en/news-mainmenu-287">summary</a> of the discussion panel.</li>
<li>Slides of invited talks (<a href="http://ixa2.si.ehu.es/~jipsagak/Kemps_Slides.pptx">Kemps-Sneijder</a> and <a href="http://ixa2.si.ehu.es/~jipsagak/GurrutxagaLeturia_Slides.pdf">GurrutxagaLeturia</a>)</li>
<li>Slides of oral presentations (<a href="http://ixa2.si.ehu.es/~jipsagak/Pirinen_Slides.pdf">Pirinen</a> and <a href="http://ixa2.si.ehu.es/~jipsagak/Pirinen_Slides.pdf">Bills</a>)</li>
</ul>]]></description>
			<author>kepa.sarasola@ehu.es (Kepa Sarasola)</author>
			<category>Home</category>
			<pubDate>Wed, 02 Jun 2010 13:12:32 +0000</pubDate>
		</item>
		<item>
			<title>New and temporary composition of SALTMIL committee</title>
			<link>http://ixa2.si.ehu.es/saltmil/index.php/en/home-mainmenu-28/13-home/82-saltmil-committee-members</link>
			<guid>http://ixa2.si.ehu.es/saltmil/index.php/en/home-mainmenu-28/13-home/82-saltmil-committee-members</guid>
			<description><![CDATA[<p>The membership of the SALTMIL Committee is changing, with immediate<br />effect. The officers are now the following:<br /><br /></p>
<ul>
<li>Chairperson:  Mikel Forcada (Departament de Llenguatges i Sistemes  Informàtics, Universitat d'Alacant, Spain)</li>
<li>Secretary:    Francis Tyers (Departament de Llenguatges i Sistemes Informàtics, Universitat d'Alacant, Spain)</li>
<li>ISCA contact: Briony Williams (Language Technologies Unit, BangorUniversity, UK)</li>
</ul>
<ul>
<li>Member:       Kepa Sarasola (Lengoaia eta Sistema Informatikoak Saila, Euskal Herriko Unibertsitatea/Department of Computer Languages,University of the Basque Country, Spain)</li>
</ul>
<p><br />This is a temporary arrangement, and will be reviewed at the SALTMIL<br />Annual General Meeting in May 2010, when there will be an opportunity to elect members of<br />the Committee.</p>]]></description>
			<author>b.williams@bangor.ac.uk (Briony Williams)</author>
			<category>Home</category>
			<pubDate>Thu, 27 Aug 2009 14:44:15 +0000</pubDate>
		</item>
		<item>
			<title>MEMBERSHIP</title>
			<link>http://ixa2.si.ehu.es/saltmil/index.php/en/home-mainmenu-28/13-home/13-membership</link>
			<guid>http://ixa2.si.ehu.es/saltmil/index.php/en/home-mainmenu-28/13-home/13-membership</guid>
			<description><![CDATA[	    <p>To become a member of the SALTMIL SIG (with voting rights), please send an email to the Secretary, Mikel Forcada, at: <a href="mailto:mlf@dlsi.ua.es">mlf@dlsi.ua.es</a> 	    In the email, please give the following information: 	    </p><ul> 	     	    <li>Your name, email address and&nbsp; affiliation (e.g. your university)</li>  	    <li>Whether or not you are an ISCA member</li> 	    <li>Your ISCA membership number if you are a member</li> 	    <li>Your minority language (optional)</li> 	    </ul> 	     	    <p>Please also consider becoming a member of ISCA (International Speech Communication Association): see <a href="http://www.isca-speech.org/membership.php" target="_blank">http://www.isca-speech.org/membership.php</a>  	    </p>]]></description>
			<author>asoraluce001@ikasle.ehu.es (Administrator)</author>
			<category>Home</category>
			<pubDate>Sat, 31 Mar 2007 19:05:38 +0000</pubDate>
		</item>
		<item>
			<title>SALTMIL AIMS</title>
			<link>http://ixa2.si.ehu.es/saltmil/index.php/en/home-mainmenu-28/13-home/12-saltmil-aims</link>
			<guid>http://ixa2.si.ehu.es/saltmil/index.php/en/home-mainmenu-28/13-home/12-saltmil-aims</guid>
			<description><![CDATA[<p>The <a href="http://www.isca-speech.org/" target="_blank">ISCA</a> (International Speech Communication Association) Special Interest              Group on Speech and Language Technology for Minority Languages has              the overall aim of promoting research and development in the field              of speech and language technology for lesser-used languages.</p>
<ul>
<li>Fuller statement              of <a href="http://ixa2.si.ehu.es/saltmil/index.php?option=com_content&amp;view=article&amp;id=17&amp;Itemid=77&amp;lang=en">aims and objectives</a> <a href="http://ixa2.si.ehu.es/saltmil/aims/aims/aims-of-isca-sig-speech-and-language-technology-for-minority-languages.html">.</a> </li>
</ul>
<ul>
<li>The <a href="http://ixa2.si.ehu.es/saltmil/index.php?option=com_content&amp;view=article&amp;id=27&amp;Itemid=77&amp;lang=en">proposal</a> for the SIG.</li>
</ul>]]></description>
			<author>asoraluce001@ikasle.ehu.es (Administrator)</author>
			<category>Home</category>
			<pubDate>Sat, 31 Mar 2007 19:03:28 +0000</pubDate>
		</item>
	</channel>
</rss>
