The main objective of this course is presentation of the main concepts and models for the computational treatment of morphology (regular expressions, finite-state automata etc). The course will center mainly on languages with rich morphology. During the course, the student will use the foma tool, a free and open source tool for finite-state processing. The course also introduces neural sequence-to-sequence (seq2seq) models as applied to morphology using the OpenNMT tool.
Syllabus
INTRODUCTION
FORMAL LANGUAGES AND THEIR APPLICATIONS
2.1. Basic concepts
2.2. Regular languages
MORPHOLOGY
3.1. Formalisms for computational morphology
3.2. Morphotactics and Phonology. Phenomena in several languages
3.3. Finite-State Morphology. Examples and problems using FOMA: Lexicon, Morphology, Surface-Syntax
3.4 Morphological analysis and generation using seq2seq methods.