The
Multiling RST Treebank
is a corpus of
specialized texts in several languages, which have been manually
annotated with discourse relations of Rhetorical
Structure Theory (RST) by Mann and Thompson
(1988). Although it is dynamic corpus, currently it contains 15
parallel abstracts from the terminological domain, in English,
Spanish and Basque.RSTTool
(O'Donnel 2000) is the annotation interface used to annotate this
corpus.
In this website,
various contents can be extracted from the corpus:
Occurrences
of any discourse relation.Relations
of any particular text.Linear
segmentation of any particular text.
You are free to use any information from this website, but we would appreciate an acknowledgement. The proper way to cite the Multilingual RST Treebank is the following:Iruskieta, M. da Cunha, I. Taboada, M. 2014.
A Qualitative Comparison Method for Rhetorical Structures: Identifying different discourse structures in multilingual corpora. Languages Resources and Evaluation.
Participants:
- Mikel Iruskieta
- Iria da Cunha
- Maite Taboada
- Esther Miranda
- Kike Fernandez