2012 ParallelDataToolsandInterfacesi

From GM-RKB
Jump to navigation Jump to search

Subject Headings: OPUS; Parallel Corpus; Opensubtitles2011; Europarl Corpus.

Notes

Cited By

Quotes

Author Keywords

Abstract

This paper presents the current status of OPUS, a growing language resource of parallel corpora and related tools. The focus in OPUS is to provide freely available data sets in various formats together with basic annotation to be useful for applications in computational linguistics, translation studies and cross-linguistic corpus studies. In this paper, we report about new data sets and their features, additional annotation tools and models provided from the website and essential interfaces and on-line services included in the project.

References

BibTeX

@inproceedings{2012_ParallelDataToolsandInterfacesi,
  author    = {Jorg Tiedemann},
  editor    = {Nicoletta Calzolari and
               Khalid Choukri and
               Thierry Declerck and
               Mehmet Ugur Dogan and
               Bente Maegaard and
               Joseph Mariani and
               Jan Odijk and
               Stelios Piperidis},
  title     = {Parallel Data, Tools and Interfaces in OPUS},
  booktitle = {Proceedings of the Eighth International Conference on Language Resources
               and Evaluation (LREC 2012)},
  pages     = {2214--2218},
  publisher = {European Language Resources Association (ELRA)},
  year      = {2012},
  url       = {http://www.lrec-conf.org/proceedings/lrec2012/summaries/463.html},
}


 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2012 ParallelDataToolsandInterfacesiJorg TiedemannParallel Data, Tools and Interfaces in OPUS2012