Articles
Scholarly (and otherwise) papers on omorfi
This is a collection of scientific work describing or using omorfi. If you find something is missing, please notify us on the google groups or other means.
To get BibTeX or other formats of citations, feel free to use the google scholar system.
Papers about omorfi
Following works can be used when citing omorfi. In support of reproducible research in NLP, include details about exact version you used, e.g., a footnote containing URL of omorfi repository and the version tag on releases page (or git short hash, if you used development version).
- Tommi A Pirinen (2015) Omorfi—Free and open source morphological lexical database for Finnish, in Proceedings of the 20th Nordic Conference of Computational Linguistics, NODALIDA 2015. http://www.ep.liu.se/ecp_article/index.en.aspx?issue=109;article=044
- Tommi A Pirinen (2011) Modularisation of Finnish Finite-State Language Description—Towards Wide Collaboration in Open Source Development of Morphological Analyser in Proceedings of Nodalida 2011 (18).
- Tommi Pirinen (2008), Suomen kielen äärellistilainen automaattinen morfologinen analyysi avoimen lähdekoodin menetelmin, Master’s Thesis, University of Helsinki (in Finnish)
Papers using omorfi
This list is most likely not complete, please suggest additions (or removals) if you have any.
Machine Translation
- Raphael Rubino, Tommi Pirinen, Miquel Esplà-Gomis, Nikola Ljubešić, Sergio Ortiz Rojas, Vassilis Papavassiliou, Prokopis Prokopidis and Antonio Toral (2015), Abu-MaTran at WMT 2015 Translation Task: Morphological Segmentation and Web Crawling at WMT2015
- Lane Schwartz, Bill Bryce, Chase Geigle, Sean Massung, Yisi Liu, Haoruo Peng, Vignesh Raja, Subhro Roy and Shyam Upadhyay (2015), The University of Illinois submission to the WMT 2015 Shared Translation Task at WMT2015
- Jörg Tiedemann, Filip Ginter and Jenna Kanerva (2015) Morphological Segmentation and OPUS for Finnish-English Machine Translation at WMT2015
- Ann Clifton, Anoop Sarkar (2011). Combining morpheme-based machine translation with post-processing morpheme prediction, in Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL 2011.
- Ann Clifton, Anoop Sarkan, Morphology Generation for Statistical Machine Translation
- Ann Clifton (2010). Unsupervised Morphological Segmentation For Statistical Machine Translation Doctoral dissertation, Simon Fraser University.
Universal Dependencies, syntax, treebanking
- Sampo Pyysalo (2015) Universal Dependencies for Finnish, In: Nordic Conference of Computational Linguistics NODALIDA
- Jenna Kanerva et al. (2014), Syntactic n-gram collection from a large-scale corpus of internet Finnish, Proceedings of the Sixth International Conference Baltic HLT.
- Bernd Bohnet, Joakim Nivre, Igor Boguslavsky, Filip Ginter and Jan Hajič (2013), Joint Morphological and Syntactic Analysis for Richly Inflected Languages in Transactions of the Association for Computational Linguistics
- Kristiina Muhonen, Tanja Purtonen (2012), Rule-Based Detection of Clausal Coordinate Ellipsis in LREC 2012
- Kristiina Muhonen, Tanja Purtonen (2012), Detecting Semantic Ambiguity: Alternative Readings in Treebanks
- Mozgovoy, Maxim (2010) Extensible dependency grammar for education: ideas and experiments, J Converg (JoC) 1.1
OCR
- Silfverberg, Miikka, and Jack Rueter (2015) Can Morphological Analyzers Improve the Quality of Optical Character Recognition?. in: First International Workshop of Computational Linguistics for Uralic Languages, Septentrio Conference Series. No. 2, 2015.
Semantic Web
- Eetu Mäkelä. (2014) Combining a rest lexical analysis web service with sparql for mashup semantic annotation from text. In: The Semantic Web: ESWC 2014 Satellite Events.
- Reetta Sinkkilä, O. Suominen, E. Hyvönen (2011), Automatic semantic subject indexing of web documents in highly inflected languages, The Semantic Web: Research and …, 2011
- E. Ahonen, Eero Hyvönen (2009), Publishing Historical Texts on the Semantic Web-A Case Study, in Semantic Computing, 2009. ICSC’09.
Spell-Checking
- Tommi A Pirinen (2014), Weighted Finite-State Methods for Spell-Checking and Correction, PhD thesis
- Tommi A Pirinen (2014), State-of-the-Art in Weighted Finite-State Spell-Checking, in CICLing 2014, proceedings in LNCS
- Tommi A Pirinen, Sam Hardwick (2012), Effect of Language and Error Models on Efficiency of Finite-State Spell-Checking and Correction, in Proceedings of 10th International Workshop on Finite-State Methods and/in Natural Language Processing FSMNLP 2012
- Tommi A Pirinen, Miikka Silfverberg (2012), Improving Finite-State Spell-Checker Suggestions with Part-of-Speech N-grams in Proceedings of International Conference on Intelligent Text Processing and Computational Linguistics CICLING 2012
- Miikka Silfverberg, Mirka Hyvärinen, Tommi A Pirinen (2011), Improving Predictive Entry of Finnish Text Messages using IRC Logs in Proceedings of the Computational Linguistics-Applications Conference 2011.
for further results, see http://scholar.google.fi/scholar?q=omorfi