Machine Translation Archive

Index of Languages

Publications since 2010

Languages M to Z

[For languages A to D click here; for languages E click here, for languages F to L click here]

[For publications 2000-2009 here; for publications before 2000 click here]

[to return to home page click here]

Macedonian to Bulgarian

(2012) Preslav Nakov & Jörg Tiedemann: Combining word-level and character-level models for machine translation between closely-related languages. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.301-305. [PDF, 122KB]

(2011) Tihomir Rangelov: Rule-based machine translation between Bulgarian and Macedonian. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.53-59. [PDF, 164KB]

Macedonian to English

(2013) Jörg Tiedemann & Preslav Nakov: Analyzing the use of character-level translation with sparse and noisy datasets. Proceedings of Recent Advances in Natural  Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.676-684. [PDF, 133KB]

Malagasy

(2014) Rohit Dholakia & Anoop Sarkar: Pivot-based triangulation for low-resource languages. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.315-328. [PDF, 580KB]

Malay

(2012) Preslav Nakov & Hwee Tou Ng: Improving statistical machine translation for a resource-poor language using related resource-rich languages. Journal of Artificial Intelligence Research 44 (2012); pp.179-222. [PDF, 421KB]

Malay to English

(2011) Preslav Nakov & Hwee Tou Ng: Translating from morphologically complex languages: a paraphrase-based approach. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1298-1307. [PDF, 156KB]

(2012) Pidong Wang, Preslav Nakov, & Hwee Tou Ng: Source language adaptation for resource-poor machine translation. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.286-296. [PDF, 171KB]

Malay to Indonesian

(2012) Raymond Hendy Susanto, Septina Dian Larasati, & Francis M.Tyers: Rule-based machine translation between Indonesian and Malaysian. COLING 2012: Proceedings of the 3rd Workshop on South and Southeast Asian Natural Language processing (SANLP), Mumbai, December 2012; pp.191-199. [PDF, 178KB]

Maltese

(2014) Michael Rosner & Kurt Sultana: Automatic methods for the extension of a bilingual dictionary using comparable corpora. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3790-3797. [PDF, 195KB]

 (2012) Mike Rosner & Jan Joachimsen: Maltese: mixed language and multilingual technology [abstract]. In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp.27-28. [PDF]

(2010) Dana Dannélls & John J.Camilleri: Verb morphology of Hebrew and Maltese – towards an open source type theoretical resource grammar in GF. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 57-61. [PDF, 326KB]

(2010) Mike Rosner: Maltese on the brink. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 17pp. [PDF, 1286KB]

Manipuri to Bengali

(2012) Thoudam Doren Singh: Bidirectional Bengali script and Meetei Mayek: transliteration of web based Manipuri news corpus. COLING 2012: Proceedings of the 3rd Workshop on South and Southeast Asian Natural Language processing (SANLP), Mumbai, December 2012; pp.181-189. [PDF, 591KB]

Manipuri to English

 (2011) Thoudam Doren Singh & Sivaji Bandyopadhyay: Integration of reduplicated multiword expressions and named entities in a phrase based statistical machine translation system. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1304-1312. [PDF, 658KB]

(2010) Thoudam Doren Singh & Sivaji Bandyopadhyay: Manipuri-English bidirectional statistical machine translation systems using morphology and dependency relations. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.83-91. [PDF, 510KB]

(2010) Thoudam Doren Singh & Sivaji Bandyopadhyay: Manipuri-English example based machine translation system [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 64KB]

Marathi

(2011) Mitesh M.Khapra, Salil Joshi, & Pushpak Bhattacharyya: It takes two to tango: a bilingual unsupervised approach for estimating sense distributions using expectation maximization. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.695-704. [PDF, 280KB]

Marathi to English

(2012) M.L.Dhore, S.K.Dixit, & R.M.Dhore: Hindi and Marathi to English NE transliteration tool using phonology and stress analysis. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp.111-118. [PDF, 301KB]

(2012) M.L.Dhore, S.K.Dixit, & R.M.Dhore: Optimizing transliteration for Hindi/Marathi to English using only two weights. COLING: Proceedings of the First International Workshop on Optimization Techniques for Human Language Technology, December 2012; pp. 31-47. [PDF, 293KB]

Marathi to Hindi

(2011) Mitesh M.Khapra, Salil Joshi, Arindam Chatterjee, & Pushpak Bhattacharyya: Together we can: bilingual bootstrapping for WSD. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.561-569. [PDF, 137KB]

Mongolian

(2012) Odbayar Chimeddorj & Atsushi Fujii: Enhancing lemmatization for Monglian and its application to statistical machine translation. COLING 2012: Proceedings of the 10th Workshop on Asian Language Resources, Mumbai, December 2012; pp.115-123. [PDF, 675KB]

Norwegian

(2014) Lars Hellan, Dorothee Beermann, Tore Bruland, Mary Esther Kropp Dakubu, Montserrat Marimon: MultiVal – towards a multilingual valence lexicon. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2478-2485. [PDF, 247KB]

Pashto

(2010) Brian Roberson: Multi-language desktop suite. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

Pashto to English

(2010) Brian Roberson: Multi-language desktop suite. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

Persian

(2014) Weston Feely, Mehdi Manshadi, Robert Frederking, & Lori Levin: The CMU METAL Farsi NLP approach. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.4052-4055. [PDF, 206KB]

(2013) Behzad Mirzababaei, Heshaam Faili, & Nava Ehsan: Discourse-aware statistical machine translation as a context-sensitive spell checker. Proceedings of Recent Advances in Natural  Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.475-482. [PDF, 1025KB]

(2012) Chris Irwin Davis: Tajik-Farsi Persian transliteration using statistical machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3988-3995. [PDF, 590KB]

(2012) Masood Ghayoomi: From grammar rule extraction to treebanking: a bootstrapping approach. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1912-1919. [PDF, 1203KB]

Persian to Arabic

(2013) Ahmed El Kholy, Nizar Habash, Gregor Leusch, Evgeny Matusov, & Hassan Sawaf: Language independent connectivity strength features for phrasal pivot statistical machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.412-418. [PDF, 200KB]; revised version.

(2013) Ahmed El Kholy, Nizar Habash, Gregor Leusch, Evgeny Matusov, & Hassan Sawaf: Selective combination of pivot and direct statistical machine translation models. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1174-1180. [PDF, 123KB]

(2010) Evgeny Matusov & Selçuk Köprü: Improving reordering in statistical machine translation from Farsi. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 148KB]

Persian to English

(2013) Nicola Bertoldi, M.Amin Farajian, Prashant Mathur, Nicholas Ruiz, & Marcello Federico: FBK’s machine translation systems for the IWSLT 2013 evaluation campaign. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 134KB]

(2013) Trevor Cohn & Gholamreza Haffari: An infinite hierarchical Bayesian model of phrasal translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.780-790. [PDF, 432KB]

(2013) Michaeel Kazi, Michael Coury, Elizabeth Salesky, Jessica Ray, Wade Shen, Terry Gleason, Tim Anderson, Grant Erdmann, Lane Schwartz, Brian Ore, Raymond Slyh, Jeremy Gwinnup, Katherine Young, & Michael Hutt: The MIT-LL/AFRL IWSLT-2013 MT system. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 195KB]

(2013) Mohammad Sadegh Rasooli, Ahmed El Kholy, & Nizar Habash: Orthographic and morphological processing for Persian-to-English statistical machine translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1047-1051. [PDF, 106KB]

(2012) Fattaneh Jabbari, Somayeh Bakhshaei, Seyed Mohammad Mohammadzadeh Ziabary, & Shahram Khadivi: Developing an open-domain English-Farsi translation system using AFEC, Amirkabir bilingual Farsi-English corpus. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.17-23. [PDF, 683KB]

(2011) Najmeh Mousavi Nejad, Shahram Khadivi, & Kaveh Taghipour: The Amirkabir machine transliteration system for NEWS 2011: Farsi-English task. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.91-95. [PDF, 116KB]

(2010) Martin Čmejrek & Bowen Zhou: Two methods for extending hierarchical rules from the bilingual chart parsing. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.180-188. [PDF, 200KB]

(2010) Evgeny Matusov & Selçuk Köprü: Improving reordering in statistical machine translation from Farsi. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 148KB]

(2010) Mahsa Mohaghegh, Abdolhossein Sarrafzadeh, & Tom Moir: Improved language modeling for English-Persian statistical machine translation. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.75-82. [PDF, 167KB]

(2010) Brian Roberson: Multi-language desktop suite. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Kristen Summers & Hassan Sawaf: User-generated system for critical document triage and exploitation – version 2011. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

Philippine languages see Cebuano, English to Tagalog, Tagalog to English

Polish

(2012) Adam Przepiórkowski: Polish language resources and tools: towards multilinguality [abstract]. In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; p.31. [PDF]

(2011) Maciej Ogrodniczuk & Adam Przepiórkowski: Polish LRTs: CESAR’s story.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 14 slides [PDF of PPT, 324KB]

(2011) Adam Radziszewski & Tomasz Śniatowski: Maca – a configurable tool to integrate Polish morphological data. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.29-36. [PDF, 157KB]

Polish to English

(2014) Rafał Jaworski & Renata Ziemlińska: TRANSLAIDE. The translaide.pl system: an effective real world installation of translation memory searching and EBMT. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014; p.53. [PDF-301KB]

(2013) Krzysztof Wolk & Krzysztof Marasek: Polish-English speech statistical machine translation systems for the IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 511KB]

(2012) Krzysztof Marasek: TED Polish-to-English translation system for the IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.126-129. [PDF, 599KB]

(2012) Alexandros Poulis & David Kolovratnik: To post-edit or not to post-edit? Estimating the benefits of MT post-editing for a European organization. AMTA-2012: Workshop on post-editing technology and practice. Proceedings, San Diego, October 28, 2012; 9pp. [PDF, 390KB]

(2012) Sandra Weiss & Lars Ahrenberg: Error profiling for evaluation of machine-translated text: a Polish-English case study.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1764-1770. [PDF, 386KB]

(2010) Filip Graliński: Mining parenthetical translations for Polish-English lexica [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 58KB]

Portuguese

(2012) Antonio Branco: Language technology for Portuguese: progress and prospects [abstract].  In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp.31-32. [PDF]

(2012) Roger Granada, Lucelene Lopes, Carlos Ramisch, Cassia Trojahn, Renata Vieira, & Aline Villavicencio: A comparable corpus based on aligned multilingual ontologies. [ACL 2012] Proceedings of the First Workshop on Multilingual Modeling, Jeju, Republic of Korea, 8-14 July 2012; pp.25-31. [PDF, 108KB]

(2012) Preslav Nakov & Hwee Tou Ng: Improving statistical machine translation for a resource-poor language using related resource-rich languages. Journal of Artificial Intelligence Research 44 (2012); pp.179-222. [PDF, 421KB]

(2011) Luís Marujo, Nuno Grazina, Tiago Luís, Wang Ling, Luísa Coheur, & Isabel Trancoso: BP2EP – adaptation of Brazilian Portuguese texts to European Portuguese. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.129-136. [PDF, 352KB]

Portuguese to Chinese

(2011) Fai Wong, Francisco Oliveira, Sam Chao, & Chi-Wai Tang: Handheld machine translation system based on constraint synchronous grammar. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.439-446. [PDF, 158KB]

(2010 Francisco Oliveira, Fai Wong, Sam Chao, & Chi-Wai Tang:  Portuguese-Chinese machine aided translation system. International Translators' Federation (FIT) 6th Asian Translator's Forum, Macau SAR.; 10pp. [PDF, 364KB]

 (2010) Francisco Oliveira, Fai Wong, & Iok-Sai Hong: Systematic processing of long sentences in rule based Portuguese-Chinese machine translation [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 58KB]

(2010) Wong Fai & Chao Sam: PCT: Portuguese-Chinese machine translation systems [abstract].  Journal of Translation Studies 13 (1-2), Special issue: The teaching of computer-aided translation, ed. Chan Sin-wai; pp. 181-196.

Portuguese to English (see also Brazilian Portuguese to English)

(2013) Anabela Barreiro, Johanna Monti, Brigitte Orliac, & Fernando Batista: When multiwords go bad in machine translation. [MT Summit XIV] Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Nice, September 3, 2013; pp.26-33. [PDF, 219KB]

(2012) Ińaki San Vicente & Iker Manterola: PaCo2: a fully automated tool for gathering parallel corpora from the Web.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1-6. [PDF, 415KB]

(2012) Joăo Silva, Luísa Coheur, Ângela Costa, & Isabel Trancoso: Dealing with unknown words in statistical machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3977-3981. [PDF, 406KB]

 (2012) Justina Valotkaite & Munshi Asadullah: Error detection for post-editing rule-based machine translation. AMTA-2012: Workshop on post-editing technology and practice. Proceedings, San Diego, October 28, 2012; 9pp. [PDF, 117KB]

(2010) Paula Paiva: Corpus representativeness in the selection of medical terms to be used in translation memory tools [abstract]. UCCTS 2010: Using Corpora in Contrastive and Translation Studies, Edge Hill University, UK, 27-29 July 2010; p.15. [PDF, 91KB]

Portuguese to Spanish

(2010) Luciana Cecilia Ramos: Post-editing free machine translation: from a language vendor’s perspective. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 5pp. [PDF, 37KB]

Pushto see Pashto

Quechua

(2012) Annette Rios & Anne Göhring: A tree is a Baum is an árbol is a sach’a: creating a trilingual treebank.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1874-1879. [PDF, 360KB]

(2011) Alex Rudnick: A resource-light approach to learning verb valencies. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 26 January, 2011; 2pp. [PDF, 54KB]; presentation: 31 slides [PDF of PPT, 697KB]

(2011) Alex Rudnick: Towards cross-language word sense disambiguation for Quechua. [RANLP 2011] Proceedings of the Student Research Workshop associated with RANLP 201, Hissar, Bulgaria, 13 September 2011; pp.133-138. [PDF, 125KB]

Romance languages

(2014) Yves Scherrer & Benoît Sagot: A language-independent and fully unsupervised approach t lexicon induction and part-of-speech tagging for closely related languages. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.502-508. [PDF, 184KB]

Romanian

(2012) Dan Cristea & Ionuţ Cristian Pistol: Multilingual linguistic workflows [abstract].  In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp.28-30. [PDF]

(2012) Iustina Ilisei, Diana Inkpen, Gloria Corpas, & Ruslan Mitkov: Romanian translational corpora: building comparable corpora for translation studies.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.56-61. [PDF, 357KB]

Romanian to English

(2013) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Building specialized bilingual lexicons using word sense disambiguation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.952-956. [PDF, 117KB]

(2013) Dhouha Bouamor, Adrian Popescu, Nasredine Semmar, & Pierre Zweigenbaum: Building specialized bilingual lexicons using large-scale background knowledge. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp. 479-489. [PDF, 246KB]

 (2013) Dan Tufiş, Radu Ion, Ştefan Daniel Dumitrescu, & Dan Ştefănescu: Wikipedia as an SMT training corpus. Proceedings of Recent Advances in Natural  Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.702-709. [PDF, 220KB]

 (2012) Ştefan Daniel Dumitrescu, Radu Ion, Dan Ştefănescu, Tiberiu Boroş, & Dan Tufiş: Romanian to English automatic MT experiments at IWSLT12 – system description paper. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.136-143. [PDF, 812KB]

 (2012) Elena Irimia: Experimenting with extracting lexical dictionaries from comparable corpora for English-Romanian language pair.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.49-55. [PDF, 738KB]

(2012) Dan Ştefănescu: Mining for term translations in comparable corpora.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.98-103. [PDF, 616KB]

(2012) Dan Tufiş & Ştefan Daniel Dumitrescu: Cascaded phrase-based statistical machine translation systems. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.129-136. [PDF, 252KB]

 (2011) Monica Gavrila & Natalia Elita: Comparing corpus-based MT approaches using restricted resources. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politčcnica de Catalunya, Barcelona; pp.43-49. [PDF, 304KB]

 (2011) Monica Gavrila & Natalia Elita: Experiments with small-sized corpora in CBMT. [RANLP 2011] Proceedings of the Student Research Workshop associated with RANLP 2011, Hissar, Bulgaria, 13 September 2011; pp.67-72. [PDF, 113KB]

(2011) Monica Gavrila: Constrained recombination in an example-based machine translation system. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. M ikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.193-200. [PDF, 417KB]; presentation, 36 slides [PDF]

Romanian to German

(2011) Monica Gavrila: Constrained recombination in an example-based machine translation system. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. M ikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.193-200. [PDF, 417KB]; presentation, 36 slides [PDF]

 (2011) Monica Gavrila & Natalia Elita: Experiments with small-sized corpora in CBMT. [RANLP 2011] Proceedings of the Student Research Workshop associated with RANLP 2011, Hissar, Bulgaria, 13 September 2011; pp.67-72. [PDF, 113KB]

Romansch

(2010) Martin Volk, Noah Bubenhofer, Adrian Althaus, Maya Bangerter, Lenz Furrer, & Beni Ruef: Challenges in building a multilingual Alpine heritage corpus. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1653-1659. [PDF, 1296KB]

Russian

(2013) Evgeniy Shin, Sebastian Stüker, Kevin Kilgour, Christian Fügen, & Alex Waibel: Maximum entropy language modeling for Russian ASR. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 167KB]

(2013) Keiko Taguchi, Andrew Finch, Seiichi Yamamoto, & Eiichiro Sumita: Inducing Romanization systems. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.19-26. [PDF, 700KB]

(2013) Svitlana Volkova, Theresa Wilson, & David Yarowsky: Exploring sentiment in social media: bootstrapping subjectivity clues from multilingual Twitter streams.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.505-510. [PDF, 317KB]

(2011) Marco Turchi, Vanni Zavarella, & Hristo Tanev: Pattern learning for event extraction using monolingual statistical machine translation. [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.371-377. [PDF, 148KB]

Russian to Arabic

(2012) Jie Jiang, Andy Way, & Rejwanul Haque: Translating user-generated content in the social networking space. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 9pp. [PDF, 130KB]

Russian to Chinese

(2010) William McIntyre: Translation of Chinese entities in Russian text. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

Russian to English

(2014) Alexey Borisov & Irina Galinskaya: Yandex School of Data Analysis Russian-English machine translation system for WMT14. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.66-70. [PDF, 245KB]

(2014) Irina Burukina: Translating implicit elements in RBMT. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.182-193. [PDF, 209KB]

(2014) Nadir Durrani, Barry Haddow, Philipp Koehn, & Kenneth Heafield: Edinburgh’s phrase-based machine translation systems for WMT-14. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.97-104. [PDF, 358KB]

(2014) Lane O.B.Schwartz, Timothy Anderson, Jeremy Gwinnup, & Katherine M.Young: Machine translation and monolingual postediting: the AFRL WMT-14 system. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.186-194. [PDF, 539KB]

(2014) Philip Williams, Rico Sennrich, Maria Nadejde, Matthias Huck, Eva Hasler, & Philipp Koehn: Edinburgh’s syntax-based systems at WMT 2014. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.207-214. [PDF, 222KB]

(2013) Waleed Ammar, Victor Chahuneau, Michael Denkowski, Greg Hanneman, Wang Ling, Austin Matthews, Kenton Murray, Nicola Segall, Alon Lavie & Chris Dyer: The CMU machine translation systems at WMT 2013: syntax, synthetic translation options, and pseudo-references.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.70-77. [PDF, 201KB]

(2013) Alexey Borisov,Jacob Dlougach & Irina Galinskaya: Yandex School of Data Analysis machine translation systems for WMT13. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.99-103. [PDF, 125KB]

(2013) Nadir Durrani, Alexander Fraser, Helmut Schmid, Hassan Sajjad & Richárd Farkas: Munich-Edinburgh-Stuttgart submissions of OSM systems at WMT13.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.122-127. [PDF, 166KB]

(2013) Vladimir Eidelman, Ke Wu, Ferhan Ture, Philip Resnik & Jimmy Lin: Towards efficient large-scale feature-rich statistical machine translation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.128-133. [PDF, 342KB]

(2013) Michaeel Kazi, Michael Coury, Elizabeth Salesky, Jessica Ray, Wade Shen, Terry Gleason, Tim Anderson, Grant Erdmann, Lane Schwartz, Brian Ore, Raymond Slyh, Jeremy Gwinnup, Katherine Young, & Michael Hutt: The MIT-LL/AFRL IWSLT-2013 MT system. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 195KB]

(2013) Evgeny Matusov & Gregor Leusch: OmnifluentTM English-to-French and Russian-to-English systems for the 2013 Workshop on Statistical Ma­chine Translation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.158-163. [PDF, 136KB]

(2013) Maria Nadejde, Philip Williams, & Philipp Koehn: Edinburgh’s syntax-based machine translation systems. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.170-176. [PDF, 161KB]

(2013) Juan Pino, Aurelien Waite, Tong Xiao, Adriŕ de Gispert, Federico Flego & William Byrne: The University of Cambridge Russian-English system at WMT13.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.200-205. [PDF, 169KB]

(2013) Hassan Sajjad, Svetlana Smekalova, Nadir Durrani, Alexander Fraser & Helmut Schmid: QCRI-MES submission at WMT13: using transliteration mining to improve statistical machine trans­lation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.219-224. [PDF, 205KB]

(2013) Svetlana Sheremetyeva: On integrating hybrid and rule-based components for patent MT with several levels of output. [MT Summit XIV] Proceedings of the 5th Workshop on Patent Translation, Nice, September 2, 2013; pp.8-15. [PDF, 1061KB]

(2013) Patrick Simianer, Laura Jehl, & Stefan Riezler: The Heidelberg University machine translation systems for IWSLT2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 224KB]

(2013) Yulia Tsvetkov, Chris Dyer, Lori Levin & Archna Bhatia: Generating English determiners in phrase-based translation with synthetic translation options.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.271-280. [PDF, 291KB]

(2013) Marion Weller, Max Kisselew, Svetlana Smekalova, Alexander Fraser, Helmut Schmid, Nadir Dur­rani, Hassan Sajjad & Richárd Farkas: Munich-Edinburgh-Stuttgart submissions at WMT13: morphological and syntactic processing for SMT.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.232-239. [PDF, 160KB]

(2012) Andrew Finch, Ohnmar Htun, & Eiichiro Sumita: The NICT translation system for IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.121-125. [PDF, 621KB]

(2012) Jie Jiang, Andy Way, & Rejwanul Haque: Translating user-generated content in the social networking space. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 9pp. [PDF, 130KB]

(2012) Katherine M.Young, Jeremy Gwinnup, & Joshua Reinhart: Reversing the Palladius mapping of Chinese names in Russian text. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 5pp. [PDF, 202KB]; abstract, 1p. [PDF, 74KB]; presentation, 37 slides [PDF of PPT, 1386KB]

(2011) Alexandra Antonova & Alexey Misyurev: Building a web-based parallel corpus and filtering out machine-translated text. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.136-144. [PDF, 217KB]

(2010) Peter Nabende: Mining transliterations from Wikipedia using pair HMMs. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.76-80. [PDF, 145KB]

Russian to Hungarian

(2012) Martina Katalin Szabó, Veronika Vincze, & István Nagy T.: HunOr: a Hungarian-Russian parallel corpus.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2453-2458. [PDF, 396KB]

Russian to Turkish

(2012) Jie Jiang, Andy Way, & Rejwanul Haque: Translating user-generated content in the social networking space. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 9pp. [PDF, 130KB]

Sámi to Norwegian

(2012) Trond Trosterud & Kevin Brubeck Unhammer: Evaluating North Sámi to Norwegian assimilation RBMT. In: Free/Open-Source Rule-Based Machine Translation, ed.Cristina Espańa-Bonet and Aarne  Ranta. Proceedings of a Workshop held in Gothenburg, 14-15 June, 2012; pp.13-26. [PDF, 570KB]

Scandinavian languages

(2011) Sara Stymne: Pre- and postprocessing for statistical machine translation into Germanic languages. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.12-17. [PDF, 93KB]

Serbian

(2012) Cvetana Krsteva & Duško Vitas: Construction and exploitation of X-Serbian bitexts [abstract]. In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp.25-26. [PDF]

Serbo-Croatian to Macedonian

(2012) Hrvoje Peradin & Francis Tyers: A rule-based machine translation system from Serbo-Croatian to Macedonian. In: Free/Open-Source Rule-Based Machine Translation, ed.Cristina Espańa-Bonet and Aarne  Ranta. Proceedings of a Workshop held in Gothenburg, 14-15 June, 2012; pp.55-63. [PDF, 431KB]

Slavic languages

(2014) Yves Scherrer & Benoît Sagot: A language-independent and fully unsupervised approach t lexicon induction and part-of-speech tagging for closely related languages. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.502-508. [PDF, 184KB]

(2011) Marko Tadić: The CESAR project: enabling LRT for 70m+ speakers.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 24 slides [PDF of PPT, 810KB]

Slovenian

(2014) Raphael Rubino, Antonio Toral, Nikola Ljubešić, & Gema Ramírez-Sánchez: Quality estimation for synthetic parallel data generation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1843-1849. [PDF, 248KB]

(2013) Nikola Ljubešić & Darja Fišer: Identifying false friends between closely related languages. Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing, 8-9 August 2013, Sofia, Bulgaria; pp.69-77. [PDF, 342KB]

(2013) Yves Scherrer & Tomaž Erjavec: Modernizing historical Slovene words with character-based SMT. Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing, 8-9 August 2013, Sofia, Bulgaria; pp.58-62. [PDF, 110KB]

(2012) Darja Fišer: Language resources and tools for semantically enhanced processing of Slovene [abstract]. In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; p.22. [PDF]

(2011) Špela Vintar & Darja Fišer: Enriching Slovene WordNet with domain-specific terms. Translation: Computation, Corpora,  Cognition 1 (1), December 2011; pp.29-44. [PDF, 631KB]

Slovenian to English

(2013) Joern Wuebker, Stephan Peitz, Tamer Alkhouli, Jan-Thorsten Peter, Minwei Feng, Markus Freitag & Hermann Ney: The RWTH Aachen machine translation systems for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 6pp. [PDF, 128KB]

(2012) Nikola Ljubešić, Špela Vintar, & Darja Fišer: Multi-word term extraction from comparable corpora by combining contextual and constituent clues.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.143-147. [PDF, 376KB]

Slovenian to Macedonian

(2010) Jernej Vičič & Petr Homola: Speeding up the implementation process of a shallow transfer machine translation system. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 624KB]

Slovenian to Serbian

(2010) Jernej Vičič & Petr Homola: Speeding up the implementation process of a shallow transfer machine translation system. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 624KB]

Spanish

(2014) Lars Hellan, Dorothee Beermann, Tore Bruland, Mary Esther Kropp Dakubu, Montserrat Marimon: MultiVal – towards a multilingual valence lexicon. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2478-2485. [PDF, 247KB]

(2013) Juri Ganitkevitch, Benjamin Van Durme, & Chris Callison-Burch: PPDB: the paraphrase database. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.758-764. [PDF, 217KB]

(2013) Svitlana Volkova, Theresa Wilson, & David Yarowsky: Exploring sentiment in social media: bootstrapping subjectivity clues from multilingual Twitter streams.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.505-510. [PDF, 317KB]

(2012) Annette Rios & Anne Göhring: A tree is a Baum is an árbol is a sach’a: creating a trilingual treebank.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1874-1879. [PDF, 360KB]

(2012) V.M.Sánchez-Cartagena, M.Esplŕ-Gomis, F.Sánchez-Martínez, & J.A.Pérez-Ortiz: Choosing the correct paradigm for unknown words in rule-based machine translation systems.  In: Free/Open-Source Rule-Based Machine Translation, ed.Cristina Espańa-Bonet and Aarne  Ranta. Proceedings of a Workshop held in Gothenburg, 14-15 June, 2012; pp.27-39. [PDF, 502KB]

(2011) Miquel Esplŕ-Gomis, Victor M.Sánchez-Cartagena, & Juan Antonio Pérez-Ortiz: Enlarging monolingual dictionaries for machine translation with active learning and non-expert users. [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.339-346. [PDF, 148KB]

(2011) Miquel Esplŕ-Gomis, Víctor M.Sánchez-Cartegna & Juan Antonio Pérez-Ortiz: Multimodal building of monolingual dictionaries for machine translation by non-expert users. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.147-154. [PDF, 163KB]

(2011) Kevin Kilgour, Christian Saam, Christian Mohr, Sebastian Stüker, & Alex Waibel: The 2011 KIT QUAERO speech-to-text system for Spanish. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.199-205. [PDF, 236KB]

 (2011) Elena Lloret & Manuel Palomar: Finding the best approach for multi-lingual text summarisation: a comparative analysis. [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.194-201. [PDF, 386KB]

(2011) Marco Turchi, Vanni Zavarella, & Hristo Tanev: Pattern learning for event extraction using monolingual statistical machine translation. [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.371-377. [PDF, 148KB]

(2010) Horacio Saggion, Juan-Manuel Torres-Moreno, Iria da Cunha, Eric SanJuan, & Patricia Velázquez-Morales: Multilingual summarization evaluation without human models. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1059-1067. [PDF, 238KB]

Spanish to Aragonese

(2012) Juan Pablo Martínez Cortés, Jim O’Regan, & Francis M.Tyers: Free/open source shallow-transfer based machine translation for Spanish and Aragonese.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2153-2157. [PDF, 316KB]

Spanish to Basque

(2012) Aingeru Mayor, Mans Hulden, & Gorka Labaka: Developing an open-source FST grammar for verb chain transfer in a Spanish-Basque MT system. Proceedings of the 10th International Workshop on Finite State Methods and Natural Language Processing, Donostia-San Sebastián, July 23-25, 2012; pp.65-69. [PDF, 157KB]

(2011) Cristina Espańa-Bonet, Lluís Mŕrquez, Gorka Labaka, Arantza Díaz de Ilarraza, & Kepa Sarasola: Hybrid machine translation guided by a rule-based system. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.554-561. [PDF, 268KB]

(2011) Alicia Pérez, M.Inés Torres, & Francisco Casacuberta: Steps taken in Spanish-Basque speech translation using stochastic finite-state transducers. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 23-27 January, 2011; 2pp. [PDF, 90KB]

(2010) Alberto Barrón-Cedeńo, Paolo Rosso, Eneko Agirre, & Gorka Labaka: Plagiarism detection across distant language pairs. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.37-45. [PDF, 274KB]

(2010) Alicia Pérez, M.Inés Torres, & Francisco Casacuberta: Potential scope of a fully-integrated architecture for speech translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 448KB]

Spanish to Catalan

(2011) Xavier Ivars-Ribes & Victor M.Sánchez-Cartagena: A widely used machine translation service and its migration to a free/open-source solution: the case of Softcatalŕ. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.61-68. [PDF, 385KB]

(2010) Marta R.Costa-jussŕ, Mireia Farrús, José B.Marińo, & José A.R.Fonollosa: Automatic and human evaluation study of a rule-based and a statistical Catalan-Spanish machine translation systems. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1706-1711. [PDF, 335KB]

(2010) Mireia Farrús, Marta R.Costa-jussŕ, José B.Marińo, & José A.R.Fonollosa: Linguistic-based evaluation criteria to identift statistical machine translation errors. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 7pp. [PDF, 606KB]

Spanish to Chinese

(2011) Rafael E.Banchs: Comparative evaluation of Spanish segmentation strategies for Spanish-Chinese transliteration. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.41-48. [PDF, 198KB]

Spanish to English

(2014) Daniel Beck, Kashif Shah, & Lucia Specia: SHEF-Lite 2.0: sparse multi-task Gaussian processes for translation quality estimation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.307-312. [PDF, 306KB]

(2014) Jeffrey Killman: Vocabulary accuracy of statistical machine translation in the legal context. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Post-editing Technology and Practice (WPTP-3); pp.85-98. [PDF, 205KB]

(2014) Isabel Lacruz, Michael Denkowski, & Alon Lavie: Cognitive demand and cognitive effort in post-editing. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Post-editing Technology and Practice (WPTP-3); pp.73-84. [PDF, 126KB]

(2014) Arle Lommel, Aljoscha Burchardt, Maja Popović, Kim Harris, Eleftherios Avramidis, & Hans Uszkoreit: Using a new analytic measure for the annotation and analysis of MT errors on real data. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.165-172. [PDF, 1781KB]

(2014) Pilar Sánchez-Gijón, & Olga Torres-Hostench: MT post-editing into the mother tongue or into a foreign language?  Spanish-to-English MT translation output post-edited by translation trainees. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Post-editing Technology and Practice (WPTP-3); pp.5-19. [PDF, 226KB]

(2014) Lucia Specia, & Kashif Shah: Predicting human translation quality. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.288-300. [PDF, 717KB]

(2013) Alexandre Allauzen, Nicolas Pécheux, Quoc Khanh Do, Marco Dinarelli, Thomas Lavergne, Au­rélien Max, Hai-Son Le & François Yvon: LIMSI @ WMT13. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.62-69. [PDF, 210KB]

(2013) Qing Dou & Kevin Knight: Dependency-based decipherment for resource-limited machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1668-1676. [PDF, 153KB]

(2013) Nadir Durrani, Alexander Fraser, Helmut Schmid, Hassan Sajjad & Richárd Farkas: Munich-Edinburgh-Stuttgart submissions of OSM systems at WMT13.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.122-127. [PDF, 166KB]

(2013) Nadir Durrani, Alexander Fraser, & Helmut Schmid: Model with minimal translation units, but decode with phrases. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.1-11. [PDF, 172KB]

(2013) Mark Fishel: Ranking translations using error analysis and quality estimation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.405-407. [PDF, 98KB]

(2013) Chi-kiu Lo & Dekai Wu: MEANT at WMT 2013: A tunable, accurate yet inexpensive semantic frame based MT evaluation metric. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.422-428. [PDF, 563KB]

(2013) Oscar Mendoza Rivera, Ruslan Mitkov, & Gloria Corpas Pastor: A flexible framework for collocation retrieval and translation from parallel and comparable corpora. [MT Summit XIV] Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Nice, September 3, 2013; pp.18-25. [PDF, 356KB]

(2013) Lisa N.Michaud & Patricia Ann McCoy: Applying machine translation metrics to student-written translations. [NAACL HLT 2013] Proceedings of the Eighth Workshop on Innovative Use of NLP for Building Educational Applications, Atlanta, Georgia, June 13 2013; pp.306-311. [PDF, 481KB]

(2013) Tsuyoshi Okita, Qun Liu, & Josef van Genabith: Shallow semantically-informed PBSMT and HPBSMT. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.177-184. [PDF, 281KB]

(2013) Matt Post, Gaurav Kumar, Adam Lopez, Damianos Karakos, Chris Callison-Burch, & Sanjeev Khudanpur: Improved speech-to-text translation with the Fisher and Callhome Spanish-English speech translation corpus. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 192KB]

(2013) Majid Razmara & Anoop Sarkar: Stacking for statistical machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.334-339. [PDF, 284KB]

(2013) Raphael Rubino, Antonio Toral, Santiago Cortés Vaíllo, Jun Xie, Xiaofeng Wu, Stephen Doherty, & Qun Liu: The CNGL-DCU-Prompsit translation systems for WMT13. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.213-218. [PDF, 152KB]

(2013) Dan Tufiş, Radu Ion, Ştefan Daniel Dumitrescu, & Dan Ştefănescu: Wikipedia as an SMT training corpus. Proceedings of Recent Advances in Natural  Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.702-709. [PDF, 220KB]

(2013) Ashish Vaswani, Yinggong Zhao, Victoria Fossum, & David Chiang: Decoding with large-scale neural language models improves translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1387-1392. [PDF, 107KB]

(2013) Ivan Vulić & Marie-Francine Moens: A study on bootstrapping bilingual vector spaces from non-parallel data (and nothing else). [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1044-1054. [PDF, 261KB]

(2013) Heng Yu, Liang Huang, Haitao Mi, & Kai Zhou: Max-violation perceptron and forced decoding for scalable MT training. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1112-1123. [PDF, 374KB]

(2012) Eleftherios Avramidis, Marta R.Costa-jussŕ, Christian Federmann, Maite Melero, Pavel Pecina, & Josef van Genabith: A richly annotated, multilingual parallel corpus for hybrid machine translation. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2189-2193. [PDF, 433KB]

 (2012) Amittai Axelrod, QingJun Li, & William D.Lewis: Applications of data selection via cross-entropy difference for real-world statistical machine translation. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.201-208. [PDF, 610KB]

(2012) Julio Castillo & Paula Estrella: Semantic textual similarity for MT evaluation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.52-58. [PDF, 278KB]

(2012) Lluís Formiga, Carlos A.Henríquez Q., Adolfo Hernández, José B.Marińo, Enric Monte, & José A.R.Fonollosa: The TALP-UPC phrase-based translation systems for WMT12: morphology simplification and domain adaptation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.275-282. [PDF, 500KB]

(2012) Francisco Guzmán, Preslav Nakov, Ahmed Thabet, & Stephan Vogel: QCRI at WMT12: experiments in Spanish-English and German-English machine translation of news text. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.298-303. [PDF, 162KB]

(2012) Philipp Koehn & Barry Haddow: Toward effective use of training data in statistical machine translation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.317-321. [PDF, 89KB]

(2012) Patrik Lambert & Rafael E.Banchs: BIA: a discriminative phrase alignment toolkit. Prague Bulletin of Mathematical Linguistics 97, April 2012; pp.43-53. [PDF, 136KB]

(2012) Lemao Liu, Tiejun Zhao, Taro Watanabe, Hailong Cao, & Conghui Zhu: Expected error minimization with ultraconservative update for SMT. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.723-732. [PDF, 144KB]

(2012) Verónica López-Ludeńa, Rubén San-Segundo, & Juan M.Montero: UPM system for WMT 2012. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.338-344. [PDF, 223KB]

 (2012) Preslav Nakov & Hwee Tou Ng: Improving statistical machine translation for a resource-poor language using related resource-rich languages. Journal of Artificial Intelligence Research 44 (2012); pp.179-222. [PDF, 421KB]

 (2012) Tsuyoshi Okita: Neural probabilistic language model for system combination. COLING 2012: Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid MT (ML4HMT, Mumbai, December 2012; pp.65-75. [PDF, 150KB]

 (2012) Tsuyoshi Okita, Raphaël Rubino, & Josef van Genabith: Sentence-level quality estimation for MT system combination. COLING 2012: Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid MT (ML4HMT, Mumbai, December 2012; pp.55-63. [PDF, 140KB]

 (2012) Tsuyoshi Okita, Antonio Toral, & Josef van Genabith: Topic modeling-based domain adaptation for system combination. COLING 2012: Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid MT (ML4HMT, Mumbai, December 2012; pp.45-53. [PDF, 135KB]

(2012) Daniele Pighin, Lluís Mŕrquez, & Jonathan May: An analysis (and an annotated corpus) of user responses to machine translation output.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1131-1135. [PDF, 355KB]

(2012) Maja Popović: Morpheme and POS-based IBM1 scores and language model scores for translation quality estimation.  WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.133-137. [PDF, 110KB]

 (2012) V.M.Sánchez-Cartagena, F.Sánchez-Martínez, & J.A.Pérez-Ortiz: An open-source toolkit for integrating shallow-transfer rules into phrase-based statistical machine translation. In: Free/Open-Source Rule-Based Machine Translation, ed.Cristina Espańa-Bonet and Aarne  Ranta. Proceedings of a Workshop held in Gothenburg, 14-15 June, 2012; pp.41-54. [PDF, 620KB]

(2012) Ińaki San Vicente & Iker Manterola: PaCo2: a fully automated tool for gathering parallel corpora from the Web.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1-6. [PDF, 415KB]

 (2011) Rafael E.Banchs & Marta R.Costa-jussŕ: A semantic feature for statistical machine translation. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.126-134. [PDF, 576KB]

(2011) Nadir Durrani, Helmut Schmid, & Alexander Fraser: A joint sequence translation model with integrated reordering. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1045-1054. [PDF, 254KB]

(2011) Miquel Esplŕ-Gomis, Felipe Sánchez-Martínez & Mikel L.Forcada: Using machine translation in computer-aided translation to suggest the target-side words to change. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.172-179. [PDF,160KB]

(2011) Miquel Esplŕ, Felipe Sánchez-Martínez, & Mikel L.Forcada: Using word alignments to assist computer-aided translation users by marking which target-side words to change or keep unedited. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.81-88. [PDF, 381KB]; presentation, 41 slides [PDF]

 (2011) Christian Federmann, Sabine Hunsicker, Yu Chen, & Rui Wang: DFKI system combination using syntactic information at ML4HMT-2011. [LIHMT] ML4HMT 2011: Shared Task on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid Machine Translation, 19th November, Barcelona Media, Barcelona; pp.104-109. [PDF, 294KB]

 (2011) Shasha Liao, Cheng Wu, & Juan Huerta: Evaluating human correction quality for machine translation from crowdsourcing.  [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.598-603. [PDF, 487KB]

(2011) Véronique López-Ludeńa & Rubén San-Segundo: UPM system for the translation task. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.420-425. [PDF, 160KB]

(2011) Daniel Ortiz-Martínez, Luis A.Leiva, Vicent Alabau, Ismael García-Varea, & Francisco Casacuberta: An interactive machine translation system with online learning. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Portland, Oregon, June 19-24, 2011; pp.68-73. [PDF, 508KB]

 (2011) Maja Popović & Hermann Ney: Towards automatic error analysis of machine translation output. Computational Linguistics 37 (4), pp. 657-688. [PDF, 219KB]

(2011) Sujith Ravi & Kevin Knight: Deciphering foreign language. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.12-21. [PDF, 225KB]

 (2011) Antti-Veikko I.Rosti, Bing Zhang, Spyros Matsoukas, & Richard Schwartz: Expected BLEU training for graphs: BBN system description for WMT11 system combination task. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.159-165. [PDF, 147KB]

 (2011) Victor M.Sánchez-Cartagnena, Felipe Sánchez-Martínez, & Juan Antonio Pérez-Ortiz: Enriching a statistical machine translation system trained on small parallel corpora with rule-based bilingual phrases. [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.90-96. [PDF, 125KB]

(2011) Víctor M.Sánchez-Cartagena, Felipe Sánchez-Martínez, & Juan Antonio Pérez-Ortíz: Integrating shallow-transfer rules into phrase-based statistical machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.562-569. [PDF, 106KB]

(2011) Victor M.Sánchez-Cartagena, Felipe Sánchez-Martínez, & Juan Antonio Pérez-Ortiz: The Universitat d’Alacant hybrid machine translation system for WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.457-463. [PDF, 127KB]

 (2011) Luciana M.Scocco: Machine translation aid to the test: a study on the benefits of post-editing for translation into the weaker language. Dissertation submitted as part requirement for Master of Arts Translation and Interpreting Studies, University of Western Sydney, 4th November 2011; 14pp. [PDF, 170KB]

(2011) Mark Seligman & Mike Dillinger: Real-time multi-media translation for healthcare: a usability study. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.595-602. [PDF, 574KB]

(2011) Antonio Toral & Andy Way: Automatic acquisition of named entities for rule-based machine translation.  Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.37-43. [PDF, 151KB]

(2011) Viet Ha-Thuc & Nicola Cancedda: Confidence-weighted learning of factored discriminitave language models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.439-444. [PDF,

 (2011) Petra Wolf, Ulrike Bernardi, Christian Federmann, & Sabine Hunsicker: From statistical term extraction to hybrid machine translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.225-232. [PDF, 329KB]; presentation, 24 slides [PDF]

(2011) Daguang Xu, Yuan Cao, & Damianos Karakos: Description of the JHU system combination scheme for WMT 2011.  [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.171-176. [PDF, 123KB]

(2010) Vamshi Ambati, Stephen Vogel, & Jaime Carbonell: Active learning and crowd-sourcing for machine translation. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2169-2174. [PDF, 436KB]

(2010) Sankaranarayanan Ananthakrishnan, Rohit Prasad, David Stallard, & Prem Natarajan: Discriminative sample selection for statistical machine translation.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.626-635. [PDF, 329KB]

(2010) Michael Denkowski & Alon Lavie: METEOR-NEXT and the METEOR paraphrase tables: improved evaluation support for five target languages. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 339-342. [PDF, 104KB]

(2010) Christian Federmann, Andreas Eisele, Hans Uszkoreit, Yu Chen, Sabine Hunsicker, & Jia Xu: Further experiments with shallow hybrid MT systems.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 77-81. [PDF, 118KB]

(2010) Jesús González-Rubio, Daniel Ortiz-Martínez, & Francisco Casacuberta: Balancing user effort and translation error in interactive machine translation via confidence measures. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.173-177. [PDF, 97KB]

(2010) Jesús González-Rubio, Daniel Ortiz-Martínez, & Francisco Casacuberta: On the use of confidence measures within an interactive-predictive machine translation system. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 525KB]

(2010) Kenneth Heafield & Alon Lavie: CMU multi-engine machine translation for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 301-306. [PDF, 112KB]

(2010) Carlos A.Henríquez Q., Marta R.Costa-jussŕ, Vidas Daudaravicius, Rafael E.Banchs, & José B.Marińo: Using collocation segmentation to augment the phrase table. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 98-102. [PDF, 1398KB]

(2010) Cong Hui, Hai Zhao, Yan Song, & Bao-Liang Lu: An empirical study on development set selection strategy for machine translation learning.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and  MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 67-71. [PDF, 447KB]

(2010) Prasanth Kolachina, Sriram Venkatapathy, Srinivas Bangalore, Sudheer Kolachina, & Avinesh PVS: Phrase-based decoding using a discriminative model. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.34-42. [PDF, 184KB]

(2010) Patrik Lambert, Simon Petitrenaud, Yanjun Ma & Andy Way: Statistical analysis of alignment characteristics for phrase-based machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 588KB]; presentation: 26 slides [PDF, 133KB]

(2010) Gregor Leusch & Hermann Ney: The RWTH system combination system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 315-320. [PDF, 152KB]

(2010) Sushant Narsale: JHU system combination scheme for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 311-314. [PDF, 99KB]

(2010) Matteo Negri & Yashar Mehdad: Creating bi-lingual entailment corpus through translations with Mechanical Turk: $100 for a 10-day rush. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.212-216. [PDF, 101KB]

(2010) Sergio Penkale, Yanjun Ma, Daniel Galron, & Andy Way: Accuracy-based scoring for phrase-based statistical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 281KB]

(2010) Juan Pino, Gonzalo Iglesias, Adriŕ de Gispert, Graeme Blackwood, Jamie Brunning, & William Byrne: The CUED HiFST system for the WMT10 translation shared task. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 155-160. [PDF, 97KB]

(2010) Narges Sharif Razavian & Stephan Vogel: Fixed length word suffix for factored statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.147-150. [PDF, 268KB]

(2010) Antti-Veikko I.Rosti, Bing Zhang, Spyros Matsoukas, & Richard Schwartz: BBN system description for WMT10 system combination task. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 321-326. [PDF, 102KB]

(2010 Lane Schwartz: Reproducible results in parsing-based machine translation: the JHU shared task submission.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 177-182. [PDF, 101KB]

(2010) J.Tomás, A.Canovas, J.Lloret, M.García Pineda, & J.L.Abad: Speech translation in pedagogical environment using additional sources of knowledge.  LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1521-1524. [PDF, 442KB]

Spanish to French

(2014) Miriam Kaeshammer & Anika Westburg: On complex word alignment configurations. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1773-1780. [PDF, 171KB]

(2012) Lorenza Russo, Sharid Loáiciga, & Asheesh Gulati: Improving machine translation of null subjects in Italian and Spanish.  [EACL 2012] Proceedings of the EACL Student Research Workshop, Avignon, France, 26 April 2012; pp. 81-88. [PDF, 116KB]

(2012) Lorenza Russo, Sharid Loáiciga, & Asheesh Gulati: Italian and Spanish null subjects: a case study evaluation in an MT perspective.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1779-1784. [PDF, 372KB]

(2011) Emmanuel Prochasson & Pascale Fung: Rare word translation extraction from aligned comparable documents. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1327-1335. [PDF, 157KB]

Spanish to German

(2013) Maja Popović, Eleftherios Avramidis, Aljoscha Burchardt, Sabine Hunsicker, Sven Schmeier, Cindy Tscherwinka, David Vilar, & Hans Uszkoreit: Learning from human judgments of machine translation output. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.231-238. [PDF, 328KB]

(2012) Eleftherios Avramidis, Aljoscha Burchardt, Christian Federmann, Maja Popović, Cindy Tscherwinka, & David  Vilar: Involving language professionals in the evaluation of machine translation. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1127-1130. [PDF, 320KB]]

(2012) Carla Parra Escartin: Design and compilation of a specialized Spanish-German parallel corpus.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2199-2206. [PDF, 1572KB]

Spanish to Guarani

(2013) Alex Rudnick & Michael Gasser: Lexical selection for hybrid MT with sequence labeling. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.102-108. [PDF, 251KB]

Spanish to Portuguese

(2011) Taylor Berg-Kirkpatrick & Dan Klein: Simple effective decipherment via combinatorial optimization. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.313-321. [PDF, 299KB]

 (2010) François Masselot, Petra Ribiczey, & Gema Ramírez-Sánchez: Using the Apertium Spanish-Brazilian Portuguese machine translation system for localization. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 577KB]; presentation: 23 slides [PDF, 569KB]

Spanish to Quechua

(2013) Annette Rios & Anne Göhring: Machine learning disambiguation of Quechua verb morphology. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.13-18. [PDF, 226KB]

(2011) Alex Rudnick: Towards cross-language word sense disambiguation for Quechua. [RANLP 2011] Proceedings of the Student Research Workshop associated with RANLP 201, Hissar, Bulgaria, 13 September 2011; pp.133-138. [PDF, 125KB]

Swedish

(2010) Manny Rayner, Pierrette Bouillon, Nikos Tsourakis, Johanna Gerlach, Maria Georgescul, Yukie Nakao, & Claudia Baur: A multilingual CALL game based on speech translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1531-1538. [PDF, 573KB]

 (2010) Maria Rosenberg: Lexical representation of agentive nominal compounds in French and Swedish [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 59KB]

Swedish to Bulgarian

(2012) Georgi Iliev & Angel Genov: Expanding parallel resources for medium-density languages for free.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3937-3943. [PDF, 321KB]

Swedish to Dutch

(2010) Gudrun Rawoens: Multilingual corpora in cross-lingusitic research: focus on the compilation of a Dutch-Swedish parallel corpus. JADT 2010: 10th International Conference on Statistical Analysis of Textual Data, 9-11 juin 2010, Rome, Italie; pp.1287-1294. [PDF, 417KB]

Swedish to English

(2012) Maria Holmqvist, Sara Stymne, Lars Ahrenberg, & Magnus Merkel: Alignment-based reordering for SMT.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3436-3440. [PDF, 300KB]

(2012) Danielle L.Mowery, Sumithra Velupillai, & Wendy W.Chapman: Medical diagnosis lost in translation – analysis of uncertainty and negation expressions in English and Swedish clinical texts. Proceedings of the 2012 Workshop on Biomedical Natural Language Processing (BioNLP 2012), Montréal, Canada, June 8, 2012; pp.56-64. [PDF, 288KB]

Tagalog

(2014) Peter Baumann & Janet Pierrehumbert: Using resource-rich languages to improve morphological analysis of under-resourced languages. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3355-3359. [PDF, 143KB]

Tajik to Farsi

(2012) Chris Irwin Davis: Tajik-Farsi Persian transliteration using statistical machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3988-3995. [PDF, 590KB]

Tamil

(2012) Loganathan Ramasamy, Zdeněk Žabokrtský, & Sowmya Vajjala: The study of effect of length in morphological segmentation of agglutinative languages. [ACL 2012] Proceedings of the First Workshop on Multilingual Modeling, Jeju, Republic of Korea, 8-14 July 2012; pp.18-24. [PDF, 151KB]

Tamil to English

(2011) Hassan Sajjad, Alexander Fraser, & Helmut Schmid: An algorithm for unsupervised transliteration mining with an application to word alignment. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.430-439. [PDF, 171KB]

Tamil to Hindi

(2010) Sobha Lalitha Devi, Vijay Sundar Ram R, Bagyavathi T, & Praveen Pralayankar: Syntactic structure transfer in a Tamil-Hindi MT system – a hybrid approach [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 59KB]

Tatar to Kazak

(2013) Ilnar Salimzyanov, Jonathan North Washington, & Francis Morton Tyers: A free/open-source Kazakh-Tatar machine translation system. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.175-182. [PDF, 327KB]

Telugu

(2012) Loganathan Ramasamy, Zdeněk Žabokrtský, & Sowmya Vajjala: The study of effect of length in morphological segmentation of agglutinative languages. [ACL 2012] Proceedings of the First Workshop on Multilingual Modeling, Jeju, Republic of Korea, 8-14 July 2012; pp.18-24. [PDF, 151KB]

 (2011) Siva Reddy & Serge Sharoff: Cross-language POS taggers (and other tools) for Indian languages: an experiment with Kannada using Telugu resources. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.11-19. [PDF, 214KB]

Thai

(2011) Nattadaporn Lertcheva & Wirote Aroonmanakun: Product name identification and classification in Thai Economic News. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.58-64. [PDF, 129KB]

Thai to English

(2010) Michael Paul, Andrew Finch, & Eiichiro Sumita: Integration of multiple bilingually-learned segmentation schemes into statistical machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 400-408. [PDF, 256KB]

(2010) Glenn Slayden, Mei-Yuh Hwang, & Lee Schwartz: Thai sentence-breaking for large-scale SMT. [Coling 2010] Proceedings of the 1st Workshop on South and Southeast Asian Natural Language Processing, Beijing, China, August 2010; pp.8-16. [PDF, 334KB]

(2010) Chai Wutiwiwatchai & Ausdang Thangthai: Syllable-based Thai-English machine transliteration. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.66-70. [PDF, 189KB]

Tunisian Arabic

(2014) Fatiha Sadat & Atefeh Farzindar: The ASMAT project – Arabic social media analysis tools. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; p.130. [PDF, 296KB]

(2013) Rahma Boujelbane, Mariem Ellouze Khemekhem, Siwar BenAyed, & Lamia Hadrich Belguith: Building bilingual lexicon to create dialect Tunisian corpora and adapt language model. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.88-93. [PDF, 454KB]

(2013) Ahmed Hamdi, Rahma Boujelbane, Nizar Habash, & Alexis Nasr: The effects of factorizing root and pattern mapping in translating between Tunisian Arabic and standard Arabic. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.125-134. [PDF, 672KB]

Turkic languages

(2015) Irina Galinskaya & Farkhat Aminov: Yandex.Translate approach to the translation of Turkic languages [abstract]. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.2: MT Users’ Track; p.30. [PDF, 456KB]

 (2013) Zhiyang Wang, Yajuan Lü, Meng Sun, & Qun Liu: Stem translation with affix-based rule selection for agglutinative languages. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.364-369. [PDF, 667KB]

Turkish

(2013) Elif Eyigöz, Daniel Gildea & Kemal Oflazer: Multi-rate HMMs for word alignment. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.494-502. [PDF, 570KB]

 (2012) Alper Kanak: MULTISAUND – bridging multilingual technology [abstract]. In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp.7-8. [PDF]

(2011) Coşkun Mermer & Murat Saraçlar: Unsupervised Turkish morphological segmentation for statistical machine translation. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 27 January, 2011; 2pp. [PDF, 136KB]; presentation: 41 slides [PDF of PPT, 278KB]

Turkish to Arabic

(2012) Jie Jiang, Andy Way, & Rejwanul Haque: Translating user-generated content in the social networking space. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 9pp. [PDF, 130KB]

Turkish to English

(2013) Elif Eyigöz, Daniel Gildea, & Kemal Oflazer: Simultaneous word-morpheme alignment for statistical machine translation. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.32-40. [PDF, 259KB]

(2013) Ertuğrul Yılmaz, İlknur Durgar El-Kahlout, Burak Aydın, Zişan Sıla Özil, & Coşkun Mermer: TÜBİTAK Turkish-English submissions for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 91KB]

(2012) M.Talha Çakmak, Süleyman Acar, & Gülşen Eryiğit: Word alignment for English-Turkish language pair.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2177-2180. [PDF, 577KB]

(2012) İlknur Durgar El-Kahlout: Recent improvements in Turkish-English machine translation [abstract]. In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp.11-13. [PDF]

(2012) Jie Jiang, Andy Way, & Rejwanul Haque: Translating user-generated content in the social networking space. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 9pp. [PDF, 130KB]

(2012) Coşkun Mermer, Hamza Kaya, İlknur Durgar El-Kahlout, & Mehmet Uğur Doğan: The TÜBİTAK statistical machine translation system for IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.144-148. [PDF,630KB]

(2012) N.Ruiz, A.Bisazza, R.Cattoni, & M.Federico: FBK’s machine translation systems for IWSLT 2012’s TED lectures. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.61-68. [PDF, 623KB]; presentation, 27 slides [PDF of PPT, 378KB]

(2011) Mauro Cettolo, Nicola Bertoldi, & Marcello Federico: Methods for smoothing the optimizer instability in SMT. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.32-39. [PDF, 150KB]

(2011) Marcello Federico, Arianna Bisazza & Christian Hardmeier: Morphological processing and word reordering for statistical MT of highly inflected languages [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; presentation: 37 slides [PDF of PPT, 835KB]

(2011) Coşkun Mermer & Murat Saraçlar: Unsupervised Turkish morphological segmentation for statistical machine translation. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 27 January, 2011; 2pp. [PDF, 136KB]; presentation: 41 slides [PDF of PPT, 278KB]

(2010) Alexandre Allauzen, Josep M.Crego, İlknur Durgar El-Kahlout, Le Hai-Son, Guillaume Wisniewski, & François Yvon: LIMSI @ IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.105-112. [PDF, 386KB]

(2010) Arianna Bisazza, Ioannis Klasinas, Mauro Cettolo, & Marcello Federico: FBK @ IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.53-58. [PDF, 346KB]

(2010) Evgeny Matusov & Selçuk Köprü: AppTek’s APT machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.29-36. [PDF, 336KB]

(2010) Coşkun Mermer & Ahmet Afşın Akın: Unsupervised search for the optimal segmentation for statistical machine translation. Proceedings of the ACL 2010 Student Research Workshop, Uppsala, Sweden, 13 July 2010; pp.31-36. [PDF, 118KB]

(2010) Coşkun Mermer, Hamza Kaya, & Mehmet Uğur Doğan: The TÜBİTAK-UEKAE statistical machine translation system for IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.183-188. [PDF, 194KB]

(2010) Michael Paul, Marcello Federico, & Sebastian Stüker: Overview of the IWSLT 2010 evaluation campaign. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.3-27. [PDF, 558KB]

(2010) Wade Shen, Tim Anderson, Ray Slyh, & A.Ryan Aminzadeh: The MIT-LL/AFRL IWSLT-2010 MT system. Proceedings of the 7th International Workshop on Spoken L anguage Translation, 2-3 December 2010, Paris, France; pp.127-134. [PDF, 341KB]; presentation: 11 slides [PDF of PPT, 422KB]

(2010) Sirvan Yahyaei & Christoph Monz: Dynamic distortion in a discriminative reordering model for statistical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.353-360. [PDF, 460KB]

(2010) Sirvan Yahyaei & Christof Monz: The QMUL system description for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.157-162. [PDF, 303KB]

Turkish to Russian

(2012) Jie Jiang, Andy Way, & Rejwanul Haque: Translating user-generated content in the social networking space. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 9pp. [PDF, 130KB]

Ugaritic

(2011) Taylor Berg-Kirkpatrick & Dan Klein: Simple effective decipherment via combinatorial optimization. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.313-321. [PDF, 299KB]

(2010) Benjamin Snyder, Regina Barzilay, & Kevin Knight: A statistical model for lost language decipherment. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1048-1057. [PDF, 764KB]

Uighur to Chinese

(2011) Zhiyang Wang, Yajuan Lü, & Qun Liu: Multi-granularity word alignment and decoding for agglutinative language translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.360-367. [PDF, 169KB]

Ukrainian to English

(2014) Eduard Šubert & Ondřej Bojar: Twitter Crowd Translation – design and objectives. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.217-227. [PDF, 324KB]

UNL

(2012) Sameh Alansary: A formalized reference grammar for UNL-based machine translation between English and Arabic. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.33-42. [PDF, 352KB]

(2012) Velislava Stoykova: Bulgarian inflectional morphology in Universal Networking Language. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp. 423-429. [PDF, 145KB]

(2012) Hiroshi Uchida, Meiying Zhu, & Khan Md.Anwarus Salam: UNL Explorer. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp. 453-458. [PDF, 411KB]

(2010) Achille Falaise, David Rouquet, Didier Schwab, Hervé Blanchon, & Christian Boitet: Ontology driven content extraction using interlingual annotation of texts in the OMNIA project. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; pp.52-60. [PDF, 359KB]

Urdu

 (2012) Tina Bögel: Urdu-Roman transliteration via finite state transducers. Proceedings of the 10th International Workshop on Finite State Methods and Natural Language Processing, Donostia-San Sebastián, July 23-25, 2012; pp.25-29. [PDF, 147KB]

 (2012) Smruthi Mukund & Rohini K.Srihari: Analyzing Urdu social media for sentiments using transfer learning with controlled translations. Proceedings of the 2012 Workshop on Language in Social Media (LSM 2012), Montréal, Canada, June 7, 2012; pp.1-8. [PDF, 393KB]

(2010) Smruthi Mukund, Debanjan Ghosh, & Rohini K.Srihari: Using cross-lingual projections to generate semantic role labeled corpus for Urdu – a resource poor language. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.797-805. [PDF, 300KB]

Urdu to English

(2014) Nadir Durrani & Philipp Koehn: Improving machine translation via triangulation and transliteration. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.71-78. [PDF, 452KB]

(2013) Trevor Cohn & Gholamreza Haffari: An infinite hierarchical Bayesian model of phrasal translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.780-790. [PDF, 432KB]

(2013) Mitesh M.Khapra, Ananthakrishnan Ramanathan, & Karthik Visweswariah: Improving reordering performance using higher order and structural features. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.315-324. [PDF, 177KB]

(2013) Karthik Visweswariah, Mitesh M.Khapra, & Ananthakrishnan Ramanathan: Cut the noise: mutually reinforcing reordering and alignments for improved machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1275-1284. [PDF, 250KB]

(2012) Kathryn Baker, Michael Bloodgood, Bonnie J.Dorr, Chris Callison-Burch, Nathaniel W.Filardo, Christine Piatko, Lori Levin, & Scott Miller: Use of modality and negation in semantically-informed syntactic MT. Computational Linguistics 38 (2), pp. 411-438. [PDF, 3140KB]

(2012) Yuan Cao & Sanjeev Khudanpur: Sample selection for large-scale MT discriminative training. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 117KB]

(2012) Ashish Vaswani, Liang Huang, & David Chiang: Smaller alignment models for better translations: unsupervised word alignment with the l0-norm. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.311-319. [PDF, 127KB]

(2012) Jonathan Weese, Chris Callison-Burch, & Adam Lopez: Using categorial grammar to label translation rules. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.222-231. [PDF, 165KB]

(2011) Michael Denkowski & Alon Lavie: Meteor 1.3: automatic metric for reliable optimization and evaluation of machine translation systems. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.85-91. [PDF, 119KB]

(2011) Chris Dyer, Jonathan Clark, Alon Lavie, & Noah A.Smith: Unsupervised word alignment with arbitary features. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.201-210. [PDF, 303KB]

(2011) Manaal Faruqui, Prasenjit Majumder, & Sebastian Padó: Soundex-based translation correction in Urdu-English cross-language information retrieval. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.25-29. [PDF, 326KB]

(2011) Kevin Gimpel & Noah A.Smith: Quasi-synchronous phrase dependency grammars for machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.474-485. [PDF, 360KB]

(2011) Lane Schwartz, Chris Callison-Burch, William Schuler, & Stephen Wu: Incremental syntactic language models for phrase-based translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.620-631. [PDF, 249KB]

(2011) V.Shukla & R.M.K.Sinha: Divergence patterns for Urdu to English and English to Urdu translation. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.21-28. [PDF, 773KB]

(2011) Karthik Visweswariah, Rajakrishnan Rajkumar, Ankur Gandhe, Ananthakrishnan Ramanathan, & Jiri Navratil: A word reordering model for improved machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.486-496. [PDF, 252KB]

(2011) Omar F.Zaidan & Chris Callison-Burch: Crowdsourcing translation: professional quality from non-professionals. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1220-1229. [PDF, 241KB]

(2010) Kathryn Baker, Michael Bloodgood, Bonnie J.Dorr, Nathaniel W.Filardo, Lori Levin, & Christine Piatko: A modality lexicon and its use in automatic tagging. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1402-1407. [PDF, 375KB]

(2010) Kathryn Baker, Michael Bloodgood, Chris Callison-Burch, Bonnie J.Dorr, Nathaniel W.Filardo, Lori Levin, Scott Miller, & Christine Piatko: Semantically-informed syntactic machine translation: a tree-grafting approach. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 360KB]

(2010) Michael Bloodgood & Chris Callison-Burch: Bucking the trend: large-scale cost-focused active learning for statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.854-864. [PDF, 489KB]

(2010) Michael Bloodgood & Chris Callison-Burch: Using Mechanical Turk to build machine translation evaluation sets. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.208-211. [PDF, 89KB]

(2010) Ann Irvine, Mike Kayser, Zhifei Li, Wren Thornton, & Chris Callison-Burch: Integrating output from specialized modules in machine translation: transliterations in Joshua. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.107-116. [PDF, 152KB]

(2010) Karthik Visweswariah, Vijil Chenthamarakshan, & Nandakishore Kambhatla: Urdu and Hindi: translation and sharing of linguistic resources. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1283-1291. [PDF, 217KB]

(2010) Omar F.Zaidan & Juri Ganitkevitch: An enriched MT grammar for under $100. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.93-98. [PDF, 186KB]

Urdu to Hindi

(2014) Nadir Durrani & Philipp Koehn: Improving machine translation via triangulation and transliteration. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.71-78. [PDF, 452KB]

(2012) Gurpreet Singh Lehal & Tejinder Singh Saini: Development of a complete Urdu-Hindi transliteration system. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.643-652. [PDF,357KB]

(2011) Hassan Sajjad, Nadir Durrani, Helmut Schmid, & Alexander Fraser: Comparing two techniques for learning transliteration models using a parallel corpus. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.129-137. [PDF, 605KB]

(2010) Karthik Visweswariah, Vijil Chenthamarakshan, & Nandakishore Kambhatla: Urdu and Hindi: translation and sharing of linguistic resources. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1283-1291. [PDF, 217KB]

Veneto to English

(2011) Suhel Jaber, Sara Tonelli, & Rodolfo Delmonte: Venetan to English machine translation: issues and possible solutions. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.69-80. [PDF, 873KB]

Vietnamese to English

 (2015) Amittai Axelrod, Ahmed Elgohary, Marianna Martindale, Khánh Nguyen, Xing Niu, Yogarshi Vyas, & Marine Carpuat: The UMD machine translation systems at IWSLT 2015. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp. 55-61. [PDF, 2.6MB]

(2015) Takahiro Nomura, Hajime Tsukada, & Tomoyoshi Akiba: Improvement of word alignment models for Vietnamese-to-English translation. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.112-116. [PDF, 1.3MB]

(2015) Viet Tran Hong, Huyen Vu Thong, Trung Le Tien, Luan Ngia Pham, & Vinh Nuyen Van: The English-Vietnamese machine translation system for IWSLT 2015. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.80-83. [PDF, 1.1MB]

(2015) Krzysztof Wolk & Krzysztof Marasek: PJAIT systems for the IWSLT 2015 evaluation campaign enhanced by comparable corpora. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.101-104. [PDF, 2.9MB]

(2012) Quoc Hung-Ngo & Werner Winiwarter: A visualizing annotation tool for semi-automatically building a bilingual corpus.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.67-74. [PDF, 820KB]

Vietnamese to French

(2010) Do Thi Ngoc Diep, Laurent Besacier, & Eric Castelli: Improved Vietnamese-French parallel corpus mining using English language. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.235-242. [PDF, 1182KB]

Vietnamese to Japanese

(2012) Takanori Kusumoto & Tomoyosi Akiba: Statistical machine translation without a source-side parallel corpus using word lattice and phrase extension.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3929-3932. [PDF, 352KB]

Zulu

(2014) Peter Baumann & Janet Pierrehumbert: Using resource-rich languages to improve morphological analysis of under-resourced languages. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3355-3359. [PDF, 143KB]