Machine Translation Archive

Index of Organizations and Affiliations

for publications from 2005 to 2009

arranged by country, part 4: US-Z

click here for part 1 (A-F), part 2 (G-M), part 3 (N-UK)

for publications since 2010, 2000-2004; 1990-1999; 1970-1989; before 1970

 [to return to home page click here]

 

 

United States of America

Abraxas Corporation

(2008) Jennifer Doyon, Christine Doran, C.Donald Means, & Domenique Parr: Automated machine translation improvement through post-editing techniques: analyst and translator experiments. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.346-353. [PDF, 712KB]

Accenture Technology Labs

(2008) Christian Monson, Ariadna Font Llitjós, Vamshi Ambati, Lori Levin, Alon Lavie, Alison Alvarez, Roberto Aranovich, Jaime Carbonell, Robert Frederking, Erik Peterson, & Katharina Probst: Linguistic structure and bilingual informants help induce machine translation of lesser-resourced languages. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 141KB]

Adobe Systems Inc.

(2009) Raymond Flournoy & Christine Duran: Machine translation and document localization at Adobe: from pilot to production. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 425-428. [PDF, 123KB]

Advanced Resources Technology Inc.

(2008) Reginald Hobbs, Jamal Laoudi, & Clare R.Voss: MTriage: web-enabled software for the creation, machine translation, and annotation of smart documents.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 433KB]

(2008) Clare R.Voss, Matthew Aguirre, Jeffrey Micher, Richard Chang, Jamal Laoudi, & Reginald Hobbs: Boosting performance of weak MT engines automatically: using MT output to align segments & build statistical post-editors. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.192-201. [PDF, 506KB]

(2008) Clare R. Voss, Jamal Laoudi, & Jeffrey Micher: Exploitation of an Arabic language resource for MT evaluation: using Buckwalter-based lookup tool to augment CMU alignment algorithm.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 246KB]

Air Force Institute of Technology

(2007) Erik A. Chapla: Statistical machine translation of Japanese. MSc thesis, Air Force Institute of Technology, Wright-Patterson Air Force Base, Ohio, March 2007 (AFIT/GE/ENG/07-06). 96pp. [PDF, 1407KB]

Air Force Research Laboratories

(2009) Wade Shen, Brian Delaney, A.Ryan Aminzadeh, Tim Anderson, & Ray Slyh: The MIT-LL/AFRL IWSLT-2009 MT system. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 71-78. [PDF, 398KB]; presentation [PDF of PPT, 1730KB]

(2008) Wade Shen, Brian Delaney, Tim Anderson, & Ray Slyh: The MIT-LL/AFRL IWSLT-2008 MT system. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp. 69-76. [PDF, 208KB]; presentation [PDF, 607KB]

(2007) Wade Shen, Brian Delaney, Tim Anderson, & Ray Slyh: The MIT-LL/AFRL IWSLT-2007 MT system.  IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 8pp. [PDF, 174KB]; presentation [PDF, 623KB]

(2006) Brian Delaney, Wade Shen, & Timothy Anderson: An efficient graph search decoder for phrase-based statistical machine translation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 197-204 [PDF, 250KB]

(2006) Douglas Jones, Wade Shen, Brian Delaney, Martha Herzog, Michael Emonts, Sabine Atwell, James Dirgin, Neil Granoien, Sargon Jabri, Jurgen Sottung, Timothy Anderson, & Timothy Hunter: Toward an Interagency Language Roundtable based assessment of speech-to-speech translation capabilities. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.82-89 [PDF, 150KB]

(2006) Wade Shen, Brian Delaney, & Tim Anderson: The MIT-LL/AFRL IWSLT-2006 MT system. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 71-76  [PDF, 804KB]

(2005) Wade Shen, Brian Delaney, & Tim Anderson: The MIT-LL/AFRL MT system. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 5pp. [PDF, 70KB]

A-Life Medical Inc.

(2006) Daniel T. Heinze, Alexander Turchin & V. Jagannathan: Automated interpretation of clinical encounters with cultural cues and electronic health record generation.  HLT-NAACL 2006: Proceedings of the  Workshop on Medical Speech Translation, 9 June 2006, New York, NY, USA; pp.24-31 [PDF, 251KB]

(2006) Daniel T. Heinze, Alexander Turchin & V. Jagannathan: Accultran: automated interpretation of clinical encounters with cultural cues and electronic health record generation.  HLT-NAACL 2006: Proceedings of the  Workshop on Medical Speech Translation, 9 June 2006, New York, NY, USA; pp.50-52 [PDF, 181KB]

Alpnet

(2007) Deryle Lonsdale: From ALPS to Alpnet (and beyond). [priv.publ., January 2007]. 12pp. [PDF, 78KB]

American National Standards Institute

(2006) Gil Francopoulo, Nuria Bel, Monte George, Nicoletta Calzolari, Monica Monachini, Mandy Pet, & Claudia Soria: Lexical markup framework (LMF) for NLP multilingual resources. Coling-ACL 2006: Proceedings of the Workshop on Multilingual Language Resources and Interoperability, Sydney, July 2006; pp.1-8. [PDF, 69KB]

Ames Research Center [see also Research Institute for Advanced Computer Science]

(2009) Beth Ann Hockey & Manny Rayner: Using paraphrases of deep semantic representations to support regression testing in spoken dialogue systems. NAACL-HLT-2009: Software Engineering, Testing and Quality Assurance for Natural Language Processing (SETQA-NLP 2009), Proceedings of the Workshop, June 5, 2009, Boulder, Colorado; pp.14-21. [PDF, 139KB]

(2009) Manny Rayner, Paula Estrella, Pierrette Bouillon, Beth Ann Hockey & Yukie Nakao: Using artificially generated data to evaluate statistical machine translation. ACL-IJCNLP 2009: Workshop on Grammar Engineering Across Frameworks (GEAF 2009), Proceedings of the workshop, 6 August 2009, Suntec, Singapore; pp.54-62. [PDF, 214KB]

(2008) Pierrette Bouillon, Sonia Halimi, Yukie Nakao, Kyoko Kanzaki, Hitoshi Isahara, Nikos Tsourakis, Marianne Starlander, Beth Ann Hockey, & Manny Rayner: Developing non-European translation pairs in a medium-vocabulary medical speech translation system. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 122KB]

(2008) Pierrette Bouillon, Glenn Flores, Maria Georgescul, Sonia Halimi, Beth Ann Hockey, Hitoshi Isahara, Kyoko Kanzaki, Yukie Nakao, Manny Rayner, Marianne Santaholma, Marianne Starlander, & Nikos Tsourakis: Many-to-many multilingual medical speech translation on a PDA. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.314-323. [PDF, 666KB]

(2008) Reginald Hobbs, Jamal Laoudi, & Clare R.Voss: MTriage: web-enabled software for the creation, machine translation, and annotation of smart documents.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 433KB]

(2008) Manny Rayner, Pierrette Bouillon, Jane Brotanek, Glenn Flores, Sonia Halimi, Beth Ann Hockey, Hitoshi Isahara, Kyoko Kanzaki, Elisabeth Kron, Yukie Nakao, Marianne Santaholma, Marianne Starlander, & Nikos Tsourakis: The 2008 MedSLT system. Coling 2008: Proceedings of the Workshop on Speech Processing for Safety Critical Translation and Pervasive Applications, 23 August 2008, Manchester, UK; pp.32-35. [PDF, 209KB]

(2008) Manny Rayner, Pierrette Bouillon, Glenn Flores, Farzad Ehsani, Marianne Starlander, Beth Ann Hockey, Jane Brotanek, & Lukas Biewald: A small-vocabulary shared task for medical speech translation. Coling 2008: Proceedings of the Workshop on Speech Processing for Safety Critical Translation and Pervasive Applications, 23 August 2008, Manchester, UK; pp.60-63. [PDF, 62KB]

(2008) Manny Rayner, Pierrette Bouillon, Beth Ann Hockey, & Yukie Nakao: Almost flat functional semantics for speech translation.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.713-720. [PDF, 96KB]

(2007) Pierrette Bouillon, Glenn Flores, Marianne Starlander, Nikos Chatzichrisafis, Marianne Santaholma, Nikos Tsourakis, Manny Rayner, & Beth Ann Hockey: A bidirectional grammar-based medical speech translator. ACL 2007: Proceedings of the Workshop on Grammar-based approaches to spoken language processing (SPEECHGRAM 2007), June 29, 2007, Prague, Czech Republic; pp.41-48. [PDF, 216KB]

(2007) Pierrette Bouillon, Sonia Halimi, Manny Rayner, & Beth Ann Hockey: Adapting a medical speech to speech translation system (MedSLT) to Arabic. ACL 2007: proceedings of the Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources, Prague, Czech Republic, 28 June 2007; pp. 41-48 [PDF, 145KB]

(2006) Nikos Chatzichrisafis, Pierrette Bouillon, Manny Rayner, Marianne Santaholma, Marianne Starlander & Beth Ann Hockey: Evaluating task performance for a unidirectional controlled language medical speech translation system.  HLT-NAACL 2006: Proceedings of the  Workshop on Medical Speech Translation, 9 June 2006, New York, NY, USA; pp.9-16 [PDF, 109KB]

(2006) Manny Rayner, Pierrette Bouillon, Nikos Chatzichrisafis, Marianne Santaholma, Marianne Starlander, Beth Ann Hockey, Yukie Nakao, Hitoshi Isahara & Kyoko Kanzaki: MedSLT: a limited-domain unidirectional grammar-based medical speech translator.  HLT-NAACL 2006: Proceedings of the  Workshop on Medical Speech Translation, 9 June 2006, New York, NY, USA; pp.44-47 [PDF, 75KB]

(2005) Manny Rayner, Nikos Chatzichrisafis, Pierrette Bouillon, Yukie Nakao, Hitoshi Isahara, Kyoko Kanzaki, Beth Ann Hockey, Marianne Santaholma, & Marianne Starlander: Japanese speech understanding using grammar specialization. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; Demonstration abstracts, pp. 26-27. [PDF, 54KB]

(2005) Marianne Starlander, Pierrette Bouillon, Nikos Chatzichrisafis, Marianne Santaholma, Manny Rayner, Beth Ann Hockey, Hitoshi Isahara, Kyoko Kanzaki, & Yukie Nakao: Practicing controlled language through a help system integrated into the medical speech translation system (MedSLT). MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.188-194. [PDF, 169KB]

(2005) Pierrette Bouillon, Manny Rayner, Nikos Chatzichrisafis, Beth Ann Hockey, Marianne Santaholma, Marianne Starlander, Yukie Nakao, Kyoko Kanzaki, & Hitoshi Isahara: A generic multi-lingual open source platform for limited-domain medical speech translation. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 50-58. [PDF, 182KB]

AppTek Inc.

(2009) Craig Kopris: Endoclitics in Pashto: can they really do that?  CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 6pp. [PDF, 485KB]

(2009) Selçuk Köprü: AppTek Turkish-English machine translation system description for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 19-23. [PDF, 309KB]

(2009) Selçuk Köprü & Jude Miller: A unification based approach to the morphological analysis and generation of Arabic. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 615KB]

(2009) Selçuk Köprü & Adnan Yazıcı: Lattice parsing to integrate speech recognition and rule-based machine translation.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.469-477. [PDF, 185KB]

(2008) Hassan Sawaf, Braddock Gaskill, & Michael Veronis: Hybrid machine translation applied to media monitoring. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.440-447. [PDF, 963KB]

Army Intelligence Center

(2008) Cecil MacPherson, Devin Rollis, & Irene Zehmisch: Meeting Army foreign language requirements with the aid of machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp. 435-439. [PDF, 518KB]

(2006) Douglas Jones, Wade Shen, Brian Delaney, Martha Herzog, Michael Emonts, Sabine Atwell, James Dirgin, Neil Granoien, Sargon Jabri, Jurgen Sottung, Timothy Anderson, & Timothy Hunter: Toward an Interagency Language Roundtable based assessment of speech-to-speech translation capabilities. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.82-89 [PDF, 150KB]

Army Research Laboratories

(2009) Reginald L.Hobbs, Clare R.Voss & Jamal Laoudi: On beyond TM: when the translator leads the design of a translation support framework. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 513-520. [PDF, 472KB]

(2008) Reginald L.Hobbs & Clare R.Voss: Designing and executing machine translation workflows through the Kepler framework. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.380-389.  [PDF of PPT presentation, 1204KB]

(2008) Calandra R. Tate: A statistical analysis of automated MT evaluation metrics for assessments in task-based MT evaluation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.182-191. [PDF, 583KB]

(2008) Jean Tavernier, Rosa Cowan, & Michelle Vanni: Holy Moses! Leveraging existing tools and resources for entity translation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 74KB]

(2008) Clare R.Voss, Matthew Aguirre, Jeffrey Micher, Richard Chang, Jamal Laoudi, & Reginald Hobbs: Boosting performance of weak MT engines automatically: using MT output to align segments & build statistical post-editors. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.192-201. [PDF, 506KB]

(2008) Clare R. Voss, Jamal Laoudi, & Jeffrey Micher: Exploitation of an Arabic language resource for MT evaluation: using Buckwalter-based lookup tool to augment CMU alignment algorithm.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 246KB]

(2006) Jamal Laoudi, Calandra R.Tate, & Clare R.Voss: Task-based MT evaluation: from who/when/where extraction to event understanding.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2048-2053 [PDF, 381KB]

(2006) Keith J.Miller & Michelle Vanni: Formal vs. informal: register-differentiated Arabic MT evaluation in the PLATO paradigm.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.161-166 [PDF, 296KB]

(2006) Calandra R. Tate & Clare R. Voss: Combining evaluation metrics via loss functions. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.242-250 [PDF, 106KB]

(2006) Clare R. Voss & Calandra R. Tate: Task-based evaluation of machine translation (MT) engines. Measuring how well people extract who, when, where-type elements in MT output. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.203-212 [PDF, 235KB]

(2005) Keith J. Miller & Michelle Vanni: Inter-rater agreement measures, and the refinement of metrics in the PLATO MT evaluation paradigm. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.125-132. [PDF, 183KB]

Articulate Software Inc.

(2006) Sabri Elkateb, William Black, Piek Vossen, David Farwell, Adam Pease, & Christiane Fellbaum: Arabic WordNet and the challenges of Arabic. The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.15-24. [PDF, 295KB]

Artis Tech Inc.

(2009) Reginald L.Hobbs, Clare R.Voss & Jamal Laoudi: On beyond TM: when the translator leads the design of a translation support framework. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 513-520. [PDF, 472KB]

(2008) Clare R.Voss, Matthew Aguirre, Jeffrey Micher, Richard Chang, Jamal Laoudi, & Reginald Hobbs: Boosting performance of weak MT engines automatically: using MT output to align segments & build statistical post-editors. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.192-201. [PDF, 506KB]

AT&T

(2009) Srinivas Bangalore, Stephan Kanthak, & Patrick Haffner: Statistical machine translation through global lexical selection and sentence reconstruction.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.185-203.

(2009) Daniel Galron, Sergio Penkale, Andy Way, & I.Dan Melamed: Accuracy-based scoring for DOT: towards direct error minimization for data-oriented translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.371-380. [PDF, 164KB]

(2008) Vivek Kumar Rangarajan Sridhar, Shrikanth Narayanan, & Srinivas Bangalore: Enriching spoken language translation with dialog acts. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 225-228. [PDF, 383KB]

(2008) Vivek Kumar Sridhar, Srinivas Bangalore, & Shrikanth S.Narayanan: Factored translation models for enriching spoken language translation with prosody. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.2723-2726; abstract [PDF, 60KB]

(2007) Srinivas Bangalore, Patrick Haffner, & Stephan Kanthak: Statistical machine translation through global lexical selection and sentence reconstruction. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 152-159 [PDF, 458KB]

(2007) Sriram Venkatapathy & Srinivas Bangalore: Three models for discriminative machine translation using global lexical selection and sentence reconstruction. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.96-102 [PDF, 191KB]

(2006) Srinivas Bangalore, Stephan Kanthak, & Patrick Haffner: Finite-state transducer-based statistical machine translation using joint probabilities. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 16-22 [PDF,179 KB]

(2006) Srinivas Bangalore, Patrick Haffner, & Stephan Kanthak: Sequence classification for machine translation. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1722; abstract [PDF, 72KB]

BBN Technologies

(2009) Spyros Matsoukas, Antti-Veikko I.Rosti, & Bing Zhang: Discriminative corpus weight estimation for machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.708-717. [PDF, 116KB]

(2009) Antti-Veikko I.Rosti, Bing Zhang, Spyros Matsoukas, & Richard Schwartz: Incremental hypothesis alignment with flexible matching for building confusion networks: BBN system description for WMT09 system combination task. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.61-65. [PDF, 115KB]

(2009) Libin Shen, Jinxi Xu, Bing Zhang, Spyros Matsoukas & Ralph Weischedel: Effective use of linguistic and contextual information for statistical machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.72-80. [PDF, 122KB]

(2009) Matthew G.Snover, Nitin Madnani, Bonnie Dorr, & Richard Schwartz: TER-Plus: paraphrase, semantic, and alignment enhancements to Translation Error Rate [abstract]. Machine Translation 23 (2/3), September 2009; pp.117-127.

(2009) Matthew Snover, Nitin Madnani, Bonnie J.Dorr, & Richard Schwartz: Fluency, adequacy, or HTER? Exploring different human judgments with a tunable MT metric.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.259-268. [PDF, 147KB]

(2008) C.-L. Kao, S.Saleem, R.Prasad, F.Choi, P.Natarajan, D.Stallard, K.Krstovski, & M.Kamali: Rapid development of an English/Farsi speech-to-speech translation system. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.166-173 [PDF, 143KB]; presentation [PDF, 95KB]

(2008) Kriste Krstovski, Michael Decerbo, Rohit Prasad, David Stallard, Shirin Saleem, & Premkumar Natarajan: A wearable headset speech-to-speech translation system. ACL-08: HLT. Workshop on Mobile Language Processing. Proceedings of the workshop, June 20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 10-12. [PDF, 117KB]

(2008) Nitin Madnani, Philip Resnik, Bonnie J.Dorr, & Richard Schwartz: Are multiple reference translations necessary? Investigating the value of paraphrased reference translations in parameter optimization. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.143-152. [PDF, 655KB]

(2008) John F.Pitrelli, Burn L.Lewis, Edward A.Epstein, Martin Franz, Daniel Kiecza, Jerome L.Quinn, Ganesh Ramaswamy, Amit Srivastava, & Paola Virga: Aggregating distributed STT, MT, and information extraction engines: the GALE interoperability-demo system. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.2743-2746; abstract [PDF, 63KB]

(2008) Antti-Veikko I. Rosti, Bing Zhang, Spyros Matsoukas, & Richard Schwartz: Incremental hypothesis alignment for building confusion networks with application to machine translation system combination. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.183-186. [PDF, 61KB]

(2008) Libin Shen, Jinxi Xu, & Ralph Weischedel: A new string-to-dependency machine translation algorithm with a target dependency language model. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 577-585. [PDF, 147KB]

(2008) Matthew Snover, Bonnie Dorr, & Richard Schwartz: Language and translation model adaptation using comparable corpora. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.857-866. [PDF, 132KB]

(2007) Antti-Veikko I. Rosti, Spyros Matsoukas, & Richard Schwartz: Improved word-level system combination for machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 312-319 [PDF, 135KB]

(2007) Antti-Veikko I.Rosti, Necip Fazil Ayan, Bing Xiang, Spyros Matsoukas, Richard Schwartz, & Bonnie J.Dorr: Combining outputs from multiple machine translation systems. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.228-235 [PDF, 129KB]

(2007)  S.Saleem, K.Subramanian, R.Prasad, David Stallard, Chia-Lin Kao, P.Natarajan, & R.Suleiman: Improvements in machine translation for English/Iraqi speech translation. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2445-2448; abstract [PDF, 23KB]

(2007) David Stallard, Fred Choi, Chia-Lin Kao, Kriste Krstovski, P.Natarajan, R.Prasad, S.Saleem, & K.Subramanian: The BBN 2007 displayless English/Iraqi speech-to-speech translation system. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2817-2820; abstract [PDF, 23KB]

(2006) Eduard Hovy, Mitchell Marcus, Martha Palmer, Lance Ramshaw & Ralph Weischedel: OntoNotes: the 90% solution.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 57-60 [PDF, 44KB]

(2006) Matthew Snover, Bonnie Dorr, Richard Schwartz, Linnea Micciulla, & John Makhoul: A study of translation edit rate with targeted human annotation.  AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.223-231 [PDF, 185KB]

(2006) David Stallard, Fred Choi, Kriste Krstovski, Prem Natarajan, Rohit Prasad, & Shirin Saleem: A hybrid phrase-based/statistical speech translation system. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1732; abstract [PDF, 74KB]

Bolt, Beranek and Newman see BBN Technologies

Brandeis University

(2009) Jinho D.Choi, Martha Palmer, & Nianwen Xue: Using parallel Propbanks to enhance word-alignments. ACL-IJCNLP 2009: Third Linguistic Annotation Workshop (LAW III), Proceedings of the workshop, 6-7 August 2009, Suntec, Singapore; pp.121-124. [PDF, 139KB]

Brigham and Women’s Hospital, Boston

(2006) Daniel T. Heinze, Alexander Turchin & V. Jagannathan: Automated interpretation of clinical encounters with cultural cues and electronic health record generation.  HLT-NAACL 2006: Proceedings of the  Workshop on Medical Speech Translation, 9 June 2006, New York, NY, USA; pp.24-31 [PDF, 251KB]

(2006) Daniel T. Heinze, Alexander Turchin & V. Jagannathan: Accultran: automated interpretation of clinical encounters with cultural cues and electronic health record generation.  HLT-NAACL 2006: Proceedings of the  Workshop on Medical Speech Translation, 9 June 2006, New York, NY, USA; pp.50-52 [PDF, 181KB]

Brigham Young University

(2007) Deryle Lonsdale: From ALPS to Alpnet (and beyond). [priv.publ., January 2007]. 12pp. [PDF, 78KB]

(2006) Simon Corston-Oliver, Anthony Aue, Kevin Duh & Eric Ringger: Multilingual dependency parsing using Bayes point machines.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 160-167 [PDF, 178KB]

(2006) Daniel Marcu & Alan K.Melby: Data-driven machine translation: a conversation with linguistics and translation studies. AMTA 2006:  7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; invited talk [PDF of PPT presentation, 998KB]; expanded version with annotations and revisions, January 2008; PDF, 768KB]

Brown University

(2008) David Ellis, Mathias Creutz, Timo Honkela, & Mikko Kurimo: Speech to speech machine translation: Biblical chatter from Finnish to English.  IJCNLP 2008: Workshop on NLP for Less Privileged Languages. Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp. 123-130. [PDF, 104KB]

(2005) Heidi J. Fox: Dependency-based statistical machine translation.  ACL-2005: Student Research Workshop, University of Michigan, Ann Arbor, June 2005; pp. 91-96. [PDF, 101KB]

(2005) Sharon Goldwater & David McClosky: Improving statistical MT through morphological analysis.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 676-683. [PDF, 152KB]

Buffalo University

(2008) Anurag Bhardwaj, Damien Jose, & Venu Govindaraju: Script independent word spotting in multilingual documents. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.48-54. [PDF, 122KB]

Butler Hill Group, Seattle

(2009) Takako Aikawa & Achim Ruopp: Chained system: a linear combination of different types of statistical machine translation systems. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.184-189. [PDF, 229KB]

(2007) Takako Aikawa, Lee Schwartz, Ronit King, Mo Corston-Oliver, & Carmen Lozano: Impact of controlled language on translation quality and post-editing in a statistical machine translation environment. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.1-7 [PDF, 314KB]

CACI International Inc.

(2009) Anna Borovikov, Eugene Borovikov, Bradley Colquitt, & Kristen Summers: The EDEAL project for automated processing of African languages.  MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 546-549. [PDF, 73KB]

(2008) James E. Andrews & Kristen Summers: Machine translation for triage and exploitation of massive text data. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.291-298 [PDF, 703KB]

(2008) Kristen Summers & Diane Chandler: Language processing for analysis and investigation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.454-456. [PDF, 647KB]

(2008) Jean Tavernier, Rosa Cowan, & Michelle Vanni: Holy Moses! Leveraging existing tools and resources for entity translation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 74KB]

California, University of (Berkeley)

(2009) John DeNero, Adam Pauls, & Dan Klein: Asynchronous binarization for synchronous grammars. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.141-144. [PDF, 183KB]

(2009) John DeNero: Estimation problems in machine translation (learning to translate). Presentation at Johns Hopkins University Summer School on Human Language Technology, June 12, 2009; 88 slides [PDF of PPT slides, 3496KB]

(2009) John DeNero, David Chiang, & Kevin Knight: Fast consensus decoding over translation forests. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.567-575. [PDF, 650KB]

(2009) Wei Gao, John Blitzer, Ming Zhou, & Kam-Fai Wong: Exploiting bilingual information to improve web search. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.1075-1083. [PDF,

(2009) Aria Haghighi, John Blitzer, John DeNero & Dan Klein: Better word alignments with supervised ITG models. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.306-314. [PDF, 301KB]

(2009) Kristen Parton, Kathleen R.McKeown, Bob Coyne, Mona T.Diab, Ralph Grishman, Dilek Hakkani-Tür, Mary Harper, Heng Ji, Wei Yun Ma, Adam Meyers, Sara Stolbach, Ang Sun, Gokhan Tur, Wei Xu, & Sibel Yaman: Who, what, when, where, why? Comparing multiple approaches to the cross-lingual 5W task. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.423-431. [PDF, 176KB]

(2009) Adam Pauls, John DeNero, & Dan Klein: Consensus training for consensus decoding in machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1418-1427. [PDF, 502KB]

(2009) Jing Zheng, Necip Fazil Ayan, Wen Wang, & David Burkett: Using syntax in large-scale audio document translation. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

(2008) David Burkett & Dan Klein: Two languages are better than one (for syntactic parsing).  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.877-886. [PDF, 405KB]

(2008) John DeNero, Alexandre Bouchard-Côté, & Dan Klein: Sampling alignment structure under a Bayesian translation model.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.314-323. [PDF, 356KB]

(2008) John DeNero & Dan Klein: The complexity of phrase alignment problems. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 25-28. [PDF, 180KB]

(2008) Frederic Gey, David Kirk Evans, & Noriko Kando: A Japanese-English technical lexicon for translation and language research. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 198KB]

(2008) Aria Haghighi, Percy Liang, Taylor Berg-Kirkpatrick, & Dan Klein: Learning bilingual lexicons from monolingual corpora. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 771-779. [PDF, 257KB]

(2008) Preslav Nakov: Improving English-Spanish statistical machine translation: experiments in domain adaptation, sentence paraphrasing, tokenization, and recasing. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.147-150. [PDF, 110KB]

(2008) Slav Petrov, Aria Haghighi, & Dan Klein: Coarse-fine syntactic machine translation using language projections.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.108-116. [PDF, 306KB]

(2007) John DeNero & Dan Klein: Tailoring word alignments to syntactic machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 17-24 [PDF, 415KB]

(2007) Frederic C.Gey: Search between Chinese and Japanese text collections. Proceedings of NTCIR-6 Workshop Meeting, May 15-18, 2007, Tokyo, Japan; pp.73-76. [PDF, 283KB]

(2007) Preslav Nakov & Marti Hearst: UCB system description for the WMT 2007 shared task. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 212-215 [PDF, 129KB]

(2006) John DeNero, Dan Gillick, James Zhang, & Dan Klein: Why generative phrase models underperform surface heuristics.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 31-38 [PDF, 393KB]

(2006) David Kauchak: Contributions to research on machine translation. PhD thesis, University of California, San Diego, 2006. xiv,92pp. [PDF, 491KB]

(2006) David Kauchak & Regina Barzilay: Paraphrasing for automatic evaluation. HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 455-462 [PDF, 196KB]

(2006) Simon Lacoste-Julien, Ben Taskar, Dan Klein, & Michael I. Jordan: Word alignment via quadratic assignment.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 112-119 [PDF, 177KB]

(2006) Percy Liang, Alexandre Bouchard-Côté, Dan Klein, & Ben Taskar: An end-to-end discriminative approach to machine translation. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.761-768. [PDF, 136KB]

(2006) Percy Liang, Ben Taskar, & Dan Klein: Alignment by agreement.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 104-111 [PDF, 150KB]

(2005) Frederic C.Gey: How similar are Chinese and Japanese for cross-language information retrieval? Proceedings of NTCIR-5 Workshop Meeting, December 6-9, 2005, Tokyo, Japan; 4pp. [PDF, 343KB]

(2005) Ben Taskar, Simon Lacoste-Julien, & Dan Klein: A discriminative matching approach to word alignment. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 73-80. [PDF, 902KB]

California, University of (Santa Cruz)

(2009) Jesse Saba Kirchner, Justin Nuger, & Yi Zhang: An extensible crosslinguistic readability framework. ACL-IJCNLP-2009: Proceedings of the 2nd Workshop on Building and Using Comparable Corpora, Suntec, Singapore, 6 August 2009; pp.11-18. [PDF, 215KB]

(2005) Manny Rayner, Pierrette Bouillon, Nikos Chatzichrisafis, Beth Ann Hockey, Marianne Santaholma, Marianne Starlander, Hitoshi Isahara, Kyoko Kanzaki, & Yukie Nakao: A methodology for comparing grammar-based and robust approaches to speech understanding. Interspeech 2005 - Eurospeech: 9th European  Conference on  Speech Communication and Technology, Lisbon, Portugal, September 4-8, 2005; pp.1877-1880 [PDF, 52KB]; abstract [PDF, 55KB]

Carnegie Mellon University

(2009) Vamshi Ambati & Jaime Carbonell: Proactive learning for building machine translation systems for minority languages. NAACL-HLT-2009: Active Learning for Natural Language Processing (ALNLP-09), Proceedings of the workshop, June 5, 2009, Boulder, Colorado; pp.58-61. [PDF, 101KB]

(2009) Vamshi Ambati, Alon Lavie, & Jaime Carbonell: Extraction of syntactic translation models from parallel data using syntax from source and target languages. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.190-197. [PDF, 251KB]

(2009) Nguyen Bach, Qin Gao, & Stephan Vogel: Source-side dependency tree reordering models and subtree movements and constraints. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 17-24. [PDF, 211KB]

(2009) Nguyen Bach, Roger Hsiao, Matthias Eck, Paisarn Charoenpornsawat, Stephan Vogel, Tanja Schultz, Ian Lane, Alex Waibel, & Alan W.Black: Incremental adaptation of speech-to-speech translation. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009;  pp.149-152. [PDF, 111KB]

(2009) Reza Bosagh Zadeh: Building strong multilingual aligned corpora. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.176-181. [PDF, 497KB]

(2009) Jaime Carbonell: New paradigms for machine translation. Translingual Europe 2009, May 13-14, Prague, Czech Republic; 35pp. [PDF of PPT, 462KB]

(2009) Jaime Carbonell: How to combine paradigms for machine translation, Translingual Europe 2009, May 13-14, Prague, Czech Republic; 4pp. [PDF of PPT, 83KB]

(2009) Jonathan Clark & Greg Hanneman: Czech-to-English translation [on Stat-XFER]. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; pp.29-31. [PDF, 1116KB]

(2009) Kevin Gimpel & Noah A.Smith: Feature-rich translation by quasi-synchronous lattice parsing. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.219-228. [PDF, 329KB]

(2009) Francisco Guzman, Qin Gao & Stephan Vogel: Reassessment of the role of phrase extraction in PBSMT. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 49-56. [PDF, 131KB]

(2009) Olivier Hamon, Christian Fügen, Djamel Mostefa, Victoria Arranz, Muntsin Kolss, Alex Waibel, & Khalid Choukri: End-to-end evaluation in simultaneous translation. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.345-353. [PDF, 106KB]

(2009) Greg Hanneman & Alon Lavie: Decoding with syntactic and non-syntactic phrases in a syntax-based machine translation system. Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.1-9. [PDF, 124KB]

(2009) Greg Hanneman, Vamshi Ambati, Jonathan H.Clark, Alok Parlikar, & Alon Lavie: An improved statistical transfer system for French-English machine translation. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.140-144. [PDF, 101KB]

(2009) Kenneth Heafield, Greg Hanneman, & Alon Lavie: Machine translation system combination with flexible word ordering. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.56-60. [PDF, 104KB]

(2009) Almut Silja Hildebrand & Stephan Vogel: CMU system combination for WMT’09. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.47-50. [PDF, 101KB]

(2009) Alon Lavie & Michael J.Denkowski: The METEOR metric for automatic evaluation of machine translation [abstract]. Machine Translation 23 (2/3), September 2009; pp.105-115.

(2009) Alon Lavie: Stat-XFER: a general framework for search-based syntax-driven MT. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; pp.19-28 [PDF, 740KB]

(2009) Alon Lavie & Mark Przybocki: Introduction to the special issue on “Automated metrics for machine translation evaluation”. Machine Translation 23 (2/3), September 2009; pp.69-70. [see publication]

(2009) Lori Levin: Adaptable, community-controlled, language technologies for language maintenance. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.8-11 [PDF, 223KB]

(2009) Zhifei Li, Chris Callison-Burch, Sanjeev Khudanpur, & Wren Thornton: Decoding Joshua: open source, parsing-based machine translation. Prague Bulletin of Mathematical Linguistics 91, 2009; pp.47-56. [PDF, 146KB]

(2009) Daniel Chung Yong Lim & Ian Lane: Language identification for speech-to-speech translation. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

(2009) Aaron B. Phillips & Ralf D. Brown: Cunei machine translation platform: system description. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.29-36. [PDF, 109KB]; presentation [PDF of PPT, 1356KB]

(2009) Ashish Venugopal & Andreas Zollmann: Grammar based statistical MT on Hadoop: an end-to-end toolkit for large scale PSCFG based MT. Prague Bulletin of Mathematical Linguistics 91, 2009; pp.67-78. [PDF, 142KB]

(2009) Ashish Venugopal, Andreas Zollmann, Noah A.Smith, & Stephan Vogel: Preference grammars: softening syntactic constraints to improve statistical machine translation. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.37-45. [PDF, 212KB]

(2009) Ying Zhang & Nguyen Bach: Virtual Babel: towards context-aware machine translation in virtual worlds. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 387-394. [PDF, 290KB]

(2008) Abhaya Agarwal & Alon Lavie: Meteor, M-BLEU and M-TER: Evaluation metrics for high-correlation with human rankings of machine translation output.  ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.115-118. [PDF, 148KB]

(2008) Vamshi Ambati & Alon Lavie: Improving syntax driven translation models by re-structuring divergent and non-isomorphic parse tree structures. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.235-244 [PDF, 734KB]

(2008) Nguyen Bach, Qin Gao, & Stephan Vogel: Improving word alignment with language model based confidence scores. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.151-154. [PDF, 607KB]

(2008) Ralf D.Brown: Exploiting document-level context for data-driven machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.46-55. [PDF, 496KB]

(2008) Jonathan H. Clark, Robert Frederking, & Lori Levin: Inductive detection of language features via clustering minimal pairs: towards feature-rich grammars in machine translation.  Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.78-86. [PDF, 246KB]

(2008) Jonathan H. Clark, Robert Frederking, & Lori Levin: Toward active learning in data selection: automatic discovery of language features during elicitation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 141KB]

(2008) Matthias Eck, Stephan Vogel, & Alex Waibel: Communicating unknown words in machine translation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 303KB]

(2008) Kevin Gimpel & Noah A. Smith: Rich source-side context for statistical machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.9-17. [PDF, 211KB]

(2008) Qin Gao & Stephan Vogel: Parallel implementations of word alignment tool. ACL-08 HLT: Software Engineering, Testing, and Quality Assurance for Natural Language Processing, June 20, 2008, The Ohio State University, Columbus, Ohio, USA; pp.49-57. [PDF, 340KB]

(2008) Greg Hanneman, Edmund Huber, Abhaya Agarwal, Vamshi Ambati, Alok Parlikar, Erik Peterson, & Alon Lavie: Statistical transfer systems for French-English and German-English machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.163-166. [PDF, 72KB]

(2008) Almut Silja Hildebrand & Stephan Vogel: Combination of machine translation systems via hypothesis selection from combined n-best lists. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.254-261 [PDF, 681KB]

(2008) Almut Silja Hildebrand, Kay Rottmann, Mohamed Noamany, Qin Gao, Sanjika Hewavitharana, Nguyen Bach, & Stephan Vogel: Recent improvements in the CMU large scale Chinese-English SMT system. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 77-80. [PDF, 117KB]

(2008) Muntsin Kolss, Matthias Wölfel, Florian Kraft, Jan Niehues, Matthias Paulik, & Alex Waibel: Simultaneous German-English lecture translation. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.174-181 [PDF, 214KB]; presentation [PDF, 205KB]

(2008) Muntsin Kolss, Stephan Vogel, & Alex Waibel: Stream decoding for simultaneous spoken language translation. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.2735-2738; abstract [PDF, 58KB]

(2008) Ni Lao, Hideki Shima, Teruko Mitamura & Eric Nyberg: Query expansion and machine translation for robust cross-lingual information retrieval. Proceedings of NTCIR-7 Workshop Meeting, December 16-19, 2008, Tokyo, Japan; pp.140-147. [PDF, 545KB]

(2008) Alon Lavie, Alok Parlikar, & Vamshi Ambati: Syntax-driven learning of sub-sentential translation equivalents and translation rules from parsed parallel corpora. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.87-95. [PDF, 1610KB]

(2008) Teruko Mitamura, Eric Nyberg, Hideki Shima, Tsuneaki Kato, Tatsunori Mori, Chin-Yew Lin,  Ruihua Song, Chuan-Jie Lin, Tetsuya Sakai, Donghong Ji, & Noriko Kando: Overview of the NTCIR-7 ACLIA tasks: advanced cross-lingual information access. Proceedings of NTCIR-7 Workshop Meeting, December 16-19, 2008, Tokyo, Japan; pp.11-25. [PDF, 1062KB]

(2008) Christian Monson, Ariadna Font Llitjós, Vamshi Ambati, Lori Levin, Alon Lavie, Alison Alvarez, Roberto Aranovich, Jaime Carbonell, Robert Frederking, Erik Peterson, & Katharina Probst: Linguistic structure and bilingual informants help induce machine translation of lesser-resourced languages. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 141KB]

(2008) Udhyakumar Nallasamy, Alan W.Black, Tanja Schultz, Robert Frederking, & Jerry Weltman: Speech translation for triage of emergency phonecalls in minority languages. Coling 2008: Proceedings of the Workshop on Speech Processing for Safety Critical Translation and Pervasive Applications, 23 August 2008, Manchester, UK; pp.48-53. [PDF, 229KB]

(2008) Udhyakumar Nallasamy, Alan W. Black, Tanja Schultz, & Robert Frederking: NineOneOne: recognizing and classifying speech for handling minority language emergency calls.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 95KB]

(2008) ThuyLinh Nguyen & Stephan Vogel: Context-based Arabic morphological analysis for machine translation. CoNLL: proceedings of the  Twelfth Conference on Computational Natural Language Learning, 16-17 August 2008, Manchester, UK; pp.135-142. [PDF, 123KB]

(2008) Jan Niehues & Stephan Vogel: Discriminative word alignment via alignment matrix modelling. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.18-25. [PDF, 576KB]

(2008) Tim Schlippe, ThuyLinh Nguyen, & Stephan Vogel: Diacritization as a machine translation problem and as a sequence labeling problem. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.270-278 [PDF, 701KB]

(2008) Tanja Schultz: Rapid language adaptation tools and technologies for multilingual speech processing systems. [abstract of] Invited talk at: First International Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU-2008), Hanoi, Vietnam, 5-7 May 2008; 1p. [PDF, 51KB]

(2008) Rohini U, Vamshi Ambati, & Vasudev Varma: Statistical machine translation models for personalized search. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.521-528. [PDF, 432KB]

(2008) Ashish Venugopal, Andreas Zollmann, Noah A. Smith, & Stephan Vogel: Wider pipelines: n-best alignments and parses in MT training. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.192-201. [PDF, 614KB]

(2008) Clare R. Voss, Jamal Laoudi, & Jeffrey Micher: Exploitation of an Arabic language resource for MT evaluation: using Buckwalter-based lookup tool to augment CMU alignment algorithm.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 246KB]

(2008) Andreas Zollmann, Ashish Venugopal, & Stephan Vogel: The CMU syntax-augmented machine translation system: SAMT on Hadoop with n-best alignments.  IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp. 18-25. [PDF, 208KB]; presentation [PDF, 109KB]

(2007) Alison Alvarez, Lori Levin, Robert Frederking, & Jill Lehman: An assessment of language elicitation without the supervision of a linguist. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.1-10 [PDF, 415KB]; presentation [PDF, 412KB]

(2007) Nguyen Bach, Matthais Eck, Paisarn Charoenpornsawat, Thilo Köhler, Sebastian Stüker, ThuyLinh Nguyen, Roger Hsiao, Alex Waibel, Stephan Vogel, Tanja Schultz, & Alan W.Black: The CMU TransTac 2007 eyes-free two-way speech-to-speech translation system. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 8pp. [PDF, 489KB]; presentation [PDF, 1043KB]

(2007) Yi Chang, Ying Zhang, Stephan Vogel, & Jie Yang: Enhancing image-based Arabic document translation using noisy channel correction model. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.89-95 [PDF, 157KB]

(2007) Matthias Eck, Stephan Vogel, & Alex Waibel: Estimating phrase pair relevance for translation model pruning. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.159-165 [PDF, 157KB]

(2007) Matthias Eck, Stephan Vogel, & Alex Waibel: Translation model pruning via usage statistics for statistical machine translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.21-24 [PDF, 93KB]

(2007) Ariadna Font Llitjós, Jaime Carbonell, & Alon Lavie: Improving transfer-based MT systems with automatic refinements. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.183-190 [PDF, 339KB]

(2007) Ariadna Font Llitjós & Stephan Vogel: A walk on the other side: adding statistical components to a transfer-based translation system. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.72-79 [PDF, 113KB]

(2007) Ariadna Font Llitjós & William A. Ridmann: The inner works of an automatic rule refiner for machine translation. METIS-II Workshop: New Approaches to Machine Translation, Centre for Computational Linguistics, Katholieke Universiteit Leuven, Belgium, 11 January 2007; 10pp. [PDF, 412KB]

(2007) Christian Fügen, Alex Waibel, & Muntsin Kolss: Simultaneous translation of lectures and speeches [abstract].  Machine Translation 21 (4) December 2007; pp.209-252.

(2007) Sanjika Hewavitharana, Alon Lavie, & Stephan Vogel: Experiments with a noun-phrase driven statistical machine translation system. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.247-253 [PDF, 241KB]

(2007) Jae Dong Kim & Stephan Vogel: Iterative refinement of lexicon and phrasal alignment. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.281-288 [PDF, 121KB]

(2007) Ian Lane, Andreas Zollmann, Thuy Linh Nguyen, Nguyen Bach, Ashish Venugopal, Stephan Vogel, Kay Rottmann, Ying Zhang, & Alex Waibel: The CMU-UKA statistical machine translation systems for IWSLT 2007. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 7pp. [PDF, 150KB]; presentation [PDF, 393KB]

(2007) Alon Lavie & Abhaya Agarwal: METEOR: an automatic metric for MT evaluation with high levels of correlation with human judgments. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 228-231 [PDF, 159KB]

(2007) Einat Minkov, Kristina Toutanova, & Hisami Suzuki: Generating complex morphology for machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 128-135 [PDF, 219KB]

(2007) Teruko Mitamura, Frank Lin, Hideki Shima, Mengqiu Wang, Jeongwoo Ko, Justin Betteridge, Matthew Bilotti, Andrew Schlaikjer & Eric Nyberg: JAVELIN III: cross-lingual question answering from Japanese and Chinese documents.  Proceedings of NTCIR-6 Workshop Meeting, May 15-18, 2007, Tokyo, Japan; 8pp. [PDF, 426KB]

(2007) Kemal Oflazer & İlknur Durgar El-Kahlout: Exploring different representational units in English-to-Turkish statistical machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 25-32 [PDF, 447KB]

(2007) M. Paulik, K.Rottmann, J.Niehues, S.Hildebrand, & S.Vogel: The ISL phrase-based MT system for the 2007 ACL Workshop on statistical machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 197-202 [PDF, 175KB]

(2007) Aaron B.Phillips: Sub-phrasal matching and structural templates in example-based MT. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.163-170 [PDF, 295KB]

(2007) Aaron B. Phillips, Violetta Cavalli-Sforza, & Ralf D. Brown: Improving example-based machine translation through morphological generalization and adaptation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.369-375 [PDF, 115KB]

(2007) Sharath Rao, Ian Lane, & Tanja Schultz: Improving spoken language translation by automatic disfluency removal: evidence from conversational speech transcripts. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.385-389 [PDF, 140KB]

(2007) Sharath Rao, Ian Lane, & Tanja Schultz: Optimizing sentence segmentation for spoken language translation. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2845-2848; abstract [PDF, 23KB]

(2007) Kay Rottmann & Stephan Vogel: Word reordering in statistical machine translation with a POS-based distortion model. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.171-180 [PDF, 3725KB]; presentation [PDF, 1129KB]

(2007) Tanja Schultz, Alan W.Black, Sameer Badaskar, Matthew Hornyak, & John Kominek: SPICE: web-based tools for rapid language adaptation in speech processing systems. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2125-2128; abstract [PDF, 23KB]

(2007) Yik-Cheung Tam, Ian Lane, & Tanja Schultz: Bilingual-LSA based LM adaptation for spoken language translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 520-527 [PDF, 162KB]

(2007) Yik-Cheung Tam, Ian Lane, & Tanja Schultz: Bilingual LSA-based adaptation for statistical machine translation [abstract].  Machine Translation 21 (4) December 2007; pp.187-207.

(2007) Yik-Cheung Tam & Tanja Schultz: Bilingual LSA-based translation lexicon adaptation for spoken language translation. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2461-2464; abstract [PDF, 23KB]

(2007) Ashish Venugopal, Andreas Zollmann, & Stephan Vogel: An efficient two-pass approach to synchronous-CFG driven statistical MT. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.500-507 [PDF, 279KB]

(2007) Ashish Venugopal & Andreas Zollmann: Hierarchical and syntax structured MT. First Machine Translation Marathon, Edinburgh, April 16-20, 2007; 52pp. [PDF of PPT presentation, 383KB]

(2007) Ying Zhang & Stephan Vogel: PanDoRA: a large-scale two-way statistical machine translation system for hand-held devices. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.543-550 [PDF, 472KB]

(2007) Bing Zhao, Nguyen Bach, Ian Lane & Stephan Vogel: A log-linear block transliteration model based on bi-stream HMMs. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.364-371 [PDF, 453KB]

(2007) Andreas Zollmann, Ashish Venugopal, Matthias Paulik, & Stephan Vogel: The syntax augmented MT (SAMT) system for the shared task in the 2007 ACL workshop on statistical machine translation. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 216-219 [PDF, 163KB]

(2006) Jaime Carbonell: contribution to panel on “Hybrid machine translation”.  AMTA 2006: 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; [PDF of PPT presentation, 56KB]

(2006) Violetta Cavalli-Sforza: Machine translation for social impact (introduction to panel on “Machine translation for social impact”).  AMTA 2006: 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; [PDF of PPT presentation, 64KB]

(2006) Violetta Cavalli-Sforza & Alon Lavie: Hybrid machine translation: why and how? (introduction to panel on “Hybrid machine translation”).  AMTA 2006: 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; [PDF of PPT presentation, 64KB]

(2006) Matthias Eck, Ian Lane, Nguyen Bach, Sanjika Hewavitharana, Muntsin Kolss, Bing Zhao, Almut Silja Hildebrand, Stephan Vogel, & Alex Waibel: The UKA/CMU statistical machine translation system for IWSLT 2006. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 130-137 [PDF, 170KB]

(2006) Matthias Eck, Stephan Vogel, & Alex Waibel: A flexible online server for machine translation evaluation. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.89-94 [PDF, 111KB]

(2006) Ariadna Font-Llitjós: Can the Internet help improve machine translation?  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 219-222 [PDF, 135KB]

(2006) Rashmi Gangadharaiah, Ralf Brown & Jaime Carbonell: Spectral clustering for example based machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 41-44 [PDF, 63KB]

(2006) Roger Hsiao, Ashish Venugopal, Thilo Köhler, Ying Zhang, Paisarn Charoenpornsawat, Andreas Zollmann, Stephan Vogel, Alan W,Black, Tanja Schultz, & Alex Waibel: Optimizing components for handheld two-way speech translation for an English-Iraqi Arabic system. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1712; abstract [PDF, 94KB]

(2006) Teruko Mitamura, Mengqiu Wang, Hideki Shima, & Frank Lin: Keyword translation accuracy and cross-lingual question answering in Chinese and Japanese. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multilingual Question Answering (MLQA06), Trento, Italy, April 4, 2006; pp.31-38 [PDF, 617KB]

(2006) Christian Monson, Ariadna Font Llitjos, Roberto Aranovich, Lori Levin, Ralf Brown, Eric Peterson, Jaime Carbonell, & Alon Lavie: Building NLP systems for two resource-scarce indigenous languages: Mapudungun and Quechua. LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.15-24. [PDF, 124KB]

(2006) Owen Rambow, Bonnie Dorr, David Farwell, Rebecca Green, Nizar Habash, Stephen Helmreich, Eduard Hovy, Lori Levin, Keith J.Miller, Teruko Mitamura, Florence Reeder, & Advaith Siddharthan: Parallel syntactic annotation of multiple languages.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.559-564 [PDF, 286KB]

(2006) Bing Zhao & Eric P. Xing: BiTAM: bilingual topic admixture models for word alignment. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.969-976. [PDF, 574KB]

(2006) Andreas Zollmann, Ashish Venugopal, Stephan Vogel, & Alex Waibel: The CMU-UKA syntax augmented machine translation system for IWSLT-06. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 138-144 [PDF, 218KB]

(2006) Andreas Zollmann & Ashish Venugopal: Syntax augmented machine translation via chart parsing.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 138-141 [PDF, 99KB]

(2006) Andreas Zollmann, Ashish Venugopal, & Stefan Vogel: Bridging the inflection morphology gap for Arabic statistical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 201-204 [PDF, 147KB]

(2005) Alison Alvarez, Lori Levin, Robert Frederking, Erik Peterson & Jeff Good: Semi-automated elicitation corpus generation . MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.388-395. [PDF, 131KB]

(2005) Satanjeev Banerjee & Alon Lavie: METEOR: an automatic metric for MT evaluation with improved correlation with human judgments. ACL-2005: Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, University of Michigan, Ann Arbor, 29 June 2005; pp. 65-72.[PDF, 225KB]

(2005) Ralf Brown: Context-sensitive retrieval for example-based translation MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.9-15. [PDF, 273KB]

(2005) Ralf D.Brown, Jae Dong Kim, Peter J.Jansen, & Jaime G.Carbonell: Symmetric probabilistic alignment. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp.87-90. [PDF, 109KB]

(2005) Matthias Eck, Stephan Vogel, & Alex Waibel: Low cost portability for statistical machine translation based on n-gram coverage. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.227-234. [PDF, 199KB]

(2005) Matthias Eck & Chiori Hori: Overview of the IWSLT 2005 Evaluation Campaign. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 22pp. [PDF, 256KB]

(2005) Matthias Eck, Stephan Vogel, & Alex Waibel: Low cost portability for statistical machine translation based on n-gram frequency and TF-IDF. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 7pp. [PDF, 135KB]

(2005) Herman A. Engelbrecht & Tanja Schultz: Rapid development of an Afrikaans English speech-to-speech translator. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 8pp. [PDF, 171KB]

(2005) Ariadna Font Llitjós, Jaime G. Carbonell, & Alon Lavie: A framework for interactive and automatic refinement of transfer-based machine translation. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 87-96. [PDF, 468KB]

(2005) Jesús Giménez, Enrike Amigó, & Chiori Hori: Machine translation evaluation inside QARLA. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 8pp. [PDF, 91KB]

(2005) Sanjika Hewavitharana, Stephan Vogel, & Alex Waibel: Augmenting a statistical translation system with a translation memory. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 126132. [PDF, 74KB]

(2005) Sanjika Hewavitharana, Bing Zhao, Almut Silja Hildebrand, Matthias Eck, Chiori Hori, Stephan Vogel, & Alex Waibel: The CMU statistical machine translation system for IWSLT2005. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 8pp. [PDF, 93KB]

(2005) Almut Silja Hildebrand, Matthias Eck, Stephan Vogel, & Alex Waibel: Adaptation of the translation model for statistical machine translation based on information retrieval. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 133-1142. [PDF, 100KB]

(2005) Fei Huang: Cluster-specific named entity transliteration. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 435-444. [PDF, 169KB]

(2005) Fei Huang, Ying Zhang, & Stephan Vogel: Mining key phrase translations from web corpora. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 483-490. [PDF, 310KB]

(2005) Shyamsundar Jayaraman & Alon Lavie: Multi-engine machine translation guided by explicit word matching.  ACL-2005: Interactive Poster and Demonstration Sessions, University of Michigan, Ann Arbor, June 2005; pp. 101-104. [PDF, 117KB]

(2005) Shyamsundar Jayaraman & Alon Lavie: Multi-engine machine translation guided by explicit word matching. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 143-152. [PDF, 63KB]

(2005) Jae Dong Kim, Ralf D. Brown, Peter J. Jansen, & Jaime G. Carbonell: Symmetric probabilistic alignment for example-based translation. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 153-159. [PDF, 63KB]

(2005) Lucian Vlad Lita, Monica Rogati, & Alon Lavie: BLANC: learning evaluation metrics for MT. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 33-40. [PDF, 215KB]

(2005) Siddharth Patwardhan, Satanjeev Banerjee & Ted Pedersen: SenseRelate::TargetWord – a generalized framework for word sense disambiguation. ACL-2005: Interactive Poster and Demonstration Sessions, University of Michigan, Ann Arbor, June 2005; pp. 73-76. [PDF, 54KB]

(2005) M.Paulik, Christian Fügen, Sebastian Stüker, Tanja Schultz, Thomas Schaaf, & Alex Waibel: Document driven machine translation enhanced ASR. Interspeech 2005 - Eurospeech: 9th European  Conference on  Speech Communication and Technology, Lisbon, Portugal, September 4-8, 2005; pp.2261-2264; abstract [PDF, 59KB]

(2005) Ashish Venugopal & Stephan Vogel: Considerations in maximum mutual information and minimum classification error training for statistical machine translation. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 271-279. [PDF, 89KB]

(2005) Ashish Venugopal, Andreas Zollmann, & Alex Waibel: Training and evaluating error minimization rules for statistical machine translation.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 208-215. [PDF, 112KB]

(2005) Stephan Vogel: PESA: phrase pair extraction as sentence splitting. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.251-258. [PDF, 192KB]

(2005) Stephan Vogel: Statistical machine translation part I: hands-on introduction. [Tutorial at] IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; p.275. [PDF, 44KB]

(2005) Ying Zhang & Stephan Vogel: Competitive grouping in integrated phrase segmentation and alignment model.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 159-162. [PDF, 437KB]

(2005) Ying Zhang & Stephan Vogel: An efficient phrase-to-phrase alignment model for arbitrarily long phrase and large corpora. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 294-301. [PDF, 187KB]

(2005) Bing Zhao & Stephan Vogel: A generalized alignment-free phrase extraction.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 141-144. [PDF, 174KB]

(2005) Bing Zhao, Niyu Ge, & Kishore Papineni: Inner-outer bracket models for word alignment using hidden blocks. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 177-184. [PDF, 592KB]

(2005) Bing Zhao & Alex Waibel: Learning a log-linear model with bilingual phrase-pair features for statistical machine translation. IJCNLP-05: Fourth SIGHAN  Workshop on Chinese Language Processing. Proceedings of the workshop, 14-15 October 2005, Jeju Island, Korea; pp. 79-86. [PDF, 1301KB]

(2005) Bing Zhao, Eric P.Xing, & Alex Waibel: Bilingual word spectral clustering for statistical machine translation. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 25-32. [PDF, 467KB]

Center for Language and Speech Processing (Baltimore)

(2009) Zhifei Li, Chris Callison-Burch, Sanjeev Khudanpur, & Wren Thornton: Decoding Joshua: open source, parsing-based machine translation. Prague Bulletin of Mathematical Linguistics 91, 2009; pp.47-56. [PDF, 146KB]

(2009) Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevitch, Sanjeev Khudanpur, Lane Schwartz, Wren Thornton, Jonathan Weese, & Omar Zaidan: Joshua: open source toolkit for parsing-based machine translation. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; 23pp.  [PDF, 933KB]

Center for the Study of Language and Information, Stanford University

(2008) Stephan Oepen: Grammar-based processing and re-usable hybrid MT – an experience report in the wilderness. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 32 slides. [PDF, 254KB]

(2006) Lars Nygaard, Jan Tore Lønning, Torbjørn Nordgård, & Stephan Oepen: Using a bi-lingual dictionary in lexical transfer. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.233-238 [PDF, 133KB]

(2006) Stephan Oepen & Jan Tore Lønning: Discriminant-based MRS banking.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1250-1255 [PDF, 430KB]

(2005) Francis Bond, Stephan Oepen, Melanie Siegel, Ann Copestake, & Dan Flickinger: Open Source machine translation with DELPH-IN  MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Workshop on Open Source Machine Translation; pp.15-22. [PDF, 2700KB]

(2005) Dan Flickinger, Jan Tore Lønning, Helge Dyvik, Stephan Oepen, & Francis Bond: SEM-I rational MT: enriching deep grammars with a semantic interface for scalable machine translation. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.165-172. [PDF, 249KB]

(2005) Erik Velldal & Stephan Oepen: Maximum entropy models for realization ranking. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.109-116. [PDF, 217KB]

(2005) Stephan Oepen, Helge Dyvik, Dan Flickinger, Jan Tore Lønning, Paul Meurer, & Victoria Rosén: Holistic regression testing for high-quality MT: some methodological and technological reflections. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 198-204. [PDF, 173KB]

Chicago University

(2009) Sravana Reddy & Sonjia Waxmonsky: Substring-based transliteration with conditional random fields. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.92-95. [PDF, 90KB]

(2006) Irina Matveeva & Gina-Anne Levow: Computing term translation probabilities with generalized latent semantic analysis. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Posters and demonstrations, Trento, Italy, April 5-6, 2006; pp.151-154 [PDF, 90KB]

(2006) Benjamin Wellington, Sonja Waxmonsky, & I.Dan Melamed: Empirical lower bounds on the complexity of translational equivalence. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.977-984. [PDF, 121KB]

City University of New York see New York, City University

Colorado, University of

(2009) Jinho D.Choi, Martha Palmer, & Nianwen Xue: Using parallel Propbanks to enhance word-alignments. ACL-IJCNLP 2009: Third Linguistic Annotation Workshop (LAW III), Proceedings of the workshop, 6-7 August 2009, Suntec, Singapore; pp.121-124. [PDF, 139KB]

(2006) Eduard Hovy, Mitchell Marcus, Martha Palmer, Lance Ramshaw & Ralph Weischedel: OntoNotes: the 90% solution.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 57-60 [PDF, 44KB]

Columbia University

(2009) Marine Carpuat: One translation per discourse. NAACL-HLT-2009 (SEW-2009): Semantic Evaluations: Recent Achievements and Future Directions, Proceedings of the workshop, June 4, 2009, Boulder, Colorado; pp.19-27. [PDF, 112KB]

(2009) Marine Carpuat: Toward using morphology in French-English phrase-based SMT.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.150-154. [PDF, 92KB]

(2009) Jakob Elming, Nizar Habash, & Josep M.Crego: Combination of statistical word alignments based on multiple preprocessing schemes.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.93-110.

(2009) Jakob Elming & Nizar Habash: Syntactic reordering for English-Arabic phrase-based machine translation. Proceedings of the EACL 2009 Workhop on Computational Approaches to Semitic Languages, Athens, Greece, 31 March 2009; pp.69-77. [PDF, 155KB]

(2009) Nizar Habash: REMOOV: a tool for online handling of out-of-vocabulary words in machine translation. MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.217-220. [PDF, 342KB]

(2009) Nizar Habash, Bonnie Dorr, & Christof Monz: Symbolic-to-statistical hybridization: extending generation-heavy machine translation [abstract]. Machine Translation 23 (1), February 2009; pp.23-63.

(2009) Nizar Habash & Jun Hu: Improving Arabic-Chinese statistical machine translation using English as pivot language. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.173-181. [PDF, 265KB]

(2009) Wei-Yun Ma & Kathleen McKeown: Where’s the verb? Correcting machine translation during question answering. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.333-336. [PDF, 170KB]

(2009) Kathleen McKeown: Query-focused summarization using text-to-text generation: when information comes from multilingual sources. [ACL-IJCNLP-2009] Proceedings of the 2009 Workshop on Language Generation and Summarisation ACL-IJCNLP 2009, Suntec, Singapore, 6 August 2009; p.3. [PDF, 33KB]

(2009) Kristen Parton, Kathleen R.McKeown, Bob Coyne, Mona T.Diab, Ralph Grishman, Dilek Hakkani-Tür, Mary Harper, Heng Ji, Wei Yun Ma, Adam Meyers, Sara Stolbach, Ang Sun, Gokhan Tur, Wei Xu, & Sibel Yaman: Who, what, when, where, why? Comparing multiple approaches to the cross-lingual 5W task. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.423-431. [PDF, 176KB]

(2009) Ibrahim M.Saleh & Nizar Habash: Automatic extraction of lemma-based bilingual dictionaries for morphologically rich languages. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 622KB]

(2008) Josep M. Crego & Nizar Habash: Using shallow syntax information to improve word alignment and reordering for SMT. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.53-61. [PDF, 200KB]

(2008) Nizar Habash: Four techniques for online handling of out-of-vocabulary words in Arabic-English statistical machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 57-60. [PDF, 81KB]

(2008)  Nizar Habash & Hayden Metsky: Automatic learning of morphological variations for handling out-of-vocabulary terms in Urdu-English machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.107-116. [PDF, 551KB]

(2007) Mona Diab, Mahmoud Ghoneim, & Nizar Habash: Arabic diacritization in the context of statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.143-149 [PDF, 105KB]

(2007) Mona Diab & Nizar Habash: Arabic dialect processing tutorial. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.5-6 [PDF, 122KB]

(2007) Jakob Elming & Nizar Habash: Combination of statistical word alignments based on multiple preprocessing schemes. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.25-28 [PDF, 123KB]

(2007) Nizar Habash: Syntactic preprocessing for statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.215-222 [PDF, 118KB]

(2007) Nizar Habash: NLG is still relevant to MT.  MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.64-65 [PDF, 478KB]

(2007) Nizar Habash & Owen Rambow: Arabic diacritization through full morphological tagging. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.53-56 [PDF, 99KB]

(2007) Katrin Kirchhoff, Owen Rambow, Nizar Habash, & Mona Diab: Semi-automatic error analysis for large-scale statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.289-296 [PDF, 192KB]

(2006) Mona Diab & Nizar Habash: Arabic dialect processing.  Tutorial at AMTA 2006 conference, August 8, 2006, Cambridge, Massachusetts, USA; 61pp. [PDF of PPT presentation, 1514KB]

(2006) Michel Galley, Jonathan Graehl, Kevin Knight, Daniel Marcu, Steve DeNeefe, Wei Wang, & Ignacio Thayer: Scalable inference and training of context-rich syntatic translation models.  Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.961-968. [PDF, 438KB]

(2006) Nizar Habash, Bonnie Dorr & Christof Monz: Challenges in building an Arabic-English GHMT system with SMT components. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.56-65 [PDF, 116KB]

(2006) Nizar Habash, Clinton Mah, Sabiha Imran, Randy Calistri-Yeh, & Páraic Sheridan: Design, construction and validation of an Arabic-English conceptual interlingua for cross-lingual information retrieval. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.107-112 [PDF, 408KB]

(2006) Nizar Habash & Owen Rambow: MAGEAD: a morphological analyzer and generator for the Arabic dialects. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.681-688. [PDF, 112KB]

(2006) Nizar Habash & Fatiha Sadat: Arabic preprocessing schemes for statistical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 49-52 [PDF, 72KB]

(2006) Nizar Habash: contribution to panel on “Hybrid machine translation”.  AMTA 2006: 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; [PDF of PPT presentation, 230KB]

(2006) Sameer Maskey, Bowen Zhou, & Yuqing Gao: A phrase-level machine translation approach for disfluency detection using weighted finite state transducers. Interspeech 2006: ICSLP Ninth International Conference on Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1886; abstract [PDF, 81KB]

(2006) Owen Rambow, Bonnie Dorr, David Farwell, Rebecca Green, Nizar Habash, Stephen Helmreich, Eduard Hovy, Lori Levin, Keith J.Miller, Teruko Mitamura, Florence Reeder, & Advaith Siddharthan: Parallel syntactic annotation of multiple languages.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.559-564 [PDF, 286KB]

(2006) Fatiha Sadat & Nizar Habash: Combination of Arabic preprocessing schemes for statistical machine translation.  Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.1-8. [PDF, 104KB]

(2005) Advaith Siddharthan & Kathleen McKeown: Improving multilingual summarization: using redundancy in the input to correct MT errors.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 33-40. [PDF, 255KB]

(2005) Nizar Habash & Owen Rambow: Arabic tokenization, part-of-speech tagging and morphological disambiguation in one fell swoop.  ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 573-580. [PDF, 107KB]

(2005) Nizar Habash, Owen Rambow & George Kiraz: Morphological analysis and generation for Arabic dialects.  ACL-2005: Workshop on Computational Approaches to Semitic Languages, University of Michigan, Ann Arbor, 29 June 2005; pp. 17-24. [PDF, 121KB]

(2005) Martin Jansche: Treebank transfer. IWPT 2005: Proceedings of the Ninth International Workshop on Parsing Technologies, Vancouver, October 2005; pp.74-82. [PDF, 120KB]

Common Sense Advisory Inc.

(2009) Donald A. DePalma: The business case for machine translation. [Tutorial at] MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26, 2009, Ottawa, Ontario, Canada; 27pp. [PDF of PPT presentation, 481KB]

(2009) Donald A.DePalma: Obama White House calls for machine translation. Global Watchtower (Washington, DC: Common Sense Advisory Inc., 27 Sep 2009); 1 p. [PDF, 12KB]

(2006) Donald A. DePalma: Evaluating MT prior to deployment.  AMTA 2006: 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; presentation on User Track [PDF of PPT presentation, 226KB]

Cornell University

(2007) Philipp Koehn, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, Chris Dyer, Ondřej Bojar, Alexandra Constantin, Evan Herbst: Moses: open source toolkit for statistical machine translation. ACL 2007: proceedings of demo and poster sessions, Prague, Czech Republic, June 2007; pp. 177-180 [PDF, 119KB]

Corporation for National Research Initiatives

(2008) Allison L.Powell & Allison Blodgett: The use of machine-generated transcripts during human translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.427-434. [PDF, 649KB]

CSLI see Center for the Study of Language and Information

DARPA see Defense Advanced Research Projects Agency

Defense Advanced Research Projects Agency

(2008) Meghan Lammie Glenn, Stephanie Strassel, Lauren Friedman, & Haejoong Lee: Management of large annotation projects involving multiple human judges: a case study of GALE machine translation post-editing. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 144KB]

(2008) Gregory A. Sanders, Sébastien Bronsart, Sherri Condon, & Craig Schlenoff: Odds of successful transfer of low-level concepts: a key metric for bidirectional speech-to-speech machine translation in DARPA’s TRANSTAC program.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 177KB]

(2008) Clare R. Voss, Jamal Laoudi, & Jeffrey Micher: Exploitation of an Arabic language resource for MT evaluation: using Buckwalter-based lookup tool to augment CMU alignment algorithm.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 246KB]

(2008) Brian A. Weiss, Craig Schlenoff, Greg Sanders, Michelle P.Steves, Sherri Condon, Jon Phillips, & Dan Parvaz: Performance evaluation of speech translation systems.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 123KB]

(2006) Joseph Olive: Language research at DARPA. AMTA 2006:  7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; invited talk [PDF of PPT presentation, 1045KB]

Defense Intelligence Agency

(2008) Nicholas Bemish: Can MT really help the Department of Defense? [Keynote presentation at] AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; 4pp. [PDF, 473KB]

Defense Language Institute

(2007) Douglas Jones, Martha Herzog, Hussny Ibrahim, Arvind Jairam, Wade Shen, Edward Gibson, & Michael Emonts: ILR-based MT comprehension test with multi-level questions. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.77-80 [PDF, 67KB]

(2006) Douglas Jones, Wade Shen, Brian Delaney, Martha Herzog, Michael Emonts, Sabine Atwell, James Dirgin, Neil Granoien, Sargon Jabri, Jurgen Sottung, Timothy Anderson, & Timothy Hunter: Toward an Interagency Language Roundtable based assessment of speech-to-speech translation capabilities. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.82-89 [PDF, 150KB]

(2005) Douglas Jones, Edward Gibson, Wade Shen, Neil Granoien, Martha Herzog, Douglas Reynolds, & Clifford Weinstein: Measuring human readability of machine generated text: three case studies in speech recognition and machine translation. Proceedings of 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), March 18-23, 2005, Philadelphia, PA, USA; vol.5, pp. 1009-1012 [PDF, 233KB]

Department of Defense

(2008) Alexander E. Richman & Patrick Schone: Mining Wiki resources for multilingual named entity recognition. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 1-9. [PDF, 134KB]

Digital Silk Road LLC

(2009) Achim Ruopp: BigTM.net: gaining a head start on translation projects. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 482-488. [PDF of PPT presentation, 1371KB]

Dolores Labs

(2008) Manny Rayner, Pierrette Bouillon, Glenn Flores, Farzad Ehsani, Marianne Starlander, Beth Ann Hockey, Jane Brotanek, & Lukas Biewald: A small-vocabulary shared task for medical speech translation. Coling 2008: Proceedings of the Workshop on Speech Processing for Safety Critical Translation and Pervasive Applications, 23 August 2008, Manchester, UK; pp.60-63. [PDF, 62KB]

Education Testing Service

(2005) Jill Burstein & Daniel Marcu: Translation exercise assistant: automated generation of translation exercises for native-Arabic speakers learning English. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; Demonstration abstracts, pp. 16-17. [PDF, 53KB]

ENLASCO Corp

(2006) Christian Lieske, Felix Sasaki, & Yves Savourel: W3C internationalization tag set – a gentle introduction.  Translating and the Computer 28: proceedings of the Twenty-eighth International Conference on Translating and the Computer, 16-17 November 2006, London. (London: Aslib, 2006); 22pp. [PDF, 126KB]; presentation, 39 slides [PDF, 1406KB]

Fair Isaac Corporation

(2007) Dayne Freitag & Shahram Khadivi: A sequence alignment model based on the averaged perceptron. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 238-247. [PDF, 180KB]

Fluential Inc.

(2008) Farzad Ehsani, Jim Kimzey, Elaine Zuber, Demitrios Master, & Karen Sudre: Speech to speech translation for nurse patient interaction. Coling 2008: Proceedings of the Workshop on Speech Processing for Safety Critical Translation and Pervasive Applications, 23 August 2008, Manchester, UK; pp.54-59. [PDF, 92KB]

(2008) Manny Rayner, Pierrette Bouillon, Glenn Flores, Farzad Ehsani, Marianne Starlander, Beth Ann Hockey, Jane Brotanek, & Lukas Biewald: A small-vocabulary shared task for medical speech translation. Coling 2008: Proceedings of the Workshop on Speech Processing for Safety Critical Translation and Pervasive Applications, 23 August 2008, Manchester, UK; pp.60-63. [PDF, 62KB]

Ford Motor Company

(2006) Nestor Rychtyckyj: Standard language at Ford Motors: a case study in controlled language development and deployment. CLAW 2006: 5th International Workshop on Controlled Language Applications, Cambridge, MA, USA, August 12, 2006; 10pp. [PDF, 70KB]

Fordham University

(2009) Elena Filatova: Directions for exploiting asymmetries in multilingual Wikipedia. NAACL-HLT-2009: The Third International Workshop on Cross Lingual Information Access: Addressing the Information Need of Multilingual Societies (CLIAWS3), Proceedings of the Workshop, June 4, 2009, Boulder, Colorado; pp.30-37. [PDF, 114KB]

Ge’ez Frontier Foundation

(2006) Daniel Jacob: Unicode for under-resourced languages. LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.33-38. [PDF, 176KB]

Georgetown University

(2009) Ibrahim M.Saleh & Nizar Habash: Automatic extraction of lemma-based bilingual dictionaries for morphologically rich languages. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 622KB]

George Washington University

(2009) Yilu Zhou: Maximum n-gram HMM-based name transliteration: experiment in NEWS 2009 on English-Chinese corpus. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.128-131. [PDF, 244KB]

Georgia Institute of Technology

(2009) Yong Zhao & Xiaodong He: Using n-gram based features for machine translation system combination.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.205-208. [PDF, 357KB]

(2007) Eric Nichols, Francis Bond, Darren Scott Appling, & Yuji Matsumoto: Combining resources for open source machine translation. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.134-143 [PDF, 404KB]; poster [PDF, 67KB]

Google, Inc.

(2009) Dmitriy Genzel, Klaus Macherey & Jakob Uszkoreit: Creating a high-quality machine translation system for a low-resource language: Yiddish. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 41-48. [PDF, 157KB]

(2009) Jan Hajič, Massimiliano Ciaramita, Richard Johansson, Daisuke Kawahara, Maria Antònia Marti, Lluís Màrquez, Adam Meyers, Joakim Nivre, Sebastian Padó, Jan Štěpánek, Pavel Straňák, Mihai Surdeanu, Naiwen Xue, & Yi Zhang: The CoNLL-2009 Shared task: syntactic and semantic dependencies in multiple languages. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.1-18. [PDF, 220KB]

(2009) Alon Halevy, Peter Norvig, & Fernando Pereira: The unreasonable effectiveness of data. IEEE Intelligent Systems, March-April 2009; pp.8-12. [PDF, 376KB]

(2009) Liang Huang, Wenbin Jiang & Qun Liu: Bilingually-constrained (monolingual) shift-reduce parsing. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1222-1231. [PDF, 169KB]

(2009) Liang Huang, Hao Zhang, David Gildea & Kevin Knight: Binarization of synchronous context-free grammars. Computational Linguistics 35 (4), pp. 559-595 [PDF, 837KB]

(2009) Martin Jansche & Richard Sproat: Named entity transcription with pair n-gram models. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.32-35. [PDF, 113KB]

(2009) Shankar Kumar, Wolfgang Macherey, Chris Dyer & Franz Och: Efficient minimum error rate training and minimum Bayes-risk decoding for translation hypergraphs and lattices. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.163-171. [PDF, 342KB]

(2009) Mausam, Stephen Soderland, Oren Etzioni, Daniel S.Weld, Michael Skinner & Jeff Bilmes: Compiling a massive, multilingual dictionary via probabilistic inference. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.262-270. [PDF, 704KB}

(2009) Roy Tromble & Jason Eisner: Learning linear ordering problems for better translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1007-1016. [PDF, 178KB]

(2009) Ashish Venugopal & Andreas Zollmann: Grammar based statistical MT on Hadoop: an end-to-end toolkit for large scale PSCFG based MT. Prague Bulletin of Mathematical Linguistics 91, 2009; pp.67-78. [PDF, 142KB]

(2009) Peng Xu, Jaeho Kang, Michael Ringgard, & Franz Och: Using a dependency parser to improve SMT for subject-object-verb languages. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.245-253. [PDF, 273KB]

(2008) Jason Katz-Brown & Michael Collins: Syntactic reordering in preprocessing for Japanese®English translation: MIT system description for NTCIR-7 patent translation task. Proceedings of NTCIR-7 Workshop Meeting, December 16-19, 2008, Tokyo, Japan; pp. 409-414. [PDF, 424KB]

(2008) Dekang Lin, Shaojun Zhao, Benjamin Van Durme, & Marius Paşca: Mining parenthetical translations from the web by word alignment. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 994-1002. [PDF, 419KB]

(2008) Wolfgang Macherey, Franz Josef Och, Ignacio Thayer, & Jakob Uszkoreit: Lattice-based minimum error rate training for statistical machine translation.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.725-734. [PDF, 304KB]

(2008) Stefan Riezler, Yi Liu, & Alexander Vasserman: Translating queries into snippets for improved query expansion.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.737-744. [PDF, 137KB]

(2008) Roy W.Tromble, Shankar Kumar, Franz Och, & Wolfgang Macherey: Lattice minimum Bayes-risk decoding for statistical machine translation. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.620-629. [PDF, 285KB]

(2008) Jakob Uszkoreit & Thorsten Brants: Distributed word clustering for large scale class-based language modelling in machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 755-762. [PDF, 222KB]

(2008) Andreas Zollmann, Ashish Venugopal, Franz Och, & Jay Ponte: A systematic comparison of phrase-based, hierarchical and syntax-augmented statistical MT. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.1145-1152. [PDF, 153KB]

(2007) Thorsten Brants, Ashok C.Popat, Peng Xu, Franz J.Och, & Jeffrey Dean: Large language models in machine translation. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 858-857. [PDF, 297KB]

(2007) Colin Cherry & Dekang Lin: Inversion transduction grammar for joint phrasal translation modeling. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.17-24 [PDF, 178KB]

(2007) Shankar Kumar, Franz Och, & Wolfgang Macherey: Improving word alignment with bridge languages. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 42-50. [PDF, 183KB]

(2007) Wolfgang Macherey & Franz Josef Och: An empirical study on computing consensus translations from multiple machine translation systems. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 986-995. [PDF, 169KB]

(2007) Stefan Riezler, Alexander Vasserman, Ioannis Tsochantaridis, Vibhu Mittal, & Yi Liu: Statistical machine translation for query expansion in answer retrieval. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 464-471 [PDF, 104KB]

(2006) Colin Cherry & Dekang Lin: Soft syntactic constraints for word alignment through discriminative training. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.105-112. [PDF, 153KB]

(2006) Colin Cherry & Dekang Lin: A comparison of syntactically motivated word alignment spaces. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Trento, Italy, April 3-7, 2006; pp.145-152 [PDF, 150KB]

(2005) Franz Josef Och: Statistical machine translation: foundations and recent advances MT Summit X, Phuket, Thailand, September 12, 2005, Tutorial Note. [PDF of PPT presentation, 2436KB]

Grape City Inc.

(2006) Jianfeng Gao, Mu Li, Andi Wu, & Chang-Ning Huang: Chinese word segmentation and named entity recognition: a pragmatic approach. Computational Linguistics 31 (4), pp. 531-574. [PDF, 1638KB]

Harvard University

(2007) Rebecca Nesson & Stuart M.Shieber: Extraction phenomena in synchronous TAG syntax and semantics. SSST, NAACL-HLT-2007 AMTA Workshop on  Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.9-16 [PDF, 696KB]

(2007) Stuart M.Shieber: Probabilistic synchronous tree-adjoining grammars for machine translation: the argument from bilingual dictionaries. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.88-95 [PDF, 152KB]

(2006) Rebecca Nesson, Stuart Shieber & Alexander Rush: Induction of probabilistic synchronous tree-insertion grammars for machine translation. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.128-137 [PDF, 376KB]

HRL Laboratories LLC

(2005) Emil Ettelaie, Sudeep Gandhe, Panayiotis Georgiou, Kevin Knight, Daniel Marcu, Shrikanth Narayanan, David Traum, & Robert Belvin: Transonics: a practical speech-to-speech translator for English-Farsi medical dialogues.  ACL-2005: Interactive Poster and Demonstration Sessions, University of Michigan, Ann Arbor, June 2005; pp. 89-92  [PDF, 352KB]

IBM see International Business Machines

ICSI

(2007) Evgeny Matusov, Dustin Hillard, Mathew Magimai-Doss, Dilek Hakkani-Tür, Mari Ostendorf, & Hermann Ney: Improving speech translation with automatic boundary prediction. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2449-2452; abstract [PDF, 61KB]

Illinois, University of

(2009) Patricia Phillips Batoma, Roxana Girju, Elizabeth Lowe, & Patricia Minacori: Educating and assessing the human translator in an age of technology. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 457-458. [PDF, 81KB]

(2009) Ming-Wei Chang, Dan Goldwasser, Dan Roth, & Yuancheng Tu: Unsupervised constraint driven learning for transliteration discovery. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.299-307. [PDF, 580KB]

(2009) Andrea Gesmundo, James Henderson, Paola Merlo, & Ivan Titov: A latent variable model of synchronous syntactic-semantic parsing for multiple languages. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.37-42. [PDF, 120KB]

(2009) Alexandre Klementiev & Dan Roth: Names entity transliteration and discovery in multilingual corpora.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.79-92.

(2009) Sebastian Riedel & James Clarke: Revisiting optimal decoding for machine translation IBM Model 4. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.5-8. [PDF, 112KB]

(2008) Michael Gamon, Jianfeng Gao, Chris Brockett, Alexandre Klementiev, William B.Dolan, Dmitriy Belenko, & Lucy Vanderwende: Using contextual speller techniques and language modelling for ESL error correction. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.449-456. [PDF, 475KB]

(2008) Dan Goldwasser & Dan Roth: Active sample selection for named entity transliteration. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 53-56. [PDF, 538KB]

(2008) Dan Goldwasser & Dan Roth: Transliteration as constrained optimization.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.353-362. [PDF, 645KB]

(2007) Paul C. Davis, Zhuli Xie, & Kevin Small: All links are not the same: evaluating word alignments for statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.119-126 [PDF, 122KB]

(2007) Roxana Girju: Improving the interpretation of noun phrases with cross-linguistic information. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 568-575 [PDF, 156KB]

(2007) Alla Rozovskaya & Richard Sproat: Multilingual word sense disambiguation: a comparative cross-linguistic study. ACL 2007: Proceedings of the Workshop on Balto-Slavonic Natural Language Processing, June 29, 2007, Prague, Czech Republic; pp.82-87. [PDF, 122KB]

(2007) Su-Youn Yoon, Kyoung-Young Kim, & Richard Sproat: Multilingual transliteration using feature based phonetic method. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 112-119 [PDF, 269KB]

(2006) Alexandre Klementiev & Dan Roth: Weakly supervised named entity transliteration and discovery from multilingual comparable corpora. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.817-824. [PDF, 173KB]

(2006) Alla Rozovskaya, Richard Sproat, & Elabbas Benmamoun: Challenges in processing colloquial Arabic. The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.4-14. [PDF, 253KB]

(2006) Richard Sproat, Tao Tao, & ChenXiang Zhai: Named entity transliteration with comparable corpora. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.73-80. [PDF, 232KB]

(2006) Tao Tao, Su-Youn Yoon, Andrew Fister, Richard Sproat, & ChengXiang Zhai: Unsupervised name entity transliteration using temporal and phonetic correlation.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 250-257. [PDF, 139KB]

(2006) Raghavendra Udupa U. & Hemanta K.Maji: Computational complexity of statistical machine translation. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Trento, Italy, April 3-7, 2006; pp.25-32 [PDF, 142KB]

Illinois, University (Urbana-Champaign)

(2007) Dan Goldwasser, Ming-Wei Chang, Yuancheng Tu & Dan Roth: Constraint-driven transliteration discovery. In: Recent Advances in Natural Language Processing: selected papers from RANLP 2007, ed. Nicolas Nikolov, Galia Angelova, Ruslan Mitkov, John Benjamins Publ.Co.: Amsterdam/Philadelphia, 2000; pp.21-40. [PDF, 525KB]

Indiana, University of

(2009) Michael Gasser: Semitic morphological analysis and generation using finite state transducers with feature structures. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.309-317. [PDF, 217KB]

(2006) David M.Rojas & Takako Aikawa: Predicting MT quality as a function of the source language.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2534-2537 [PDF, 613KB]

Information Sciences Institute, University of Southern California

(2009) David Chiang, Kevin Knight, & Wei Wang: 11,001 new features for statistical machine translation. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.218-226. [PDF, 114KB]

(2009) Steve DeNeefe & Kevin Knight: Synchronous tree adjoining machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.727-736. [PDF, 398KB]

(2009) John DeNero, David Chiang, & Kevin Knight: Fast consensus decoding over translation forests. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.567-575. [PDF, 650KB]

(2009) Ulf Hermjakob: Improved word alignment with statistics and linguistic heuristics. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.229-237. [PDF, 107KB]

(2009) Liang Huang, Hao Zhang, David Gildea & Kevin Knight: Binarization of synchronous context-free grammars. Computational Linguistics 35 (4), pp. 559-595 [PDF, 837KB]

(2009) Kevin Knight: Automata for transliteration and machine translation [abstract]. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; p.27. [PDF, 34KB]

(2009) Kevin Knight & Philipp Koehn: Topics in statistical machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Tutorial abstracts, Suntec, Singapore, 2 August 2009; p.2. [PDF, 57KB]

(2009) Kevin Knight & Richard Sproat: Writing systems, transliteration and decipherment.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Tutorials, Boulder, Colorado, May 31 - June 5, 2009; pp.15-16. [PDF, 68KB]

(2009) Michael Pust & Kevin Knight: Faster MT decoding through pervasive laziness.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.141-144. [PDF, 108KB]

(2009) Sujith Ravi & Kevin Knight: Learning phoneme mappings for transliteration without parallel data. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.37-45. [PDF, 198KB]

(2008) David Chiang, Steve DeNeefe, Yee Seng Chan & Hwee Tou Ng: Decomposability of translation metrics for improved evaluation and efficient algorithms.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.610-619. [PDF, 316KB]

(2008) David Chiang, Yuval Marton, & Philip Resnik: Online large-margin training of syntactic and structural translation features.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.224-233. [PDF, 162KB]

(2008) Steve DeNeefe, Ulf Hermjakob & Kevin Knight: Overcoming vocabulary sparsity in MT using lattices. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.89-96. [PDF, 492KB]

(2008) Victoria Fossum & Kevin Knight: Using bilingual Chinese-English word alignments to resolve PP-attachment ambiguity in English. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.245-253 [PDF, 595KB]

(2008) Victoria Fossum, Kevin Knight, & Steven Abney: Using syntax to improve word alignment precision for syntax-based machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.44-52. [PDF, 150KB]

(2008) Ulf Hermjakob, Kevin Knight, & Hal Daumé III: Name translation in statistical machine translation: learning when to transliterate. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 389-397. [PDF, 107KB]

(2008) Sujith Ravi & Kevin Knight: Attacking decipherment problems optimally with low-order n-gram models.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.812-819. [PDF, 659KB]

(2008) Hao Zhang, Daniel Gildea, & David Chiang: Extracting synchronous grammar rules from word-level alignments in linear time. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.1081-1088. [PDF, 181KB]

(2008) Jingbo Zhu, Huizhen Wang, & Eduard Hovy: Learning a stopping criterion for active learning for word sense disambiguation and text classification. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.366-372. [PDF, 421KB]

(2007) Yee Seng Chan, Hwee Tou Ng, & David Chiang: Word sense disambiguation improves statistical machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 33-40 [PDF, 516KB]

(2007) David Chiang: Hierarchical phrase-based translation. Computational Linguistics 33 (2), pp. 202-228. [PDF, 597KB]

(2007) Steve DeNeefe, Kevin Knight, Wei Wang & Daniel Marcu: What can syntax-based MT learn from phrase-based MT? EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 755-763. [PDF, 240KB]

(2007) Alexander Fraser & Daniel Marcu: Getting the structure right for word alignment: LEAF. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 51-60. [PDF, 669KB]

(2007) Alexander Fraser & Daniel Marcu: Measuring word alignment quality for statistical machine translation. Computational Linguistics 33 (3), pp. 293-303. [PDF, 163KB]

(2007) Eduard Hovy: Investigating why BLEU penalizes non-statistical systems. MT Summit XI Workshop: Automatic procedures in MT evaluation, 11 September 2007, Copenhagen, Denmark, [Proceedings]; 10pp. [PDF of PPT presentation, 265KB]

(2007) Liang Huang & David Chiang: Forest rescoring: faster decoding with integrated language models. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 144-151 [PDF, 215KB]

(2007) Kevin Knight: Automatic language translation generation help needs badly. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.1-4 [PDF, 1093KB]

(2007) Kevin Knight: Capturing practical natural language transformations [abstract]. Machine Translation 21 (2), June 2007; pp.121-133.

(2007) Kevin Knight & Philipp Koehn: Statistical machine translation. Tutorial at MT Summit XI, 10 September 2007, Copenhagen, Denmark; abstract, 1p. [PDF, 106KB]; part 1 by Philipp Koehn, 128pp. [PDF of PPT presentation, 601KB]; part 2 by Kevin Knight, 180pp. [PDF of PPT presentation, 2292KB]

(2007) Jonathan May & Kevin Knight: Syntactic re-alignment models for machine translation. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 360-368. [PDF, 180KB]

(2006) Hal Daumé III & Daniel Marcu: Induction of word and phrase alignments for automatic document summarization. Computational Linguistics 31 (4), pp. 505-530. [PDF, 507KB]

(2006) Alexander Fraser & Daniel Marcu: Semi-supervised training for statistical word alignment. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.769-776. [PDF, 200KB]

(2006) Michel Galley, Jonathan Graehl, Kevin Knight, Daniel Marcu, Steve DeNeefe, Wei Wang, & Ignacio Thayer: Scalable inference and training of context-rich syntatic translation models.  Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.961-968. [PDF, 438KB]

(2006) Eduard Hovy: A gentle introduction to ontologies.  Tutorial at AMTA 2006 conference, August 8, 2006, Cambridge, Massachusetts, USA; 60pp. [PDF of PPT presentation, 2050KB]

(2006) Eduard Hovy, Mitchell Marcus, Martha Palmer, Lance Ramshaw & Ralph Weischedel: OntoNotes: the 90% solution.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 57-60 [PDF, 44KB]

(2006) Bryant Huang & Kevin Knight: Relabeling syntax trees to improve syntax-based machine translation quality.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 240-247 [PDF, 747KB]

(2006) Liang Huang, Kevin Knight & Aravind Joshi: Statistical syntax-directed translation with extended domain of locality. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.66-73 [PDF, 162KB]

(2006) Liang Huang, Kevin Knight, & Aravind Joshi: A syntax-directed translator with extended domain of locality. HLT-NAACL-2006: Computationally Hard Problems and Joint Inferenec in Speech and Language Processing. Proceedings of the workshop, 9 June 2006, New York City, NY, USA; pp.1-8. [PDF, 120KB]

(2006) Kevin Knight, Anish Nair, Nishit Rathod, & Kenji Yamada: Unsupervised analysis for decipherment problems. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.499-506. [PDF, 200KB]

(2006) Daniel Marcu & Alan K.Melby: Data-driven machine translation: a conversation with linguistics and translation studies. AMTA 2006:  7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; invited talk [PDF of PPT presentation, 998KB]; expanded version with annotations and revisions, January 2008; PDF, 768KB]

(2006) Dragos Stefan Munteanu & Daniel Marcu: Extracting parallel sub-sentential fragments from non-parallel corpora. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.81-88. [PDF, 1598KB]

(2006) Dragos Stefan Munteanu & Daniel Marcu: Improving machine translation performance by exploiting non-parallel corpora. Computational Linguistics 31 (4), pp. 477-504 [PDF, 1060KB]

(2006) Owen Rambow, Bonnie Dorr, David Farwell, Rebecca Green, Nizar Habash, Stephen Helmreich, Eduard Hovy, Lori Levin, Keith J.Miller, Teruko Mitamura, Florence Reeder, & Advaith Siddharthan: Parallel syntactic annotation of multiple languages.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.559-564 [PDF, 286KB]

(2006) Radu Soricut & Daniel Marcu: Stochastic language generation using WIDL-expressions and its application in machine translation and summarization.  Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.1105-1112. [PDF, 233KB]

(2006) Hao Zhang, Liang Huang, Daniel Gildea, & Kevin Knight: Synchronous binarization for machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 256-263 [PDF, 206KB]

(2006) Liang Zhou, Chin-Yew Lin, & Eduard Hovy: Re-evaluating machine translation results with paraphrase support.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 77-84. [PDF, 324KB]

(2005) Steve DeNeefe & Kevin Knight: ISI’s 2005 statistical machine translation entries. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 4pp. [PDF, 83KB]

(2005) Steve DeNeefe, Kevin Knight, & Hayward H. Chan: Interactively exploring a machine translation model. ACL-2005: Interactive Poster and Demonstration Sessions, University of Michigan, Ann Arbor, June 2005; pp. 97-100. [PDF, 123KB]

(2005) Emil Ettelaie, Sudeep Gandhe, Panayiotis Georgiou, Kevin Knight, Daniel Marcu, Shrikanth Narayanan, David Traum, & Robert Belvin: Transonics: a practical speech-to-speech translator for English-Farsi medical dialogues.  ACL-2005: Interactive Poster and Demonstration Sessions, University of Michigan, Ann Arbor, June 2005; pp. 89-92  [PDF, 352KB]

(2005) Alexander Fraser & Daniel Marcu: ISI’s participation in the Romanian-English alignment task. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 91-94. [PDF, 88KB]

(2005) Kevin Knight & Daniel Marcu: Machine translation in the year 2004. Proceedings of 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), March 18-23, 2005, Philadelphia, PA, USA; vol.5, pp. 965-968 [PDF, 3119KB]

Intel Corporation

(2008) Will Burgett & Julie Chang: The triple advantage factor of machine translation: cost, time-to-market and FAUT. [Keynote presentation at] AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.1-10. [PDF of PPT presentation, 853KB]

Interactive Systems Laboratories see Carnegie Mellon University, Pittsburgh

International Business Machines

(2009) Martin Čmejrek, Bowen Zhou, & Bing Xiang: Enriching SCFG rules directly from efficient bilingual chart parsing. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 136-143. [PDF, 315KB]; presentation [PDF of PPT, 982KB]

(2009) Yonggang Deng & Bowen Zhou: Optimizing word alignment combination for phrase table training. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.229-232. [PDF, 132KB]

(2009) Fei Huang: Confidence measure for word alignment. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.932-940. [PDF, 727KB]

(2009) David Lubensky & Salim Roukos: IBM deployment of real time translation services. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 437-444. [PDF of PPT presentation, 622KB]

(2009) Ruhi Sarikaya, Sameer Maskey, Rong Zhang & Ea-Ee Jan: Iterative sentence-pair extraction from quasi-parallel corpora for machine translation. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

(2009) Bing Zhao & Shengyuan Chen: A simplex Armijo downhill algorithm for optimizing statistical machine translation decoding parameters.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.21-24. [PDF, 601KB]

(2008) Niyu Ge, Abe Ittycheriah, & Kishinore Papineni: Multiple reorderings in phrase-based machine translation. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.61-68. [PDF, 1-6KB]

(2008) Fei Huang, Ahmad Emami, & Imed Zitouni: When Harry met Harri, [Arabic name] and [Chinese name]: cross-lingual name spelling normalization.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.391-399. [PDF, 165KB]

(2008) John F.Pitrelli, Burn L.Lewis, Edward A.Epstein, Martin Franz, Daniel Kiecza, Jerome L.Quinn, Ganesh Ramaswamy, Amit Srivastava, & Paola Virga: Aggregating distributed STT, MT, and information extraction engines: the GALE interoperability-demo system. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.2743-2746; abstract [PDF, 63KB]

(2008) John F.Pitrelli, Burn L.Lewis, Edward A.Epstein, Jerome L.Quinn, & Ganesh Ramaswamy: A data format enabling interoperation of speech recognition, translation and information extraction engines: the GALE type system. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.1654-1657; abstract [PDF, 67KB]

(2008) Christoph Tillmann: A rule-driven dynamic programming decoder for statistical MT. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.37-45. [PDF, 152KB]

(2008) Bing Zhao & Yaser Al-Onaizan: Generalizing local and non-local word-reordering patterns for syntax-based machine translation.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.572-581. [PDF, 538KB]

(2008) Bowen Zhou, Bing Xiang, Xiaodan Zhu, & Yuqing Gao: Prior derivation models for formally syntax-based translation using linguistically syntactic parsing and tree kernels. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.19-27. [PDF, 246KB]

(2008) Imed Zitouni & Radu Florian: Mention detection crossing the language barrier. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.600-609. [PDF, 149KB]

(2007) Yonggang Deng & Yuqing Gao: Guiding statistical word alignment models with prior knowledge. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 1-8 [PDF, 245KB]

(2007) Fei Huang & Kishore Papineni: Hierarchical system combination for machine translation. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 277-286. [PDF, 158KB]

(2007) Abraham Ittycheriah & Salim Roukos: Direct translation model 2. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.57-64 [PDF, 238KB]

(2007) Ruhi Sarikaya & Yonggang Deng: Joint morphological-lexical language modeling for machine translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.145-148 [PDF, 157KB]

(2007) Ruhi Sarikaya, Yonggang Deng, & Yuqing Gao: Context dependent word modeling for statistical machine translation using part-of-speech tags. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2821-2824; abstract [PDF, 23KB]

(2007) Christoph Tillmann & Tong Zhang: A block bigram prediction model for statistical machine translation. ACM Transactions on Speech and Language Processing 4 (3), art.6. [abstract, PDF, 12KB]

(2007) Jia Xu, Yonggang Deng, Yuqing Gao, & Hermann Ney: Domain dependent statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.515-520 [PDF, 289KB]

(2006) Arendse Bernth: EasyEnglish Analyzer: taking controlled language from sentence to discourse level. CLAW 2006: 5th International Workshop on Controlled Language Applications, Cambridge, MA, USA, August 12, 2006; 7pp. [PDF, 116KB]

(2006) Young-Suk Lee: IBM Arabic-to-English translation for IWSLT 2006. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp.45-52  [PDF, 241KB]

(2006) Michael C. McCord: MT for social impact (contribution to panel on “Machine translation for social impact”).  AMTA 2006: 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; [PDF of PPT presentation, 1022KB]

(2006) Sameer Maskey, Bowen Zhou, & Yuqing Gao: A phrase-level machine translation approach for disfluency detection using weighted finite state transducers. Interspeech 2006: ICSLP Ninth International Conference on Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1886; abstract [PDF, 81KB]

(2006) Yaser Al-Onaizan & Kishore Papineni: Distortion models for statistical machine translation. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.529-536. [PDF, 135KB]

(2006) Salim Roukos: Rosetta: an analyst’s co-pilot. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; 60pp.  [PDF, 2789KB]

(2006) Christoph Tillmann: Efficient dynamic programming search algorithms for phrase-based SMT. HLT-NAACL-2006: Computationally Hard Problems and Joint Inferenec in Speech and Language Processing. Proceedings of the workshop, 9 June 2006, New York City, NY, USA; pp.9-16. [PDF, 183KB]

(2006) Christoph Tillmann & Tong Zhang: A discriminative global training algorithm for statistical MT. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.721-728. [PDF, 179KB]

(2006) Yuqing Gao, Gu Liang, Bowen Zhou, Ruhi Sarikaya, Mohamed Afify, Hong-Kwang Kuo, Wei-zhong Zhu, Yonggang Deng, Charles Prosser, Wei Zhang & Laurent Besacier: IBM MASTOR system: multilingual automatic speech-to-speech translator.  HLT-NAACL 2006: Proceedings of the  Workshop on Medical Speech Translation, 9 June 2006, New York, NY, USA; pp.57-60 [PDF, 199KB]

(2006) Weizhong Zhu, Bowen Zhou, Charles Prosser, Pavel Krbec, & Yuqing Gao: Recent advances of IBM’s handheld speech translation system. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1590; abstract [PDF, 93KB]

(2005) Hany Hassan & Jeffrey Sorensen: An integrated approach for Arabic-English named entity translation. ACL-2005: Workshop on Computational Approaches to Semitic Languages, University of Michigan, Ann Arbor, 29 June 2005; pp. 87-93. [PDF, 108KB]

(2005) Abraham Ittycheriah & Salim Roukos: A maximum entropy word aligner for Arabic-English machine translation.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 33-40. [PDF, 327KB]

(2005) Young-Suk Lee: IBM statistical machine translation for spoken languages. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 8pp. [PDF, 1029KB]

(2005) Kenneth A.McKethan & Graciela White: Demystifying software globalization. Translation Journal 9 (2), April 2005; 7pp. [PDF, 131KB]

(2005) Christoph Tillmann & Tong Zhang: A localized prediction model for statistical machine translation. ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 557-564. [PDF, 189KB]

(2005) Bing Zhao, Niyu Ge, & Kishore Papineni: Inner-outer bracket models for word alignment using hidden blocks. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 177-184. [PDF, 592KB]

Johns Hopkins University

(2009) Kathy Baker, Steven Bethard, Michael Bloodgood, Ralf Brown, Chris Callison-Burch, Glen Coppersmith, Bonnie Dorr, Wes Filardo, Kendall Giles, Anni Irvine, Mike Kayser, Lori Levin, Justin Martineau, Jim Mayfield, Scott Miller, Aaron Phillips, Andrew Philpot, Christine Piatko, Lane Schwartz, & David Zajic: Semantically informed machine translation (SIMT). Final report of the 2009 Summer Camp for Applied Language Exploration [John Hopkins University], November 2009; 152pp. [PDF, 5705KB]

(2009) Chris Callison-Burch: Fast, cheap, and creative: evaluating translation quality using Amazon’s Mechanical Turk. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.286-295. [PDF, 289KB]

(2009) Chris Callison-Burch, Philipp Koehn, Christof Monz, & Josh Schroeder: Findings of the 2009 Workshop on Statistical Machine Translation. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.1-28. [PDF, 544KB]

(2009) Ken Church: Repetition and language models and comparable corpora [abstract]. ACL-IJCNLP-2009: Proceedings of the 2nd Workshop on Building and Using Comparable Corpora, Suntec, Singapore, 6 August 2009; p.1. [PDF, 61KB]

(2009) Nikesh Garera, Chris Callison-Burch & David Yarowsky: Improving translation lexicon induction from monolingual corpora via dependency contexts and part-of-speech equivalences. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning, June 4-5, 2009, Boulder, Colorado; pp.129-137. [PDF, 450KB]

(2009) Frederick Jelinek: The dawn of statistical ASR and MT. [Paper given on receipt of the ACL’s Life-time Achievement Award in 2009] Computational Linguistics 35 (4), pp. 483-494 [PDF, 952KB]

(2009) Zhifei Li: Training and inference methods over translation forests [abstract]. CWMT 2009: the 5th China Workshop on Machine Translation, Nanjing, China, October 16-17, 2009; 1p. [PDF, 14KB]

(2009) Zhifei Li & Sanjeev Khudanpur: Efficient extraction of oracle-best translations from hypergraphs.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.9-12. [PDF, 112KB]

(2009) Zhifei Li, Jason Eisner, & Sanjeev Khudanpur: Variational decoding for statistical machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.593-601. [PDF, 217KB]

(2009) Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevitch, Sanjeev Khudanpur, Lane Schwartz, Wren N.G.Thornton, Jonathan Weese, & Omar F.Zaidan: Demonstration of Joshua: an open source toolkit for parsing-based machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Software Demonstrations, Suntec, Singapore, 3 August 2009; pp.25-28. [PDF, 429KB]

(2009) Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevitch, Sanjeev Khudanpur, Lane Schwartz, Wren N.G.Thornton, Jonathan Weese, & Omar F.Zaidan: Joshua: an open source toolkit for parsing-based machine translation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.135-139. [PDF, 109KB]

(2009) Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevitch, Sanjeev Khudanpur, Lane Schwartz, Wren Thornton, Jonathan Weese, & Omar Zaidan: Joshua: open source toolkit for parsing-based machine translation. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; 23pp.  [PDF, 933KB]

(2009) Zhifei Li & Jason Eisner: First- and second-order expectation semirings with applications to minimum-risk training on translation forests. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.40-51. [PDF, 406KB]

(2009) Paul McNamee, James Mayfield, & Charles Nicholas: Translation corpus source and size in bilingual retrieval. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.25-28. [PDF, 177KB]

(2009) Yuval Marton, Chris Callison-Burch, & Philip Resnik: Improved statistical machine translation using monolingually-derived paraphrases. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.381-390. [PDF, 160KB]

(2009) Kristen Parton, Kathleen R.McKeown, Bob Coyne, Mona T.Diab, Ralph Grishman, Dilek Hakkani-Tür, Mary Harper, Heng Ji, Wei Yun Ma, Adam Meyers, Sara Stolbach, Ang Sun, Gokhan Tur, Wei Xu, & Sibel Yaman: Who, what, when, where, why? Comparing multiple approaches to the cross-lingual 5W task. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.423-431. [PDF, 176KB]

(2009) Roy Tromble & Jason Eisner: Learning linear ordering problems for better translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1007-1016. [PDF, 178KB]

(2009) Balakrishnan Vardarajan & Delip Rao: Î-extension Hidden Markov Models and weighted transducers for machine transliteration. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.120-123. [PDF, 129KB]

(2009) Omar F. Zaidan: Z-MERT: a fully configurable open source tool for minimum error rate training of machine translation systems. Prague Bulletin of Mathematical Linguistics 91, 2009; pp.79-88. [PDF, 263KB]

(2009) Omar F.Zaidan & Chris Callison-Burch: Feasibility of human-in-the-loop minimum error rate training. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.52-61. [PDF, 321KB]

(2008) Chris Callison-Burch, Cameron Fordyce, Philipp Koehn, Christof Monz, & Josh Schroeder: Further meta-evaluation of machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.70-106. [PDF, 320KB]

(2008) Chris Callison-Burch: Syntactic constraints on paraphrases extracted from parallel corpora. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.196-205. [PDF, 187KB]

(2008) Chris Callison-Burch, Trevor Cohn, & Mirella Lapata: ParaMetric: an automatic evaluation metric for paraphrasing. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.97-104. [PDF, 232KB]

(2008) Markus Dreyer, Jason R.Smith, & Jason Eisner: Latent-variable modeling of string transductions with finite-state methods.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.1080-1089. [PDF, 513KB]

(2008) Nikesh Garera & David Yarowsky: Translating compounds by learning component gloss translation models via multiple languages. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.403-410. [PDF, 497KB]

(2008) Nikesh Garera & David Yarowsky: Minimally supervised multilingual taxonomy and translation lexicon induction. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.465-472. [PDF, 356KB]

(2008) Damianos Karakos, Jason Eisner, Sanjeev Khudanpur, & Markus Dreyer: Machine translation system combination using ITG-based alignments. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 81-84. [PDF, 65KB]

(2008) Philipp Koehn & Chris Callison-Burch: Statistical machine translation. Course material, 20th European Summer School in Logic, Language and Information (ESSLLI 2008), Freie und Hansestadt Hamburg, Germany, 4-15 August 2008; 144 slides [PDF of PPT presentation, 4803KB]

(2008) Zhifei Li & Sanjeev Khudanpur: Large-scale discriminative n-gram language models for statistical machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.133-142. [PDF, 643KB]

(2008) Zhifei Li & Sanjeev Khudanpur: A scalable decoder for parsing-based machine translation with equivalent language model state maintenance. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.10-18. [PDF, 569KB]

(2008) Zhifei Li & David Yarowsky: Unsupervised translation induction for Chinese abbreviations using monolingual corpora. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 425-433. [PDF, 181KB]

(2008) Roy W.Tromble, Shankar Kumar, Franz Och, & Wolfgang Macherey: Lattice minimum Bayes-risk decoding for statistical machine translation. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.620-629. [PDF, 285KB]

(2007) Chris Callison-Burch, Cameron Fordyce, Philipp Koehn, Christof Monz, & Josh Schroeder: (Meta-) evaluation of machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 136-158 [PDF, 373KB]

(2007) Markus Dreyer, Keith Hall, & Sanjeev Khudanpur: Comparing reordering constraints for SMT using efficient BLEU oracle computation. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.103-110 [PDF, 206KB]

(2007) Keith Hall & Petr Němec: Generation in machine translation from deep syntactic trees. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.57-64 [PDF, 237KB]

(2006)Yonggang Deng & William Byrne: MTTK: an alignment toolkit for statistical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 265-268 [PDF, 135KB]

(2006) Paul McNamee & James Mayfield: Translation of multiword expressions using parallel suffix arrays.  AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.100-109 [PDF, 733KB]

(2006) Brock Pytlik & David Yarowsky: Machine translation for languages lacking bitext via multilingual gloss transduction. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.156-165 [PDF, 267KB]

(2006) Jason Riesa & David Yarowsky: Minimally supervised morphological segmentation with applications to machine translation. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.185-192 [PDF, 613KB]

(2006) Jason Riesa, Behrang Mohit, Kevin Knight, & Daniel Marcu: Building an English-Iraqi Arabic machine translation system for spoken utterances with limited resources. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 2012; abstract [PDF, 80KB]

(2006) Charles Schafer: Novel probabilistic finite-state transducers for cognate and transliteration modeling. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.203-212 [PDF, 250KB]

(2006) Wade Shen, Richard Zens, Nicola Bertoldi, & Marcello Federico: The JHU workshop 2006 IWSLT system. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 59-63  [PDF, 420KB]

(2006) David Smith & Charles Schafer: An overview of statistical machine translation.  Tutorial at AMTA 2006 conference, August 8, 2006, Cambridge, Massachusetts, USA; 44pp. [PDF of PPT presentation, 1081KB]

(2006) David A. Smith & Jason Eisner: Quasi-synchronous grammars: alignment by soft projection of syntactic dependencies.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 23-30 [PDF, 207KB]

(2005) Andrea Burbank, Marine Carpuat, Stephen Clark, Markus Dreyer, Pamela Fox, Declan Groves, Keith Hall, Mary Hearne, I. Dan Melamed, Yihai Shen, Andy Way, Ben Wellington, & Dekai Wu: Final report of the 2005 Language Enginering Workshop on statistical machine translation by parsing, Johns Hopkins University, Center for Speech and Language Processing, November 12, 2005. 67pp. [PDF, 645KB]

(2005) Yonggang Deng & William Byrne: HMM word and phrase alignment for statistical machine translation.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 169-176. [PDF, 189KB]

(2005) Elliott Franco Drábek & David Yarowsky: Induction of fine-grained part-of-speech taggers via classifier combination and crosslingual projection.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 49-56. [PDF, 109KB]

(2005) Shankar Kumar & William Byrne: Local phrase reordering models for statistical machine translation.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 161-168. [PDF, 193KB]

(2005) Charles Schafer & Elliott Franco Drábek: Models for Inuktitut-English word alignment. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 79-82. [PDF, 183KB]

Language Computer Corporation

(2006) Marian Olteanu, Pasin Suriyentrakorn, & Dan Moldovan: Language models and reranking for machine translation.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 150-153 [PDF, 115KB]

Language Technologies Institute, Pittsburgh see Carnegie Mellon University

Language Weaver Inc.  [see also SDL International]

(2009) David Chiang, Kevin Knight, & Wei Wang: 11,001 new features for statistical machine translation. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.218-226. [PDF, 114KB]

(2009) Kenji Yamada & Ion Muslea: Reranking for large-scale statistical machine translation.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.151-168.

(2008) Daniel Marcu: Language translation solutions for community content. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.420-426. [PDF of PPT presentation, 861KB]

(2008) Mark Tapling: Automated translation 2008: science meets solution. [Keynote presentation at] AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.23-34. [PDF of PPT presentation, 887KB]

(2007) Steve DeNeefe, Kevin Knight, Wei Wang & Daniel Marcu: What can syntax-based MT learn from phrase-based MT? EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 755-763. [PDF, 240KB]

(2006) Michel Galley, Jonathan Graehl, Kevin Knight, Daniel Marcu, Steve DeNeefe, Wei Wang, & Ignacio Thayer: Scalable inference and training of context-rich syntatic translation models.  Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.961-968. [PDF, 438KB]

(2006) Bryant Huang & Kevin Knight: Relabeling syntax trees to improve syntax-based machine translation quality.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 240-247 [PDF, 747KB]

(2006) Kevin Knight, Anish Nair, Nishit Rathod, & Kenji Yamada: Unsupervised analysis for decipherment problems. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.499-506. [PDF, 200KB]

(2006) Daniel Marcu, Wei Wang, Abdessamad Echihabi, & Kevin Knight: SPMT: statistical machine translation with syntactified target language phrases. EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 44-52. [PDF, 164KB]

(2006) Wei Wang, Kevin Knight & David Marcu: Capitalizing machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 1-8 [PDF, 171KB]

(2006) Statistical machine translation gets real: the LanguageWeaver profile. Global by Design, December 2006; pp. 8-19 [PDF, 2115KB]

(2005) Jill Burstein & Daniel Marcu: Translation exercise assistant: automated generation of translation exercises for native-Arabic speakers learning English. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; Demonstration abstracts, pp. 16-17. [PDF, 53KB]

(2005) Will Knight: Software learns to translate by reading up. New Scientist, 22 February 2005. [PDF, 70KB]

Lincoln Laboratory, MIT see Massachusetts Institute of Technology

Linguistic Data Consortium

(2008) Lauren Friedman & Stephanie Strassel: Identifying common challenges for human and machine translation: a case study from the GALE program. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.364-369. [PDF, 548KB]

(2008) Lauren Friedman, Haejoong Lee, & Stephanie Strassel: A quality control framework for gold standard reference translations: the process and toolkit developed for GALE. Translating and the Computer 30, 27-28 November 2008, London; 6pp. [PDF, 77KB]

(2008) Meghan Lammie Glenn, Stephanie Strassel, Lauren Friedman, & Haejoong Lee: Management of large annotation projects involving multiple human judges: a case study of GALE machine translation post-editing. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 144KB]

(2008) Kazuaki Maeda, Xiaoyi Ma, & Stephanie Strassel: Creating sentence-aligned parallel text corpora from a large archive of potential parallel text using BITS and Champollion.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 68KB]

(2008) Zhiyi Song & Stephanie Strassel: Entity translation and alignment in the ACE-07 ET task. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 135KB]

(2007) Christopher Cieri, Stephanie Strassel, Meghan Lammie Glenn, & Lauren Friedman: Linguistic resources in support of various evaluation metrics. MT Summit XI Workshop: Automatic procedures in MT evaluation, 11 September 2007, Copenhagen, Denmark, [Proceedings]; 34pp. [PDF of PPT presentation, 1007KB]

(2006) Xiaoyi Ma: Champollion: a robust parallel text sentence aligner.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.489-492 [PDF, 313KB]

(2006) Xiaoyi Ma & Christopher Cieri: Corpus support for machine translation at LDC.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.859-864 [PDF, 507KB]

(2006) Mohamed Maamouri, Ann Bies, & Seth Kulick: Diacritization: a challenge to Arabic treebank annotation and parsing.  The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.35-47 [PDF, 223KB]

(2006) Stephanie Strassel, Christopher Cieri, Andrew Cole, Denise DiPersio, Mark Lieberman, Xiaoyi Ma, Mohamed Maamouri, & Kazuaki Maeda: Integrated linguistic resources for language exploitation technologies.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.185-190 [PDF, 465KB]

(2005) Christopher Cieri: Evaluation initiatives: US activities in HLT evaluation. ELRA-HLT Evaluation Workshop, Malta, 1-2 December 2005; 7pp. [PDF from PPT, 237KB]

(2005) Christopher Cieri: HLT evaluation: role of data centers. ELRA-HLT Evaluation Workshop, Malta, 1-2 December 2005; 6pp. [PDF from PPT, 195KB]

Logos Institute

(2009) Bernard Scott & Anabela Barreiro: OpenLogos MT and the SAL representation language. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 19-26. [PDF, 253KB]

Louisiana State University

(2008) Udhyakumar Nallasamy, Alan W.Black, Tanja Schultz, Robert Frederking, & Jerry Weltman: Speech translation for triage of emergency phonecalls in minority languages. Coling 2008: Proceedings of the Workshop on Speech Processing for Safety Critical Translation and Pervasive Applications, 23 August 2008, Manchester, UK; pp.48-53. [PDF, 229KB]

Marine Corps Forces

(2008) Scott Hourin: Language technology in humanitarian aid and disaster response. [Abstract of keynote speech at] IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, Hawaii, USA.

Maryland, University of

(2009) Abhishek Arun, Chris Dyer, Barry Haddow, Phil Blunsom, Adam Lopez, & Philipp Koehn: Monte Carlo inference and maximization for phrase-based translation. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning, June 4-5, 2009, Boulder, Colorado; pp.102-110. [PDF, 167KB]

(2009) Chris Dyer: Decoding: phrase-based models. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; pp.42-50 [PDF, 281KB]

(2009) Chris Dyer: Using maximum entropy model to build segmentation lattices for MT.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.406-414. [PDF, 173KB]

(2009) Chris Dyer, Hendra Setiawan, Yuval Marton, & Philip Resnik: The University of Maryland statistical machine translation system for the Fourth Workshop on Machine Translation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.145-149. [PDF, 120KB]

(2009) Nizar Habash, Bonnie Dorr, & Christof Monz: Symbolic-to-statistical hybridization: extending generation-heavy machine translation [abstract]. Machine Translation 23 (1), February 2009; pp.23-63.

(2009) Michal Hrusecky, Tomas Caithaml, & Chris Dyer: Feature function overhaul. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; 13pp. [PDF, 88KB]

(2009) Jeremy G.Kahn, Matthew Snover, & Mari Ostendorf: Expected dependency pair match: predicting translation quality with expected syntactic structure [abstract]. Machine Translation 23 (2/3), September 2009; pp.169-179.

(2009) Shankar Kumar, Wolfgang Macherey, Chris Dyer & Franz Och: Efficient minimum error rate training and minimum Bayes-risk decoding for translation hypergraphs and lattices. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.163-171. [PDF, 342KB]

(2009) Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevitch, Sanjeev Khudanpur, Lane Schwartz, Wren N.G.Thornton, Jonathan Weese, & Omar F.Zaidan: Demonstration of Joshua: an open source toolkit for parsing-based machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Software Demonstrations, Suntec, Singapore, 3 August 2009; pp.25-28. [PDF, 429KB]

(2009) Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevitch, Sanjeev Khudanpur, Lane Schwartz, Wren N.G.Thornton, Jonathan Weese, & Omar F.Zaidan: Joshua: an open source toolkit for parsing-based machine translation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.135-139. [PDF, 109KB]

(2009) Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevitch, Sanjeev Khudanpur, Lane Schwartz, Wren Thornton, Jonathan Weese, & Omar Zaidan: Joshua: open source toolkit for parsing-based machine translation. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; 23pp.  [PDF, 933KB]

(2009) Paul McNamee, James Mayfield, & Charles Nicholas: Translation corpus source and size in bilingual retrieval. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.25-28. [PDF, 177KB]

(2009) Yuval Marton, Chris Callison-Burch, & Philip Resnik: Improved statistical machine translation using monolingually-derived paraphrases. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.381-390. [PDF, 160KB]

(2009) Douglas W.Oard: Cross-language information access: looking backward, looking forward [abstract]. NAACL-HLT-2009: The Third International Workshop on Cross Lingual Information Access: Addressing the Information Need of Multilingual Societies (CLIAWS3), Proceedings of the Workshop, June 4, 2009, Boulder, Colorado; p.1. [PDF, 87KB]

(2009) Kristen Parton, Kathleen R.McKeown, Bob Coyne, Mona T.Diab, Ralph Grishman, Dilek Hakkani-Tür, Mary Harper, Heng Ji, Wei Yun Ma, Adam Meyers, Sara Stolbach, Ang Sun, Gokhan Tur, Wei Xu, & Sibel Yaman: Who, what, when, where, why? Comparing multiple approaches to the cross-lingual 5W task. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.423-431. [PDF, 176KB]

(2009) Hendra Setiawan, Min-Yen Kan, Haizhou Li, & Philip Resnik: Topological ordering of function words in hierarchical phrase-based translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.324-332. [PDF, 504KB]

(2009) Matthew G.Snover, Nitin Madnani, Bonnie Dorr, & Richard Schwartz: TER-Plus: paraphrase, semantic, and alignment enhancements to Translation Error Rate [abstract]. Machine Translation 23 (2/3), September 2009; pp.117-127.

(2009) Matthew Snover, Nitin Madnani, Bonnie J.Dorr, & Richard Schwartz: Fluency, adequacy, or HTER? Exploring different human judgments with a tunable MT metric.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.259-268. [PDF, 147KB]

(2008) David Chiang, Yuval Marton, & Philip Resnik: Online large-margin training of syntactic and structural translation features.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.224-233. [PDF, 162KB]

(2008) Christopher Dyer, Aaron Cordova, Alex Mont, & Jimmy Lin: Fast, easy, and cheap: construction of statistical machine translation models with MapReduce. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.199-207. [PDF, 228KB]

(2008) Chris Dyer, Alex Mont, Aaron Cordova, & Jimmy Lin: SMT model building with MapReduce. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 91 slides. [PDF, 1850KB]

(2008) Christopher Dyer, Smaranda Muresan, & Philip Resnik: Generalizing word lattice translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 1012-1020 . [PDF, 201KB]

(2008) Nitin Madnani, Philip Resnik, Bonnie J.Dorr, & Richard Schwartz: Are multiple reference translations necessary? Investigating the value of paraphrased reference translations in parameter optimization. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.143-152. [PDF, 655KB]

(2008) Yuval Marton & Philip Resnik: Soft syntactic constraints for hierarchical phrased-based translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 1003-1011. [PDF, 153KB]

(2008) Allison L.Powell & Allison Blodgett: The use of machine-generated transcripts during human translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.427-434. [PDF, 649KB]

(2008) Matthew Snover, Bonnie Dorr, & Richard Schwartz: Language and translation model adaptation using comparable corpora. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.857-866. [PDF, 132KB]

(2008) Michael Subotin: Generalizing local translation models. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.28-36. [PDF, 148KB]

(2008) Daniel Zeman & Philip Resnik: Cross-language parser adaptation between related languages. IJCNLP 2008: Workshop on NLP for Less Privileged Languages. Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.35-42. [PDF, 150KB]

(2007) Chris Dyer: Decoder-guided backoff: using word lattices to improve translation from morphologically complex languages. First Machine Translation Marathon, Edinburgh, April 16-20, 2007; 54pp. [PDF of PPT presentation, 1158KB]

(2007) Christopher J. Dyer: The ‘noisier channel’: translation from morphologically complex languages. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 207-211 [PDF, 127KB]

(2007) Christopher J.Dyer: The University of Maryland translation system for IWSLT 2007. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 6pp. [PDF, 174KB]

(2007) Mary Harper, Alex Acero, Srinivas Bangalore, Jaime Carbonell, Jordan Cohen, Barbara Cuthill, Carol Espy-Wilson, Christiane Fellbaum, John Garofolo, Chin-Hui Lee, Jim Lester, Andrew McCallum, Nelson Morgan, Michael Picheney, Joe Picone, Lance Ramshaw, Jeff Reynar, Hadar Shemtov, & Clare Voss: Report on the NSF-sponsored Human Language Technology Workshop on Industrial Centers. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.239-246 [PDF, 94KB]

(2007) Philipp Koehn, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, Chris Dyer, Ondřej Bojar, Alexandra Constantin, Evan Herbst: Moses: open source toolkit for statistical machine translation. ACL 2007: proceedings of demo and poster sessions, Prague, Czech Republic, June 2007; pp. 177-180 [PDF, 119KB]

(2007) Adam Lopez: Hierarchical phrase-based translation with suffix arrays. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 976-985. [PDF, 313KB]

(2007) Nitin Madnani, Necip Fazil Ayan, Philip Resnick, & Bonnie J.Dorr: Using paraphrases for parameter tuning in statistical machine translation. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 120-127 [PDF, 148KB]

(2007) Antti-Veikko I.Rosti, Necip Fazil Ayan, Bing Xiang, Spyros Matsoukas, Richard Schwartz, & Bonnie J.Dorr: Combining outputs from multiple machine translation systems. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.228-235 [PDF, 129KB]

(2007) Calandra Rilette Tate: An investigation of the relationship between automated machine translation evaluation metrics and user performance on an information extraction task. PhD dissertation, University of Maryland; xv,145pp. [PDF, 1170KB]

(2006) Tod Allman & Stephen Beale: A natural language generator for minority languages.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.39-46. [PDF, 301KB]

(2006) Necip Fazil Ayan & Bonnie J. Dorr: Going beyond AER: an extensive analysis of word alignments and their impact on MT. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.9-16. [PDF, 274KB]

(2006) Necip Fazil Ayan & Bonnie J. Dorr: A maximum entropy approach to combining word alignments.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 96-103 [PDF, 325KB]

(2006) Nizar Habash, Bonnie Dorr & Christof Monz: Challenges in building an Arabic-English GHMT system with SMT components. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.56-65 [PDF, 116KB]

(2006) Diana Inkpen, Muath Alzghool, Gareth J.F.Jones & Douglas W.Oard: Investigating cross-language speech retrieval for a spontaneous conversational speech collection.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 61-64 [PDF, 43KB]

(2006) Jamal Laoudi, Calandra R.Tate, & Clare R.Voss: Task-based MT evaluation: from who/when/where extraction to event understanding.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2048-2053 [PDF, 381KB]

(2006) Adam Lopez & Philip Resnik: Word-based alignment, phrase-based translation: what’s the link?  AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.90-99 [PDF, 183KB]

(2006) G. Craig Murray, Bonnie J. Dorr, Jimmy Lin, Jan Hajič, & Pavel Pecina: Leveraging reusability: cost-effective lexical acquisition for large-scale ontology translation. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.945-952. [PDF, 159KB]

(2006) G. Craig Murray, Bonnie J. Dorr, Jimmy Lin, Jan Hajič, & Pavel Pecina: Leveraging recurrent phrase structure in large-scale ontology translation. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.141-150 [PDF, 686KB]

(2006) Owen Rambow, Bonnie Dorr, David Farwell, Rebecca Green, Nizar Habash, Stephen Helmreich, Eduard Hovy, Lori Levin, Keith J.Miller, Teruko Mitamura, Florence Reeder, & Advaith Siddharthan: Parallel syntactic annotation of multiple languages.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.559-564 [PDF, 286KB]

(2006) Matthew Snover, Bonnie Dorr, Richard Schwartz, Linnea Micciulla, & John Makhoul: A study of translation edit rate with targeted human annotation.  AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.223-231 [PDF, 185KB]

(2006) Calandra R. Tate & Clare R. Voss: Combining evaluation metrics via loss functions. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.242-250 [PDF, 106KB]

(2006) Clare R. Voss & Calandra R. Tate: Task-based evaluation of machine translation (MT) engines. Measuring how well people extract who, when, where-type elements in MT output. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.203-212 [PDF, 235KB]

(2005) Necip Fazil Ayan, Bonnie J. Dorr, & Christof Monz: NeurAlign: combining word alignments using neural networks.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 65-72. [PDF, 139KB]

(2005) Necip Fazil Ayan, Bonnie J.Dorr, & Christof Monz: Alignment link projection using transformation-based learning.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 185-192. [PDF, 132KB]

(2005) Stephen Beale, Sergei Nirenburg, Marjorie McShane, & Tod Allman: Document authoring the Bible for minority language translation. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.63-70. [PDF, 415KB]

(2005) Clara Cabezas & Philip Resnick: Using WSD techniques for lexical selection in statistical machine translation. College Park, ND: Institute for Advanced Computer Studies, University of Maryland, July 2005 (CS-TR-4736, LAMP-TR-124, UMIACS-TR-2005-42) 12pp. [PDF, 240KB]

(2005) David Chiang: A hierarchical phrase-based model for statistical machine translation. ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 263-270. [PDF, 203KB]

(2005) David Chiang, Adam Lopez, Nitin Madnani, Christof Monz, Philip Resnik, & Michael Subotin: The Hiero machine translation system: extensions, evaluation and analysis. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 779-786. [PDF, 227KB]

(2005) Philipp Koehn & Christof Monz: Shared task: statistical machine translation between European languages. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 119-124. [PDF, 41KB]

(2005) Adam Lopez & Philip Resnik: Pattern visualization for machine translation output.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; Demonstration abstracts, pp. 12-13. [PDF, 160KB]

(2005) Adam Lopez & Philip Resnik: Improved HMM alignment models for languages with scarce resources.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 83-86. [PDF, 103KB]

(2005) Marjorie McShane, Sergei Nirenburg, & Stephen Beale: An NLP lexicon as a largely language-independent resource [abstract].  Machine Translation 19 (2), 2005; pp.139-173.

Mary Washington University

(2009) Ahmed Abdelali, Steve Helmreich, & Ron Zacharski: Investigations on standard Arabic geographical classification.  CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 193KB]

(2009) William Ogden, Ron Zacharski, Sieun An, & Yuki Ishikawa: User choice as an evaluation metric for web translation services in cross language instant messaging applications. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 97-103. [PDF, 376KB]

Massachusetts Institute of Technology

(2009) Ibrahim Badr, Rabih Zbib, & James Glass: Syntactic phrase reordering for English-to-Arabic statistical machine translation.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.86-93. [PDF, 115KB]

(2009) Xavier Carreras & Michael Collins: Non-projective parsing for statistical machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.200-209. [PDF, 174KB]

(2009) Wade Shen, Brian Delaney, A.Ryan Aminzadeh, Tim Anderson, & Ray Slyh: The MIT-LL/AFRL IWSLT-2009 MT system. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 71-78. [PDF, 398KB]; presentation [PDF of PPT, 1730KB]

(2009) Benjamin Snyder, Tahira Naseem, Jacob Eisenstein, & Regina Barzilay: Adding more languages improves unsupervised multilingual part-of-speech tagging: a Bayesian non-parametric approach. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.83-91. [PDF, 599KB]

(2009) Benjamin Snyder, Tahira Naseem, & Regina Barzilay: Unsupervised multilingual grammar induction. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.73-81. [PDF, 273KB]

(2008) Ibrahim Badr, Rabih Zbib, & James Glass: Segmentation for English-to-Arabic statistical machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers., June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 153-156. [PDF, 89KB]

(2008) Jason Katz-Brown & Michael Collins: Syntactic reordering in preprocessing for Japanese®English translation: MIT system description for NTCIR-7 patent translation task. Proceedings of NTCIR-7 Workshop Meeting, December 16-19, 2008, Tokyo, Japan; pp. 409-414. [PDF, 424KB]

(2008) Wade Shen, Brian Delaney, Tim Anderson, & Ray Slyh: The MIT-LL/AFRL IWSLT-2008 MT system. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp. 69-76. [PDF, 208KB]; presentation [PDF, 607KB]

(2008) Benjamin Snyder & Regina Barzilay: Unsupervised multilingual learning for morphological segmentation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 737-745. [PDF, 254KB]

(2008) Yushi Xu & Stephanie Seneff: Two-stage translation: a combined linguistic and statistical machine translation framework. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.222-231 [PDF, 671B]

(2007) Douglas Jones, Martha Herzog, Hussny Ibrahim, Arvind Jairam, Wade Shen, Edward Gibson, & Michael Emonts: ILR-based MT comprehension test with multi-level questions. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.77-80 [PDF, 67KB]

(2007) Philipp Koehn, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, Chris Dyer, Ondřej Bojar, Alexandra Constantin, Evan Herbst: Moses: open source toolkit for statistical machine translation. ACL 2007: proceedings of demo and poster sessions, Prague, Czech Republic, June 2007; pp. 177-180 [PDF, 119KB]

(2007) John Lee, Ming Zhou, & Xiaohua Liu: Detection of non-native sentences using machine-translated training data. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.93-96 [PDF, 188KB]

(2007) Wade Shen, Brian Delaney, Tim Anderson, & Ray Slyh: The MIT-LL/AFRL IWSLT-2007 MT system.  IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 8pp. [PDF, 174KB]; presentation [PDF, 623KB]

(2007) Chao Wang, Michael Collins, & Philipp Koehn: Chinese syntactic reordering for statistical machine translation.  EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 737-745. [PDF, 161KB]

(2007) Chao Wang & Stephanie Seneff: Automatic assessment of student translations for foreign language tutoring. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.468-475 [PDF, 124KB]

(2007) Luke S. Zettlemoyer & Robert C. Moore: Selective phrase pair extraction for improved statistical machine translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.209-212 [PDF, 121KB]

(2006) Brooke Cowan, Ivona Kučerová, & Michael Collins: A discriminative model for tree-to-tree translation.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 232-241. [PDF, 338KB]

(2006) Brian Delaney, Wade Shen, & Timothy Anderson: An efficient graph search decoder for phrase-based statistical machine translation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 197-204 [PDF, 250KB]

(2006) Douglas Jones, Wade Shen, Brian Delaney, Martha Herzog, Michael Emonts, Sabine Atwell, James Dirgin, Neil Granoien, Sargon Jabri, Jurgen Sottung, Timothy Anderson, & Timothy Hunter: Toward an Interagency Language Roundtable based assessment of speech-to-speech translation capabilities. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.82-89 [PDF, 150KB]

(2006) David Kauchak & Regina Barzilay: Paraphrasing for automatic evaluation. HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 455-462 [PDF, 196KB]

(2006) Stephanie Seneff, Chao Wang & John Lee: Combining linguistic and statistical methods for bi-directional English Chinese translation in the flight domain. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.213-222 [PDF, 152KB]

(2006) Stephanie Seneff: Combining interlingua with SMT (contribution to panel on “Hybrid machine translation”).  AMTA 2006: 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; [PDF of PPT presentation, 109KB]

(2006) Wade Shen, Brian Delaney, & Tim Anderson: The MIT-LL/AFRL IWSLT-2006 MT system. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 71-76  [PDF, 804KB]

(2006) Wade Shen, Richard Zens, Nicola Bertoldi, & Marcello Federico: The JHU workshop 2006 IWSLT system. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 59-63  [PDF, 420KB]

(2006) Chao Wang & Stephanie Seneff: High-quality speech translation in the flight domain. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1135; abstract [PDF, 72KB]

(2005) Michael Collins, Philipp Koehn, & Ivona Kučerová: Clause restructuring for statistical machine translation. ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 531-540. [PDF, 129KB]

(2005) Douglas Jones, Edward Gibson, Wade Shen, Neil Granoien, Martha Herzog, Douglas Reynolds, & Clifford Weinstein: Measuring human readability of machine generated text: three case studies in speech recognition and machine translation. Proceedings of 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), March 18-23, 2005, Philadelphia, PA, USA; vol.5, pp. 1009-1012 [PDF, 233KB]

(2005) Wade Shen, Brian Delaney, & Tim Anderson: The MIT-LL/AFRL MT system. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 5pp. [PDF, 70KB]

Massachusetts, University of

(2009) Kedar Bellare, Koby Crammer, & Dayne Freitag: Loss-sensitive discriminative training of machine transliteration models. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Proceedings of the Student Research Workshop and Doctoral Consortium, Boulder, Colorado, June 1, 2009;  pp.61-65. [PDF, 117KB]

(2007) Fernando Diaz & Donald Metzler: Pseudo-aligned multilingual corpora. IJCAI-07: Twentieth International Joint conference on Artificial Intelligence, Hyderabad, India, 6-12 January 2007; pp.2727-2732. [PDF, 126KB]

(2005) Vanessa Murdock & W.Bruce Croft: A translation model for sentence retrieval.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 684-691. [PDF, 205KB]

Meadan

(2006) Ed Bice: Meadan global dialogue (contribution to panel on “Machine translation for social impact”).  AMTA 2006: 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; [PDF of PPT presentation, 1472KB]

Meaningful Machines LLC

(2006) Jaime Carbonell, Steve Klein, David Miller, Michael Steinbaum, Tomer Grassiany, & Jochen Frei: Context-based machine translation. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.19-28 [PDF, 338KB]

(2006) Evan Ratliff: Me translate pretty one day. Wired 14 (12), 2006; pp.210-213. [PDF, 32KB]

MedQuist Inc.

(2006) Daniel T. Heinze, Alexander Turchin & V. Jagannathan: Automated interpretation of clinical encounters with cultural cues and electronic health record generation.  HLT-NAACL 2006: Proceedings of the  Workshop on Medical Speech Translation, 9 June 2006, New York, NY, USA; pp.24-31 [PDF, 251KB]

(2006) Daniel T. Heinze, Alexander Turchin & V. Jagannathan: Accultran: automated interpretation of clinical encounters with cultural cues and electronic health record generation.  HLT-NAACL 2006: Proceedings of the  Workshop on Medical Speech Translation, 9 June 2006, New York, NY, USA; pp.50-52 [PDF, 181KB]

Medtronic Inc.

(2006) Uwe Muegge: Fully automatic high quality machine translation of restricted text: a case study. Translating and the Computer 28: proceedings of the Twenty-eighth International Conference on Translating and the Computer, 16-17 November 2006, London. (London: Aslib, 2006); 6pp. [PDF, 43KB]

Michigan University

(2008) Victoria Fossum & Kevin Knight: Using bilingual Chinese-English word alignments to resolve PP-attachment ambiguity in English. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.245-253 [PDF, 595KB]

(2008) Victoria Fossum, Kevin Knight, & Steven Abney: Using syntax to improve word alignment precision for syntax-based machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.44-52. [PDF, 150KB]

(2007) Yang Ye, Ming Zhou, & Chin-Yew Lin: Sentence level machine translation evaluation as a ranking problem: one step aside from BLEU.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 240-247 [PDF, 311KB]

(2007) Yang Ye, Karl-Michael Schneider, & Steven Abney: Aspect marker generation in English-to-Chinese machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.521-527 [PDF, 129KB]

(2006) Agam Patel & Dragomir R.Radev: Lexical similarity can distinguish between automatic and manual translations. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1230-1235 [PDF, 326KB]

(2006) Yang Ye, Victoria Li Fossum, & Steven Abney: Latent featrures in automatic tense translation between Chinese and English. Coling-ACL 2006: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, Sydney, 22-23 July 2006; pp.48-55. [PDF, 79KB]

Microsoft Corporation  [see also China: Microsoft Research Asia; India: Microsoft Research India]

(2009) Takako Aikawa & Achim Ruopp: Chained system: a linear combination of different types of statistical machine translation systems. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.184-189. [PDF, 229KB]

(2009) Colin Cherry & Hisami Suzuki: Discriminative substring decoding for transliteration. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1066-1075. [PDF, 583KB]

(2009) Colin Cherry & Hisami Suzuki: NEWS 2009 machine transliteration shared task system description: transliteration with letter-to-phoneme technology. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.69-71. [PDF, 289KB]

(2009) Xiaodong He & Kristina Toutanova: Joint optimization for machine translation system combination. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1202-1211. [PDF, 337KB]

(2009) Long Jiang, Shiquan Yang, Ming Zhou, Xiaohua Liu, & Qingsheng Zhu: Mining bilingual data from the web with adaptively learnt patterns. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.870-878. [PDF, 306KB]

(2009) Chi-Ho Li, Xiaodong He, Yupeng Liu, & Ning Xi: Incremental HMM alignment for MT system combination. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.949-957. [PDF, 559KB]

(2009) Robert C.Moore & Chris Quirk: Less is more: significance-based n-gram selection for smaller, better language models. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.746-755. [PDF, 564KB]

(2009) Chris Wendt & Will Lewis: Pushing the quality of a customized SMT system using shared training data. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 450-456. [PDF of PPT presentation, 250KB]

(2009) Yong Zhao & Xiaodong He: Using n-gram based features for machine translation system combination.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.205-208. [PDF, 357KB]

(2008) Colin Cherry: Cohesive phrase-based decoding for statistical machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 72-80. [PDF, 179KB]

(2008) Colin Cherry & Chris Quirk: Discriminative, syntactic language modeling through latent SVMs. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.65-74. [PDF, 609KB]

(2008) Michael Gamon, Jianfeng Gao, Chris Brockett, Alexandre Klementiev, William B.Dolan, Dmitriy Belenko, & Lucy Vanderwende: Using contextual speller techniques and language modelling for ESL error correction. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.449-456. [PDF, 475KB]

(2008) Xiaodong He, Mei Yang, Jianfeng Gao, Patrick Nguyen, & Robert Moore: Indirect HMM-based hypothesis alignment for combining outputs from machine translation systems.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.98-107. [PDF, 280KB]

(2008) Masaki Itagaki & Takako Aikawa: Post-MT term swapper: supplementing a statistical machine translation system with a user dictionary.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 352KB]

(2008) Arul Menezes & Chris Quirk: Syntactic models for structural word insertion and deletion.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.735-744. [PDF, 178KB]

(2008) Robert C.Moore & Chris Quirk: Random restarts in minimum error rate training for statistical machine translation.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.585-592. [PDF, 552KB]

(2008) Kristina Toutanova, Hisami Suzuki, & Achim Ruopp: Applying morphology generation models to machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 514-522. [PDF, 586KB]

(2008) Jia Xu, Jianfeng Gao, Kristina Toutanova, & Hermann Ney: Bayesian semi-supervised Chinese word segmentation for statistical machine translation. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.1017-1024. [PDF, 137KB

(2007) Takako Aikawa, Lee Schwartz, Ronit King, Mo Corston-Oliver, & Carmen Lozano: Impact of controlled language on translation quality and post-editing in a statistical machine translation environment. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.1-7 [PDF, 314KB]

(2007) Guihong Cao, Jianfeng Gao, & Jian-Yun Nie: A system to mine large-scale bilingual dictionaries from monolingual web pages. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.57-64 [PDF, 539KB]

(2007) Pi-Chuan Chang & Kristina Toutanova: A discriminative syntactic word order model for machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 9-16 [PDF, 235KB]

(2007) Xiaodong He: Using word dependent transition models in HMM based word alignment for statistical machine translation. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 80-87 [PDF, 139KB]

(2007) Masaki Itagaki, Takako Aikawa, & Xiaodong He: Automatic validation of terminology translation consistency with statistical method.  MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.269-274 [PDF, 416KB]

(2007) Arul Menezes & Chris Quirk: Using dependency order templates to improve generality in translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 1-8 [PDF, 153KB]

(2007) Einat Minkov, Kristina Toutanova, & Hisami Suzuki: Generating complex morphology for machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 128-135 [PDF, 219KB]

(2007) Robert C. Moore & Chris Quirk: Faster beam-search decoding for phrasal statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.321-327 [PDF, 241KB]

(2007) Robert C.Moore & Chris Quirk: An iteratively-trained segmentation-free phrase translation model for statistical machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 112-119 [PDF, 137KB]

(2007) Patrick Nguyen, Milind Mahajan, & Xiaodong He: Training non-parametric features for statistical machine translation. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 72-79 [PDF, 169KB]

(2007) Chris Quirk, Raghavendra Udupa U., & Arul Menezes: Generative models of noisy translations with applications to parallel fragment extraction. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.377-384 [PDF, 249KB]

(2007) Stephen Richardson: Microsoft machine translation: from research to real user. Invited talk at MT Summit XI, 10-14 September 2007, Copenhagen, Denmark; 35pp. [PDF of PPT presentation, 3535KB]

(2007) Kristina Toutanova & Hisami Suzuki: Generating case markers in machine translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.49-56 [PDF, 168KB]

(2007) Luke S. Zettlemoyer & Robert C. Moore: Selective phrase pair extraction for improved statistical machine translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.209-212 [PDF, 121KB]

(2006) Chris Brockett, William B.Dolan, & Michael Gamon: Correcting ESL errors using phrasal SMT techniques.  Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.249-256. [PDF, 109KB]

(2006) Simon Corston-Oliver & Anthony Aue: Dependency parsing with reference to Slovene, Spanish and Swedish. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.196-200. [PDF, 286KB]

(2006) Simon Corston-Oliver, Anthony Aue, Kevin Duh & Eric Ringger: Multilingual dependency parsing using Bayes point machines.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 160-167 [PDF, 178KB]

(2006) Arul Menezes, Kristina Toutanova, & Chris Quirk: Microsoft Research treelet translation system: NAACL 2006 Europarl evaluation.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 158-161 [PDF, 676KB]

(2006) Robert C.Moore, Wen-tau Yih, & Andreas Bode: Improved discriminative bilingual word alignment. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.513-520. [PDF, 121KB]

(2006) Chris Quirk & Simon Corston-Oliver: The impact of parse quality on syntactically-informed statistical machine translation.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 62-69. [PDF, 203KB]

(2006) Christopher Quirk & Arul Menezes: Dependency treelet translation: the convergence of statistical and example-based machine translation? [abstract] Machine Translation 20 (1),2006; pp.43-65.

(2006) Chris Quirk & Arul Menezes: Do we need phrases? Challenging the conventional wisdom in statistical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 9-16 [PDF, 97KB]

(2006) David M.Rojas & Takako Aikawa: Predicting MT quality as a function of the source language.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2534-2537 [PDF, 613KB]

(2005) Chris Brockett & William B.Dolan: Support vector machines for paraphrase identification and corpus construction. IJCNLP-05: Third International Workshop on Paraphrasing (IWP 2005). Proceedings of the workshop, 12 October 2005, Jeju Island, Korea; pp. 1-8. [PDF, 115KB]

(2005) William B.Dolan & Chris Brockett: Automatically constructing a corpus of sentential paraphrases. IJCNLP-05: Third International Workshop on Paraphrasing (IWP 2005). Proceedings of the workshop, 12 October 2005, Jeju Island, Korea; pp. 9-16. [PDF, 115KB]

(2005) Michael Gamon, Anthony Aue, & Martine Smets: Sentence-level MT evaluation without reference translations: beyond language modeling. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 103-111. [PDF, 76KB]

(2005) Arul Menezes & Chris Quirk: Dependency treelet translation: the convergence of statistical and example-based machine-translation? MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.99-108. [PDF, 314KB]

(2005) Arul Menezes & Chris Quirk: Microsoft Research treelet translation system: IWSLT evaluation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 4pp. [PDF, 240KB]

(2005) Robert C. Moore: Association-based bilingual word alignment. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 1-8. [PDF, 102KB]

(2005) Robert C. Moore: A discriminative framework for bilingual word alignment.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 81-88. [PDF, 118KB]

(2005) Chris Quirk, Arul Menezes, & Colin Cherry: Dependency treelet translation: syntactically informed phrasal SMT.  ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 271-279. [PDF, 118KB]

Millburn High School

(2008)  Nizar Habash & Hayden Metsky: Automatic learning of morphological variations for handling out-of-vocabulary terms in Urdu-English machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.107-116. [PDF, 551KB]

Minnesota, University of

(2009) Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevitch, Sanjeev Khudanpur, Lane Schwartz, Wren N.G.Thornton, Jonathan Weese, & Omar F.Zaidan: Demonstration of Joshua: an open source toolkit for parsing-based machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Software Demonstrations, Suntec, Singapore, 3 August 2009; pp.25-28. [PDF, 429KB]

(2009) Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevitch, Sanjeev Khudanpur, Lane Schwartz, Wren N.G.Thornton, Jonathan Weese, & Omar F.Zaidan: Joshua: an open source toolkit for parsing-based machine translation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.135-139. [PDF, 109KB]

(2008) Lane Schwartz: Multi-source translation methods. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.279-288 [PDF, 650KB]

(2006) Ted Pedersen, Anagha Kulkarni, Roxana Angheluta, Zornitsa Kozareva, & Thamar Solorio: Improving name discrimination: a language salad approach. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.25-32 [PDF, 268 KB]

(2005) Joel Martin, Rada Mihalcea, & Ted Pedersen: Word alignment for languages with scarce resources. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 65-74. [PDF, 170KB]

(2005) Siddharth Patwardhan, Satanjeev Banerjee & Ted Pedersen: SenseRelate::TargetWord – a generalized framework for word sense disambiguation. ACL-2005: Interactive Poster and Demonstration Sessions, University of Michigan, Ann Arbor, June 2005; pp. 73-76. [PDF, 54KB]

Mississippi University

(2009) Milam Aiken, Mina Park, Lakisha Simmons, & Tobin Lindblom: Automatic translation in multilingual electronic meetings. Translation Journal 13 (3), July 2009; 9pp. [PDF, 285KB]

MIT see Massachusetts Institute of Technology

MITRE Corporation

(2009) Sherri Condon, Gregory A.Sanders, Dan Parvaz, Alan Rubenstein, Christy Doran, John Aberdeen, & Beatrice Oshika: Normalization for automated metrics: English and Arabic speech translation. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 33-40. [PDF, 326KB]

(2009) Jennifer DeCamp: Language Technology Resource Center. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 506-512. [PDF, 200KB]

(2009) Jennifer DeCamp: Translation tools: status, practice, and gaps. [Tutorial at] MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26, 2009, Ottawa, Ontario, Canada; 9pp. [PDF, 662KB], presentation, 45pp. [PDF of PPT, 1124KB]

(2009) Jennifer DeCamp: What is missing in user-centric MT? MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 489-485. [PDF, 454KB]

(2009) Ken Samuel, Alan Rubenstein, Sherri Condon & Alex Yeh: Name matching between Chinese and Roman scripts: machine complements human. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.28-31. [PDF, 454KB]

(2009) Carol Van Ess-Dykema, Susan P.Converse, Dennis Perzanowski, & John S.White: Exploring translation memory for extensibility across genres: implications for usage and metrics. Translating and the Computer 31, 19-20 November 2009, London; 16pp. [PDF, 444KB]

(2008) Sherri Condon, Jon Phillips, Christy Doran, John Aberdeen, Dan Parvaz, Beatrice Oshika, Greg Sanders, & Craig Schlendoff: Applying automated metrics to speech translation dialogs.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 178KB]

(2008) Jennifer DeCamp: Language Technology Resource Center. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; 7pp. [PDF, 678KB]

(2008) Jennifer DeCamp: Working with the U.S.Government: information resources. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.334-338. [PDF, 597KB]

(2008) Jennifer Doyon, Christine Doran, C.Donald Means, & Domenique Parr: Automated machine translation improvement through post-editing techniques: analyst and translator experiments. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.346-353. [PDF, 712KB]

(2008) Roderick Holland & Brendan Keyes: ClipperRSS: a light-weight prototype for the cross-language exploitation of syndicated feeds. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.390-393. [PDF, 661KB]

(2008) Inderjeet Mani, Alex Yeh, & Sherri Condon: Learning to match names across languages. Coling 2008: Proceedings of the 2nd Workshop on Multi-source, Multilingual Information Extraction and Summarization, 23 August 2008, Manchester, UK; pp.2-9. [PDF, 182KB]

(2008) Karine Megerdoomian & Dan Parvaz: Low-density language bootstrapping: the case of Tajiki Persian.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 331KB]

(2008) Gregory A. Sanders, Sébastien Bronsart, Sherri Condon, & Craig Schlenoff: Odds of successful transfer of low-level concepts: a key metric for bidirectional speech-to-speech machine translation in DARPA’s TRANSTAC program.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 177KB]

(2008) Brian A. Weiss, Craig Schlenoff, Greg Sanders, Michelle P.Steves, Sherri Condon, Jon Phillips, & Dan Parvaz: Performance evaluation of speech translation systems.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 123KB]

(2007) Jen Doyon: How to successfully integrate MT. Tutorial at MT Summit XI, 10 September 2007, Copenhagen, Denmark; abstract, 1p. [PDF, 88KB]

(2007) John S.White & Florence Reeder: The Chinese room experiment: the self-organizing feng shui of MT. Workshop at MT Summit XI, 11 September 2007, Copenhagen, Denmark; abstract, 1p. [PDF, 13KB]

(2006) Sherri Condon & Keith J.Miller: Name transliteration: current methods, applications, and evaluation in name transliteration and name translation. Tutorial at AMTA 2006 conference, August 8, 2006, Cambridge, Massachusetts, USA; 46pp. [PDF of PPT presentation, 1103KB]

(2006) Sherri Condon & Keith J. Miller: What’s in a name: current methods, applications, and evaluation in multilingual name search and matching. [Abstract for tutorial, in] HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 299-300 [PDF, 20KB]

(2006) David Day, Galen Williamson, Alex Yeh, Keith Crouch, Sam Bayer, Jennifer DeCamp, Angel Asencio, Seamus Clancy, & Flo Reeder: A platform for the empirical analysis of translation resources, tools and their use.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Third International Workshop on Language Resources for Translation Work, Research & Training (LR4Trans-III), Genoa, Italy, 28 May 2006; pp.39-44. [PDF, 613KB]

(2006) Jennifer DeCamp: Social impact of MT (contribution to panel on “Machine translation for social impact”).  AMTA 2006: 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; [PDF of PPT presentation, 76KB]

(2006) Gil Francopoulo, Nuria Bel, Monte George, Nicoletta Calzolari, Monica Monachini, Mandy Pet, & Claudia Soria: Lexical markup framework (LMF) for NLP multilingual resources. Coling-ACL 2006: Proceedings of the Workshop on Multilingual Language Resources and Interoperability, Sydney, July 2006; pp.1-8. [PDF, 69KB]

(2006) Andrew T. Freeman, Sherri L. Condon & Christopher M. Ackerman: Cross linguistic name matching in English and Arabic: a “one to many mapping” extension of the Levenshtein edit distance algorithm. HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 471-478 [PDF, 371KB]

(2006) Keith J.Miller & Michelle Vanni: Formal vs. informal: register-differentiated Arabic MT evaluation in the PLATO paradigm.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.161-166 [PDF, 296KB]

(2006) Owen Rambow, Bonnie Dorr, David Farwell, Rebecca Green, Nizar Habash, Stephen Helmreich, Eduard Hovy, Lori Levin, Keith J.Miller, Teruko Mitamura, Florence Reeder, & Advaith Siddharthan: Parallel syntactic annotation of multiple languages.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.559-564 [PDF, 286KB]

(2006) Florence Reeder: Direct application of a language learner test to MT evaluation. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.166-175 [PDF, 123KB]

(2006) Florence Reeder: Measuring MT adequacy using Latent Semantic Analysis.  AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.176-184 [PDF, 180KB]

(2005) John Henderson & William Morgan: Gaming fluency: evaluating the bounds and expectations of segment-based translation memory. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 175-182. [PDF, 194KB]

(2005) Keith J. Miller: Evaluation principles and objectives: ISLE, EAGLES (a play in three acts).  ELRA-HLT Evaluation Workshop, Malta, 1-2 December 2005; 48pp. [PDF from PPT, 901KB]

(2005) Keith J. Miller & Michelle Vanni: Inter-rater agreement measures, and the refinement of metrics in the PLATO MT evaluation paradigm. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.125-132. [PDF, 183KB]

Mobile Technologies LLC

(2007) Christian Fügen, Alex Waibel, & Muntsin Kolss: Simultaneous translation of lectures and speeches [abstract].  Machine Translation 21 (4) December 2007; pp.209-252.

(2006) Roger Hsiao, Ashish Venugopal, Thilo Köhler, Ying Zhang, Paisarn Charoenpornsawat, Andreas Zollmann, Stephan Vogel, Alan W,Black, Tanja Schultz, & Alex Waibel: Optimizing components for handheld two-way speech translation for an English-Iraqi Arabic system. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1712; abstract [PDF, 94KB]

Motorola Corporation

(2007) Paul C. Davis, Zhuli Xie, & Kevin Small: All links are not the same: evaluating word alignments for statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.119-126 [PDF, 122KB]

National Ground Intelligence Center

(2008) James E. Andrews & Kristen Summers: Machine translation for triage and exploitation of massive text data. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.291-298 [PDF, 703KB]

(2008) Shin Chang-Meadows: MT errors in CH-to-EN MT systems: user feedback. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.324-333.  [PDF of PPT presentation, 778KB]

National Institute of Standards and Technology

(2009) Sherri Condon, Gregory A.Sanders, Dan Parvaz, Alan Rubenstein, Christy Doran, John Aberdeen, & Beatrice Oshika: Normalization for automated metrics: English and Arabic speech translation. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 33-40. [PDF, 326KB]

(2009) Alon Lavie & Mark Przybocki: Introduction to the special issue on “Automated metrics for machine translation evaluation”. Machine Translation 23 (2/3), September 2009; pp.69-70. [see publication]

(2009) Mark Przybocki, Kay Peterson, Sebastian Bronsart, & Gregory Sanders: The NIST 2008 metrics for machine translation challenge—overview, methodology, metrics, and results [abstract]. Machine Translation 23 (2/3), September 2009; pp.71-103.

(2009) The 2009 NIST machine translation evaluation plan (MT09). [NIST, 2009]; 8pp. [PDF, 360KB]

(2009) NIST 2009 open machine translation evaluation (MT09). Official release of results: Arabic-English . [NIST, 2009]; [HTML, 301KB]

(2009) NIST 2009 open machine translation evaluation (MT09). Official release of results: Urdu-English . [NIST, 2009]; [HTML, 125KB]

(2009) NIST 2009 open machine translation evaluation (MT09). Official release of results: Combination tests . [NIST, 2009]; [HTML, 114KB]

(2009) NIST 2009 open machine translation evaluation (MT09). Official release of results: Progress test . [NIST, 2009]; [HTML, 108KB]

(2008) Sherri Condon, Jon Phillips, Christy Doran, John Aberdeen, Dan Parvaz, Beatrice Oshika, Greg Sanders, & Craig Schlendoff: Applying automated metrics to speech translation dialogs.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 178KB]

(2008) Mark Przybocki, Kay Peterson, & Sébastien Bronsart: Translation adequacy and preference evaluation tool (TAP-ET).  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 605KB]

(2008) Gregory A. Sanders, Sébastien Bronsart, Sherri Condon, & Craig Schlenoff: Odds of successful transfer of low-level concepts: a key metric for bidirectional speech-to-speech machine translation in DARPA’s TRANSTAC program.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 177KB]

(2008) Brian A. Weiss, Craig Schlenoff, Greg Sanders, Michelle P.Steves, Sherri Condon, Jon Phillips, & Dan Parvaz: Performance evaluation of speech translation systems.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 123KB]

(2008) NIST: MetricsMATR challenge. [NIST, 2008]. 6pp. [PDF, 35KB]

(2008)  The 2008 NIST open machine translation evaluation plan (MT08). [NIST, 2008]; 7pp. [PDF, 360KB]

(2008) NIST 2008 open machine translation evaluation – (MT08). Official evaluation results. [NIST, 2008]

(2006) Mark Przybocki, Gregory Sanders, & Audrey Le: Edit distance: a metric for machine translation evaluation.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2038-2043 [PDF, 446KB]

(2006) The 2006 NIST machine translation evaluation plan (MT06). [NIST, 2006]; 6pp. [PDF, 92KB]

(2006) NIST 2006 machine translation evaluation. Official results. [NIST, 2006]; [HTML, 263KB]

(2005) The 2005 NIST machine translation evaluation plan (MT-05). [NIST, 2005];6pp. [PDF, 51KB]

(2005) NIST 2005 machine translation evaluation. Official results. [NIST, 2005]; 6pp. [PDF, 38KB]

 

National Science Foundation

(2007) Mary Harper, Alex Acero, Srinivas Bangalore, Jaime Carbonell, Jordan Cohen, Barbara Cuthill, Carol Espy-Wilson, Christiane Fellbaum, John Garofolo, Chin-Hui Lee, Jim Lester, Andrew McCallum, Nelson Morgan, Michael Picheney, Joe Picone, Lance Ramshaw, Jeff Reynar, Hadar Shemtov, & Clare Voss: Report on the NSF-sponsored Human Language Technology Workshop on Industrial Centers. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.239-246 [PDF, 94KB]

National Virtual Translation Center

(2009) Carol Van Ess-Dykema, Susan P.Converse, Dennis Perzanowski, & John S.White: Exploring translation memory for extensibility across genres: implications for usage and metrics. Translating and the Computer 31, 19-20 November 2009, London; 16pp. [PDF, 444KB]

(2009) Carol Van Ess-Dykema, Dennis Perzanowski, Susan Converse, Rachel Richardson, John S.White, & Tucker Maney: Translation memory technology assessment. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 550-559 [PDF of PPT presentation, 2534KB]

(2008) Carol Van Ess-Dykema, Helen G. Gigley, Stephen Lewis, & Emily Vancho Bannister: Embedding technology at the front end of a human translation workflow: an NVTC vision. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.457-463. [PDF, 725KB] 

Naval Postgraduate School

(2007) Wing Shan Shirley Ong: Sequoyah foreign language translation system – business case analysis. Masters thesis, Naval Postgraduate School, Monterey, CA, December 2007. xviii, 71pp. [PDF, 1193KB]

Naval Research Laboratory

(2009) Carol Van Ess-Dykema, Susan P.Converse, Dennis Perzanowski, & John S.White: Exploring translation memory for extensibility across genres: implications for usage and metrics. Translating and the Computer 31, 19-20 November 2009, London; 16pp. [PDF, 444KB]

(2009) Carol Van Ess-Dykema, Dennis Perzanowski, Susan Converse, Rachel Richardson, John S.White, & Tucker Maney: Translation memory technology assessment. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 550-559 [PDF of PPT presentation, 2534KB]

New Jersey, State University of

(2009) Miguel A.Jiménez-Crespo: Conventions in localization: a corpus study of original vs. translated web texts. Journal of Specialised Translation 12 (July 2009); pp.79-102. [PDF, 285KB]

New Mexico State University

(2009) Ahmed Abdelali, Steve Helmreich, & Ron Zacharski: Investigations on standard Arabic geographical classification.  CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 193KB]

(2009) William Ogden, Ron Zacharski, Sieun An, & Yuki Ishikawa: User choice as an evaluation metric for web translation services in cross language instant messaging applications. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 97-103. [PDF, 376KB]

(2008) Peter A. Chew & Ahmed Abdelali: The effects of language relatedness on multilingual information retrieval: a case study with Indo-European and Semitic languages. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.1-9. [PDF, 196KB]

(2008) Peter A.Chew, Brett W.Bader, & Ahmed Abdelali: Latent morpho-semantic analysis: multilingual information retrieval with character n-grams and mutual information. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.129-136. [PDF, 117KB]

(2007) Peter A.Chew & Ahmed Abdelali: Benefits of the ‘massively parallel Rosetta stone’: cross-language information retrieval with over 30 languages. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 872-879 [PDF, 145KB]

(2006) Ahmed Abdelali, James Cowie, Steve Helmreich, Wanying Jin, Maria Pilar Milagros, Bill Ogden, Hamid Mansouri Rad & Ron Zacharski: Guarani: a case study in resource development for quick ramp-up MT. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.1- 9 [PDF, 344KB]

(2006) David Farwell & Stephen Helmreich: Pragmatics-based MT and the translation of puns. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.187- 193 [PDF, 157KB]

(2006) Owen Rambow, Bonnie Dorr, David Farwell, Rebecca Green, Nizar Habash, Stephen Helmreich, Eduard Hovy, Lori Levin, Keith J.Miller, Teruko Mitamura, Florence Reeder, & Advaith Siddharthan: Parallel syntactic annotation of multiple languages.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.559-564 [PDF, 286KB]

Newswatch Inc.

(2008) Teruko Mitamura, Eric Nyberg, Hideki Shima, Tsuneaki Kato, Tatsunori Mori, Chin-Yew Lin,  Ruihua Song, Chuan-Jie Lin, Tetsuya Sakai, Donghong Ji, & Noriko Kando: Overview of the NTCIR-7 ACLIA tasks: advanced cross-lingual information access. Proceedings of NTCIR-7 Workshop Meeting, December 16-19, 2008, Tokyo, Japan; pp.11-25. [PDF, 1062KB]

New York, City University of

(2009) Heng Ji: Mining name translations from comparable corpora by creating bilingual information networks. [ACL-IJCNLP-2009] Proceedings of the 2nd Workshop on Building and Using Comparable Corpora, Suntec, Singapore, 6 August 2009; pp.34-47. [PDF, 120KB]

(2009) Kristen Parton, Kathleen R.McKeown, Bob Coyne, Mona T.Diab, Ralph Grishman, Dilek Hakkani-Tür, Mary Harper, Heng Ji, Wei Yun Ma, Adam Meyers, Sara Stolbach, Ang Sun, Gokhan Tur, Wei Xu, & Sibel Yaman: Who, what, when, where, why? Comparing multiple approaches to the cross-lingual 5W task. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.423-431. [PDF, 176KB]

(2007) Matt Huenerfauth, Liming Zhao, Erdan Gu, & Jan Allbeck: Design and evaluation of an American Sign Language generator. ACL 2007: Proceedings of the Workshop on Embodied Language Processing, June 29, 2007, Prague, Czech Republic; pp.51-58. [PDF, 204KB]

(2007) Kui-Lam Kwok & Norbert Dinstl: NTCIR-6 Chinese monolingual and English-Chinese cross language retrieval experiments using PIRCS. Proceedings of NTCIR-6 Workshop Meeting, May 15-18, 2007, Tokyo, Japan; pp.92-96. [PDF, 68KB]

(2007) Kui-Lam Kwok, Peter Deng & Norbert Dinstl: NTCIR-6 monolingual Chinese and English-Chinese cross-lingual question answering experiments using PIRCS.  Proceedings of NTCIR-6 Workshop Meeting, May 15-18, 2007, Tokyo, Japan; 8pp. [PDF, 195KB]

(2005) Kui-Lam Kwok, Sora Choi, Norbert Dinstl, & Peter Deng: NTCIR-5 Chinese, English, Korean cross-language retrieval experiments using PIRCS. Proceedings of NTCIR-5 Workshop Meeting, December 6-9, 2005, Tokyo, Japan; 8pp. [PDF, 455KB]

New York University

(2009) Daniel Galron, Sergio Penkale, Andy Way, & I.Dan Melamed: Accuracy-based scoring for DOT: towards direct error minimization for data-oriented translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.371-380. [PDF, 164KB]

(2009) Jan Hajič, Massimiliano Ciaramita, Richard Johansson, Daisuke Kawahara, Maria Antònia Marti, Lluís Màrquez, Adam Meyers, Joakim Nivre, Sebastian Padó, Jan Štěpánek, Pavel Straňák, Mihai Surdeanu, Naiwen Xue, & Yi Zhang: The CoNLL-2009 Shared task: syntactic and semantic dependencies in multiple languages. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.1-18. [PDF, 220KB]

(2009) Kristen Parton, Kathleen R.McKeown, Bob Coyne, Mona T.Diab, Ralph Grishman, Dilek Hakkani-Tür, Mary Harper, Heng Ji, Wei Yun Ma, Adam Meyers, Sara Stolbach, Ang Sun, Gokhan Tur, Wei Xu, & Sibel Yaman: Who, what, when, where, why? Comparing multiple approaches to the cross-lingual 5W task. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.423-431. [PDF, 176KB]

(2009) Benjamin Wellington, Joseph Turian & I.Dan Melamed: Toward purely discriminative training for tree-structured translation models.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.131-149.

(2008) Shasha Liao: Combining source and target language information for name tagging of machine translation output. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the student research workshop, June 16, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 19-24. [PDF, 98KB]

 (2007) Heng Ji & Ralph Grishman: Collaborative entity extraction and translation. In: Recent Advances in Natural Language Processing: selected papers from RANLP ’97, ed. Nicolas Nikolov, John Benjamins Publ.Co.: Amsterdam/Philadelphia, 2000; pp.73-84. [PDF, 303KB]

(2006) Benjamin Wellington, Joseph Turian, Chris Pike & I.Dan Melamed: Scalable purely-discriminative training for word and tree transducers. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.251-260 [PDF, 176KB]

(2006) Benjamin Wellington, Sonja Waxmonsky, & I.Dan Melamed: Empirical lower bounds on the complexity of translational equivalence. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.977-984. [PDF, 121KB]

NIST see National Institute of Standards and Technology

Northern State University (South Dakota)

(2009) Milam Aiken, Mina Park, Lakisha Simmons, & Tobin Lindblom: Automatic translation in multilingual electronic meetings. Translation Journal 13 (3), July 2009; 9pp. [PDF, 285KB]

North Texas, University of

(2009) Hakan Ceylan & Yookyung Kim: Language identification of search engine queries. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.1066-1074. [PDF, 239KB]

(2009) Ravi Sinha, Diana McCarthy, & Rada Mihalcea: SemEval-2010 Task 2: cross-lingual lexical substitution. NAACL-HLT-2009 (SEW-2009): Semantic Evaluations: Recent Achievements and Future Directions, Proceedings of the workshop, June 4, 2009, Boulder, Colorado; pp.76-81. [PDF,

(2008) Carmen Banea, Rada Mihalcea, Janyce Wiebe, & Samer Hassan: Multilingual subjectivity analysis using machine translation. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.127-135. [PDF, 15403KB]

(2008) Rada Mihalcea & Chee Wee Leong: Toward communicating simple sentences using pictorial representations [abstract].  Machine Translation 22 (3), September 2008; pp.153-173.

(2008) Rada Mihalcea & Vivi Nastase: How to add a new language on the NLP map: building resources and tools for languages with scarce resources. [Abstract of tutorial at] IJCNLP 2008: Sixth SIGHAN Workshop on Chinese Language Processing, Proceedings of the workshop, 11-12 January 2008, Hyderabad, India; pp.938. [PDF, 249KB]

(2008) Michael Mohler & Rada Mihalcea: BABYLON parallel text builder: gathering parallel texts for low-density languages. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 63KB]

(2006) Rada Mihalcea & Ben Leong: Toward communicating simple sentences using pictorial representations. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.119-127 [PDF, 219 KB]

(2005) Joel Martin, Rada Mihalcea, & Ted Pedersen: Word alignment for languages with scarce resources. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 65-74. [PDF, 170KB]

Ohio State University

(2007) Dennis N. Mehay & Chris Brew: BLEUÂTRE: flattening syntactic dependencies for MT evaluation. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.122-131 [PDF, 370KB]; presentation [PDF, 802KB]

(2007) Michael White, Rajakrishnan Rajkumar, & Scott Martin: Towards broad coverage surface realization with CCG. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.22-30 [PDF, 2020KB]

(2006) Jirka Hana, Anna Feldman, Chris Brew, & Luiz Amaral: Tagging Portuguese with a Spanish tagger using cognates. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.33-40 [PDF, 253KB]

Oregon Health Sciences University

(2009) Kevin Knight & Richard Sproat: Writing systems, transliteration and decipherment.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Tutorials, Boulder, Colorado, May 31 - June 5, 2009; pp.15-16. [PDF, 68KB]

Pacific North West National Laboratory

(2005) Thomas Hoeft, Nick Cramer, M.L.Gregory, & Elizabeth Hetzler: Bridging the gap between technology and users: leveraging machine translation in a visual data triage tool.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; Demonstration abstracts, pp. 2-3. [PDF, 77KB]

Palo Alto Research Center see Xerox Corporation

Pan American Health Organization

(2009) Julia Aymerich & Hermes Camelo: The machine translation maturity model at PAHO. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 403-409. [PDF, 44KB]

(2007) Julia Aymerich & Hermes Camelo: Automatic extraction of entries for a machine translation dictionary using bitexts. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.21-27 [PDF, 88KB]

Pennsylvania, University of

(2009) Kedar Bellare, Koby Crammer, & Dayne Freitag: Loss-sensitive discriminative training of machine transliteration models. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Proceedings of the Student Research Workshop and Doctoral Consortium, Boulder, Colorado, June 1, 2009;  pp.61-65. [PDF, 117KB]

(2009) Jieun Chae & Ani Nenkova: Predicting the fluency of text with shallow structural features: case studies of machine translation and human-written text.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.139-147. [PDF, 120KB]

(2009) João Graça, Kuzman Ganchev & Ben Taskar: PostCAT – posterior constrained alignment toolkit. Prague Bulletin of Mathematical Linguistics 91, 2009; pp.27-36. [PDF, 134KB]

(2008) Lauren Friedman, Haejoong Lee, & Stephanie Strassel: A quality control framework for gold standard reference translations: the process and toolkit developed for GALE. Translating and the Computer 30, 27-28 November 2008, London; 6pp. [PDF, 77KB]

(2008) Kuzman Ganchev, João V. Graça, & Ben Taskar: Better alignments = better translations? ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 763-770. [PDF, 685KB]

(2008) Haitao Mi & Liang Huang: Forest-based translation rule extraction.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.206-214. [PDF, 170KB]

(2008) Haitao Mi, Liang Huang, & Qun Liu: Forest-based translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 192-199. [PDF, 185KB]

(2007) Liang Huang: Binarization, synchronous binarization, and target-side binarization. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.33-40 [PDF, 161KB]

(2007) Liang Huang & David Chiang: Forest rescoring: faster decoding with integrated language models. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 144-151 [PDF, 215KB]

(2007) Matt Huenerfauth, Liming Zhao, Erdan Gu, & Jan Allbeck: Design and evaluation of an American Sign Language generator. ACL 2007: Proceedings of the Workshop on Embodied Language Processing, June 29, 2007, Prague, Czech Republic; pp.51-58. [PDF, 204KB]

(2007) Sriram Venkatapathy & Aravind K.Joshi: Discriminative word alignment by learning the alignment structure and syntactic divergence between a language pair. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.49-56 [PDF, 172KB]

(2006) Yuan Ding & Martha Palmer: Better learning and decoding for syntax based SMT using PSDIG. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.37-45 [PDF, 391KB]

(2006) Eduard Hovy, Mitchell Marcus, Martha Palmer, Lance Ramshaw & Ralph Weischedel: OntoNotes: the 90% solution.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 57-60 [PDF, 44KB]

(2006) Liang Huang: Efficient algorithms for richer formalisms: parsing and machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 223-226 [PDF, 87KB]

(2006) Liang Huang, Kevin Knight & Aravind Joshi: Statistical syntax-directed translation with extended domain of locality. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.66-73 [PDF, 162KB]

(2006) Liang Huang, Kevin Knight, & Aravind Joshi: A syntax-directed translator with extended domain of locality. HLT-NAACL-2006: Computationally Hard Problems and Joint Inferenec in Speech and Language Processing. Proceedings of the workshop, 9 June 2006, New York City, NY, USA; pp.1-8. [PDF, 120KB]

(2006) Ryan McDonald, Kevin Lerman, & Fernando Pereira: Multilingual dependency analysis with a two-stage discriminative parser. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.216-220. [PDF, 79KB]

(2006) Sriram Venkatapathy & Aravind K.Joshi: Using information about multi-word expressions for the word-alignment task. Coling-ACL 2006: Workshop on Multiword Expressions: Identifying and Exploiting Underlying Properties, Sydney, July 2006; pp.20-27 [PDF, 120KB]

(2006) Hao Zhang, Liang Huang, Daniel Gildea, & Kevin Knight: Synchronous binarization for machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 256-263 [PDF, 206KB]

(2005) Yuan Ding & Martha Palmer: Machine translation using probabilistic synchronous dependency insertion grammars.  ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 541-548. [PDF, 369KB]

(2005) Liang Huang, Hao Zhang, & Daniel Gildea: Machine translation as lexicalized parsing with hooks. IWPT 2005: Proceedings of the Ninth International Workshop on Parsing Technologies, Vancouver, October 2005; pp.65-73. [PDF, 130KB]

(2005) Matt Huenerfauth: American Sign Language generation: multimodal NLG with multiple linguistic channels. ACL-2005: Student Research Workshop, University of Michigan, Ann Arbor, June 2005; pp. 37-42. [PDF, 91KB]

Pittsburgh: Carnegie Mellon University see Carnegie Mellon University

Pittsburgh: Language Technologies Institute see Carnegie Mellon University

Pittsburgh, University of

(2009) Joshua S.Albrecht, Rebecca Hwa, & G.Elisabeta Marai: Correcting automatic translations through collaborations between MT and monolingual target-language users.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.60-68. [PDF, 178KB]

(2009) Behrang Mohit, Frank Liberato, & Rebecca Hwa: Language model adaptation for difficult to translate phrases. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.160-167. [PDF, 343KB]

(2008) Joshua S.Albrecht & Rebecca Hwa: Regression for machine translation evaluation at the sentence level [abstract].  Machine Translation 22 (1/2), March-June 2008; pp.1-27.

(2008) Joshua S. Albrecht & Rebecca Hwa: The role of pseudo reference in MT evaluation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.187-190. [PDF, 53KB]

(2008) Carmen Banea, Rada Mihalcea, Janyce Wiebe, & Samer Hassan: Multilingual subjectivity analysis using machine translation. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.127-135. [PDF, 15403KB]

(2008) Christian Monson, Ariadna Font Llitjós, Vamshi Ambati, Lori Levin, Alon Lavie, Alison Alvarez, Roberto Aranovich, Jaime Carbonell, Robert Frederking, Erik Peterson, & Katharina Probst: Linguistic structure and bilingual informants help induce machine translation of lesser-resourced languages. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 141KB]

(2007) Joshua S. Albrecht & Rebecca Hwa: Regression for sentence-level MT evaluation with pseudo references. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 296-303 [PDF, 98KB]

(2007) Joshua S. Albrecht & Rebecca Hwa: A re-examination of machine learning approaches for sentence-level MT evaluation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 880-887 [PDF, 126KB]

(2007) Behrang Mohit & Rebecca Hwa: Localization of difficult-to-translate phrases.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 248-255 [PDF, 117KB]

(2006) Rebecca Hwa, Carol Nichols & Khalil Sima’an: Corpus variations for translation lexicon induction. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.74-81 [PDF, 178KB]

(2006) Christian Monson, Ariadna Font Llitjos, Roberto Aranovich, Lori Levin, Ralf Brown, Eric Peterson, Jaime Carbonell, & Alon Lavie: Building NLP systems for two resource-scarce indigenous languages: Mapudungun and Quechua. LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.15-24. [PDF, 124KB]

(2006) Jason Riesa, Behrang Mohit, Kevin Knight, & Daniel Marcu: Building an English-Iraqi Arabic machine translation system for spoken utterances with limited resources. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 2012; abstract [PDF, 80KB]

(2005) Carol Nichols & Rebecca Hwa: Word alignment and cross-lingual resource acquisition. ACL-2005: Interactive Poster and Demonstration Sessions, University of Michigan, Ann Arbor, June 2005; pp. 69-72. [PDF, 331KB]

Powerset Inc.

(2007) Pierrette Bouillon, Glenn Flores, Marianne Starlander, Nikos Chatzichrisafis, Marianne Santaholma, Nikos Tsourakis, Manny Rayner, & Beth Ann Hockey: A bidirectional grammar-based medical speech translator. ACL 2007: Proceedings of the Workshop on Grammar-based approaches to spoken language processing (SPEECHGRAM 2007), June 29, 2007, Prague, Czech Republic; pp.41-48. [PDF, 216KB]

(2007) Pierrette Bouillon, Sonia Halimi, Manny Rayner, & Beth Ann Hockey: Adapting a medical speech to speech translation system (MedSLT) to Arabic. ACL 2007: proceedings of the Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources, Prague, Czech Republic, 28 June 2007; pp. 41-48 [PDF, 145KB]

Pratt & Whitney Corporation

(2006) David Shaw: Simplified Technical English in the 21st century. CLAW 2006: 5th International Workshop on Controlled Language Applications, Cambridge, MA, USA, August 12, 2006; 1p. [abstract] [PDF, 53KB]

Princeton University

(2006) Sabri Elkateb, William Black, Piek Vossen, David Farwell, Adam Pease, & Christiane Fellbaum: Arabic WordNet and the challenges of Arabic. The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.15-24. [PDF, 295KB]

Rochester University

(2009) Tagyoung Chung & Daniel Gildea: Unsupervised tokenization for machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.718-726. [PDF, 197KB]

(2009) Liang Huang, Hao Zhang, David Gildea & Kevin Knight: Binarization of synchronous context-free grammars. Computational Linguistics 35 (4), pp. 559-595 [PDF, 837KB]

(2008) Dekang Lin, Shaojun Zhao, Benjamin Van Durme, & Marius Paşca: Mining parenthetical translations from the web by word alignment. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 994-1002. [PDF, 419KB]

(2008) Ding Liu & Daniel Gildea: Improved tree-to-string transducer for machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.62-69. [PDF, 208KB]

(2008) Matt Post & Daniel Gildea: Parsers as language models for statistical machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.172-181. [PDF, 538KB]

(2008) Hao Zhang, Daniel Gildea, & David Chiang: Extracting synchronous grammar rules from word-level alignments in linear time. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.1081-1088. [PDF, 181KB]

(2007) Ding Liu & Daniel Gildea: Source-language features and maximum correlation training for machine translation evaluation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.41-48 [PDF, 201KB]

(2007) Hao Zhang & Daniel Gildea: Factorization of synchronous context-free grammars in linear time. SSST, NAACL-HLT-2007 AMTA Workshop on  Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.25-32 [PDF, 151KB]

(2006) Ding Liu & Daniel Gildea: Stochastic iterative alignment for machine translation evaluation.  Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.539-546. [PDF, 201KB]

(2006) Hao Zhang & Daniel Gildea: Efficient search for inversion transduction grammar.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 224-231. [PDF, 172KB]

(2006) Hao Zhang & Daniel Gildea: Inducing word alignments with bilexical synchronous trees. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.953-960. [PDF, 156KB]

(2006) Hao Zhang, Liang Huang, Daniel Gildea, & Kevin Knight: Synchronous binarization for machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 256-263 [PDF, 206KB]

(2005) Liang Huang, Hao Zhang, & Daniel Gildea: Machine translation as lexicalized parsing with hooks. IWPT 2005: Proceedings of the Ninth International Workshop on Parsing Technologies, Vancouver, October 2005; pp.65-73. [PDF, 130KB]

(2005) Ding Liu & Daniel Gildea: Syntactic features for evaluation of machine translation. ACL-2005: Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, University of Michigan, Ann Arbor, 29 June 2005; pp. 25-32. [PDF, 158KB]

(2005) Hao Zhang & Daniel Gildea: Stochastic lexicalized inversion transduction grammar for alignment. ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 4787-482. [PDF, 102KB]

Rutgers University

(2009) Miguel A.Jiménez-Crespo: Conventions in localization: a corpus study of original vs. translated web texts. Journal of Specialised Translation 12 (July 2009); pp.79-102. [PDF, 285KB]

Saint Louis University

(2006) Kevin P. Scannell: Machine translation for closely related language pairs.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.103-107. [PDF, 139KB]

(2006) Oliver Streiter, Kevin P.Scanell, & Martin Stuflesser: Implementing NLP projects for noncentral languages: instructions for funding bodies, strategies for developers [abstract]. Machine Translation 20 (4),2006; pp.267-289.

Sandia National Laboratories

(2008) Brett W.Bader & Peter A.Chew: Enhancing multilingual latent semantic analysis with term alignment information.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.49-56. [PDF, 295KB]

(2008) Peter A.Chew, Brett W.Bader, & Ahmed Abdelali: Latent morpho-semantic analysis: multilingual information retrieval with character n-grams and mutual information. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.129-136. [PDF, 117KB]

(2008) Peter A. Chew & Ahmed Abdelali: The effects of language relatedness on multilingual information retrieval: a case study with Indo-European and Semitic languages. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.1-9. [PDF, 196KB]

(2007) Peter A.Chew & Ahmed Abdelali: Benefits of the ‘massively parallel Rosetta stone’: cross-language information retrieval with over 30 languages. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 872-879 [PDF, 145KB]

SDL International

(2009) Terry Lawlor: [Contribution to panel]: Converging technologies – what are the benefits for MT users? MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26, 2009, Ottawa, Ontario, Canada; 10pp. [PDF of PPT presentation, 858KB]

Sehda Inc.

(2006) Farzad Ehsani, Jim Kimzey, Demetrios Master, Karen Lesea & Hunil Park: Speech to speech translation for medical triage in Korean.  HLT-NAACL 2006: Proceedings of the  Workshop on Medical Speech Translation, 9 June 2006, New York, NY, USA; pp.17-23 [PDF, 96KB]

(2006) Farzad Ehsani, Jim Kimzey, Demetrios Master, Karen Sudre, David Domingo & Hunil Park: S-MINDS 2-way speech-to-speech translation system.  HLT-NAACL 2006: Proceedings of the  Workshop on Medical Speech Translation, 9 June 2006, New York, NY, USA; pp.48-49 [PDF, 58KB]

(2005) Yookyung Kim, Jun Huang, Youssef Billawala, Demitrios Master, & Farzad Ehsani: Sehda S2MT: incorporation of syntax into statistical translation system. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 6pp. [PDF, 116KB]

Smart Communications Inc.

(2006) John Smart: SMART Controlled English. CLAW 2006: 5th International Workshop on Controlled Language Applications, Cambridge, MA, USA, August 12, 2006; 9pp. [PDF, 262KB]

Southern California, University of

(2008) Emil Ettelaie, Panayiotis G.Georgiou, & Shrikanth S.Narayanan: Mitigation of data sparsity in classifier-based translation. Coling 2008: Proceedings of the Workshop on Speech Processing for Safety Critical Translation and Pervasive Applications, 23 August 2008, Manchester, UK; pp.1-4. [PDF, 138KB]

(2008) Emil Ettelaie, Panayiotis G.Georgiou, & Shrikanth S.Narayanan: Towards unsupervised training of the classifier-based speech translator. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.2739-2742; abstract [PDF, 58KB]

(2008) Vivek Kumar Sridhar, Srinivas Bangalore, & Shrikanth S.Narayanan: Factored translation models for enriching spoken language translation with prosody. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.2723-2726; abstract [PDF, 60KB]

(2008) Vivek Kumar Rangarajan Sridhar, Shrikanth Narayanan, & Srinivas Bangalore: Enriching spoken language translation with dialog acts. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 225-228. [PDF, 383KB]

(2006) Emil Ettelaie, Panayiotis G.Georgiou, & Shrikanth Narayanan: Cross-lingual dialog model for speech to speech translation. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1858; abstract [PDF, 74KB]

(2006) Jason Riesa, Behrang Mohit, Kevin Knight, & Daniel Marcu: Building an English-Iraqi Arabic machine translation system for spoken utterances with limited resources. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 2012; abstract [PDF, 80KB]

(2005) Steve DeNeefe, Kevin Knight, & Hayward H. Chan: Interactively exploring a machine translation model. ACL-2005: Interactive Poster and Demonstration Sessions, University of Michigan, Ann Arbor, June 2005; pp. 97-100. [PDF, 123KB]

Southern California, University of: Information Sciences Institute see Information Sciences Institute, University of Southern California

Southwestern Texas University

(2008) Pierrette Bouillon, Glenn Flores, Maria Georgescul, Sonia Halimi, Beth Ann Hockey, Hitoshi Isahara, Kyoko Kanzaki, Yukie Nakao, Manny Rayner, Marianne Santaholma, Marianne Starlander, & Nikos Tsourakis: Many-to-many multilingual medical speech translation on a PDA. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.314-323. [PDF, 666KB]

(2008) Manny Rayner, Pierrette Bouillon, Jane Brotanek, Glenn Flores, Sonia Halimi, Beth Ann Hockey, Hitoshi Isahara, Kyoko Kanzaki, Elisabeth Kron, Yukie Nakao, Marianne Santaholma, Marianne Starlander, & Nikos Tsourakis: The 2008 MedSLT system. Coling 2008: Proceedings of the Workshop on Speech Processing for Safety Critical Translation and Pervasive Applications, 23 August 2008, Manchester, UK; pp.32-35. [PDF, 209KB]

(2008) Manny Rayner, Pierrette Bouillon, Glenn Flores, Farzad Ehsani, Marianne Starlander, Beth Ann Hockey, Jane Brotanek, & Lukas Biewald: A small-vocabulary shared task for medical speech translation. Coling 2008: Proceedings of the Workshop on Speech Processing for Safety Critical Translation and Pervasive Applications, 23 August 2008, Manchester, UK; pp.60-63. [PDF, 62KB]

(2008) Marianne Starlander, Pierrette Bouillon, Glenn Flores, Manny Rayner, & Niks Tsourakis: Comparing two different bidirectional versions of the limited-domain medical spoken language translator MedSLT.  EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.176-181. [PDF, 521KB]

SpeechCycle Inc.

(2009) David Suendermann, Jackson Liscombe, Krishna Dayanidhi, & Roberto Pieraccini: Localization of speech recognition in spoken dialog systems: how machine translation can make our lives easier. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

SpeechGear Inc.

(2006) Lisa Cochran: Reaching across the language barrier.  Speech Technology, November/December 2006. [PDF, 86KB]

Spoken Translation, Inc.

(2008) Mark Seligman & Mike Dillinger: Rapid portability among domains in an interactive spoken language translation system. Coling 2008: Proceedings of the Workshop on Speech Processing for Safety Critical Translation and Pervasive Applications, 23 August 2008, Manchester, UK; pp.40-47. [PDF, 129KB]

(2006) Mike Dillinger & Mark Seligman: ConverserTM : highly interactive speech-to-speech translation for healthcare.  HLT-NAACL 2006: Proceedings of the  Workshop on Medical Speech Translation, 9 June 2006, New York, NY, USA; pp.40-43 [PDF, 71KB]

(2006) Mark Seligman & Mike Dillinger: Usability issues in an interactive speech-to-speech translation system for healthcare. HLT-NAACL 2006: Proceedings of the  Workshop on Medical Speech Translation, 9 June 2006, New York, NY, USA; pp.1-8 [PDF, 288KB]

(2006) Mike Dillinger: Tools and techniques for translatable content. Tutorial at AMTA 2006 conference, August 8, 2006, Cambridge, Massachusetts, USA; 25pp. [PDF of PPT presentation, 1398KB]

(2005) Mike Dillinger: Interlinguas and semantic roles. MT Summit X, Phuket, Thailand, September 12, 2005, Tutorial Note. [PDF of PPT presentation, 1945KB]

(2005) Chengqing Zong & Mark Seligman: Toward practical spoken language translation [abstract]. Machine Translation 19 (2), 2005; pp.113-137.

SRA International

(2009) Dmitry Zelenko: Combining MDL transliteration training with discriminative modeling. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.116-119. [PDF, 92KB]

(2006) Dmitry Zelenko & Chinatsu Aone: Discriminative methods for transliteration.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 612-617. [PDF, 342KB]

SRI International

(2009) Kedar Bellare, Koby Crammer, & Dayne Freitag: Loss-sensitive discriminative training of machine transliteration models. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Proceedings of the Student Research Workshop and Doctoral Consortium, Boulder, Colorado, June 1, 2009;  pp.61-65. [PDF, 117KB]

(2009) Dayne Freitag & Zhiqiang Wang: Name transliteration with bidirectional perceptron edit models. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.132-135. [PDF, 96KB]

(2009) Kristen Parton, Kathleen R.McKeown, Bob Coyne, Mona T.Diab, Ralph Grishman, Dilek Hakkani-Tür, Mary Harper, Heng Ji, Wei Yun Ma, Adam Meyers, Sara Stolbach, Ang Sun, Gokhan Tur, Wei Xu, & Sibel Yaman: Who, what, when, where, why? Comparing multiple approaches to the cross-lingual 5W task. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.423-431. [PDF, 176KB]

(2009) Mei Yang & Jing Zheng: Toward smaller, faster, and better hierarchical phrase-based SMT. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.237-240. [PDF, 140KB]

(2009) Jing Zheng, Necip Fazil Ayan, Wen Wang, & David Burkett: Using syntax in large-scale audio document translation. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

(2008) Necip Fazil Ayan, Jing Zheng, & Wen Wang: Improving alignments for better confusion networks for combining machine translation systems. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.33-40. [PDF, 228KB]

(2008) Andreas Kathol & Jing Zheng: Strategies for building a Farsi-English SMT system from limited resources. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.2731-2734; abstract [PDF, 62KB]

(2007) Mathias Creutz, Teemu Hirsimäki, Mikko Kurimo, Antti Puurula, Janne Pylkkönen, Vesa Siivola, Matti Varjokallio, Ebru Arısoy, Murat Saraçlar, & Andreas Stolcke: Analysis of morph-based speech recognition and the modeling of out-of-vocabulary words across languages. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.380-387 [PDF, 136KB]

(2007) Kristin Precoda, Jing Zheng, Dimitra Vergyri, Horacio Franco, Colleen Richey, Andreas Kathol, & Sachin Kajarekar: IraqComm: a next generation translation system. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2841-2844; abstract [PDF, 23KB]

(2005) John Fry: Assembling a parallel corpus from RSS news feeds MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.59-62. [PDF, 303KB]

(2005) Andreas Kathol, Kristin Precoda, Dimitra Vergyri, Wen Wang, & Susanne Riehemann: Speech translation for low-resource languages: the case of Pashto. Interspeech 2005 - Eurospeech: 9th European  Conference on  Speech Communication and Technology, Lisbon, Portugal, September 4-8, 2005; pp.2273-2276; abstract [PDF, 44KB]

Stanford University [see also Center for the Study of Language and Information]

(2009) Pi-Chuan Chang, Dan Jurafsky, & Christopher D.Manning: Disambiguating “DE” for Chinese-English machine translation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.215-223. [PDF, 276KB]

(2009) Pi-Chuan Chang, Huihsin Tseng, Dan Jurafsky, & Christopher D.Manning: Discriminative reordering with Chinese grammatical features. Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.51-59. [PDF, 558KB]

(2009) Yu Chen, Martin Kay, & Andreas Eisele: Intersecting multilingual data for faster and better statistical translations.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.128-136. [PDF, 159KB]

(2009) Michel Galley & Christopher D.Manning: Quadratic-time dependency parsing for machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.773-781. [PDF, 235KB]

(2009) Spence Green, Conal Sathi & Christopher D.Manning: NP subject detection in verb-initial Arabic clauses. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp.  [PDF, 729KB]

(2009) Sebastian Padó, Daniel Cer, Michel Galley, Dan Jurafsky, & Christopher D.Manning: Measuring machine translation quality as semantic equivalence: a metric based on entailment features [abstract]. Machine Translation 23 (2/3), 2009; pp.181-193.

(2009) Sebastian Padó, Michel Galley, Dan Jurafsky & Chris Manning: Robust machine translation evaluation with entailment features. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.297-305. [PDF, 437KB]

(2009) Sebastian Padó, Michel Galley, Dan Jurafsky, & Christopher D.Manning: Textual entailment features for machine translation evaluation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.37-41. [PDF, 378KB]

(2008) Daniel Cer, Daniel Jurafsky, & Christopher D. Manning: Regularization and search for minimum error rate training. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.26-34. [PDF, 712KB]

(2008) Pi-Chuan Chang, Michel Galley, & Christopher D. Manning: Optimizing Chinese word segmentation for machine translation performance. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.224-232. [PDF, 165KB]

(2008) Andreas Eisele, Christian Federmann, Hans Uszkoreit, Hervé Saint-Amand, Martin Kay, Michael Jellinghaus, Sabine Hunsicker, Teresa Herrmann, & Yu Chen: Hybrid machine translation architectures within and beyond the EuroMatrix project. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.27-34. [PDF, 552KB]

(2008) Andreas Eisele, Christian Federmann, Hans Uszkoreit, Hervé Saint-Amand, Martin Kay, Michael Jellinghaus, Sabine Hunsicker, Teresa Herrmann, & Yu Chen: Hybrid architectures for multi-engine machine translation. Translating and the Computer 30, 27-28 November 2008, London; 12pp. [PDF, 170KB]

(2008) Michel Galley & Christopher D.Manning: A simple and effective hierarchical phrase reordering model. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.848-856. [PDF, 230KB]

(2007) Pi-Chuan Chang & Kristina Toutanova: A discriminative syntactic word order model for machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 9-16 [PDF, 235KB]

(2007) Stephan Oepen, Erik Velldal, Jan Tore Lønning, Paul Meurer, Victoria Rosén, & Dan Flickinger: Towards hybrid quality-oriented machine translation – on linguistics and probabilities in MT. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.144-153 [PDF, 328KB]

(2006) Jan Tore Lønning & Stephan Oepen: Re-usable tools for precision machine translation.  Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.53-56. [PDF, 213KB]

(2005) David Vickrey, Luke Biewald, Marc Teyssier, & Daphne Koller: Word-sense disambiguation for machine translation.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 771-778. [PDF, 149KB]

StreamSage Inc.

(2005) Hemali Majithia, Philip Rennart & Evelyne Tzoukermann: Rapid ramp-up for statistical machine translation: minimal training for maximal coverage . MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.438-444. [PDF, 160KB]

Systran Software Inc. [see also France: Systran SA]

(2009) Jin Yang, Satoshi Enoue, Jean Senellart, & Tristan Croiset: SYSTRAN Chinese-English and English-Chinese hybrid machine translation systems. CWMT 2009: the 5th China Workshop on Machine Translation, Nanjing, China, October 16-17, 2009; 8pp. [PDF, 158KB]

(2007) John S.White & Florence Reeder: The Chinese room experiment: the self-organizing feng shui of MT. Workshop at MT Summit XI, 11 September 2007, Copenhagen, Denmark; abstract, 1p. [PDF, 13KB]

(2006) Jean Senellart & John S. White: First strategies for integrating hybrid approaches into established systems (contribution to panel on “Hybrid machine translation”).  AMTA 2006: 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; [PDF of PPT presentation, 68KB]

Technical Support Working Group

(2008) Kathleen Egan, Francis Kubala, & Allen Sears: User-centered development and implementation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.354-363. [PDF of PPT presentation, 1017KB]

TeleNav Inc.

(2008) Farzad Ehsani, Jim Kimzey, Elaine Zuber, Demitrios Master, & Karen Sudre: Speech to speech translation for nurse patient interaction. Coling 2008: Proceedings of the Workshop on Speech Processing for Safety Critical Translation and Pervasive Applications, 23 August 2008, Manchester, UK; pp.54-59. [PDF, 92KB]

Tennessee, University of

(2005) Daniel J. Walker: The Open A.I. KitTM: general machine learning modules from statistical machine translation MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Workshop on Open Source Machine Translation; pp.1-6. [PDF, 162KB]

Texas University

(2007) Yuk Wah Wong & Raymond J.Mooney: Generation by inverting a semantic parser that uses statistical machine translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.172-179 [PDF, 192KB]

(2006) Tod Allman & Stephen Beale: A natural language generator for minority languages.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.39-46. [PDF, 301KB]

(2006) Marian Olteanu, Chris Davis, Ionut Volosen, & Dan Moldovan: Phramer – an open source statistical pharse-based translator.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 146-149 [PDF, 125KB]

(2006) Ted Pedersen, Anagha Kulkarni, Roxana Angheluta, Zornitsa Kozareva, & Thamar Solorio: Improving name discrimination: a language salad approach. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.25-32 [PDF, 268 KB]

(2006) Yuk Wah Wong & Raymond J. Mooney: Learning for semantic parsing with statistical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 439-446 [PDF, 205KB]

(2005) Stephen Beale, Sergei Nirenburg, Marjorie McShane, & Tod Allman: Document authoring the Bible for minority language translation. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.63-70. [PDF, 415KB]

(2005) Jonas Kuhn: Parsing word-aligned parallel corpora in a grammar induction context. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 17-25. [PDF, 150KB]

Texas University (Dallas)

(2008) Thamar Solorio & Yang Liu: Part-of-speech tagging for English-Spanish code-switched text. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.1051-1060. [PDF, 129KB]

TextWise LLC

(2006) Nizar Habash, Clinton Mah, Sabiha Imran, Randy Calistri-Yeh, & Páraic Sheridan: Design, construction and validation of an Arabic-English conceptual interlingua for cross-lingual information retrieval. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.107-112 [PDF, 408KB]

Thomson Legal and Regulatory Research and Development Group

(2005) Isabelle Moulinier & Ken Williams: Thomson Legal and Regulatory at NTCIR-5: Japanese and Korean experiments.  Proceedings of NTCIR-5 Workshop Meeting, December 6-9, 2005, Tokyo, Japan; 5pp. [PDF, 309KB]

Transclick Inc.

(2008) Robert Levin: Real-time translation of IM chat. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.402-411. [PDF of PPT presentation, 689KB]

(2005) Leslie Barrett & Robert Levin: Usability considerations for a cellular-based text translator. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.471-475. [PDF, 235KB]

Translation Optimization Partners

(2009) Mike Dillinger: An introduction to machine translation. [Tutorial at] MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26, 2009, Ottawa, Ontario, Canada; 20pp. [PDF of PPT presentation, 360KB]

(2009) Mike Dillinger: [Introduction to panel]: Converging technologies – what are the benefits for MT users?  MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26, 2009, Ottawa, Ontario, Canada; 16pp. [PDF of PPT presentation, 1393KB]

Tufts University

(2009) David Bamman & Gregory Crane: Automatically building bilingual dictionaries for Greek and Latin.  EAMT-2009 Workshop Machine Translation for Historical Languages, 13 May 2009, Barcelona; slides [PDF of PPT, 211KB]

US Air Force Research Laboratories see Air Force Research Laboratories

US Army [see also Army Research Laboratories]

(2009) Vaughn Laganosky: United States Army machine foreign language translation (MFLT) requirements and capability. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 521-526. [PDF of PPT presentation, 4909KB]

US Army Intelligence Center see Army Intelligence Center

US Patent and Trademark Office

(2009) Arti Shah: Translation of patent documents at the United States Patent and Trademark Office. [Panel on] “Real world challenges of patent translation”, MT Summit XII: Third Workshop on Patent Translation, August 30, 2009, Ottawa, Ontario, Canada; 10pp. [PDF of PPT presentation, 303KB]

Utah, University of

(2008) Ulf Hermjakob, Kevin Knight, & Hal Daumé III: Name translation in statistical machine translation: learning when to transliterate. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 389-397. [PDF, 107KB]

(2005) Siddharth Patwardhan, Satanjeev Banerjee & Ted Pedersen: SenseRelate::TargetWord – a generalized framework for word sense disambiguation. ACL-2005: Interactive Poster and Demonstration Sessions, University of Michigan, Ann Arbor, June 2005; pp. 73-76. [PDF, 54KB]

Vivisimo Inc.

(2008) Christian Monson, Ariadna Font Llitjós, Vamshi Ambati, Lori Levin, Alon Lavie, Alison Alvarez, Roberto Aranovich, Jaime Carbonell, Robert Frederking, Erik Peterson, & Katharina Probst: Linguistic structure and bilingual informants help induce machine translation of lesser-resourced languages. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 141KB]

(2006) Rebecca Hwa, Carol Nichols & Khalil Sima’an: Corpus variations for translation lexicon induction. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.74-81 [PDF, 178KB]

Washington, University of (Seattle)

(2009) Andrei Alexandrescu & Katrin Kirchhoff: Graph-based learning for statistical machine translation. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.119-127. [PDF, 148KB]

(2009) Janara Christensen, Mausam & Oren Etzioni: A rose is a roos is a ruusu: querying translations for web image search. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.193-196. [PDF, 101KB]

(2009) Jeremy G.Kahn, Matthew Snover, & Mari Ostendorf: Expected dependency pair match: predicting translation quality with expected syntactic structure [abstract]. Machine Translation 23 (2/3), September 2009; pp.169-179.

(2009) Mausam, Stephen Soderland, Oren Etzioni, Daniel S.Weld, Michael Skinner & Jeff Bilmes: Compiling a massive, multilingual dictionary via probabilistic inference. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.262-270. [PDF, 704KB}

(2009) Stephen Soderland, Christopher Lim, Mausam, Bo Qin, Oren Etzioni, & Jonathan Pool: Lemmatic machine translation. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.128-135. [PDF, 214KB]

(2009) Mei Yang, Amittai Axelrod, Kevin Duh, & Katrin Kirchhoff: The University of Washington machine translation system for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 124-128. [PDF, 264KB]; poster [PDF, 1111KB]

(2009) Mei Yang & Jing Zheng: Toward smaller, faster, and better hierarchical phrase-based SMT. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.237-240. [PDF, 140KB]

(2008) Amittai Axelrod, Mei Yang, Kevin Duh, & Katrin Kirchhoff: The University of Washington machine translation system for ACL WMT 2008. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.123-126. [PDF, 65KB]

(2008) Kevin Duh: Ranking vs. regression in machine translation evaluation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.191-194. [PDF, 91KB]

(2008) Kevin Duh & Katrin Kirchhoff: Beyond log-linear models: boosted minimum error rate training for n-best re-ranking. ACL 2008 HLT Short Papers, June 2008, Columbus, Ohio; pp.37-40. [PDF, 102KB]

(2008) Xiaodong He, Mei Yang, Jianfeng Gao, Patrick Nguyen, & Robert Moore: Indirect HMM-based hypothesis alignment for combining outputs from machine translation systems.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.98-107. [PDF, 280KB]

(2008) Achim Ruopp & Fei Xia: Finding parallel texts on the web using cross-language information retrieval.  IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.18-25. [PDF, 239KB]

(2007) Oren Etzioni, Kobi Reiter, Stephen Soderland, & Marcus Sammer: Lexical translation with application to image searching on the web. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.175-182 [PDF, 603KB]

(2007) Karim Filali & Jeff Bilmes: Generalized graphical abstractions for statistical machine translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.33-36 [PDF, 155KB]

(2007) Katrin Kirchhoff, Owen Rambow, Nizar Habash, & Mona Diab: Semi-automatic error analysis for large-scale statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.289-296 [PDF, 192KB]

(2007) Katrin Kirchhoff & Mei Yang: The University of Washington machine translation system for the IWSLT 2007 competition.  IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 6pp. [PDF, 106KB]; presentation [PDF, 505KB]

(2007) Evgeny Matusov, Dustin Hillard, Mathew Magimai-Doss, Dilek Hakkani-Tür, Mari Ostendorf, & Hermann Ney: Improving speech translation with automatic boundary prediction. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2449-2452; abstract [PDF, 61KB]

(2007) Marcus Sammer & Stephen Soderland: Building a sense-distinguished multilingual lexicon from monolingual corpora and bilingual lexicons. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.399-406 [PDF, 263KB]

(2006) Simon Corston-Oliver, Anthony Aue, Kevin Duh & Eric Ringger: Multilingual dependency parsing using Bayes point machines.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 160-167 [PDF, 178KB]

(2006) Katrin Kirchhoff, Kevin Duh, & Chris Lim: The University of Washington machine translation system for IWSLT 2006. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 145-152  [PDF, 168KB]

(2006) Jonathan Pool: Can controlled languages scale to the Web? CLAW 2006: 5th International Workshop on Controlled Language Applications, Cambridge, MA, USA, August 12, 2006; 12pp. [PDF, 136KB]

(2006) Marcus Sammer, Kobi Reiter, Stephen Soderland, Katrin Kirchhoff & Oren Etzioni: Ambiguity reduction for machine translation: human-computer collaboration.  AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.193-202 [PDF, 247KB]

(2006) Mei Yang & Katrin Kirchhoff: Phrase-based backoff models for machine translation of highly inflected languages. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Trento, Italy, April 3-7, 2006; pp.41-48 [PDF, 209KB]

(2005) Katrin Kirchhoff & Mei Yang: Improved language modeling for statistical machine translation. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 125-128. [PDF, 92KB]

West Virginia University

(2006) Daniel T. Heinze, Alexander Turchin & V. Jagannathan: Automated interpretation of clinical encounters with cultural cues and electronic health record generation.  HLT-NAACL 2006: Proceedings of the  Workshop on Medical Speech Translation, 9 June 2006, New York, NY, USA; pp.24-31 [PDF, 251KB]

(2006) Daniel T. Heinze, Alexander Turchin & V. Jagannathan: Accultran: automated interpretation of clinical encounters with cultural cues and electronic health record generation.  HLT-NAACL 2006: Proceedings of the  Workshop on Medical Speech Translation, 9 June 2006, New York, NY, USA; pp.50-52 [PDF, 181KB]

Williams College

(2007) Philipp Koehn, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, Chris Dyer, Ondřej Bojar, Alexandra Constantin, Evan Herbst: Moses: open source toolkit for statistical machine translation. ACL 2007: proceedings of demo and poster sessions, Prague, Czech Republic, June 2007; pp. 177-180 [PDF, 119KB]

Wisconsin, Medical College of

(2007) Pierrette Bouillon, Glenn Flores, Marianne Starlander, Nikos Chatzichrisafis, Marianne Santaholma, Nikos Tsourakis, Manny Rayner, & Beth Ann Hockey: A bidirectional grammar-based medical speech translator. ACL 2007: Proceedings of the Workshop on Grammar-based approaches to spoken language processing (SPEECHGRAM 2007), June 29, 2007, Prague, Czech Republic; pp.41-48. [PDF, 216KB]

Xerox Corporation: Palo Alto Research Center

(2006) Martin Kay: A life of language. Computational Linguistics 31 (4), pp. 425-438. [PDF, 97KB]

(2006) Tracy Holloway King: [review of] Hristo Georgiev, English algorithmic grammar. Machine Translation 20 (3),2006; pp.217-220. [see publication]

(2006) Stefan Riezler & John T. Maxwell III: Grammatical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 248-255 [PDF, 299KB]

(2005) Martin Kay: A life of language. [Paper given in acknowledgement of the Life-time Achievement Award at the 43rd Annual Meeting of the Association for Computational Linguistics, Ann Arbor, Michigan, 27th June, 2005]; 13pp. [PDF, 1232KB]

(2005) Stefan Riezler & John T.Maxwell III: On some pitfalls in automatic evaluation and significance testing for MT. ACL-2005: Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, University of Michigan, Ann Arbor, 29 June 2005; pp. 57-64. [PDF, 115KB]

Yahoo Research

(2009) Hakan Ceylan & Yookyung Kim: Language identification of search engine queries. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.1066-1074. [PDF, 239KB]

(2009) Pi-Chuan Chang, Huihsin Tseng, Dan Jurafsky, & Christopher D.Manning: Discriminative reordering with Chinese grammatical features. Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.51-59. [PDF, 558KB]

(2008) Niyu Ge, Abe Ittycheriah, & Kishinore Papineni: Multiple reorderings in phrase-based machine translation. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.61-68. [PDF, 1-6KB]

(2007) Fei Huang & Kishore Papineni: Hierarchical system combination for machine translation. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 277-286. [PDF, 158KB]

(2007) Christoph Tillmann & Tong Zhang: A block bigram prediction model for statistical machine translation. ACM Transactions on Speech and Language Processing 4 (3), art.6. [abstract, PDF, 12KB]

(2006) Christoph Tillmann & Tong Zhang: A discriminative global training algorithm for statistical MT. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.721-728. [PDF, 179KB]

Yale University

(2005) Michael S.Blekhman, Olga Bezhanova, & Marina Byezhanova: Comparative analysis of the translation quality produced by three MT systems. International Journal of Translation 17 (1-2), Jan-Dec 2005; pp.39-85. [PDF, 236KB]

 

Vietnam

Lac-Viet Computing Group

(2009) Vinh Van Nguyen, Akira Shimazu, Minh Le Nguyen & Thai Phuong Nguyen: Improving a lexicalized hierarchical reordering model using maximum entropy. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 89-96. [PDF, 172KB]

MICA Center

(2009) Thi-Ngoc-Diep Do, Viet-Bac Le, Brigitte Bigi, Laurent Besacier, & Eric Castelli: Mining a comparable text corpus for a Vietnamese-French statistical machine translation system.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.165-172. [PDF, 152KB]

Vietnam National University

(2009) Vinh Van Nguyen, Akira Shimazu, Minh Le Nguyen & Thai Phuong Nguyen: Improving a lexicalized hierarchical reordering model using maximum entropy. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 89-96. [PDF, 172KB]

(2008) Thai Phuong Nguyen, Akira Shimazu, Tu-Bao Ho, Minh Le Nguyen, & Vinh Van Nguyen: A tree-to-string phrase-based model for statistical machine translation. CoNLL: proceedings of the  Twelfth Conference on Computational Natural Language Learning, 16-17 August 2008, Manchester, UK; pp.143-150. [PDF, 392KB]