Machine Translation Archive

Index of Organizations and Affiliations

for publications 2010-2014

arranged by country, part 1: A-F

click here for part 2 (G-M), part 3 (N-UK), part 4 (US-Z)

Click for publications 2005-2009, 2000-2004; 1990-1999; 1970-1989; before 1970

 [to return to home page click here]

 

International

International Writers’ Group

(2012) Jost Zetzsche: Using MT in today’s CAT tools. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; [abstract, 2pp. [PDF, 744KB]

(2010) Jost Zetzsche: Crowdsourcing and the professional translator. Contribution to workshop of ‘Collaborative translation’ at AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31, 2010; 1p. [PDF, 54KB]

United Nations Organization

(2013) Bruno Pouliquen, Cecilia Elizalde, Marcin Junczys-Dowmunt, Christophe Mazenc, & José García-Verdugo: Large-scale multiple language translation accelerator at the United Nations. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.345-352. [PDF, 566KB]

(2012) Yu Chen & Andreas Eisele: MultiUN v2: UN documents with multilingual alignments.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2500-2504. [PDF, 405KB]

(2012) Cecilia Elizalde, Bruno Pouliquen, Christophe Mazenc & José García-Verdugo: TAPTA4UN: collaboration on machine translation between the World Intellectual Property Organization and the United Nations. [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; 3pp. [PDF, 168KB]

(2012) Bruno Pouliquen, Christophe Mazenc, Cecilia Elizalde, & Jose Garcia-Verdugo: Statistical machine translation prototype using UN parallel documents. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.12-19. [PDF, 251KB]

(2010) Andreas Eisele & Yu Chen: MultiUN: a multilingual corpus from United Nation documents. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2868-2872. [PDF, 333KB]

(2010) Li Zuo: Machine translation from English to Chinese: a study of Google’s performance with the UN documents. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

World Intellectual Property Organization

(2015) Bruno Pouliquen: Full-text patent translation at WIPO; scalability, quality and usability. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: Sixth Workshop on Patent and Scientific Literature Translation (PSLT6); pp.1-8. [PDF, 800KB]

(2014) Marcin Junczys-Dowmunt & Bruno Pouliquen: SMT of German patents at WIPO: decompounding and verb structure pre-reordering.  Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.217-220. [PDF, 369KB]

(2013) Bruno Pouliquen, Cecilia Elizalde, Marcin Junczys-Dowmunt, Christophe Mazenc, & José García-Verdugo: Large-scale multiple language translation accelerator at the United Nations. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.345-352. [PDF, 566KB]

(2012) Cecilia Elizalde, Bruno Pouliquen, Christophe Mazenc & José García-Verdugo: TAPTA4UN: collaboration on machine translation between the World Intellectual Property Organization and the United Nations. [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; 3pp. [PDF, 168KB]

(2012) Bruno Pouliquen, Christophe Mazenc, Cecilia Elizalde, & Jose Garcia-Verdugo: Statistical machine translation prototype using UN parallel documents. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.12-19. [PDF, 251KB]

(2011) Bruno Pouliquen & Christophe Mazenc: Automatic translation tools at WIPO. Translating and the Computer 33, 17-18 November 2011, London; 6pp. [PDF, 407KB]

(2011) Bruno Pouliquen & Christophe Mazenc: COPPA, CLIR and TAPTA: three tools to assist in overcoming the patent barrier at WIPO. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.24-30. [PDF, 896KB]

(2011) Bruno Pouliquen, Christophe Mazenc & Aldo Iorio: Tapta: a user-driven translation system for patent documents based on domain-aware statistical machine translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.5-12. [PDF, 342KB]; presentation, 15 slides [PDF, 1642KB]

World Trade Organization

(2011) Karim Benzineb & Jacques Guyot: Building blocks to integrate a Moses MT engine into the production translation work-flow with reference to the World Trade Organization and other clients.  Translating and the Computer 33, 17-18 November 2011, London; 11pp. [PDF, 194KB]

(2010) Olivier Pasteur: Challenges at the World Trade Organization: evaluation and implementation of a statistical machine translation system. Translating and the Computer 32, 18-19 November 2010, London; 7pp. [PDF, 419KB]

World Wide Web Consortium

(2013) Felix Sasaki: Metadata for the multilingual web: introducing internationalization tag set (ITS) 2.0. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; p.423. [PDF, 205KB]

(2011) Richard Ishida: The multilingual web: latest developments at the W3C/IETF. Translating and the Computer 33, 17-18 November 2011, London; presentation 55 slides. [PDF of PPT, 4551KB]

 

Algeria

Institut Supérieur Arabe de Traduction

(2012) Houda Saadane, Ouafa Benterki, Nasredine Semmar, & Christian Fluhr: Using Arabic transliteration to improve word alignment from French-Arabic parallel corpora. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.38-46. [PDF,

 

Argentina

Córdoba, Universidad Nacional de

(2011) Manny Rayner, Paula Estrella, & Pierrette Bouillon: Bootstrapping a statistical speech translator from a rule-based one. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.21-28. [PDF, 199KB]

(2011) Marianne Starlander & Paula Estrella: Looking for the best evaluation method for interlingua-based spoken language translation in the medical domain. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.81-92. [PDF, 905KB]

(2010) Manny Rayner, Paula Estrella, & Pierrette Bouillon: A bootstrapped interlingua-based SMT architecture. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 611KB]; presentation: 30 slides [PDF, 737KB]

Ocean Translations S.R.L.

(2010) Luciana Cecilia Ramos: Post-editing free machine translation: from a language vendor’s perspective. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 5pp. [PDF, 37KB]

Universidad Nacional de Córdoba

(2012) Julio Castillo & Paula Estrella: Semantic textual similarity for MT evaluation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.52-58. [PDF, 278KB]

Universidad Tecnológica Nacional – Facultad Regional Córdoba

(2012) Julio Castillo & Paula Estrella: Semantic textual similarity for MT evaluation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.52-58. [PDF, 278KB]

 

Australia

Macquarie University

(2012) Bevan Jones, Jacob Andreas, Daniel Bauer, Karl Moritz Hermann, & Kevin Knight: Semantics-based machine translation with hyperedge replacement grammars. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.1359-1376. [PDF, 169KB]

(2011) Susan Howlett & Mark Dras: Clause restructuring for SMT not absolutely helpful . ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.384-388. [PDF, 91KB]

(2011) François Lareau, Mark Dras, Benjamin Börschinger, & Robert Dale: Collocations in multilingual natural language generation: lexical functions meet lexical functional grammar. [ALTA 2011] Proceedings of Australasian Language Technology Association Workshop, Canberra, Australia, 1-2 December 2011; pp.95-104. [PDF, 302KB]

(2010) Susan Howlett & Mark Dras: Dual-path phrase-based statistical machine translation. Australasian Language Technology Workshop 2010 (ALTA Workshop 2010), 9-10 December 2008, Melbourne, Australia; pp.32-40. [PDF, 633KB]

Melbourne, University of

(2015) Oliver Adams, Graham Neubig, Trevor Cohn & Steven Bird: Inducing bilingual lexicons from small quantities of sentence-aligned phonemic transcriptions. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.248-255. [PDF, 2.8MB]

(2014) Yvette Graham, Nikita Mathur, & Timothy Baldwin: Randomized significance tests in machine translation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.266-274. [PDF, 358KB]

(2014) Xingyi Song, Lucia Specia, & Trevor Cohn: Data selection for discriminative training in statistical machine translation. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014; pp.45-52. [PDF, 415KB]

(2013) Yvette Graham: A dependency-constrained hierarchical model with Moses. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.464-471. [PDF, 341KB]

 (2013) Yvette Graham, Bahar Salehi, & Timothy Baldwin: Umelb: cross-lingual textual entailment with word alignment and string similarity features. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.133-137. [PDF, 91KB]

(2012) Steven Bird & David Chiang: Machine translation for language preservation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.125-134. [PDF, 924KB]

 (2012) James Breen, Timothy Baldwin, & Francis Bond: Segmentation and translation of Japanese multi-word loanwords. ALTA 2012: Proceedings of Australasian Language Technology Association Workshop, 4-6 December 2012, Otago University, Dunedin, New Zealand; pp. 61-69. [PDF, 249KB]

 (2012) Yvette Graham, Timothy Baldwin, Aaron Harwood, Alistair Moffat, & Justin Zobel: Measurement of progress in machine translation. ALTA 2012: Proceedings of Australasian Language Technology Association Workshop, 4-6 December 2012, Otago University, Dunedin, New Zealand; pp. 70-78. [PDF, 576KB]

 (2012) Marco Lui & Timothy Baldwin: langid.py: an off-the-shelf language identification tool. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 10 July 2012, System Demonstrations; pp.25-30. [PDF, 91KB]

(2011) Steven Abney & Steven Bird: Towards a data model for the universal corpus. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.120-127. [PDF, 118KB]

(2011) Sumukh Ghodke, Steven Bird, & Rui Zhang: A breadth-first representation for tree matching in large scale forest-based translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.785-793. [PDF, 941KB]

(2010) Timothy Baldwin, Jonathan Pool, & Susan M.Colowick: PanLex and LEXTRACT: translating all words of all languages of the world. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Demonstrations volume; pp.37-40. [PDF, 309KB]

Monash University

(2013) Trevor Cohn & Gholamreza Haffari: An infinite hierarchical Bayesian model of phrasal translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.780-790. [PDF, 432KB]

(2013) Majid Razmara, Maryam Siahbani, Gholamreza Haffari, & Anoop Sarkar: Graph propagation for paraphrasing out-of-vocabulary words in statistical machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1105-1115. [PDF, 325KB]

(2013) Baskaran Sankaran, Gholamreza Haffari, & Anoop Sarkar: Scalable variational inference for extracting hierarchical phrase-based translation rules. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.438-446. [PDF, 310KB]

(2012) Baskaran Sankaran, Gholamreza Haffari, & Anoop Sarkar: Compact rule extraction for hierarchical phrase-based translation. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 342KB]

(2011) Baskaran Sankaran, Gholamreza Haffari, & Anoop Sarkar: Bayesian extraction of minimal SCFG rules for hierarchical phrase-based translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.533-541. [PDF, 148KB]

Queensland University of Technology

(2010) Ling-Xiang Tang, Shlomo Geva, Andrew Trotman, & Yue Xu: A voting mechanism for named entity translation in English-Chinese question answering. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; pp.43-51. [PDF, 554KB]

Western Sydney University

(2011) Luciana M.Scocco: Machine translation aid to the test: a study on the benefits of post-editing for translation into the weaker language. Dissertation submitted as part requirement for Master of Arts Translation and Interpreting Studies, University of Western Sydney, 4th November 2011; 14pp. [PDF, 170KB]

(2010) Ignacio Garcia: Does Google know better? Translators and machine translation. Translating and the Computer 32, 18-19 November 2010, London; presentation, 16 slides [PDF of PPT, 473KB]

(2010) Ignacio Garcia: Translation training 2010: forward thinking, work ready [abstract].  Journal of Translation Studies 13 (1-2), Special issue: The teaching of computer-aided translation, ed. Chan Sin-wai; pp. 275-282.

 

Austria

Innsbruck, Universität

(2013) Christoph Schmidt, Oscar Koller, Hermann Ney, Thomas Hoyoux, & Justus Piater: Using viseme recognition to improve a sign language translation system. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 727KB]

Kaleidoscope GmbH

(2015) Klaus Fleischmann: From occasional quality control to collaborative quality assurance. Proceedings of the 37th Conference Translating and the Computer, London, November 26-27, 2015; pp .24-26. [PDF, 141KB]

(2013) Klaus Fleischmann: Collaborative terminology in the age of social web 2.0. [Aslib 2013] Translating and the Computer 35, 28-29 November 2013, etc.venues, Paddington, London, UK; 4pp. [PDF, 246KB]; presentation, 31 slides [PDF of PPT, 3384KB]

Wien, Universität

(2012) Gerhard Budin: Terminological ontologies in multi-lingual cross-domain communities of practice [abstract]. In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; p.19. [PDF]

(2012) Quoc Hung-Ngo & Werner Winiwarter: A visualizing annotation tool for semi-automatically building a bilingual corpus.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.67-74. [PDF, 820KB]

(2012) Vesna Lušicky & Tanja Wissik: Terminology: don’t only collect it, use it!  [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; 9pp. [PDF, 1257KB], presentation: 27 slides [PDF, 2179KB]

 

Belgium

Antwerp, University

(2011) Guy De Pauw, Peter Waiganjo Wagacha, & Gilles-Maurice de Schryver: Towards English-Swahili machine translation. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 23-27 January, 2011; 2pp. [PDF, 77KB]

(2010) Guy De Pauw, Naomi Maajabu, & Peter Waiganjo Wagacha: A knowledge-light approach to Luo machine translation and part-of-speech tagging. [LREC 2010]: Proceedings of the Second Workshop on African Language Technology, AFLAT 2010, Valetta, Malta; pp.15-20. [PDF, 141KB]

Brussel, Vrije Universiteit

(2014) Koen Kerremans: Representing intra- and interlingual terminological variation in a new type of translation resource: a prototype proposal. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.45-50. [PDF, 377KB]

Cross Language NV

(2014) Falko Schaefer, Joeri van de Walle, & Joachim van den Bogaert: Moses SMTas an aid to translators in the production process.  Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014; pp.89-92. [PDF, 425KB]

(2012) John Tinsley, Alexandru Ceausu, Jian Zhang, Heidi Depraetere, & Joeri Van de Walle: IPTranslator: facilitating patent search with machine translation. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 9pp. [PDF, 419KB]

(2011) Heidi Depraetere, Joachim Van den Bogaert, & Joeri Van de Walle: Bologna translation service: online translation of course syllabi and study programmes in English. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.29-34. [PDF, 164KB]

(2010) Heidi Depraetere & Pablo Vazquez: Machine translation engine selection in the enterprise. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 4pp. [PDF, 480KB]; presentation: 14 slides [PDF, 1631KB]

Crossminder

(2010) Andrés Domínguez: Crossminder. Contribution to session “Language Technology in Practice: Needs – Problems – Wishes”. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 2 slides [PDF of PPT]

Ghent University

(2015) Joke Daems, Sonia Vandepitte, Robert Hartsuker, & Lieve Macken: The impact of machine translation error types on post-editing effort indicators. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: Fourth Workshop on Post-editing Technology and Practice (WPTP 4); p.31-45. [PDF, 658KB]

(2014) Joke Daems, Lieve Macken, & Sonia Vandepitte: On the origin of errors: a fine-grained analysis of MT and PE errors and their relationship. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.62-68. [PDF, 120KB]

(2013) Joke Daems, Lieve Macken, & Sonia Vandepitte: Quality as the sum of its parts: a two-step approach for the identification of translation problems and translation quality assessment for HT and MT+PE.  Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), Nice, France, 2 September 2013; pp. 63-71. [PDF, 209KB]

(2013) Els Lefever & Véronique Hoste: SemEval-2013 Task 10: cross-lingual word sense disambiguation. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.158-166. [PDF, 307KB]

(2012) Els Lefever, Véronique Hoste, & Martine De Cock: Discovering missing Wikipedia inter-language links by means of cross-lingual word sense disambiguation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.841-846. [PDF, 345KB]

(2011) Guy De Pauw, Peter Waiganjo Wagacha, & Gilles-Maurice de Schryver: Towards English-Swahili machine translation. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 23-27 January, 2011; 2pp. [PDF, 77KB]

(2011) Els Lefever & Véronique Hoste: An evaluation and possible improvement path for current SMT behavior on ambiguous nouns. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.52-60. [PDF, 164KB]

(2011) Els Lefever, Véronique Hoste, & Martine De Cock: ParaSense or how to use parallel corpora for word sense disambiguation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.317-322. [PDF, 119KB]

(2010) Els Lefever & Véronique Hoste: Construction of a benchmark data set for cross-lingual word sense disambiguation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1584-1590. [PDF, 465KB]

(2010) Lieve Macken: An annotation scheme and gold standard for Dutch-English word alignment. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3369-3374. [PDF, 379KB]

(2010) Gudrun Rawoens: Multilingual corpora in cross-lingusitic research: focus on the compilation of a Dutch-Swedish parallel corpus. JADT 2010: 10th International Conference on Statistical Analysis of Textual Data, 9-11 juin 2010, Rome, Italie; pp.1287-1294. [PDF, 417KB]

Ghent University College

(2013) Els Lefever & Véronique Hoste: SemEval-2013 Task 10: cross-lingual word sense disambiguation. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.158-166. [PDF, 307KB]

(2012) Els Lefever, Véronique Hoste, & Martine De Cock: Discovering missing Wikipedia inter-language links by means of cross-lingual word sense disambiguation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.841-846. [PDF, 345KB]

(2011) Els Lefever & Véronique Hoste: An evaluation and possible improvement path for current SMT behavior on ambiguous nouns. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.52-60. [PDF, 164KB]

(2011) Els Lefever, Véronique Hoste, & Martine De Cock: ParaSense or how to use parallel corpora for word sense disambiguation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.317-322. [PDF, 119KB]

Hasselt University

(2015) Jan Van den Bergh, Mieke Haesen, Eva Geurts, Donald Degraen, Karin Coninx, & Iulianna van der Lek-Ciudin: Recommendations for translation environments to improve translators’ workflows. Proceedings of the 37th Conference Translating and the Computer, London, November 26-27, 2015; pp.106-119. [PDF, 213KB]

Lessius University College (Antwerp)

(2010) Tom Vanallemeersch: Belgisch Staatsblad corpus: retrieving French-Dutch sentences from official documents. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3413-3416. [PDF, 273KB]

Leuven, University

(2015) Jan Van den Bergh, Mieke Haesen, Eva Geurts, Donald Degraen, Karin Coninx, & Iulianna van der Lek-Ciudin: Recommendations for translation environments to improve translators’ workflows. Proceedings of the 37th Conference Translating and the Computer, London, November 26-27, 2015; pp.106-119. [PDF, 213KB]

(2014) Carlos S.C. Teixeira: Perceived vs. measured performance in the post-editing of suggestions from machine translation and translation memories. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Post-editing Technology and Practice (WPTP-3); pp.45-59. [PDF, 168KB]

(2014) Tom Vanallemeersch & Vincent Vandeghinste: Improving fuzzy matching through syntactic knowledge. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.90-99. [PDF, 430KB]

(2013) Ivan Vulić & Marie-Francine Moens: A study on bootstrapping bilingual vector spaces from non-parallel data (and nothing else). [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1044-1054. [PDF, 261KB]

(2012) Gideon Kotzé, Vincent Vandeghinste, Scott Martens, & Jörg Tiedemann: Large aligned treebanks for syntax-based machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.467-473. [PDF, 498KB]

(2012) Ivan Vulić & Marie-Francine Moens: Detecting highly confident word translations from comparable corpora without any prior knowledge. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 449-459. [PDF, 339KB]

(2011) Arda Tezcan & Vincent Vandeghinste: SMT-CAT integration in a technical domain: handling XML markup using pre & post-processing methods. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.55-62. [PDF, 343KB]

(2011) Ivan Vulić, Wim De Smet, & Marie-Francine Moens: Identifying word translations from comparable corpora using latent topic models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.479-484. [PDF, 153KB]

(2010) Ineke Schuurman & Vincent Vandeghinste: Cultural aspects of spatiotemporal analysis in multilingual applications. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3308-3313. [PDF, 318KB]

(2010) Tom Vanallemeersch: Belgisch Staatsblad corpus: retrieving French-Dutch sentences from official documents. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3413-3416. [PDF, 273KB]

(2010) Vincent Vandeghinste & Scott Martens: Bottom-up transfer in example-based machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 943KB]; presentation: 26 slides [PDF, 771KB]

Liege, Université

(2010) Philippe Dreuw, Jens Forster, Yannick Gweth, Daniel Stein, Hermann Ney, Gregorio Martinez, Jaume Verges Llahi, Onno Crasborn, Ellen Ormel, Wei Du, Thomas Hoyoux, Justus Piater, Jose Miguel Moya, & Mark Wheatley: SignSpeak – understanding, recognition, and translation of sign languages. [LREC 2010] 4th Workshop on the Representation and Processing of Sign Languages: Corpora and Sign Language  Technologies, Malta, May 2010; pp.65-72. [PDF, 6730KB]

Louvain, University

(2010) Bruno Cartoni & Marie-Aude Lefer: The MuLeXFoR database: representing word-formation processes in a multilingual lexicographic environment. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.840-843. [PDF, 284KB]

Supreme Headquarters Allied Powers Europe (NATO)

(2011) Ian Jones: The NATO terminology programme and NATOTerm. Translating and the Computer 33, 17-18 November 2011, London; 22pp. [PDF, 1159KB]

 

Brazil

Minas Gerais, Universidade Federal de

(2014) Sheila Castilho, Sharon O’Brien, Fabio Alves, & Morgan O’Brien: Does post-editing increase usability? A study with Brazilian Portuguese as target language. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.183-190. [PDF, 754KB]

(2011) Fabio Alves & Daniel Couto Vale: On drafting and revision in translation: a corpus linguistics oriented analysis of translation process data. Translation: Computation, Corpora,  Cognition 1 (1), December 2011; pp.105-122. [PDF, 1117KB]

(2011) Fabio Alves, Adrianna Pagano, & Igor L. da Silva: Modeling (un)packing of meaning in translation: insights from effortful text production. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.153-163. [PDF, 1064KB]

Pontificia Universidade Católica do Rio Grande del Sul

(2012) Roger Granada, Lucelene Lopes, Carlos Ramisch, Cassia Trojahn, Renata Vieira, & Aline Villavicencio: A comparable corpus based on aligned multilingual ontologies. [ACL 2012] Proceedings of the First Workshop on Multilingual Modeling, Jeju, Republic of Korea, 8-14 July 2012; pp.25-31. [PDF, 108KB]

Rio Grande del Sul, Universidade do

(2014) B.R.Laranjeira, V.P.Moreira, A.Villavicencio, C.Ramisch, & M.J.Finatto: Comparing the quality of focused crawlers and of the translation resources obtained from them. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3572-3578. [PDF, 803KB]

(2012) Roger Granada, Lucelene Lopes, Carlos Ramisch, Cassia Trojahn, Renata Vieira, & Aline Villavicencio: A comparable corpus based on aligned multilingual ontologies. [ACL 2012] Proceedings of the First Workshop on Multilingual Modeling, Jeju, Republic of Korea, 8-14 July 2012; pp.25-31. [PDF, 108KB]

São Carlos, Universidade de

(2011) Daniel Emilio Beck: Syntax-based statistical machine translation using tree automata and tree transducers. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.36-40. [PDF, 124KB]

São Paulo, Universidade de

(2010) Wilker Aziz, Marc Dymetman, Shachar Mirkin, Lucia Specia, Nicola Cancedda, & Ido Dagan: Learning an expert from human annotations in statistical machine translation: the case of out-of-vocabulary words. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 585KB]; presentation: 21 slides [PDF, 777KB]

(2010) Helena de Medeiros Caseli, Bruno Akio Sugiyama, & Junia Coutinho Anacleto: Using common sense to generate culturally contextualized machine translation. Proceedings of the NAACL HLT 2010 Young Investigators Workshop on Computational Approaches to Languages of the Americas, Los Angeles, CA, June 2010; pp.24-31. [PDF, 353KB]

(2010) Lucia Silva: Fine-tuning in Brazilian Portuguese-English statistical transfer machine translation: verbal tenses.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Student Research Workshop… June 2, 2010, Los Angeles, California; pp.58-63. [PDF, 251KB]

Universidade Estadual Paulista

(2010) Paula Paiva: Corpus representativeness in the selection of medical terms to be used in translation memory tools [abstract]. UCCTS 2010: Using Corpora in Contrastive and Translation Studies, Edge Hill University, UK, 27-29 July 2010; p.15. [PDF, 91KB]

 

Bulgaria

BTB Group

(2010) Diana Santos, Luís Miguel Cabral, Corina Forascu, Pamela Forner, Frederic Gey, Katrin Lamm, Thomas Mandl, Petya Osenova, Anselmo Peñas, Álvaro Rodrigo, Julia Schulz, Yvonne Skalban, & Erik Tjong Kim Sang: GikiCLEF: crosscultural issues in multilingual information access. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2346-2353. [PDF, 408KB]

Bulgarian Academy of Sciences

(2012) Galia Angelova & Ivelina Nikolova: Cross-lingual extraction of concepts and relations in the medical domain [abstract].  In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp.34-35. [PDF]

(2012) Galia Angelova & Svetla Boytcheva: Multilingual aspects of information extraction from medical texts in Bulgarian [abstract].  In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp.15-16. [PDF]

(2012) Georgi Iliev & Angel Genov: Expanding parallel resources for medium-density languages for free.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3937-3943. [PDF, 321KB]

(2012) Svetla Koeva: ATLAS – human language technologies integrated within a multilingual web content management system. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.72-76. [PDF, 331KB]

(2012) Svetla Koeva, Borislav Rizov, Ivelina Stoyanova, Svetlozara Leseva, Rositsa Dekova, Angel Genov, Ekaterina Tarpomanova, Tsvetana Dimitrova, & Hristina Kukova: Application of clause alignment for statistical machine translation. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.102-110. [PDF, 135KB]

(2012) Rui Wang, Petya Osenova, & Kiril Simov: Linguistically-enriched models for Bulgarian-to-English machine translation. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.10-19. [PDF, 134KB]

(2012) Rui Wang, Petya Osenova, & Kiril Simov: Linguistically-augmented Bulgarian-to-English statistical machine translation model. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.119-128. [PDF, 163KB]

(2011) Kiril Simov, Petya Osenova, Laska Laskova, Aleksandar Savkov, & Stanislava Kancheva: Bulgarian-English parallel treebank: word and semantic level alignment. AEPC 2011: proceedings of the Second Workshop on Annotation and Exploitation of Parallel Corpora, associated with the 8th International Conference on Recent Advances in Natural Language Processing (RANLP 2011), 15th September 2011, Hissar, Bulgaria; pp.29-38. [PDF, 396KB]

Institute for Bulgarian Language

(2012) Anelia Belogay, Diman Karagyozov, Svetla Koeva, Cristina Vertan, Adam Przepiórkowski, Polivios Raxis, & Dan Cristea: Harnessing NLP technologies in the processes of multilingual content management.  [EACL 2012] Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 6-10. [PDF, 367KB]

(2012) Velislava Stoykova: Bulgarian inflectional morphology in Universal Networking Language. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp. 423-429. [PDF, 145KB]

(2011) Svetla Koeva: Furthering natural language processing in Bulgaria. META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 40 slides [PDF of PPT, 3276KB]

New Bulgarian University

(2011) Maria Stambolieva: Parallel corpora in aspectual studies of non-aspect languages. AEPC 2011: proceedings of the Second Workshop on Annotation and Exploitation of Parallel Corpora, associated with the 8th International Conference on Recent Advances in Natural Language Processing (RANLP 2011), 15th September 2011, Hissar, Bulgaria; pp.31-42. [PDF, 233KB]

Ontotext AD

(2013) Meritxell Gonzàlez, Maria Mateva, Ramona Enache, Cristina España, Lluís Màrquez, Borislav Popov, & Aarne Ranta: MT techniques in a retrieval system of semantically enriched patents. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.273-280. [PDF, 908KB]

Tetracom Interactive Solutions Ltd.

(2012) Anelia Belogay, Diman Karagyozov, Svetla Koeva, Cristina Vertan, Adam Przepiórkowski, Polivios Raxis, & Dan Cristea: Harnessing NLP technologies in the processes of multilingual content management.  [EACL 2012] Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 6-10. [PDF, 367KB]

 

Canada

Alberta, University of

(2013) Mohammad Salameh, Colin Cherry, & Grzegorz Kondrak: Reversing morphological tokenization in English-to-Arabic SMT. [NAACL-HLT 2013] Proceedings of the NAACL HLT 2013 Student Research Workshop, 13 June 2013, Atlanta, Georgia; pp.47-53. [PDF, 161KB]

(2012) Grzegorz Kondrak & Mohammad Salameh: Transliteration experiments on Chinese and Arabic. [ACL 2012] Proceedings of NEWS 2012 Named Entities Workshop, July 12, 2012, Jeju, Republic of Korea; pp.71-75. [PDF, 73KB]

(2011) Aditya Bhargava & Grzegorz Kondrak: How do you pronounce your name? Improving G2P with transliterations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.399-408. [PDF, 271KB]

 (2011) Aditya Bhargava, Bradley Hauer, & Grzegorz Kondrak: Leveraging transliterations from multiple languages. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.36-40. [PDF, 111KB]

(2010) Sittichai Jiampojamarn, Kenneth Dwyer, Shane Bergsma, Aditya Bhargava, Qing Dou, Mi-Young Kim, & Grzegorz Kondrak: Transliteration generation and mining with limited training resources. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.39-47. [PDF, 143KB]

Centre for Information Technology Innovation see Montréal, Université

Centre de Recherche Informatique de Montréal

(2011) Caroline Barrière & Pierre Isabelle: Searching parallel corpora for contextually equivalent terms. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.105-112. [PDF, 149KB]

Institute for Information Technology

(2011) Caroline Barrière & Pierre Isabelle: Searching parallel corpora for contextually equivalent terms. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.105-112. [PDF, 149KB]

KeaText

(2011) Alexandre Patry & Philippe Langlais: Going beyond word cooccurrences in global lexicon selection for statistical machine translation using a multilayer perceptron. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.658-666. [PDF, 428KB]

(2011) Alexandre Patry & Philippe Langlais: Identifying parallel documents from a large bilingual collection of texts: application to parallel article extraction in Wikipedia. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.87-95. [PDF, 147KB]

Montréal, Ecole Polytechnique

(2010) Horacio Saggion, Juan-Manuel Torres-Moreno, Iria da Cunha, Eric SanJuan, & Patricia Velázquez-Morales: Multilingual summarization evaluation without human models. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1059-1067. [PDF, 238KB]

(2010) Bin Wei & Christopher Pal: Cross lingual adaptation: an experiment in sentiment classifications. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.258-262. [PDF, 109KB]

Montréal, Université de [see also Recherche Appliquée en Linguistique Informatique]

 (2014) A.Barreiro, J.Monti, B.Orliac, S. Preuß, K.Arrieta, W.Ling, F.Batista & I.Trancoso: Linguistic evaluation of support verb constructions by OpenLogos and Google Translate. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.35-40. [PDF, 131KB]

(2014) A.Barreiro, J.Monti, B.Orliac, S. Preuß, K.Arrieta, W.Ling, F.Batista & I.Trancoso: Linguistic evaluation of support verb constructions by OpenLogos and Google Translate. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.35-40. [PDF, 131KB]

(2013) Fethi Lamraoui & Philippe Langlais: Yet another fast, robust and open source sentence aligner: time to reconsider sentence alignment? Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.77-84. [PDF, 610KB]

(2013) Philippe Langlais: Mapping source to target strings without alignment by analogical learning: a case study with transliteration.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.684-689. [PDF, 221KB]

(2012) Julien Bourdaillet & Philippe Langlais: Identifying infrequent translations by aligning non parallel sentences. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 331KB]

(2011) Brian Harris: ¡Cuéntame cómo pasó! – a memoir of machine translation in Montreal circa 1970. In: María Jesús Blasco Mayor & María Amparo Jimenez Ivars (eds.) Interpreting naturally: a tribute to Brian Harris. (Bern: Peter Lang, 2011); pp.11-32. [note]

(2011) Stéphane Huet & Philippe Langlais: Identifying the translations of idiomatic expressions using TransSearch. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.45-56. [PDF, 1123KB]

(2011) Alexandre Patry & Philippe Langlais: Going beyond word cooccurrences in global lexicon selection for statistical machine translation using a multilayer perceptron. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.658-666. [PDF, 428KB]

(2011) Alexandre Patry & Philippe Langlais: Identifying parallel documents from a large bilingual collection of texts: application to parallel article extraction in Wikipedia. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.87-95. [PDF, 147KB]

(2010) Lucia Specia & Atefeh Farzindar: Estimating machine translation post-editing effort with HTER. JEC 2010: Second joint EM+/CNGL WorkshopBringing MT to the user: research on integrating MT in the translation industry”, AMTA 2010, Denver, Colorado, November 4, 2010; pp.33-41. [PDF, 178KB]

(2010) Audrey Laroche & Philippe Langlais: Revisiting context-based projection methods for term-translation spotting in comparable corpora. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.617-625. [PDF, 224KB]

MultiCorpora

(2012) Daniel Gervais: MultiTrans Prism version 5.5. [In: Technology Showcase Guide]. AMTA-2012. Proceedings, San Diego, November 1, 2012; 2pp.

National Research Council Canada (see also Institute for Information Technology)

(2014) Marine Carpuat, Cyril Goutte, & George Foster: Linear mixture models for robust machine translation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.499-509. [PDF, 329KB]

(2014) Boxing Chen & Colin Cherry: A systematic comparison of smoothing techniques for sentence-level BLEU. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.362-367. [PDF, 316KB]

(2014) Boxing Chen, Roland Kuhn, & George Foster: A comparison of mixture and vector space techniques for translation model adaptation. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.124-138 [PDF, 521KB]

(2014) Michel Simard: Clean data for training statistical MT: the case of MT contamination. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.69-82. [PDF, 533KB]

(2014) Darlene Stewart, Roland Kuhn, Eric Joanis, & George Foster: Coarse “split and lump” bilingual language models for richer source information in SMT. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.28-41. [PDF, 1190KB]

 (2013) Marine Carpuat: NRC: a machine translation approach to cross-lingual word sense disambiguation (SemEval-2013 Task 10). [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.188-192. [PDF, 100KB]

(2013) Marine Carpuat: A semantic evaluation of machine translation lexical choice. [SSST-7]: Proceedings of the Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation. Atlanta, Georgia, 13 June 2013; pp.1-10. [PDF, 145KB]

(2013) Boxing Chen, Roland Kuhn, & George Foster: Vector space model for adaptation in statistical machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1285-1293. [PDF, 207KB]

(2013) Colin Cherry: Improved reordering for phrase-based translation using sparse features. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.22-31. [PDF, 187KB]

(2013) George Foster, Boxing Chen, & Roland Kuhn: Simulating discriminative training for linear mixture adaptation in statistical machine translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.183-190. [PDF, 556KB]

(2013) Eric Joanis, Darlene Stewart, Samuel Larkin & Roland Kuhn: Transferring markup tags in statistical machine translation: a two-stream approach.  Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), Nice, France, 2 September 2013; pp. 73-81. [PDF, 599KB]

(2013) Mohammad Salameh, Colin Cherry, & Grzegorz Kondrak: Reversing morphological tokenization in English-to-Arabic SMT. [NAACL-HLT 2013] Proceedings of the NAACL HLT 2013 Student Research Workshop, 13 June 2013, Atlanta, Georgia; pp.47-53. [PDF, 161KB]

(2013) Michel Simard & George Foster: PEPr: post-edit propagation using phrase-based statistical machine translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.191-198. [PDF, 465KB]

(2012) Marine Carpuat & Michel Simard: The trouble with SMT consistency. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.442-449. [PDF, 106KB]

(2012) Boxing Chen, Roland Kuhn, & George Foster: Improving AMBER, an MT evaluation metric. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.59-63. [PDF, 117KB]

(2012) Boxing Chen, Roland Kuhn, & Samuel Larkin: PORT: a precision-order-recall MT evaluation metric for tuning. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.930-939. [PDF, 165KB]

(2012) Colin Cherry: Decoding. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 83 slides [PDF of PPT, 413KB]

(2012) Colin Cherry, Robert C.Moore, & Chris Quirk: On hierarchical re-ordering and permutation parsing for phrase-based decoding.  WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.200-209. [PDF, 284KB]

(2012) Colin Cherry & Barry Haddow: Sparse features in Moses. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 10 slides [PDF of PPT, 55KB]

(2012) Colin Cherry: Word-based models. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 57 slides [PDF of PPT, 226KB]

(2012) Hal Daumé III, Marine Carpuat, Alex Fraser, & Chris Quirk: Domain adaptation in machine translation: findings from the 2012 Johns Hopkins University Summer Workshop. Keynote [abstract]. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 1p. [PDF, 158KB]

(2012) Cyril Goutte, Marine Carpuat, & George Foster: The impact of sentence alignment errors on phrase-based machine translation performance. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 147KB]

(2012) J.Howard Johnson: Conditional significance pruning: discarding more of huge phrase tables. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 241KB]

(2012) Majid Razmara, George Foster, Baskaran Sankaran, & Anoop Sarkar: Mixing multiple translation models in statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.940-949. [PDF, 229KB]

(2012) Michel Simard & Atsushi Fujita: A poor man’s translation memory using machine translation evaluation metrics. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 958KB]

(2012) Michel Simard: Book review: Bitext alignment [by] Jörg Tiedemann. Computational Linguistics 38 (4); pp.799-825. [PDF, 37KB]

(2012) Marco Turchi, Cyril Goutte, & Nello Cristianini: Learning machine translation from in-domain and out-of-domain data. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.305-312. [PDF, 160KB]

(2011) Boxing Chen & Roland Kuhn: AMBER: a modified BLEU, enhanced ranking metric. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.71-77. [PDF, 148KB]

(2011) Boxing Chen, Roland Kuhn, & George Foster: Semantic smoothing and fabrication of phrase pairs for SMT. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.144-150. [PDF, 335KB]

(2011) Boxing Chen, Roland Kuhn, George Foster, & Howard Johnson: Unpacking and transforming feature functions: new ways to smooth phrase tables.  MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.269-275. [PDF, 158KB]

(2010) Boxing Chen, George Foster, & Roland Kuhn: Bilingual sense similarity for statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.834-843. [PDF, 185KB]

(2010) Boxing Chen, George Foster, & Roland Kuhn: Fast consensus hypothesis regeneration for machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 11-16. [PDF, 120KB]

(2010) Colin Cherry: Book review: Statistical machine translation, [by] Philipp Koehn. Computational Linguistics 36 (4), pp. 773-776 [PDF, 49KB]

(2010) Alain Désilets: Collaborative translation: technology, crowdsourcing, and the translator perspective. Introduction to workshop at AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31, 2010; 2pp. [PDF, 114KB]

(2010) Alain Désilets: WeBiText: multilingual concordancer built from public high quality web content. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) George Foster, Cyril Goutte, & Roland Kuhn: Discriminative instance weighting for domain adaptation in statistical machine translation.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.451-459. [PDF, 256KB]

(2010) George Foster, Pierre Isabelle, & Roland Kuhn: Translating structured documents. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 132KB]

(2010) Roland Kuhn, Boxing Chen, George Foster, & Evan Stratford: Phrase clustering for smoothing TM probabilities – or, how to extract paraphrases from phrase tables. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.608-616. [PDF, 328KB]

(2010) Samuel Larkin, Boxing Chen, George Foster, Ulrich Germann, Eric Joanis, Howard Johnson, & Roland Kuhn: Lessons from NRC’s Portage system at WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 127-132. [PDF, 124KB]

NLP Technologies Inc.

(2014) Fabrizio Gotti, Philippe Langlais, & Atefeh Farzindar: Hashtag occurrences, layout and translation: a corpus-driven analysis of tweets published by the Canadian government.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2254-2261. [PDF, 486KB]

(2014) Fatiha Sadat & Atefeh Farzindar: The ASMAT project – Arabic social media analysis tools. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; p.130. [PDF, 296KB]

(2012) Atefeh Farzindar & Wael Khreich: Evaluation of domain adaptation techniques for TRANSLI in a real-world environment. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 5pp. [PDF, 136KB]

(2010) Atefeh Farzindar: TRANSLI: trusted automated translation at the service of justice. Translating and the Computer 32, 18-19 November 2010, London; presentation, 19 slides [PDF of PPT, 1192KB]

(2010) Lucia Specia & Atefeh Farzindar: Estimating machine translation post-editing effort with HTER. JEC 2010: Second joint EM+/CNGL WorkshopBringing MT to the user: research on integrating MT in the translation industry”, AMTA 2010, Denver, Colorado, November 4, 2010; pp.33-41. [PDF, 178KB]

Onscope Group

(2010) Graham Russell: Moz: translation of structured terminology-rich text. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 4pp. [PDF, 157KB]

Ottawa University

(2012) Iustina Ilisei, Diana Inkpen, Gloria Corpas, & Ruslan Mitkov: Romanian translational corpora: building comparable corpora for translation studies.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.56-61. [PDF, 357KB]

(2012) Elizabeth Marshman: In the driver’s seat: perceptions of control as indicators of language professionals’ satisfaction with technologies in the workplace. [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; 12pp. [PDF, 450KB], presentation: 22 slides [PDF, 372KB]

 (2010) Lynne Bowker & Elizabeth Marshman: Toward a model of active and situated learning in the teaching of computer-aided translation: introducing the CERTT project [abstract].  Journal of Translation Studies 13 (1-2), Special issue: The teaching of computer-aided translation, ed. Chan Sin-wai; pp. 199-226.

Québec à Montréal, Université de

(2014) Fatiha Sadat & Atefeh Farzindar: The ASMAT project – Arabic social media analysis tools. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; p.130. [PDF, 296KB]

(2013) Fatiha Sadat: Towards a hybrid rule-based and statistical Arabic-French machine translation system. Proceedings of Recent Advances in Natural  Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.579-583. [PDF, 510KB]

 (2013) Rahma Sellami, Fatiha Sadat, & Lamia Hadrich Belguith: Exploiting multiple resources for Japanese to English patent translation. [MT Summit XIV] Proceedings of the 5th Workshop on Patent Translation, Nice, September 2, 2013; pp.34-39. [PDF, 1300KB]

(2010) Fatiha Sadat: Using co-occurrence tendencies to improve cross-language information retrieval. JADT 2010: 10th International Conference on Statistical Analysis of Textual Data, 9-11 juin 2010, Rome, Italie; pp.1101-1111. [PDF, 1846KB]

(2012) Rahma Sellami, Fatiha Sadat, & Lamia Hadrich Belguith: Exploiting Wikipedia as a knowledge base for the extraction of linguistic resources: application on Arabic-French comparable corpora and bilingual lexicons. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.72-79. [PDF, 889KB]

RALI see Recherche Appliquée en Linguistique Informatique

Recherche Appliquée en Linguistique Informatique, Université de Montréal

(2014) Fabrizio Gotti, Philippe Langlais, & Atefeh Farzindar: Hashtag occurrences, layout and translation: a corpus-driven analysis of tweets published by the Canadian government.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2254-2261. [PDF, 486KB]

(2013) Fethi Lamraoui & Philippe Langlais: Yet another fast, robust and open source sentence aligner: time to reconsider sentence alignment? Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.77-84. [PDF, 610KB]

(2013) Philippe Langlais: Mapping source to target strings without alignment by analogical learning: a case study with transliteration.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.684-689. [PDF, 221KB]

(2010) Stéphane Huet, Julien Bourdaillet, Alexandre Patry, & Philippe Langlais: The RALI machine translation system for WMT 2010. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 103-109. [PDF, 115KB]

(2010) Audrey Laroche & Philippe Langlais: Revisiting context-based projection methods for term-translation spotting in comparable corpora. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.617-625. [PDF, 224KB]

Sherbrooke, Université de

(2014) Anne Marie Taravella: Affective impact of the use of technology on employed language specialists: an exploratory qualitative study. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.202-210. [PDF, 151KB]

(2013) Anne Marie Taravella & Alain O.Villeneuve: Acknowledging the needs of computer-assisted translation tools users: the human perspective in human-machine translation. Journal of Specialised Translation, issue 19, January 2013; pp.62-74. [PDF, 361KB]

Simon Fraser University

(2015) Hassan Shavarani, Maryam Siahbani, Ramtin Mehdizadeh Seraj, & Anoop Sarkar:  Learning segmentations that balance latency versus quality in spoken language translation. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.217-224. [PDF, 3.5MB]

(2014) Rohit Dholakia & Anoop Sarkar: Pivot-based triangulation for low-resource languages. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.315-328. [PDF, 580KB]

(2014) Baskaran Sankaran & Anoop Sarkar: Bayesian iterative-cascade framework for hierarchical phrase-based translation. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.15-27. [PDF, 728KB]

(2014) Maryam Siahbani & Anoop Sarkar: Expressive hierarchical rule extraction for left-to-right translation. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.1-14. [PDF, 814KB]

(2013) Majid Razmara, Maryam Siahbani, Gholamreza Haffari, & Anoop Sarkar: Graph propagation for paraphrasing out-of-vocabulary words in statistical machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1105-1115. [PDF, 325KB]

(2013) Majid Razmara & Anoop Sarkar: Ensemble triangulation for statistical machine translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.252-260. [PDF, 240KB]

(2013) Majid Razmara & Anoop Sarkar: Stacking for statistical machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.334-339. [PDF, 284KB]

(2013) Baskaran Sankaran, Gholamreza Haffari, & Anoop Sarkar: Scalable variational inference for extracting hierarchical phrase-based translation rules. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.438-446. [PDF, 310KB]

(2013) Maryam Siahbani, Baskaran Sankaran, & Anoop Sarkar: Efficient left-to-right hierarchical phrase-based translation with improved reordering. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1089-1099. [PDF,351KB]

 (2013) Mahsa Yarmohammadi, Vivek K.Rangarajan Sridhar, Srinivas Bangalore, & Baskaran Sankaran: Incremental segmentation and decoding strategies for simultaneous translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1032-1036. [PDF, 192KB]

(2012) Majid Razmara, Baskaran Sankaran, Ann Clifton, & Anoop Sarkar: Kriya –the SFU system for translation task at WMT-12. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.356-361. [PDF, 130KB]

(2012) Majid Razmara, George Foster, Baskaran Sankaran, & Anoop Sarkar: Mixing multiple translation models in statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.940-949. [PDF, 229KB]

(2012) Baskaran Sankaran, Gholamreza Haffari, & Anoop Sarkar: Compact rule extraction for hierarchical phrase-based translation. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 342KB]

(2012) Baskaran Sankaran, Majid Razmara, & Anoop Sarkar: Kriya – an end-to-end hierarchical phrase-based MT system. Prague Bulletin of Mathematical Linguistics 97, April 2012; pp.83-98. [PDF,

(2011) Ann Clifton & Anoop Sarkar: Combining morpheme-based machine translation with post-processing morpheme prediction. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.32-42. [PDF, 287KB]

(2011) Baskaran Sankaran, Gholamreza Haffari, & Anoop Sarkar: Bayesian extraction of minimal SCFG rules for hierarchical phrase-based translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.533-541. [PDF, 148KB]

(2010) Baskaran Sankaran, Ajeet Grewal, & Anoop Sarkar: Incremental decodings for phrase-based statistical machine translation.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 216-223. [PDF, 137KB]

Synchromedia

(2011) Amir Hazem, Emmanuel Morin & Sebastian Peña Saldarriaga: Bilingual lexicon extraction from comparable corpora as metasearch. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.35-43. [PDF, 159KB]

Toronto University

(2012) Ulrich Germann: Syntax-aware phrase-based statistical machine translation: system description. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.292-297. [PDF, 94KB]

(2012) Oliver Kutz, Christoph Lange, Till Mossakowski, C.Maria Keet, Fabian Neuhaus, & Michael Gruninger: The Babel of the Semantic Web tongues – in search of the Rosetta stone of interoperability. Semantic Web Conference 2012; 6pp. [PDF, 813KB]

(2010) Eric Corlett & Gerald Penn: An exact A* method for deciphering letter-substitution ciphers. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1040-1047. [PDF, 132KB]

Waterloo, University of

(2010) Roland Kuhn, Boxing Chen, George Foster, & Evan Stratford: Phrase clustering for smoothing TM probabilities – or, how to extract paraphrases from phrase tables. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.608-616. [PDF, 328KB]

 

China

Baidu Inc.

(2015) Xiaoguang Hu, Wei Li, Xiang Lan, Hua Wu, & Haifeng Wang: Improved beam search with constrained softmax for NMT. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.297-309. [PDF, 1,178KB]

(2013) Mo Yu, Tiejun Zhao, Yalong Bai, Hao Tian, & Dianhai Yu: Cross-lingual projections between languages from differenr families. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.312-317. [PDF, 187KB]

(2013) Chao Zhang, Shiqi Zhao, & Haifeng Wang: Bootstrapping large-scale named entities using URL-text hybrid patterns. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.293-301. [PDF, 249KB]

(2013) Jiang Zhao, Man Lan, & Zheng-Yu Niu: ECNUCS: recognizing cross-lingual textual entailment using multiple text similarity and text difference measures. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.118-123. [PDF, 100KB]

(2013) Xiaoning Zhu, Zhongjun He, Hua Wu, Haifeng Wang, Conghui Zhu, & Tiejun Zhao: Improving pivot-based statistical machine translation using random walk. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.524-534. [PDF, 291KB]

(2012) Hanna Béchara, Raphaël Rubino, Yifan He, Yanjun Ma, & Josef van Genabith: An evaluation of statistical post-editing systems applied to RBMT and SMT systems. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.215-230. [PDF, 243KB]

(2012) Jinsong Su, Hua Wu, Haifeng Wang, Yidong Chen, Xiaodong Shi, Huailin Dong, & Qun Liu: Translation model adaptation for statistical machine translation with monolingual topic information. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.459-468. [PDF, 368KB]

(2012) Xianchao Wu, Takuya Matsuzaki, & Jun’ichi Tsujii: Akamon: an open source toolkit for tree/forest-based statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 10 July 2012, System Demonstrations; pp.127-132. [PDF, 501KB]

(2011) Wei He, Shiqi Zhao, Haifeng Wang, & Ting Liu: Enriching SMT training data via paraphrasing. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.803-810. [PDF, 157KB]

(2011) Yifan He, Yanjun Ma, Andy Way, & Josef van Genabith: Rich linguistic features for translation memory-inspired consistent translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.456-463. [PDF, 159KB]

(2011) Zhanyi Liu, Haifeng Wang, Hua Wu, Ting Liu, & Sheng Li: Reordering with source language collocations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1036-1044. [PDF, 462KB]

(2011) Yanjun Ma, Yifan He, Andy Way, & Josef van Genabith: Consistent translation using discriminative learning: a trnslation memory-inspired approach. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1239-1248. [PDF, 193KB]

(2011) Yanjun Ma, Yifan He, & Josef van Genabith: From the confidence estimation of machine translation to the integration of MT and translation memory. Tutorial at MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.13-15. [PDF, 75KB]

(2011) Ankit K.Srivastava, Yanjun Ma, & Andy Way: Oracle-based training for phrase-based statistical machine translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.169-176. [PDF, 428KB]

(2011) Haifeng Wang: Web-based machine translation [abstract]. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; p.1. [PDF, 149KB]

(2010) Zhanyi Liu, Haifeng Wang, Hua Wu, & Sheng Li: Improving statistical machine translation with monolingual collocation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.825-833. [PDF, 405KB]

(2010) Shiqi Zhao, Haifeng Wang, Xiang Lan, & Ting Liu: Leveraging multiple MT engines for paraphrase generation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1326-1334. [PDF, 597KB]

Beihang University

(2010) Qian Duoxiu: Pedagogical reflections on computer-aided translation as a course [abstract].  Journal of Translation Studies 13 (1-2), Special issue: The teaching of computer-aided translation, ed. Chan Sin-wai; pp. 13-25.

Beijing Foreign Studies University

(2011) Ying Qin & Guohua Chen: Forward-backward machine transliteration between English and Chinese based on combined CRFs. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.82-85. [PDF, 168KB]

Beijing Institute of Technology

(2010) Huang Heyan: Researches on MT and language information processing of CCLIE & BIT. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 37pp. [PDF, 7229KB]

Beijing Jiaotong University

(2011) Wenliang Chen, Jun’ichi Kazama, Min Zhang, Yoshimasa Tsuruoka, Yujie Zhang, Yiou Wang, Kentaro Torisawa, & Haizhou Li: SMT helps bitext dependency parsing. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.73-83. [PDF, 583KB]

Beijing Lingosail Tech Co

(2015) Xiaona Ren, Yongpeng Wei, & Rile Hu: Simplify sentence structure for improving human post-editing efficiency on Chinese-to-English patent machine translation. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: Sixth Workshop on Patent and Scientific Literature Translation (PSLT6); pp.33-43. [PDF, 909KB]

Beijing Normal University

(2015) Hongzheng Li, Kai Zhao, Renfen Hu, Yun Zhu, & Yaohong Jin: A hybrid system for Chinese-English patent machine translation. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: Sixth Workshop on Patent and Scientific Literature Translation (PSLT6); pp.52-67. [PDF, 1125KB]

Beijing University

(2010) Chang Baobao, Kang Byeong-kwu, & Yu Shiwen: Developing CAT tools for translating Chinese scientific monographs [abstract].  Journal of Translation Studies 13 (1-2), Special issue: The teaching of computer-aided translation, ed. Chan Sin-wai; pp. 165-179.

Centre of Computer & Language Information Engineering

(2010) Huang Heyan: Researches on MT and language information processing of CCLIE & BIT. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 37pp. [PDF, 7229KB]

China Patent Information Center

(2011) Hongfei Jiang: Expanding the applications of MT in patent translation. Invited talk at [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.17-36. [PDF, 7898KB]

Chinese Academy of Sciences

(2010) Sun Le: A user-adaptive framework for computer-aided translation system [abstract]. Journal of Translation Studies 13 (1-2), Special issue: The teaching of computer-aided translation, ed. Chan Sin-wai; pp. 155-64.

City University of Hong Kong

(2012) Billy T.M.Wong & Chunyu Kit: Extending machine translation evaluation metrics with lexical cohesion to document level. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.1060-1068. [PDF, 125KB]

(2011) Oi Yee Kwong: English-Chinese personal name transliteration by syllable-based maximum matching. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.96-100. [PDF, 256KB]

(2011) Bin Lu, Ka Po Chow, & Benjamin K.Tsou: The cultivation of a Chinese-English-Japanese trilingual parallel corpus from comparable patents. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.472-479. [PDF, 207KB]

(2011) Billy Wong & Chunyu Kit: Comparative evaluation of term informativeness measures in machine translation evaluation metrics. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.537-544. [PDF, 147KB]

(2010) Chan Sin-wai: A new curriculum for the teaching of translation technology: the teaching of a translation project course as a case study [abstract]. Journal of Translation Studies 13 (1-2), Special issue: The teaching of computer-aided translation, ed. Chan Sin-wai; pp. 83-154.

(2010) Cong Hui, Hai Zhao, Yan Song, & Bao-Liang Lu: An empirical study on development set selection strategy for machine translation learning.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and  MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 67-71. [PDF, 447KB]

(2010) Bin Lu, Tao Jiang, Kapo Chow, & Benjamin K. Tsou: Building a large English-Chinese parallel corpus from comparable patents and its experimental application to SMT.  [LREC 2010] Proceedings of the 3rd Workshop on Building and Using Comparable Corpora, Malta, 22 May 2010; pp.42-49. [PDF, 271KB]

(2010) Yan Song, Chunyu Kit, & Hai Zhao: Reranking with multiple features for better transliteration. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.62-65. [PDF, 395KB]

(2010) Billy Tak-Ming Wong: Semantic evaluation of machine translation.  LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2884-2888. [PDF, 574KB]

(2010) Billy T-M Wong & Chunyu Kit: The parameter-optimized ATEC metric for MT evaluation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 360-364. [PDF, 114KB]

Dalian University of Technology

(2011) Ma Jianjun, Huang Degen, Liu Haixia, & Sheng Wenfeng: POS tagging of English particles for machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.57-63. [PDF, 222KB]

(2010) Degen Huang, Lian Zhao, Lishuang Li, & Haitao Yu: Mining large-scale comparable corpora from Chinese-English news collections. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.472-480. [PDF, 255KB]

Digital Butter (Hong Kong)

(2011) Simon Shi, Pascale Fung, Emmanuel Prochasson, Chi-kiu Lo, & Dekai Wu: Mining parallel documents using low bandwidth and high precision CLIR from the heterogeneous web. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.420-428. [PDF, 512KB]

East China Normal University

(2013) Jiang Zhao, Man Lan, & Zheng-Yu Niu: ECNUCS: recognizing cross-lingual textual entailment using multiple text similarity and text difference measures. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.118-123. [PDF, 100KB]

Fudan University

(2010) Yuejie Zhang, Yang Wang, Lei Cen, Yanxia Su, Cheng Jin, Xiangyang Xue, & Jianping Fan: Fusion of multiple features and ranking SVM for web-based English-Chinese OOV term translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1435-1443. [PDF, 551KB]

Fujitsu Research and Development Company (China)

(2011) Zhongguang Zheng, Yao Meng, & Hao Yu: Lexical-based reordering model for hierarchical phrase-based machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.389-396. [PDF, 385KB]

(2010) Zhongjun He, Yao Meng, & Hao Yu: Extending the hierachical phrase based model with maximum entropy based BTG. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 104KB]

(2010) Zhongjun He, Yao Meng, & Hao Yu: Learning phrase boundaries for hierarchical phrase-based translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.383-390. [PDF, 378KB]

(2010) Zhongjun He, Yao Meng, & Hao Yu: Maximum entropy based phrase reordering for hierarchical phrase-based translation.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.555-563. [PDF, 144KB]

Harbin Institute of Technology, Harbin University of Science and Technology

(2013) Wanxiang Che, Mengqiu Wang, Christopher D.Manning, & Ting Liu: Named entity recognition with bilingual constraints. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.52-62. [PDF, 289KB]

(2013) Lei Cui, Xilun Chen, Dongdong Zhang, Shujie Liu, Mu Li, & Ming Zhou: Multi-domain adaptation for SMT using multi-task learning. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1055-1065. [PDF, 302KB]

(2013) Lei Cui, Dongdong Zhang, Shujie Liu, Mu Li, & Ming Zhou: Bilingual data cleaning for SMT using graph-based random walk.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.340-345. [PDF, 259KB]

(2013) Tingting Li, Tiejun Zhao, Andrew Finch, & Chunyue Zhang: A tightly-coupled unsupervised clustering and bilingual alignment model for transliteration.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.393-398. [PDF, 698KB]

 (2013) Lemao Liu, Taro Watanabe, Eiichiro Sumita, & Tiejun Zhao: Additive neural networks for statistical machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.791-801. [PDF, 261KB]

(2013) Lemao Liu, Tiejun Zhao, Taro Watanabe, & Eiichiro Sumita: Tuning SMT with a large number of features via online feature grouping. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.279-285. [PDF, 153KB]; revised, 7pp. [PDF]

(2013) Mengqiu Wang, Wanxiang Che, & Christopher D.Manning: Joint word alignment and bilingual named entity recognition using dual decomposition. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1073-1082. [PDF, 424KB]

(2013) Mo Yu, Tiejun Zhao, Yalong Bai, Hao Tian, & Dianhai Yu: Cross-lingual projections between languages from differenr families. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.312-317. [PDF, 187KB]

(2013) Conghui Zhu, Taro Watanabe, Eiichiro Sumita, & Tiejun Zhao: Hierarchical phrase table combination for machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.802-810. [PDF, 1464KB]

(2013) Junguo Zhu, Muyun Yang, Sheng Li, & Tiejun Zhao: Repairing incorrect translation with examples. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.967-971. [PDF, 523KB]

(2013) Xiaoning Zhu, Zhongjun He, Hua Wu, Haifeng Wang, Conghui Zhu, & Tiejun Zhao: Improving pivot-based statistical machine translation using random walk. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.524-534. [PDF, 291KB]

(2012) Wei He, Hua Wu, Haifeng Wang, & Ting Liu: Improve SMT quality with automatically extracted paraphrase rules. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.979-987. [PDF, 330KB]

(2012) Xinxin Li, Xuan Wang, & Wingwei Liao: Simple maximum entropy models for multilingual coreference resolution. Proceedings of the Joint Conference on EMNLP and CoNLL: Shared Task, July 13, 2012, Jeju Island, Korea; pp.83-87. [PDF, 205KB]

(2012) Lemao Liu, Tiejun Zhao, Taro Watanabe, Hailong Cao, & Conghui Zhu: Expected error minimization with ultraconservative update for SMT. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.723-732. [PDF, 144KB]

(2012) Lemao Liu, Hailong Cao, Taro Watanabe, Tiejun Zhao, Mo Yu, & CongHui Zhu: Locally training the log-linear model for SMT. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.402-411. [PDF, 289KB]

(2012) Shujie Liu, Chi-Ho Li, Mu Li & Ming Zhou: Learning translation consensus with structured label propagation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.302-310. [PDF, 204KB]

(2012) Shujie Liu, Chi-Ho Li, Mu Li, & Ming Zhou: Re-training monolingual parser bilingually for syntactic SMT. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.854-862. [PDF, 708KB]

(2012) Chunye Zhang, Tingting Li, & Tiejun Zhao: Syllable-based machine transliteration with extra phrase features.  [ACL 2012] Proceedings of NEWS 2012 Named Entities Workshop, July 12, 2012, Jeju, Republic of Korea; pp.52-56. [PDF, 163KB]

(2012) Xiaoning Zhu, Yiming Cui, Conghui Zhu, Tiejun Zhao, & Hailong Cao: The HIT-LTRC machine translation system for IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.77-80. [PDF, 576KB]; presentation, 14 slides [PDF of PPT, 1018KB]

(2011) Lei Cui, Dongdong Zhang, Mu Li, & Ming Zhou: Function word generation in statistical machine translation systems. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.139-146. [PDF, 184KB]

(2011) Nan Duan, Mu Li, Ming Zhou, & Lei Cui: Improving phrase extraction via MBR phrase scoring and pruning. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.189-197. [PDF, 302KB]

(2011) Ruiji Fu, Bing Qin, & Ting Liu: Generating Chinese named entity data from a parallel corpus. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.264-272. [PDF, 450KB]

(2011) Wei He, Shiqi Zhao, Haifeng Wang, & Ting Liu: Enriching SMT training data via paraphrasing. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.803-810. [PDF, 157KB]

(2011) Chi-Ho Li, Shujie Liu, Chenguang Wang, & Ming Zhou: ENGtube: an integrated subtitle environment for ESL. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.373-377. [PDF, 360KB]

(2011) Lemao Liu, Tiejun Zhao, Chao Wang, & Hailong Cao: A unified and discriminative soft syntactic constraint model for hierarchical phrase-based translation.  MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.253-260. [PDF, 197KB]

(2011) Shujie Liu, Chi-Ho Li & Ming Zhou: Statistic machine translation boosted with spurious word deletion. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.72-79. [PDF, 239KB]

(2011) Shujie Liu, Chi-Ho Li, & Ming Zhou: A unified SMT framework combining MIRA and MERT. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.181-188. [PDF, 233KB]

 (2011) Yupeng Liu, Tiejun Zhao, & Sheng Li: Hypergraph training and decoding of system combination in SMT. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.570-577 [PDF, 235KB]

(2011) Zhanyi Liu, Haifeng Wang, Hua Wu, Ting Liu, & Sheng Li: Reordering with source language collocations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1036-1044. [PDF, 462KB]

(2011) Yinggong Zhao, Shujie Liu, Yangsheng Ji, Jiajun Chen, & Guodong Zhou: Transductive minimum error rate training for statistical machine translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.641-648. [PDF, 214KB]

(2010) Lei Cui, Dongdong Zhang, Mu Li, Ming Zhou, & Tiejun Zhao: A joint rule selection model for hierarchical phrase-based translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.6-11. [PDF, 213KB]

(2010) Lei Cui, Dongdong Zhang, Mu Li, Ming Zhou, & Tiejun Zhao: Hybrid decoding: decoding with partial hypotheses combination over multiple SMT systems. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.214-222. [PDF, 497KB]

(2010) Shui Liu, Sheng Li, Tiejun Zhao, Min Zhang, & Pengyuan Liu: Head-modifier relation based non-lexical reordering model for phrase-based translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.748-756. [PDF, 297KB]

(2010) Shujie Liu, Chi-Ho Li, & Ming Zhou: Discriminative pruning for discriminative ITG alignment. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.316-324. [PDF, 972KB]

(2010) Shujie Liu, Chi-Ho Li, & Ming Zhou: Improved discriminative ITG alignment using hierarchical phrase pairs and semi-supervised training. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.730-738. [PDF, 1050KB]

(2010) Zhanyi Liu, Haifeng Wang, Hua Wu, & Sheng Li: Improving statistical machine translation with monolingual collocation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.825-833. [PDF, 405KB]

(2010) Shiqi Zhao, Haifeng Wang, Xiang Lan, & Ting Liu: Leveraging multiple MT engines for paraphrase generation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1326-1334. [PDF, 597KB]

(2010) Junguo Zhu, Muyun Yang, Bo Wang, Sheng Li, & Tiejun Zhao: All in strings: a powerful string-based automatic MT evaluation metric with multiple granularities. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1533-1540. [PDF, 159KB]

Henan University of Technology

(2012) Baoli Li: Learning to model multilingual unrestricted coreference in OntoNotes. Proceedings of the Joint Conference on EMNLP and CoNLL: Shared Task, July 13, 2012, Jeju Island, Korea; pp.129-135. [PDF, 130KB]

Hong Kong, City University

(2015) Chaochao Wang, Deyi Xiong, Min Zhang, & Chunyu Kit: Learning bilingual distributed phrase represenations for statistical machine translation. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.32-43. [PDF, 668KB]

(2013) Xiaojun Quan, Chunyu Kit, & Yan Song: Non-monotonic sentence alignment via semisupervised learning.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.622-630. [PDF, 348KB]

 (2013) Chengzhi Zhang, Xuchen Yao & Chunyu Kit: Finding more bilingual webpages with high credibility via link analysis. Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.138-143. [PDF, 499KB]

Hong Kong Institute of Education

(2011) Bin Lu, Ka Po Chow, & Benjamin K.Tsou: The cultivation of a Chinese-English-Japanese trilingual parallel corpus from comparable patents. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.472-479. [PDF, 207KB]

(2011) Benjamin K.Tsou & Bin Lu: Machine translation between uncommon language pairs via a third common language: the case of patents. Translating and the Computer 33, 17-18 November 2011, London; 13pp. [PDF, 185KB]

Hong Kong University

(2014) Jessica Xiangyu Liu: Task-based teaching of computer-aided translation in a progressive manner. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.235-242. [PDF, 249KB]

Hong Kong University of Science and Technology

(2015) Meriem Beloucif, Markus Saers, & Dekai Wu: Improving semantic SMT via soft semantic role label constraints on ITG alignmens. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.333-345. [PDF, 647KB]  

(2014) Anik Dey & Pascale Fung: A Hindi-English code-switching corpus. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2410-2413. [PDF, 427KB]

(2014) Chi-kiu Lo & Dekai Wu: On the reliability and inter-annotator agreement of human semantic MT evaluation via HMEANT. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.602-607. [PDF, 279KB]

(2013) Chi-kiu Lo & Dekai Wu: Human semantic MT evaluation with HMEANT for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 317KB]

(2013) Chi-kiu Lo, Karteek Addanki, Markus Saers & Dekai Wu: Improving machine translation by training against an automatic semantic frame based evaluation metric. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.375-381. [PDF, 370KB]

(2013) Chi-kiu Lo, Meriem Beloucif, & Dekai Wu: Improving machine translation into Chinese by tuning against Chinese MEANT. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 284KB]

(2013) Chi-kiu Lo & Dekai Wu: Can informal genres be better translated by tuning on automatic semantic metrics? Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.93-100. [PDF, 598KB]

(2013) Chi-kiu Lo & Dekai Wu: MEANT at WMT 2013: A tunable, accurate yet inexpensive semantic frame based MT evaluation metric. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.422-428. [PDF, 563KB]

(2013) Markus Saers, Karteek Addanki & Dekai Wu: Combining top-down and bottom-up search for unsupervised induction of transduction grammars. [SSST-7]: Proceedings of the Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation. Atlanta, Georgia, 13 June 2013; pp. 48-57. [PDF, 431KB]

(2013) Markus Saers, Karteek Addanki & Dekai Wu: Segmenting vs.chunking rules: unsupervised ITG induction via minim conditional description length. Proceedings of Recent Advances in Natural  Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.584-592. [PDF, 388KB]

(2013) Markus Saers & Dekai Wu: Unsupervised learning of bilingual categories in inversion transduction grammar induction. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 223KB]

(2013) Markus Saers, Karteek Addanki, & Dekai Wu: Unsupervised transduction grammar induction via minimum description length. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.67-73. [PDF, 418KB]

(2013) Markus Saers & Dekai Wu: Bayesian induction of bracketing inversion transduction grammars. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1158-1166. [PDF, 483KB]

(2013) Dekai Wu, Karteek Addanki, Markus Saers, & Meriem Beloucif: Learning to Freestyle: hip hop challenge-response induction via transduction rule segmentation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.102-112. [PDF, 398KB]

(2013) Dekai Wu, Karteek Addanki, & Markus Saers: Modeling hip hop challenge-response lyrics as machine translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.109-116. [PDF, 408KB]

(2012) Karteek Addanki, Chi-kui Lo, Markus Saers, & Dekai Wu: LTG vs. ITG coverage of cross-lingual verb frame alternations. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.295-302. [PDF, 283KB]

(2012) Ondřej Bojar & Dekai Wu: Towards a predicate-argument evaluation for MT. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.30-38. [PDF, 126KB]

(2012) Chi-kiu Lo, Anand Karthik Tumuluru, & Dekai Wu: Fully automatic semantic MT evaluation.  WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.243-252. [PDF, 379KB]

(2012) Chi-kiu Lo & Dekai Wu: Unsupervised vs. supervised weight estimation for semantic MT evaluation metrics. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.49-56. [PDF, 151KB]

(2012) Markus Saers, Karteek Addanki, & Dekai Wu: From finite-state to inversion transductions: toward unsupervised bilingual grammar induction. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.2325-2339. [PDF, 222KB]

(2012) Ping Xu & Pascale Fung: Cross-lingual language modelling with syntactic reordering for low-resource speech recognition. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.766-776. [PDF, 275KB]

(2011) Christian Hardmeier, Jörg Tiedemann, Markus Saers, Marcello Federico, & Mathur Prashant: The Uppsala-FBK systems at WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.372-378. [PDF, 115KB]

(2011) Chi-kiu Lo & Dekai Wu: MEANT: an inexpensive, high-accuracy, semi-automatic metric for evaluating translation utility via semantic frames. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.220-229. [PDF, 935KB]

(2011) Chi-kiu Lo & Dekai Wu: A radically simple, effective annotation and alignment methodology for semantic frame based SMT and MT evaluation. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.58-65. [PDF, 898KB]

(2011) Chi-kiu Lo & Dekai Wu: Structured vs. flat semantic role representations for machine translation evaluation. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.10-20. [PDF, 758KB]

(2011) Emmanuel Prochasson & Pascale Fung: Rare word translation extraction from aligned comparable documents. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1327-1335. [PDF, 157KB]

(2011) Markus Saers, Dekai Wu, & Chris Quirk: On the expressivity of linear transductions. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.431-438. [PDF, 184KB]

(2011) Markus Saers & Dekai Wu: Linear transduction grammars and zipper finite-state transducers.  [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.640-647. [PDF, 144KB]

(2011) Markus Saers & Dekai Wu: Principled induction of phrasal bilexica. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.313-320. [PDF, 373KB]

(2011) Markus Saers & Dekai Wu: Reestimation of reified rules in semiring parsing and biparsing. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.70-78. [PDF, 237KB]

(2011) Simon Shi, Pascale Fung, Emmanuel Prochasson, Chi-kiu Lo, & Dekai Wu: Mining parallel documents using low bandwidth and high precision CLIR from the heterogeneous web. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.420-428. [PDF, 512KB]

(2011) Dekai Wu: Syntactic SMT and semantic SMT. Tutorial at MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.12-13. [PDF, 75KB]

(2010) Chen Yuncong & Pascale Fung: Unsupervised synthesis of multilingual Wikipedia articles. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.197-205. [PDF, 1040KB]

(2010) Pascale Fung, Emmanuel Prochasson, & Simon Shi: Trillions of comparable documents. [LREC 2010] Proceedings of the 3rd Workshop on Building and Using Comparable Corpora, Malta, 22 May 2010; pp.26-34. [PDF, 199KB]

(2010) Chi-kiu Lo & Dekai Wu: Evaluating machine translation utility via semantic role labels. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2873-2877. [PDF, 417KB]

(2010) Chi-kiu Lo & Dekai Wu: Semantic vs. syntactic vs. n-gram structure for machine translation evaluation. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.52-60. [PDF, 401KB]

(2010) Markus Saers, Joakim Nivre, & Dekai Wu: Linear inversion transduction grammar alignments as a second translation path. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 167-171. [PDF, 111KB]

(2010) Markus Saers, Joakim Nivre, & Dekai Wu: A systematic comparison between inversion transduction grammar and linear transduction grammar for word alignment. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.10-18. [PDF, 279KB]

(2010) Markus Saers, Joakim Nivre, & Dekai Wu: Word alignment with stochastic bracketing linear inversion transduction grammar.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.341-344. [PDF, 97KB]

HuanJian Group Co. Ltd.

(2010) Huang Heyan: Researches on MT and language information processing of CCLIE & BIT. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 37pp. [PDF, 7229KB]

ICT  see Institute of Computing Technology

Institute of Automation (Chinese Academy of Sciences)

(2013) Yufeng Chen, Chengqing Zong; & Keh-Yi Su: A joint model to identify and align bilingual named entities. Computational Linguistics 39 (2); pp.229-266. [PDF, 708KB]

(2013) Xingyuan Peng, Xiaoyin Fu, Wei Wei, Zhenbiao Chen, Wei Chen, & Bo Xu: The CASIA machine translation system for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 5pp. [PDF, 158KB]

(2013) Mei Tu, Yu Zhou, & Chengqing Zong: A novel translation framework based on rhetorical structure theory.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.370-374. [PDF, 610KB]

(2013) Kun Wang, Chengqing Zong, & Key-Yih Su: Integrating translation memory into phrase-based machine translation during decoding. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.11-21. [PDF, 1776KB]

(2013) Feifei Zhai, Jiajun Zhang, Yu Zhou, & Chengqing Zong: Handling ambiguities of bilingual predicate-argument structures for statistical machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1127-1136. [PDF, 269KB]

 (2013) Jiajun Zhang & Chengqing Zong: Learning a phrase-based translation model from monolingual data with application to domain adaptation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1425-1434. [PDF, 476KB]

 (2013) Guangyou Zhou, Fang Liu, Yang Liu, Shizhu He, & Jun Zhao: Statistical machine translation improves question retrieval in community question answering via matrix factorization. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.852-861. [PDF, 814KB]

(2012) Shixiang Lu, Wei Wei, Xiaoyin Fu, & Bo Xu: Translation model based cross-lingual language model adaptation: from word models to phrase models. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.512-522. [PDF, 196KB]

(2012) Mei Tu, Yu Zhou & Chengqing Zong: A universal approach to translating numerical and time expressions. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.209-216. [PDF, 756KB]

(2012) Feifei Zhai, Jiajun Zhang, Yu Zhou, & Chengqing Zong: Machine translation by modelling predicate-argument structure transformation. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.3019-3036. [PDF, 1473KB]

(2012) Feifei Zhai, Jiajun Zhang, Yu Zhou, & Chengqing Zong: Tree-based translation without using parse trees. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.3037-3054. [PDF, 1800KB]

(2012) Guangyou Zhou, Kang Liu, & Jun Zhao: Exploiting bilingual translation for question retrieval in community-based question answering. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.3153-3170. [PDF, 352KB]

(2011) Yufeng Chen & Chengqing Zong: A semantic-specific model for Chinese named entity translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.138-146. [PDF, 282KB]

(2011) Maoxi Li, Chenqing Zong, & Hwee Tou Ng: Automatic evaluation of Chinese translation output: word-level or character-level? ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.159-164. [PDF, 151KB]

(2011) Wei Wei & Bo Xu: Effective use of discontinuous phrases for hierarchical phrase-based translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.397-404. [PDF, 148KB]

(2011) Feifei Zhai, Jiajun Zhang, Yu Zhou, & Chengqing Zong: Simple but effective approaches to improving tree-to-tree model. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.261-268. [PDF, 225KB]

(2011) Jiajun Zhang, Feifei Zhai, & Chenqing Zong: Augmenting string-to-tree translation models with fuzzy use of source-side syntax. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.204-215. [PDF, 317KB]

(2011) Guangyou Zhou, Li Cai, Jun Zhao, & Kang Liu: Phrase-based translation model for question retrieval in community question answer archives. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.653-662. [PDF, 759KB]

(2010) Yufeng Chen, Chenqing Zong, & Keh-Yih Su: On jointly recognizing and aligning bilingual named entities. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.631-639. [PDF, 151KB]

(2010) Yanqing He, Yu Zhou, Chengqing Zong, & Huilin Wang: A novel reordering model based on multi-layer phrase for statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.447-455. [PDF, 234KB]

Institute of Computational Linguistics

(2010) Shui Liu, Sheng Li, Tiejun Zhao, Min Zhang, & Pengyuan Liu: Head-modifier relation based non-lexical reordering model for phrase-based translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.748-756. [PDF, 297KB]

(2010) Xiaojun Wan, Huiying Li, & Jianguo Xiao: Cross-language document summarization based on machine translation quality prediction. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.834-843. [PDF, 318KB]

Institute of Computer Science and Technology

(2013) Xiaojun Wan: Co-regression for cross-language review rating prediction.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.526-531. [PDF, 220KB]

Institute of Computing Technology (Chinese Academy of Sciences)

(2014) Liangyou Li, Xiaofeng Wu, Santiago Cortés Vaíllo, Jun Xie, Andy Way, & Qun Liu: The DCU-ICTCAS MT system at WMT 2014 on German-English translation task. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.136-141. [PDF, 306KB]

(2014) Xiaofeng Wu, Hui Yu, & Qun Liu: RED: DCU-CASICT participation in WMT2014 metrics task. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.420-425. [PDF, 301KB]

(2014) Sitong Yang, Heng Yu, Hongmei Zhao, Qun Liu, & Yajuan Lü: Review and analysis of China workshop on machine translation 2013 evaluation. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.208-221. [PDF, 767KB]

(2013) Guisheng Ben, Deyi Xiong, Zhiyang Teng, Yajuan Lü, & Qun Liu: Bilingual lexical cohesion trigger model for document-level machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.382-386. [PDF, 155KB]

(2013) Qun Liu, Zhaopeng Tu, & Shouxun Lin: A novel graph-based compact representation of word alignment. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.358-363. [PDF, 211KB]

(2013) Fandong Meng, Jun Xie, Linfeng Song, Yajuan Lü, & Qun Liu: Translation with source constituency and dependency trees. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1066-1076. [PDF, 341KB]

(2013) Raphael Rubino, Antonio Toral, Santiago Cortés Vaíllo, Jun Xie, Xiaofeng Wu, Stephen Doherty, & Qun Liu: The CNGL-DCU-Prompsit translation systems for WMT13. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.213-218. [PDF, 152KB]

(2013) Zhiyang Wang, Yajuan Lü, Meng Sun, & Qun Liu: Stem translation with affix-based rule selection for agglutinative languages. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.364-369. [PDF, 667KB]

(2013) Xiaofeng Wu, Hui Yu & Qun Liu: DCU participation in WMT2013 metrics task. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.435-439. [PDF, 175KB]

 (2013) Tian Xia, Zongcheng Ji, Shaodan Zhai, Yidong Chen, Qun Liu, & Shaojun Wang: Improving alignment of system combination by using multi-objective optimization. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.535-544. [PDF, 204KB]

 (2013) Heng Yu, Liang Huang, Haitao Mi, & Kai Zhou: Max-violation perceptron and forced decoding for scalable MT training. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1112-1123. [PDF, 374KB]

(2013) Heng Yu, Jinsong Su, Yajuan Lü, & Qun Liu: A topic-triggered language model for statistical machine translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.447-454. [PDF, 130KB]

(2012) Yang Feng, Dongdong Zhang, Mu Li, Ming Zhou, & Qun Liu: Hierarchical chunk-to-string translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.950-958. [PDF, 152KB]

(2012) Yang Feng, Yang Liu, Qun Liu, & Trevor Cohn: Left-to-right tree-to-string decoding with prediction. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.1191-1200. [PDF, 511KB]

(2012) Junhui Li, Zhaopeng Tu, Guodong Zhou, & Joseph van Genabith: Head-driven hierarchical phrase-based translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.33-37. [PDF, 268KB]

(2012) Junhui Li, Zhaopeng Tu, Guodong Zhou, & Josef van Genabith: Using syntactic head information in hierarchical phrase-based translation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.232-242. [PDF, 283KB]

(2012) Jinsong Su, Hua Wu, Haifeng Wang, Yidong Chen, Xiaodong Shi, Huailin Dong, & Qun Liu: Translation model adaptation for statistical machine translation with monolingual topic information. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.459-468. [PDF, 368KB]

(2012) Zhaopeng Tu, Yang Liu, Yifan He, Josef van Genabith, Qun Liu, & Shouxun Lin: Combining multiple alignments to improve machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.1249-1260. [PDF, 178KB]

(2012) Xinyan Xiao, Deyi Xiong, Min Zhang, Qun Liu, & Shouxun Lin: A topic similarity model for hierarchical phrase-based translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.750-758. [PDF, 630KB]

(2012) Xinyan Xiao, Deyi Xiong, Yang Liu, Qun Liu, & Shouxun Lin: Unsupervised discriminative induction of synchronous grammar for machine translation. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.2883-2898. [PDF, 252KB]

(2012) Hao Xiong & Qun Liu: ICT: system description for CoNLL-2012. Proceedings of the Joint Conference on EMNLP and CoNLL: Shared Task, July 13, 2012, Jeju Island, Korea; pp.71-75. [PDF, 557KB]

(2011) Wenbin Jiang, Qun Liu, & Yajuan Lü: Relaxed cross-lingual projection of constituent syntax. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1192-1201. [PDF, 278KB]

(2011) Xianhua Li, Yajuan Lü, Yao Meng, Qun Liu, & Hao Yu: Feedback selecting of manually acquired rules using automatic evaluation. [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.52-59. [PDF, 185KB]

(2011) Yang Liu, Qun Liu, & Yajuan Lü: Adjoining tree-to-string translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1278-1287. [PDF, 159KB]

(2011) Linfeng Song, Haitao Mi, Yajuan Lü, & Qun Liu: Bagging-based system combination for domain adaptation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.293-298. [PDF,

(2011) Zhaopeng Tu, Yang Liu, Qun Liu, & Shouxun Lin: Extracting hierarchical rules from a weighted alignment matrix. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1294-1303. [PDF, 238KB]

(2011) Ashish Vaswani, Haitao Mi, Liang Huang & David Chiang: Rule Markov models for fast tree-to-string translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.856-864. [PDF, 122KB]

(2011) Zhiyang Wang, Yajuan Lü, & Qun Liu: Multi-granularity word alignment and decoding for agglutinative language translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.360-367. [PDF, 169KB]

(2011) Xinyan Xiao, Yang Liu, Qun Liu, & Shouxun Lin: Fast generation of translation forest for large-scale SMT discriminative training. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.880-888. [PDF, 818KB]

(2011) Jun Xie, Haitao Mi & Qun Liu: A novel dependency-to-string model for statistical machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.216-226. [PDF, 992KB]

(2011) Daqi Zheng, Yifan He, Yang Liu & Qun Liu: Maximum rank correlation training for statistical machine translation.  MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.48-55. [PDF, 184KB]

(2010) Yang Feng, Haitao Mi, Yang Liu, & Qun Liu: An efficient shift-reduce decoding algorithm for phrase-based machine translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.285-293. [PDF, 175KB]

(2010) Liang Huang & Haitao Mi: Efficient incremental decoding for tree-to-string translation.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.273-283. [PDF, 266KB]

(2010) Yang Liu & Liang Huang: Tree-based and forest-based translation. ACL 2010:Tutorial abstracts, Uppsala, Sweden, 11 July, 2010; p.2. [PDF, 58KB]

(2010) Yang Liu, Qun Liu, & Shouxun Lin: Discriminative word alignment by linear modeling. Computational Linguistics 36 (3), pp. 303-339 [PDF, 327KB]

(2010) Yang Liu & Qun Liu: Joint parsing and translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.707-715. [PDF, 211KB]

(2010) Haitao Mi & Qun Liu: Constituency to dependency translation with forests. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1433-1442. [PDF, 190KB]

(2010) Haitao Mi, Liang Huang, & Qun Liu: Machine translation with lattices and forests. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.837-845. [PDF, 268KB]

(2010) Jinsong Su, Yang Liu, Haitao Mi, Hongmei Zhao, Yajuan Lü, & Qun Liu: Dependency-based bracketing transduction grammar for statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1185-1193. [PDF, 226KB]

(2010) Jinsong Su, Yang Liu, Yajuan Lü, Haitao Mi, & Qun Liu: Learning lexicalized reordering models from reordering graphs. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.12-16. [PDF, 1365KB]

(2010) Zhaopeng Tu, Yang Liu, Young-Sook Hwang, Qun Liu, & Shouxun Lin: Dependency forest for statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1092-1100. [PDF, 215KB]

(2010) Zhiyang Wang, Yajuan Lü, Qun Liu, & Young-Sook Hwang: Better filtration and augmentation for hierarchical phrase-based translation rules. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.142-146. [PDF, 135KB]

(2010) Xinyan Xiao, Yang Liu, Young-Sook Hwang, Qun Liu, & Shouxun Lin: Joint tokenization and translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1200-1208. [PDF, 342KB]

(2010) Hao Xiong, Jun Xie, Hui Yu, Kai Liu, Wei Luo, Haitao Mi, Yang Liu, Yajuan Lü, & Qun Liu: The ICT statistical machine translation system for IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.73-79. [PDF, 510KB]

Institute of Intelligent Machines

(2013) Zede Zhu, Miao Li, Lei Chen, & Zhenxin Yang: Building comparable corpora based on bilingual LDA model.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.278-281. [PDF, 127KB]

(2010) Wen Li, Lei Chen, Wudabala, & Miao Li: Chained machine translation using morphemes as pivot language. [Coling 2010] Proceedings of the 8th Workshop on Asian Language Resources, Beijing, China, 21-22 August 2010; pp.169-177. [PDF, 170KB]

Institute of Scientific and Technical Information

(2010) Yanqing He, Yu Zhou, Chengqing Zong, & Huilin Wang: A novel reordering model based on multi-layer phrase for statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.447-455. [PDF, 234KB]

Jiangxi Normal University

(2013) Maoxi Li, Aiwen Jiang, & Mingwen Wang: Listwise approach to learning to rank for automatic evaluation of machine translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.51-59. [PDF, 277KB]

(2012) Li Maoxi & Wang Mingwen: Confusion network based system combination for Chinese translation output: word-level or character-level? COLING 2012: Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid MT (ML4HMT), Mumbai, December 2012; pp.17-24. [PDF, 282KB]

Key Laboratory of Computational Linguistics

(2012) Xinfan Meng, Furu Wei, Ge Xu, Longkai Zhang, Xiaohua Liu, Ming Zhou, & Houfeng Wang: Lost in translations? Building sentiment lexicons using context based machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.829-838. [PDF, 232KB]

(2010) Xiaojun Wan, Huiying Li, & Jianguo Xiao: Cross-language document summarization based on machine translation quality prediction. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.834-843. [PDF, 318KB]

Key Laboratory of Intelligent Technology and Systems

(2013) Yang Liu: A shift-reduce parsing algorithm for phrase-based string-to-dependency translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1-10. [PDF, 290KB]

Key Laboratory of Medical Image Computing

(2012) Tong Xiao, Jingbo Zhu, Hao Zhang, & Qiang Li: NiuTrans: an open source toolkit for phrase-based and syntax-based machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 10 July 2012, System Demonstrations; pp.19-24. [PDF, 258KB]

(2011) Tong Xiao, Jingbo Zhu, Shujie Yao, & Hao Zhang: Document-level consistency verification in machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.131-138. [PDF, 126KB]

(2010) Tong Xiao, Jingbo Zhu, Hao Zhang, & Muhua Zhu: An empirical study of translation rule extraction with multiple parsers. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1345-1353. [PDF, 388KB]

Leshan Normal University

(2012) Yunqing Xia, Guoyu Tang, Peng Jin, & Xia Yang: CLTC: a Chinese-English cross-lingual topic corpus.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.532-537. [PDF, 955KB]

Macau University

(2014) Yi Lu, Longyue Wang, Derek F.Wong, Lidia S.Chao, Yiming Wang, & Francisco Oliveira: Domain adaptation for medical text translation using web resources. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.233-238. [PDF, 485KB]

(2014) Liang Tian, Derek F.Wong, Lidia S.Chao, Paula Quaresma, Francisco Oliveira, Yi Lu, Shuo Li, Yiming Wang, & Longyue Wang: UM-Corpu: a large English-Chinese parallel corpus for statistical machine translation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1837-1842. [PDF, 605KB]

(2014) Longyue Wang, Yi Lu, Derek F.Wong, Lidia Chao, Yiming Wang, & Francisco Oliveira: Combining domain adaptation approaches for medical text translation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.254-259. [PDF, 348KB]

(2013) Aaron Li-Feng Han, Derek F.Wong, Lidia S.Chao, Liangye He, Yi Li, Junwen Xing, & Xiaodong Zeng: Language-independent model for machine translation evaluation with reinforced factors. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.215-222. [PDF, 623KB]

(2013) Shuo Li, Derek F.Wong & Lidia S.Chao: Experiments with POS-based restructuring and alignment-based re-ordering for statistical machine translation. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.82-87. [PDF, 273KB]

 (2013) Aaron Li-Feng Han, Yi Lu, Derek F. Wong, Lidia S. Chao, Liangye He & Junwen Xing: Quality estimation for machine translation using the joint method of evaluation criteria and statisti­cal modeling. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.365-372. [PDF, 309KB]

(2013) Francisco Oliveira, Derek F.Wong, Lidia S.Chao, Liang Tian, & Liangye He: Influence of part-of-speech and phrasal category universal tag-set in tree-to-tree translation models. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.907-911. [PDF, 127KB]

(2013) Longyue Wang, Derek F.Wong, Lidia S.Chao, Junwen Xing, Yi Lu, & Isabel Trancoso: Edit distance: a new data selection criterion for domain adaptation in SMT. Proceedings of Recent Advances in Natural  Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.727-732. [PDF, 231KB]

(2012) Aaron L.F.Han, Derek F.Wong, & Lidia S.Chao: LEPOR: a robust evaluation metric for machine translation with augmented factors. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.441-450. [PDF, 546KB]

(2011) Liang Tian, Fai Wong, & Sam Chao: Word alignment using GIZA++ on Windows. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.369-372. [PDF, 192KB]

(2011) Fai Wong, Francisco Oliveira, Sam Chao, & Chi-Wai Tang: Handheld machine translation system based on constraint synchronous grammar. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.439-446. [PDF, 158KB]

(2010 Francisco Oliveira, Fai Wong, Sam Chao, & Chi-Wai Tang:  Portuguese-Chinese machine aided translation system. International Translators' Federation (FIT) 6th Asian Translator's Forum, Macau SAR.; 10pp. [PDF, 364KB]

(2010) Francisco Oliveira, Fai Wong, Iok-Sai Hong & Ming-Chui Dong: Parsing extended constraint synchronous grammar in Chinese-Portuguese machine translation. International Conference on Computational Processing of Portuguese Language (PROPOR 2010), Porto Alegre, Brazil, 2010; 6pp. [PDF, 91KB]

(2010) Liang Tian, Fai Wong, & Sam Chao: An improvement of translation quality with adding key-words in parallel corpus. Ninth International Conference on Machine Learning and Cybernetics Qingdao, China, 11-14 July 2010; pp. 1273-1278. [PDF, 226KB]

(2010) Hari Venkatesan: Teaching translation memory systems: SDL Trados 2007 [abstract].  Journal of Translation Studies 13 (1-2), Special issue: The teaching of computer-aided translation, ed. Chan Sin-wai; pp. 71-81.

(2010) Wong Fai & Chao Sam: PCT: Portuguese-Chinese machine translation systems [abstract].  Journal of Translation Studies 13 (1-2), Special issue: The teaching of computer-aided translation, ed. Chan Sin-wai; pp. 181-196.

Microsoft Research Asia

(2013) Yuki Arase & Ming Zhou: Machine translation detection from monolingual web-text.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1597-1607. [PDF, 591KB]

(2013) Lei Cui, Dongdong Zhang, Shujie Liu, Mu Li, & Ming Zhou: Bilingual data cleaning for SMT using graph-based random walk.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.340-345. [PDF, 259KB]

(2013) Lei Cui, Xilun Chen, Dongdong Zhang, Shujie Liu, Mu Li, & Ming Zhou: Multi-domain adaptation for SMT using multi-task learning. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1055-1065. [PDF, 302KB]

(2013) Nan Yang, Shujie Liu, Mu Li, Ming Zhou, & Nenghai Yu: Word alignment modeling with context dependent deep neural network. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.166-175. [PDF, 554KB]

(2012) Nan Duan, Mu Li, & Ming Zhou: Forced derivation tree based model training to statistical machine translation. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.445-454. [PDF, 761KB]

(2012) Yang Feng, Dongdong Zhang, Mu Li, Ming Zhou, & Qun Liu: Hierarchical chunk-to-string translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.950-958. [PDF, 152KB]

(2012) Seung-Wook Lee, Dongdong Zhang, Mu Li, Ming Zhou, & Hae-Chang Rim: Translation model size reduction for hierarchical phrase-based statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.291-295. [PDF, 106KB]

(2012) Shujie Liu, Chi-Ho Li, Mu Li & Ming Zhou: Learning translation consensus with structured label propagation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.302-310. [PDF, 204KB]

(2012) Shujie Liu, Chi-Ho Li, Mu Li, & Ming Zhou: Re-training monolingual parser bilingually for syntactic SMT. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.854-862. [PDF, 708KB]

(2012) Xinfan Meng, Furu Wei, Ge Xu, Longkai Zhang, Xiaohua Liu, Ming Zhou, & Houfeng Wang: Lost in translations? Building sentiment lexicons using context based machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.829-838. [PDF, 232KB]

(2012) Hong Sun & Ming Zhou: Joint learning of a dual SMT system for paraphrase generation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.38-42. [PDF, 121KB]

(2012) Xianchao Wu, Takuya Matsuzaki, & Jun’ichi Tsujii: Akamon: an open source toolkit for tree/forest-based statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 10 July 2012, System Demonstrations; pp.127-132. [PDF, 501KB]

(2012) Nan Yang, Mu Li, Dongdong Zhang, & Nenghai Yu: A ranking-based approach to word reordering for statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.912-920. [PDF]

(2012) Dong Yu: Who can understand your speech better – deep neural network of Gaussian mixture model? [Abstract of] Keynote speech at: IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; p.9. [PDF, 546KB]

(2011) Daniel Andrade, Takuya Matsuzaki, & Jun’ichi Tsujii: Learning the optimal use of dependency-parsing information for finding translations with comparable corpora. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.10-18. [PDF, 651KB]

(2011) Lei Cui, Dongdong Zhang, Mu Li, & Ming Zhou: Function word generation in statistical machine translation systems. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.139-146. [PDF, 184KB]

(2011) Nan Duan, Mu Li, & Ming Zhou: Hypothesis mixture decoding for statistical machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1258-1267. [PDF, 1027KB]

(2011) Nan Duan, Mu Li, Ming Zhou, & Lei Cui: Improving phrase extraction via MBR phrase scoring and pruning. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.189-197. [PDF, 302KB]

(2011) Chi-Ho Li, Shujie Liu, Chenguang Wang, & Ming Zhou: ENGtube: an integrated subtitle environment for ESL. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.373-377. [PDF, 360KB]

(2011) Shujie Liu, Chi-Ho Li & Ming Zhou: Statistic machine translation boosted with spurious word deletion. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.72-79. [PDF, 239KB]

(2011) Shujie Liu, Chi-Ho Li, & Ming Zhou: A unified SMT framework combining MIRA and MERT. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.181-188. [PDF, 233KB]

(2011) Junichi Tsujii: Resource-rich research on natural language processing and understanding. Keynote at: IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker

(2010) Lei Cui, Dongdong Zhang, Mu Li, Ming Zhou, & Tiejun Zhao: A joint rule selection model for hierarchical phrase-based translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.6-11. [PDF, 213KB]

(2010) Lei Cui, Dongdong Zhang, Mu Li, Ming Zhou, & Tiejun Zhao: Hybrid decoding: decoding with partial hypotheses combination over multiple SMT systems. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.214-222. [PDF, 497KB]

(2010) Nan Duan, Mu Li, Dongdong Zhang, & Ming Zhou: Mixture model-based minimum Bayes risk decoding using multiple machine translation systems. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.313-321. [PDF, 372KB]

(2010) Nan Duan, Hong Sun, & Ming Zhou: Translation model generalization using probability averaging for machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.304-312. [PDF, 296KB]

(2010) Chi-Ho Li, Nan Duan, Yinggong Zhao, Shujie Liu, Lei Cui, Mei-yuh Hwang, Amittai Axelrod, Jianfeng Gao, Yaodong Zhang, & Li Deng: The MSRA machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.135-138. [PDF, 254KB]; presentation: 17 slides [PDF of PPT, 477KB]

(2010) Gae-won You, Seung-won Hwang, Young-In Song, Long Jiang, & Zaiqing Nie: Mining name translations from entity graph mapping.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.430-439. [PDF, 1542KB]

(2010) Mu Li, Yinggong Zhao, Dongdong Zhang, & Ming Zhou: Adaptive development data selection for log-linear model in statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.662-670. [PDF, 213KB]

(2010) Shujie Liu, Chi-Ho Li, & Ming Zhou: Discriminative pruning for discriminative ITG alignment. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.316-324. [PDF, 972KB]

(2010) Shujie Liu, Chi-Ho Li, & Ming Zhou: Improved discriminative ITG alignment using hierarchical phrase pairs and semi-supervised training. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.730-738. [PDF, 1050KB]

(2010) Tetsuya Sakai: Multilinguality at NTCIR, and moving on…[abstract]. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; p.2. [PDF, 56KB]

Minjiang University

(2012) Xinfan Meng, Furu Wei, Ge Xu, Longkai Zhang, Xiaohua Liu, Ming Zhou, & Houfeng Wang: Lost in translations? Building sentiment lexicons using context based machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.829-838. [PDF, 232KB]

Nanjing University

 (2013) Ashish Vaswani, Yinggong Zhao, Victoria Fossum, & David Chiang: Decoding with large-scale neural language models improves translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1387-1392. [PDF, 107KB]

(2013) Chengzhi Zhang, Xuchen Yao & Chunyu Kit: Finding more bilingual webpages with high credibility via link analysis. Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.138-143. [PDF, 499KB]

(2012) Ning Xi, Guangchao Tang, Xinyu Dai, Shujian Huang, & Jiajun Chen: Enhancing statistical machine translation with character alignment. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.285-290. [PDF, 171KB]

(2011) Shujian Huang, Stephan Vogel, & Jiajun Chen: Dealing with spurious ambiguity in learning ITG-based word alignment. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.379-383. [PDF, 182KB]

(2011) Ning Xi, Guangchao Tang, Boyuan Li, & Yinggong Zhao: Word alignment combination over multiple word segmentation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.1-5. [PDF, 329KB]

(2011) Yinggong Zhao, Shujie Liu, Yangsheng Ji, Jiajun Chen, & Guodong Zhou: Transductive minimum error rate training for statistical machine translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.641-648. [PDF, 214KB]

(2010) Shujian Huang, Kangxi Li, Xinyu Dai, & Jiajun Chen: Improving word alignment by semi-supervised ensemble. CoNLL-2010: Fourteenth Conference on Computational Natural Language Learning, Proceedings of the conference, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp.135-143. [PDF, 585KB]

(2010) Mu Li, Yinggong Zhao, Dongdong Zhang, & Ming Zhou: Adaptive development data selection for log-linear model in statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.662-670. [PDF, 213KB]

National Institute for Informatics

(2012) Xianchao Wu, Takuya Matsuzaki, & Jun’ichi Tsujii: Akamon: an open source toolkit for tree/forest-based statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 10 July 2012, System Demonstrations; pp.127-132. [PDF, 501KB]

National Institute for Information and Communication Technology

(2013) Rui Wang, Masao Utiyama, Isao Goto, Eiichiro Sumita, Hai Zhao, & Bao-Liang Lu: Converting continuous-space language models into n-gram language models for statistical machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.845-850. [PDF, 571KB]

National Laboratory for Information Science and Technology

(2010) Peng Li, Maosong Sun, & Ping Xue: Fast-Champollion: a fast and robust sentence alignment algorithm. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.710-718. [PDF, 267KB]

Nokia Research Center (Beijing)

(2010) Mikko Kurimo, William Byrne, John Dines, Philip N.Garner, Matthew Gibson, Yong Guan, Teemu Hirsimäki, Reima Karhila, Simon King, Hui Liang, Keiichiro Oura, Lakshmi Saheer, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi Tokuda, Mirjam Wester, Yi-Jian Wu, & Junichi Yamagishi: Personalising speech-to-speech translation in the EMIME project. Proceedings of the ACL 2010 System Demonstrations, Uppsala, Sweden, 13 July 2010; pp.48-53. [PDF, 679KB]

Northeastern University

(2012) Feiliang Ren: A practical Chinese-English ON translation method based on ON’s distribution characteristics on the web. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp. 239-246. [PDF, 102KB]

(2012) Tong Xiao, Jingbo Zhu, Hao Zhang, & Qiang Li: NiuTrans: an open source toolkit for phrase-based and syntax-based machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 10 July 2012, System Demonstrations; pp.19-24. [PDF, 258KB]

(2012) Jingbo Zhu, Tong Xiao, & Chunliang Zhang: Learning better rule extraction with translation span alignment. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.280-284. [PDF, 235KB]

(2011) Tong Xiao, Jingbo Zhu, Shujie Yao, & Hao Zhang: Document-level consistency verification in machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.131-138. [PDF, 126KB]

(2011) Jingbo Zhu & Tong Xiao: Improving decoding generalization for tree-to-string translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.418-423. [PDF, 311KB]

(2010) Tong Xiao, Jingbo Zhu, Muhua Zhu, & Huizhen Wang: Boosting-based system combination for machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.739-748. [PDF, 380KB]

(2010) Tong Xiao, Jingbo Zhu, Hao Zhang, & Muhua Zhu: An empirical study of translation rule extraction with multiple parsers. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1345-1353. [PDF, 388KB]

Peking University

(2015) Dong Zhan & Hiromi Nakaiwa: Automatic detection of antecedents of Japanese zero pronouns using a Japanese-English bilingual corpus. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.66-79. [PDF, 863KB]

(2012) Minwei Feng, Weiwei Sun, & Hermann Ney: Semantic cohesion model for phrase-based SMT. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.867-878. [PDF, 227KB]

(2011) Chi-Ho Li, Shujie Liu, Chenguang Wang, & Ming Zhou: ENGtube: an integrated subtitle environment for ESL. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.373-377. [PDF, 360KB]

(2010) Yu Jingsong & Wang Huashu: A proposed strategy for computer-aided translation education: a brief summary for the teaching practice of the CAT master’s program at Peking University [abstract]. Journal of Translation Studies 13 (1-2), Special issue: The teaching of computer-aided translation, ed. Chan Sin-wai; pp. 237-249.

Pohang University of Science and Technology

(2013) Taesung Lee & Seung-won Hwang: Bootstrapping entity translation on weakly comparable corpora.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.631-640. [PDF, 624KB]

(2013) Gae-won You, Young-rok Cha, Jinhan Kim, & Seung-won Hwang: Enriching entity translation discovery using selective temporality. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.201-205. [PDF, 651KB]

Shanghai Jiao Tong University

(2013) Rui Wang, Masao Utiyama, Isao Goto, Eiichiro Sumita, Hai Zhao, & Bao-Liang Lu: Converting continuous-space language models into n-gram language models for statistical machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.845-850. [PDF, 571KB]

(2012) Jiří Maršík & Ondřej Bojar: TrTok: a fast and trainable tokenizer for natural languages. Prague Bulletin of Mathematical Linguistics 98, October 2012; pp.75-85. [PDF, 141KB]

(2012) Chunyang Wu & Hai Zhao: Regression with phrase indicators for estimating MT quality. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.152-156. [PDF, 119KB]

(2011) Duo Ding: Integrate multilingual web search results using cross-lingual topic models. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.20-24. [PDF, 367KB]

(2010) Cong Hui, Hai Zhao, Yan Song, & Bao-Liang Lu: An empirical study on development set selection strategy for machine translation learning.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and  MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 67-71. [PDF, 447KB]

(2010) Yan Song, Chunyu Kit, & Hai Zhao: Reranking with multiple features for better transliteration. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.62-65. [PDF, 395KB]

(2010) Yan Tian: YanFa: an online automatic scoring and intelligent feedback system of student English-Chinese translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Demonstrations volume; pp.25-28. [PDF, 227KB]

Shengyang Aerospace University

(2015) Guiping Zhang, Na Ye, Fang Cai, Chuang Wu, Xiangkui Sun, Jinfu Yuan, & Dongfeng Cai: Productivity promotion strategies for collaborative translation on huge volume technical documents. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.2: MT Users’ Track; pp.248-258. [PDF, 1MB]

(2011) Wenjing Lang, Qiaoli Zhou, Guiping Zhang, & Dongfeng Cai: Multi-stage Chinese dependency parsing based on dependency direction. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.64-71. [PDF, 128KB]

(2011) Na Ye & Guiping Zhang: Study on the impact factors of the translators’ post-editing efficiency in a collaborative translation environment. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.579-586. [PDF, 277KB]

Soochow University

(2015) Chaochao Wang, Deyi Xiong, Min Zhang, & Chunyu Kit: Learning bilingual distributed phrase represenations for statistical machine translation. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.32-43. [PDF, 668KB]

(2013) Guisheng Ben, Deyi Xiong, Zhiyang Teng, Yajuan Lü, & Qun Liu: Bilingual lexical cohesion trigger model for document-level machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.382-386. [PDF, 155KB]

(2013) Xinyan Xiao & Deyi Xiong: Max-margin synchronous grammar induction for machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.255-264. [PDF, 885KB]

(2013) Deyi Xiong, Yang Ding, Min Zhang, & Chew Lim Tan: Lexical chain based cohesion models for document-level statistical machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1563-1573. [PDF, 254KB]

(2012) Gong Zheng Xian, Zhang Min, Tan ChewLim, & Zhou GuoDong: Classifier-based tense model for SMT. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.411-420. [PDF, 190KB]

(2012) Zhengxian Gong, Min Zhang, Chewlim Tan, & Guodong Zhou: N-gram-based tense models for statistical machine translation. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.276-285. [PDF, 265KB]

(2012) Junhui Li, Zhaopeng Tu, Guodong Zhou, & Josef van Genabith: Using syntactic head information in hierarchical phrase-based translation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.232-242. [PDF, 283KB]

(2012) Junhui Li, Zhaopeng Tu, Guodong Zhou, & Joseph van Genabith: Head-driven hierarchical phrase-based translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.33-37. [PDF, 268KB]

(2012) Li LiangYou, Gong ZhengXian, & Zhou GuoDong: Phrase-based evaluation for machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.663-672. [PDF, 201KB]

(2011) Zhengxian Gong, Min Zhang, & Guodong Zhou: Cache-based document-level statistical machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.909-919. [PDF, 330KB]

(2011) Zhengxian Gong, Guodong Zhou, & Liangyou Li: Improve SMT with source-side “topic-document” distributions. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.496-501. [PDF, 297KB]

(2011) Yinggong Zhao, Shujie Liu, Yangsheng Ji, Jiajun Chen, & Guodong Zhou: Transductive minimum error rate training for statistical machine translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.641-648. [PDF, 214KB]

(2010) Yanhui Feng, Yu Hong, Zhenxiang Yan, Jianmin Yao, & Qiaoming Zhu: A novel method for bilingual web page acquisition from search engine web records. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.294-302. [PDF, 184KB]

Star Corporation

(2010) Bin Lu, Tao Jiang, Kapo Chow, & Benjamin K. Tsou: Building a large English-Chinese parallel corpus from comparable patents and its experimental application to SMT.  [LREC 2010] Proceedings of the 3rd Workshop on Building and Using Comparable Corpora, Malta, 22 May 2010; pp.42-49. [PDF, 271KB]

Sun Yat-sen University

(2010) Luo Xuejuan: The course design of CAT for undergraduate English majors of Sun Yat-sen University [abstract].  Journal of Translation Studies 13 (1-2), Special issue: The teaching of computer-aided translation, ed. Chan Sin-wai; pp. 251-272.

Symantec Software (Beijing) Co.Ltd.

(2011) Yanli Sun, Juan Li, & Yi Li: Deploying MT into a localisation workflow: pains and gains. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.239-243. [PDF, 83KB]

Tianjin University

(2012) Hong Sun & Ming Zhou: Joint learning of a dual SMT system for paraphrase generation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.38-42. [PDF, 121KB]

(2011) Nan Duan, Mu Li, & Ming Zhou: Hypothesis mixture decoding for statistical machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1258-1267. [PDF, 1027KB]

(2011) Nan Duan, Mu Li, Ming Zhou, & Lei Cui: Improving phrase extraction via MBR phrase scoring and pruning. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.189-197. [PDF, 302KB]

(2010) Nan Duan, Hong Sun, & Ming Zhou: Translation model generalization using probability averaging for machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.304-312. [PDF, 296KB]

(2010) Nan Duan, Mu Li, Dongdong Zhang, & Ming Zhou: Mixture model-based minimum Bayes risk decoding using multiple machine translation systems. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.313-321. [PDF, 372KB]

(2010) Junguo Zhu, Muyun Yang, Bo Wang, Sheng Li, & Tiejun Zhao: All in strings: a powerful string-based automatic MT evaluation metric with multiple granularities. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1533-1540. [PDF, 159KB]

Toshiba Corporation (China)

(2011) Hirokazu Suzuki: Automatic post-editing based on SMT and its selective application by sentence-level automatic quality evaluation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.156-163. [PDF, 725KB]

Tsinghua University

(2015) Gerard de Melo: Wiktionary-based word embeddings. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.346-359. [PDF, 709KB] 

(2014) Lars Borin, Jens Allwood, & Gerard de Melo: Bring vs.MTRoget: evaluating automatic thesaurus translation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2115-2121. [PDF, 252KB]

(2013) Peng Li, Yang Liu & Maosong Sun: Recursive autoencoders for ITG-based translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.567-577. [PDF, 304KB]

(2013) Yang Liu: A shift-reduce parsing algorithm for phrase-based string-to-dependency translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1-10. [PDF, 290KB]

(2012) Yang Feng, Yang Liu, Qun Liu, & Trevor Cohn: Left-to-right tree-to-string decoding with prediction. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.1191-1200. [PDF, 511KB]

(2012) Peng Li, Yang Liu, & Maosung Sun: A beam search algorithm for ITG word alignment. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.673-682. [PDF, 471KB]

(2012) Chunyang Liu, Qi Liu, Yang Liu, & Maosong Sun: THUTR: a translation retrieval system. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp. 321-328. [PDF, 306KB]

(2012) Zhaopeng Tu, Yang Liu, Yifan He, Josef van Genabith, Qun Liu, & Shouxun Lin: Combining multiple alignments to improve machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.1249-1260. [PDF, 178KB]

(2012) Yunqing Xia, Guoyu Tang, Peng Jin, & Xia Yang: CLTC: a Chinese-English cross-lingual topic corpus.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.532-537. [PDF, 955KB]

(2012) Xinyan Xiao, Deyi Xiong, Yang Liu, Qun Liu, & Shouxun Lin: Unsupervised discriminative induction of synchronous grammar for machine translation. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.2883-2898. [PDF, 252KB]

(2011) Jing He & Hongyu Liang: Word-reordering for statistical machine translation using trigram language model. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1288-1293. [PDF, 513KB]

(2011) Min Zhang, Xiangyu Duan, Ming Liu, Yunqing Xia, & Haizhou Li: Joint alignment and artificial data generation: an empirical study of pivot-based machine transliteration. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1207-1215. [PDF, 319KB]

University of the Chinese Academy of Sciences

(2014) Sitong Yang, Heng Yu, Hongmei Zhao, Qun Liu, & Yajuan Lü: Review and analysis of China workshop on machine translation 2013 evaluation. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.208-221. [PDF, 767KB]

(2013) Fandong Meng, Jun Xie, Linfeng Song, Yajuan Lü, & Qun Liu: Translation with source constituency and dependency trees. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1066-1076. [PDF, 341KB]

University of Science and Technology of China

(2013) Nan Yang, Shujie Liu, Mu Li, Ming Zhou, & Nenghai Yu: Word alignment modeling with context dependent deep neural network. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.166-175. [PDF, 554KB]

(2013) Zede Zhu, Miao Li, Lei Chen, & Zhenxin Yang: Building comparable corpora based on bilingual LDA model.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.278-281. [PDF, 127KB]

(2012) Nan Yang, Mu Li, Dongdong Zhang, & Nenghai Yu: A ranking-based approach to word reordering for statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.912-920. [PDF,

(2010) Wen Li, Lei Chen, Wudabala, & Miao Li: Chained machine translation using morphemes as pivot language. [Coling 2010] Proceedings of the 8th Workshop on Asian Language Resources, Beijing, China, 21-22 August 2010; pp.169-177. [PDF, 170KB]

Xiamen University

(2013) Xiaodong Shi, Yidong Chen, & Xiuping Huang: Key problems in conversion from simplified to traditional Chinese characters. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.287-293. [PDF, 273KB]

 (2013) Tian Xia, Zongcheng Ji, Shaodan Zhai, Yidong Chen, Qun Liu, & Shaojun Wang: Improving alignment of system combination by using multi-objective optimization. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.535-544. [PDF, 204KB]

(2013) Heng Yu, Jinsong Su, Yajuan Lü, & Qun Liu: A topic-triggered language model for statistical machine translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.447-454. [PDF, 130KB]

(2012) Jinsong Su, Hua Wu, Haifeng Wang, Yidong Chen, Xiaodong Shi, Huailin Dong, & Qun Liu: Translation model adaptation for statistical machine translation with monolingual topic information. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.459-468. [PDF, 368KB]

(2011) Xiaodong Shi, Xiang Zhu, & Yidong Chen: Improving the hierarchical phrase-based translation model. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.383-388. [PDF, 174KB]

Xi’an University of Technology

(2011) Jie Jiang, Jinhua Du, & Andy Way: Incorporating source-language paraphrases into phrase-based SMT with confusion networks. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.31-40. [PDF, 1325KB]

Yahoo (China)

(2010) Lei Shi, Rada Mihalcea, & Mingjun Tian: Cross language text classification by model translation and semi-supervised learning. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.1057-1067. [PDF, 525KB]

Youdao Inc

(2012) Xinyan Xiao, Deyi Xiong, Yang Liu, Qun Liu, & Shouxun Lin: Unsupervised discriminative induction of synchronous grammar for machine translation. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.2883-2898. [PDF, 252KB]

Yuan Ze University

(2012) Guo-Hau Lai, Ying-Mei Guo, & Richard Tzong-Han Tsai: Unsupervised Japanese-Chinese opinion word translation using dependency distance and feature-opinion association weight. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.1503-1517. [PDF, 254KB]

Zhejiang University

(2012) Yong Zhuang, Guillaume Wisniewski, & François Yvon: Non-linear models for confidence estimation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.157-162. [PDF, 427KB]

 

Colombia

Universidad Nacional de Colombia

(2013) Sergio Jimenez, Claudia Becerra, & Alexander Gelbukh: SOFTCARDINALITY: learning to identify directional cross-lingual entailment from cardinalities and SMT. [SEMEVAL-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation (SemEval 2013), Atlanta, Georgia, June 14-15, 2013; pp.118-123. [PDF, 144KB]

 

Croatia

Institute of Croatian Language and Linguistics

(2012) Damir Cávar: Bootstrapping NLP and MT resources for under-resourced languages. In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; p.31. [PDF]

Rijeka University

(2012) Sanja Seljan, Tomislav Vičić, & Marija Brkić: BLEU evaluation of machine-translated English-Croatian legislation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2143-2148. [PDF, 332KB]

 (2011) Marija Brkić, Sanja Seljan, & Maja Matetić: Machine translation evaluation for Croatian-English and English-Croation language pairs. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.93-104. [PDF, 768KB]

Zagreb University

 (2014) Miquel Esplà-Gomis, Filip Klubička, Nikola Ljubešić, Sergio Ortiz-Rojas, Vassilis Papavassiliou, & Prokopis Prokopidis: Comparing two acquisition systems for automatically building an English-Croatian parallel corpus from multilingual websites.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1252-1258. [PDF, 158KB]

(2014) Nikola Ljubešić & Antonio Toral: caWaC – a web corpus of Catalan and its application to language modeling an machine translation.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1728-1732. [PDF, 138KB]

(2014) Raphael Rubino, Antonio Toral, Nikola Ljubešić, & Gema Ramírez-Sánchez: Quality estimation for synthetic parallel data generation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1843-1849. [PDF, 248KB]

(2013) Marianna Apidianaki, Nikola Ljubešić, & Darja Fišer: Cross-lingual WSD for translation extraction from comparable corpora. Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.1-10. [PDF, 388KB]

 (2013) Nikola Ljubešić & Darja Fišer: Identifying false friends between closely related languages. Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing, 8-9 August 2013, Sofia, Bulgaria; pp.69-77. [PDF, 342KB]

(2012) Nikola Ljubešić, Špela Vintar, & Darja Fišer: Multi-word term extraction from comparable corpora by combining contextual and constituent clues.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.143-147. [PDF, 376KB]

(2012) Hrvoje Peradin & Francis Tyers: A rule-based machine translation system from Serbo-Croatian to Macedonian. In: Free/Open-Source Rule-Based Machine Translation, ed.Cristina España-Bonet and Aarne  Ranta. Proceedings of a Workshop held in Gothenburg, 14-15 June, 2012; pp.55-63. [PDF, 431KB]

(2012) Sanja Seljan, Tomislav Vičić, & Marija Brkić: BLEU evaluation of machine-translated English-Croatian legislation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2143-2148. [PDF, 332KB]

(2012) Max Silberztein, Tamás Váradi, & Marko Tadić: Open source multi-platform NooJ for NLP. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp. 401-408. [PDF, 557KB]

(2012) Tamás Váradi & Marko Tadić: Central and South-East European resources in META-SHARE. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp. 431-437. [PDF, 1205KB]

 (2011) Marija Brkić, Sanja Seljan, & Maja Matetić: Machine translation evaluation for Croatian-English and English-Croation language pairs. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.93-104. [PDF, 768KB]

(2011) Darja Fišer & Nikola Ljubešić: Bilingual lexicon extraction from comparable corpora for closely related languages.  [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.125-131. [PDF, 95KB]

(2011) Darja Fišer, Nikola Ljubešić, Špela Vintar, & Senja Pollak: Building and using comparable corpora for domain-specific bilingual lexicon extraction. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.19-26. [PDF, 287KB]

(2011) Marko Tadić: The CESAR project: enabling LRT for 70m+ speakers.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 24 slides [PDF of PPT, 810KB]

 

Cuba

Matanzas University

(2013) Yoan Gutiérrez, Yenier Castañeda, Andy González, Rainel Estrada, Dennys D.Ping, Jose I.Abreu, Roger Pérez, Antonio Fernández Orquín, Andrés Mintoyo, Rafael Muñoz, & Franc Camara: UMICC_DLSI: reinforcing a ranking algorithm with sense frequencies and multidimensional semantic resources to solve multililingual word sense disambiguation. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.241-249. [PDF, 489KB]

 

Czech Republic

Charles University, Prague

(2014) Petra Barančíková: Parmesan: Meteor without paraphrases with paraphrased references. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.355-361. [PDF, 305KB]

(2014) Ondřej Bojar, Vojtěch Diatka, Pavel Rychlý, Pavel Straňák, Vit Suchomel, Aleš Tamchyna, & Daniel Zeman: HindEnCorp – Hindi-English and Hindi-only corpus for machine translation.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3550-3555. [PDF, 107KB]

(2014) Petra Barančiková, Rudolf Rosa, & Aleš Tamchyna: Improving evaluation of English-Czech MT through paraphrasing.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2696-2700. [PDF, 127KB]

(2014) Ondřej Dušek, Jan Hajič, Jaroslava Hlaváčová, Michal Novák, Pavel Pecina, Rudolf Rosa, Aleš Tamchyna, Zdeňka Urešová, & Daniel Zeman: Machine translation of medical texts in the Khresmoi project. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.221-228. [PDF, 513KB]

(2014) Bushra Jawaid & Ondřej Bojar: Two-step machine translation with lattices. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.682-686. [PDF, 67KB]

(2014) Jindřich Libovický & Pavel Pecina: Tolerant BLEU: a submission to the WMT14 metrics task. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.409-413. [PDF, 360KB]

(2014) Matouš Macháček & Ondřej Bojar: Results of the WMT14 metrics shared task. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.293-301. [PDF, 331KB]

(2014) Aleš Tamchyna, Martin Popel, Rudolf Rosa, & Ondřej Bojar: CUNI in WMT14: Chimera still awaits Bellerophon. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.195-200. [PDF, 178KB]

(2014) Zdeňka Urešová, Ondřej Dušek, Jan Hajič, & Pavel Pecina: Multilingual test sets for machine translation of search queries for cross-lingual information retrieval in the medical domain. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3244-3247. [PDF, 116KB]

(2014) Pavel Vondřička: Aligning parallel texts with InterText. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1875-1879. [PDF, 476KB]

(2014) Nianwen Xue, Ondřej Bojar, Jan Hajič, Martha Palmer, Zdeňka Urešová, & Xiuhong Zhang: Not an interlingua, but close: comparison of English AMRs to Chinese and Czech. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1765-1772. [PDF, 417KB]

(2013) Karel Bílek & Daniel Zeman: CUni multilingual matrix in the WMT 2013 shared task. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.85-91. [PDF, 246KB]

(2013) Ondřej Bojar, Christian Buck, Chris Callison-Burch, Christian Federmann, Barry Haddow, Philipp Koehn, Christof Monz, Matt Post, Radu Soricut & Lucia Specia: Findings of the 2013 Workshop on Statistical Machine Translation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.1-44. [PDF, 1767KB]

(2013) Ondřej Bojar, Rudolf Rosa & Aleš Tamchyna: Chimera – three heads for English-to-Czech translation.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.92-98. [PDF, 175KB]

(2013) Petra Galuščáková, Martin Popel & Ondřej Bojar: PhraseFix: statistical post-editing of TectoMT.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.141-147. [PDF, 157KB]

(2013) Nathan David Green & Zdeněk Žabokrtský: Improvements to syntax-based machine translation using ensemble dependency parsers. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp. 19-24. [PDF, 202KB]

(2013) Matouš Macháček & Ondřej Bojar: Results of the WMT13 metrics shared task.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.45-51. [PDF, 153KB]

(2013) Jiří Mírovský, Kateřina Rysová, Magdaléna Rysová, & Eva Hajičová: (Pre-)annotation of topic-focus articulation in Prague Czech-English dependency treebank. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.55-63. [PDF, 990KB]

 (2013) Thomas Meyer & Lucie Poláková: Machine translation with many manually labeled discourse connectives. Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.43-50. [PDF, 307KB]

(2013) Michal Novák, Anna Nedoluzhko, & Zdeněk Žabokrtský: Translation of “it” in a deep syntax framework. Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.51-59. [PDF, 384KB]

(2013) Michal Novák, Zdeněk Žabokrtský & Anna Nedoluzhko: Two case studies on translating pronouns in a deep syntax framework. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1037-1041. [PDF, 348KB]

(2013) Rudolf Rosa, David Mareček, & Aleš Tamchyna: Deepfix: statistical post-editing of statistical machine translation using deep syntactic analysis. ACL Student Research Workshop, Sofia, Bulgaria, August 4-9 2013; pp.172-179. [PDF, 178KB]

 (2013) Zdenka Uresova, Jana Sindlerova, Eva Fucikova, & Jan Hajic: An analysis of annotation of verb-noun idiomatic combinations in a parallel dependency corpus. [NAACL HLT 2013] Proceedings of the 9th Workshop on Multiword Expressions (MWE 2013), Atlanta, Georgia, 13-14 June, 2013; pp.58-63. [PDF, 115KB]

(2012) Eleftherios Avramidis, Marta R.Costa-jussà, Christian Federmann, Maite Melero, Pavel Pecina, & Josef van Genabith: A richly annotated, multilingual parallel corpus for hybrid machine translation. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2189-2193. [PDF, 433KB]

(2012) Jan Berka, Ondřej Bojar, Mark Fishel, Maja Popović, & Daniel Zeman: Automatic MT error analysis: Hjerson helping Addicter.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2158-2163. [PDF, 580KB]

(2012) Ondřej Bojar, Zdeněk Žabokrtský, Ondřej Dušek, Petra Galuščáková, Martin Majliš, David Mareček, Jiří Maršík, Michal Novák, Martin Popel, & Aleš Tamchyna: The joy of parallelism with CzEng 1.0.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3921-3928. [PDF, 460KB]

(2012) Ondřej Bojar, Bushra Jawaid, & Amir Kamran: Probes in a taxonomy of factored phrase-based models. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.253-260. [PDF, 134KB]

(2012) Ondřej Bojar & Dekai Wu: Towards a predicate-argument evaluation for MT. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.30-38. [PDF, 126KB]

(2012) Ondřej Dušek, Zdeněk Žabokrtský, Martin Popel, Martin Majliš, Michal Novák, & David Mareček: Formemes in English-Czech deep syntactic MT. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.267-274. [PDF, 316KB]

(2012) Christian Federmann, Eleftherios Avramidis, Marta R.Costa-jussà, Maite Melero, Pavel Pecina, & Josef van Genabith: The ML4HMT workshop on optimising the division of labour in hybrid machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3430-3435. [PDF, 631KB]

(2012) Christian Federmann, Maite Melero, Pavel Pecina, & Josef van Genabith: Towards optimal choice selection for improved hybrid machine translation. Prague Bulletin of Mathematical Linguistics 97, April 2012; pp.5-22. [PDF, 268KB]

(2012) Mark Fishel, Ondřej Bojar, & Maja Popović: Terra: a collection of translation error-annotated corpora.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.7-14. [PDF, 367KB]

(2012) Mark Fishel, Rico Sennrich, Maja Popović, & Ondřej Bojar: TerrorCat: a translation error categorization-based MT quality metric. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.64-70. [PDF, 116KB]

(2012) Jan Hajič, Eva Hajičová, Jarmila Panevová, Petr Sgall, Ondřej Bojar, Silvie Cinková, Eva Fučiková, Marie Mikulová, Petr Pajas, Jan Popelka, Jiři Semecký, Jana Šindlerová, Jan Štěpánek, Josef Toman, Zdeňka Urešová, & Zdeněk Žabokrtský: Announcing Prague Czech-English dependency treebank 2.0.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3153-3160. [PDF, 544KB]

(2012) Eva Hajičová & Petr Sgall: Formal models and practice of annotation [abstract]. In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp.17-18. [PDF]

(2012) Raymond Hendy Susanto, Septina Dian Larasati, & Francis M.Tyers: Rule-based machine translation between Indonesian and Malaysian. COLING 2012: Proceedings of the 3rd Workshop on South and Southeast Asian Natural Language processing (SANLP), Mumbai, December 2012; pp.191-199. [PDF, 178KB]

(2012) Vladislav Kuboň: Machine translation among related Slavic languages [abstract].  In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp.13-14. [PDF]

(2012) Septina Dian Larasati: IDENTIC corpus: morphologically enriched Indonesian-English parallel corpus.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.902-906. [PDF, 719KB]

(2012) Septina Dian Larasati: Improving word alignment by exploiting adapted word similarity. AMTA-2012: Monolingual machine translation-2012 workshop. Proceedings, San Diego, November 1, 2012; pp.41-45. [PDF, 227KB]

(2012) Jiří Maršík & Ondřej Bojar: TrTok: a fast and trainable tokenizer for natural languages. Prague Bulletin of Mathematical Linguistics 98, October 2012; pp.75-85. [PDF, 141KB]

(2012) Pavel Pecina, Antonio  Toral, Vassilis Papavassiliou, Prokopis Prokopidis, & Josef van Genabith: Domain adaptation of statistical machine translation using web-crawled resources: a case study. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.145-152. [PDF, 201KB]

(2012) Pavel Pecina, Antonio Toral, & Josef van Genabith: Simple and effective parameter tuning for domain adaptation of statistical machine translation. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.2209-2224. [PDF, 166KB]

(2012) Loganathan Ramasamy, Ondřej Bojar, & Zdeněk Žabokrtský: Morphological processing for English-Tamil statistical machine translation. COLING 2012: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012), Mumbai, December 2012; pp.113-122. [PDF, 542KB]

(2012) Loganathan Ramasamy, Zdeněk Žabokrtský, & Sowmya Vajjala: The study of effect of length in morphological segmentation of agglutinative languages. [ACL 2012] Proceedings of the First Workshop on Multilingual Modeling, Jeju, Republic of Korea, 8-14 July 2012; pp.18-24. [PDF, 151KB]

(2012) Rudolf Rosa, David Mareček, & Ondřej Dušek: DEPFIX: a system for automatic correction of Czech MT outputs. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.362-368. [PDF, 114KB]

(2012) Rudolf Rosa, Ondřej Dušek, David Mareček, & Martin Popel: Using parallel features in parsing of machine-translated sentences for correction of grammatical errors. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.39-48. [PDF, 186KB]

(2012) Aleš Tamchyna, Petra Galuščáková, Amir Kamran, Miloš Stanojević, & Ondřej Bojar: Selecting data for English-to-Czech machine translation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.374-381. [PDF, 110KB]

(2012) Antonio Toral, Marc Poch, Pavel Pecina, & Gregor Thurmair: Efficiency-based evaluation of aligners for industrial applications. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.57-60. [PDF, 210KB]

(2012) Kateřina Veselovská, Nguy Giang Linh, & Michal Novák: Using Czech-English parallel corpora in automatic identification of it.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.112-120. [PDF, 306KB]

(2012) Daniel Zeman: Data issues of the multilingual translation matrix. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.395-400. [PDF, 225KB]

(2011) Jan Berka, Martin Černý, & Ondřej Bojar: Quiz-based evaluation of machine translation. Prague Bulletin of Mathematical Linguistics, no.95, April 2011; pp.77-86. [PDF, 190KB]

 (2011) Ondřej Bojar: Analyzing error types in English-Czech machine translation. Prague Bulletin of Mathematical Linguistics 95, April 2011; pp.63-76. [PDF, 166KB]

 (2011) Ondřej Bojar, Miloš Ercegovčević, Martin Popel, & Omar F.Zaidan: A grain of salt for the WMT manual evaluation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.1-11. [PDF, 191KB]

(2011) Ondřej Bojar & Aleš Tamchyna: Forms wanted: training SMT on monolingual data. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; 2pp. [PDF, 75KB];  presentation: 20 slides [PDF of PPT, 111KB]

 (2011) Ondřej Bojar & Aleš Tamchyna: Improving translation model by monolingual data . [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.330-336. [PDF, 107KB]

(2011) Ondřej Bojar: Rich morphology and what can we expect from hybrid approaches to MT [abstract]. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; p.12. [PDF, 139KB]

(2011) Nathan Green: Effects of noun phrase bracketing in dependency parsing and machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.69-74. [PDF, 288KB]

(2011) Jan Hajič: Machine translation research in META-NET.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 17 slides [PDF of PPT, 1735KB]

(2011) Bushra Jawaid & Daniel Zeman: Word-order issues in English-to-Urdu statistical machine translation. Prague Bulletin of Mathematical Linguistics, no.95, April 2011; pp.87-106. [PDF, 1006KB]

(2011) Septina Dian Larasati & Vladislav Kuboň: A study of Indonesian-to-Malaysian MT system. MALINDO 2010: the 4th International MALINDO Workshop, Depok, Indonesia, August 2nd 2010; 7pp. [PDF, 363KB]

 (2011) Matouš Macháček & Ondřej Bojar: Approximating a deep-syntactic metric for MT evaluation and tuning. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.92-98. [PDF, 130KB]

 (2011) David Mareček, Rudolf Rosa, Petra Galuščáková, & Ondřej Bojar: Two-step translation with grammatical post-processing. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.426-432. [PDF, 119KB]

(2011) Martin Popel, David Mareček, Nathan Green, & Zdeněk Žabokrtský: Influence of parser choice on dependency-based MT. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.433-439. [PDF, 113KB]

(2011) Česlav Przywara & Ondřej Bojar: eppex: epochal phrase table extraction for statistical machine translation. Sixth Machine Translation Marathon, 5-10 September 2011, Trento; Prague Bulletin of Mathematical Linguistics, no.96, October 2011; pp.89-98. [PDF, 137KB]; presentation, 24 slides [PDF of PPT, 118KB]

(2011) Johanka Spoustová & Miroslav Spousta: Comparable fora. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.96-101. [PDF, 80KB]

(2011) Daniel Zeman: Hierarchical phrase-based MT at the Charles University for the WMT 2011 shared task. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.496-500. [PDF, 274KB]

(2011) Dan Zeman, Mark Fishel, Jan Berka, & Ondřej Bojar: Addicter: what’s wrong with my translations? Sixth Machine Translation Marathon, 5-10 September 2011, Trento; Prague Bulletin of Mathematical Linguistics, no.96, October 2011; pp.79-88. [PDF, 276KB]; presentation, 20 slides [PDF of PPT, 359KB]

(2010) Ondřej Bojar & Kamil Kos: 2010 failures in English-Czech phrase-based MT.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 60-66. [PDF,

(2010) Ondřej Bojar, Kamil Kos, & David Mareček: Tackling sparse data issue in machine translation evaluation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.86-91. [PDF, 132KB]

(2010) Ondřej Bojar, Adam Liška, & Zdeněk Žabokrtský: Evaluating utility of data sources in a large parallel Czech-English corpus CzEng 0.9. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.447-452. [PDF, 359KB]

(2010) Ondřej Bojar, Pavel Straňák, & Daniel Zeman: Data issues in English-to-Hindi machine translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1771-1777. [PDF, 557KB]

(2010) Jan Hajic: Building bridges using innovative approaches in machine translation. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 24 slides [PDF of PPT, 1109KB]

(2010) Michael Jellinghaus, Alexandros Poulis, & David Kolovratnik: Exodus – exploring SMT for EU institutions.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 110-114. [PDF, 109KB]

(2010) David Mareček, Martin Popel, & Zdeněk Žabokrtský: Maximum entropy translation model in dependency-based MT framework. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 201-206. [PDF, 254KB]

(2010) Drahomira (Johanka) Spoustová & Miroslav Spousta: Dependency parsing as a sequence labeling task. Prague Bulletin of Mathematical Linguistics, no.94, September 2010; pp.7-14. [PDF, 271KB]

(2010) Jernej Vičič & Petr Homola: Speeding up the implementation process of a shallow transfer machine translation system. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 624KB]

(2010) Daniel Zeman: Hierarchical phrase-based MT at the Charles University for the WMT 2010 shared task.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 212-215. [PDF, 269KB]

Codesign s.r.o.

(2012) Petr Homola: A machine translation toolchain for polysynthetic languages. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.65-68. [PDF, 211KB]

Czech Technical University

(2014) Eduard Šubert & Ondřej Bojar: Twitter Crowd Translation – design and objectives. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.217-227. [PDF, 324KB]

(2011) Jan Berka, Martin Černý, & Ondřej Bojar: Quiz-based evaluation of machine translation. Prague Bulletin of Mathematical Linguistics, no.95, April 2011; pp.77-86. [PDF, 190KB]

IBM Prague Research Lab

(2013) Martin Čmejrek, Haitao Mi, & Bowen Zhou: Flexible and efficient hypergarph interactions for joint hierarchical and forest-to-string decoding. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.545-555. [PDF, 195KB]

Institute of the Czech Language

(2012) Karel Oliva: Czech contribution to… session “Plans for language technology research and innovations in member states and regions” META-FORUM, Brussels, June 19-21, 2012; 5 slides. [PDF of PPT, 57KB]

Masaryk University

(2014) Ondřej Bojar, Vojtěch Diatka, Pavel Rychlý, Pavel Straňák, Vit Suchomel, Aleš Tamchyna, & Daniel Zeman: HindEnCorp – Hindi-English and Hindi-only corpus for machine translation.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3550-3555. [PDF, 107KB]

(2014) Adam Kilgarriff, Miloš Jakubíček, Vojtěch Kovář, Pavel Rychlý, Vít Baisa, & Lucia Kocincová: Extrinsic corpus evaluation with a collocation dictionary task. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.545-552. [PDF, 143KB]

Moravia Worldwide

(2011) Tomáš Hudík & Achim Ruopp: The integration of Moses into localization industry. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.47-53. [PDF, 449KB]

Prague: University see Charles University, Prague

Universita Karlovy see Charles University, Prague

Ústav Formální a Aplikované Lingvistiky MFF, Charles University (Prague) see Charles University, Prague

 

Denmark

Aarhus University

(2011) Tina Paulsen Christensen & Anne Schjoldager: The impact of translation-memory (TM) technology on cognitive processes: student-translators’ retrospective comments in an online questionnaire. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.119-130. [PDF, 863KB]

Copenhagen Business School

(2014) Michael Carl, Mercedes García Martínez, Bartolomé Mesa-Lao, & Nancy Underwood: CFT13: a resource for research into the post-editing process. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1757-1764. [PDF, 1011KB]

(2014) Nancy Underwood, Bartolomé Mesa-Lao, Mercedes García Martínez, Michael Carl, Vicent Alabau,  Jesús González-Rubio, Luis A.Leiva, Germán Sanchis-Trilles, Daniel Ortíz-Martínez, & Francisco Casacuberta: Evaluating the effects of interactivity in a post-editing workbench. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.553-559. [PDF, 268KB]

(2014) Mercedes García-Martínez, Karan Singla, Aniruddha Tammewar, Bartolomé Mesa-Lao, Ankita Thakur, Anusuya M.A., Srinivas Bangalore, & Michael Carl: SEECAT: ASR & Eye-tracking enabled computer-assisted translation. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014; pp.81-88. [PDF, 624KB]

(2013) Vicent Alabau, Ragnar Bonk, Christian Buck, Michael Carl, Francisco Casacuberta, Mercedes García-Martínez, Jesús González, Philipp Koehn, Luis Leiva, Bartolomé Mesa-Lao, Daniel Ortiz, Hervé Saint-Amand, Germán Sanchis, & Chara Tsiukala: Advanced computer aided translation with a web-based workbench. Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), Nice, France, 2 September 2013; pp. 55-62. [PDF, 699KB]

(2013) V.Alabau, J.González-Rubio, L.A.Leiva, D.Ortiz-Martínez, G.Sanchis-Trilles, F.Casacuberta, B.Meso-Lao, R.Bonk. M.Carl, & M.García-Martínez: User evaluation of advanced interaction features for a computer-assisted translation workbench. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.361-368. [PDF, 1166KB]

(2013) Philipp Koehn, Michael Carl, Francisco Casacuberta, & Eva Marcos: CASMACAT: Cognitive Analysis and Statistical Methods for Advanced Computer Aided Translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; p.411. [PDF,  219KB]

(2013) Abhijit Mishra, Pushpak Bhattacharyya, & Michael Carl: Automarically predicting sentence translation difficulty.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.346-351. [PDF, 430KB]

(2012) Ragnar Bonk, Michael Carl, Jakob Elming: CASMACAT workbench prototype 1. [In: Technology Showcase Guide]. AMTA-2012. Proceedings, San Diego, November 1, 2012; 2pp.

(2012) Michael Carl: The CRITT TPR-DB 1.0: a database for empirical human translation process research. AMTA-2012: Workshop on post-editing technology and practice. Proceedings, San Diego, October 28, 2012; 10pp. [PDF, 843KB]

(2012) Michael Carl: Human translation process research and the CRITT TPR-database. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 29 slides [PDF of PPT, 6145KB]

(2012) Michael Carl & Barbara Dragsted: Inside the monitor model: processes of default and challenged translation production. Translation: Computation, Corpora,  Cognition 2 (1), July 2012; pp.127-145. [PDF, 2189KB]

(2012) Andreas Kirkedal: Tree-based hybrid machine translation.  EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.77-86. [PDF, 260KB]

(2012) Iørn Korzen & Morten Gylling: Text structure in a contrastive and translational perspective: on information density and clause linkage in Italian and Danish. Translation: Computation, Corpora,  Cognition 2 (1), July 2012; pp.23-46. [PDF, 601KB]

(2012) Bartolomé Mesa-Lao: The next generation translator’s workbench: post-editing in CASMACAT v.1.0. [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; 11pp. [PDF, 87KB], presentation: 16 slides [PDF, 311KB]

(2011) Michael Carl, Barbara Dragsted & Arnt Lykke Jakobsen: On the systematicity of human translation processes. Tralogy, Paris, 3-4 March 2011; 11pp. [PDF, 915KB]

(2011) Michael Carl: Patterns of shallow text production in translation. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.143-151. [PDF, 951KB]

(2011) Michael Carl, Barbara Dragsted, Jakob Elming, Daniel Hardt, & Arnt Lykke Jakobsen: The process of post-editing: a pilot study. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.131-142. [PDF, 1078KB]

(2011) Jakob Elming & Martin Haulrich: Reordering by parsing. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.35-42. [PDF, 361KB]

(2011) Fumiko Kano Glückstad: Application of classical psychology theory to terminological ontology alignment. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.227-238. [PDF, 1187KB]

 (2011) Anders Søgaard & Martin Haulrich: Sentence-level instance-weighting for graph-based and transition-based dependency parsing. IWPT 2011: 12th International Confernce on Parsing Technologies, October 5-7, 2011, Dublin City University; pp.43-47. [PDF, 86KB]

(2010) Michael Carl: A computational framework for a cognitive model of human translation processes. Translating and the Computer 32, 18-19 November 2010, London; 19pp. [PDF, 1314KB]

(2010) Michael Carl & Matthias Buch-Kromann: Correlating translation product and translation process data of professional and student translators. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 1205KB]

(2010) Daniel Hardt & Jakob Elming: Incremental re-training for post-editing SMT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 191KB]

Copenhagen University

 (2013) Hanne Fersøe, Dorte Haltrup Hansen, Lene Offersgaard, Sussi Olsen, & Claus Povlsen: Let’sMT! as a learning platform for SMT. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.353-359. [PDF, 571KB]

(2012) Lene Offersgaard & Dorte Haltrup Hansen: SMT systems for less-resourced languages based on domain-specific data.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.75-80. [PDF, 380KB]

(2012) Bolette Sandford Pedersen: The META-NET language white paper series: overview and key results. META-FORUM, Brussels, June 19-21, 2012; 13 slides. [PDF of PPT, 3758KB]

 (2011) Anders Søgaard & Martin Haulrich: Sentence-level instance-weighting for graph-based and transition-based dependency parsing. IWPT 2011: 12th International Confernce on Parsing Technologies, October 5-7, 2011, Dublin City University; pp.43-47. [PDF, 86KB]

(2012) Jörg Tiedemann, Dorte Haltrup Hansen, Lene Offersgaard, Sussi Olsen, & Matthias Zumpe: A distributed resource repository for cloud-based machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2207,2213. [PDF, 441KB]

(2012) Andrejs Vasiļjevs, Markus Forsberg, Tatiana Gornostay, Dorte H.Hansen, Kristin M.Jóhannsdóttir, Krister Lindén, Gunn I.Lyse, Lene Offersgaard, Ville Oksanen, Sussi Olsen, Bolette S.Pedersen, Eiríkur Rögnvaldsson, Roberts Rozis, Inguna Skadiņa, & Koenraad De Smet: Creation of an open shared language resource repository in the Nordic and Baltic countries.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1076-1083. [PDF, 1336KB]

(2011) Steffen Hedegaard & Jakob Grue Simonsen: Lost in translation: authorship attribution using frame semantics. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.65-70. [PDF, 109KB]

(2011) Christian Rishøj & Anders Søgaard: Factored translation with unsupervised word clusters. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.447-451. [PDF, 163KB]

(2011) Anders Søgaard: Data point selection for cross-language adaptation of dependency parsers. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.682-686. [PDF, 124KB]

(2010) Mossab Al-Hunaity: Utilizing web service technology to create Danish Arabic language resources. LREC 2010: Web Services and Processing Pipelines in HLT - WSPP2010 Workshop, 17 May 2010, Valletta, Malta; 20 slides. [PDF of PPT, 505KB]

(2010) Mossab Al-Hunaity, Bente Maegaard, & Dorte Hansen: Using English as a pivot language to enhance Danish-Arabic statistical machine translation. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 108-113. [PDF, 768KB]

(2010) Anders Søgaard: Can inversion transduction grammars generate hand alignments? EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 6pp. [PDF, 602KB]; presentation by Michael Carl: 19 slides [PDF, 122KB]

Danish Language Council

(2010) Sabine Kirchmeier-Andersen: Linguistic diversity and language change – future challenges for MT. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 34pp. [PDF, 3094KB]

LanA Consulting

(2013) Svetlana Sheremetyeva: On integrating hybrid and rule-based components for patent MT with several levels of output. [MT Summit XIV] Proceedings of the 5th Workshop on Patent Translation, Nice, September 2, 2013; pp.8-15. [PDF, 1061KB]

SAS Institute

(2010) Ronan Martin: Creating your own translation memory repository. Translating and the Computer 32, 18-19 November 2010, London; 7pp. [PDF, 200KB]

 

Ecuador

Cuenca, Universidad de

(2011) John McCrae, Maurizio Espinoza, Elena Monteil-Ponsoda, Guadalupe Aguado-de-Cea, & Philipp Cimiano: Combining statistical and semantic approaches to the translation of ontologies and taxonomies. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.116-125. [PDF, 576KB]

 

Egypt

Alexandria University

(2012) Sameh Alansary: A formalized reference grammar for UNL-based machine translation between English and Arabic. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.33-42. [PDF, 352KB]

American University in Cairo

(2012) Amira Shoukry & Ahmed Rafea: Preprocessing Egyptian dialect tweets for sentiment mining. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.47-56. [PDF, 1193KB]

(2010) Mohamed Altantawy, Nizar Habash, Owen Rambow, & Ibrahim Saleh: Morphological analysis and generation of Arabic nouns: a morphemic functional approach. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, organised by ELRA, 17-23 May 2010, Valletta, Malta; pp.851-858. [PDF,               899KB]

Cairo Microsoft Innovation Center

(2011) Ali El-Kahky, Kareem Darwish, Ahmed Saad Aldein, Mohamed Abd El-Wahab, Ahmed Hefny, & Waleed Ammar: Improved transliteration mining using graph reinforcement. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1384-1393. [PDF, 939KB]

(2010) Kareem Darwish: Transliteration mining with phonetic conflation and iterative training. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.39-47. [PDF, 465KB]

Cairo University

(2011) Ali El-Kahky, Kareem Darwish, Ahmed Saad Aldein, Mohamed Abd El-Wahab, Ahmed Hefny, & Waleed Ammar: Improved transliteration mining using graph reinforcement. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1384-1393. [PDF, 939KB]

IBM Cairo Technology and Development Center

(2010) Sara Noeman & Amgad Madkour: Language independent transliteration mining system using finite state automata framework. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.57-61. [PDF, 120KB]

 

Estonia

Tallin University of Technology

(2010) Maxim Khalilov, Lauma Pretkalniņa, Natalja Kuvaldina, & Veronika Pereseina: SMT of Latvian, Lithuanian and Estonian languages: a comparative study. Human Language Technologies – the Baltic Perpective,  International Conference, Riga, Latvia, October 8, 2010; 8pp. [PDF, 145KB]

Tartu University

(2010) Mark Fishel & Heiki-Jaan Kaalep: CorporAl: a method and tool for handling overlapping parallel corpora. Fifth Machine Translation Marathon, 13-18 September, University of Le Mans; Prague Bulletin of Mathematical Linguistics, no.94, September 2010; pp.67-76. [PDF, 149KB]; presentation [PDF, 1037KB]

(2010) Mark Fishel & Harri Kirik: Linguistically motivated unsupervised segmentation for machine translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1741-1745. [PDF, 308KB]

Ethiopia

Haramaya University

(2010) Sisay Adugna & Andreas Eisele: English-Oromo machine translation: an experiment using a statistical approach. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2196-2199. [PDF, 368KB]

 

European Union

European Captioning Institute

(2010) Yota Georgakopoulou: Challenges for the audiovisual industry in the digital age: accessibility and multilingualism. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 16pp. [PDF, 830KB]

European Commission

(2014) Najeh Hajlaoui, David Kolovratnik, Jaakko Väyrynen, Ralf Steinberger, & Daniel Varga: DCEP – digital corpus of the European Parliament. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3164-3171. [PDF, 252KB]

(2012) Roberto Cencioni: Digital content & cognitive systems, language technologies. META-FORUM, Brussels, June 20-21, 2012; 32 slides. [PDF of PPT, 386KB]

(2012) Yu Chen & Andreas Eisele: MultiUN v2: UN documents with multilingual alignments.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2500-2504. [PDF, 405KB]

 (2012) Markus Foti: MT@EC: working with translators. [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; 3pp. [PDF, 95KB], presentation: 23 slides [PDF, 588KB]

(2012) Edouard Geoffrois: National plans and international connections and synergies. META-FORUM, Brussels, June 19-21, 2012; 8 slides. [PDF of PPT, 143KB]

(2012) Márta Nagy-Rothengass: Language technologies in Europe: trends and future perspectives. HLT Days 27-28 September 2012, Warsaw; 23 slides [PDF of PPT, 422KB]

(2012) Kimmo Rossi: European Commission [calls]. HLT Days 27-28 September 2012, Warsaw; 22 slides [PDF of PPT, 320KB]

(2012) Kimmo Rossi: European Commission: upcoming [programmes]. META-FORUM, Brussels, June 19-21, 2012; 13 slides. [PDF of PPT, 257KB]

(2012) Kimmo Rossi: Language technology at the crossroads [abstract]. In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp.1-2. [PDF]

(2012) Ralf Steinberger, Andreas Eisele, Szymon Klocek, Spyridon Pilos, & Patrick Schlüter: DGT-TM: a freely available translation  memory in 22 languages.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.454-459. [PDF, 471KB]

(2011) Roberto Cencioni: 3 years already and still a long way to go…  [EC support for LT] META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 18 slides [PDF of PPT, 192KB]

(2011) Andreas Eisele & Caroline Lavecchia: Using statistical machine translation for computer-aided translation at the European Commission. Proceedings of the Third Joint EM+/CNGL Workshop “Bringing MT to the User: Research Meets Translators” (JEC ’11), Luxembourg, 14 October 2011; pp.3-12. [PDF, 1125KB]

(2011) Maria José Machado & Hilário Leal Fontes: Machine translation: case study – English into Portuguese – evaluation of Moses in DGT Portuguese Language Department using Moses for Mere Mortals. A Folha: Boletim da Língua Portuguesa nas Instituições Europeias, no.37, autumn 2011; English version, 11pp. [PDF, 967KB]

(2011) Rytis Matikonis: Language tools and technologies at the European Commission. Tralogy, Paris, 3-4 March 2011; 12 slides. [PDF, 627KB]

(2011) Spyridon Pilos: Machine translation at the European Commission.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 17 slides [PDF of PPT, 309KB]

(2011) Conrad Toft: Interinstitutional Language Technology Watch: language technology at the European institutions.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 19 slides [PDF of PPT, 627KB]

(2010) Josep Bonet: Is machine translation ripe for EU translators? Invited talk at: 7th International Workshop on Spoken Language Translation, 2December 2010, Paris, France; 25 slides. [PDF of PPT, 1033KB]

(2010) Roberto Cencioni: EC Info-session on upcoming calls and further actions in support of language technology and innovation. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 35 slides [PDF of PPT, 229KB]

(2010) Hanna Klimek: Towards a business forum for language technology stakeholders. Introduction to session “Language Technology in Practice: Needs – Problems – Wishes”. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 8 slides [PDF of PPT, 9403KB]

(2010) Spyridon Pilos: Machine translation at the European Commission. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 13pp. [PDF, 286KB]

(2010) Michael Jellinghaus, Alexandros Poulis, & David Kolovratnik: Exodus – exploring SMT for EU institutions.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 110-114. [PDF, 109KB]

(2010) Kimmo Rossi: DG INFSO, Directorate E, unit E1: Language technologies, machine translation. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 9pp. [PDF, 148KB]

European Economic and Social Committee

(2011) Conrad Toft: Interinstitutional Language Technology Watch: language technology at the European institutions.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 19 slides [PDF of PPT, 627KB]

European Parliament

(2014) Najeh Hajlaoui: SMT for restricted sublanguage in CAT tool context at the European Parliament. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.228-234. [PDF, 163KB]

(2014) Najeh Hajlaoui, David Kolovratnik, Jaakko Väyrynen, Ralf Steinberger, & Daniel Varga: DCEP – digital corpus of the European Parliament. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3164-3171. [PDF, 252KB]

(2013) Susanne Altenberg: The human interpreter in action – multilingualism at the European Parliament. [Keynote at IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.6, 2013; [abstract] 1p. [PDF, 126KB]

(2012) Konstantinos Chatzitheodorou & Alexandros Poulis: Experiments on domain-specific statistical machine translation at the European Parliament. [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; 6pp. [PDF, 171KB]; presentation by Alexandros Poulis: 16 slides [PDF, 134KB]

(2012) Felicity Hands: The translation process. [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; presentation: 88 slides [PDF, 2912KB]

(2012) Alexandros Poulis & David Kolovratnik: To post-edit or not to post-edit? Estimating the benefits of MT post-editing for a European organization. AMTA-2012: Workshop on post-editing technology and practice. Proceedings, San Diego, October 28, 2012; 9pp. [PDF, 390KB]

(2010) Michael Jellinghaus, Alexandros Poulis, & David Kolovratnik: Exodus – exploring SMT for EU institutions.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 110-114. [PDF, 109KB]

European Patent Office

(2011) Bertrand Le Chapelain: Machine translation project and activities in EPO “the patent case”. Invited talk at [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.1-16. [PDF, 1271KB]

(2011) Wolfgang Täger: The sentence-aligned European patent corpus. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.177-184. [PDF, 234KB]

(2010) Georg Artelsmair: Recent developments in machine translation policy at the European Patent Office. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 22 slides [PDF of PPT, 709KB]

(2010) Patrizia Biani: The evolution of the European machine translation programme at the European Patent Office. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 20pp. [PDF, 4965KB]

(2010) Patrizia Biani: The evolution of the European machine translation programme at the European Patent Office. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; abstract: 1 p. [PDF, 466KB]; presentation: 19 slides [PDF, 498KB]

European Union Bookshop

(2014) Raivis Skadiņš, Jörg Tiedemann, Roberts Rozis & Daiga Deksne: Billions of parallel words for free: building and using the EU Bookshop corpus.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1850-1855. [PDF, 521KB]

European Union of the Deaf

(2010) Philippe Dreuw, Jens Forster, Yannick Gweth, Daniel Stein, Hermann Ney, Gregorio Martinez, Jaume Verges Llahi, Onno Crasborn, Ellen Ormel, Wei Du, Thomas Hoyoux, Justus Piater, Jose Miguel Moya, & Mark Wheatley: SignSpeak – understanding, recognition, and translation of sign languages. [LREC 2010] 4th Workshop on the Representation and Processing of Sign Languages: Corpora and Sign Language  Technologies, Malta, May 2010; pp.65-72. [PDF, 6730KB]

European University Institute

(2011) Philippe Lacour, Any Freitas, Aurélien Bénel, Franck Eyraud & Diana Zambon: Translation and the new digital commons. Tralogy, Paris, 3-4 March 2011; 16 pp. [PDF, 313KB]

Joint Research Centre (Ispra)

(2014) Najeh Hajlaoui, David Kolovratnik, Jaakko Väyrynen, Ralf Steinberger, & Daniel Varga: DCEP – digital corpus of the European Parliament. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3164-3171. [PDF, 252KB]

(2014) Guillaume Jacquet, Maud Ehrmann, & Ralf Steinberger: Clustering of multi-word named entity variants: multilingual evaluation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2696-2700. [PDF, 217KB]

(2013) Alexandra Balahur & Marco Turchi: Improving sentiment analysis in Twitter using multilingual machine translated data. Proceedings of Recent Advances in Natural  Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.49-55. [PDF, 99KB]

(2012) Alexandra Balahur & Marco Turchi: Multilingual sentiment analysis using machine translation. [ACL 2012] Proceedings of the 3rd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis, Jeju, Republic of Korea, 12 July 2012; pp.52-60. [PDF, 133KB]

(2012) Maud Ehrmann: Cross-lingual information access to multilingual documents – an overview of JRC’s media monitoring applications [abstract].  In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; p.35. [PDF]

(2012) Ralf Steinberger, Andreas Eisele, Szymon Klocek, Spyridon Pilos, & Patrick Schlüter: DGT-TM: a freely available translation  memory in 22 languages.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.454-459. [PDF, 471KB]

 (2012) Josef Steinberger & Marco Turchi: Machine translation for multilingual summary content evaluation. Proceedings of the NAACL-HLT Workshop on Evaluation Metrics and System Comparison for Automatic Summarization, Montréal, Canada, June 3-8, 2012; pp.19-27. [PDF, 126KB]

(2012) Marco Turchi, Martin Atkinson, Alastair Wilcox, Brett Crawley, Stefano Bucci, Ralf Steinberger, & Erik van der Goot: ONTS: optima news translation system. [EACL 2012] Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 25-30. [PDF, 323KB]

(2012) Marco Turchi, Cyril Goutte, & Nello Cristianini: Learning machine translation from in-domain and out-of-domain data. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.305-312. [PDF, 160KB]

(2012) Marco Turchi, Josef Steinberger, & Lucia Specia: Relevance ranking for translated texts. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.153-160. [PDF, 228KB]

(2011) Jakub Piskorski, Jenya Belayeva, & Martin Atkinson: Exploring the usefulness of cross-lingual information fusion for refining real-time news event extraction: a preliminary study.  [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.210-217. [PDF, 215KB]

(2011) Nadi Tomeh, Marco Turchi, Guillaume Wisinewski, Alexandre Allauzen, & François Yvon: How good are your phrases? Assessing phrase quality with single class classification. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.261-268. [PDF, 691KB]

(2011) Marco Turchi: Multilingual text mining and machine translation activities carried out at the EC’s Joint Research Center. Invited talk at: Sixth Machine Translation Marathon, 5-10 September 2011, Trento, Italy; 52 slides [PDF of PPT, 4286KB]

(2011) Marco Turchi, Vanni Zavarella, & Hristo Tanev: Pattern learning for event extraction using monolingual statistical machine translation. [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.371-377. [PDF, 148KB]

 

Finland

Aalto University

(2013) Seppo Enarvi & Mikko Kurimo: Studies on training text selection for conversational Finnish language modeling. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 210KB]

(2011) Jaako Väyrynen, Timo Honkela, & Marcus Dobrinkat: Context in machine translation: challenge and data set. META-NET Workshop: Context in MT, June 14, 2011; 10 slides [PDF of PPT, 146KB]

(2010) Marcus Dobrinkat, Tero Tapiovaara, Jaakko Väyrynen, & Kimmo Kettunen: Evaluating machine translations using mNCD. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.80-85. [PDF, 145KB]

(2010) Marcus Dobrinkat, Jaako Väyrynen, Tero Tapiovaara, & Kimmo Kettunen: Normalized compression distance based measures for MetricsMATR 2010. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 343-348. [PDF, 118KB]

(2010) Mikko Kurimo, William Byrne, John Dines, Philip N.Garner, Matthew Gibson, Yong Guan, Teemu Hirsimäki, Reima Karhila, Simon King, Hui Liang, Keiichiro Oura, Lakshmi Saheer, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi Tokuda, Mirjam Wester, Yi-Jian Wu, & Junichi Yamagishi: Personalising speech-to-speech translation in the EMIME project. Proceedings of the ACL 2010 System Demonstrations, Uppsala, Sweden, 13 July 2010; pp.48-53. [PDF, 679KB]

(2010) Sami Virpioja, Jaako Väyrynen, André Mansikkaniemi, & Mikko Kurimo: Applying morphological decomposition to statistical machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 195-200. [PDF, 116KB]

Helsinki University

(2015) Maarit Koponen: How to teach machine translation post-editing? Experiences from a post-editing course. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: Fourth Workshop on Post-editing Technology and Practice (WPTP 4); p.2-15. [PDF, 505KB]

(2013) Maarit Koponen: This translation is not too bad: an analysis of post-editor choices in a machine-translation post-editing task. Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), Nice, France, 2 September 2013; pp. 1-9. [PDF, 148KB]

(2012) Maarit Koponen: Comparing human perceptions of post-editing effort with post-editing operations. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.181-190. [PDF, 176KB]

 (2012) Maarit Koponen, Wilker Aziz, Luciana Ramos, & Lucia Specia: Post-editing time as a measure of cognitive effort. AMTA-2012: Workshop on post-editing technology and practice. Proceedings, San Diego, October 28, 2012; 10pp. [PDF, 331KB]

(2012) Jyrki Niemi & Krister Lindén: Representing the translation relation in a bilingual wordnet.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2439-2446. [PDF, 365KB]

(2012) Tommi A.Pirinen & Francis M.Tyers: Compiling Apertium morphological dictionaries with HFST and using them in HFST applications.  [SALTMIL 2012] Workshop on Language Technology for Normalisation of Less-Resourced Languages, 22 May 2012, Istanbul, Turkey; pp.25-28. [PDF, 120KB]

(2012) Andrejs Vasiļjevs, Markus Forsberg, Tatiana Gornostay, Dorte H.Hansen, Kristin M.Jóhannsdóttir, Krister Lindén, Gunn I.Lyse, Lene Offersgaard, Ville Oksanen, Sussi Olsen, Bolette S.Pedersen, Eiríkur Rögnvaldsson, Roberts Rozis, Inguna Skadiņa, & Koenraad De Smet: Creation of an open shared language resource repository in the Nordic and Baltic countries.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1076-1083. [PDF, 1336KB]

(2010) Graham Wilcock: Cloud computing for the humanities. Human Language Technologies—The Baltic Perspective, 4th International Conference, Riga, Latvia, October 7-8, 2010; 28 slides [PDF of PPT, 2265KB]

Kymenlaasko University of Applied Sciences

(2011) Kimmo Kettunen: Packing it all up in search for a language independent MT quality measure tool [abstract]. In: Zygmunt Vetulani (ed.): Human Language Technology: Challenges for Computer Science and Linguistics: 4th Language and Technology Conference, November 6-8, Poznań, Poland (Lecture Notes in Computer Science, 6562; Heidelberg: Springer Verlag, 2011); pp.567-576. [PDF, 14KB] 

(2010) Marcus Dobrinkat, Jaako Väyrynen, Tero Tapiovaara, & Kimmo Kettunen: Normalized compression distance based measures for MetricsMATR 2010. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 343-348. [PDF, 118KB]

(2010) Marcus Dobrinkat, Tero Tapiovaara, Jaakko Väyrynen, & Kimmo Kettunen: Evaluating machine translations using mNCD. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.80-85. [PDF, 145KB]

Vaasa, University

(2012) Christer Laurén: Arts and sciences?? New ways of thinking: where are they born? [abstract]. In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; p.20. [PDF]

 

France

Airbus

(2012) Florence Beaujard: Linguistics activities of Airbus design office. META-FORUM, Brussels, June 19-21, 2012; 7 slides. [PDF of PPT, 1355KB]

Aix-Marseille, Université de

(2014) B.R.Laranjeira, V.P.Moreira, A.Villavicencio, C.Ramisch, & M.J.Finatto: Comparing the quality of focused crawlers and of the translation resources obtained from them. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3572-3578. [PDF, 803KB]

 (2013) Marta R.Costa-jussà, Rafael E.Banchs, Reinhard Rapp, Patrik Lambert, Kurt Eberle, & Bogdan Babych: Workshop on Hybrid Approaches to Translation: overview and developments. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp. 1-6. [PDF, 235KB]

(2013) Ahmed Hamdi, Rahma Boujelbane, Nizar Habash, & Alexis Nasr: The effects of factorizing root and pattern mapping in translating between Tunisian Arabic and standard Arabic. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.125-134. [PDF, 672KB]

Atos Research and Innovation

(2012) Tomás Pariente: Text analytics and big data. META-FORUM, Brussels, June 19-21, 2012; 23 slides. [PDF of PPT, 4103KB]

Avignon, Université de

(2013) Stéphane Huet, Elena Manishina & Fabrice Lefèvre: Factored Machine Translation Systems for Russian-English. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.154-157. [PDF, 98KB]

(2012) Raphaël Rubino, Stéphane Huet, Fabrice Lefèvre, & Georges Linarès: Statistical post-editing of machine translation for domain adaptation. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.221-228. [PDF, 225KB]

(2011) Stéphane Huet & Philippe Langlais: Identifying the translations of idiomatic expressions using TransSearch. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.45-56. [PDF, 1123KB]

 (2011) Marion Potet, Raphaël Rubino, Benjamin Lecouteux, Stéphane Huet, Hervé Blanchon, Laurent Besacier, & Fabrice Lefèvre: The LIGA (LIG/LIA) machine translation system for WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.440-446. [PDF, 92KB]

(2010) Horacio Saggion, Juan-Manuel Torres-Moreno, Iria da Cunha, Eric SanJuan, & Patricia Velázquez-Morales: Multilingual summarization evaluation without human models. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1059-1067. [PDF, 238KB]

Caen Basse-Normandie, Université de

(2010) Julien Gosme, Wigdan Mekki, Fathi Debili, Yves Lepage, & Nadine Lucas: The GREYC/LLACAN machine translation systems for the IWSLT 2010 campaign. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.59-65. [PDF, 307KB]

(2010) Adrien Lardilleux, Julien Gosme, & Yves Lepage: Bilingual lexicon induction: effortless evaluation of word alignment tools and production of resources for improbable language pairs. [LREC 2010]: Proceedings of the Second Workshop on African Language Technology, AFLAT 2010, Valetta, Malta; pp.252-256. [PDF, 330KB]

CEA  see Commissariat à l’Énergie Atomique

CLIPS see Communication langagière et interaction personne-système

Commissariat à l’Énergie Atomique

(2014) Nasredine Semmar, Othman Zennaki, & Meriama Laib: Using cross-language information retrieval and statistical language modelling in example-based machine translation. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.36-44. [PDF, 274KB]

(2013) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Building specialized bilingual lexicons using word sense disambiguation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.952-956. [PDF, 117KB]

(2013) Dhouha Bouamor, Adrian Popescu, Nasredine Semmar, & Pierre Zweigenbaum: Building specialized bilingual lexicons using large-scale background knowledge. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp. 479-489. [PDF, 246KB]

(2013) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Context vector disambiguation for bilingual lexicon extraction from comparable corpora.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.759-764. [PDF, 199KB]

(2013) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Towards a generic approach for bilingual lexicon extraction from comparable corpora. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp. 143-150. [PDF, 531KB]

(2013) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Using WordNet and semantic similarity for bilingual terminology mining from comparable corpora. Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.16-23. [PDF, 628KB]

(2013) Nasredine Semmar & Houda Saadane: Using transliteration of proper names from Arabic to Latin script to improve English-Arabic word alignment. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1022-1026. [PDF, 92KB]

(2012) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbeaum: Automatic construction of a multi-word expressions bilingual lexicon: a statistical machine translation evaluation perspective. COLING 2012: Proceedings of the 3rd Workshop on Cognitive Aspects of the Lexicon (CogALex-III), Mumbai, December 2012; pp.95-107. [PDF, 210KB]

(2012) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Identifying bilingual multi-word expressions for statistical machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.674-679. [PDF, 387KB]

 (2011) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Improved statistical machine translation using multiword expressions. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.15-20. [PDF, 345KB]

(2011) Nasredine Semmar & Dhouha Bouamor: A new hybrid machine translation approach using cross-language information retrieval and only target text corpora. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.75-79. [PDF, 898KB]

(2010)  Nasredine Semmar, Christophe Servan, Gaël de Chalendar, Benoît Le Ny, & Jean-Jacques Bouzaglou: A hybrid word alignment approach to improve translation lexicons with compound words and idiomatic expressions. Translating and the Computer 32, 18-19 November 2010, London; 10pp. [PDF, 95KB]; presentation, 37 slides [PDF of PPT, 934KB]

(2010) Nasredine Semmar & Laib Meriama: Using a hybrid word alignment approach for automatic construction and updating of Arabic to French lexicons. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 114-119. [PDF, 309KB]

Direction Générale de l’Armement 

(2011) Victoria Arranz, Olivier Hamon, Karim Boudahmane, & Martine Garnier-Rizet: Protocol and lessons learnt from the production of parallel corpora for the evaluation of speech translation systems. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.129-135. [PDF, 115KB]

(2011) Karim Boudahmane, Bianka Buschbeck, Eunach Cho, Josep Maria Crego, Markus Freitag, Thomas Lavergne, Hermann Ney, Jan Niehues, Stephan Peitz, Jean Senellart, Artem Sokolov, Alex Waibel, Tonio Wandmacher, Joern Wuebker, & François Yvon: Advances on spoken language translation in the Quaero program. IWSLT 2011: International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011; 7pp. [PDF, 131KB]

ELDA see Evaluations and Language Resources Distribution Agency

Ensimag

(2011) Arnaud Vié, Luis Villarejo Muñoz, Mireia Farrús Cabeceran, & Jimmy O’Regan: Apertium advanced web interface: a first step towards interactivity and language tools convergence. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.45-51. [PDF, 280KB]

Evaluations and Language Resources Distribution Agency

(2014) E.Comelles, J.Atserias, V.Arranz, I.Castellón, & J.Sesé: VERTa: facing the multilingual experience of a linguistically-based MT evaluation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2701-2707. [PDF, 163KB]

(2014) Stelios Piperidis, Harris Papageorgiou, Christian Spurk, Georg Rehm, Khalid Choukri, Olivier Hamon, Nicoletta Calzolari, Riccardo del Gratta, Bernardo Magnini, & Christian Girardi: META-SHARE: one year after. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1208-1211. [PDF, 861KB]

(2012) Núria Bel, Vassilis Papavasiliou, Prokopis Prokopidis, Antonio Toral, & Victoria Arranz: Mining and exploiting domain-specific corpora in the PANACEA platform. [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.24-26. [PDF, 469KB]

(2012) Elisabet Comelles, Jordi Atserias, Victoria Arranz, & Irene Castellón: VERTa: linguistic features in MT evaluation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3944-3950. [PDF, 521KB]

(2012) Christian Federmann, Ioanna Giannopoulou, Christian Girardi, Olivier Hamon, Dimitris Mavroeidis, Salvatore Minutoli, & Marc Schröder: META-SHARE v2: an open network of repositories for language resources including data and tools.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3300-3303. [PDF, 588KB]

(2012) Maria Gavrilidou, Penny Labropoulou, Elina Desipri, Stelios Piperidis, Haris Papageorgiou, Monica Monachini, Francesca Frontini, Thierry Declerck, Gil Francopoulo, Victoria Arranz, & Valerie Mapelli: The META-SHARE metadata scheme for the description of language resources.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1090-1097. [PDF, 710KB]

(2012) Manuela Yapomo, Gloria Corpas, & Ruslan Mitkov: CLIR- and ontology-based approach for bilingual extraction of comparable documents.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.121-125. [PDF, 380KB]

(2011) Victoria Arranz, Olivier Hamon, Karim Boudahmane, & Martine Garnier-Rizet: Protocol and lessons learnt from the production of parallel corpora for the evaluation of speech translation systems. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.129-135. [PDF, 115KB]

(2011) Elisabet Comelles, Irene Castellón, Jordi Atserias, Victoria Arranz, & Olivier Hamon: VERTa: exploring a multidimensional linguistically-motivated metric. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.21-28. [PDF, 309KB]

(2011) Olivier Hamon & Khalid Choukri: Evaluation methodology and results for English-to-Arabic MT.  MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.480-487. [PDF,

(2010) Jesús Giménez, Lluís Màrquez, Elisabet Comelles, Irene Castellón, & Victoria Arranz: Document-level automatic MT evaluation based on discourse representations. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 333-338. [PDF, 112KB]

(2010) Olivier Hamon: Is my judge a good one? LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1735-1740. [PDF, 281KB]

Franche-Comté, Université de

(2012) Max Silberztein, Tamás Váradi, & Marko Tadić: Open source multi-platform NooJ for NLP. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp. 401-408. [PDF, 557KB]

(2011) Lucas Nunes Vieira & Lucia Specia: A review of translation tools from a post-editing perspective. Proceedings of the Third Joint EM+/CNGL Workshop “Bringing MT to the User: Research Meets Translators” (JEC ’11), Luxembourg, 14 October 2011; pp.33-42. [PDF, 801KB]

GETA see Groupe d'Etudes pour la Traduction Automatique

Grenoble : Ensimag see Ensimag

Grenoble: Institut National Polytechnique see Institut National Polytechnique de Grenoble

Grenoble: Université III

(2013) Nasredine Semmar & Houda Saadane: Using transliteration of proper names from Arabic to Latin script to improve English-Arabic word alignment. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1022-1026. [PDF, 92KB]

Grenoble, Université Joseph-Fournier

(2015) Zied Elloumi, Hervé Blanchon, Gilles Serasset, & Laurent Besacier: METEOR for multiple target languages using DBnary. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.80-89. [PDF, 569KB]

(2015) Christophe Servan, Ngoc Tien Le, Ngoc Quang Luong, Benjamin Lecouteux, & Laurent Besacier: An open-source toolkit for word-level confidence estimation in machine translation. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.196-203. [PDF, 3.2MB]

(2014) Ngoc-Quang Luong, Laurent Besacier, & Benjamin Lecouteux: An efficient two-pass decoder for SMT using word confidence estimation. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.117-124. [PDF, 621KB]

(2014) Ngoc-Quang Luong, Laurent Besacier, & Benjamin Lecouteux: LIG system for word level QE task at WMT14. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.335-341. [PDF, 392KB]

(2014) Shachar Mirkin & Laurent Besacier: Data selection for compact adapted SMT models. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.301-314. [PDF, 610KB]

(2013) Ngoc Quang Luong, Benjamin Lecouteux & Laurent Besacier: LIG system for WMT13 QE task: investigating the usefulness of features in word confidence estima­tion for MT. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.386-391. [PDF, 202KB]

(2013) Shachar Mirkin & Nicola Cancedda: Assessing quick update methods of statistical translation models.  [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 605KB]

(2013) Carlos Ramish, Laurent Besacier, & Alexander Kobzar: How hard is it to automatically translate phrasal verbs from English to French?  [MT Summit XIV] Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Nice, September 3, 2013; pp.53-61. [PDF, 303KB]

(2013) Violeta Seretan: On translating syntactically-flexible expressions [abstract]. Invited talk at [MT Summit XIV] Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Nice, September 3, 2013; p.11. [PDF, 284KB]

(2013) Lingxiao Wang & Christian Boitet: Online production of HQ parallel corpora and permanent task-based evaluation of multiple MT systems: both can be obtained through iMAGs with no added cost. Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), Nice, France, 2 September 2013; pp. 103-110. [PDF, 2109KB]

(2013) Eric Wehrli & Luka Nerima: Anaphora resolution, collocations and translation. [MT Summit XIV] Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Nice, September 3, 2013; pp.12-17. [PDF, 202KB]

(2012) Vincent Berment & Christian Boitet: Heloise – an ARIANE-G5 compatible environment for developing expert MT systems online. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp.9-16. [PDF, 189KB]

(2012) Laurent Besacier, Benjamin Lecouteux, Marwen Azouzi, & Luong Ngoc Quang: The LIG English to French machine translation system for IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp. 102-108. [PDF,625KB]

(2012) Roger Granada, Lucelene Lopes, Carlos Ramisch, Cassia Trojahn, Renata Vieira, & Aline Villavicencio: A comparable corpus based on aligned multilingual ontologies. [ACL 2012] Proceedings of the First Workshop on Multilingual Modeling, Jeju, Republic of Korea, 8-14 July 2012; pp.25-31. [PDF, 108KB]

(2012) Ruslan Kalitvianski, Christian Boitet, & Valérie Bellynck: Collaborative computer-assisted translation applied to pedagogical documents and literary works. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp. 255-260. [PDF, 637KB]

(2012) Marion Potet, Emmanuelle Esperança-Rodier, Laurent Besacier, & Hervé Blanchon: Collection of a large database of French-English SMT output corrections.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.4043-4048. [PDF, 357KB]

(2012) Marion Potet, Laurent Besacier, Hervé Blanchon, & Marwen Azouzi: Towards a better understanding of statistical post-editing. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.284-291. [PDF, 624KB]

(2012) Wang Ling Xiao, Zhang Ying, Christian Boitet, & Valérie Bellynck: Demo of iMAG possibilities: MT-postediting, translation quality evaluation, parallel corpus production. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp. 475-481. [PDF, 1265KB]

(2011) Christian Boitet, Valérie Bellynck, Achille Falaise, & Nguyen Hong-Thai: Operationalization of interactive multilingual gateways (iMAGs) in the Traouiero project. Translating and the Computer 33, 17-18 November 2011, London; 9pp. [PDF, 1737KB]

 (2011) Benjamin Lecouteux, Laurent Besacier, & Hervé Blanchon: LIG English-French spoken language translation system for IWSLT 2011. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.68-72 [PDF, 98KB]

(2011) Bo Li, Eric Gaussier, & Akiko Aizawa: Clustering comparable corpora for bilingual lexicon extraction. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.473-478. [PDF, 216KB]

 (2011) Marion Potet, Raphaël Rubino, Benjamin Lecouteux, Stéphane Huet, Hervé Blanchon, Laurent Besacier, & Fabrice Lefèvre: The LIGA (LIG/LIA) machine translation system for WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.440-446. [PDF, 92KB]

(2011) Marion Potet, Emmanuelle Esperança-Rodier, Hervé Blanchon, & Laurent Besacier: Preliminary experiments on using users’ post-editions to enhance a SMT system. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.161-168. [PDF, 346KB]

(2010) Laurent Besacier, Haitem Afli, Do Thi Ngoc Diep, Hervé Blanchon, & Marion Potet: LIG statistical machine translation systems for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.99-104. [PDF, 417KB]

(2010) Christian Boitet, Huynh Cong Phap, Nguyen Hong Thai, & Valérie Bellynck: The iMAG concept: multilingual access gateway to an elected web sites with incremental quality increase through collaborative post-edition of MT pretranslations. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 8pp. [PDF, 1585KB]

(2010) Do Thi Ngoc Diep, Laurent Besacier, & Eric Castelli: A fully unsupervised approach for mining parallel data from comparable corpora. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 856KB]; presentation: 30 slides [PDF, 2621KB]

(2010) Do Thi Ngoc Diep, Laurent Besacier, & Eric Castelli: Improved Vietnamese-French parallel corpus mining using English language. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.235-242. [PDF, 1182KB]

(2010) Achille Falaise, David Rouquet, Didier Schwab, Hervé Blanchon, & Christian Boitet: Ontology driven content extraction using interlingual annotation of texts in the OMNIA project. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; pp.52-60. [PDF, 359KB]

(2010) Najeh Hajlaoui & Christian Boitet: Multilinguization and personalization of NL-based systems. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; pp.79-87. [PDF, 453KB]

(2010) Bo Li & Eric Gaussier: Improving corpus comparability for bilingual lexicon extraction from comparable corpora. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.644-652. [PDF, 276KB]

(2010) M.G.Abbas Malik, Christian Boitet, & Pushpak Bhattacharyya: Finite-state scriptural translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.791-800. [PDF, 580KB]

(2010) M.G.Abbas Malik, Christian Boitet, Pushpak Bhattacharyya & Laurent Besacier: Weak translation problems – a case study of scriptural translation. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 6pp. [PDF, 241KB]

(2010) Marion Potet, Laurent Besacier, & Hervé Blanchon: The LIG machine translation system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 161-166. [PDF, 81KB]

Grenoble: Université Pierre Mendes France

(2012) Marion Potet, Emmanuelle Esperança-Rodier, Laurent Besacier, & Hervé Blanchon: Collection of a large database of French-English SMT output corrections.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.4043-4048. [PDF, 357KB]

(2012) Marion Potet, Laurent Besacier, Hervé Blanchon, & Marwen Azouzi: Towards a better understanding of statistical post-editing. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.284-291. [PDF, 624KB]

(2011) Marion Potet, Emmanuelle Esperança-Rodier, Hervé Blanchon, & Laurent Besacier: Preliminary experiments on using users’ post-editions to enhance a SMT system. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.161-168. [PDF, 346KB]

Grenoble: Université Stendhal

(2012) Estelle Delpech, Béatrice Daille, Emmanuel Morin, & Claire Lemaire: Extraction of domain-specific bilingual lexicon from comparable corpora: compositional translation and ranking. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.745-761. [PDF, 319KB]

(2012) Houda Saadane, Ouafa Benterki, Nasredine Semmar, & Christian Fluhr: Using Arabic transliteration to improve word alignment from French-Arabic parallel corpora. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.38-46. [PDF]

Groupe d’Etudes pour la Traduction Automatique

(2012) Luong Ngoc Quang: Integrating lexical, syntactic  and system-based features to improve word confidence estimation in SMT. JEP-TALN-RECITAL 2012: Actes de la conference conjointe, Grenoble, 4 au 8 juin 2012; vol.3, pp.43-56. [PDF, 219KB]

INaLCO

(2012) Vincent Berment & Christian Boitet: Heloise – an ARIANE-G5 compatible environment for developing expert MT systems online. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp.9-16. [PDF, 189KB]

INRIA see Institut de Recherche en Informatique et Systèmes Aléatoires

Institute for Multilingual and Multimedia Information

(2012) Maria Gavrilidou, Penny Labropoulou, Elina Desipri, Stelios Piperidis, Haris Papageorgiou, Monica Monachini, Francesca Frontini, Thierry Declerck, Gil Francopoulo, Victoria Arranz, & Valerie Mapelli: The META-SHARE metadata scheme for the description of language resources.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1090-1097. [PDF, 710KB]

(2012) Joseph Mariani: Language resources and evaluation for a multilingual Europe [abstract]. In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp. 3-5. [PDF]

(2012) Joseph Mariani: EC collaborative research instruments. META-FORUM, Brussels, June 19-21, 2012; 13 slides. [PDF of PPT, 248KB]

(2012) Joseph Mariani: Priority theme 3: Socially aware interactive assistant. META-FORUM, Brussels, June 19-21, 2012; 15 slides. [PDF of PPT, 813KB]

(2011) Joseph Mariani: The future European multilingual information society.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 26 slides [PDF of PPT, 11,176KB]

Institut de Recherche en Informatique et Systèmes Aléatoires

(2014) Yves Scherrer & Benoît Sagot: A language-independent and fully unsupervised approach t lexicon induction and part-of-speech tagging for closely related languages. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.502-508. [PDF, 184KB]

(2010) Marianna Apidianaki & Yifan He: An algorithm for cross-lingual sense-clustering tested in a MT evaluation setting. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.219-226. [PDF, 474KB]

(2010) Richard Moot: Semi-automated extraction of a wide-coverage type-logical grammar for French. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 6pp. [PDF, 60KB]

(2010) Richard Moot: Wide-coverage French syntax and semantics using Grail. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 4pp. [PDF, 121KB]

(2010) Sylwia Ozdowska & Vincent Claveau: Inferring syntactic rules for word alignment through inductive logic programming. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.749-754. [PDF, 370KB]

Laboratoire Bordelais de Recherche en Informatique

(2010) Richard Moot: Semi-automated extraction of a wide-coverage type-logical grammar for French. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 6pp. [PDF, 60KB]

(2010) Richard Moot: Wide-coverage French syntax and semantics using Grail. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 4pp. [PDF, 121KB]

Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier

(2011) Johan Segura & Violaine Prince: Using alignment to detect multiword expressions in bilingual corpora. .  Tralogy, Paris, 3-4 March 2011;  17pp. [PDF, 430KB]

Laboratoire d’Informatique pour la Méchanique et les Sciences d’Ingénieur

(2015) Franck Burlot & François Yvon: Morphology-aware alignments for translation to and from a synthetic language. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp. 188-195. [PDF, 3.1MB]

(2014) Marianna Apidianaki, Emilia Verzeni, & Diana McCarthy: Semantic clustering of pivot paraphrases. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.4270-4275. [PDF, 243KB]

(2014) Quoc Kanh Do, Teresa Herrmann, Jan Niehues, Alexandre Allauzen, François Yvon & Alex Waibel: The KIT-LIMSI translation system for WMT 2014. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.84-89. [PDF, 378KB]

(2014) Nicolas Pécheux, Li Gong, Quoc Khanh Do, Benjamin Marie, Yulia Ivanishcheva, Alexandre Allauzen, Thomas Lavergne, Jan Niehues, Aurélien Max, & François Yvon: LIMSI @ WMT’14 medical translation task. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.246-253. [PDF, 316KB]

(2014) Nicolas Pécheux, AlexandreAllauzen, & Francçois Yvon: Rule-based reordering space in statistical machine translation.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1800-1806. [PDF, 607]

(2014) Guillaume Wisniewski, Nicolas Pécheux, Alexandre Allauzen, & François Yvon: LIMSI submission for WMT’14 QE task. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.348-354. [PDF, 472KB]

(2014) Guillaume Wisniewski, Natalie Kübler, & François Yvon: A corpus of machine translation errors extracted from translation students exercises. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3585-3588. [PDF, 328KB]

(2013) Alexandre Allauzen, Nicolas Pécheux, Quoc Khanh Do, Marco Dinarelli, Thomas Lavergne, Au­rélien Max, Hai-Son Le & François Yvon: LIMSI @ WMT13. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.62-69. [PDF, 210KB]

(2013) Marianna Apidianaki, Nikola Ljubešić, & Darja Fišer: Cross-lingual WSD for translation extraction from comparable corpora. Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.1-10. [PDF, 388KB]

(2013) Marianna Apidianaki: LIMSI: cross-lingual word sense disambiguation using translation sense clustering. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.178-182. [PDF, 113KB]

(2013) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Building specialized bilingual lexicons using word sense disambiguation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.952-956. [PDF, 117KB]

(2013) Dhouha Bouamor, Adrian Popescu, Nasredine Semmar, & Pierre Zweigenbaum: Building specialized bilingual lexicons using large-scale background knowledge. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp. 479-489. [PDF, 246KB]

(2013) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Context vector disambiguation for bilingual lexicon extraction from comparable corpora.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.759-764. [PDF, 199KB]

(2013) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Using WordNet and semantic similarity for bilingual terminology mining from comparable corpora. Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.16-23. [PDF, 628KB]

(2013) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Towards a generic approach for bilingual lexicon extraction from comparable corpora. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp. 143-150. [PDF, 531KB]

(2013) Li Gong, Aurélien Max, & François Yvon: Improving bilingual sub-sentential alignment by sampling-based transpotting. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 431KB]

(2013) Benjamin Marie & Aurélien Max: A study in greedy oracle improvement of translation hypotheses. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 400KB]

(2013) Stephan Peitz, Saab Mansour, Matthias Huck, Markus Freitag, Hermann Ney, Eunah Cho, Teresa Herrmann, Mohammed Mediani, Jan Niehues, Alex Waibel, Alexandre Allauzen, Quoc Khanh Do, Bianka Buschbeck, & Tonio Wandmacher: Joint WMT 2013 Submission of the QUAERO Project. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.185-192. [PDF, 179KB]

(2013) M.Ali Basha Shaik, Zoltan Tüske, Simon Wiesler, Markus Nußbaum-Thom, Stephan Peitz, Ralf Schlüter & Hermann Ney: The RWTH Aachen German and English LVCSR systems for IWSLT-2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 230KB]

(2013) Anil Kumar Singh, Guillaume Wisniewski & François Yvon: LIMSI submission for the WMT’13 quality estimation task: an experiment with n-gram posteriors. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.398-404. [PDF, 204KB]

(2013) Guillaume Wisniewski, Anil Kumar Singh, Natalia Segal, & François Yvon: Design and analysis of a large corpus of post-edited translations: quality estimation, failure analysis and variability of post-edition. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.117-124. [PDF, 541KB]

(2012) Marianna Apidianaki: Measuring the adequacy of cross-lingual paraphrases in a machine translation setting. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.63-72. [PDF, 137KB]

(2012) Marianna Apidianaki, Guillaume Wisniewski, Artem Sokolov, Aurélien Max, & François Yvon: WSD for n-best reranking and local language modeling in SMT. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.1-9. [PDF, 108KB]

(2012) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbeaum: Automatic construction of a multi-word expressions bilingual lexicon: a statistical machine translation evaluation perspective. COLING 2012: Proceedings of the 3rd Workshop on Cognitive Aspects of the Lexicon (CogALex-III), Mumbai, December 2012; pp.95-107. [PDF, 210KB]

(2012) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Identifying bilingual multi-word expressions for statistical machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.674-679. [PDF, 387KB]

(2012) Houda Bouamor, Aurélien Max, Gabriel Illouz, & Anne Vilnat: A contrastive review of paraphrase acquisition techniques.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2653-2658. [PDF, 352KB]

(2012) Houda Bouamor, Aurélien Max, & Anne Vilnat: Validation of sub-sentential paraphrases acquired from parallel monolingual corpora. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 716-725. [PDF, 173KB]

(2012) Markus Freitag, Stephan Peitz, Matthias Huck, Hermann Ney, Jan Niehues, Teresa Herrmann, Alex Waibel, Le Hai-son, Thomas Lavergne, Alexandre Allauzen, Bianka Buschbeck, Josep Maria Crego, & Jean Senellart: Joint WMT 2012 submission for the QUAERO project. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.322-329. [PDF, 113KB]

(2012) Maria Gavrilidou, Penny Labropoulou, Elina Desipri, Stelios Piperidis, Haris Papageorgiou, Monica Monachini, Francesca Frontini, Thierry Declerck, Gil Francopoulo, Victoria Arranz, & Valerie Mapelli: The META-SHARE metadata scheme for the description of language resources.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1090-1097. [PDF, 710KB]

(2012) Li Gong, Aurélien Max, & François Yvon: Towards contextual adaptation for any-text translation. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.292-299. [PDF, 727KB]

(2012) Adrien Lardilleux, François Yvon, & Yves Lepage: Hierarchical sub-sentential alignment with Anymalign. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.279-286. [PDF, 173KB]

(2012) Hai-Son Le, Thomas Lavergne, Alexandre Allauzen, Marianna Apidianaki, Li Gong, Aurélien Max, Artem Sokolov, Guillaume Wisniewski, & François Yvon: LIMSI @ WMT’12.  WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.330-337. [PDF, 189KB]

(2012) Joseph Mariani: EC collaborative research instruments. META-FORUM, Brussels, June 19-21, 2012; 13 slides. [PDF of PPT, 248KB]

(2012) Joseph Mariani: Language resources and evaluation for a multilingual Europe [abstract]. In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp. 3-5. [PDF]

(2012) Joseph Mariani: Priority theme 3: Socially aware interactive assistant. META-FORUM, Brussels, June 19-21, 2012; 15 slides. [PDF of PPT, 813KB]

(2012) Artem Sokolov, Guillaume Wisniewski, & François Yvon: Computing lattice BLEU oracle scores for machine translation. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp.120-129. [PDF, 258KB]

(2012) Artem Sokolov, Guillaume Wisniewski & François Yvon: Non-linear n-best list reranking with few features. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 326KB]

(2012) Qian Yu, Aurélien Max, & François Yvon: Aligning bilingual literary works: a pilot study. NAACL-HLT Workshop on Computational Linguistics for Literature, Montréal, Canada, June 8, 2012; pp.36-44. [PDF, 232KB]

(2012) Qian Yu, Aurélien Max, & François Yvon: Revisiting sentence alignment algorithms for alignment visualization and evaluation. [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.10-16. [PDF, 430KB]

 (2011) Alexandre Allauzen, Hélène Bonneau-Maynard, Hai-Son Le, Aurélien Max, Guillaume Wisniewski, François Yvon, Gilles Adda, Josep M.Crego, Adrien Lardilleux, Thomas Lavergne, & Artem Sokolov: LIMSI @ WMT11. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.309-315. [PDF, 107KB]

 (2011) Karim Boudahmane, Bianka Buschbeck, Eunach Cho, Josep Maria Crego, Markus Freitag, Thomas Lavergne, Hermann Ney, Jan Niehues, Stephan Peitz, Jean Senellart, Artem Sokolov, Alex Waibel, Tonio Wandmacher, Joern Wuebker, & François Yvon: Advances on spoken language translation in the Quaero program. IWSLT 2011: International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011; 7pp. [PDF, 131KB]

(2011) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Improved statistical machine translation using multiword expressions. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.15-20. [PDF, 345KB]

(2011) Houda Bouamor, Aurélien Max, & Anne Vilnat: Monolingual alignment by edit rate computation on sentential paraphrase pairs. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.395-400. [PDF, 138KB]

(2011) Josep M.Crego, François Yvon, & José B.Mariño: Ncode: an open source bilingual n-gram SMT toolkit. Sixth Machine Translation Marathon, 5-10 September 2011, Trento; Prague Bulletin of Mathematical Linguistics, no.96, October 2011; pp.49-58. [PDF, 190KB]; presentation, 59 slides [PDF of PPT, 878KB]

(2011) Michael Filhol, Line Patris, & Pierre Guitteny: Text-sign parallel corpus study to start designing an automatic translation system.  International Workshop on Sign Language Translation and Avatar Technology (SLTAT), 23 October 2011, Dundee, UK; 7pp. [PDF, 351KB]

 (2011) M.Filhol & A.Braffort: Text to SL translation.  International Workshop on Sign Language Translation and Avatar Technology (SLTAT), 10-11 January 2011, Federal Ministry of Labour and Social Affairs, Berlin, Germany; 13 slides [PDF of PPT, 1044KB]

(2011) Karën Fort, Gilles Adda, & K.Bretonnel Cohen: Amazon Mechanical Turk: gold mine or coal mine?  Computational Linguistics 37 (2), pp. 413-420 [PDF, 96KB]

(2011) Markus Freitag, Gregor Leusch, Joern Wuebker, Stephan Peitz, Hermann Ney, Teresa Herrmann, Jan Niehues, Alex Waibel, Alexandre Allauzen, Gilles Adda, Josep Maria Crego, Bianka Buschbeck, Tonio Wandmacher, & Jean Senellart: Joint WMT submission of the QUAERO project. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.358-364. [PDF, 107KB]

(2011) Lori Lamel, Sandrine Courcinous, Julien Despres, Jean-Luc Gauvain, Yvan Josse, Kevin Kilgour, Florian Kraft, Viet Bac Le, Hermann Ney, Markus Nußbaum-Thom, Ilya Oparin, Tim Schlippe, Ralf Schlüter, Tanja Schultz, Thiago Fraga da Silva, Sebastian Stüker, Martin Sundermeyer, Bianca Vieru, Ngoc Thang Vu, Alexander Waibel, & Cécile Woehrling: Speech recognition for machine translation in Quaero. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.121-128. [PDF, 310KB]; presentation, 33 slides. [PDF of PPT]

(2011) Thomas Lavergne, Josep Maria Crego, Alexandre Allauzen, & François Yvon: From n-gram-based to CRF-based translation models. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.542-553. [PDF, 197KB]

(2011) Thomas Lavergne, Alexandre Allauzen, Hai-Son Le, & François Yvon: LIMSI’s experiments in domain adaptation for IWSLT11. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.62-67. [PDF, 358KB]

(2011) Joseph Mariani: The future European multilingual information society.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 26 slides [PDF of PPT, 11,176KB]

(2011) Artem Sokolov & François Yvon: Minimum error rate training semiring. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.241-248. [PDF, 418KB]; presentation, 19 slides [PDF]

(2011) Nadi Tomeh, Alexandre Allauzen, & François Yvon: Discriminative weighted alignment matrices for statistical machine translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.305-312. [PDF, 413KB]; presentation, 38 slides [PDF]

(2011) Nadi Tomeh, Marco Turchi, Guillaume Wisinewski, Alexandre Allauzen, & François Yvon: How good are your phrases? Assessing phrase quality with single class classification. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.261-268. [PDF, 691KB]

(2010) Gilles Adda & Joseph Mariani: Language resources & Amazon Mechanical Turk: ethical, legal and other issues. LREC 2010: Le gal Issues for Sharing Language Resources - LISLR2010 Workshop, 17 May 2010, Valletta, Malta; 21slides. [PDF of PPT, 244KB]

(2010) Alexandre Allauzen, Josep M.Crego, Ílknur Durgar El-Kahlout, & François Yvon: LIMSI’s statistical translation systems for WMT’10.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 54-59. [PDF, 129KB]

(2010) Alexandre Allauzen, Josep M.Crego, İlknur Durgar El-Kahlout, Le Hai-Son, Guillaume Wisniewski, & François Yvon: LIMSI @ IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.105-112. [PDF, 386KB]

(2010) Bruno Cartoni & Marie-Aude Lefer: The MuLeXFoR database: representing word-formation processes in a multilingual lexicographic environment. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.840-843. [PDF, 284KB]

(2010) Josep Maria Crego & François Yvon: Improving reordering with linguistically informed bilingual n-grams. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.197-205. [PDF, 270KB]

(2010) Josep Maria Crego, Aurélien Max, & François Yvon: Local lexical adaptation in machine translation through triangulation: SMT helping SMT. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.232-240. [PDF, 197KB]

(2010) İlknur Durgar El-Kahlout & François Yvon: The pay-offs of preprocessing for German-English statistical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.251-258. [PDF, 301KB]

(2010) Le Hai Son, Alexandre Allauzen, Guillaume Wisniewski, & François Yvon: Training continuous space language models: some practical issues. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.778-788. [PDF, 311KB]

(2010) Gregor Leusch, Aurélien Max, Josep Maria Crego, & Hermann Ney: Multi-pivot translation by system combination. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.299-306. [PDF, 303KB]

(2010) Joseph Mariani: Foreword. 7th International Workshop on Spoken Language Translation, 2 December 2010, Paris, France; pp.v-vi. [PDF, 117KB]

(2010) Aurélien Max: Example-based paraphrasing for improved phrase-based statistical machine translation.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.656-666. [PDF, 234KB]

(2010) Aurélien Max, Josep Maria Crego, & François Yvon: Contrastive lexical evaluation of machine translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1753-1757. [PDF, 400KB]

(2010) Nadi Tomeh, Alexandre Allauzen, Guillaume Wisniewski & François Yvon: Refining word alignment with discriminative training. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 203KB]

(2010) Guillaume Wisniewski, Alexandre Allauzen, & François Yvon: Assessing phrase-based translation models with oracle decoding. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.933-943. [PDF, 188KB]

(2010) François Yvon & Viggo Hansen (eds.) EAMT 2010:  Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France.

(2010) Zhemin Zhu, Delphine Bernhard, & Iryna Gurevych: A monolingual tree-based translation model for sentence simplification. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1353-1361. [PDF, 260KB]

Laboratoire d’Informatique Fondamentale

(2010) Reinhard Rapp & Michael Zock: The noisier the better: identifying multilingual word translations using a single monolingual corpus. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; pp.16-25. [PDF, 179KB]

(2010) Reinhard Rapp & Michael Zock: Utilizing citations of foreign words in corpus-based dictionary generation. [Coling 2010] Proceedings of the Second Workshop on NLP Challenges in the Information Explosion Era, Beijing, China, 28 August 2010; pp.50-59. [PDF, 188KB]

Laboratoire Lorrain de Recherche et Informatique et ses Applications

 (2013) David Langlois & Kamel Smaili: LORIA system for the WMT13 quality estimation shared task. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.380-385. [PDF, 177KB]

(2013) Motaz Saad, David Langlois, & Kamel Smaili: Comparing multilingual comparable articles based on opnions. Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.105-111. [PDF, 512KB]

 (2011) Sylvain Raybaud, David Langlois, & Kamel Smaïli: Broadcast news speech-to-text translation experiments. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.378-381. [PDF, 64KB]

LABRI see Laboratoire Bordelais de Recherche en Informatique

Langage, Langues et Cultures d’Afrique Noire

(2010) Julien Gosme, Wigdan Mekki, Fathi Debili, Yves Lepage, & Nadine Lucas: The GREYC/LLACAN machine translation systems for the IWSLT 2010 campaign. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.59-65. [PDF, 307KB]

LaTTiCe see Paris, Université de

Le Mans, Université

(2015) Walid Aransa, Holger Schwenk, & Loïc Barrault: Improving continuous space language models auxiliary features. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.151-158. [PDF, 2.9MB]

(2015) Mercedes Garcia Martínez, Loïc Barrault, Anthony Rousseau, Paul Deléglise, & Yannick Estève: The LIUM ASR and SLT systems for IWSLT 2015. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp. 50-54. [PDF, 1.8MB]

(2014) Anthony Rousseau, Paul Deléglise, & Yannick Estève: Enhancing the TED-LIUM corpus with selected data for language modeling and more TED talks. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3935-3939. [PDF, 115KB]

(2013) Mauro Cettolo, Christophe Servan, Nicola Bertoldi, Marcello Federico, Loïc Barrault, & Holger Schwenk: Issues in incremental adaptation of statistical MT from human post-edits. Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), Nice, France, 2 September 2013; pp. 111-118. [PDF, 186KB]

(2013) Marcello Federico, Philipp Koehn, Holger Schwenk, & Marco Trombetti: Matecat: Machine Translation Enhanced Computer Assisted Translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; p.425. [PDF, 192KB]

(2013) Rico Sennrich, Holger Schwenk & Walid Aransa: A multi-domain translation model framework for statistical machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.832-840. [PDF, 1387KB]

(2012) Walid Aransa, Holger Schwenk, & Loic Barrault: Semi-supervised transliteration mining from parallel and comparable corpora. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp. 185-192. [PDF, 650KB]

(2012) Frédéric Blain, Holger Schwenk, & Jean Senellart: Incremental adaptation using translation information and post-editing analysis. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.229-236. [PDF, 873KB]

(2012) Patrik Lambert & Rafael E.Banchs: BIA: a discriminative phrase alignment toolkit. Prague Bulletin of Mathematical Linguistics 97, April 2012; pp.43-53. [PDF, 136KB]

(2012) Patrik Lambert, Holger Schwenk, & Frédéric Blain: Automatic translation of scientific documents in the HAL archive.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3933-3936. [PDF, 329KB]

(2012) Patrick Lambert, Jean Senellart, Laurent Romary, Holger Schwenk, Florian Zipser, Patrice Lopez, & Frédéric Blain: Collaborative machine translation service for scientific texts. [EACL 2012] Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 11-15. [PDF, 523KB]

(2012) Holger Schwenk: Continuous space translation models for phrase-based statistical machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.1071-1080. [PDF, 124KB]

(2012) Holger Schwenk, Anthony Rousseau, & Mohammed Attik: Large, pruned or continuous space language models on a GPU for statistical machine translation. NAACL-HLT Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT, Montréal, Canada, June 8, 2012; pp.11-19. [PDF, 198KB]

(2012) Christophe  Servan & Simon Petitrenaud: Calculation of phrase probabilities for statistical machine translation by using belief functions. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.1101-1110. [PDF, 120KB]

(2012) Christophe Servan, Patrik Lambert, Anthony Rousseau, Holger Schwenk, & Loïc Barrault: LIUM’s SMT machine translation systems for WMT 2012. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.369-373. [PDF, 106KB]

(2012) Kashif Shah, Loïc Barrault, & Holger Schwenk: A general framework to weight heterogenous parallel data for model adaptation in statistical machine translation.  AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 194KB]

(2011) LoïcBarrault & Patrik Lambert: Machine translation system combination with MANY for ML4HMT. [LIHMT] ML4HMT 2011: Shared Task on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid Machine Translation, 19th November, Barcelona Media, Barcelona; pp.89-92. [PDF, 344KB]

 (2011) Loïc Barrault: MANY improvements for WMT’11. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.135-139. [PDF, 151KB]

(2011) Frédéric Blain, Jean Senellart, Holger Schwenk, Mirko Plitt, & Johann Roturier: Qualitative analysis of post-editing for high quality machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.164-171. [PDF, 218KB]

(2011) Patrik Lambert & Rafael Banchs: BIA: a discriminative phrase alignment toolkit. Sixth Machine Translation Marathon, 5-10 September 2011, Trento; 23 slides [PDF of PPT, 288KB]

(2011) Patrik Lambert, Holger Schwenk, Christophe Servan, & Sadaf Abdul-Rauf: Investigations on translation model adaptation using monolingual data. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.284-293. [PDF, 124KB]

 (2011) Anthony Rousseau, Fethi Bougares, Paul Deléglise, Holger Schwenk, & Yannick Estève: LIUM’s systems for the IWSLT 2011 speech translation tasks. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.79-85. [PDF, 250 KB]

(2011) Holger Schwenk, Patrik Lambert, Loïc Barrault, Christophe Servan, Haithem Afli, Sadaf Abdul-Rauf, & Kashif Shah: LIUM’s SMT machine translation systems for WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.464-469. [PDF, 119KB]

(2011) Christophe Servan & Holger Schwenk: Optimising multiple metrics with MERT. Sixth Machine Translation Marathon, 5-10 September 2011, Trento; Prague Bulletin of Mathematical Linguistics, no.96, October 2011; pp.109-117. [PDF, 141KB]; presentation, 20 slides [PDF of PPT, 535KB]

(2011) Kashif Shah, Loïc Barrault, & Holger Schwenk: Parametric weighting of parallel data for statistical machine translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1323-1331. [PDF, 754KB]

(2010) Loïc Barrault: Open source machine translation system combination. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.147-155. [PDF, 163KB]

(2010) Loïc Barrault: MANY: open source MT system combination at WMT’10. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 271-275. [PDF, 101KB]

(2010) Patrik Lambert: Statistical machine translation: IBM models and word alignment. Presentation at Fifth Machine Translation Marathon, 13-18 September 2010, University of Le Mans; 40 slides. [PDF, 230KB]

(2010) Patrik Lambert, Sadaf Abdul-Rauf, & Holger Schwenk: LIUM SMT machine translation system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 121-126. [PDF, 119KB]

(2010) Patrik Lambert, Simon Petitrenaud, Yanjun Ma & Andy Way: Statistical analysis of alignment characteristics for phrase-based machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 588KB]; presentation: 26 slides [PDF, 133KB]

(2010) Anthony Rousseau, Loïc Barrault, Paul Deléglise, & Yannick Estève: LIUM’s statistical machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.113-117. [PDF, 246KB]

(2010) Francisco Zamora-Martínez, María José Castro-Bleda, & Holger Schwenk: N-gram-based machine translation enhanced with neural networks for the French-English BTEC-IWSLT’10 task.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.45-52. [PDF, 545KB]

(2010) Holger Schwenk: Continuous-space language models for statistical machine translation. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.137-146. [PDF, 151KB]

(2010) Kashif Shah, Loïc Barrault, & Holger Schwenk: Translation model adaptation by resampling. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 392-399. [PDF, 176KB]

Lexcelera, Innovating Translation

(2012) Lori Thicke: Why do we need language technology? META-FORUM, Brussels, June 19-21, 2012; 15 slides. [PDF of PPT, 1543KB]

(2010) Lori Thicke: What’s driving machine translation and why should we care? EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 11pp. [PDF of PPT presentation, 1539KB]; presentation: 23 slides [PDF, 1568KB]

Lille, Université de

(2010) Ilse Depraetere: What counts as useful advice in a university post-editing training context? Report on a case study. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 7pp. [PDF, 560KB]

LIMSI see Laboratoire d’Informatique pour la Méchanique et les Sciences d’Ingénieur

Lingua et Machina

(2013) Benjamin Marie & Aurélien Max: A study in greedy oracle improvement of translation hypotheses. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 400KB]

(2012) Estelle Delpech, Béatrice Daille, Emmanuel Morin, & Claire Lemaire: Extraction of domain-specific bilingual lexicon from comparable corpora: compositional translation and ranking. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.745-761. [PDF, 319KB]

(2012) Estelle Delpech, Béatrice Daille, Emmanuel Morin, & Claire Lemaire: Identification of fertile translations in medical comparable corpora: a morpho-compositional approach. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 154KB]

LIPN see Paris, Université de

LIRMM see Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier

LIUM see Le Mans Université

LLACAN see Langage, Langues et Cultures d’Afrique Noire

LORIA see Laboratoire Lorrain de Recherche et Informatique et ses Applications

Lorraine, Université de

(2012) David Langlois, Sylvain Raybaud, & Kamel Smaïli: LORIA system for the WMT12 quality estimation shared task. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.114-119. [PDF, 96KB]

Maine, Université du

(2013) Haithem Afli, Loïc Barrault & Holger Schwenk: Multimodal comparable corpora as resources for extracting parallel data: parallel phrases extraction. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.286-292. [PDF, 1586KB]

(2012) Frédéric Blain, Holger Schwenk, & Jean Senellart: Incremental adaptation using translation information and post-editing analysis. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.229-236. [PDF, 873KB]

Montpellier: Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier see Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier

Nancy, Université de see Laboratoire Lorrain de Recherche et Informatique et ses Applications

Nantes, Université de

(2013) Béatrice Daille: TTC: terminology extraction, translation tools and comparable corpora. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; p.449. [PDF, 272KB]

(2013) Rima Harastani, Béatrice Daille & Emmanuel Morin: Ranking translation candidates acquired from comparable corpora. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.401-409. [PDF, 276KB]

(2013) Amir Hazem & Emmanuel Morin: A comparison of smoothing techniques for bilingual lexicon extraction from comparable corpora.  Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.24-33. [PDF, 546KB]

(2013) Amir Hazem & Emmanuel Morin: Word co-occurrence counts prediction for bilingual terminology extraction from comparable corpora. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1392-1400. [PDF, 162KB]

(2012) Béatrice Daille: Building bilingual terminologies from comparable corpora: the TTC TermSuite.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.29-32. [PDF, 317KB]

(2012) Estelle Delpech, Béatrice Daille, Emmanuel Morin, & Claire Lemaire: Extraction of domain-specific bilingual lexicon from comparable corpora: compositional translation and ranking. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.745-761. [PDF, 319KB]

(2012) Estelle Delpech, Béatrice Daille, Emmanuel Morin, & Claire Lemaire: Identification of fertile translations in medical comparable corpora: a morpho-compositional approach. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 154KB]

(2012) Amir Hazem & Emmanuel Morin: ICA for bilingual lexicon extraction from comparable corpora.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.126-133. [PDF, 474KB]

(2011) Helena Blancafort, Ulrich Heid, Tatiana Gornostay, Claude Méchoulam, Béatrice Daille & Serge Sharoff. User-centred views on terminology extraction tools: usage scenarios and integration into MT and CAT tools. Tralogy, Paris, 3-4 March 2011;  15pp. [PDF, 201KB]

(2011) Amir Hazem, Emmanuel Morin & Sebastian Peña Saldarriaga: Bilingual lexicon extraction from comparable corpora as metasearch. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.35-43. [PDF, 159KB]

(2011) Emmanuel Morin & Emmanuel Prochasson: Bilingual lexicon extraction from comparable corpora enhanced with parallel corpora. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.27-34. [PDF, 125KB]

(2010) Manny Rayner, Pierrette Bouillon, Nikos Tsourakis, Johanna Gerlach, Maria Georgescul, Yukie Nakao, & Claudia Baur: A multilingual CALL game based on speech translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1531-1538. [PDF, 573KB]

Orange Labs

(2015) Artem Sokolov, Stefan Riezler, & Tanguy Urvoy: Bandit structured prediction for learning from partial feedback in statistical machine translation. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.160-171. [PDF, 611KB]

(2010) Benoît Gaillard, Malek Boualem, & Olivier Collin: Query translation using Wikipedia-based resources for analysis and disambiguation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 579KB]

Paris, Ecole Normale Supérieure

(2011) Philippe Lacour, Any Freitas, Aurélien Bénel, Franck Eyraud & Diana Zambon: Translation and the new digital commons. Tralogy, Paris, 3-4 March 2011; 16 pp. [PDF, 313KB]

Paris, Université de

(2015) Franck Burlot & François Yvon: Morphology-aware alignments for translation to and from a synthetic language. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp. 188-195. [PDF, 3.1MB]

(2010) Olivier Hamon: Is my judge a good one? LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1735-1740. [PDF, 281KB]

(2010) Lin Sun, Anna Korhonen, Thierry Poibeau, & Cédric Messiant: Investigating the cross-linguistic potential of VerbNet-style classification. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp. 1056-1064. [PDF, 227KB]

Paris Diderot, Université

(2014) Marianna Apidianaki, Emilia Verzeni, & Diana McCarthy: Semantic clustering of pivot paraphrases. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.4270-4275. [PDF, 243KB]

(2014) Yves Scherrer & Benoît Sagot: A language-independent and fully unsupervised approach t lexicon induction and part-of-speech tagging for closely related languages. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.502-508. [PDF, 184KB]

(2014) Guillaume Wisniewski, Natalie Kübler, & François Yvon: A corpus of machine translation errors extracted from translation students exercises. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3585-3588. [PDF, 328KB]

(2011) Michael Filhol, Line Patris, & Pierre Guitteny: Text-sign parallel corpus study to start designing an automatic translation system.  International Workshop on Sign Language Translation and Avatar Technology (SLTAT), 23 October 2011, Dundee, UK; 7pp. [PDF, 351KB]

Paris VII, Université

(2014) Valérie Hanoka & Benoît Sagot: YaMTG: an open-source heavily multilingual translation graph extracted from wiktionaries and parallel corpora. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3179-3186. [PDF, 245KB]

 (2013) Yves Scherrer & Tomaž Erjavec: Modernizing historical Slovene words with character-based SMT. Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing, 8-9 August 2013, Sofia, Bulgaria; pp.58-62. [PDF, 110KB]

Paris Sud, Université de

(2014) Nicolas Pécheux, Li Gong, Quoc Khanh Do, Benjamin Marie, Yulia Ivanishcheva, Alexandre Allauzen, Thomas Lavergne, Jan Niehues, Aurélien Max, & François Yvon: LIMSI @ WMT’14 medical translation task. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.246-253. [PDF, 316KB]

(2014) Nicolas Pécheux, AlexandreAllauzen, & Francçois Yvon: Rule-based reordering space in statistical machine translation.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1800-1806. [PDF, 607]

(2014) Guillaume Wisniewski, Nicolas Pécheux, Alexandre Allauzen, & François Yvon: LIMSI submission for WMT’14 QE task. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.348-354. [PDF, 472KB]

(2014) Guillaume Wisniewski, Natalie Kübler, & François Yvon: A corpus of machine translation errors extracted from translation students exercises. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3585-3588. [PDF, 328KB]

(2013) Alexandre Allauzen, Nicolas Pécheux, Quoc Khanh Do, Marco Dinarelli, Thomas Lavergne, Au­rélien Max, Hai-Son Le & François Yvon: LIMSI @ WMT13. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.62-69. [PDF, 210KB]

(2013) Li Gong, Aurélien Max, & François Yvon: Improving bilingual sub-sentential alignment by sampling-based transpotting. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 431KB]

(2013) Benjamin Marie & Aurélien Max: A study in greedy oracle improvement of translation hypotheses. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 400KB]

(2013) Guillaume Wisniewski, Anil Kumar Singh, Natalia Segal, & François Yvon: Design and analysis of a large corpus of post-edited translations: quality estimation, failure analysis and variability of post-edition. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.117-124. [PDF, 541KB]

(2012) Marianna Apidianaki, Guillaume Wisniewski, Artem Sokolov, Aurélien Max, & François Yvon: WSD for n-best reranking and local language modeling in SMT. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.1-9. [PDF, 108KB]

(2012) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Identifying bilingual multi-word expressions for statistical machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.674-679. [PDF, 387KB]

(2012) Houda Bouamor, Aurélien Max, Gabriel Illouz, & Anne Vilnat: A contrastive review of paraphrase acquisition techniques.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2653-2658. [PDF, 352KB]

(2012) Houda Bouamor, Aurélien Max, & Anne Vilnat: Validation of sub-sentential paraphrases acquired from parallel monolingual corpora. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 716-725. [PDF, 173KB]

(2012) Li Gong, Aurélien Max, & François Yvon: Towards contextual adaptation for any-text translation. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.292-299. [PDF, 727KB]

(2012) Aimée Lahaussois & Séverine Guillaume: A viewing and processing tool for the analysis of a comparable corpus of Kiranti mythology.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.33-41. [PDF, 778KB]

(2012) Adrien Lardilleux, François Yvon, & Yves Lepage: Hierarchical sub-sentential alignment with Anymalign. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.279-286. [PDF, 173KB]

(2012) Hai-Son Le, Thomas Lavergne, Alexandre Allauzen, Marianna Apidianaki, Li Gong, Aurélien Max, Artem Sokolov, Guillaume Wisniewski, & François Yvon: LIMSI @ WMT’12.  WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.330-337. [PDF, 189KB]

(2012) Artem Sokolov, Guillaume Wisniewski, & François Yvon: Computing lattice BLEU oracle scores for machine translation. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp.120-129. [PDF, 258KB]

(2012) Artem Sokolov, Guillaume Wisniewski & François Yvon: Non-linear n-best list reranking with few features. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 326KB]

(2012) Qian Yu, Aurélien Max, & François Yvon: Aligning bilingual literary works: a pilot study. NAACL-HLT Workshop on Computational Linguistics for Literature, Montréal, Canada, June 8, 2012; pp.36-44. [PDF, 232KB]

(2012) Qian Yu, Aurélien Max, & François Yvon: Revisiting sentence alignment algorithms for alignment visualization and evaluation. [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.10-16. [PDF, 430KB]

(2012) Yong Zhuang, Guillaume Wisniewski, & François Yvon: Non-linear models for confidence estimation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.157-162. [PDF, 427KB]

(2011) Victoria Arranz, Olivier Hamon, Karim Boudahmane, & Martine Garnier-Rizet: Protocol and lessons learnt from the production of parallel corpora for the evaluation of speech translation systems. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.129-135. [PDF, 115KB]

(2011) Josep M.Crego, François Yvon, & José B.Mariño: Ncode: an open source bilingual n-gram SMT toolkit. Sixth Machine Translation Marathon, 5-10 September 2011, Trento; Prague Bulletin of Mathematical Linguistics, no.96, October 2011; pp.49-58. [PDF, 190KB]; presentation, 59 slides [PDF of PPT, 878KB]

(2011) Souhir Gahbiche-Braham, Hélène Bonneau-Maynard, & François Yvon: Two ways to use a noisy parallel news corpus for improving statistical machine translation. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.44-51. [PDF, 176KB]

(2011) Thomas Lavergne, Alexandre Allauzen, Hai-Son Le, & François Yvon: LIMSI’s experiments in domain adaptation for IWSLT11. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.62-67. [PDF, 358KB]

(2011) Artem Sokolov & François Yvon: Minimum error rate training semiring. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.241-248. [PDF, 418KB]; presentation, 19 slides [PDF]

(2011) Nadi Tomeh, Alexandre Allauzen, & François Yvon: Discriminative weighted alignment matrices for statistical machine translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.305-312. [PDF, 413KB]; presentation, 38 slides [PDF]

(2010) Alexandre Allauzen, Josep M.Crego, Ílknur Durgar El-Kahlout, & François Yvon: LIMSI’s statistical translation systems for WMT’10.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 54-59. [PDF, 129KB]

(2010) Alexandre Allauzen, Josep M.Crego, İlknur Durgar El-Kahlout, Le Hai-Son, Guillaume Wisniewski, & François Yvon: LIMSI @ IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.105-112. [PDF, 386KB]

(2010) Josep Maria Crego, Aurélien Max, & François Yvon: Local lexical adaptation in machine translation through triangulation: SMT helping SMT. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.232-240. [PDF, 197KB]

(2010) İlknur Durgar El-Kahlout & François Yvon: The pay-offs of preprocessing for German-English statistical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.251-258. [PDF, 301KB]

(2010) Le Hai Son, Alexandre Allauzen, Guillaume Wisniewski, & François Yvon: Training continuous space language models: some practical issues. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.778-788. [PDF, 311KB]

(2010) Gregor Leusch, Aurélien Max, Josep Maria Crego, & Hermann Ney: Multi-pivot translation by system combination. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.299-306. [PDF, 303KB]

(2010) Aurélien Max, Josep Maria Crego, & François Yvon: Contrastive lexical evaluation of machine translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1753-1757. [PDF, 400KB]

(2010) Aurélien Max: Example-based paraphrasing for improved phrase-based statistical machine translation.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.656-666. [PDF, 234KB]

(2010) Nadi Tomeh, Alexandre Allauzen, Guillaume Wisniewski & François Yvon: Refining word alignment with discriminative training. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 203KB]

(2010) Guillaume Wisniewski, Alexandre Allauzen, & François Yvon: Assessing phrase-based translation models with oracle decoding. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.933-943. [PDF, 188KB]

Paul Sabatier, Université de (Toulouse)

(2010) Arianna Bisazza, Ioannis Klasinas, Mauro Cettolo, & Marcello Federico: FBK @ IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.53-58. [PDF, 346KB]

Rouen: Institut National des Sciences Appliquées, Rouen see Institut National des Sciences Appliquées, Rouen

SAP France

(2015) Martin Benjamin, Amar Mukunda & Jeff Allen: Kamusi pre-D-source-side disambiguation and a sense aligned multilingual lexicon. Proceedings of the 37th Conference Translating and the Computer, London, November 26-27, 2015; pp.27-32. [PDF, 188KB]

Signe

(2011) Michael Filhol, Line Patris, & Pierre Guitteny: Text-sign parallel corpus study to start designing an automatic translation system.  International Workshop on Sign Language Translation and Avatar Technology (SLTAT), 23 October 2011, Dundee, UK; 7pp. [PDF, 351KB]

Simple Shift

(2011) Karim Benzineb & Jacques Guyot: Building blocks to integrate a Moses MT engine into the production translation work-flow with reference to the World Trade Organization and other clients.  Translating and the Computer 33, 17-18 November 2011, London; 11pp. [PDF, 194KB]

Softissimo

(2013) Guillaume Wisniewski, Anil Kumar Singh, Natalia Segal, & François Yvon: Design and analysis of a large corpus of post-edited translations: quality estimation, failure analysis and variability of post-edition. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.117-124. [PDF, 541KB]

(2011) Théo Hoffenberg & Christophe Brun-Franc: An innovative platform to allow full translation of internet sites. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.41-45. [PDF, 316KB]

(2010)  Nasredine Semmar, Christophe Servan, Gaël de Chalendar, Benoît Le Ny, & Jean-Jacques Bouzaglou: A hybrid word alignment approach to improve translation lexicons with compound words and idiomatic expressions. Translating and the Computer 32, 18-19 November 2010, London; 10pp. [PDF, 95KB]; presentation, 37 slides [PDF of PPT, 934KB]

Sogitec

(2011) Helena Blancafort, Ulrich Heid, Tatiana Gornostay, Claude Méchoulam, Béatrice Daille & Serge Sharoff. User-centred views on terminology extraction tools: usage scenarios and integration into MT and CAT tools. Tralogy, Paris, 3-4 March 2011;  15pp. [PDF, 201KB]

Strasbourg, Université de

(2011) Mirabela Navlea & Amalia Todiraşcu: Cognate identification for a French-Romanian lexical alignment system: empirical study. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.145-152. [PDF, 271KB]

Supélec

(2011) Johann Roturier & Anthony Bensadoun: Evaluation of MT systems to translate user generated content. MT  Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.244-251. [PDF, 284KB]

Syllabs

(2011) Helena Blancafort, Ulrich Heid, Tatiana Gornostay, Claude Méchoulam, Béatrice Daille & Serge Sharoff. User-centred views on terminology extraction tools: usage scenarios and integration into MT and CAT tools. Tralogy, Paris, 3-4 March 2011;  15pp. [PDF, 201KB]

Systran SA [see also United States of America: Systran Software Inc.]

(2012) Frédéric Blain, Holger Schwenk, & Jean Senellart: Incremental adaptation using translation information and post-editing analysis. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.229-236. [PDF, 873KB]

(2012) Patrik Lambert, Holger Schwenk, & Frédéric Blain: Automatic translation of scientific documents in the HAL archive.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3933-3936. [PDF, 329KB]

(2012) Patrick Lambert, Jean Senellart, Laurent Romary, Holger Schwenk, Florian Zipser, Patrice Lopez, & Frédéric Blain: Collaborative machine translation service for scientific texts. [EACL 2012] Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 11-15. [PDF, 523KB]

(2011) Frédéric Blain, Jean Senellart, Holger Schwenk, Mirko Plitt, & Johann Roturier: Qualitative analysis of post-editing for high quality machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.164-171. [PDF, 218KB]

 (2011) Karim Boudahmane, Bianka Buschbeck, Eunach Cho, Josep Maria Crego, Markus Freitag, Thomas Lavergne, Hermann Ney, Jan Niehues, Stephan Peitz, Jean Senellart, Artem Sokolov, Alex Waibel, Tonio Wandmacher, Joern Wuebker, & François Yvon: Advances on spoken language translation in the Quaero program. IWSLT 2011: International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011; 7pp. [PDF, 131KB]

(2010) Petr Knoth, Trevor Collins, Elsa Sklavounou, & Zdenek Zdrahal: Facilitating cross-language retrieval and machine translation by multilingual domain ontologies. [LREC 2010] Workshop on Supporting eLearning with Language Resources  and Semantic Data, Valletta, Malta, 22 May 2010; 42 slides. [PDF of PPT, 440KB]

(2010) Philipp Koehn & Jean Senellart: Convergence of translation memory and statistical machine translation. JEC 2010: Second joint EM+/CNGL WorkshopBringing MT to the user: research on integrating MT in the translation industry”, AMTA 2010, Denver, Colorado, November 4, 2010; pp.21-31. [PDF, 188KB]

(2010) Philipp Koehn & Jean Senellart: Fast approximate string matching with suffix arrays and A* parsing. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 178KB]

Toulouse: Université Paul Sabatier  see Paul Sabatier, Université de

Toulouse-Le Mirail, Université de

(2010) Sylwia Ozdowska & Vincent Claveau: Inferring syntactic rules for word alignment through inductive logic programming. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.749-754. [PDF, 370KB]

Tours, Université François Rabelais

(2011) Emeline Lecuit, Denis Maurel, & Duško Vitas: A tagged and aligned corpus for the study of proper names in translation. AEPC 2011: proceedings of the Second Workshop on Annotation and Exploitation of Parallel Corpora, associated with the 8th International Conference on Recent Advances in Natural Language Processing (RANLP 2011), 15th September 2011, Hissar, Bulgaria; pp.11-18. [PDF, 244KB]

Translators Without Borders – Traducteurs sans Frontières

(2010) Lori Thicke: Translation technology for fighting world poverty. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 20pp. [PDF, 12614KB]

Troyes, University of Technology

(2011) Philippe Lacour, Any Freitas, Aurélien Bénel, Franck Eyraud & Diana Zambon: Translation and the new digital commons. Tralogy, Paris, 3-4 March 2011; 16 pp. [PDF, 313KB]

Verbatim Analysis

(2014) Valérie Hanoka & Benoît Sagot: YaMTG: an open-source heavily multilingual translation graph extracted from wiktionaries and parallel corpora. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3179-3186. [PDF, 245KB]

Vocapia Research

(2011) Lori Lamel, Sandrine Courcinous, Julien Despres, Jean-Luc Gauvain, Yvan Josse, Kevin Kilgour, Florian Kraft, Viet Bac Le, Hermann Ney, Markus Nußbaum-Thom, Ilya Oparin, Tim Schlippe, Ralf Schlüter, Tanja Schultz, Thiago Fraga da Silva, Sebastian Stüker, Martin Sundermeyer, Bianca Vieru, Ngoc Thang Vu, Alexander Waibel, & Cécile Woehrling: Speech recognition for machine translation in Quaero. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.121-128. [PDF, 310KB]; presentation, 33 slides. [PDF of PPT]

Xerox Research Centre Europe

(2014) Shachar Mirkin & Laurent Besacier: Data selection for compact adapted SMT models. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.301-314. [PDF, 610KB]

(2013) Wilker Aziz, Marc Dymetman & Sriram Venkatapathy: Investigations in exact inference for hierarchical translation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.472-483. [PDF, 581KB]

(2013) Shachar Mirkin & Nicola Cancedda: Assessing quick update methods of statistical translation models.  [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 605KB]

(2013) Shachar Mirkin, Sriram Venkatapathy, & Marc Dymetman: Confidence-driven rewriting for improved translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.257-264. [PDF, 616KB]

(2013) Shachar Mirkin, Sriram Venkatapathy, Marc Dymetman, & Ioan Calapodescu: SORT: an interactive source-rewriting tool for improved translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, System demonstrations, Sofia, Bulgaria, August 4-9 2013; pp.85-90. [PDF, 943KB]

 (2013) Sara Stymne, Nicola Cancedda, & Lars Ahrenberg: Generation of compound words in statistical machine translation into compounding languages. Computational Linguistics 39 (4); pp.1067-1108. [PDF, 615KB]

(2012) Nicola Cancedda: Private access to phrase tables for statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.23-27. [PDF, 135KB]

(2012) Prasanth Kolachina, Nicola Cancedda, Marc Dymetman, & Sriram Venkatapathy: Prediction of learning curves in machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.22-30. [PDF, 368KB]

(2012) Vassilina Nikoulina, Bogomil Kovachev, Nikolaos Lagos, & Christof Monz: Adaptation of statistical machine translation model for cross-lingual information retrieval in a service context. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp.109-119. [PDF, 218KB]

(2012) Vassilina Nikoulina, Agnes Sandor, & Marc Dymetman: Hybrid adaptation of named entity recognition for statistical machine translation. COLING 2012: Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid MT (ML4HMT), Mumbai, December 2012; pp.1-15. [PDF, 198KB]

(2011) Sara Stymne: Definite noun phrases in statistical machine translation into Scandinavian languages. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.289-296. [PDF, 336KB]; presentation, 50 slides [PDF]

(2011) Sara Stymne & Nicola Cancedda: Productive generation of compound words in statistical machine translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.250-260. [PDF, 167KB]

(2011) Viet Ha-Thuc & Nicola Cancedda: Confidence-weighted learning of factored discriminitave language models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.439-444. [PDF,

(2010) Samidh Chatterjee & Nicola Cancedda: Minimum error rate training by sampling the translation lattice. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.606-615. [PDF, 248KB]

(2010) Marc Dymetman & Nicola Cancedda: Intersecting hierarchical and phrase-based models of translation: formal aspects and algorithms. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.1-9. [PDF, 354KB]

(2010) Benjamin Roth, Andrew McCallum, Marc Dymetman & Nicola Cancedda: Machine translation using overlapping alignments and SampleRank. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 7pp. [PDF, 138KB]

(2010) Lucia Specia, Nicola Cancedda, & Marc Dymetman: A dataset for assessing machine translation evaluation metrics. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3375-3378. [PDF, 362KB]

(2010) Wilker Aziz, Marc Dymetman, Shachar Mirkin, Lucia Specia, Nicola Cancedda, & Ido Dagan: Learning an expert from human annotations in statistical machine translation: the case of out-of-vocabulary words. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 585KB]; presentation: 21 slides [PDF, 777KB]