My research is concerned with language and its role in human communication. In particular, my long-term research goal is to break down barriers in human-human or human-machine communication through the development of natural language processing (NLP) technologies. This includes the development of technology for machine translation, which helps break down barriers in communication for people who speak different languages, and natural language understanding, which helps computers understand and respond to human language. Within this overall goal of breaking down barriers to human communication, I have focused on several aspects of language that both make it interesting as a scientific subject, and hold potential for the construction of practical systems. Specific areas of interest include:
- Multilingual Language Processing
- Machine Translation
- Syntactic and Semantic Analysis
- Cross-lingual Learning
- Natural Language Interfaces to Computers
- Natural Language to Code Generation
- Question Answering and Information Extraction
- Modeling Human-Computer or Human-Human Interaction
- Machine Learning for NLP
- Neural Network Models for NLP
- Unsupervised and Semi-supervised Learning
- 9/2016-onward Carnegie Mellon University (CMU): Assistant Professor
- 4/2012-8/2016 Nara Institute of Science and Technology (NAIST): Assistant Professor
- 4/2010-3/2012 Kyoto University: Doctoral course in Intelligent Information Systems
- 4/2008-3/2010 Kyoto University: Master's course in Intelligent Information Systems
- 8/2006-3/2008 Hyogo Prefectural Government: Coordinator for International Relations
- 9/2005-7/2006 Tajima Agricultural High School: Assistant Language Teacher
- 8/2001-5/2005 University of Illinois, Urbana-Champaign: B.S. Computer Science
Here is a list of a few of my current favorite papers:
- Odette Scharenborg, Laurent Besacier, Alan Black, Mark Hasegawa-Johnson, Florian Metze, Graham Neubig, Sebastian Stüker, Pierre Godard, Markus Müller, Lucas Ondel, Shruti Palaskar, Philip Arthur, Francesco Ciannella, Mingxing Du, Elin Larsen, Danny Merkx, Rachid Riad, Liming Wang, Emmanuel Dupoux.
Speech Technology for Unwritten Languages (BibTex)
Transactions on Audio, Speech, and Language Processing. 2020 (To Appear).
- Xinjian Li, Siddharth Dalmia, Juncheng Li, Patrick Littell, Matthew Lee, Jiali Yao, Antonios Anastasopoulos, David Mortensen, Graham Neubig, Alan Black, Florian Metze.
Universal Phone Recognition with a Multilingual Allophone System (BibTex)
2020 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Barcelona. May 2020 (To Appear).
- Chunting Zhou, Jiatao Gu, Graham Neubig.
Understanding Knowledge Distillation in Non-autoregressive Machine Translation (BibTex)
International Conference on Learning Representations (ICLR). Addis Ababa, Ethiopia. April 2020 (To Appear).
- Junxian He, Xinyi Wang, Graham Neubig, Taylor Berg-Kirkpatrick.
A Probabilistic Formulation of Unsupervised Text Style Transfer (BibTex, Code/Data)
International Conference on Learning Representations (ICLR). Addis Ababa, Ethiopia. April 2020 (To Appear). Spotlight.
- Bhuwan Dhingra, Manzil Zaheer, Vidhisha Balachandran, Graham Neubig, Ruslan Salakhutdinov, William W. Cohen.
Differentiable Reasoning over a Virtual Knowledge Base (BibTex)
International Conference on Learning Representations (ICLR). Addis Ababa, Ethiopia. April 2020 (To Appear). Talk.
- Slides for tutorials and classes can be found on my teaching page
- Software and resources that I've developed can be found on my software page.
- Tools for Natural Language Processing
- The Kyoto Free Translation Task: A task that can be used for evaluation of English-Japanese translation systems
- Japanese Parallel Data: A list of various data that can be used to create machine translation systms to/from Japanese