Home Page

Publications

Brian DuSell and David Chiang. Learning hierarchical structures with differentiable nondeterministic stacks. arXiv:2109.01982. PDF BibTeX
Isaac Caswell, Julia Kreutzer, Lisa Wang, Ahsan Wahab, Daan van Esch, Nasanbayar Ulzii-Orshikh, Allahsera Tapo, Nishant Subramani, Artem Sokolov, Claytone Sikasote, Monang Setyawan, Supheakmungkol Sarin, Sokhar Samb, Benoît Sagot, Clara Rivera, Annette Rios, Isabel Papadimitriou, Salomey Osei, Pedro Javier Ortiz Suárez, Iroro Orife, Kelechi Ogueji, Rubungo Andre Niyongabo, Toan Q. Nguyen, Mathias Müller, André Müller, Shamsuddeen Hassan Muhammad, Nanda Muhammad, Ayanda Mnyakeni, Jamshidbek Mirzakhalov, Tapiwanashe Matangira, Colin Leong, Nze Lawson, Sneha Kudugunta, Yacine Jernite, Mathias Jenny, Orhan Firat, Bonaventure F. P. Dossou, Sakhile Dlamini, Nisansa de Silva, Sakine Çabuk Ballı, Stella Biderman, Alessia Battisti, Ahmed Baruwa, Ankur Bapna, Pallavi Baljekar, Israel Abebe Azime, Ayodele Awokoya, Duygu Ataman, Orevaoghene Ahia, Oghenefego Ahia, Sweta Agrawal, and Mofetoluwa Adeyemi. Quality at a glance: an audit of Web-crawled multilingual datasets. In Proc. AfricaNLP. 2021. PDF BibTeX
Samuel Grieggs, Bingyu Shen, Greta Rauch, Pei Li, Jiaqi Ma, David Chiang, Brian Price, and Walter Scheirer. Measuring human perception to improve handwritten document transcription. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021. doi:10.1109/TPAMI.2021.3092688. DOI BibTeX
Toan Q. Nguyen, Kenton Murray, and David Chiang. Data augmentation by concatenation for low-resource translation: a mystery and a solution. In Proc. Conference on Spoken Language Translation. 2021. PDF BibTeX
David Chiang and Colin McDonald. Syntax-based attention masking for neural machine translation. In Proc. NAACL Student Research Workshop. 2021. PDF BibTeX
David Chiang, Alexander M. Rush, and Boaz Barak. Named tensor notation. 2021. arXiv:2102.13196. PDF BibTeX
David Chiang and Chung-chieh Shan. Translating recursive probabilistic programs to factor graph grammars. 2020. Presented at PROBPROG 2020. PDF BibTeX
David Chiang and Darcey Riley. Factor graph grammars. In Proc. NeurIPS. 2020. PDF BibTeX
Brian DuSell and David Chiang. Learning context-free languages with nondeterministic stack RNNs. In Proc. CoNLL, 507–519. 2020. PDF BibTeX
Julian Salazar, Davis Liang, Toan Q. Nguyen, and Katrin Kirchhoff. Masked language model scoring. In Proc. ACL, 2699–2712. 2020. doi:10.18653/v1/2020.acl-main.240. PDF BibTeX
Justin DeBenedetto and David Chiang. Representing unordered data using complex-weighted multiset automata. In Hal Daumé III and Aarti Singh, editors, Proc. ICML, volume 119 of Proceedings of Machine Learning Research, 2412–2420. 2020. PDF BibTeX
Toan Q. Nguyen and Julian Salazar. Transformers without tears: improving the normalization of self-attention. In Proc. Workshop on Spoken Language Translation. 2019. doi:10.5281/zenodo.3525484. DOI BibTeX
Kenton Murray, Jeffery Kinnison, Toan Q. Nguyen, Walter Scheirer, and David Chiang. Auto-sizing the Transformer network: improving speed, efficiency, and performance for low-resource machine translation. In Proc. Workshop on Neural Generation and Translation, 231–240. 2019. PDF BibTeX
Arturo Argueta and David Chiang. Accelerating sparse matrix operations in neural networks on graphics processing units. In Proc. ACL, 6215–6224. 2019. PDF BibTeX
Antonios Anastasopoulos, Alison Lui, Toan Q. Nguyen, and David Chiang. Neural machine translation of text from non-native speakers. In Proc. NAACL: HLT, volume 1, 3070–3080. 2019. PDF BibTeX
Xuan Zhang, Gaurav Kumar, Huda Khayrallah, Kenton Murray, Jeremy Gwinnup, Marianna J Martindale, Paul McNamee, Kevin Duh, and Marine Carpuat. An empirical exploration of curriculum learning for neural machine translation. 2018. arXiv:1811.00739. PDF BibTeX
Kenton Murray and David Chiang. Correcting length bias in neural machine translation. In Proc. WMT, 212–223. 2018. PDF BibTeX
Brian Thompson, Huda Khayrallah, Antonios Anastasopoulos, Arya D. McCarthy, Kevin Duh, Rebecca Marvin, Paul McNamee, Jeremy Gwinnup, Tim Anderson, and Philipp Koehn. Freezing subnetworks to analyze domain adaptation in neural machine translation. In Proc. WMT, 124–132. 2018. PDF BibTeX
Xinyi Wang, Salvador Aguinaga, Tim Weninger, and David Chiang. Growing better graphs with latent-variable probabilistic graph grammars. In Proc. Workshop on Mining and Learning with Grammars. 2018. PDF BibTeX
Antonios Anastasopoulos, Marika Lekakou, Josep Quer, Eleni Zimianiti, Justin DeBenedetto, and David Chiang. Part-of-speech tagging on an endangered language: a parallel Griko-Italian resource. In Proc. COLING, 2529–2539. 2018. PDF BibTeX
Marcely Zanon Boito, Antonios Anastasopoulos, Marika Lekakou, Aline Villavicencio, and Laurent Besacier. A small Griko-Italian speech translation corpus. In Proc. Workshop on Spoken Language Technologies for Under-Resourced Languages. 2018. BibTeX
Arturo Argueta and David Chiang. Composing finite state transducers on GPUs. In Proc. ACL, 2697–2705. 2018. PDF BibTeX
Justin DeBenedetto and David Chiang. Algorithms and training for weighted multiset automata and regular expressions. In Proc. Conference on Implementation and Applications of Automata, 146–158. 2018. PDF BibTeX
Antonios Anastasopoulos and David Chiang. Leveraging translations for speech transcription in low-resource settings. In Proc. INTERSPEECH. 2018. PDF BibTeX
Corey Pennycuff, Satyaki Sikdar, Catalina Vajiac, David Chiang, and Tim Weninger. Synchronous hyperedge replacement graph grammars. In Proc. Conference on Graph Transformations. 2018. BibTeX
Antonios Anastasopoulos and David Chiang. Tied multitask learning for neural speech translation. In Proc. NAACL: HLT, volume 1, 82–91. 2018. PDF BibTeX
Toan Nguyen and David Chiang. Improving lexical choice in neural machine translation. In Proc. NAACL: HLT, volume 1, 334–343. 2018. PDF BibTeX
Huadong Chen, Shujian Huang, David Chiang, Xinyu Dai, and Jiajun Chen. Combining character and word information in neural machine translation using a multi-level attention. In Proc. NAACL: HLT, volume 1, 1284–1293. 2018. PDF BibTeX
Salvador Aguinaga, David Chiang, and Tim Weninger. Learning hyperedge replacement grammars for graph generation. IEEE Trans. Pattern Analysis and Machine Intelligence, 41(3):625–638, 2019. doi:10.1109/TPAMI.2018.2810877. PDF BibTeX
David Chiang, Frank Drewes, Daniel Gildea, Adam Lopez, and Giorgio Satta. Weighted DAG automata for semantic graphs. Computational Linguistics, 44(1):119–186, 2018. PDF BibTeX
Graham Neubig, Chris Dyer, Yoav Goldberg, Austin Matthews, Waleed Ammar, Antonios Anastasopoulos, Miguel Ballesteros, David Chiang, Daniel Clothiaux, Trevor Cohn, Kevin Duh, Manaal Faruqui, Cynthia Gan, Dan Garrette, Yangfeng Ji, Lingpeng Kong, Adhiguna Kuncoro, Gaurav Kumar, Chaitanya Malaviya, Paul Michel, Yusuke Oda, Matthew Richardson, Naomi Saphra, Swabha Swayamdipta, and Pengcheng Yin. DyNet: the dynamic neural network toolkit. 2017. arXiv:1701.03980. PDF BibTeX
Toan Q. Nguyen and David Chiang. Transfer learning across low-resource, related languages for neural machine translation. In Proc. IJCNLP, volume 2, 296–301. 2017. PDF BibTeX
Huadong Chen, Shujian Huang, David Chiang, Xin-Yu Dai, and Jiajun Chen. Top-rank enhanced listwise optimization for statistical machine translation. In Proc. CoNLL, 90–99. 2017. PDF BibTeX
Antonios Anastasopoulos, Sameer Bansal, David Chiang, Sharon Goldwater, and Adam Lopez. Spoken term discovery for language documentation using translations. In Proc. Workshop on Speech-Centric NLP, 53–58. 2017. PDF BibTeX
Antonios Anastasopoulos and David Chiang. A case study on using speech-to-translation alignments for language documentation. In Proc. Workshop on Use of Computational Methods in Study of Endangered Languages, 170–178. 2017. PDF BibTeX
Huadong Chen, Shujian Huang, David Chiang, and Jiajun Chen. Improved neural machine translation with a syntax-aware encoder and decoder. In Proc. ACL, volume 1, 1936–1945. 2017. PDF BibTeX
Arturo Argueta and David Chiang. Decoding with finite-state transducers on GPUs. In Proc. EACL, volume 1, 1044–1052. 2017. PDF BibTeX
Ulf Hermjakob, Qiang Li, Daniel Marcu, Jonathan May, Sebastian J. Mielke, Nima Pourdamghani, Michael Pust, Xing Shi, Kevin Knight, Tomer Levinboim, Kenton Murray, David Chiang, Boliang Zhang, Xiaoman Pan, Di Lu, Ying Lin, and Heng Ji. Incident-driven machine translation and name tagging for low-resource languages. Machine Translation, 32(1–2):59–89, 2018. doi:10.1007/s10590-017-9207-1. DOI BibTeX
Antonios Anastasopoulos, David Chiang, and Long Duong. An unsupervised probability model for speech-to-translation alignment of low-resource languages. In Proc. EMNLP, 1255–1263. 2016. PDF BibTeX
Salvador Aguiñaga, Rodrigo Palacios, David Chiang, and Tim Weninger. Growing graphs from hyperedge replacement graph grammars. In Proc. CIKM, 469–478. 2016. doi:10.1145/2983323.2983826. DOI BibTeX
Long Duong, Antonios Anastasopoulos, David Chiang, Steven Bird, and Trevor Cohn. An attentional model for speech translation without transcription. In Proc. NAACL: HLT, 949–959. 2016. PDF BibTeX
Kenton W. Murray and Jayant Krishnamurthy. Probabilistic neural programs. In Proc. Workshop on Neural Abstract Machines and Program Induction. 2016. PDF BibTeX
Tomer Levinboim and David Chiang. Supervised phrase table triangulation with neural word embeddings for low-resource languages. In Proc. EMNLP, 1079–1083. 2015. PDF BibTeX
Tomer Levinboim and David Chiang. Multi-task word alignment triangulation for low-resource languages. In Proc. NAACL: HLT, 1221–1226. 2015. PDF BibTeX
Kenton Murray and David Chiang. Auto-sizing neural networks: with applications to \(n\)-gram language models. In Proc. EMNLP, 908–916. 2015. PDF BibTeX
Tomer Levinboim, Ashish Vaswani, and David Chiang. Model invertibility regularization: sequence alignment with or without parallel data. In Proc. NAACL: HLT, 609–618. 2015. PDF Code BibTeX
Steven Bird, David Chiang, Friedel Frowein, Florian Hanke, and Ashish Vaswani. Documentary linguistics and computational linguistics: a response to Brooks. Language Documentation and Conservation, 9:10–11, 2015. BibTeX