- Papers about our work
- Please visit the ARK @ CMU for software and datasets developed there.
- idea_relations, a framework to identify relations between ideas in temporal text corpora (ACL 2017)
- twitter_langid, hierarchical character-word neural network for word-level language identification in tweets (SocialNLP 2016)
Noah's ARK is an informal collection of researchers within the Natural Language Processing group at the University of Washington and formerly at Carnegie Mellon University's Language Technologies Institute.
Available for download: numerous software tools and datasets for and papers about natural language processing.Read More
ARK researchers and alumni are leaders in natural language processing, machine learning, and computational social science.Read More
Dallas Card, CMU Ph.D. student: NLP and ML for social science
Elizabeth Clark, UW Ph.D. student: NLP
Jesse Dodge, CMU Ph.D. student: NLP
Jeffrey Flanigan, CMU Ph.D. student:semantic parsing and generation; machine translation
Yangfeng Ji, UW post-doc: discourse, machine learning for NLP
Lingpeng Kong, CMU Ph.D. student: structured prediction for NLP
Lucy Lin, UW Ph.D. student: NLP for social science
Nelson Liu, UW undergraduate: text-driven forecasting
Kelvin Luu, UW Ph.D. student: NLP for education
George Mulcaire, UW Ph.D. student: multilingual NLP
Hao Peng, UW Ph.D. student: NLP
Nikko Rush UW undergraduate: NLP
Nathan Schneider defended his Ph.D. thesis, Lexical Semantic Analysis in Natural Language Text, in 2014 at CMU. He is a postdoctoral researcher at the University of Edinburgh.
Tae Yano defended her Ph.D. thesis, Text as Actuator: Text-Driven Response Modeling and Prediction in Politics, in 2013 at CMU. She is a research developer at Microsoft.
Kevin Gimpel defended his Ph.D. thesis, Discriminative Feature-Rich Modeling for Syntax-Based Machine Translation, in 2012 at CMU. He is an assistant professor at the Toyota Technological Institute at Chicago.
André Martins defended his Ph.D. thesis, The Geometry of Constrained Structured Prediction: Applications to Inference and Learning of Natural Language Syntax, in 2012 at CMU and the Instituto Superior Técnico in Lisbon. He is head of research at Unbabel.
Dipanjan Das defended his Ph.D. thesis, Semi-Supervised and Latent-Variable Models of Natural Language Semantics, in 2012 at CMU. He is a research scientist at Google.
Shay Cohen defended his Ph.D. thesis, Computational Learning of Probabilistic Grammars in the Unsupervised Setting, in 2011 at CMU. He is a lecturer at the University of Edinburgh.
- Chris Dyer (CMU post-doc, 2010–12), assistant professor at Carnegie Mellon University
- Fei Liu (CMU post-doc, 2013–15), assistant professor at the University of Central Florida
- Behrang Mohit (CMU-Q post-doc, 2010–12), research scientist at Ask.com
- Cari (Sisson) Bader (CMU MS, 2008), Nuance
- Victor Chahuneau (CMU MS, 2013)
- Mengqiu Wang (CMU MS 2007)
- Rohan Ramanath (CMU MS 2015), LinkedIn
- Miguel Ballesteros (CMU visitor, 2014–16)
- Rishav Bhowmick (CMU-Q BS, 2010)
- Desai Chen (CMU BS, 2010), Ph.D. student at MIT
- Philip Gianfortoni (CMU research programmer, 2009–10)
- Swapna Gottipati (CMU visitor, 2012), Singapore Management University
- Mohammad Haque (CMU BS, 2010), Endeca
- Dimitry Levin (CMU BS, 2010), Citigroup
- Zack McCord (CMU BS)
- Bill McDowell (CMU research programmer, 2013–14)
- Zach Paine (CMU research programmer, 2008), Apple
- Daniel Preoțiuc-Pietro (CMU visitor, 2013)
- Naomi Saphra (CMU BS, 2013), Ph.D. student at Johns Hopkins University/University of Edinburgh
- Neel Shah (CMU BS, 2010)
- Dan Tasse (CMU BS, 2008), Ph.D. student at CMU
- Mattt Thompson (CMU BA, 2009)