Jacob Collard

National Institute of Standards and Technology

Information Technology Laboratory



I am a post-doctoral researcher at the National Institute of Standards and Technology, having received my Ph.D in linguistics in 2020 from Cornell University. My main research interests are minimally supervised, linguistically-motivated learning algorithms for syntax and semantics and computational models of semantics and pragmatics. In my spare time, I enjoy hiking, reading, creative writing, drawing, and game design.

Selected Publications

Unsupervised Formal Grammar Induction with Confidence, New Orleans, Louisiana, October 2019. Association for Computation in Linguistics. [PDF] [BIB]
Talapady N Bhat; Eswaran Subrahmanian; Ursula Kattner; John Elliott; Carelyn Campbell; Ira Monarch; and Jacob Collard. Generating domain ontologies using root- and rule-based terms. Journal of the Washington Academy of Science, 2018. [BIB]
Talapady N Bhat; Eswaran Subrahmanian; John Elliott; Ira Monarch; Brian McMahon; Peter R Strickland; and Jacob Collard. R&R - a de novo method to create search terms for IUCr documents. Acta Crystallographica, 2018. [BIB]
Finite State Reasoning for Presupposition Satisfaction, Santa Fe, New Mexico, August 2018. Association for Computation in Linguistics. [ONLINE] [PDF] [BIB]


  • APLN 505: Semantics and Pragmatics (Spring 2021; Montclair State University)
  • COGST 1101: Introduction to Cognitive Science (Spring 2018; Cornell University) — with Professor Jennifer Schwade
  • LING 1101: Introduction to Linguistics (Fall 2017; Cornell University) — with Professor Miloje Despic
  • LING 4424: Computational Linguistics (Spring 2017; Cornell University) — with Professor John Hale
  • LING 1100-103: Endangered Languages (Fall 2016; Cornell University)


I was on the editing team for Semantics and Linguistic Theory 26 and 27 and on the organizing committee for NELS 49. I have also served several volunteer positions for the Cornell Linguistics Circle, including president and web administrator, and served as the organizer for two reading groups: the semantics reading group and the computational linguistics reading group.