I am a PhD candidate at MIT in the Department of Brain & Cognitive Sciences (BCS), and a member of the GenLM research consortium, where we are building an open-source ecosystem for language model probabilistic programming. I have previously interned in AI Research at Apple. I am grateful for my many wonderful mentors and collaborators across these communities.
My research, which is funded by the NSF GRFP and an MIT Presidential Fellowship, draws from diverse disciplines spanning cognitive science, Bayesian machine learning, and NLP. Currently, I'm focused on developing train-time and test-time algorithms for reliably controlling language models. I am particularly interested in tasks involving goal inference, long-horizon planning, and sparse reward.
Select Projects ‣ Sampling Algorithms & Programming Models for LLMs ‣ AWRS [COLM]: Fast randomized algorithm for constrained decoding as posterior inference.
‣ GenLM [ICLR Oral]: Controlling LLM generation via programmable constraints and sequential Monte Carlo.
‣ Decoding [GitHub]: An open-source library for compositional language model programs.
‣ Reasoning, Pragmatics, & World Knowledge ‣ ProbSem [CogSci]: Pragmatic semantic parsing via LLM-mediated approximate inference. ‣ LINC [EMNLP Outstanding Paper]: Combining LLMs with SMT solvers for provably consistent reasoning. ‣ EWoK [TACL]: Benchmarking LLMs on core knowledge and world modeling.
‣ AI for Code & Mathematics
‣ BrainCode [NeurIPS]: An investigation of how LLMs encode computer programs.
‣ HumanMath [NeurIPS Math AI Workshop]: Opinion piece on the communicative role of mathematics.
Open Source I care about open source and allocate a portion of my time towards community contributions. These have previously included the development and evaluation of code models with the Star Coder project by Hugging Face and Service Now, the first implementation of CFG-guided text generation for the Outlines library by dottxt-ai, and contributions to AI for mathematics with Project Numina.
June 2024: Joined Project Numina as a contributor, facilitating the release of the models, datasets, and code used to win the first AIMO progress prize.