I am a final-year PhD candidate at MIT EECS, where I am advised by Aleksander Mądry. Before starting my PhD, I was a pre-doc at Microsoft Research, where I worked with Praneeth Netrapalli and Prateek Jain. I studied Statistics and Computer Science at the University of Illinois at Urbana-Champaign.


Research

Do Language Models Robustly Acquire New Knowledge?
Parameters vs FLOPs: Scaling Laws for Optimal Sparsity of MoE Language Models
ContextCite: Attributing Model Generation to Context
Decomposing and Editing Predictions by Modeling Model Computation
ModelDiff: A Framework for Comparing Learning Algorithms
Do Input Gradients Highlight Discriminative Features?
The Pitfalls of Simplicity Bias in Neural Networks
Growing Attributed Networks through Local Processes