Hi! I am a first year Computer Science PhD student at University of Wisconsin–Madison working with Tengyang Xie and Fred Sala. As an undergraduate at Brown University, I was fortunate to work with Stephen Bach.

My research interests include reasoning, data-centric ML, and synthetic data. My recent emphasis has been on understanding the capabilities of RL post-training, and understanding the role that data plays. Feel free to reach out for a chat!

Education

2024—Present

University of Wisconsin—Madison

Ph.D. in Computer Science

Working with: Prof. Fred Sala, Prof. Tengyang Xie

2020—2024

Brown University

Sc.B. in Mathematics-Computer Science

Worked with: Prof. Stephen Bach

Publications

R&B: Domain Regrouping and Data Mixture Balancing for Efficient Foundation Model Training

R&B: Domain Regrouping and Data Mixture Balancing for Efficient Foundation Model Training

Albert Ge, Tzu-Heng Huang, John Cooper, Avi Trost, Ziyi Chu, Satya Sai Srinath Namburi GNVV, Ziyang Cai, Kendall Park, Nicholas Roberts, Frederic Sala

Reclustering data and reusing training gradients for data mixing.

Learning to Generate Instruction Tuning Datasets for Zero-Shot Task Adaptation

ACL Findings 2024

Learning to Generate Instruction Tuning Datasets for Zero-Shot Task Adaptation

Nihal V. Nayak, Yiyang Nan, Avi Trost, Stephen H. Bach

Training a model to generate conditional synthetic data for task adaptation.