Research | Jiawei Zhao

Selected Publications

Deep Think with Confidence

Yichao Fu* , Xuewei Wang , Yuandong Tian , and Jiawei Zhao*

In Preprint , 2025

arXiv Code
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Jiawei Zhao, Zhenyu Zhang , Beidi Chen , Zhangyang Wang , Anima Anandkumar , and Yuandong Tian

In International Conference on Machine Learning (ICML) , 2024

Oral presentation (top 1.5%)

arXiv Code
Act Only When It Pays: Efficient Reinforcement Learning for LLM Reasoning via Selective Rollouts

Haizhong Zheng , Yang Zhou , Brian R. Bartoldson , Bhavya Kailkhura , Fan Lai , Jiawei Zhao, and Beidi Chen

In Advances in Neural Information Processing Systems (NeurIPS) , 2025

arXiv
GaLore 2: Large-Scale LLM Pre-Training by Gradient Low-Rank Projection

DiJia Su , Andrew Gu , Jane Xu , Yuandong Tian , and Jiawei Zhao

In Preprint , 2025

arXiv
ZerO Initialization: Initializing Neural Networks with only Zeros and Ones

Jiawei Zhao, Florian Tobias Schaefer , and Anima Anandkumar

Transactions on Machine Learning Research (TMLR), 2022

arXiv Code
signSGD with Majority Vote is Communication Efficient and Fault Tolerant

Jeremy Bernstein* , Jiawei Zhao*, Kamyar Azizzadenesheli , and Anima Anandkumar

In International Conference on Learning Representations (ICLR) , 2019

arXiv Code

Recent Publications

Prosperity before Collapse: How Far Can Off-Policy RL Reach with Stale Data on LLMs?

Haizhong Zheng , Jiawei Zhao, and Beidi Chen

In Preprint , 2025

arXiv Code
ParetoQ: Scaling Laws in Extremely Low-bit LLM Quantization

Zechun Liu , Changsheng Zhao , Hanxian Huang , Sijia Chen , Jing Zhang , Jiawei Zhao, Scott Roy , Lisa Jin , Yunyang Xiong , Yangyang Shi , Lin Xiao , Yuandong Tian , Bilge Soran , Raghuraman Krishnamoorthi , Tijmen Blankevoort , and Vikas Chandra

In Advances in Neural Information Processing Systems (NeurIPS) , 2025

arXiv
From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories, and Applications

Ajay Jaiswal , Yifan Wang , Lu Yin , Shiwei Liu , Runjin Chen , Jiawei Zhao, Ananth Grama , Yuandong Tian , and Zhangyang Wang

In International Conference on Machine Learning (ICML) , 2025

arXiv
S^2FT: Efficient, Scalable and Generalizable LLM Fine-tuning by Structured Sparsity

Xinyu Yang , Jixuan Leng , Geyang Guo , Jiawei Zhao, Ryumei Nakada , Linjun Zhang , Huaxiu Yao , and Beidi Chen

In Advances in Neural Information Processing Systems (NeurIPS) , 2024

arXiv
HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading

Cheng Luo , Zefan Cai , Hanshi Sun , Jinqi Xiao , Bo Yuan , Wen Xiao , Junjie Hu , Jiawei Zhao, Beidi Chen , and Anima Anandkumar

In Preprint , 2025

arXiv
Mini-Sequence Transformers: Optimizing Intermediate Memory for Long Sequences Training

Cheng Luo , Jiawei Zhao, Zhuoming Chen , Beidi Chen , and Anima Anandkumar

In Advances in Neural Information Processing Systems (NeurIPS) , 2024

arXiv

Please refer to my Google Scholar for a complete list of publications.