Selected Publications
-
Deep Think with Confidence
Yichao
Fu*
, Xuewei
Wang
, Yuandong
Tian
, and Jiawei
Zhao*
In Preprint , 2025
-
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Jiawei
Zhao, Zhenyu
Zhang
, Beidi
Chen
, Zhangyang
Wang
, Anima
Anandkumar
, and Yuandong
Tian
In International Conference on Machine Learning (ICML) , 2024
Oral presentation (top 1.5%)
-
Act Only When It Pays: Efficient Reinforcement Learning for LLM Reasoning via Selective Rollouts
Haizhong
Zheng
, Yang
Zhou
, Brian R.
Bartoldson
, Bhavya
Kailkhura
, Fan
Lai
, Jiawei
Zhao, and Beidi
Chen
In Advances in Neural Information Processing Systems (NeurIPS) , 2025
-
GaLore 2: Large-Scale LLM Pre-Training by Gradient Low-Rank Projection
DiJia
Su
, Andrew
Gu
, Jane
Xu
, Yuandong
Tian
, and Jiawei
Zhao
In Preprint , 2025
-
ZerO Initialization: Initializing Neural Networks with only Zeros and Ones
Jiawei
Zhao, Florian Tobias
Schaefer
, and Anima
Anandkumar
Transactions on Machine Learning Research (TMLR), 2022
-
signSGD with Majority Vote is Communication Efficient and Fault Tolerant
Jeremy
Bernstein*
, Jiawei
Zhao*, Kamyar
Azizzadenesheli
, and Anima
Anandkumar
In International Conference on Learning Representations (ICLR) , 2019
Recent Publications
-
ParetoQ: Scaling Laws in Extremely Low-bit LLM Quantization
Zechun
Liu
, Changsheng
Zhao
, Hanxian
Huang
, Sijia
Chen
, Jing
Zhang
, Jiawei
Zhao, Scott
Roy
, Lisa
Jin
, Yunyang
Xiong
, Yangyang
Shi
, Lin
Xiao
, Yuandong
Tian
, Bilge
Soran
, Raghuraman
Krishnamoorthi
, Tijmen
Blankevoort
, and Vikas
Chandra
In Advances in Neural Information Processing Systems (NeurIPS) , 2025
-
From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories, and Applications
Ajay
Jaiswal
, Yifan
Wang
, Lu
Yin
, Shiwei
Liu
, Runjin
Chen
, Jiawei
Zhao, Ananth
Grama
, Yuandong
Tian
, and Zhangyang
Wang
In International Conference on Machine Learning (ICML) , 2025
-
S^2FT: Efficient, Scalable and Generalizable LLM Fine-tuning by Structured Sparsity
Xinyu
Yang
, Jixuan
Leng
, Geyang
Guo
, Jiawei
Zhao, Ryumei
Nakada
, Linjun
Zhang
, Huaxiu
Yao
, and Beidi
Chen
In Advances in Neural Information Processing Systems (NeurIPS) , 2024
-
HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading
Cheng
Luo
, Zefan
Cai
, Hanshi
Sun
, Jinqi
Xiao
, Bo
Yuan
, Wen
Xiao
, Junjie
Hu
, Jiawei
Zhao, Beidi
Chen
, and Anima
Anandkumar
In Preprint , 2025
-
Please refer to my Google Scholar for a complete list of publications.