Four papers accepted for presentation at ICML 2023
The following four papers have been accepted for presentation at the Fourtieth International Conference on Machine Learning (external page ICML 2023) :
High-throughput Generative Inference of Large Language Models with a Single GPU. Ying Sheng, Lianmin Zheng, Binhang Yuan, Zhuohan Li, Max Ryabinin, Beidi Chen, Percy Liang,Ion Stoica, Christopher Re, Ce Zhang
FedHPO-Bench: A Benchmark Suite for Federated Hyperparameter Optimization. Zhen WANG, Weirui Kuang, Ce Zhang, Bolin Ding, Yaliang Li
CocktailSGD: Fine-tuning Foundation Models over 500Mbps Networks. Jue WANG, Yucheng Lu, Binhang Yuan, Beidi Chen, Percy Liang, Christopher De Sa, Christopher Re, Ce Zhang
Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time. Zichang Liu, Jue WANG, Tri Dao, Tianyi Zhou, Binhang Yuan, Zhao Song, Anshumali Shrivastava, Ce Zhang, Yuandong Tian, Christopher Re, Beidi Chen