Towards Optimal Multi-draft Speculative
Decoding
Zhengmian Hu
Tong Zheng
Vignesh Viswanathan
Ziyi Chen
Ryan A. Rossi
Yihan Wu
Dinesh Manocha
Heng Huang
11affiliationtext: Department of Computer Science, University of Maryland, College Park, MD, USA22affiliationtext: Adobe Research, San Jose, CA, USA33affiliationtext: Manning College of Information & Computer Sciences, University of Massachusetts Amherst, MA, USA44affiliationtext: Department of Electrical and Computer Engineering, University of Maryland, College Park, MD, USA