This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

MegaByte: Predicting Million-byte Sequences with Multiscale Transformers

Lili Yu    Dániel Simig    Colin Flaherty    Armen Aghajanyan    Luke Zettlemoyer    Mike Lewis
Machine Learning, ICML