Bohan Zhuang’s Post

View profile for Bohan Zhuang, graphic

Faculty @ Zhejiang University; Adjunct Faculty @ Monash University; Visiting Research Scientist @ DAMO Academy

We'd like to share our latest simple and effective work, ZipAR, which reduces 91% of auto-regressive image generation overhead without any training. 🔮 🔮 Addressing the issue of slow decoding in AR image generation models (such as Emu3, Lumina-mGPT, LlamaGen, and Janus), we propose a training-free parallel decoding framework that achieves up to 91% reduction in image generation overhead while preserving almost all image details and semantic information. This is an early-stage work, and we are continuing to improve it. Feedback and discussions are very welcome! Paper link: https://lnkd.in/gizCB4bK Code link: https://lnkd.in/gjDvmEsh

  • No alternative text description for this image

To view or add a comment, sign in

Explore topics