NanoQuant: Efficient Sub-1-Bit Quantization of Large Language Models Paper • 2602.06694 • Published 7 days ago • 12
LittleBit: Ultra Low-Bit Quantization via Latent Factorization Paper • 2506.13771 • Published May 30, 2025
RaBiT: Residual-Aware Binarization Training for Accurate and Efficient LLMs Paper • 2602.05367 • Published 8 days ago • 7