LO-BCQ: Locally Optimal Block Clustered Quantization for 4-bit (W4A4) LLM Inference
Reena Elangovan, Charbel Sakr, Anand Raghunathan, Brucek Khailany
Action editor: Yunhe Wang
https://openreview.net/forum?id=loWISTqGwW
#quantization #quantizing #blocks
#quantizing
Posts tagged #quantizing on Bluesky
0
0
0
0
Accumulator-Aware Post-Training Quantization for Large Language Models
Ian Colbert, Giuseppe Franco, Fabian Grob, Jinjie Zhang, Rayan Saab
Action editor: Jundong Li
https://openreview.net/forum?id=p6l0579yj7
#quantization #quantizing #multiplications
0
0
0
0
New #Featured Certification, #J2C Certification:
LO-BCQ: Locally Optimal Block Clustered Quantization for 4-bit (W4A4) LLM Inference
Reena Elangovan, Charbel Sakr, Anand Raghunathan, Brucek Khailany
https://openreview.net/forum?id=loWISTqGwW
#quantization #quantizing #blocks
0
0
0
0