GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers

Published in , 2024