MLSys'24 Best Paper - AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Download MP3Related Videos
1:06:02 57:57 1:06:32 51:34 1:06:21 1:09:26 56:18 1:17:05 1:08:38 0:13 0:26 31:53 1:17:48 1:15:24 1:06:28 1:10:36 1:06:30 51:25 1:08:37 1:06:00