SINQ
Collection
This collection contains the models quantized with the SINQ quantization method.
•
19 items
•
Updated
•
10
None defined yet.
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights
Top-Theta Attention: Sparsifying Transformers by Compensated Thresholding