IQ (importance-matrix quantization)

A family of quantizations (IQ3_M, IQ4_NL, etc.) that use a calibration dataset to identify which weights matter most, applying higher precision selectively. Often better quality than K-quants at the same bit depth, especially at lower bit levels (2–3 bit).

Referenced in