Files
variet_llm/scripts/qwen_latest.txt

24 lines
1.5 KiB
Plaintext

UD-IQ4_NL | pure-GPU minbatch | 65.11 | GPU | 19177
UD-IQ4_NL | pure-GPU nommap small | 65.01 | GPU | 19672
UD-IQ4_NL | pure-GPU row-split | 13.65 | GPU | 19427
UD-IQ4_NL | pure-GPU ts=0.5,0.5 | 64.92 | GPU | 19664
UD-IQ4_NL | pure-GPU all-tricks | 64.72 | GPU | 19171
UD-IQ4_NL | tune t=2 | 64.87 | GPU | 19170
UD-IQ4_NL | tune t=6 | 64.88 | GPU | 19168
UD-IQ4_NL | tune t=8 | 64.5 | GPU | 19168
UD-IQ4_NL | tune ub=256 b=1024 | 64.73 | GPU | 20640
UD-IQ4_NL | tune ub=256 b=2048 | 63.69 | GPU | 20614
UD-IQ4_NL | tune kv=q8_0/q8_0 | 64.78 | GPU | 20422
UD-IQ4_NL | tune kv=f16/f16 | 65.53 | GPU | 22812
UD-IQ4_NL | FINAL | 66.31 | GPU | 22811
MXFP4_MOE | pure-GPU minbatch | 63.06 | GPU | 22747
MXFP4_MOE | pure-GPU nommap small | 63.75 | GPU | 22579
MXFP4_MOE | pure-GPU ts=0.5,0.5 | 62.88 | GPU | 22578
MXFP4_MOE | pure-GPU all-tricks | 62.55 | GPU | 22743
MXFP4_MOE | tune t=2 | 63.07 | GPU | 22601
MXFP4_MOE | tune t=6 | 63.58 | GPU | 22583
MXFP4_MOE | tune t=8 | 62.92 | GPU | 22536
MXFP4_MOE | tune ub=256 b=1024 | 62.76 | GPU | 22874
MXFP4_MOE | tune ub=256 b=2048 | 62.74 | GPU | 22912
MXFP4_MOE | FINAL | 63.71 | GPU | 22566
Q4_K_M | pure-GPU nommap small | 62.29 | GPU | 22975