view reply Nice work, it would be helpful to know the details about the inference. Did you use vLLM or Transformers? Are you using a specific Evaluation framework like lighteval or lm-evaluation-harness?