Llama3.1-SuperDeepFuse-CrashCourse12K
Llama3.1-SuperDeepFuse-CrashCourse12K is an 8B parameter language model based on Llama3.1-SuperDeepFuse
and further fine-tuned on agentlans/crash-course.
Model Details
- Base Model: Llama3.1-SuperDeepFuse (8B parameters)
- Fine-tuning Dataset: 12 000 samples from agentlans/crash-course (containing samples from 10 high-quality instruct datasets)
- Model Type: Instruction-tuned language model
- Language(s): Multilingual
- License: Follows standard Llama 3.1 usage terms
Training Procedure
Fine-tuning
- Method: LoRA (Low-Rank Adaptation)
- Optimizer: AdamW
- Learning Rate: 5e-5
- Batch Size: 2 per device
- Gradient Accumulation Steps: 8
- Training Epochs: 1
- Max Sequence Length: 2048
- LoRA Configuration:
- Rank: 8
- Alpha: 16
- Dropout: 0.5
- Target: all layers
- Quantization: 4-bit (bitsandbytes)
- Precision: BF16
- Other Techniques: NEFTune (noise alpha: 5), RS-LoRA
Performance and Limitations
This model potentially offers:
- Enhanced multi-task reasoning
- Improved performance in mathematics and coding tasks
- Better instruction-following abilities
However:
- Performance may be limited compared to larger model variants
- Can produce misleading or incorrect outputs
- Outputs should be independently verified for critical applications
Additional Information
Open LLM Leaderboard Evaluation Results
Detailed results can be found here!
Summarized results can be found here!
| Metric |
Value (%) |
| Average |
27.93 |
| IFEval (0-Shot) |
71.87 |
| BBH (3-Shot) |
31.83 |
| MATH Lvl 5 (4-Shot) |
17.67 |
| GPQA (0-shot) |
8.39 |
| MuSR (0-shot) |
8.60 |
| MMLU-PRO (5-shot) |
29.24 |