Llama3.1-SuperDeepFuse-CrashCourse12K

Llama3.1-SuperDeepFuse-CrashCourse12K is an 8B parameter language model based on Llama3.1-SuperDeepFuse and further fine-tuned on agentlans/crash-course.

Model Details

Base Model: Llama3.1-SuperDeepFuse (8B parameters)
Fine-tuning Dataset: 12 000 samples from agentlans/crash-course (containing samples from 10 high-quality instruct datasets)
Model Type: Instruction-tuned language model
Language(s): Multilingual
License: Follows standard Llama 3.1 usage terms

Method: LoRA (Low-Rank Adaptation)
Optimizer: AdamW
Learning Rate: 5e-5
Batch Size: 2 per device
Gradient Accumulation Steps: 8
Training Epochs: 1
Max Sequence Length: 2048
LoRA Configuration:
- Rank: 8
- Alpha: 16
- Dropout: 0.5
- Target: all layers
Quantization: 4-bit (bitsandbytes)
Precision: BF16
Other Techniques: NEFTune (noise alpha: 5), RS-LoRA

This model potentially offers:

However:

For the original model, see agentlans/Llama3.1-SuperDeepFuse
For the base Llama 3.1 model, including training data and model architecture, refer to the original Llama 3.1 model card.

Detailed results can be found here! Summarized results can be found here!

Safetensors

Model size

8B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Finetuned

(2)

this model

Quantizations