Pretraining Datasets Collection This collection provides high-quality, large-scale Romanian pretraining datasets derived from FineWeb-2. • 3 items • Updated 27 days ago
Pretraining Datasets Collection This collection provides high-quality, large-scale Romanian pretraining datasets derived from FineWeb-2. • 3 items • Updated 27 days ago
Evaluating the Performance of Large Language Models in Competitive Programming: A Multi-Year, Multi-Grade Analysis Paper • 2409.09054 • Published Aug 31, 2024 • 1
Evaluating the Performance of Large Language Models in Competitive Programming: A Multi-Year, Multi-Grade Analysis Paper • 2409.09054 • Published Aug 31, 2024 • 1
CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues Paper • 2404.03820 • Published Apr 4, 2024 • 25