grpo-training meta-llama/Llama-3.2-1B-Instruct Text Generation • 1B • Updated Oct 24, 2024 • 7.37M • • 1.48k meta-llama/Llama-3.1-8B Text Generation • 8B • Updated Oct 16, 2024 • 1.3M • • 2.25k epfl-llm/meditron-7b Text Generation • 7B • Updated Dec 7, 2023 • 2.72k • 322 medalpaca/medalpaca-7b Text Generation • 7B • Updated Apr 2, 2024 • 1.34k • • 91
grpo-training meta-llama/Llama-3.2-1B-Instruct Text Generation • 1B • Updated Oct 24, 2024 • 7.37M • • 1.48k meta-llama/Llama-3.1-8B Text Generation • 8B • Updated Oct 16, 2024 • 1.3M • • 2.25k epfl-llm/meditron-7b Text Generation • 7B • Updated Dec 7, 2023 • 2.72k • 322 medalpaca/medalpaca-7b Text Generation • 7B • Updated Apr 2, 2024 • 1.34k • • 91