MrBERT BSC-LT/MrBERT Fill-Mask • 0.3B • Updated Mar 26 • 969 • 8 BSC-LT/MrBERT-es Fill-Mask • 0.2B • Updated Apr 9 • 4.51k • • 7 BSC-LT/MrBERT-ca Fill-Mask • 0.1B • Updated Apr 21 • 66 • 2 BSC-LT/MrBERT-legal Fill-Mask • 0.3B • Updated Apr 9 • 199 • 1
Salamandra 🦎 BSC-LT/salamandra-7b-instruct Text Generation • 8B • Updated Oct 22, 2025 • 98.6k • 79 BSC-LT/salamandra-7b Text Generation • 8B • Updated Oct 22, 2025 • 911 • 29 BSC-LT/salamandra-2b-instruct Text Generation • 2B • Updated Oct 22, 2025 • 3.12k • 28 BSC-LT/salamandra-2b Text Generation • 2B • Updated Oct 22, 2025 • 2.05k • 25
Speech models Models developed by the speech team of the Language Technologies unit BSC-LT/wavenext-encodec Updated Sep 12, 2024 • 4 BSC-LT/wavenext-mel Updated Sep 10, 2024 • 8 • 3 BSC-LT/vocos-mel-22khz Updated Aug 27, 2024 • 863 • 7 BSC-LT/whisper-large-v3-ca-punctuated-3370h Automatic Speech Recognition • Updated Oct 28, 2025 • 125 • 1
BSC-LT/whisper-large-v3-ca-punctuated-3370h Automatic Speech Recognition • Updated Oct 28, 2025 • 125 • 1
ALIA BSC-LT/ALIA-40b-instruct-2601 Text Generation • 40B • Updated about 10 hours ago • 14.3k • 12 BSC-LT/ALIA-40b-instruct-2601-GGUF Text Generation • 40B • Updated Feb 20 • 136 • 4
BSC-LT/ALIA-40b-instruct-2601 Text Generation • 40B • Updated about 10 hours ago • 14.3k • 12
MT Datasets BSC-LT/Legal_Catalan_Spanish_Parallel_Corpus Updated 9 days ago • 26 BSC-LT/MULTI_corpus Viewer • Updated 8 days ago • 468k • 36 BSC-LT/geneval_catalan Viewer • Updated Apr 9 • 5.25k • 387 BSC-LT/NTEU_Multilingual_Evaluation_Dataset Updated Nov 4, 2025 • 85 • 1
Speech datasets Datasets curated by the speech team of the Language Technologies unit BSC-LT/CAESAR-TINY Viewer • Updated Apr 7, 2025 • 667 • 27 BSC-LT/CAESAR-TV3 Viewer • Updated Apr 7, 2025 • 2.96k • 68 • 1 BSC-LT/BSCs_Code_Switching_CA-ES_ASR_Test Updated Nov 22, 2025 • 18 BSC-LT/distilled-yodas-spanish Updated Dec 15, 2025 • 150 • 3
MrBERT BSC-LT/MrBERT Fill-Mask • 0.3B • Updated Mar 26 • 969 • 8 BSC-LT/MrBERT-es Fill-Mask • 0.2B • Updated Apr 9 • 4.51k • • 7 BSC-LT/MrBERT-ca Fill-Mask • 0.1B • Updated Apr 21 • 66 • 2 BSC-LT/MrBERT-legal Fill-Mask • 0.3B • Updated Apr 9 • 199 • 1
ALIA BSC-LT/ALIA-40b-instruct-2601 Text Generation • 40B • Updated about 10 hours ago • 14.3k • 12 BSC-LT/ALIA-40b-instruct-2601-GGUF Text Generation • 40B • Updated Feb 20 • 136 • 4
BSC-LT/ALIA-40b-instruct-2601 Text Generation • 40B • Updated about 10 hours ago • 14.3k • 12
Salamandra 🦎 BSC-LT/salamandra-7b-instruct Text Generation • 8B • Updated Oct 22, 2025 • 98.6k • 79 BSC-LT/salamandra-7b Text Generation • 8B • Updated Oct 22, 2025 • 911 • 29 BSC-LT/salamandra-2b-instruct Text Generation • 2B • Updated Oct 22, 2025 • 3.12k • 28 BSC-LT/salamandra-2b Text Generation • 2B • Updated Oct 22, 2025 • 2.05k • 25
MT Datasets BSC-LT/Legal_Catalan_Spanish_Parallel_Corpus Updated 9 days ago • 26 BSC-LT/MULTI_corpus Viewer • Updated 8 days ago • 468k • 36 BSC-LT/geneval_catalan Viewer • Updated Apr 9 • 5.25k • 387 BSC-LT/NTEU_Multilingual_Evaluation_Dataset Updated Nov 4, 2025 • 85 • 1
Speech models Models developed by the speech team of the Language Technologies unit BSC-LT/wavenext-encodec Updated Sep 12, 2024 • 4 BSC-LT/wavenext-mel Updated Sep 10, 2024 • 8 • 3 BSC-LT/vocos-mel-22khz Updated Aug 27, 2024 • 863 • 7 BSC-LT/whisper-large-v3-ca-punctuated-3370h Automatic Speech Recognition • Updated Oct 28, 2025 • 125 • 1
BSC-LT/whisper-large-v3-ca-punctuated-3370h Automatic Speech Recognition • Updated Oct 28, 2025 • 125 • 1
Speech datasets Datasets curated by the speech team of the Language Technologies unit BSC-LT/CAESAR-TINY Viewer • Updated Apr 7, 2025 • 667 • 27 BSC-LT/CAESAR-TV3 Viewer • Updated Apr 7, 2025 • 2.96k • 68 • 1 BSC-LT/BSCs_Code_Switching_CA-ES_ASR_Test Updated Nov 22, 2025 • 18 BSC-LT/distilled-yodas-spanish Updated Dec 15, 2025 • 150 • 3