Leaderboard
Task-agnostic pre-training only (1 single SSL model for all tasks). AER results reported are obtained with the GRU-32 architecture
Results on test sets
ASR ETAPE : Automatic Speech Recognition - WER (%)
ASR CommonVoice : Automatic Speech Recognition - WER (%)
SLU MEDIA : Spoken Langue Understanding - Concept Error Rate - CER (%)
AST mTEDx : Speech to Text Translation - BLEU (for 3 language pairs: fr-en, fr-es and fr-pt)
AER RECOLA Arousal : Automatic Emotion Recognition - Concordance Correlation Coefficient for Arousal
AER RECOLA Valence : Automatic Emotion Recognition - Concordance Correlation Coefficient for Valence
AER AlloSat : Automatic Emotion Recognition - Concordance Correlation Coefficient for Satisfaction