Meta Platforms' AI lab released Voicebox last week, a machine learning model that can convert text into voice. Unlike previous text-to-speech models, Voicebox can edit, remove noise, and transfer styles, all of which are activities for which it has not been specifically trained.
Researchers at Meta used their own unique methodology to train the model. Initial findings are encouraging and potentially power many future applications, but Meta has not published Voicebox owing to ethical concerns regarding abuse.
What We Mean By Flow Matching
It is possible to synthesise speech in English, French,...
Meta Platforms' AI lab released Voicebox last week, a machine learning model that can convert text into voice. Unlike previous text-to-speech models, Voicebox can...