VALL-E: AI can now imitate any human voice in just 3-seconds.
oknoob.substack.com
Microsoft has made a major advance in the realm of artificial intelligence (AI) with the release of VALL-E, an AI system that can correctly mimic any person's voice. Unlike standard text-to-speech models that employ waveforms, VALL-E takes a three-second sample of someone's voice, divides it into tokens, and uses these tokens to generate new sounds depending on the rules it has learned. This means that the AI system can detect and imitate characteristics in a person's voice, such as tone, pitch, and speaking style.
VALL-E: AI can now imitate any human voice in just 3-seconds.
VALL-E: AI can now imitate any human voice in…
VALL-E: AI can now imitate any human voice in just 3-seconds.
Microsoft has made a major advance in the realm of artificial intelligence (AI) with the release of VALL-E, an AI system that can correctly mimic any person's voice. Unlike standard text-to-speech models that employ waveforms, VALL-E takes a three-second sample of someone's voice, divides it into tokens, and uses these tokens to generate new sounds depending on the rules it has learned. This means that the AI system can detect and imitate characteristics in a person's voice, such as tone, pitch, and speaking style.