The only way to train an AI voice model is to have lots of samples. As scummy as they are, miscrosoft is not selling your voice recordings with the info required to link them to a specific person
Not true anymore. You can create a reasonable voice clone with like 30 seconds of audio now (11labs for example doesn’t do any kind of authentication). The results are good enough for this kind of thing, especially in a lower bandwidth situation like a phone call.
True for creating voices at all, but that work has already been done.
Now we’re just taking these large AI’s trained to mimic voices and giving them a 30 second audio clip to tell them what to mimic. It can be done quickly and give convincing results especially when hidden by the phonecall quality.
The only way to train an AI voice model is to have lots of samples. As scummy as they are, miscrosoft is not selling your voice recordings with the info required to link them to a specific person
Not true anymore. You can create a reasonable voice clone with like 30 seconds of audio now (11labs for example doesn’t do any kind of authentication). The results are good enough for this kind of thing, especially in a lower bandwidth situation like a phone call.
Or recordings made during customer service calls, maybe a disgruntled employee decides to repurpose the data.
True for creating voices at all, but that work has already been done.
Now we’re just taking these large AI’s trained to mimic voices and giving them a 30 second audio clip to tell them what to mimic. It can be done quickly and give convincing results especially when hidden by the phonecall quality.