#Pronunciation issues
1 messages · Page 1 of 1 (latest)
If it mispronounce words, is because the audio used for the dataset was noisy or you straight up used a short dataset.
The dataset was an hour long and I cleaned it with RX beforehand.
if one hour long length, bad audio inference is the only problem
use clean audio for it
But what if he didn't delete the noise properly?
On the dataset.
noise is fine, as long as its barely audible, the important is clear pronunciation of words
if talking about the dataset if noise isn't treated well, it doesn't destroy the pronunciations, those are just noise, unless they are random bleeds, not mic noise
overlapping sfx and such
also could be model and inference audio is incompatible, use only an audio with the same language or accent from the model or dataset
Ayo? @serene wadi level 15 !!! 