Using my model without an index generates a better quality file compared to using the generated feature index file. Using the index, the voice sounds more like the pretrained model and ruins the output. The index adds a sort of lisp. I may be missing a key step here.
Here are some examples that show what I'm talking about
