Hey everyone, I have a question about the recommended workflow for preparing vocals for an RVC dataset.
Which order generally gives better results?
A) DeEcho/DeReverb → Lead Vocal Separation
B) Lead Vocal Separation → DeEcho/DeReverb
I want to follow the workflow that experienced RVC model creators usually use.
Additional context:
When I dereverb first, I get a cleaner and better overall tone, but the output keeps some adlib reflections, reverb traces, and echo remnants.
When I separate first, most of those reflections disappear, but the vocal tone becomes more unnatural, less isolated, and somewhat degraded.
If anyone who has trained multiple models can share what’s worked best for them, I’d really appreciate it!

