🎵 Sharing My Passion Project: SESA Audio Separation! 🎙️
Hey everyone! I’m thrilled to share a project I’ve been working on solo: SESA Audio Separation. This tool separates audio into vocals, instrumentals, drums, bass, and more, with quality enhancement features. I don’t know coding, so I built this with AI help. I’d love your feedback! 😊
What’s SESA Audio Separation About?
I made this to make audio separation easy for music production or audio processing fans. Despite no coding skills, AI helped me add features to experiment with audio creatively.
- Audio Separation: Split audio into vocals, instrumentals, drums, bass, etc., using models like Vocal Models.
- At the same time, a phase fixer is also available, allowing you to obtain audio with less residual noise.
- all newly made models are available (some of the older models are also available
- Apollo Quality Enhancement: Boost audio with MP3 Enhancer, Lew Vocal Enhancer, or Apollo Universal Model (Normal or Mid/Side methods).
- Auto & Manual Ensemble: Combine model outputs for better results using average, median, min, or max methods.
- Google Drive Integration: Copy files to Google Drive for sharing.
- User-Friendly Interface: Built with Gradio, featuring progress bars and logs.
- Output Formats: Export in WAV, FLAC, MP3, and more.
Tools & Tech (With AI Help)
I relied on AI since I don’t code. Here’s what was used:
- Languages & Libraries: Python, Gradio (UI), Librosa, Soundfile (audio processing), PyTorch (model inference).
- Other Tools: Apollo for enhancement, subprocess for scripts.
- Features: Multi-language support with I18nAuto, progress tracking with tqdm.
Colab: https://colab.research.google.com/drive/1CH2JWd6YculmKSug9zpzuxvM6mSYysdB?usp=sharing
Note: Auto ensemble might have a few bugs, but I’m doing my best to fix them.