#🎵|stable-audio

1 messages · Page 4 of 1

cobalt surge
#

this doesn't tell me anything about the format you were feeding it a couple days ago and what you're using it now.

somber cradle
#

Then it is cut off in 30 seconds, before it worked, now I don't know why it is giving this error.

#

I download the video, go to the audio option, select it and that's it, the audio goes to normal. But when I use the prombt on the audio of the video I downloaded, it gives this error

cobalt surge
#

mp3, wav, mp4, aiff ? ogg, flac, mod, midi, bink ?

somber cradle
#

Sometimes this error occurs too

#

Since the audio is only 30 seconds long

#

I use SnapTube to download videos and to cut I use Kinemaster

gaunt orbit
#

maybeeee

#

try 31 secs

somber cradle
gaunt orbit
#

ah oop

somber cradle
#

I don't know if you saw the other error I showed @gaunt orbit

#

The videos I download are in mp4 format

cobalt surge
cobalt surge
#

that's the contact link for stableaudio

somber cradle
#

Could it be a bug on the site?????

#

What do I put on the ticket?

cobalt surge
#

you describe your problem, what you're trying to do, what type of files you're using as input, the error you get, etc

somber cradle
#

The name of the ticket has to include the error that is occurring, right???

cobalt surge
#

a summary of your problem should be fine.

#

something like that

somber cradle
#

I sent it and now what do I do?

#

I put the name of the ticket as the name of the error, I don't know if it works

#

The appeared here noreply

#

What do I do now? Just wait???

torn hare
#

filmmaker

cobalt surge
somber cradle
#

My theory is that it's a website bug

#

Why wasn't this happening before?

#

It only started happening now so I think it must be a bug on the site.

#

How long does it usually take for them to respond?

somber cradle
#

@cobalt surge A question when they answer the notification appears here on my cell phone

cobalt surge
lyric elbow
#

what model are you using exactly?

#

anybody mind telling me about the new open source audio models?

brisk wind
brisk wind
cobalt surge
#

he s not talking about suno but Stable Audio Open Small

brisk wind
lyric elbow
lyric elbow
brisk wind
cobalt surge
#

oh ok I thought you were specifically about the new model. Didn t see the "s" at the end.

brisk wind
lyric elbow
brisk wind
#

I use yue to produce it and you be the judge

#

the lyrics is nsfw

lyric elbow
#

yeah it seems like yue doesn't cut it

brisk wind
somber cradle
#

@cobalt surge Where they respond

somber cradle
# brisk wind Suno

Suno is garbage It doesn't create the style you want, sometimes it's really bad, I prefer Stable audio

brisk wind
#

suno 4 or above is good

somber cradle
brisk wind
somber cradle
#

This only happens when I use audio from a game or something non-original.

brisk wind
somber cradle
brisk wind
#

so is suno but the free version limit the length of it

somber cradle
#

Like the Stable audio example takes the Devil May Cry song and turns it into a Resident Evil style

somber cradle
#

Besides, it creates extra time for the music.

brisk wind
somber cradle
brisk wind
# somber cradle I asked chatgpt if suno could do this and he said no
somber cradle
#

I want something that does a remake of the audio video that I sent. Stable audio does that.

#

Do you understand???

#

Stable audio is giving that error in the image I showed This happens when I use audio even though it is mp3 mp4 wav

#

I don't know how to fix it, but a guy here told me to send ticket I didn't understand anymore, but he told me to wait for them to respond.

cobalt surge
somber cradle
#

Could someone test if Stable audio is still bugged with the audios?

lyric elbow
#

@south dawn possible scam/fishing attempt

somber cradle
#

@cobalt surge They answered me and now what do I do?

cobalt surge
somber cradle
loud garden
loud garden
somber cradle
#

Stable audio will have an update on June 31st

#

Is it to fix that bug I sent to the guys?

somber cradle
#

Has anyone tested whether the audios in stable audio are working???

supple eagle
#

I was trying to get a horse whinney or neigh from stable-audio-open or stable-audio-open-small, and i can't seem to get anything besides weird human-sounding grunts. I'm wondering if there is a more specific prompt i ought to be using, or if this is just out of scope of what either of these models can do

#

any tips on how to use these for SFX/foley audio

somber cradle
#

I wanted a website where you send your audio and make a remix using prombt like Stable audio

#

Stable audio can do this but it has this bug and cannot use audios

somber cradle
#

Stop bots

somber cradle
somber cradle
#

Has anyone tested if that audio bug has been fixed???

merry hound
merry hound
low fjord
#

passed this on

merry hound
winged raptor
somber cradle
somber cradle
#

Did you mean it was fixed???

somber cradle
#

Does anyone have the paid Stable Audio? With 3 minute audios, if so could you make some songs for me????

winged raptor
somber cradle
#

Does anyone have paid Stable audio?

#

Why would I want someone to make a song from an audio of my full song since the paid mode is up to 3 minutes

somber cradle
#

Could someone help me with this that I asked

somber cradle
#

I wish someone would make some music for me

#

But since I don't have the paid version, I only have a 30 second limit.

#

I would like someone to help me make a remix of the audios with more than 30 seconds if anyone can help me thank you

worldly copper
#

Hy anyone's have hey gen ai membership contact ya who know how to do ai voice dubbing contact me

hallow scaffold
#

Hey! 👋
Working on OBSIDIAN-Neural, a VST plugin that does real-time AI audio generation for live performance.
Basically: type "dark techno kick" → AI generates directly in your DAW while you're jamming.
Current stack:

Stable Audio Open (local + server)
VST3 interface
8-track sampler with tempo sync
MIDI triggering

GitHub: https://github.com/innermost47/ai-dj
Demo video: https://www.youtube.com/watch?v=l4KMC5adxVA
Anyone else working on similar stuff? Curious to see what you're building here!

GitHub

🎧 AI-powered VST plugin for real-time music generation using LLM contextual prompts and Stable Audio Open - innermost47/ai-dj

I created the world's first AI jam partner VST!
Watch me demonstrate OBSIDIAN Neural Sound Engine - the VST plugin I developed that transforms AI into your jam partner, not your replacement.

🎯 What you'll see in this video:

  • My AI music plugin generating samples in real-time while I play live instruments
  • LLM-driven prompts from the syste...
▶ Play video
somber cradle
#

Will no one help me???

quiet wolf
#

👍

somber cradle
#

I don't think anyone will help me

somber cradle
#

Is there anyone please who has paid Stable audio???

austere fossil
#

Does Comfy UI have UI for Stable audio?

Any recommended open source UI for Stable audio?

quiet wolf
#

amazing

somber cradle
#

Nobody wants to help me 😭😭😭

#

I'm lost for anyone who has paid Stable Audio to make some songs for me but they're ignoring me.

winged raptor
somber cradle
winged raptor
somber cradle
#

The audios I send are cut at 30 seconds

winged raptor
somber cradle
winged raptor
#

yep, you can send audios up to 190 seconds

#

I see now that this Is kind of new thing, before was limited to 30 sec.

somber cradle
#

I wish someone had the paid mode to make some music for me

#

I wanted to do some remixes of some audios that I have, but as they are more than 30 seconds long it's not possible. If I keep cutting it into parts it will be horrible.

#

190 Seconds is 3 minutes???

winged raptor
#

190 sec is 3 minutes + 10 seconds

wicked prairie
#

👍

quiet wolf
#

I love

wicked prairie
#

ohoh

blazing vault
#

fr

dusk oyster
#

stable audio open 1.0 is really cool. any source or documentation where it is described what this stuff does ?

shut saddle
#

You're more likely to get detailed assistance on the Harmonai discord.

daring surge
#

A cow is grazing on the grassland, and from time to time it lowers its head to look at the crowd in the distance

blissful mauve
brisk wind
#

A folk-inspired ballad echoing the story of a young pilot torn from Oklahoma skies into the heart of distant wars. Set against scenes from Area 88 episode “Fire Ball – Contrails of Destiny,” this song follows the haunting journey from conscript to mercenary, driven by duty, regret, and fading memories of home.

Through gentle piano, banjo,...

▶ Play video
main osprey
#

hello

somber cradle
#

Are you guys getting a new audio error in Stable Audio???

somber cradle
#

I'm getting this error and I can't get into Stable Audio.

mental drift
#

Greetings! I'm wondering if any of the devs out there know if it's possible to be able to either multi-select and download or given access to your generated files through some other means. I do alot of multi-prompting and prompt refining .. generating 4 variations multiple times with very similar prompts. This causes a stacking effect and as a user it's easy to miss some of redownload the same file (also compounded by overwrite protection / auto indexing the filenames). Any information you have would be greatly appreciated. I love trying to get AudioSparkz 2 to generate the least through-composed things 🙂

cedar crystal
#

Hi everyone! 👋 A group of musicians and industry experts are currently collaborating to make an online course on how you can earn a 6-figure income through AI music (navigating the watermarking issue, promotion strategies, and every other caveat).

The course is on track to go live a few months from now. In the meantime, feel free to sign up for our email list at https://aimusiccreatorsacademy.com/ and interact with our pages:

vast prairie
cedar crystal
bright dragon
#

Hi friends, do you know of any open-source AI that creates sound effects? I create my videos with WAN 2.2, but I need to add sound. Open-source is preferable because it shouldn't be censored.

pearl flint
#

Any tools I should know about aside from the stock code in the official stable-audio-tools repo before I dive into training stable-audio-open and/or the small model?

cedar crystal
# cedar crystal
poll_question_text

What’s the biggest thing stopping you from making a living wage through AI music?

victor_answer_votes

5

total_votes

5

victor_answer_id

3

victor_answer_text

Other.

tidal lava
#

@low fjord

harsh coral
dry furnace
#

Wher tf mod is remove this shit

crystal stream
runic orchid
#

Hi, I've created a vibde data and AI engineering tool. And I would love for others to test it. Anyone who is in?

runic orchid
#

Greetings! I'm wondering if any of the devs out there know if it's possible to be able to either multi-select and download or given access to your generated files through some other means. I do alot of multi-prompting and prompt refining .. generating 4 variations multiple times with very similar prompts. This causes a stacking effect and as a user it's easy to miss some of redownload the same file (also compounded by overwrite protection / auto indexing the filenames). Any information you have would be greatly appreciated. I love trying to get AudioSparkz 2 to generate the least through-composed things 🙂

runic orchid
#

Are you guys getting a new audio error in Stable Audio???

plush robin
#

Hey everyone!

Wanted to share a project I've been building that might interest folks here who are into generative AI.
Obsidian Neural is an open-source VST3 plugin that brings Stable Audio Open directly into music DAWs for real-time generation.

What it does:
Instead of generating static audio files, it acts as a live instrument:

Type a prompt → AI generates a loop
Trigger it via MIDI like a sampler
8-track system with instant switching between variations
Everything syncs to your DAW's tempo

Think of it as Stable Audio Open meets live performance.

Why it's relevant here:

Built on Stability AI's Stable Audio Open model
Explores real-time inference for creative workflows
Open source (AGPL v3.0) — all code is public
LLM-powered contextual prompts for better generation

Similar vibe to what Stable Diffusion did for images, but for musicians who want to play with AI rather than just generate finished outputs.

Links:

GitHub: https://github.com/innermost47/ai-dj
Website: https://obsidian-neural.com
Demo videos: https://www.youtube.com/playlist?list=PL9PCUNVx6wp8gMdbo59a1k3M7sdSPmo5A

GitHub

The sampler that dreams. AI-powered VST3 for real-time music generation. Generate tempo-synced loops, trigger via MIDI, sculpt the unexpected. 8-track sampler meets infinite sound engine. No pre-ma...

First VST for real-time AI music generation. 8-track sampler, LLM brain, live performance ready.

red locust
#

Stop the channel

red locust
#

What problems have you encountered

wide breach
faint perch
#

talented

faint perch
narrow galleon
plush parrot
plush robin
#

macOS Update!
OBSIDIAN Neural - AI-powered VST3 for real-time music generation with text prompts & MIDI control.
Now signed & notarized by Apple - no more security warnings!
Perfect for live performance, jamming & improv. Generate unique audio loops on-the-fly and layer them with your gear.
https://obsidian-neural.com
https://github.com/innermost47/ai-dj

First VST for real-time AI music generation. 8-track sampler, LLM brain, live performance ready.

GitHub

The sampler that dreams. AI-powered VST3 for real-time music generation. Generate tempo-synced loops, trigger via MIDI, sculpt the unexpected. 8-track sampler meets infinite sound engine. No pre-ma...

lyric radish
#

hey@all thank you for your great work!! I want to donate a large amount of ethically sourced audio data files (40+ million files + the tools to algorithmically increase the amount available), all types of hq audio assets (songs, loops, instruments, compositions etc) for ai training purposes to the stable diffusion project. Just send dm me

plush robin
#

I added a "Draw-to-Audio" feature to my AI music generation VST - sketch your sound instead of typing prompts

So I've been working on OBSIDIAN Neural, an open-source VST3 for AI music generation focused on live performance, and just added something weird: a canvas where you can draw what you want to hear.
How it works:

Draw on the canvas (lines, shapes, whatever)
Vision LLM interprets your drawing
Translates it into audio generation prompts
~10-20 seconds later, you've got a sample

Examples:

Chaotic scribbles → distorted aggressive rhythms
Smooth flowing curves → ambient pads
Sharp geometric shapes → structured sequences

It's not meant to replace traditional prompting, but gives you another creative input during composition/live sessions - especially useful when you're in the flow and don't want to stop to type.
The whole project is open source, presented at AES AIMLA 2025 in London. Built for musicians who want AI as an instrument, not a songwriting robot.
Links:

GitHub: https://github.com/innermost47/ai-dj
Website: https://obsidian-neural.com

Would love feedback from other producers/performers experimenting with AI tools!

GitHub

The sampler that dreams. AI-powered VST3 for real-time music generation. Generate tempo-synced loops, trigger via MIDI, sculpt the unexpected. 8-track sampler meets infinite sound engine. No pre-ma...

First VST for real-time AI music generation. 8-track sampler, LLM brain, live performance ready.

somber cradle
#

I don't know if anyone will answer me, but could someone send a message to the stable audio developers to fix a bug in version 2.5?In this version, the audio tracks you add are not modified.They don't undergo any changes or anything.I've already tried setting up several prompts, but the audio doesn't change.Only the older version, 2.0, works well, but that version is terrible.I already tried messaging them, but I didn't know how to explain the problem properly, but I tried and it didn't work.

#

Is anyone else here having this problem too?

wide breach
wide breach
terse dagger
#

A slight modification to the default workflow in the Comfy templates section. Supplying a random number to SuperPrompt produces a random prompt that the audio generator tries to match.

 a group of astronauts in spacesuits stand in front of a glowing orb, their faces etched with determination. The orb is a vibrant red, with intricate patterns etched into its surface. The astronauts wear helmets and helmets, and their faces are filled with excitement and anticipation. The scene is set in a futuristic space station, with glowing orbs and glowing screens.
olive cedar
#

hey is there API for this?

cobalt surge
frozen jay
#

What is currently the best way to do AI audio

#

Im a bit new to SD - really great tool - looking into audio as well

topaz mortar
#

​🚀 LOOKING FOR A CREATIVE PARTNER (AI ANIME SERIES) 🚀
​The Vision: I am a 15-year-old creator building a high-quality AI Anime Web Series. The goal is to build a solid portfolio and eventually pitch to major platforms like Netflix/Crunchyroll.
​Who I am looking for:
​🎨 AI Artist: Someone who can generate consistent Anime characters (Stable Diffusion/Niji).
​🎬 Video Editor: Someone who can animate images into cinematic scenes.
​🌍 Age: Preferably 15-18 years old (Beginners are welcome!).
​The Deal: * This is a Long-term Partnership (50/50 Profit Share).
​No upfront payment, we grow and earn together as a team.
​Let’s build a masterpiece from scratch!
​Interested? Send me a DM

lost marten
#

Anyone know of any tools like Suno’s audio cover where you can sing and Suno will give back its own polished rendition, but same cadence?

#

I don’t mean cloning or auto tuning

#

But it gives you back an actual polished rendition (with a different voice )

#

Wondering what the tech is behind this

sharp crow
#

Anyone here that know if it would be possible to generate continues stream of music such as soundscapes etc?

#

@low fjord sorry for the ping but here is a scammer, couldn't find a moderator role to ping.

worldly nebula
#

@sharp crow hi

sharp crow
weak depot
#

👋 Hi there!

I train flux /sdxl lora for your socials and also Onlyfans and patreon. If u need ur AI Influecer I'll be happy to help u with it.

🔗 Portfolio & custom LoRA with stable face:

https://www.behance.net/gallery/243708697/Stable-AI-Influencer-Private-Flux-Face-LoRA

I create realistic AI influencers and stable AI identities.What I create:AI influencers for Instagram, TikTok, and X (Twitter)Digital personas for Patreon and OnlyFansLong-term AI characters for branding and monetizationRealistic AI faces for lifes...

wet quarry
#

I am trying to understand how I can input a particular sample, and then make its output sound more like sound fx, than music - if you know what I mean. Sort of musique concrete-ish. Are there prompts that can make my input sound like "tumbling pieces of cardboard" or just "metal", as in the object/material "metal"?

carmine obsidian
sharp crow
#

?

sharp crow
#

why do you keep pinging me?

paper kernel
plush robin
orchid bobcat
#

Hello

#

So what are people using these days for AI music/remixes? I never used one yet, only image generation

#

anyone can point me out toward something not too hard to use?

frigid dune
#

AI ADS

carmine obsidian
orchid bobcat
#

asking the same thing as last time

#

What are people using these days for AI music/remixes? I never used one yet, only image generation
anyone can point me out toward something not too hard to use?

orchid bobcat
#

oh i forgot to say, i want something local and free like stable diffusion webuis and so on, if possible

sharp crow
#

through ComfyUI you could, if you start from a audio workflow there shouldnt be much hassle.

orchid bobcat
# pallid whale Try ACE-Step v1.5.

thanks, but man i feel so dumb when reading these gitup install stuff... sometime you get the easy thing, sometime they say "clone shit" copy that, without saying where or how, lol

orchid bobcat
#

managed to install it, it works, but it all sounds pretty bad, i'm trying to remix something, but it never follows the music style i'm telling it to, even if i lower the strength of the original and so on
do you have exeperience using it @pallid whale ?

pallid whale
#

I can post one of my samples, but I'm not sure if I'm allowed to.

#

I guess everyone else is.

orchid bobcat
#

i can't really talk much right now, but if you're okay with it, you could help me through DMs later

pallid whale
pallid whale
surreal jolt
#

#✨|sdxl 帮我生成图片:28岁女生,身材曲线火辣,胸围饱满,腰臀比明显。她穿着一件贴身的白色短袖T恤和一条深蓝色的高腰紧身牛仔裤,脚上踩着一双白色运动鞋。站在城市街道的斑马线一端等红绿灯,身后是车流和建筑。她双手自然垂着,单肩背着一个黑色小包,身体微微侧对镜头,头转向马路对面的红灯,表情安静而自然,眼神放空地等待着,嘴唇自然闭合,整个人散发出一种日常出行的松弛感。

摄影规格:8K超高清画质,用富士GFX100 II中画幅相机搭配80mm f/1.8镜头拍摄,f/2.8光圈,快门1/800秒,ISO 200。利用下午四点的自然光,光线柔和带一点暖色。捕捉她等红灯时放空望向前方的自然神态,眼神平静不呆板,白色T恤的纹理和牛仔裤的质感清晰,背景是虚化的车流和城市建筑,有动态模糊感。

stuck dirge
plush robin
#

Anyone with an RTX 3070+ collecting dust?

OBSIDIAN Neural is building a decentralized GPU network for real-time AI music generation — no big cloud, no middleman. Providers get 85% of subscription revenue split equally each month via Stripe.

The infra is live. The network needs builders.

https://github.com/innermost47/obsidian-neural-provider

GitHub

GPU inference server for the OBSIDIAN Neural distributed network. Contribute your GPU, generate music in real time, share the revenue equally. - innermost47/obsidian-neural-provider

carmine obsidian
ancient echo
carmine obsidian
carmine obsidian
carmine obsidian
#

It's meant to be a Disney-like parody about a ridiculous comic adventure between 2 pokemons 🤣

knotty ravine
oblique patrol
knotty ravine
wide breach