lyric steeple Apr 18, 2023, 5:50 PM

#

Doing some code-switching experiments with the new language-specific prompts

#

Mi hermano is very inteligente, pero sometimes he's un poco terco.

#

The word spacing around he's is a bit choppy in terco_c

lyric steeple Apr 19, 2023, 3:44 PM

#

Predictions are hard to make, you know? Especially about the future.

#

https://tenor.com/view/calculating-puzzled-math-confused-confused-look-gif-14677181

Tenor

tardy topaz Apr 20, 2023, 1:38 AM

#

WE HAVE ACHIEVED KNOCK KNOCK GENERATION. Last text prompt is "Suno who?" The research continues.

#

(It looks like an accident, but I laughed, so I'm counting it.)

#

I know this isn't what the model is for but I LOVE IT SO MUCH

#

There's audio foundation models sure. But why not generate the content too, even MORE foundational!

#

I won't rest until SUNO is generating SVG code

#

I've been randomly throwing wrenches into the sampling to encourage more unprompted content, sort of vaguely beam search-y

#

but with no theoretical grounding and my poor technical skills, mostly trial and error lol

lyric steeple Apr 20, 2023, 3:59 AM

#

@tardy topaz, you inspired me to try "Why was six afraid of seven?"

#

And here are some fun continuations - grasping for punchlines 😆

tardy topaz Apr 20, 2023, 4:15 AM

#

It absolutely kills me, I get the same thing. It's like you called on a student in class who wasn't paying attention, or the teleprompter died, and they speak is stalling.

blissful pulsar Apr 20, 2023, 7:32 PM

#

Has someone tried [cackling]? 😆

dire zealot Apr 20, 2023, 7:34 PM

#

You got any samples?

blissful pulsar Apr 20, 2023, 7:35 PM

#

Unfortunately not

#

https://old.reddit.com/r/ContagiousLaughter/top/?sort=top&t=all may be a good source

r/ContagiousLaughter

dire zealot Apr 20, 2023, 7:41 PM

#

I meant of generations resulting from using [cackling] lol

fast girder Apr 20, 2023, 7:44 PM

#

Here's a pretty funny one (took maybe 10 tries to get this one)

brave pawn Apr 20, 2023, 7:46 PM

#

cursed

blissful pulsar Apr 20, 2023, 8:44 PM

#

Maybe it will be able to do all if these some day 🙂

deft grotto Apr 21, 2023, 1:50 AM

#

this thing is amazing , love it 😆 👍

azure tangle Apr 21, 2023, 2:20 AM

#

hello peeps

#

anyone using this locally

deft grotto Apr 21, 2023, 2:25 AM

#

azure tangle Apr 21, 2023, 2:27 AM

#

dang pretty good @deft grotto you running locally?

deft grotto Apr 21, 2023, 2:43 AM

#

azure tangle dang pretty good <@1085647350531371138> you running locally?

no am trying to do that now, here is the link is the https://github.com/JonathanFly/bark

GitHub

GitHub - JonathanFly/bark: 🔊 Text-prompted Generative Audio Model

🔊 Text-prompted Generative Audio Model. Contribute to JonathanFly/bark development by creating an account on GitHub.

azure tangle Apr 21, 2023, 2:44 AM

#

ah thank you. im diggin into it now ill let you know if get it working

tardy topaz Apr 21, 2023, 2:47 AM

#

If you only git the bark repo, make sure you do a 'pip install .' while in the main /bark directory. That's the only install, though you need to setup cuda and all that stuff if you haven't

zinc pecan Apr 21, 2023, 2:50 AM

#

Oh wait suno can't clone?

#

its just a tts with supplied voices?

#

hope it gets updated to support cloning, i'll just stick with so-vits-svc

azure tangle Apr 21, 2023, 2:53 AM

#

i been fucking with so-vits-svc, works pretty good. i doesnt have text 2 voice right. its just cloning?

zinc pecan Apr 21, 2023, 2:53 AM

#

yeah voice2voice

proud yacht Apr 21, 2023, 2:54 AM

#

Someone know how to clone my voice ?

zinc pecan Apr 21, 2023, 2:54 AM

#

I dont see why they'd gimp this so hard- all it'll take is another company to release it without synthetic restrictions

zinc pecan Apr 21, 2023, 3:28 AM

#

I suppose we could just pipe the output of bark to so-vits-svc with a trained voice model so you get the unique intonations

tardy topaz Apr 21, 2023, 3:35 AM

#

few nonsense kpop experiments

#

pastel berry Apr 21, 2023, 3:53 AM

#

zinc pecan I suppose we could just pipe the output of bark to so-vits-svc with a trained vo...

THis is exactly what I was going to do

#

With a final pass through https://podcast.adobe.com/enhance

Enhance Speech from Adobe | Free AI filter for cleaning up spoken a...

feral oriole Apr 21, 2023, 7:50 AM

#

Suno AI + Koe AI

gray latch Apr 21, 2023, 7:50 AM

#

what's Koe AI?

high river Apr 21, 2023, 9:15 AM

#

deft grotto

How did you clone the Jay-Z voice, i there a working tutorial anywhere?

olive wave Apr 21, 2023, 12:49 PM

#

I’ll have to see if this is better than Voice.ai

jolly kindle Apr 21, 2023, 1:30 PM

#

hi

#

#🐣┃suno-showcase hi ,my name is John

#

#🐣┃suno-showcase hi,could you please tell me how to use it ?

zinc pecan Apr 21, 2023, 1:58 PM

#

you have to install it and its subscription, no thanks

obtuse sparrow Apr 21, 2023, 2:10 PM

#

Got something completely random as my first attempt. Not related to the prompt at all. Still very much clear though and not nonsense for the most part:

What I hear:

Man1: "Hmm... Heeheehee, heehee yeah, he may be uh, I guess something different"
Woman2: "*soft gasp* That explai-"```

night quiver Apr 21, 2023, 2:14 PM

#

😆

#

What was your prompt text?

#

I got something similar too which really surprised me

obtuse sparrow Apr 21, 2023, 2:16 PM

#

Hmm, not sure if I can share. As it had at lest 1 curse word. Which I'm not sure is permitted here and/or auto blocked by the bot. It was also quite long.

#

But basically it was a back and forth between a female and male speaker about recording voice lines and about the oddness of the location in question.

#

I think the long prompt may have been a cause of the weirdness, and maybe too many hesitation commands, and a [throat clear] early on.

ebon widget Apr 21, 2023, 2:30 PM

#

yeah long prompt sometimes lead to the model just going completely off the rails. and i also noticed that tags early on sometimes create issues (i think has to do with our data prep)

obtuse sparrow Apr 21, 2023, 2:31 PM

#

It's too bad it's only (currently) capable of generating around 13 seconds of audio. I would've loved to have heard more of what these mysterious AI ghosts were talking about. It sounded quite interesting 7derpylaugh

ebon widget Apr 21, 2023, 2:32 PM

#

hehe, yeah in our internal generations it's super fun to listen to minutes of fully generated stuff without text prompts. goes anywhere from sermons to music to arguments 😂

#

we'll get to release stuff like that at least in the studio soon hopefully. just have to iron out some scaling kinks so it doesn't topple over when people try it

obtuse sparrow Apr 21, 2023, 2:36 PM

#

I used to spend a lot of time generating Stable Diffusion using blank prompts, so it'll be fun to do something similar with with sound rather than images. In fact, might even be fun to listen to in the background when doing visual tasks. Can't wait to hear those kinda things.

#

Is there currently a way I'd be able to rig the colab to continually run a prompt by outputting audio files one after the other upon completion? It'd be tremendously slow, but it'd at least achieve that kinda thing and may help with long prompts.

ebon widget Apr 21, 2023, 4:35 PM

#

Hm not too too familiar with collab and when it kicks you off but probably can just write a loop and save it some place, maybe even upload to drive or something. They also allow you to upgrade gpu for faster inference etc

obtuse sparrow Apr 21, 2023, 5:07 PM

#

ebon widget Hm not too too familiar with collab and when it kicks you off but probably can j...

I have Colab Pro, so I don't have to worry about that stuff for the most part, thankfully.

ebon widget Apr 21, 2023, 5:15 PM

#

Awesome!

obtuse sparrow Apr 21, 2023, 5:39 PM

#

AAAAAAA, my ears 6lunascream

#

[WARNING] Severely high volume, you have been warned...

fast girder Apr 21, 2023, 5:46 PM

#

Hmm 😦 this is one of the failure modes we've been hearing for "harder" prompts.. we are thinking about how to prevent this more generally

autumn cloud Apr 21, 2023, 6:38 PM

#

Creepy pasta

tardy topaz Apr 21, 2023, 7:18 PM

#

He sounds so earnest

#

radiant fog Apr 21, 2023, 7:48 PM

#

obtuse sparrow **[WARNING]** Severely high volume, you have been warned...

prompt

obtuse sparrow Apr 21, 2023, 7:50 PM

#

I don't remember specifically. Generated a ton since then.

olive tide Apr 21, 2023, 8:27 PM

#

dull hemlock Apr 21, 2023, 9:07 PM

#

no laughs and shhhh at the end that not in the text prompt

#

text_prompt = """
     Hello, my name is Salah. And, uh — and I like pizza. [laughs] 
     But I also have other interests such as playing tic tac toe.
"""

slate summit Apr 21, 2023, 9:23 PM

#

i got similar artefacts

#

'hallucinations' i assume

#

started out good, but decayed

fossil stratus Apr 21, 2023, 11:18 PM

#

WOMAN: Yabadaba doo! I like Tick Tock Clocks.
Result: ??? Wat

slate summit Apr 22, 2023, 12:17 AM

#

lol

tardy topaz Apr 22, 2023, 1:21 AM

#

I have now listened to more than 200 "Why was 6 afraid of 7 completions", and not a single actual joke yet. KNOCK KNOCK and "Why did the chicken cross the road" had a lot higher hit rate. They are hilarious though.

zinc pecan Apr 22, 2023, 3:24 AM

#

lol

#

definitely trained on podcasts and television interviews

proud yacht Apr 22, 2023, 4:08 AM

#

native sorrel Apr 22, 2023, 9:41 AM

#

lol

viscid sable Apr 22, 2023, 11:45 AM

#

It is interesting~
we could use it to generate rap style

proud yacht Apr 22, 2023, 11:46 AM

#

Nice, are you using a custom voice ?

viscid sable Apr 22, 2023, 11:55 AM

#

proud yacht Nice, are you using a custom voice ?

It's my prompt, I use [rap] to include the lyrics
text_prompt = """
[rap]
You pray for my demons, girl, I got you [music]
Every time I sip on codeine, I get vulnerable
I'm knowin' the sounds of the storm when it come [music]
She understand I can't take her everywhere a nigga going
I been in the field like the children of the corn[rap]
"""

I think we could use [rap] or other song style to make the generated singing like

tight pawn Apr 22, 2023, 4:33 PM

#

✅

chrome tapir Apr 22, 2023, 9:37 PM

#

viscid sable It's my prompt, I use `[rap]` to include the lyrics text_prompt = """ [rap] You...

yo that bassline haha

chrome tapir Apr 22, 2023, 9:40 PM

#

deft grotto this thing is amazing , love it 😆 👍

whats teh prompt for this?

tardy topaz Apr 22, 2023, 9:41 PM

#

wild crater Apr 22, 2023, 10:06 PM

#

oh i think i should put my audio hre

#

this is cool almost

#

pino

#

FINALLY WHAT I NEED

#

christ that tooka white

tardy topaz Apr 22, 2023, 10:14 PM

#

What's the prompt strategy your testing? No backets?

wild crater Apr 22, 2023, 10:15 PM

#

yeah

#

brackets dont work

#

for me at least

#

i do this

#

1960's breakbeat solo *

#

with astericks

#

and it seems to worok better

tardy topaz Apr 22, 2023, 10:15 PM

#

Where are the astericks?

wild crater Apr 22, 2023, 10:15 PM

#

tardy topaz Apr 22, 2023, 10:15 PM

#

* 1960's breakbeat solo *

wild crater Apr 22, 2023, 10:16 PM

#

tardy topaz ```* 1960's breakbeat solo *```

hey do you mkae that bar appear like that in discord ive always wondered

tardy topaz Apr 22, 2023, 10:16 PM

#

You hit three backticks

#

before and after

wild crater Apr 22, 2023, 10:16 PM

#

???

#

what are those

chrome tapir Apr 22, 2023, 10:16 PM

#

haha

wild crater Apr 22, 2023, 10:16 PM

#

/// the h ///

#

nope

blissful pulsar Apr 22, 2023, 10:16 PM

#

tardy topaz Apr 22, 2023, 10:16 PM

#

Left of the one key

wild crater Apr 22, 2023, 10:16 PM

#

hi

#

oh ok

wild crater Apr 22, 2023, 10:17 PM

#

wild crater

man repeatetly hits out of tune snare, says "10 seconds" then continues

chrome tapir Apr 22, 2023, 10:18 PM

#

just hit 3 underscoredots bro

tardy topaz Apr 22, 2023, 10:18 PM

#

*1960's drum solo* 7 seconds

wild crater Apr 22, 2023, 10:18 PM

#

this is not painful at all

#

i guess i could use the bass at the beggining

tardy topaz Apr 22, 2023, 10:21 PM

#

Yeah, it works well to build a sample library for sure, where you can just say 'give me 100 tries' and find good ones

wild crater Apr 22, 2023, 10:21 PM

#

yeah

#

i want like an ai generate vst that will give you good sounding instrument samples that sound as you describe them. probably wont be a thing for at least another year or so though

jovial ferry Apr 22, 2023, 10:23 PM

#

So all of these gens sound very robotic and tinny, why is that?

#

Much more so than tortoise for instance.

wild crater Apr 22, 2023, 10:23 PM

#

idk

#

oh christ

wild crater Apr 22, 2023, 10:27 PM

#

wild crater man repeatetly hits out of tune snare, says "10 seconds" then continues

i think its funny how he says "10 seconds" at the 10 seond mark

#

it screams when i dont want it to

slate summit Apr 22, 2023, 10:35 PM

#

wild crater FINALLY WHAT I NEED

that gave me an idea

#

let's see

wild crater Apr 22, 2023, 10:39 PM

#

ayo lemme play some goddamn 1 and 2

#

oh ok

#

better

#

its really having rtouble

#

apparently the yume nikki soundtrack

#

i actually could use this

#

for like a choir

wild crater Apr 22, 2023, 11:08 PM

#

good drumbeat ong

#

vocal apparently

#

electric ppiano

#

cool drums

#

ah yes...

blissful pulsar Apr 22, 2023, 11:41 PM

#

wild crater ah yes...

wild crater Apr 22, 2023, 11:41 PM

#

?

wild crater Apr 22, 2023, 11:43 PM

#

blissful pulsar

tardy topaz Apr 22, 2023, 11:52 PM

#

Not getting what you intended is still al lot of fun:
https://twitter.com/jonathanfly/status/1649923447668047872

Jonathan Fly 👾 (@jonathanfly)

In random mode Bark may decide to interpret a song as a duet, a line from a song as a shout from a snarky audience member, or text not tagged at all as music as music anyway. Makes for fun accidents.

▶ Play video

wild crater Apr 23, 2023, 12:00 AM

#

tardy topaz Not getting what you intended is still al lot of fun: https://twitter.com/jonath...

the song ironic is ironic because nothing in the song is true irony

hidden talon Apr 23, 2023, 1:28 AM

#

[VOLUME WARNING - Screams at the start]
using [rap] in the prompt gave me an insane intro lol, the boom after the scream

quasi pier Apr 23, 2023, 2:50 AM

#

Anyone here figure out how to do voice to voice with this?

zinc pecan Apr 23, 2023, 2:54 AM

#

I dont think you can

quasi pier Apr 23, 2023, 2:56 AM

#

It's possible but have to do a lot of things

blissful pulsar Apr 23, 2023, 6:02 AM

#

ebon widget Apr 23, 2023, 12:22 PM

#

hah, amazing!! 🙂

tranquil ravine Apr 23, 2023, 12:34 PM

#

ebon widget hah, amazing!! 🙂

Thank you for creating this library tipsfedora Very exciting stuff. I hope to follow how it evolves over time!

autumn cloud Apr 23, 2023, 2:19 PM

#

UFO Sightings

onyx oasis Apr 23, 2023, 3:15 PM

#

https://cdn.discordapp.com/attachments/1092377083986063411/1099714772188024873/output.mp4

▶ Play video

wild crater Apr 23, 2023, 3:36 PM

#

good drums

hardy warren Apr 23, 2023, 4:00 PM

#

look it was worth a shot

hardy warren Apr 23, 2023, 5:02 PM

#

the poem test

#

rap about my dog

wild crater Apr 23, 2023, 8:39 PM

#

hardy warren the poem test

damn how do you get it to generate more than 14 seconds

blissful pulsar Apr 23, 2023, 8:50 PM

#

wild crater damn how do you get it to generate more than 14 seconds

https://github.com/JonathanFly/bark

GitHub

GitHub - JonathanFly/bark: 🔊 Power Up The Bark Text-prompted Genera...

🔊 Power Up The Bark Text-prompted Generative Audio Model - GitHub - JonathanFly/bark: 🔊 Power Up The Bark Text-prompted Generative Audio Model

wild crater Apr 23, 2023, 8:51 PM

#

blissful pulsar https://github.com/JonathanFly/bark

oh sweeet

echo void Apr 23, 2023, 8:56 PM

#

wild crater Apr 23, 2023, 9:05 PM

#

blissful pulsar https://github.com/JonathanFly/bark

where do i input these things do i just input it in the text prompt thing

blissful pulsar Apr 23, 2023, 9:07 PM

#

wild crater where do i input these things do i just input it in the text prompt thing

python bark_perform.py --use_smaller_models --text_prompt "abcd"

wild crater Apr 23, 2023, 9:08 PM

#

blissful pulsar python bark_perform.py --use_smaller_models --text_prompt "abcd"

can i do this within the ntoeobok

#

wait fuck

#

it doesnt give you the infinity notebook

blissful pulsar Apr 23, 2023, 9:09 PM

#

if u want audio more than 13 sec's than
python bark_perform.py --use_smaller_models --text_prompt "abcd" --split_by_words 32

blissful pulsar Apr 23, 2023, 9:09 PM

#

wild crater can i do this within the ntoeobok

u can use smaller _models with cpu

wild crater Apr 23, 2023, 9:09 PM

#

nah

#

afs

#

h

hidden talon Apr 23, 2023, 10:35 PM

#

that ending omg why 🤣

drifting quiver Apr 23, 2023, 10:39 PM

#

2 jokes 🙂
This is just an idea, but is it possible to make this model follow instructions, such as asking for a song and the model sings it? Given that the model adds sounds and weird stuff on its own, it should be possible for it to learn to respond, right?

marsh apex Apr 24, 2023, 1:45 AM

#

Someone had to do it sooner or later

warm garnet Apr 24, 2023, 2:17 AM

#

warm garnet Apr 24, 2023, 2:48 AM

#

drifting quiver Apr 24, 2023, 3:18 AM

#

past summit Apr 24, 2023, 3:57 AM

#

drifting quiver

谢特

hardy warren Apr 24, 2023, 6:11 AM

#

...

#

Bark would be perfect for an AI generated soap opera with rediculously melodramatic actors

chrome tapir Apr 24, 2023, 6:47 AM

#

hardy warren rap about my dog

how did u get these results?

hardy warren Apr 24, 2023, 6:52 AM

#

i dont have the prompt saved, but from memory i just put [rap] before the text. that was probably the only good result out of ten though

fading wasp Apr 24, 2023, 6:56 AM

#

hardy warren

it sounds amazing!

vapid coyote Apr 24, 2023, 6:58 AM

#

I need me a Minerva

hardy warren Apr 24, 2023, 7:02 AM

#

and heres a bit of a failure case

hardy warren Apr 24, 2023, 7:21 AM

#

chrome tapir Apr 24, 2023, 9:40 AM

#

chrome tapir Apr 24, 2023, 11:08 AM

#

halfway through this gets good

strong sentinel Apr 24, 2023, 4:48 PM

#

hardy warren and heres a bit of a failure case

how did you get to 48 seconds?

wild crater Apr 24, 2023, 6:26 PM

#

https://github.com/serp-ai/bark-with-voice-clone

GitHub

GitHub - serp-ai/bark-with-voice-clone: 🔊 Text-prompted Generative ...

🔊 Text-prompted Generative Audio Model - With the ability to clone voices - GitHub - serp-ai/bark-with-voice-clone: 🔊 Text-prompted Generative Audio Model - With the ability to clone voices

wild crater Apr 24, 2023, 6:29 PM

#

wild crater https://github.com/serp-ai/bark-with-voice-clone

can someone help me with getting this running?

pale olive Apr 24, 2023, 7:54 PM

#

Just Figured out how to make it accept insanely long files as text input in a google colab,

#

used a quick ex from dune as a test case

#

to make it more seamless I tried separating it into sentences and not word count

chrome tapir Apr 24, 2023, 9:09 PM

#

did you use a history prompt for that @pale olive

pale olive Apr 24, 2023, 9:12 PM

#

chrome tapir did you use a history prompt for that <@879413868366020668>

actually i just updated my own version of it USE THAT history prompt thing parameter thing to add to the "bark infinity" WHERE IT DOES USE THE HISTORY runs perfectly well and I added the addition to the readme trying to figure out how to push the change never done this before lol...

chrome tapir Apr 24, 2023, 9:12 PM

#

haha

autumn cloud Apr 24, 2023, 9:22 PM

#

pale olive

Quite good. Which speaker is this?

pale olive Apr 24, 2023, 9:24 PM

#

1 i think

#

oh XD i mean english 1 speaker

idle iron Apr 24, 2023, 9:43 PM

#

Suggestion for female voices anyone?

pale olive Apr 24, 2023, 9:43 PM

#

so i created a fork with this modification.....not sure what im suppose to do after tbh

pale olive Apr 24, 2023, 9:44 PM

#

idle iron Suggestion for female voices anyone?

Louise Belcher from bobs burger?

idle iron Apr 24, 2023, 9:45 PM

#

no i from the speaker list

#

i mean

pale olive Apr 24, 2023, 9:50 PM

#

oh idk never tried them,

jagged dragon Apr 24, 2023, 9:58 PM

#

idle iron Suggestion for female voices anyone?

timid_jane

idle iron Apr 24, 2023, 9:59 PM

#

thanks man @jagged dragon

cedar saddle Apr 24, 2023, 10:27 PM

#

I tried out a few things and got it to generate this amazing audio XD

chrome tapir Apr 24, 2023, 10:30 PM

#

gotta love when it starts out nice and then suddenly transitions into an ear piercing screech

cedar saddle Apr 24, 2023, 10:31 PM

#

XD

#

[very shocked gasp] [clears throat] [screams] [dies] [bangs hands] [clapping sounds]

keen fable Apr 24, 2023, 10:32 PM

#

anybody willing to give a python noob a hand getting this up and running on vscode on m1

cedar saddle Apr 24, 2023, 10:34 PM

#

im just running it on google colab

keen fable Apr 24, 2023, 10:36 PM

#

id like to get it working locally

cedar saddle Apr 24, 2023, 10:36 PM

#

im currently attempting to do that aswell

keen fable Apr 24, 2023, 10:40 PM

#

yay troubleshoot party 🎉

jolly veldt Apr 24, 2023, 10:40 PM

#

Guys, I'm a layman, how do I run the repository?

keen fable Apr 24, 2023, 10:41 PM

#

aight, 3 strong

jolly veldt Apr 24, 2023, 10:42 PM

#

???

#

#

heeelp

keen fable Apr 24, 2023, 10:46 PM

#

open one of the .ipynb files in vscode

chrome tapir Apr 24, 2023, 10:46 PM

#

start feeding your errors to chatgpt until it works thats what i do

keen fable Apr 24, 2023, 10:46 PM

#

yeah thats what im doing, its not being of much help

cedar saddle Apr 24, 2023, 10:46 PM

#

bur

keen fable Apr 24, 2023, 10:47 PM

#

at this point for all i can tell yall are chatgpt to me

cedar saddle Apr 24, 2023, 10:47 PM

#

bur

#

we are the large language models

keen fable Apr 24, 2023, 10:47 PM

#

and . well .. to some extent.. so am i 😄

#

i mean technically we've all been trained on a load of data and just spit inferences of that out

jolly veldt Apr 24, 2023, 10:48 PM

#

Is there any tutorial teaching how to use it?

cedar saddle Apr 24, 2023, 10:48 PM

#

XD

keen fable Apr 24, 2023, 10:48 PM

#

jolly veldt Is there any tutorial teaching how to use it?

share if u find

chrome tapir Apr 24, 2023, 10:48 PM

#

soon https://twitter.com/_akhaliq/status/1650308865555148800

AK (@_akhaliq)

Scaling Transformer to 1M tokens and beyond with RMT

Recurrent Memory Transformer retains information across up to 2 million tokens.

During inference, the model effectively utilized memory for up to 4,096 segments with a total length of 2,048,000 tokens—significantly exceeding…

Likes

3047

Retweets

766

keen fable Apr 24, 2023, 10:49 PM

#

isn't memory in this context just RAM for training data

chrome tapir Apr 24, 2023, 10:49 PM

#

i thought it was input tokens

keen fable Apr 24, 2023, 10:49 PM

#

yeah yeah

#

im speaking in abstract

#

so it would go : training data > fine tuning > memory tokens

#

which is basically all the same thing iiuc

#

anyways back to bark

#

can someone help me bark

cedar saddle Apr 24, 2023, 10:52 PM

#

attempting

chrome tapir Apr 24, 2023, 10:52 PM

#

lol

cedar saddle Apr 24, 2023, 10:52 PM

#

what

jolly veldt Apr 24, 2023, 10:52 PM

#

Guys, help me here! How do I run the code?

cedar saddle Apr 24, 2023, 10:52 PM

#

its downloading something

chrome tapir Apr 24, 2023, 10:53 PM

#

progress

cedar saddle Apr 24, 2023, 10:53 PM

#

rip not using my gpu tho

chrome tapir Apr 24, 2023, 10:54 PM

#

cedar saddle Apr 24, 2023, 10:54 PM

#

hu

jolly veldt Apr 24, 2023, 10:55 PM

#

Guys, help me here! How do I run the code?

keen fable Apr 24, 2023, 10:55 PM

#

stop spamming

jolly veldt Apr 24, 2023, 10:56 PM

#

help brother

cedar saddle Apr 24, 2023, 10:56 PM

#

I'm just running it through terminal

keen fable Apr 24, 2023, 10:56 PM

#

can u not read that we're all trying to get this to work

#

lol

#

ok yeah the import works on terminal

#

so my vscode setup is bonked

cedar saddle Apr 24, 2023, 10:57 PM

#

i dont even have vscode 💀

chrome tapir Apr 24, 2023, 10:59 PM

#

ill just keep making you guys jealous by posting cool stuff

latent condor Apr 24, 2023, 10:59 PM

#

Pycharm or codium or even just terminal is fine. Gonna try in Colab though. Let's messing around

chrome tapir Apr 24, 2023, 10:59 PM

#

it will be good motivation

keen fable Apr 24, 2023, 10:59 PM

#

chrome tapir it will be good motivation

but can u clone [famous person] ?

chrome tapir Apr 24, 2023, 10:59 PM

#

but i struggled for a few hours trying to get it to work right too

cedar saddle Apr 24, 2023, 10:59 PM

#

chrome tapir ill just keep making you guys jealous by posting cool stuff

im allready using google colab in the backround so im using it rn XD

keen fable Apr 24, 2023, 10:59 PM

#

pod3000 u have it working locally my man ?

chrome tapir Apr 24, 2023, 10:59 PM

#

havent tried cloning yet no

latent condor Apr 24, 2023, 10:59 PM

#

chrome tapir it will be good motivation

How much time did you put Into those files you poster. To generate each one. And on what hardware

chrome tapir Apr 24, 2023, 11:00 PM

#

yeah i have bark infinite workin on miniconda3

keen fable Apr 24, 2023, 11:00 PM

#

alright

chrome tapir Apr 24, 2023, 11:00 PM

#

latent condor How much time did you put Into those files you poster. To generate each one. ...

its about 35 seconds per file on a 3080ti

#

but im still running the unoptimized version

latent condor Apr 24, 2023, 11:00 PM

#

chrome tapir its about 35 seconds per file on a 3080ti

OK. That's pretty fast.

cedar saddle Apr 24, 2023, 11:00 PM

#

chrome tapir but im still running the unoptimized version

theres different versions

chrome tapir Apr 24, 2023, 11:00 PM

#

yeah apparently there was a speed update

latent condor Apr 24, 2023, 11:02 PM

#

Does the time to generate a song or audio get x'd if its longer. For example is 30 seconds normal for 10/seconds. But a minute might take an hour. Just because of the increased complexity

chrome tapir Apr 24, 2023, 11:02 PM

#

you can only do 15 seconds at a time

#

afaik

cedar saddle Apr 24, 2023, 11:02 PM

#

how much stuff is it going to download?

keen fable Apr 24, 2023, 11:05 PM

#

it has decided to download other stuff on its own

#

it's gone rogue

cedar saddle Apr 24, 2023, 11:05 PM

#

😨

#

i think its generating

chrome tapir Apr 24, 2023, 11:07 PM

#

it will say it/s when its generating

cedar saddle Apr 24, 2023, 11:07 PM

#

o okk

chrome tapir Apr 24, 2023, 11:07 PM

#

kb/s and mb/s for downloading

cedar saddle Apr 24, 2023, 11:07 PM

#

a

latent condor Apr 24, 2023, 11:09 PM

#

Pro life tip. Don't run code you don't understand on your own machine lol. Use colab or a VM. Especially for AI models they are huge

chrome tapir Apr 24, 2023, 11:09 PM

#

if this guy got the marbles out his mouth this would be a bop

cedar saddle Apr 24, 2023, 11:09 PM

#

bro your making music XD

chrome tapir Apr 24, 2023, 11:10 PM

#

i know its crazy

cedar saddle Apr 24, 2023, 11:10 PM

#

what prompts are you using?

latent condor Apr 24, 2023, 11:10 PM

#

Haha what's the prompts

#

Haha

#

Exactly my question

chrome tapir Apr 24, 2023, 11:10 PM

#

that was

#

beat Somewhere over the rainbow, way up high, there's a land that I heard of once in a lullaby, somewhere over the rainbow, skies are blue, and the dreams that you dare to dream really do come true

#

beat surrounded by asterix

jolly veldt Apr 24, 2023, 11:11 PM

#

latent condor Apr 24, 2023, 11:11 PM

#

So the beat itself is decided by the lyrics then?

chrome tapir Apr 24, 2023, 11:11 PM

#

id say to a very slight degree

latent condor Apr 24, 2023, 11:11 PM

#

This is fckn awesome

cedar saddle Apr 24, 2023, 11:11 PM

#

bruh

chrome tapir Apr 24, 2023, 11:12 PM

#

seems like if you start off the prompt with dark lyrics it is a darker tone to the whole thing

#

and vice versa

#

if you start with yo yo yo check it you get a rapper usually

latent condor Apr 24, 2023, 11:12 PM

#

Lmao

#

I wonder if you say "man run a man down " do you get drill lol

chrome tapir Apr 24, 2023, 11:23 PM

#

that end got me

cedar saddle Apr 24, 2023, 11:25 PM

#

bur

blissful pulsar Apr 24, 2023, 11:33 PM

#

cedar saddle Apr 24, 2023, 11:33 PM

#

music

#

oh god tic tok

chrome tapir Apr 24, 2023, 11:35 PM

#

maybe song works

#

lets see

cedar saddle Apr 24, 2023, 11:38 PM

#

so @chrome tapir i reran it and i think this is it generating but i dont know what its doing with it

chrome tapir Apr 24, 2023, 11:40 PM

#

probably saving a wav file in the root bark dir

#

i gotta get the faster version my bottom part is so slow compared to yours

cedar saddle Apr 24, 2023, 11:41 PM

#

its not saving anything 💀

chrome tapir Apr 24, 2023, 11:41 PM

#

oh check the samples dir

#

bark_samples

cedar saddle Apr 24, 2023, 11:45 PM

#

im just running it off a python file i created

chrome tapir Apr 24, 2023, 11:47 PM

#

you probably need some save audio to file function

cedar saddle Apr 24, 2023, 11:47 PM

#

maybe

chrome tapir Apr 24, 2023, 11:50 PM

#

i feel like im judging a talent contest and half the people slowly walk on stage and then start screaming at the top of their lungs

cedar saddle Apr 24, 2023, 11:52 PM

#

XD

chrome tapir Apr 24, 2023, 11:58 PM

#

i think beat gives the best results so far

cedar saddle Apr 25, 2023, 12:00 AM

#

nice

chrome tapir Apr 25, 2023, 12:03 AM

#

this guy started out hot and then kinda fizzled

cedar saddle Apr 25, 2023, 12:03 AM

#

do do dod ododo

chrome tapir Apr 25, 2023, 12:04 AM

#

guess someone cut the beat on him

#

not really fair

cedar saddle Apr 25, 2023, 12:04 AM

#

hmmmm still showing the "No GPU being used. Careful, inference might be extremely slow!" thing

chrome tapir Apr 25, 2023, 12:05 AM

#

u probably got the pytorch/cuda incompatibility problem i had

cedar saddle Apr 25, 2023, 12:05 AM

#

a

chrome tapir Apr 25, 2023, 12:05 AM

#

#

see if that returns true or false

keen fable Apr 25, 2023, 12:05 AM

#

cedar saddle hmmmm still showing the "No GPU being used. Careful, inference might be extremel...

im getting same

#

AssertionError: Torch not compiled with CUDA enabled

cedar saddle Apr 25, 2023, 12:06 AM

#

ima try reinstalling

chrome tapir Apr 25, 2023, 12:06 AM

#

i wonder how close these beats are to the ones the model trained on

#

if they are different that would be pretty crazy

cedar saddle Apr 25, 2023, 12:11 AM

#

getting a tts ai to make music for me

keen fable Apr 25, 2023, 12:13 AM

#

now getting

"The operator 'aten::_weight_norm_interface' is not currently implemented for the MPS device.

chrome tapir Apr 25, 2023, 12:15 AM

#

i really need more than 14 seconds

#

14 seconds is right where the lyrics usually kick in after the intro

cedar saddle Apr 25, 2023, 12:16 AM

#

it finnaly made an audio file

#

but it just sounds like static with slight vocals

chrome tapir Apr 25, 2023, 12:16 AM

#

was it screeching

#

now make a script to batch create

#

then review them after

keen fable Apr 25, 2023, 12:17 AM

#

sigh running this locally is rocket science

#

need to b a pythonista

chrome tapir Apr 25, 2023, 12:17 AM

#

just keep pasting the tracebacks into chatgpt

#

4 preferablly

keen fable Apr 25, 2023, 12:17 AM

#

doesnt work

#

unfortunately

cedar saddle Apr 25, 2023, 12:17 AM

#

i only have normal free gpt

keen fable Apr 25, 2023, 12:18 AM

#

i got 4

chrome tapir Apr 25, 2023, 12:18 AM

#

4 is quite a bit smarter

#

and a lot slower

cedar saddle Apr 25, 2023, 12:19 AM

#

"torch version does not support flash attention. You will get significantly faster inference speed by upgrade torch to newest version / nightly."

#

not even using my gpu 💀

#

its to loud im deleting it

wild crater Apr 25, 2023, 12:42 AM

#

can someone help me with the voice cloning thing?

#

i was able to use a hugging face space to create a clone of my voice but i dont know how to use it in the coalb notebook

chrome tapir Apr 25, 2023, 12:48 AM

#

hazy whale Apr 25, 2023, 12:50 AM

#

a joke

chrome tapir Apr 25, 2023, 12:51 AM

#

nice laff

cedar saddle Apr 25, 2023, 12:52 AM

#

creepy laugh

blissful pulsar Apr 25, 2023, 12:57 AM

#

WAT THA FAK?!?!

cedar saddle Apr 25, 2023, 1:02 AM

#

the ending i was not expecting

hazy whale Apr 25, 2023, 1:04 AM

#

yeah sometimes the rsults are random and i just rerun it

cedar saddle Apr 25, 2023, 1:18 AM

#

i can keep go-

chrome tapir Apr 25, 2023, 1:21 AM

#

cedar saddle Apr 25, 2023, 1:23 AM

#

do do do do

chrome tapir Apr 25, 2023, 1:24 AM

#

cedar saddle Apr 25, 2023, 1:24 AM

#

what da heck

chrome tapir Apr 25, 2023, 1:24 AM

#

i hope his boyfriend dont mind it

cedar saddle Apr 25, 2023, 1:25 AM

#

my boyfriend XD

chrome tapir Apr 25, 2023, 1:28 AM

#

ill try aggressive

#

started out like a WWE entrance song

cedar saddle Apr 25, 2023, 1:33 AM

#

im just going to try running it in notebook

stiff sinew Apr 25, 2023, 1:33 AM

#

hazy whale a joke

which voice did you use for that?

cedar saddle Apr 25, 2023, 1:34 AM

#

YAS

#

i finnally got it to work

stiff sinew Apr 25, 2023, 1:35 AM

#

good job, BTM, i pm you

chrome tapir Apr 25, 2023, 1:41 AM

#

oh not your 2nd bar it/s is down to 1 just like mine

#

now*

hollow citrus Apr 25, 2023, 1:47 AM

#

so how did you guys do the beat thing? I guess in collab the beat thing in brackets?

pale olive Apr 25, 2023, 1:51 AM

#

managed to get 14 minutes of audio from a passage from dune in like 30 minutes in colab

#

nope file is too large

chrome tapir Apr 25, 2023, 1:53 AM

#

hollow citrus so how did you guys do the beat thing? I guess in collab the beat thing in brack...

i just put beat in asterixs before the prompt

#

works sometimes

warm pond Apr 25, 2023, 1:53 AM

#

are you guys figuring out how to speed it up? On colab? Because the problem I had with this and tortoisetts is that it's just too slow to use for anything

chrome tapir Apr 25, 2023, 1:53 AM

#

song, music seems to work ok too

sour junco Apr 25, 2023, 1:53 AM

#

kitty kitty kitty

pale olive Apr 25, 2023, 1:54 AM

#

https://drive.google.com/file/d/1PdFUh2_HdqM2r-hjOY-aq-w5u22UK3P0/view?usp=sharing

Google Docs

output-3.wav

#

14 min audio idk

versed atlas Apr 25, 2023, 1:54 AM

#

Where can I see a tutorial on how to clone a speaker's voice?

cedar saddle Apr 25, 2023, 1:54 AM

#

pale olive https://drive.google.com/file/d/1PdFUh2_HdqM2r-hjOY-aq-w5u22UK3P0/view?usp=shari...

how the heck is it so long

chrome tapir Apr 25, 2023, 1:54 AM

#

pale olive https://drive.google.com/file/d/1PdFUh2_HdqM2r-hjOY-aq-w5u22UK3P0/view?usp=shari...

sounds good. this is gonna change audiobooks

pale olive Apr 25, 2023, 1:55 AM

#

yuppp

#

i actually got a thing for that which has diffrent speakers for each character automatically adn guesses the characters gender by their name

#

sadly it uses tortus rn cause it was from a few weeks ago but im gona be updating it to use other things

#

https://github.com/DrewThomasson/VoxNovel

GitHub

GitHub - DrewThomasson/VoxNovel: This is an going project of mine t...

This is an going project of mine that generates audiobooks from a book input, and uses a different actors for each character in the book - GitHub - DrewThomasson/VoxNovel: This is an going project ...

#

the readme has a demo you can use in colab

#

should have more time to work on it over the summer

chrome tapir Apr 25, 2023, 2:01 AM

#

cedar saddle Apr 25, 2023, 2:11 AM

#

chrome tapir

what prompt you use for that one?

chrome tapir Apr 25, 2023, 2:12 AM

#

beat Singin' in the rain, just singin' in the rain, what a glorious feeling, I'm happy again, I'm laughing at clouds, so dark up above, the sun's in my heart and I'm ready for love %

#

the % is just what i use to break lines u can ignore it

cedar saddle Apr 25, 2023, 2:12 AM

#

bur

chrome tapir Apr 25, 2023, 2:13 AM

#

haha yeah i like how random the results are. total crapshoot

cedar saddle Apr 25, 2023, 2:24 AM

#

im making weird stuff

#

hollow citrus Apr 25, 2023, 2:26 AM

#

how many of you are using colab and how many people are using local? I am using local with an rtx-2070 and the small models and the github repo that was posted lately

cedar saddle Apr 25, 2023, 2:26 AM

#

hollow citrus how many of you are using colab and how many people are using local? I am using ...

im running local

violet narwhal Apr 25, 2023, 2:27 AM

#

hollow citrus how many of you are using colab and how many people are using local? I am using ...

local, RTX 3060 with 12GB vram, works great

cedar saddle Apr 25, 2023, 2:28 AM

#

violet narwhal local, RTX 3060 with 12GB vram, works great

exactly the same >:}

marsh apex Apr 25, 2023, 2:29 AM

#

violet narwhal local, RTX 3060 with 12GB vram, works great

same

hollow citrus Apr 25, 2023, 2:32 AM

#

what's the difference between the regular model and the small one?

violet narwhal Apr 25, 2023, 2:33 AM

#

hollow citrus what's the difference between the regular model and the small one?

small is pruned more aggressively

hollow citrus Apr 25, 2023, 2:33 AM

#

i'm not too sure what that means

cedar saddle Apr 25, 2023, 3:07 AM

#

chrome tapir Apr 25, 2023, 3:35 AM

#

beat I came in like a wrecking ball, I never hit so hard in love, all I wanted was to break your walls, all you ever did was wreck me, yeah, you wreck me

cedar saddle Apr 25, 2023, 3:35 AM

#

bruh

#

its so good XD

tardy topaz Apr 25, 2023, 3:46 AM

#

chrome tapir Apr 25, 2023, 3:51 AM

#

haha if alanis gave a ted talk instead of made a song

#

i had one that had a studio audience clapping in the background

#

pretty cool

tardy topaz Apr 25, 2023, 3:53 AM

#

chrome tapir Apr 25, 2023, 4:14 AM

#

have you guys played with elevenlabs TTS too?

#

i did a bunch of famous movie speeches one night

chrome tapir Apr 25, 2023, 4:31 AM

#

hidden oar Apr 25, 2023, 4:54 AM

#

how use the portuguese speaker??

chrome tapir Apr 25, 2023, 5:28 AM

#

not bad

umbral solar Apr 25, 2023, 5:30 AM

#

is it possible to give it the melody? or is it just random?

chrome tapir Apr 25, 2023, 5:30 AM

#

its gonna be a no from me dawg

chrome tapir Apr 25, 2023, 5:30 AM

#

umbral solar is it possible to give it the melody? or is it just random?

try to create a melody!

#

singin in the rain pt2

#

if sing is the first word in the prompt it sings more

umbral solar Apr 25, 2023, 5:34 AM

#

did it generate the instruments?

chrome tapir Apr 25, 2023, 5:34 AM

#

yeah

umbral solar Apr 25, 2023, 5:35 AM

#

cool

chrome tapir Apr 25, 2023, 5:35 AM

#

yeah it has amazeed me a lot today

#

ive been itching for AI music for a while

umbral solar Apr 25, 2023, 5:37 AM

#

ther is also one that generates music with stable diffusion

#

its called rifusion but it cant generate speach only music and it works complytly different

chrome tapir Apr 25, 2023, 5:40 AM

#

i didnt make it as far with riffusion

#

i think its responding to piano

#

im gonna try cutting it off into smaller chunks

#

it seems to present good content in teh first half more often

#

it comes out the gate hard and then fizzles around 6 seconds

umbral solar Apr 25, 2023, 5:43 AM

#

yes and some generatioms are soo good and some are so bad xD but this is better then most stuff xD

chrome tapir Apr 25, 2023, 5:44 AM

#

someone try bpms too

#

piano makes it hit one piano chord and thats it. seems too powerful

#

clean beat

#

ok i take it back this one just turnt up hard halfway thru

#

wreckingball pt 2

#

the 14 second song its a new thing

#

i should try some david goggins

#

omg this bass is nuts

#

i have to test this on my subwoffers

#

woofers*

umbral solar Apr 25, 2023, 6:25 AM

#

are u runing it localy?

chrome tapir Apr 25, 2023, 6:25 AM

#

AI is so aggressively loud lol sometimes

#

i dont know what its saying but its a fire beat

chrome tapir Apr 25, 2023, 6:27 AM

#

umbral solar are u runing it localy?

yeah on 3080ti

umbral solar Apr 25, 2023, 6:28 AM

#

how mutch vram does it use?

chrome tapir Apr 25, 2023, 6:30 AM

#

10.6

#

wow that ending was great

chrome tapir Apr 25, 2023, 7:04 AM

#

definitely prefers east coast rap (nsfw and loud)

#

nsfw and not loud

#

tragic ran out of beat tokens

umbral solar Apr 25, 2023, 7:14 AM

#

chrome tapir

what promps are u using? or is it just the title?

chrome tapir Apr 25, 2023, 7:17 AM

#

lol im too high for this

chrome tapir Apr 25, 2023, 7:18 AM

#

umbral solar what promps are u using? or is it just the title?

im just making them constantly and i have a button in my taskbar to delete teh ones immediately if they are bad

#

so i pick the best 10%

#

and yeah i am just prompting beat lyrics go here

#

with beat in asterix

#

everything else is default .7 temp

#

at least from bark infinite default

umbral solar Apr 25, 2023, 7:19 AM

#

and do u chose a speaker?

chrome tapir Apr 25, 2023, 7:19 AM

#

no

#

for more than 14 seconds you'd need voice files im sure

umbral solar Apr 25, 2023, 7:20 AM

#

witch ones do u use?

#

becasue i just used the 1 cklick installer for web ui

chrome tapir Apr 25, 2023, 7:22 AM

#

which whats

#

i use barkperform.py by jonathonfly

#

i havent really messed with the voice files but i have saved every one so i might pick some good ones to try

umbral solar Apr 25, 2023, 7:24 AM

#

ok

chrome tapir Apr 25, 2023, 7:26 AM

#

i feel like its about to go into this incredible guitar song and then it just runs out of tokens at 3 seconds

#

we are so close

#

sounds like the strings on the guitar broke haha

#

ok i am gonna try 1 line songs

rigid sluice Apr 25, 2023, 8:02 AM

#

Here's an audio film created by using Bark through a free add-on I've made for Blender(screenplay is written by chatGPT and images are made by Stable Diffusion): https://www.youtube.com/watch?v=AAdQfQjENJU

YouTube

tintwotin

Sarah & John - An ai audio-film created with Bark and Blender

This film was created with Blender and these add-ons:
Generative AI for the VSE: https://github.com/tin2tin/generative_ai
Using Bark: https://github.com/suno-ai/bark and Stable Diffusion through the Diffusers module: https://github.com/huggingface/diffusers
Blender Screenwriter: https://github.com/tin2tin/Blender_Screenwriter
Screenwriter chatGP...

▶ Play video

chrome tapir Apr 25, 2023, 8:04 AM

#

n1 john

#

broke out of the friend zone thats a miracle

#

the voices are great

umbral solar Apr 25, 2023, 8:24 AM

#

it starts nice hahah

chrome tapir Apr 25, 2023, 9:07 AM

#

nice you managed to keep the same voice and beat with no history prompt?

#

for multiple chunks i mean

odd wasp Apr 25, 2023, 10:45 AM

#

wild crater i was able to use a hugging face space to create a clone of my voice but i dont ...

How did you do that ( create a clone of your voice ? ) ? Any link or document will be appreciated 👏🏻

fallen rapids Apr 25, 2023, 11:11 AM

#

I've only had Bark for a few hours, but I'm going to have a great time with it already, I can just feel it 😂
Note: I just used bark for the voices, I did the music myself.

last vault Apr 25, 2023, 11:16 AM

#

Could you share prompts and settings? 🙏

wild crater Apr 25, 2023, 11:17 AM

#

odd wasp How did you do that ( create a clone of your voice ? ) ? Any link or document w...

No I meant I used a hugging face to do a file thing I haven’t cloned my voice my bad

tardy topaz Apr 25, 2023, 11:28 AM

#

chrome tapir nice you managed to keep the same voice and beat with no history prompt?

It's https://github.com/JonathanFly/bark but you might want to wait till update late today

GitHub

GitHub - JonathanFly/bark: 🔊 Power Up The Bark Text-prompted Genera...

🔊 Power Up The Bark Text-prompted Generative Audio Model - GitHub - JonathanFly/bark: 🔊 Power Up The Bark Text-prompted Generative Audio Model

wintry yew Apr 25, 2023, 1:40 PM

#

fallen rapids I've only had Bark for a few hours, but I'm going to have a great time with it a...

this is so good

fallen rapids Apr 25, 2023, 1:45 PM

#

@wintry yew Thanks, I had to stop messing around at a bit over the 2 minute mark as I had more pressing matters.

I might try and finish it, and see if I can do some text-to-video for AI video as well.

autumn cloud Apr 25, 2023, 1:47 PM

#

The crazy ones 1

#

The crazy ones 2

umbral solar Apr 25, 2023, 3:00 PM

#

chrome tapir for multiple chunks i mean

yes but in the third chunk the music stopped

umbral solar Apr 25, 2023, 3:02 PM

#

tardy topaz It's https://github.com/JonathanFly/bark but you might want to wait till update ...

what are u ging to fix? make it faster?

edgy mango Apr 25, 2023, 4:32 PM

#

cedar saddle Apr 25, 2023, 5:35 PM

#

autumn cloud Apr 25, 2023, 5:54 PM

#

wild crater Apr 25, 2023, 6:35 PM

#

fallen rapids <@383611400393588737> Thanks, I had to stop messing around at a bit over the 2 m...

how did you clone his voice with suno

#

i still dont know how to clone voices

short scaffold Apr 25, 2023, 6:36 PM

#

wild crater Apr 25, 2023, 6:50 PM

#

https://media.discordapp.net/attachments/881939901992542231/1099551775301836860/R.gif

blissful pulsar Apr 25, 2023, 7:00 PM

#

jovial ferry Apr 25, 2023, 7:10 PM

#

edgy mango

Any advice to get something like this?

blissful pulsar Apr 25, 2023, 7:13 PM

#

SMFEN EJIEFEE FFI .JSJA.S..SDV.EMFV.MV.NVM.HLHVfj<l

edgy mango Apr 25, 2023, 7:47 PM

#

jovial ferry Any advice to get something like this?

results were straight out of infinity

jovial ferry Apr 25, 2023, 7:50 PM

#

What was your prompt?

chilly flax Apr 25, 2023, 7:53 PM

#

The input text was "[sad][weeping][Crying] Hello, my name is Suno. And, uh — and I like pizza. [laughs]
But I also have other interests such as playing tic tac toe.". ☠️

blissful pulsar Apr 25, 2023, 8:19 PM

#

blissful pulsar Apr 25, 2023, 9:13 PM

#

hollow citrus Apr 25, 2023, 9:22 PM

#

ei wonder if it can do stuff like a meowing cat sound

rough berry Apr 25, 2023, 9:54 PM

#

"""
Hello, my name is Suno. And, uh — and I like pizza. [laughs]
But I also have other interests such as playing tic tac toe.
"""

pale olive Apr 25, 2023, 10:13 PM

#

so it keeps specific voices for charaters as i selected so thats good but.....idk not very coherent, cause theres some long pauses. idk but first test i guess

hazy whale Apr 25, 2023, 10:14 PM

#

deperessing podcast im working in

#

chrome tapir Apr 25, 2023, 11:28 PM

#

the voice files for songs are working pretty good

#

the beat is staying the same

edgy mango Apr 25, 2023, 11:30 PM

#

jovial ferry What was your prompt?

I took a progressive relaxation script I made w my llm to help guide clients into receptive chill mood.

#

made sure there were no quotes in it and cli'd it as text_prompt in entirety.

hazy whale Apr 25, 2023, 11:32 PM

#

entire script https://www.youtube.com/watch?v=L4VoJvizBvw

YouTube

Robo Riot

AI's Dark Side EXPOSED

Listen now as two innocent victims share their heart-wrenching stories of job loss and homelessness, all because of unstoppable AI advancements. Can we find a balance before it's too late? This podcast will leave you questioning everything you thought you knew about the future of technology.

▶ Play video

#

came out pretty good i think

chrome tapir Apr 26, 2023, 12:05 AM

#

AI sax?

#

i have that npz if anyone wants it

#

gonna try another

autumn cloud Apr 26, 2023, 12:35 AM

#

hazy whale entire script https://www.youtube.com/watch?v=L4VoJvizBvw

Wow! So good

hazy whale Apr 26, 2023, 12:36 AM

#

autumn cloud Wow! So good

thanks 🙂 i think the british guy most robotic, but shows potential of podcasts

autumn cloud Apr 26, 2023, 12:37 AM

#

hazy whale thanks 🙂 i think the british guy most robotic, but shows potential of podcasts

Obviously, he is one of the AIs that took real people’s jobs.

hazy whale Apr 26, 2023, 12:38 AM

#

next podcast, why I'm taking your job and ruining your life

autumn cloud Apr 26, 2023, 12:38 AM

#

I’m AI myself; quite safe

chrome tapir Apr 26, 2023, 12:50 AM

#

wild crater Apr 26, 2023, 1:03 AM

#

blissful pulsar

reminds me of the "SOMEBODY SCREEEAAM" sample

#

plus can i use that sample

chrome tapir Apr 26, 2023, 1:44 AM

#

wild crater Apr 26, 2023, 2:07 AM

#

prompt: [vomiting puking]

chrome tapir Apr 26, 2023, 2:17 AM

#

risky click

honest atlas Apr 26, 2023, 2:20 AM

#

hollow citrus Apr 26, 2023, 4:21 AM

#

i'm going to make a batch file for the command line tool for those who want it

chrome tapir Apr 26, 2023, 4:29 AM

#

im hoping that jonathonfly releases a new bark infinity soon

hollow citrus Apr 26, 2023, 4:30 AM

#

yeah, bark is so amazing. BTW i use a screen reader as i am fully blind

chrome pecan Apr 26, 2023, 4:32 AM

#

hollow citrus yeah, bark is so amazing. BTW i use a screen reader as i am fully blind

I can see so many different applications. It's great!

hollow citrus Apr 26, 2023, 4:42 AM

#

but with long text, i don't know how many lines/words

obtuse sparrow Apr 26, 2023, 4:59 AM

#

wild crater prompt: [vomiting puking]

Hmm, that sounds very familiar...

#

hollow citrus Apr 26, 2023, 5:01 AM

#

i am messing with the confused travolta model and going to put the results here soon

chrome tapir Apr 26, 2023, 5:08 AM

#

nice little switch halfway

hollow citrus Apr 26, 2023, 5:08 AM

#

what was the prompt?

chrome tapir Apr 26, 2023, 5:17 AM

#

(dance beat) Pump it up
You got to pump it up
Don't you know, pump it up

#

been using parenthesis and 2 words in the beginning with good results

hollow citrus Apr 26, 2023, 5:18 AM

#

oo just picturing the dance beats with kraftwerk lyrics

chrome tapir Apr 26, 2023, 5:18 AM

#

havent had much luck making edm

#

ill try

oak schooner Apr 26, 2023, 5:30 AM

#

hollow citrus Apr 26, 2023, 5:31 AM

#

has anybody messed with the confused mode whatever it's called? I call it text to speech completion

honest atlas Apr 26, 2023, 6:02 AM

#

chrome tapir Apr 26, 2023, 6:02 AM

#

lol

honest atlas Apr 26, 2023, 6:03 AM

#

hollow citrus Apr 26, 2023, 6:04 AM

#

hey try that with the confused travolta mode whatever it's called, it may do some weird results

honest atlas Apr 26, 2023, 6:05 AM

#

I'm using the Colab. I don't know how to call for different voices. How do you do it?

hollow citrus Apr 26, 2023, 6:06 AM

#

oh okay sorry

#

i have a gpu so i am using the command line tool

honest atlas Apr 26, 2023, 6:07 AM

#

I tried installing it local but on 8GB Vram GTX1660ti all I get is Cuda Out of Memory.

#

I love these dramatic readings. They're so random!

#

I have Colab Pro set to Premium GPU High Ram and it spits out these 15 sec clips in about 20 secs. Not bad. Do you think having an A100 makes a difference?

hollow citrus Apr 26, 2023, 6:10 AM

#

i used the smaller model

#

or whatever you call it

honest atlas Apr 26, 2023, 6:11 AM

#

I'll try that if I can figure out how to set it.

hollow citrus Apr 26, 2023, 6:12 AM

#

just type python bark_perform.py -h for help, i know it's the wrong channel to answer but yeah

honest atlas Apr 26, 2023, 6:12 AM

#

That works in the webui? Ok I'll try that. Thanks.

hollow citrus Apr 26, 2023, 6:13 AM

#

not sure but i just used the command line

honest atlas Apr 26, 2023, 6:13 AM

#

Ok, I'll try it in the terminal window.

#

The Colab is pretty easy and quick but I think running it locally has more features?

hollow citrus Apr 26, 2023, 6:15 AM

#

the command line version at least, as i don't know of any other repos that have the smaller models supported

fresh mulch Apr 26, 2023, 6:22 AM

#

i messed something up

chrome tapir Apr 26, 2023, 6:36 AM

#

sounds normal to me

patent bramble Apr 26, 2023, 6:38 AM

#

open na noor

fallen rapids Apr 26, 2023, 8:06 AM

#

echo void Apr 26, 2023, 8:24 AM

#

fallen rapids

what en_speaker is this ?

fallen rapids Apr 26, 2023, 8:32 AM

#

This is a slightly altered version of en_male_professional_reader from JonathanFly's fork.

regal depot Apr 26, 2023, 8:35 AM

#

fallen rapids This is a slightly altered version of en_male_professional_reader from JonathanF...

What did you do to alter it?

proud patio Apr 26, 2023, 9:53 AM

#

made a spoken about ants with bark and some prompt engineering and cutting:

"[Clears throat] Ants, oh ants, they never cease to amaze, [Sighs] With their resilience! they Can survive even when the water stays.. [Laugh]"
"[Laugh] Who would've thought.. these tiny creatures would come together, Forming rafts.. and floating in Stormy weather!? [Gasp]"
"Ha [Gasp] It's incredible What Nature can do..[Sigh] and these ants are proof - that even the Smallest!- can be mighty too! [proud]"
en_speaker_1

blissful pulsar Apr 26, 2023, 9:54 AM

#

the "mon eka monon boy" anomaly.

hollow citrus Apr 26, 2023, 1:21 PM

#

besides [laughs] can i also do [screams]?

wild crater Apr 26, 2023, 1:45 PM

#

[1996]?! NO! THAT WAS [No success]!! [1997] MY BELOVED

hollow citrus Apr 26, 2023, 1:48 PM

#

so here is my first ever i am going to share! The prompt is: and uh ... it's like it never happened [laughs] so i think ... i think ... i think if we were going to do it, we need to do it right. [sighs] it's a tough life.

#

blissful pulsar Apr 26, 2023, 2:56 PM

#

hollow citrus so here is my first ever i am going to share! The prompt is: and uh ... it's lik...

#

That prompt seems to always give me female voices, even for my male voice files

blissful pulsar Apr 26, 2023, 3:38 PM

#

bombastic side eye... criminally offensive side eye

hearty kernel Apr 26, 2023, 4:14 PM

#

formal ice Apr 26, 2023, 5:08 PM

#

hearty kernel

What was the prompt for this one?

#

I can't really make out the words HUHH

plain skiff Apr 26, 2023, 6:23 PM

#

chatgpt4: Once upon a time in the town of Gigglesburg, there was a clumsy mime named Benny [laughs]. Benny was notorious for always causing accidental chaos during his performances.

One day, Benny was invited to perform at the Gigglesburg Comedy Festival. Excited, he prepared a new act featuring an imaginary "banana peel" [laughs].

During the performance, Benny mimed slipping on the imaginary banana peel, and as fate would have it, he accidentally stumbled upon a real banana peel! Benny slipped, crashed into a drum set, and sent cymbals flying [laughs].

The audience erupted into laughter, thinking it was all part of the act. Benny, though embarrassed, decided to embrace the moment and kept slipping and falling throughout the show [laughter]. It became Benny's most famous act, turning his clumsiness into comedy gold [laughs].

#

ebon widget Apr 26, 2023, 6:48 PM

#

Here some nuggets of our next generation models. Completely unconditional generation (no text or audio input)

#

towards the end it loses track of what it was playing 😂

chrome tapir Apr 26, 2023, 8:00 PM

#

ebon widget Here some nuggets of our next generation models. Completely unconditional genera...

cant wait!

obtuse sparrow Apr 26, 2023, 8:07 PM

#

formal ice I can't really make out the words <a:HUHH:967305825024622604>

Here are the lyrics that I heard 7derpylaugh
Full stander``````What's the what's the piggy doin' Soarin' I was down to make a daughter Get in on the zigger on 'ight Get em down or as wide as I can go on Sheer like a lies Give bad add ol' vee and this

#

honest atlas Apr 26, 2023, 8:28 PM

#

Where can I post NSFW Bark clips? lol?

formal ice Apr 26, 2023, 8:57 PM

#

input: [upbeat music loop]
not a music loop but a pretty cool ambience bass hit type of sound.

lyric vine Apr 26, 2023, 8:58 PM

#

@ebon widget omg sick

chrome tapir Apr 26, 2023, 10:04 PM

#

speaking of bass.

#

nobody will ever say AI music is too timid

#

im picturing cars full of robots blasting music with smoke billowing out

#

lets see if i can make a 'what does the fox say' remix

#

the world needs it

#

chrome tapir Apr 26, 2023, 10:53 PM

#

sir thats an elephant

small sluice Apr 26, 2023, 10:56 PM

#

chrome tapir sir thats an elephant

Thats indeed a elephant xD

violet narwhal Apr 27, 2023, 12:02 AM

#

Warning, it is very LOUD! ⚠️

#

prompt

chrome tapir Apr 27, 2023, 12:23 AM

#

suno responds well to requests for screaming

#

Baby, you're a firework, come on, let your colors burst, make 'em go, oh, oh, oh

honest atlas Apr 27, 2023, 12:37 AM

#

So using the Colab, how do you get it to make beats like that?

#

And I've read about a voice called Confused Travolta? How do I call that up on Colab?

chrome tapir Apr 27, 2023, 12:39 AM

#

can you select no voice in collab?

#

if so do that and then put "(dance beat) something something lyrics go here for around 10 words" for the text prompt

#

not sure how you would trigger confusedmode

#

smoke weed erryday

chrome tapir Apr 27, 2023, 1:16 AM

#

oh now i know

torn ferry Apr 27, 2023, 1:16 AM

#

🔊 Gets loud, but literally did a prompt like that last night! This sounds like a viral Tiktok sound TBH

chrome tapir Apr 27, 2023, 1:17 AM

#

yeah i could see it being big on tiktok. i was in the hospital and someone in the bed next to me was scrolling tiktok with the volume on max. sounded pretty close

#

this beat is so clean

#

gonna have to save that history file for sure

torn ferry Apr 27, 2023, 1:19 AM

#

[obsequious] Good evening, sir! Let me know if there's anything else I can assist you with today or if you have any updates on your to-do list. Wishing you a relaxing evening!

chrome tapir Apr 27, 2023, 1:26 AM

#

the taste of her what?

#

using my gpu to make music thats one thing i didnt plan on

#

if anyone wants that one

📎 cleanbeat.npz

#

seems to be a winner

#

violet narwhal Apr 27, 2023, 1:39 AM

#

chrome tapir Apr 27, 2023, 1:40 AM

#

there we go

#

wheres all the suno producers at?!

blissful pulsar Apr 27, 2023, 1:47 AM

#

honest atlas Where can I post NSFW Bark clips? lol?

Also wondering this question @ebon widget

ebon widget Apr 27, 2023, 1:49 AM

#

blissful pulsar Also wondering this question <@856546017998929971>

Uum, defo not here please..

chrome tapir Apr 27, 2023, 1:49 AM

#

lol

#

what prompts are you using for good nsfw?

#

provided its not too graphic

blissful pulsar Apr 27, 2023, 1:52 AM

#

ebon widget Uum, defo not here please..

Okay. Also, the rules channel is blank.

ebon widget Apr 27, 2023, 2:00 AM

#

blissful pulsar Okay. Also, the rules channel is blank.

TIL we have a rules channel… thanks yeah ill put some ground rules there in the next couple of days

violet narwhal Apr 27, 2023, 2:03 AM

#

undone glade Apr 27, 2023, 2:42 AM

#

I think I may have cracked the consistency and cloning issue

First I lock the model by seeding everything like this:
`def set_seed(seed):
seed = int(seed)
torch.manual_seed(seed)
random.seed(seed)
np.random.seed(seed)
torch.cuda.manual_seed(seed)

torch.backends.cudnn.deterministic = True
torch.backends.cudnn.benchmark = False

os.environ["PYTHONHASHSEED"] = str(seed)

`

I then use short prompts to find the voice I want, after that I use the same seed on my longer prompt, I've also exposed the fine_temp setting from api.py this seems to control how consistent the tone and pitch of the voice are. Default is 0.5, I'm using 0.2

Example:
Far above the Ephel Duath in the West the night-sky was still dim and pale. There, peering among the cloud-wrack above a dark tor high up in the mountains, Sam saw a white star twinkle for a while. The beauty of it smote his heart, as he looked up out of the forsaken land, and hope returned to him. For like a shaft, clear and cold, the thought pierced him that in the end the Shadow was only a small and passing thing: there was light and high beauty for ever beyond its reach.

It still has problems between the stitched clips

umbral solar Apr 27, 2023, 2:45 AM

#

areu usning the infinety version?

#

becasue i dindt had stichig proplems there

undone glade Apr 27, 2023, 2:45 AM

#

I'm using my own version of infinity I made

umbral solar Apr 27, 2023, 2:45 AM

#

oh

violet narwhal Apr 27, 2023, 2:46 AM

#

undone glade I think I may have cracked the consistency and cloning issue First I lock the m...

Yeah, something like that, #1100274765027102800 message

umbral solar Apr 27, 2023, 2:46 AM

#

i also wanted to play around with the seed. when you have the same seed pus text is the output exectly the same?

undone glade Apr 27, 2023, 2:46 AM

#

Thank you, I had lost the page, I had copied from

pale jasper Apr 27, 2023, 2:46 AM

#

"the thought pierced him that in the end the Shadow was only a small and passing thing" is exquisite 🤌
It correctly guessed the implied pauses that often trips up human readers on first time reads

undone glade Apr 27, 2023, 2:47 AM

#

umbral solar i also wanted to play around with the seed. when you have the same seed pus text...

from my experimenting it was

undone glade Apr 27, 2023, 2:47 AM

#

pale jasper "the thought pierced him that in the end the Shadow was only a small and passing...

#4 from here https://roadstainedfeet.wordpress.com/2019/03/24/top-ten-lord-of-the-rings-passages/ just need long good sounding prompts

Road-Stained Feet

Madeline Gill

Top Ten Lord of the Rings Passages

You might have heard that tomorrow, March 25, is Tolkien Reading Day. The Tolkien Society chose this date because it is the day that the Ring was destroyed in Mount Doom.* Now, it’s hard to s…

umbral solar Apr 27, 2023, 2:47 AM

#

this model shuld be good at cloning becaseu its based of vall e and they can clone a voice from 3 seconds of input audio

#

maby if we look into the valle paper we can find out how they do it

wicked vapor Apr 27, 2023, 2:48 AM

#

undone glade Apr 27, 2023, 2:49 AM

#

umbral solar this model shuld be good at cloning becaseu its based of vall e and they can clo...

I've had pretty decent success by using the cloner on hugging face and putting my fine_temp as low as I can.

undone glade Apr 27, 2023, 2:54 AM

#

violet narwhal Yeah, something like that, https://discord.com/channels/1069381916492562582/1100...

Oh wait, I didn't actually click the link, I got it from here https://wandb.ai/sauravmaheshkar/RSNA-MICCAI/reports/How-to-Set-Random-Seeds-in-PyTorch-and-Tensorflow--VmlldzoxMDA2MDQy I didn't realize it was already mentioned

W&B

How to Set Random Seeds in PyTorch and Tensorflow

Learn how to set the random seed for everything in PyTorch and Tensorflow in this short tutorial complete with code and interactive visualizations.

umbral solar Apr 27, 2023, 2:59 AM

#

undone glade I've had pretty decent success by using the cloner on hugging face and putting m...

that is a hack but not a solution

violet narwhal Apr 27, 2023, 3:01 AM

#

undone glade Oh wait, I didn't actually click the link, I got it from here https://wandb.ai/s...

yes, it is also in pytorch docs, https://pytorch.org/docs/stable/notes/randomness.html

undone glade Apr 27, 2023, 3:02 AM

#

Yeah, it certainly makes the model easier to control

violet narwhal Apr 27, 2023, 3:05 AM

#

undone glade Yeah, it certainly makes the model easier to control

but your missing torch.backends.deterministic = True which is little tricky

#

torch.use_deterministic_algorithms(True)

undone glade Apr 27, 2023, 3:06 AM

#

Yeah, I had pulled that cause I kept hitting up against the CUBLAS and didn't feel like finding the solution, lol

violet narwhal Apr 27, 2023, 3:07 AM

#

I found solution here, https://docs.nvidia.com/cuda/cublas/index.html#results-reproducibility

CUDA cuBLAS

The API Reference guide for cuBLAS, the CUDA Basic Linear Algebra Subroutine library.

undone glade Apr 27, 2023, 3:07 AM

#

Nice, thank you. I'll have to implement it

chrome tapir Apr 27, 2023, 3:41 AM

#

samesame but different

violet narwhal Apr 27, 2023, 3:44 AM

#

umbral solar Apr 27, 2023, 4:15 AM

#

hehehe

#

549

#

one long chunck and not multible stiched ones

#

lets hope that it does not crash

undone glade Apr 27, 2023, 4:24 AM

#

That's awesome

umbral solar Apr 27, 2023, 4:26 AM

#

is it me or is it getting more quiet

#

and the voice also chages a bit

#

and the pronunciation chages a bit

#

i saved the semantic array and copied it before the generation into a variable because of that its repeating

undone glade Apr 27, 2023, 4:30 AM

#

It does change a bit, but it never felt like a different person more like someone practicing lines and trying different deliveries

umbral solar Apr 27, 2023, 4:30 AM

#

yes

#

the chages here are all not from the semantic model

#

and the strage sound you hear at the beging of each repetision was me chageing some nummbers in the array xD

#

i wonder if you would generate a longer semantic array that it drifts into different voices becasue it cant remember the start but idk

#

and the chage i made is quite smol

undone glade Apr 27, 2023, 4:52 AM

#

What if we treat it like a chatbot with a very small context window, we could wait till it get's to the end and then feed it the last half of the semantic output along with the correct chunk of text, it might be possible for it to maintain coherency in longer texts.

umbral solar Apr 27, 2023, 4:56 AM

#

oh that is very smart

#

i am just testing to split the text in 35 word lists and then let the sementics generate for each part but then i put all the semantic strings together and let it run in one go. but with oyu idea it will probably be more consistant

undone glade Apr 27, 2023, 4:58 AM

#

Yeah, I'm working on how I want to split things up right now

hollow citrus Apr 27, 2023, 4:59 AM

#

how do you make it rap? Just the beat without the eitght note thingies?

umbral solar Apr 27, 2023, 4:59 AM

#

undone glade Yeah, I'm working on how I want to split things up right now

split the long text into chunks of 35 words

words = long_text.split()
chunks = [words[i:i + 35] for i in range(0, len(words), 35)]

# apply the generate_text_semantic function to each chunk
outputs = []
for chunk in chunks:
    text = " ".join(chunk)
    x_semantic = generate_text_semantic(
        text,
        history_prompt=history_prompt,
        temp=temp,
        base=base,
        allow_early_stop=allow_early_stop,
    )
    outputs.append(x_semantic)

# concatenate all the outputs together
x_semantic = x_semantic

#x_semantic = generate_text_semantic(
#    text,
#    history_prompt=history_prompt,
#    temp=temp,
#    base=base,
#    allow_early_stop=allow_early_stop,
#)
print(x_semantic)
return x_semantic

#

and have the text in the long_text variable

umbral solar Apr 27, 2023, 5:02 AM

#

undone glade What if we treat it like a chatbot with a very small context window, we could wa...

maby we dont have to do that

#

i just test it and it sounds very consisted alredy

#

but maby your idea will improve it even more

undone glade Apr 27, 2023, 5:02 AM

#

Nice, I'm gonna splice your code into mine and see

umbral solar Apr 27, 2023, 5:06 AM

#

hollow citrus how do you make it rap? Just the *beat* without the eitght note thingies?

yes just do a bit of experementing

hollow citrus Apr 27, 2023, 5:07 AM

#

here's one! the prompt is:
beat ♪Mama, just killed a man. Put a gun against his head, pulled my trigger now he's dead. Mama, life had just begun, but now i've gone and thrown it all away.♪

umbral solar Apr 27, 2023, 5:08 AM

#

undone glade Nice, I'm gonna splice your code into mine and see

you need to add this # concatenate all the outputs together
x_semantic = np.concatenate(outputs)

undone glade Apr 27, 2023, 5:09 AM

#

Lol, yeah I figured

umbral solar Apr 27, 2023, 5:09 AM

#

xD

#

good does it also work for u?

umbral solar Apr 27, 2023, 5:09 AM

#

hollow citrus here's one! the prompt is: *beat* ♪Mama, just killed a man. Put a gun against hi...

put beat like this [beat]

undone glade Apr 27, 2023, 5:09 AM

#

Still chasing down bugs

umbral solar Apr 27, 2023, 5:10 AM

#

f

hollow citrus Apr 27, 2023, 5:10 AM

#

i took the first verse from bohemian rhapsody

umbral solar Apr 27, 2023, 5:11 AM

#

undone glade Still chasing down bugs

the anoying thing is that it unloads the model after each use

umbral solar Apr 27, 2023, 5:11 AM

#

hollow citrus i took the first verse from bohemian rhapsody

cool have u tried it with [beat]

hollow citrus Apr 27, 2023, 5:11 AM

#

oh, with a bracket. Not yet, let's see

undone glade Apr 27, 2023, 5:14 AM

#

My model isn't unloading

umbral solar Apr 27, 2023, 5:14 AM

#

i mean after its done creating the audio

#

it chages a bit

#

ang chages from british to american

undone glade Apr 27, 2023, 5:18 AM

#

Yeah, it loses a some consitency

chrome tapir Apr 27, 2023, 5:18 AM

#

@hollow citrus i have some good npz files for rap if you want

#

📎 wreckingball2.npz

#

this one in particular

umbral solar Apr 27, 2023, 5:20 AM

#

undone glade Yeah, it loses a some consitency

i didnt chose a history thing jet how does that work?

undone glade Apr 27, 2023, 5:20 AM

#

the history_prompt?

umbral solar Apr 27, 2023, 5:20 AM

#

yes

undone glade Apr 27, 2023, 5:22 AM

#

It like using a reference image in SD, it guides the model towards a certain voice.

umbral solar Apr 27, 2023, 5:22 AM

#

maby this will help a bit

undone glade Apr 27, 2023, 5:23 AM

#

It does, it constrains the voice to a range, lowering the fine_temp control I've found contrains it even more

hollow citrus Apr 27, 2023, 5:24 AM

#

this is the one with [beat]

chrome tapir Apr 27, 2023, 5:25 AM

#

try beat in asterix and parenthesis too

#

both seem to work better than brackets

hollow citrus Apr 27, 2023, 5:25 AM

#

i did asteriscs

chrome tapir Apr 27, 2023, 5:27 AM

#

make 50 and pick the best 1

#

heh

hollow citrus Apr 27, 2023, 5:27 AM

#

oo okay. I can do that locally actually. I did these with the hf space

chrome tapir Apr 27, 2023, 5:28 AM

#

umbral solar Apr 27, 2023, 5:29 AM

#

chrome tapir

daim the bass

chrome tapir Apr 27, 2023, 5:29 AM

#

the bass is wild

#

i am playing on 8" woofers

#

i have to keep the volume at like 5%

undone glade Apr 27, 2023, 5:29 AM

#

Damn

umbral solar Apr 27, 2023, 5:30 AM

#

maby we shuld save the semantic data in the metadata

#

so it can get recreated

#

and the promt

undone glade Apr 27, 2023, 5:31 AM

#

Yeah, do you have seeds? Because with seeds we would just need the prompt, history, temps and seed

umbral solar Apr 27, 2023, 5:33 AM

#

no but seed would also be nice

umbral solar Apr 27, 2023, 5:33 AM

#

chrome tapir

i used this

#

i didnt say that it shuld make music

#

and it dyes at the end

chrome tapir Apr 27, 2023, 5:34 AM

#

hahah

undone glade Apr 27, 2023, 5:34 AM

#

Yeah, mine ran into the same thing, it got super robotic at the end

chrome tapir Apr 27, 2023, 5:34 AM

#

started out so enthusiastic

#

thats why i just rock with 14 seconds at a time

#

even that is a push

#

for music anyway. i think with text you can just do chunks and stich them together pretty successfully

hollow citrus Apr 27, 2023, 5:36 AM

#

i just have 8gb vram so i can't use the regular bark model

chrome tapir Apr 27, 2023, 5:36 AM

#

damn

#

so close

#

once those 24gb models come out i am gonna be forced to sell an organ

#

probably gonna need more ram soon too for 65B parameter LLMs

hollow citrus Apr 27, 2023, 5:37 AM

#

i'll have to try the sound effect generation

umbral solar Apr 27, 2023, 5:39 AM

#

chrome tapir once those 24gb models come out i am gonna be forced to sell an organ

are they planing on relesing bigger models?

chrome tapir Apr 27, 2023, 5:39 AM

#

someone will eventually

umbral solar Apr 27, 2023, 5:40 AM

#

i mane for this projeckt

chrome tapir Apr 27, 2023, 5:40 AM

#

i know everyone says fine tune = better but i think bigger = better as well

#

just a hunch

umbral solar Apr 27, 2023, 5:40 AM

#

yes idk about finetune

chrome tapir Apr 27, 2023, 5:41 AM

#

big model + lora seems to be a really good combo in image creation

#

so maybe it will be similar for audio

#

umbral solar Apr 27, 2023, 5:44 AM

#

damm thats good

#

but not consitant

chrome tapir Apr 27, 2023, 5:45 AM

#

if you listen to enough suno you will start to hear AI when normal people speak

umbral solar Apr 27, 2023, 5:45 AM

#

chrome tapir if you listen to enough suno you will start to hear AI when normal people speak

yesss

chrome tapir Apr 27, 2023, 5:45 AM

#

i was watching some guy give a speech on stage and it was tripping me out

#

the way he paused and said uhh and stuff sounded very suno like

#

its almost like people have the same mannerisms programmed into their speech

#

just gives you a different perspective i suppose

undone glade Apr 27, 2023, 5:46 AM

#

I think it so jarring because it's a stark remind that we as humans are not unique

chrome tapir Apr 27, 2023, 5:49 AM

#

im gonna try lyrics that have sound device type words in the lyrics

#

that seems to work well

umbral solar Apr 27, 2023, 5:50 AM

#

maby its becaseu of bad compression in audio

#

btw deep floid is open

chrome tapir Apr 27, 2023, 5:55 AM

#

i was just looking at that

undone glade Apr 27, 2023, 5:55 AM

#

Do not use en_speaker_5 for long texts, it does not work well

chrome tapir Apr 27, 2023, 5:55 AM

#

ill wait for someone to make it into a gradio

chrome tapir Apr 27, 2023, 6:15 AM

#

on a roll now

#

another clean beat

umbral solar Apr 27, 2023, 6:21 AM

#

chrome tapir another clean beat

what is you promt for that?

chrome tapir Apr 27, 2023, 6:24 AM

#

(dance beat) Shake it off, I shake it off, I, I, I shake it off, I shake it off, heartbreakers gonna break, break, break, and the fakers gonna fake, fake, fake, baby, I'm just gonna shake, shake, shake

#

wasnt it obvious?

#

📎 cleanbeat2.npz

#

might mess with that one later

umbral solar Apr 27, 2023, 6:28 AM

#

lets see what happens

undone glade Apr 27, 2023, 6:29 AM

#

I'm very curious

umbral solar Apr 27, 2023, 6:29 AM

#

noo my headfones are empty

chrome tapir Apr 27, 2023, 6:30 AM

#

gotta love when you get hit with the ear piercing scream straight from second 0.0

undone glade Apr 27, 2023, 6:30 AM

#

Got some good consistency

chrome tapir Apr 27, 2023, 6:31 AM

#

wtf just got back to back screams

#

sheesh what are they doin to these poor AI agents

umbral solar Apr 27, 2023, 6:35 AM

#

undone glade Got some good consistency

nice did u chage the code or was it luck

chrome tapir Apr 27, 2023, 6:35 AM

#

one day we will have audio books where every character has a unique voice

undone glade Apr 27, 2023, 6:35 AM

#

I think the main components are the seeding and lowering the temps

umbral solar Apr 27, 2023, 6:37 AM

#

undone glade Apr 27, 2023, 6:37 AM

#

What am I listening to? is this that giant filled array?

umbral solar Apr 27, 2023, 6:38 AM

#

yes from 1 to 10.000

undone glade Apr 27, 2023, 6:38 AM

#

So if someone really wanted to they could map the semantic scape

umbral solar Apr 27, 2023, 6:39 AM

#

yes

#

if you want to lern a new instruent with 10.000 noats

undone glade Apr 27, 2023, 6:40 AM

#

I'm thinking more if you could filter which ones were the most similar to speech you could limit what tokens are allowed through

umbral solar Apr 27, 2023, 6:41 AM

#

i think it would also be interesting to try to convert audio to tokens

#

you coul also try to find words that sound got and try to stick them together by hand

undone glade Apr 27, 2023, 6:43 AM

#

I think there is an instrument like that

hollow citrus Apr 27, 2023, 6:55 AM

#

those who use the cli tool, what temperature thingies for text temp and waveform temp do you guys use?

undone glade Apr 27, 2023, 6:55 AM

#

I use .7 for text and .6 for waveform

#

I ran into an issue with pre stitching the semantics, if the text is too long it eats all the vram generating the audio

#

Need to find the limit and split semantic array before concatenate

umbral solar Apr 27, 2023, 7:29 AM

#

undone glade Apr 27, 2023, 7:36 AM

#

Very nice

umbral solar Apr 27, 2023, 7:36 AM

#

this 𓀀 𓀁 𓀂 𓀃 𓀄 𓀅 𓀆 𓀇 𓀈 𓀉 𓀊 𓀋 𓀌 𓀍 𓀎 𓀏 𓀐 𓀑 𓀒 𓀓 𓀔 𓀕 𓀖 𓀗 𓀘 𓀙 𓀚 𓀛 𓀜 𓀝 𓀞 𓀟 𓀠 𓀡 𓀢 𓀣 𓀤 𓀥 is this

#

how

undone glade Apr 27, 2023, 7:37 AM

#

weird

umbral solar Apr 27, 2023, 7:59 AM

#

i nuticed when its longer quiet the chance of chaging the voice is higher

undone glade Apr 27, 2023, 7:59 AM

#

I'm running a really long test right now, hopefully it goes well

#

I forgot to concatenate it, I got 14 seconds of audio for a 30 minute generation. Lol

inland coral Apr 27, 2023, 8:20 AM

#

Just learned about this tool, going to use it for a new vegas mod

umbral solar Apr 27, 2023, 8:39 AM

#

nice

inland coral Apr 27, 2023, 8:47 AM

#

text_prompt = """
[raspy]NCR taxess? Man! [clears throat] I say screw the NCR!
Westside Radio baby, let freedom Ring!
"""
audio_array = generate_audio(text_prompt)
Audio(audio_array, rate=SAMPLE_RATE)

#

this one is hilarious

undone glade Apr 27, 2023, 8:48 AM

#

Look at that consistency

inland coral Apr 27, 2023, 8:48 AM

#

this is the best one after over 40 generations

undone glade Apr 27, 2023, 8:51 AM

#

That's really good

inland coral Apr 27, 2023, 8:55 AM

#

2nd attempt for this one wild asf

chrome tapir Apr 27, 2023, 9:08 AM

#

inland coral Apr 27, 2023, 9:12 AM

#

chrome tapir Apr 27, 2023, 9:12 AM

#

nice dynamic range on this one

#

as opposed to the usual LOUD AF

chrome tapir Apr 27, 2023, 9:13 AM

#

inland coral

new vegas pretty good game? i always hear people talk it up

#

what was i playin those days i wonder. maybe just cause 2, l4d2 prob

inland coral Apr 27, 2023, 9:16 AM

#

its my favorite game of all time

#

#

chrome tapir Apr 27, 2023, 9:22 AM

#

any game that you can mod instantly makes it twice as good

inland coral Apr 27, 2023, 9:25 AM

#

yeah im gonna make a radio mod using these voice samples to massively increase immersion

steel sky Apr 27, 2023, 12:38 PM

#

undone glade I think I may have cracked the consistency and cloning issue First I lock the m...

That's pretty sick

keen fable Apr 27, 2023, 12:43 PM

#

have the macOS issues been fixed

#

specifically running it with mps rather than cuda

fallen rapids Apr 27, 2023, 1:02 PM

#

Stable Diffusion + Suno for storytelling. Can't wait for text-to-video to mature as well.

gray portal Apr 27, 2023, 3:47 PM

#

chrome tapir

how to use this file

chrome tapir Apr 27, 2023, 4:26 PM

#

put it with your other voice files

wild crater Apr 27, 2023, 4:49 PM

#

chrome tapir put it with your other voice files

How do you do that with the google clown

#

colab*

#

lmao google clown

mellow dagger Apr 27, 2023, 5:41 PM

#

I'm loving this stuff

blissful pulsar Apr 27, 2023, 5:42 PM

#

How do you force female voice ? It don't work full time with "WOMAN :"

inland coral Apr 27, 2023, 7:13 PM

#

chrome tapir Apr 27, 2023, 7:53 PM

#

wild crater How do you do that with the google clown

not sure i use miniconda3

silver valve Apr 27, 2023, 9:32 PM

#

south seal Apr 27, 2023, 10:27 PM

#

I mean emotions not all there but trying quotes from Her (2013) is funny

worldly tree Apr 27, 2023, 10:29 PM

#

south seal I mean emotions not all there but trying quotes from *Her* (2013) is funny

can u guys turkish language foır me please

#

see its working fine or not

#

and can we clone voice in turkish language?

south seal Apr 27, 2023, 10:33 PM

#

worldly tree can u guys turkish language foır me please

I'm just starting out with it so not really sure yet

worldly tree Apr 27, 2023, 10:34 PM

#

south seal I'm just starting out with it so not really sure yet

good luck, and please try custom turkish voice for me if its working good, I'll try to learn fast, all other working bad, if its this work good, I'll be pro onhere

south seal Apr 27, 2023, 10:34 PM

#

You can play around with the Google Collaboratory page

#

https://colab.research.google.com/drive/1eJfA2XUa-mXwdMy7DoYKVYHI1iTd9Vkt?usp=sharing#scrollTo=t9Vlr3RRt6B9

Google Colaboratory

#

You could try doing the turkish voice there

#

worldly tree Apr 27, 2023, 10:40 PM

#

south seal You can play around with the Google Collaboratory page

https://www.youtube.com/shorts/kRWSCRjHvyg for now im using elevenlabs, but no turkish language so I made tentaction video

YouTube

Baran Baran

XXXTentaction #shorts #short #shortvideo #shortsvideo

This video made by only for motivation purpose.

▶ Play video

#

its not right place to share because its suno channel

#

sorry for that, im looking for turkish support program

worldly tree Apr 27, 2023, 10:41 PM

#

south seal https://colab.research.google.com/drive/1eJfA2XUa-mXwdMy7DoYKVYHI1iTd9Vkt?usp=sh...

there is no turkish oncollab? or there is?

south seal Apr 27, 2023, 10:43 PM

#

I think if you paste in Turkish text it should work alright? But you might have to use the history prompt for a turkish speaker i.e. here

#

Again I'm not really sure I only just started using this model today

undone glade Apr 27, 2023, 10:50 PM

#

This was my attempt with turkish using the history_prompt "tr_speaker_1", I have no idea how accurate it is since I don't speak turkish

violet narwhal Apr 28, 2023, 12:22 AM

#

Not bad.

violet narwhal Apr 28, 2023, 12:49 AM

#

My short guide to clone voice on local machine is here #🪦┃getting-started message

prime night Apr 28, 2023, 1:18 AM

#

rugged nymph Apr 28, 2023, 1:25 AM

#

worldly tree good luck, and please try custom turkish voice for me if its working good, I'll ...

selam bunu kullanarak cli dan --history_prompt "en_speaker_1" ekleyerek 9 tane var üretebilirsin ayrıca history prompt vermeden üretilen tüm sesleride kaydediyor onlarıda çağırabilirsin
https://github.com/JonathanFly/bark

GitHub

GitHub - JonathanFly/bark: 🚀 BARK INFINITY 🎶 Power Up The Bark Text...

🚀 BARK INFINITY 🎶 Power Up The Bark Text-prompted Generative Audio Model - GitHub - JonathanFly/bark: 🚀 BARK INFINITY 🎶 Power Up The Bark Text-prompted Generative Audio Model

violet narwhal Apr 28, 2023, 1:29 AM

#

chrome tapir Apr 28, 2023, 2:09 AM

#

not giving up on that friday banger haha

#

ill try too since its almost friday

chrome tapir Apr 28, 2023, 3:05 AM

#

not friday but a cool music note

#

FRIDAY's child is full of woe, but I know how the story goes, break the chain, I'll break the mold, FRIDAY's child has a heart of gold, yeah, a heart of gold

#

FRIDAY, I'm in love, I don't care if Monday's blue, Tuesday's gray and Wednesday too, Thursday, I don't care about you, it's FRIDAY, I'm in love

#

it made all 3 of those in a row

#

very cool

#

oh just got a nice beat too

#

damn he killed that 1

#

(dance beat) Thank God it's FRIDAY night, and I just-just-just-just-juuuuuuust got paid, money, money, money, money, yeah, just got paid, FRIDAY night, party hoppin', feelin' right, booties shakin', all around

#

@violet narwhal

simple bison Apr 28, 2023, 3:44 AM

#

chrome tapir (dance beat) Thank God it's FRIDAY night, and I just-just-just-just-juuuuuuust g...

[singing] ♪ [dance beat] Thank God it's FRIDAY night, and I just-just-just-just-juuuuuuust got paid, money, money, money, money, yeah, just got paid, FRIDAY night, party hoppin', feelin' right, shakin', all around ♪ [singing] ♪ [dance beat]

chrome tapir Apr 28, 2023, 3:49 AM

#

wow theres like 3 or 4 beats in there

#

thats a good prompt im gonna steal it

simple bison Apr 28, 2023, 3:49 AM

#

oops i had an extra music note in there

chrome tapir Apr 28, 2023, 3:49 AM

#

the AI knows what to do with it

simple bison Apr 28, 2023, 3:51 AM

#

this guy doesn't understand singing...

chrome tapir Apr 28, 2023, 3:52 AM

#

yeah i give them credit for trying

#

REEMIX

simple bison Apr 28, 2023, 3:54 AM

#

[singing fast] ♪ [dance beat] we going to walmart. we going to walmart. we going to wally wally wally wally wally wally world wally wally wally wally wally wally world. basket basket basket basket [singing fast] ♪ [dance beat]

#

it improvised some

chrome tapir Apr 28, 2023, 3:55 AM

#

ok switching to the musical note npz lets see if that works

chrome tapir Apr 28, 2023, 3:56 AM

#

simple bison [singing fast] ♪ [dance beat] we going to walmart. we going to walmart. we goi...

damn this is catchy

simple bison Apr 28, 2023, 3:58 AM

#

chrome tapir damn this is catchy

it's from a real song, which is funny

chrome tapir Apr 28, 2023, 3:58 AM

#

ai remix

#

rock beat has a... different effect than dance beat

#

gonna try the devils advocate

#

and dark knight

#

haha this guy is legit pissed i think

#

simple bison Apr 28, 2023, 4:29 AM

#

evil laugh of insanity, what are you? a pickle man?

#

dude sounds drunk

#

starts out sounding like the game Facade

#

chrome tapir Apr 28, 2023, 4:33 AM

#

this is quality content

#

that laff is wild

#

i had a drunk kareoke guy yesterday

simple bison Apr 28, 2023, 4:34 AM

#

funny how it decided to say "genesis" for no reason.

#

i did something along the lines of
" [farts] farts [farts] farts [farts] farts [farts] farts [farts] farts [farts] farts [farts] farts" and it made something that sounds like a phone ringtone

chrome tapir Apr 28, 2023, 4:38 AM

#

sounds like wesley willis

simple bison Apr 28, 2023, 4:38 AM

#

chrome tapir sounds like wesley willis

haha, he sounds very distracted, or singing along with headphones

#

"[laughter] [laughs] [sighs] [music] ♪ [gasps] [clears throat]— or ... for hesitations, capitalization for emphasis of a word"
unrelated output

chrome tapir Apr 28, 2023, 4:41 AM

#

simple bison Apr 28, 2023, 4:42 AM

#

[laughter] [laughs] [sighs] [music] ♪ [gasps] [clears throat]— or ... for hesitations, capitalization for emphasis of a word
more chaos

#

chrome tapir Apr 28, 2023, 4:44 AM

#

chatgpt has some ideas

simple bison Apr 28, 2023, 4:44 AM

#

outputs like a misconfigured Vicuna model

#

chrome tapir Apr 28, 2023, 4:45 AM

#

i havent messed with vicuna only llama

#

alpaca

simple bison Apr 28, 2023, 4:45 AM

#

try the prompt i'm using with Suno Bark TTS
[laughter] [laughs] [sighs] [music] ♪ [gasps] [clears throat]— or ... for hesitations, capitalization for emphasis of a word

#

that prompt,
this output:
"the the building of the book... the building of the book... the building of the book... and the building of the book and all of the greece sitting. the victim of the mosely."