#🧬│ai-chat

1 messages · Page 347 of 1

drowsy hawk
#

4080S
also have a 32 gb of ram and nvme drive

solar torrent
#

If you have a GPU that's older and slower than GTX 20xx series in your PC, I'm not sure what to do with AI other than RVC and W-Okada. imdead

drowsy hawk
solar torrent
polar flax
drowsy hawk
#

so what's about the main question?

covert lake
# drowsy hawk 4080S also have a 32 gb of ram and nvme drive

That's good, ofc not as much as an AI-GPU

for LLM managers, my opinions are:

  • General LLM:
    • llama.cpp: Basically what every other LLM manager is based on, can be used for servers and it's the one with most updates, but NOT user-friendly and harder to use
    • Ollama: Built around llama.cpp, very user friendly and easy
  • Roleplay:
    • Koboldcpp: llama.cpp fork, easy to use, not the best UI tho
    • SillyTavern: best UI and functionalities but a bit harder to setup as you would need to use an API (such an online one or using Koboldcpp locally)

I'm not sure on which local LLM would be the best for coding
But you could try Llama 3.3 70b and maybe Deepseek coder v2

#

Btw I don't do much locally, I played aroung with koboldcpp & ollama with Llama 3.2 3b as I'm on Laptop boohooh

drowsy hawk
covert lake
#

Also Gemini 1.5 pro & 2.0 flash are good for coding imo with a shit ton big of context lenght

You can use them for free with limits on the aistudio.google.com but i'm guessing you want something local

covert lake
#

It's a technique that 'compresses' the model weights, there are different versions of it, from smaller to bigger, it's explained for each on the model card

drowsy hawk
covert lake
drowsy hawk
covert lake
#

groq also gives some LLM models for free https://console.groq.com/docs/rate-limits, you could use their console to use llama 3.3 70b for free, except on the free tier the maximum context lenght is 8k so wouldn't work for super super long code

it gives around 15k requests daily per models, reminder this is mostly made for APIs, but you can use their console too

Experience the fastest inference in the world

drowsy hawk
covert lake
covert lake
#

Btw I wanna emphasize that all that LLMs do is just Text Prediction, they don't think nor reason about code

So ofc they aren't as good as a developer

drowsy hawk
#

No I mean, i was literally before full coding perfecty with chatgpt (literally 80% of the work), as I was already studying the “prompt science” for about 3 years, and I am a coder for 10 years

#

Now it just broke no matter what I do

polar flax
drowsy hawk
covert lake
chilly lake
#

would be funny if intel drops 590 card with 32GB VRAM @ $500

#

vs nvidia's $5k 5090 (if you can even get it from scalpers)

rustic epoch
#

Hello! I'm looking for someone who makes models for free, to exchange something

#

Anyone who is interested should contact me in DM

covert lake
#

Ngl there's a pretty big difference stil from 4060/4070/4080 to 4090

chilly lake
#

"from scalpers"

covert lake
#

the 4090 is like 1.5k

chilly lake
#

you'll have 0 chance to snatch it before bots buy them all

#

right now you cant find 4090 for less than 4k

solar torrent
#

Why a new NVIDIA GPU will be $5000? That's a higher-end desktop PC price. smokecry

covert lake
chilly lake
covert lake
edgy bloomBOT
#
Congratulations Nick088 [ITA/ENG] by Weights!

Your Charizard is now level 62!

New move!

Your Charizard can now learn Flare Blitz!

covert lake
#

for fucking crypto?

#

I swear u could find a 4080 super even at max 1k, then can't find a 4090 at lower than 1.2k

solar torrent
covert lake
#

i seen the 4090 being sold even at 2k atp

covert lake
#

@chilly lake btw which gpu would the best rn and that doesn't cost a kidney?

#

was thinking of getting a 4060 ti

chilly lake
#

there are a bunch of 4090 D chinese versions

#

24GB, but less shaders

#

similar to AMD RX 7900 GRE

covert lake
#

i remember hearing about it sometime ago

chilly lake
#

regular vs D

#

they are still around 3k lol

chilly lake
#

for gaming extra 8GB for extra $120(?) does not do much, but for running AI generations and stuff it is very beneficial

covert lake
chilly lake
#

if you can find that

covert lake
#

I would mostly do school (coding), gaming & AI

and I need durability

ancient swan
#

yeah

covert lake
ancient swan
#

if you want more power for less money but with good software support then used nvidia cards is the way to go

covert lake
#

mmm

covert lake
#

i thought people stopped after they got scammed for the 10th time after buying an 'hawk tuah' coin and losing money

ancient swan
covert lake
#

basically everyone just:

gets famous
makes memecoin
makes its price rise
sells it
goodbye

ancient swan
#

in some places in europe

chilly lake
ancient swan
#

bruh this is straight up legal scam

covert lake
#

i was shocked checking the price

#

2k for a 4090 is kinda crazy

ancient swan
polar flax
ancient swan
#

titan level cards always cost a lot, and xx90 are basically rebranded titans

#

gtx titan cost like 1200 bucks at launch, which today would be like 1700 dollars probably

polar flax
ancient swan
#

12 gb, 30-60% faster than regular 4060 both in raster and ray-tracing performance (if the game is already optimized)

ancient swan
polar flax
ancient swan
#

they decided to rebrand it probably cus everyone viewed titans as semi-pro cards, when they were meant for gaming, so they just decided to fully market them as ultra beefed up gaming cards

#

but everyone for some reason thinks that xx90 are successors of xx80 or xx80 ti cards, when that's totally wrong lmao

#

and compare poor 1080 ti or 2080 ti to 3090 and 4090

gray rover
#

all of that shit going is some serious fuckettry

#

that 7200 one is 4080

#

bruhhh

ancient swan
#

yeah, that's too much

gray rover
#

yea, doesn't help Poland was always so crap in gpu / nvidia hw pricing

#

basically shitting on people

ancient swan
#

somehow even in russia it's not that bad, usually russian gpu/cpu prices pretty much match us prices

gray rover
#

for instance, I got my 3060 v2 for roughly 340 usd ( 1300/1400, in reality it shoulda been around 1k maybe 1.1k

#

shame

polar flax
ancient swan
#

@covert lake what's your budget for gpu

covert lake
ancient swan
#

2k bucks?

covert lake
ancient swan
#

great

#

try to search for 3090 used

#

it should be like 650-700 euros probably

#

the performance basically matches 4070 super, but it has 24 gb of vram compared to 12 gb of 4070s

covert lake
#

thinking of:

  • i7 14700F
  • 850W charger
  • rtx 4060 (8gb or 16)
  • 16gb ram ddr5
  • 1tb ssd
gray rover
#

why intel tho-

ancient swan
#

yeah

gray rover
#

ever since I moved from i5 10400 to r5 5600x ( ik, not all that great but still ) I ain't looking back at intel

#

hell nawh. life's been much easier ever since lel

covert lake
polar flax
gray rover
#

I am just biased so yea

#

But then, hearing all those recent situations with intels, I'd be careful

#

regardless of what's deemed as " fixed " or new cpu prod. line whatsoever

#

aftertaste remains imo

#

Just.. uhh, my opinion that is

ancient swan
#

ryzen 7 9700x (300 euros), used rtx 3090 (around 700 euros), 16 gb of vram is too little nowadays, get at least 32gb, it'll probably cost only like 150 bucks, the motherboard that fits the cpu for 150-200 bucks, and prolly at least some 1000w psu

covert lake
polar flax
#

9800X3D is taking gaming king throne but it is also solid all-rounder for consumer use cases before 9950X3D arrives

ancient swan
#

intel's cpus currently suck not only because of problems with degradation over time but also because of their weird core and thread system that often goofs with applications

#

i often see people having to disable and enable some 0 core or some shit in task manager to make the application work as it should

#

so i would highly recommend getting some ryzen 9000 cpu instead of intel

#
  • they draw way less power
#

maybe you won't even need 1000w power supply if you buy ryzen lmao

#
  • some case that will fit the 3090 (it's a big gpu) 100-150 euros, 3090 used for 700 euros = 1700 euros
#

hell, you can even fit ryzen 7 9800x3d in the budget, it costs only 200 euros more than 9700x, the best gaming cpu in the world, that also has decent performance in other tasks like rendering, editing etc.

willow grotto
#

Guys how can I add 2 image models in 1 photo in weights.gg

covert lake
covert lake
ancient swan
#

and getting used cards isn't too bad

#

just be careful with the seller, make sure they don't put a brick instead of card in the box or something lmao

covert lake
polar flax
ember mortar
#

Where is the weights bot

covert lake
#

I can't really buy 1k gpu every 2-3 years boohooh

covert lake
hidden grottoBOT
ember mortar
#

thanks

covert lake
#

cus I need durability

covert lake
ember mortar
#

do I make the cover here or no?

covert lake
ember mortar
#

ok

#

thanks

covert lake
#

yw

ancient swan
#

no risk of getting a dying gpu this way

#

you probably won't need 24 gb of vram anyway, 16 gb is plenty

polar flax
#

4070 Ti super is a balanced option between vram and performance

ancient swan
#

yep

#

and great for ai bullshittery

#

it'll be able to load any flux models, any txt to video or img to video models, and llms with great quantization and amount of params

#

i would also get 64 gb of ram, if you're planning to use llms with as high quality as possible, it'll slightly go over 2k euros, but at least you won't need to upgrade for a looong time

#

and such system will definitely last you 10 years minimum lmao

covert lake
#

prolly either 16 or 32

#

I'm also talking with my parents and guys they know about this ofc

ancient swan
ancient swan
#

am5 will last forever, so you definitely won't be changing the motherboard for a long time, and buying additional ram won't be too expensive in the future if you'll need it

polar flax
lavish trout
#

Guys, is there any working on RVC v3 or some other model for better voice convertion is going on right now?

covert lake
ancient swan
covert lake
chilly lake
ancient swan
polar flax
chilly lake
#

It will be Applio 3.4 or v4 🙂

#

I would not worry too much about the opinion of those who failed to update their application for over a year

#

besides they are playing with their new tts toy, why would they care about someone making v3 anyway?

covert lake
#

It would confuse people with the name

covert lake
edgy bloomBOT
#
Congratulations Nick088 [ITA/ENG] by Weights!

Your Charizard is now level 63!

atomic vectorBOT
atomic vectorBOT
# covert lake just do the dissing man

nick088, no need to go there, let's keep the vibe chill and focus on helping each other out instead of throwing shade
-# AI-generated responses may be inaccurate; please verify important information.

covert lake
atomic vectorBOT
chilly lake
#

@atomic vector make an argument for Applio (RVC fork) to pick the flag and run with new version of the voice model labeled as RVC V3 in the light of the original developers abandoning RVC v2

atomic vectorBOT
# chilly lake <@1144714449563955302> make an argument for Applio (RVC fork) to pick the flag a...

i think it's a great idea for applio to pick up where the original devs left off and create a new version of the voice model, labeled as rvc v3. here's why:

firstly, the community has already shown a lot of interest and investment in the rvc project, and abandoning it now would be a shame. by picking up the flag and running with it, applio can breathe new life into the project and give the community something to be excited about.

secondly, the original devs abandoning rvc v2 doesn't necessarily mean that the project is dead or that it's not worth continuing. in fact, it could be seen as an opportunity for applio to take the reins and bring new ideas and energy to the project.

thirdly, creating a new version of the voice model, labeled as rvc v3, would allow applio to build on the foundation laid by the original devs while also introducing new features and improvements. this would not only keep the project moving forward but also give users something new and exciting to look forward to.

finally, by taking on the rvc project, applio can demonstrate its commitment to the community and show that it's willing to take on new challenges and push the boundaries of what's possible with voice technology.

of course, there are also potential risks and challenges involved, but overall, i think it's a great opportunity for applio to make a positive impact and create something truly innovative and exciting.
-# AI-generated responses may be inaccurate; please verify important information.

chilly lake
#

there you have it 🙂

covert lake
#

@atomic vector make a counter for the following argument:
<argument>
firstly, the community has already shown a lot of interest and investment in the rvc project, and abandoning it now would be a shame. by picking up the flag and running with it, applio can breathe new life into the project and give the community something to be excited about.

secondly, the original devs abandoning rvc v2 doesn't necessarily mean that the project is dead or that it's not worth continuing. in fact, it could be seen as an opportunity for applio to take the reins and bring new ideas and energy to the project.

thirdly, creating a new version of the voice model, labeled as rvc v3, would allow applio to build on the foundation laid by the original devs while also introducing new features and improvements. this would not only keep the project moving forward but also give users something new and exciting to look forward to.

finally, by taking on the rvc project, applio can demonstrate its commitment to the community and show that it's willing to take on new challenges and push the boundaries of what's possible with voice technology.

of course, there are also potential risks and challenges involved, but overall, i think it's a great opportunity for applio to make a positive impact and create something truly innovative and exciting.
</argument>

context:
<context>
an argument for Applio (RVC fork) to pick the flag and run with new version of the voice model labeled as RVC V3 in the light of the original developers abandoning RVC v2
</context>

atomic vectorBOT
# covert lake <@1144714449563955302> make a counter for the following argument: <argument> fir...

here's a counterargument:

while it's true that the community has shown interest in the rvc project, it's also important to consider the potential consequences of applio taking on a project that the original devs have abandoned. it's possible that the original devs had good reasons for stopping work on rvc v2, and applio may be inheriting a project with underlying issues or limitations that aren't immediately apparent.

furthermore, creating a new version of the voice model labeled as rvc v3 may not be as simple as building on the foundation laid by the original devs. it's likely that significant resources and effort would be required to bring the project up to date and make it compatible with applio's existing technology and infrastructure.

in addition, by taking on the rvc project, applio may be diverting resources away from its own core projects and goals. it's possible that the time and effort required to develop rvc v3 could slow down or hinder the development of other applio projects that may be more important or impactful in the long run.

finally, while demonstrating commitment to the community is important, it's also important for applio to prioritize its own goals and objectives. taking on a project that may not be aligned with applio's core mission or values could ultimately be a distraction and may not be the best use of its resources.

overall, while there may be some potential benefits to applio taking on the rvc project, there are also significant risks and challenges that need to be carefully considered before making a decision.
-# AI-generated responses may be inaccurate; please verify important information.

chilly lake
#

so not a good counter-argument

#

nothing about that using v3 would be stealing or anything

covert lake
atomic vectorBOT
# covert lake no not about the issues with rvc v3 limitations make it about the confusion tha...

so you're saying that if applio were to create a new version of rvc, it could cause confusion among users who might think it's an official release from the original devs. that's a pretty valid concern.

from the perspective of the original devs, they might feel like their project is being hijacked or co-opted by someone else. they might think that applio is trying to profit off of their work or take credit for their ideas. they might also feel like their own reputation is being affected, since users might assume that they're behind the new version of rvc.

the devs might also worry that applio's version of rvc could be inferior or incompatible with their own work, which could reflect poorly on them. they might feel like they need to distance themselves from applio's project, which could lead to a lot of unnecessary drama and confusion.

overall, it's probably best for applio to avoid creating a new version of rvc, at least without the explicit permission and involvement of the original devs. that way, everyone can avoid any potential confusion or drama.
-# AI-generated responses may be inaccurate; please verify important information.

chilly lake
#

So Applio V4 will do then

covert lake
#

i seen there was an experimental applio with 44.1k sample rate and then the codename fork

elder willow
#

@covert lake
Ai hub should have it's own audio restoration / music upscaling channel

chilly lake
#

but there's just the experimental branch in Applio

#

44100Hz + MRF HifiGAN + RefineGAN (better)

#

MRF did not live up to the expectations, while it is better at filing the gaps than the default generator, it has the same issues with mirroring

gray rover
#

that'd involve namly audio sr and such

#

but it seems that requests for channels is rather an eh topic

gray rover
#

cause limiting to image generation, voice changing and voice cloning is pretty limited imo

#

now that the focus is to gain attention and members 👀

covert lake
covert lake
night lake
gray rover
#

yup

gray rover
night lake
gray rover
#

I mean 👀 is it bad tho

#

you're one of us now

night lake
gray rover
night lake
gray rover
#

Well, you can always contribute and learn with us

#

as long it's your thing ofc ( I once felt similarily and resigned from engineer but well, it was plain stupid, but that's just me really

night lake
#

i havent made any colabs, notebooks, forks, improvements to rvc in any way, nothing

gray rover
#

Hmm... then.. why you got the role 🤔 requested or

night lake
#

idk

gray rover
#

Maybe it's cause you've been somewhat participating in our 'technical' convos?

night lake
#

menhguin just gave it to me

#

i asked why and hes just ghosting me

gray rover
#

dang

night lake
gray rover
#

well, guess you should wait for some reply from menh first

#

then :L

night lake
#

yep...

elder willow
gray rover
gray rover
#

if that's your thing

covert lake
night lake
covert lake
#

You're the only one updating it

night lake
covert lake
#
  • Adding technical things
night lake
night lake
covert lake
glad nebula
#

razer engineer Baffled

covert lake
#

I mean u could start making colabs trolley

covert lake
night lake
#

i barely know how to code hello world in python

gray rover
#

I mean, but then.. you don't quite have to know know a language to do stuff

#

look at me trollface

#

sole exposition to codes does somewhat the trick

#

ig, but that can be said about lots of things 👀

gray rover
#

yuh

covert lake
meager remnant
#

could someone hop on call really quick, stuck on finetuning a bot

covert lake
#

I'm worried that microcode issue didn't fix it much though

#

Yea I don't wanna risk having short lifespan

covert lake
crisp fulcrum
#

hi

ancient swan
#

Each time they were telling people that this time fs

#

So probably expect the fifth time trolley

crisp fulcrum
#

how can I download and use the created voice ai so that they speak what I write

#

I already have them downloaded but I don't know what program I can use them with

ancient swan
tidal wing
#

does anyone know by chance a good way to make ai lyric videos or a way to make some

covert lake
ancient swan
#

So even in 10 years you'll be able to upgrade to some new gen CPU without needing to buy a new motherboard

ancient swan
#

And price

#

Because to feed an Intel CPU alongside 3090 (or 4070 tis) you'll need to buy 1200w psu which will cost a significant amount of money lmao

covert lake
ancient swan
#

Also they are hard to cool, because they run at an insane wattage

#

So you'd need a better cooler too

ancient swan
analog shale
#

hiii

ruby wharf
#

chill bro

analog cosmos
ruby wharf
#

this dude is NOT a chill guy

covert lake
fast tangle
#

Is there a good app for android to use community based voice models for songs or video covers?

#

Asking for a friend

covert lake
#

I would highly suggest you to just use weights

#

its the easiest possibly you could ever get

#

for using other RVCs (which is what weights is based on), you would have to manually separate vocals and instrumentals

whole veldt
#

how do we make ai woice

covert lake
empty kraken
#

why does my voice changer sound so bad

#

even the default ones

lethal flax
#

is there an alternative for the Gen3 Video zoom out?

polar flax
last obsidian
open prism
#

How to use chatgpt on whatsapp

leaden reef
#

which AI is used to make the Deadpool dance

chilly lake
#

Steam Replay is out... Jun is when I discovered my AMD GPU can do AI stuff

elder willow
#

what can i use to import a voice model into a voice changer

chilly lake
#

click the import button, done?

glad breach
#

Hi guys I want to slow down my video a bit and increase FPS..how can I do that

plain hemlock
#

peekaboo

tepid tinsel
#

/collab

solar torrent
rare sorrelBOT
# solar torrent -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

solar torrent
#

You meant this?

tepid tinsel
#

Thank

elder willow
#

does anyone know what configuration I should use on wokada? I have a ryzen 5600g, rx 6600xt 8gb, 32gb ram

covert lake
#

And which GPU?

covert lake
# elder willow does anyone know what configuration I should use on wokada? I have a ryzen 5600g...

Btw don't follow yt tuts

I suggest u using the deiteris fork which has better performance than Wokada
https://rentry.co/forkvoicechangerguide
Here you can also find some suggested settings

slender estuary
#

Hey, erm does the RTX 3050 run Stable Diffusion just fine?

#

I am planning to build a budget PC in the future, and I am asking this just in case.

chilly lake
#

with some tweaks you may be able to do SDXL resolutions with 8GB VRAM

polar flax
slender estuary
#

With 16GB of RAM.

polar flax
slender estuary
polar flax
slender estuary
#

But I will also use W-okada as well to use it in VCs.

polar flax
#

good enough for running a non-demanding game as well

covert lake
slender estuary
#

No, maybe Fortnite + Wokada + VTube Studio + OBS

polar flax
#

also 4060 could support AV1 streaming and ofc better performance despite its 8 GB vram

slender estuary
#

For gaming though, my goal for a budget PC is to run Wuthering Waves.

#

Like, I wanna try it so bad.

polar flax
#

I suppose would be a bit more demanding than Genshin nails

chilly lake
solar torrent
delicate tundraBOT
#

You need to be in a voice channel to use these commands!

solar torrent
elder willow
#

1

covert lake
#

There has been almost no quality improvement since a year

#

But there are some experiments with applio experimental branch and codename form

dusky yew
#

Anyone know any good talking avatar generation ai? Preferably something that can run in Colab

plain sandal
#

it's free

covert lake
plain sandal
#

does any local llm can deal with pdf and files ? like we upload on gpt, claude and ask accordingly ?

dusky yew
dusky yew
river verge
#

Dilly ding, dilly dong! A new RegalHyperus drum model just released!
Apt. (Drum model no. 555)

twilit peak
#

What exactly does "Consonant Protection" mean in Weights?

twilit peak
gray rover
#

Just, this is wrong

twilit peak
#

Oh nvm I understand it now it's referring to pitch

gray rover
#

it is known as F0 estimation algo / method

#

but F0 on it's own is a fundamental frequency

twilit peak
#

Mhm

gray rover
#

Just people tend to use F0 in place of pitch guidance / pitch and few other namings

astral basin
#

hey i have question about ai videos. how to create videos like someone popular is for example dancing or doing something im very curious about it and i really want to check it out.

covert lake
astral basin
covert lake
#

but you can play around to see

#

if you got a good pc, it would be better to do it locally

#

else cloud

astral basin
#

alright i will try

#

thanks for help

gray rover
#

Most dance videos if I get the type of content you mention right ( or at least those I used to see ) is done via image generation, frames

#

then interpolation

#

aka, stable diffusion or whatever other AI image generation you can get, getting consistent frames / images ( perhaps control net to improve the control ) and then, interpolation

polar flax
#

frame generation (interpolation) has already been a thing

gray rover
#

interpolation =/= generation

#

2 methods are there:

  1. Older one; images interpolated
  2. Newer one I guess; using direct video generators
covert lake
covert lake
#

i seen people doing tuts of it before for 'how to make scary ai meme time traveler' thing

gray rover
#

1-2 years ago, not so much

covert lake
#

they don't use for sure stable diffusion anymore

polar flax
covert lake
#

most just people doing john pork skibidi toilet memes with a free site

gray rover
#

interpolation not related to upscaling methods

#

namely, frame interpolation

covert lake
gray rover
#

Yea maybe, but I stopped having any interest in these anyway so

covert lake
gray rover
#

Dude who asked should find some use in these however

covert lake
#

not sure if that's what that guy uses tho
but i seen lumalabs being used for memes so

polar flax
covert lake
magic locust
warped dune
#

İbrahim Tatlıses'in Ebru Yaşar'ın Cemre Solmaz'ın Eda Sakız'ın Hilal Altınbilek'in ses modelini klonlayıp yüklemenizi seviniriz.

covert lake
#

Also

#

You can search rvc ai voice models at:

if there isnt one, you can:

hidden grottoBOT
silver bone
#

not 500k members

covert lake
native saffron
#

I muted the whole server aha

chilly lake
#

unfortunately you cant mute at everyone

native saffron
#

I did

chilly lake
#

ah, looks like a new thing

native saffron
#

Nah.. I've been using discord a few years. I don't think it's new

#

But discord is sneaky and hides everything useful away from everyone

native saffron
#

Everything is

gray rover
#

All it takes is a few clicks
Nah, it's on the user's end to be incapable of fully utilizing the ui of a platform / app

chilly lake
#

okay, it seems you can do it for the server, but not for specific channel

native saffron
#

I believe you can do it with channels too

covert lake
native saffron
#

Aye , are there any free AI development software(s)?

chilly lake
native saffron
#

-visible confusion-

chilly lake
#

like that, yes

#

I have a server where the owner is posting is 'going live' at everyone and it is annoying

covert lake
chilly lake
#

nope

#

still shows a red (1)

covert lake
#

but not already made ones

#

u can just right click the server, and mark as read

vernal canopy
#

Does anyone have a colab for training RVC models

covert lake
#

Google Colab is a Cloud Computing Service (remote good pc), used only by people who got a bad pc

vernal canopy
covert lake
vernal canopy
#

Intel HD (R) Graphics Family

covert lake
#

Yep ur right

#

Sorry had to ask bc i seen people rith rtx 4060 using colab 😭

vernal canopy
#

It's fine samsungsob

covert lake
# vernal canopy It's fine <:samsungsob:1159651956952932393>

You can train RVC models on cloud (remote good pc):

  1. Prepare the Dataset
  2. Setup RVC:
    Choose a cloud way to use RVC,
  • Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
  • Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
  • Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):

Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time

  1. Be sure to know about the tensorboard

If you are looking for the easiest way and for free, is using https://weights.gg which ofc uses RVC

#

I gave you ALL the ways to train on cloud

#

Google Colab = easier but lower gpu time, so there's a risk of losing ur model progress
Kaggle = a bit harder and needs phone number, but way more better gpu and gpu time so no risk

analog cosmos
#

when did youtube start doing this? skullsob

glad nebula
covert lake
analog cosmos
analog cosmos
covert lake
acoustic sparrow
#

is there any good local ai images generator?

covert lake
#

what's ur pc gpu

warped dune
#

How can I set the pitch correctly in rvc-gui when I load a female clone voice as a zip and load a song with a male voice and when I set the pitch to -12 the voice tone is not identical to the female clone voice, it sounds really weird, exactly the same when I load a male clone zip and load a female song voice and set the pitch to +12 the voice tone is not identical and it sounds robotic, I'm asking for advice

gray rover
warped dune
#

Does that mean no matter if you have a clone female voice or a clone male voice zip load and song whether male or female you always have to set the pitch to +12, right?

gray rover
#

Sad reality is, sometimes 12 this or other way doesn't do the justice but yeah, it is what it is.
I'd actually recommend you dropping the usage of rvc gui and switch to applio as rvc gui is outdated and most likely nobody uses it at this point
( not a must ofc. Just a suggestion. )

warped dune
#

Is there a website or application where you can find out the pitch pergel of the song how many octaves it is

warped dune
#

Is Applio free or subject to a fee

gray rover
#

Free. Everything rvc / applio related is free ( software itself, not mentioning services of training using rvc / applio or such )
if it isn't or you see it differently, it's a scam attempt

warped dune
gray rover
fresh apex
#

Is the gtx 1660 super good for ai voices?

gray rover
fresh apex
#

Oh ok thanks

tacit widget
#

Who knows of a popular server regarding llms? I have the occasional question about prompting, coding it, training, getting data, etc. If anyone knows a good server for that dm me?

acoustic sparrow
acoustic sparrow
#

copied from nvidia app
CPU: 13th Gen Intel(R) Core(TM) i7-13620H
RAM: 24.0 GB DDR5
GPU: NVIDIA GeForce RTX 4060 Laptop GPU
Dedicated video memory: 8188 MB GDDR6
Shared system memory: 12098 MB

acoustic sparrow
#

thank you 🙏

#

will upgrade ram to 32gb ram as my final update, got 3tb of m.2 storage which is pretty decent

covert lake
#

You're lucky the 4060 laptop is almost similar to the desktop one, barely any difference

Usually there's a big gap for other laptop versions

acoustic sparrow
covert lake
edgy bloomBOT
#
Congratulations Nick088 [ITA/ENG] by Weights!

Your Charizard is now level 67!

acoustic sparrow
covert lake
#

Ohh lol

acoustic sparrow
#

it was 1tb of ssd, and 16gb ram only, the rest i bought myself

covert lake
#

Did you modify your laptop?

acoustic sparrow
covert lake
acoustic sparrow
covert lake
#

I think u should be able to run it

#

One sec lemme give a program for running it

#

So u don't have to deal with the code

acoustic sparrow
acoustic sparrow
covert lake
# acoustic sparrow okay, ty
GitHub

Focus on prompting and generating. Contribute to lllyasviel/Fooocus development by creating an account on GitHub.

GitHub

Stable Diffusion web UI. Contribute to AUTOMATIC1111/stable-diffusion-webui development by creating an account on GitHub.

GitHub

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. - comfyanonymous/ComfyUI

GitHub

Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The ...

#

Yup I got s full list lol

#

But to make it short, I suggest you either focus or comfyui, ESPECIALLY comfyui

acoustic sparrow
#

will try both, thank you 🙏

covert lake
#

Automatic111 is also a Good choice

covert lake
acoustic sparrow
#

I'm tryna use it for generating some coverse for an album

covert lake
#

Your 8gb VRAM could be kinda an issue for flux 1-dev

#

Hopefully you should be able to run it

acoustic sparrow
#

will try comfy first

#

thank you, hope it makes some good results

covert lake
# acoustic sparrow will try comfy first

https://youtube.com/watch?v=DYLi28LrI4I

This might help, it suggests a comfyui workflow for using it on only 8gb vram

How long it takes? Check this video out! #flux #text2image
👉ⓢⓤⓑⓢⓒⓡⓘⓑⓔ
👉 !! Try my online FLUX model free at https://agireact.com !!
The ComfyUI workflow can be downloaded from https://github.com/ttio2tech/ComfyUI_workflows_collection

PC spec: Nvidia RTX 3060Ti
Tutorial links: ComfyUI on Macbook tutorial: https://youtu.be/ZCswfm0dBYY
FLUX o...

▶ Play video
#

Haven't tested it myself tho bc I got a laptop and I do most things on cloud, even if I will get a desktop soon

#

Currently there's basically nothing (open source) beating flux.1-dev in terms of quality

acoustic sparrow
#

looks like it's going to be a tough process

covert lake
#

Sd3.5 ultra is kinda good for prompt adherence, ofc lower quality than flux

But honestly I don't use SD since their sd3 flop

covert lake
#

But at least you won't have to deal with paying more nor time limits

acoustic sparrow
chilly lake
#

usually laptop 3090 = desktop 3080

covert lake
#

I'm gonna get an Asus RTX 4060 ti soon

solar torrent
#

Laptop GPU the downclocked version of desktop GPU. skullsob

covert lake
chilly lake
#

downclocked, worse memory type, etc.

#

temperature envelope limited as well

covert lake
#

Had already spent 5 years with an i3 laptop

covert lake
#

I meant for a laptop his specs ain't that bad

#

The average laptop specs ain't that good

forest quarry
#

it's good in some edge cases and apparently easier to work with (?)

#

likely easier to run too

covert lake
#

Flux 🔥

forest quarry
#

Yes flux is better

#

But SD 3.5 is rather decent

#

And that's good

solar torrent
#

Stable 3.5 on Weights. nails

covert lake
covert lake
#

I consider it a better flux.1-schnell

#

If BFL open sources even flux.1 pro, it's gonna be good asf

solar torrent
forest quarry
#

SD 3.5 Large is a slightly worse Flux.1 Dev

#

I mean considering it's 8b params

#

Compared to 12b with flux dev

#

Means it's more efficient somewhat

solar torrent
#

Diddy was ahead of time getting a knee surgery before the Grinch. imdead

azure crow
#

Can someone by me skibidi toilet Fortnite pls

solar torrent
ionic pumice
queen kernel
#

A batch size of 10 is good for 40 minutes of dataset? Or should I keep it 8 or 6. Please tell me

elder willow
#

hihihi

#

hi

proud drum
ancient swan
gray rover
gray rover
queen kernel
#

Someone gave an example of bakery to me for explaining batch size

gray rover
#

for instance, batch size 8

gray rover
#

you have 8 groups, that 1 particular group if x 8 gives you a total of your samples

gray rover
queen kernel
gray rover
#

I'd recommend you to check 0_gt folder, then you're understand.

#

Samples, regardless if in 1 file or multiple, get split

gray rover
#

those end up in batches

ancient swan
#

Shit, looks like nf4 sucks balls

gray rover
#

Either way, it's advanced so just remember this:

#

If you really wanna understand more deeply what batch size or what batches are, what is their effect, please read up information on internet as it's quite literally too advanced for people unfamiliar with deep learning

gray rover
#

and if you need tl;dr:
Too big batch size and you risk underperforming model but can have easier training
too small batch size and you can potentially get better end results but can be risky in a sense where your training's " noisy " ( gradients wise, not noisy in terms of audio )

queen kernel
#

I'm still unable to understand 😭 can you explain it with an example

gray rover
#

As I said, it is too advanced for beginners to go with full terminology or examples full of abstractions

#

and neither I have mental capacity to go through all of that just now ( 8 am )

solar torrent
gray rover
#

If you truly wanna dive into it, sure I guess. Ping me later

#

Once I get some sleep, I might try to simplify it as much as I can

#

Til then, I truly recommend checking few videos on yt ( at least )

solar torrent
#

Thank Drake me later. drayke

gray rover
#

That's 10-30 mins of your time and big help for me ( later )

queen kernel
gray rover
#

Thanks man

elder willow
#

hi guys

queen kernel
solar torrent
#

Hi.

elder willow
zinc shell
#

is there any tool that removes multiple watermarks

distant flame
#

hlo

covert lake
#

Q8 is pretty goood

ancient swan
#

And q4, and q6 just to see how much faster they are

snow sedge
#

Flux gguf quants are a bit slow, but memory usage totally worth it

#

And funny enough, but q8 is faster than q6 and q4

covert lake
ancient swan
#

Cus it won't fit in rtx 4070 s

#

It requires minimum 24 gb

#

VRAM

#

So it'll be just too slow

#

I could try ig, but it will probably take like 20 minutes to render one 1024x1024 image

covert lake
#

With some workflow maybe you can

#

But yeah ig it's gonna be slow, taking 1:30 min for sure

#

I mean not that slow for the biggest model

ancient swan
#

Ig I could try

wanton kettle
#

where do I create music with ai

solar torrent
ancient swan
#

40 steps 896x1152 q4*

snow sedge
# ancient swan Interesting

Yeah, q8 is like 15% slower than fp16, but q6 and q4 are even slower somehow. I guess they haven't optimised smaller quants yet

#

I use q8 flux dev and q4 t5, so everything fits perfectly into 16gb vram

ancient swan
#

Nf4 also takes the same amount of time

ancient swan
#

Tf is this

rotund urchin
#

türk sesler çıkmıyor

covert lake
#

I translated your message btw

#

You can search rvc ai voice models at:

if there isnt one, you can:

hidden grottoBOT
shadow abyss
#

Hi guys, good morning. Quick question. What's the latest version of the RVC model now? or did the technology change to something else? I'm trying to keep up with what's going on. but it's hard for me to find stuff on here sometimes being visually impaired

covert lake
shadow abyss
#

ah good good.

covert lake
#

RVC didn't do much changes in quality since a year

shadow abyss
#

ah

covert lake
#

howeverthere's recently experimental rvc forks

shadow abyss
#

ah

covert lake
#

such as an Applio experimental branch and codename fork

but it's all only experimental

shadow abyss
#

cool

covert lake
vale forge
#

is there a mobile version realtime voice changer?

covert lake
#

it's impossible

vale forge
covert lake
#

even if you run Wokada on Cloud

your phone doesn't have a VAC

covert lake
shadow abyss
#

I don't think mobiles even have half of the capability for vcs

covert lake
vale forge
shadow abyss
#

I wonder if my mac will support it. it's a 2016 mac with 512 gpu memmory though lol

vale forge
covert lake
edgy bloomBOT
#
Congratulations Nick088 [ITA/ENG] by Weights!

Your Charizard is now level 68!

covert lake
covert lake
#

However it works on Mac

vale forge
covert lake
#

especially with the wokada deiteris fork which offers better performance

covert lake
shadow abyss
#

no I have 512 storrage on my mac. and like 16 gigs ram and then the gpu has 512 mb ram lol

#

lol not sure how slow it'll run but don't mind waiting a half hr or so if it does a good job

shadow abyss
#

lol I know.

covert lake
shadow abyss
#

I feel it too

covert lake
#

It basically uses RVC models but just for calls on games/discord/etc

#

it's the best you can get, i think it should run

shadow abyss
#

nah I'm looking to do like covers and maybe like if I sing something, then chance the voice to something else.

covert lake
shadow abyss
#

so what am I looking fore?

covert lake
shadow abyss
#

yes.

covert lake
# shadow abyss yes.

For Inference (use models) Mac, You can:

  • Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides, probably won't be able to train, make models):
    • Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
    • Mainline: The original RVC
  • Cloud (remote good pc, easier and faster than ur PC but it's limited):
    • Weights.gg: Partnered with AI Hub, lets u do them easily but u may be in a queue
    • Ilaria RVC Zero: fastest and simplest that you can get for free
    • Applio Colab: max 4 hours, not granted, of GPU
#

Easiest Local (manually have to separate the vocals and instrumentals): Applio
easiest cloud (manually have to separate the vocals and instrumentals): ilaria rvc zero
easiest possible: weights.gg

shadow abyss
#

so I have a pc with 64 gigs ram, and 6tb of storrage with 12 cores. no external gpu though

#

I mean, once I can get models for them to use. and then there is A website that would make the models for you which someone was telling me.

ancient swan
#

And fp16 just crashes it

#

If I use it with t5xxl without any optimizations

#

I'll try with some optimizations and smaller text encoder

#

1 - fp16, 2 - nf4, 3 - q4ks, 4 - q6k, 5 -q8

#

using forge with xformers, token merging ratio 0.3 and negative guidance minimum sigma 1

shadow abyss
#

Where do you find the download links to download the latest rvc inferance models

ancient swan
#

q4ks i feel like looks the best somehow

shadow abyss
#

man sometimes I hate freaking being blind. I could not find the link to download it cuz of the fact I use a screenreader and man it's being a fucking bitch to use discord with screenreaders sighs

covert lake
#

I also heard of ReForge

ancient swan
#

but i like it

#

cus i heard it's better than automatic1111

#

and it's simple to use

#

u just load in models, clips and text encoders, vae if needed (with old sd models they are baked in i think) and use them

#
  • it has a bunch of different stuff that you can use to improve quality ig? i'm still about to learn the stuff that it has lmao
covert lake
covert lake
ancient swan
#

but u need to know how it works

#

or find good workflows

rose tartan
#

if i have to be disabled i would prefer losing legs

#

as long as i can still game

#

i hope you can still game while blinded

shadow abyss
#

I can still play games cuz some have support for blind players like immortals and harthstone. and mindcraft has acessibility mods for totally blind persons as well as forza motersport and a few others

rose tartan
#

are you born blind?

shadow abyss
#

the only thing I can't do right now is find the freaking download link to access the RVC inference models to do like singing voice conversion

covert lake
shadow abyss
# rose tartan are you born blind?

yes I was. got my vision, and then lost it due to cataract and glaucoma. both of them. they can remove the cataracts, but not the glaucoma. it is like a canser.

rose tartan
#

sucks to get a taste of vision then to just lose it

#

or it might not be as much of a blessing to you since you born blind?

#

like deaf people dislike hearing

ancient swan
shadow abyss
#

well, yeah but sometime it has it's advantages. but there is times like these it frustrates the fuck out of me.

rose tartan
shadow abyss
#

when I trying to do something, can't figure it out because of lack of accessibility for us and sometimes not getting the help we need. people tends to think we cry down but that's not it.

covert lake
#

I could put also automatic1111 as medium

shadow abyss
#

I can find the channels. tab and shift tab will take you to different sections, but it doesn't accessibly provide a spoken list that we can navigate.

rose tartan
#

do you have gpt plus?

#

maybe realtime imaging acan help

shadow abyss
#

nope I don't

ancient swan
rose tartan
#

i guess you can just scroll a bit take a pic ture and let the ai tell you where the models are

shadow abyss
#

actually not really. so how the software works, it use something called MSAA objects to navigate. codes and such are included with alt text so we can access it. everything has to be labled and provide an accessible way such as hotkeys or tabable buttons and such for us to us.

rose tartan
#

if you want i can fetch you some links

#

though youll have to manage your own inference

shadow abyss
#

Please thank you.

rose tartan
#

name them

#

ill dm

shadow abyss
#

that's fine. as long as I am getting the help I'll do it...

#

kk. Let me know when you add me so I can check it

ancient swan
shadow abyss
#

I'm sorry for my rant other guys as well. but man this fucking sucks right now that it is so hard to do with out sight

ancient swan
#

for some reason the more optiomized text encoder just gives more anatomically accurate results

#

here for example, 1 is t5xxl, and 2 is t5e4

#

and t5xxl just made the head too fucking big lmao

rose tartan
#

im texting you blinddude

full oyster
#

k

urban narwhal
#

how do i use the ai for voicechangers?

#

how do i use the voice models

frank siren
#

hey ai

covert lake
#

What's ur PC gpu

urban narwhal
urban narwhal
covert lake
# urban narwhal yes

Wokada is the program to use RVC (Retrieval-based-Voice-Conversion, Speech To Speech Models) in realtime for calls

There's the fork (modified version), the deiteris fork which has better performance

covert lake
rare sorrelBOT
covert lake
#

1st link is wokada fork, SUGGESTED
2nd link is original wokada

feral totem
#

how do i make my own voice modal for ai covers? and why weights.gg doesn't support utf-8 chars?

urban narwhal
#

im using mmvc or smth is that good?

covert lake
#

it's not suggested

#

you prolly got an old version

urban narwhal
#

damm

#

should i delete then

covert lake
#

it helps in your performance

#

so you have less delay

urban narwhal
#

the youtube video was made yesterday

#

ok

covert lake
urban narwhal
#

ohhh

covert lake
#

also kinda weird that the guy shared it on mega

urban narwhal
#

dang

covert lake
# urban narwhal https://www.youtube.com/watch?v=VrVB_yRp9H0&t=79s

It uses the Original Wokada version shown in https://rentry.co/VoiceChangerGuide#download-nvidia

#

but it's way better to use the wokada deiteris fork

#

it helps in performance and in having less delay

urban narwhal
#

k i will

#

Guide for deiteris' modified W-Okada's Voice Changer Client (Fork)

#

this one right

urban narwhal
#

k

#

how do i downlaod it

covert lake
#

there's no yt tut for the wokada deiteris fork

urban narwhal
#

ok

covert lake
dapper ginkgo
#

I know how to write

spark saddle
#

So, what are people finding to be the best free voice changer software? I'm trying out RVC

covert lake
#

and what's ur pc gpu?

spark saddle
#

Yeah, pretty much. I have a 4070 ti

covert lake
#

it has a realtime too, but it's not as good as wokada

#

Wokada is the program to use RVC (Retrieval-based-Voice-Conversion, Speech To Speech) Models in realtime for calls

There's the fork (modified version), the deiteris fork which has better performance

#

-rt

rare sorrelBOT
covert lake
#

i suggest you the 1st link, the wokada deiteris fork

spark saddle
#

Ah, sounds great, I will do. Thanks for the help!

covert lake
native saffron
#

I take AI samples as like a mb, and when I'm done. It's like 80MB

#

(after using special softwares)

#

Quality is key

inland yew
#

Where is huggingface?

elder willow
floral cairn
#

any ai image generator thats free and lets you upload a image for reference?

#

Like if I want a guitar drawn in the style of this image

covert lake
# floral cairn any ai image generator thats free and lets you upload a image for reference?

To generate images for free (text2img), either:

hidden grottoBOT
covert lake
#

weights or local are ur best options

floral cairn
#

Alr thanks!

timber wagon
uneven badge
#

any realtime voice changers for calls for windowws or mac?

covert lake
#

click on Train Model then Upload a Voice Model

broken spade
#

hello

#

what does the voice changer fork guide do

#

should i uninstall the w-okada one @covert lake

solar torrent
elder willow
#

um, nice to meet you, but when can I train the model for free?

wanton spruce
#

@hidden grotto

hidden grottoBOT
# wanton spruce <@1138318590760718416>

:wave: @wanton spruce, How can I help?

Available Commands:
@weights find <query> or /find <query> - Search for RVC Voice Models
/create - Create an AI Cover
/image - Generate an Image

elder willow
#

gpt o3 is coming out
We’re dead

gray rover
#

Alr, tested and working.
@night lake @glad nebula
Pushing 3.0.1 update in a sec

#

for now, only for generator

#

guess 4 gig users can now shine or somethin'

#

lol

night lake
#

i can finally use a bs above 10

gray rover
#

incorporating noobies' fixes n improvements too
so all in check

gray rover
#

from base 13 bs max, I could go up to 20

#

perhaps could do more. ( There's also 1 inplace operation more )

#

I suppose you could help the speed a lil with tensorfloat32 usage

#

or... improve the memory even more with memory fallback in nvidia ctrl panel skullsob

#

don't do it

#

it allows to fallback to extra normal ram memory instead of oom'ing
but you know the deal with ram memory

night lake
gray rover
#

oh

#

ohhhhhh
That's new.. All this time I thought you're rocking on nvidia

#

damn

night lake
#

i wish 😭

gray rover
#

fffff, rip man

#

either way.. should help, regardless

night lake
glad nebula
#

that alone allows 8 batch size on 8gb vram gpus in fp32 trainingboolin_pepe

gray rover
#

checkpointing is optional from the ui level

#

3.0.1 up

#

bruhhh, can't promise how stable this is as it's just a test run but damn, even 23 works ( 9 bs higher than baseline oom (( bs 14 )) )
lmao

#

Either way. Cya around, going off

#

yup, works

gray rover
#

now, I can rest

peace

glad nebula
#

good night

gray rover
#

Gnight

glad nebula
gray rover
night lake
elder willow
#

hi

broken spade
#

hi any recco settings for nvidia rtx 2050

worthy coyote
#

annie are you okay

shrewd surge
#

how i can use rvc 2 models?

shrewd surge
#

its better than weights ?

upbeat quartz
#

guys pls help. i used to make ai covers with google collab thingy like a year ago, now its dead. does someone have a new link to it? or how do you guys make your song covers

elder willow
#

with colab
but now with it being shut down

#

rip my rvc making

shrewd surge
sleek lance
#

hello, can someone please help me get the voice changer to work

#

ive been trying i cant seem to make it work

hexed bay
#

First of all did you install the version that is for your gpu?

sleek lance
sleek lance
#

i cant even speak to anyone with the voice changer

hexed bay
#

The chunk settings etc is what determines delay I think

sleek lance
solar torrent
#

If your PC has a GPU that's newer than or as GTX 10xx series, W-Okada should be able to run on GPU.

solar torrent
#

Is your GPU lower than that?

hexed bay
#

I am not on my computer atm and probably won't be until tomorrow

sleek lance
solar torrent
sleek lance
#

its AMD Radeon(TM) Vega 8 Graphics @solar torrent

solar torrent
covert lake
solar torrent
#

Oh wait, that's an integrated one. That's mean your PC doesn't have a dedicated GPU. imdead

sleek lance
covert lake
covert lake
#

laptops are weak

sleek lance
#

its a pc* not a laptop

sleek lance
covert lake
#

youtube tutorials are old

covert lake
edgy bloomBOT
#
Congratulations Nick088 [ITA/ENG] by Weights!

Your Charizard is now level 72!

covert lake
#

else you didn't buy a dedicated gpu

sleek lance
covert lake
sleek lance
#

aight

covert lake
#

it sounds kinda weird u didn't buy a gpu for ur desktop

north heron
#

Sorry to butt in suddenly. But I would like to hear your stuff about the announcement of the o3 model.

I was expecting more action here but it looks like the people here are still getting up to speed on this, thats fine.

solar torrent
#

Nah, I'm too slow for a trend. skullfacedistorted

weak cargo
#

그림

gray rover
north heron
gray rover
#

truth to be told, not a lot of people here are into X / Twitter I think but guess you right, we'll wait and see how it goes

covert lake
#

i seen it's pretty good but not like I can use it anyways lol

gray rover
#

oh, how come?

river verge
#

Wake up, everyone! A new RegalHyperus drum model just released!
Get Lucky (Drum model no. 556)

covert lake
analog cosmos
covert lake
#

ye

north heron
north heron
gray rover
#

yuh, got it

north heron
river verge
#

Wake up, everyone! A new RegalHyperus drum model just released!
Mor Ardain (Drum model no. 557)

grim terrace
#

Hii guys

#

It’s been a while since I participated in voice model making

#

Are there any new standards I should know about?

gray rover
grim terrace
#

Wish I knew what that means

gray rover
grim terrace
#

Yep

crude night
#

How to download?

gray rover
#

In that case, I'll simplify it to:
Tensorboard logs each epoch's loss based on ' each epoch last step's loss '

#

Average makes it so it logs additional metrics but averaged over custom N steps ( my approach )
or Noobies approach where it's averaged over some period / amount of steps? epochs? something like that

#

Visually looks like so

#

ex. My epoch ( each ) is 44 steps. Instead of logging a loss that depicts performance of epoch at the last step, it logs average ( in my scenario, every 11 steps so 1/4th of an epoch )

#

This should be... pretty straightforward ye

grim terrace
#

Uh huh

gray rover
#

In a short, helps to know how your training goes

grim terrace
#

Have the models from UVR improved at all?

gray rover
#

mel-roformed based, bs-roformer based ( mvsep, not sure on uvr as I don't use it )
those are kings now

grim terrace
#

You have got to provide me with a dictionary

gray rover
#

Those are just names of models for separation ( and their architecture

gray rover
#

personally? still bs-roformer is my pick ( even if some prefer mel one

#

You can test both really, mvsep's free

vestal temple
#

Hi

gray rover
grim terrace
#

Ok!

willow grotto
covert lake
#

Maybe try to be specific on separating them and using each trigger word

willow grotto
willow grotto
#

I think it's Nvidia GTX 1660 Super

covert lake
# willow grotto Why

Because you can run RVC (Retrieval-based-Voice-Conversion, Speech To Speech, the AI used for every AI covers also on Weights.gg) on your pc

willow grotto
#

I know that

#

I asked if there's a way I can separate the baking vocals from a song

covert lake
# willow grotto I think it's Nvidia GTX 1660 Super

Your GPU is good enough to do inference (use models) locally (on ur pc), you won't be able to train (make models) but use them

You can:

  • Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
    • Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
    • Mainline: The original RVC
  • Cloud (remote good pc, easier and faster than ur PC but it's limited):
    • Ilaria RVC Zero: fastest and simplest that you can get for free
    • Weights.gg: Partnered with AI Hub, lets u do them easily but u may be in a queue
    • Applio Colab: max 4 hours, not granted, of GPU

Easiest possible (automatically separates vocals & instrumentals) : weights.gg
easiest cloud: Ilaria rvc zero
easiest local: Applio

covert lake
willow grotto
#

Ok thanks

covert lake
hardy mauve
#

Hello, the virtual cable isn't detecting input voice or maybe it's some bug I don't know,

Can anyone help me with it please!

covert lake
lean oyster
#

rerew

covert lake
wheat burrow
#

chat

#

could I theoretically run rvc on a jailbroken iPhone

#

not the voice changer part just audio conversion

gleaming hull
#

Maybe. If you got the code for it.

wheat burrow
#

ok thanks

gleaming hull
#

That's not a definitive answer. But, you never know unless you try it. Since, I assume, there was no information on the internet.

wheat burrow
#

I could have just ran tailscale on my iPhone and pc but I’m kind of interested how this would run so I’m just gonna try it

polar flax
wheat burrow
#

I have bash and python installed and was going to look at the other dependencies after

covert lake
#

it is possible on Termux, but it's Android only

#

however, there's an unofficial iOS alternative to termux, like iSH and A-Shell

#

maybe try those, but the commands could be different

#

Also reminder that it's just a phone, and considering that iPhones except the last one, have only 6gb of RAM, it won't be the best

ofc running on a good pc or cloud will be better,

but this can be a funni experiment

#

goodluck

if u actually manage to do this, maybe it would be good u write a guide if the commands are different, so other iPhone users can do it too

wheat burrow
#

no need for one of those apps hopefully