#💬|general-chat
1 messages · Page 13 of 1
i mean you gotta give it some credit man, still draws hands better than i do 😂
Sometimes it's spot-on, but it's usually just one hand. Inpainting with a lot of "rerolls" does the trick for now, but man, there's some mangled up hands out there.
yea usually it works better if i put some stuff into the negative prompt
Not a bad start. Keep in mind that you need to pick generations and you can use negative prompts to help
Keep in mind that less people may do better
generally i look at it this way. AI first struggled making something humans could recognize, then AI succeeds but struggles making faces, then AI succeeds but struggles making hands. and here we are
There are some beautiful hands it can make
oh yea, and i have a hunch that Ai is gonna go suuuper big this year especially with text based stuff. like KoboldAI etc, where you literally just ask it to do you job for you
🙃 That's why I'm buying out a bunch of V100's and P40's since they'll go up in price 100%
and A2's since they're just so small and adorable
I really do think so. I also think ppl don't realize how much AI is used already lol
I think the more apps there are, the more ppl use it, the more it will explode
Which is great
The more people are involved, the better
SD really turned things on its head
oh yea 100%. i kinda feel sorry for some software devs tho because a lot of them might become completely replaceable
or many other jobs in fact might become a bit obsolete
I think that we are still going to be needing plenty of people
Technology always shifts
As it does, new kinds of jobs open up
I think, and this is just my personal opinion
personal opinion or not yea you're right in thinking so i agree
You can never take out the human component of people that people need
There will always be niche markets
Just like people will want to support local communities
There are still plenty of countries
And people that need help
And our help as a community
Talent and skill is never wasted
The jobs may change
People may or may not change
But ultimately, it comes down to choosing what you do with your life
And if you want or need more jobs? Create them
Ask the AI to make up ideas if you can't think of 'em, hahahaha 😂
no matter what happens I'm save in a warm datacenter hall 😛 someone has to be there to fix up them servers or else karen won't be able to watch her netflix hehehehe 😂
Right on the nail HAHAHAHA
We all need and rely on people to keep the backbone of the net going
All the engineers, the people doing hard work, the farmers
The basic needs of life--everyone still needs 'em at the end of the day
yea most people if not 99.99% of people haven't even set a foot or seen what is going on in the background, datacenter technicians and engineers are like on the very end of the chain supporting everything but we don't get recognition really
not that i need any recognition lol, it's just funny how i try to explain to friends or family what I'm doing and I'm like well.... um.... uhhh... how do i explain lol 😂
hahah yea i usually don't bother i just tell them "just internet n' shit" :)
Hahahahahaha! Those that know already know, and those that don't...do some research?
I think dealing with the public/customer service/tech support is probably one of those jobs where it will try your brain in ways you didn't think possible.
oh oh oh hahahha get this, i wish i could EXPLAIN the feeling of when you're standing behind a full row of 48U racks at max power just the hot exhaust going through the hot isle containment and the SCREAM of the fans just sends chills down your spine of the power
My dear mother. When I went to college, aeons ago. She put some of my computer parts in a corndog box....it got put in a freezer and rescued five hours later.
LOL 😂 WHAT
Whiiiiiiirrrrrr!
YEAH. AND IT GOT PUT IN THE FREEZER BY SOMEONE
Never trust a non-techie with computer parts. That was my lesson
Oh god it's insane I tell you, it's even better when you purge the whole pod and go underneath the floor tiles and you get to feel the wind in your hair 😂
o lol i don't trust even some of the people I work with and even I think they know a lot
I've been near server racks, though probably not near as many as you.
I don't trust myself LOL
one time had a guy pull the wrong circuit breaker 😬 it was NOT good
yea I KNOW you'd think that with so many redundancies it would be fine but for some DUMB reason they don't connect the BBU's to the TOR switch so you only have power going into the hosts themselves
so you had a TOR switch powerdown basically so what is even the point of that BBU?!?!
You might as well just say someone toppled over the racks
That's
That's my question
it's designed by someone in the office that literally had no idea what they're doing probably
yea it's a mess like you'd think everything is designed to work perfect... nope 😛
Why is it someone who doesn't know what they're doing makes decisions? Hahahaha
Well
We'd like that to be the case
But that's not life
OH MY GOD tell me about it.... management doesn't have a CLUE most of the time of what I'm even doing
and yet they get the big money
yes :)
Cuz
My brain is 🤣
Off the rails
But there are a lot of amazing people in this community
Talented, well-educated and passionate people
And that's what I love about it
hey look I'm a bit "off" here and there a bit so i get ya 🫠
My brain is made of melted ice cream
laughs
The stream was a really great opportunity to show how people are working together, imho
For the greater good
As a community
(As opposed to people who aren't genuinely invested, etc)
oh i didn't know there was some sort of stream that sounds interesting
Frankly, I think AI is going to make people smarter
Yeah!
I wrote down highlights/transcript
maybe smarter but more lazy too :P
And then
STABILITY STREAM TRANSCRIPT HIGHLIGHTS 12/24/2022 Transcription by Atypical Consortium/Sunny For the deaf and hearing impaired, or for those who simply missed the stream! Basically, I just tried to do the best I could to highlight what they were saying. Though I could have done it word for wo...
There's a tranacript someone gave me, too, also copy/pasted, but the link is
Also there
ah cheers for that
It's a GREAT stream
Very technical
But a lot of GREAT, passionate people who are working on this technology and it's really enjoyable tbh
Lol I don't think human nature will change
I'd be really nice to be more involved in that but i have like so so little time but It's nice to see people working at something as a collective for no profit it's a very heartwarming community
generally not used to that personally, me and my friends would bite and fight each other 24/7 constantly 😂 in a playful way of course, i got some really bad habits from that
Oh, for sure
You know, everything moves so fast, and it can be so hard to keep up
It's so easy to get burnt out
I myself don't keep up with everything
Just because it's too much for one person
i remember back when i was OBSESSED.. and i mean obsessed.. with linux and I'd be in these chats and getting into arguments all the time like literally drawing blood with people over dumb stuff it was great 😛 ............ but i got better....
have to have some peaceful time now ay
And this community is special, I think
Very creative
Linux! It's been a long time since I used it
I used to dual boot but
Eh
creative it is true. but actually what do you think about people like in the art community
they're like oh Ai is strealing stuff
that's just like....... lol
i think it comes down to the fact that they just don't understand how it operates
Well, I have been in the art community for a long time hahaha
But this is not the same kind of feeling
no the same feeling? do tell
As it's blended with a wide variety of computer science, engineering, and various types of art--writing, music, drawing--but also has far reaching hands into applications like education and therapy
Medical application, etc
People from all walks of life
But people who have often been studying these things for years, or their whole lives
Who are putting their knowledge from their careers into AI, or switching their careers to AI
oh yea i can imagine. i used to paint a lot with like oil pains, but kinda lost interest in it over time. or maybe just didn't have time for it or something. I remember chatting with a few people who were like really engaged in art and even in college way back I think it takes a special type of character to pursue this career
same as any career to be honest
I'm an artist, both traditional and digital, and game developer, but I am disabled
A lot of ppl are also here to help disabled people be able to create, help with chronic conditions, etc, too
I try to help people understand art and technology to make it easier to understand and use SD
Artists have worked hard. I went to school, too
I've been on DA for 20? Years now
I think
interesting that you branch out like that. I used to have a big interest in everything and one day I'm like "well.. what is it the only thing that I'm MOST good at" and just specialized in that and focused all my attention on that just to get expertise in that field, and become a mentor for others.
ah very honorable i respect that
I chose game development (Besides loving games) because I love writing, I am a pianist, and an artist. I do some programming. (Done web development, but mostly old school.)
I couldn't pick because I can do all of the things and want to do all of the things.
My body may not be able to keep up, and my mind is on the fritz most of the time
But
so you got the whole shebang of game dev ay
I've spent my life studying
Thank you.<3 You're very kind
no problem friend :) it doesn't cost me anything to be nice
Because I've dedicated my time to learning, I figure that, at the very least, I can do something good with the knowledge I have.
This is what is so great about SD
Because no matter where you are in life
You can bring that knowledge to the table
Maybe you're not an artist, but you're a farmer
Everyone absorbs information a the time
Art, in a lot of ways, is simply an interpretation of that information
That knowledge about being a farmer can be translated to SD
You know about manufacturing parts, what things look at sunrise, how animals act, etc
The rendered image is not remaining on screen on Stable Diffusion. This happening to anyone else?
Key words people may never think of you may know, which gives you a unique perspective
i hold a lot of respect for people who continue to learn, when i was out of school i YEETED the hell out of there and was like nope let me be on my own terms and just get to work on stuff and get wisdom from people more skilled than i
And you can also take photos, learn how to make datasets, and have access to things others may not, etc
I would check #🤝|tech-support
I. Lol
I taught myself HS
Are there any sneak peaks or anything of someone using distilled sd
damn, how old are you?
As a guy who went into all of that also, yea... I can feel ya. I am a programmer by trade, but I am capable of everything else like writing or art. Music is still in the fritz since I am just a beginner there and can only do basic melodies, but yea.
AS OLD AS A CRYPT
(Not that old)
Let's just say around 40)
I've been playing since I was six. But playing an instrument and being an audio engineer are so different. 
I'm surprised
But also pleasantly so
That a lot of people within these spaces seem to be multi-talented
With multi-directional skills
Which is great, because, regardless of AI, that means that there's always something to learn/excel at
Is there a place where I can download textual inversion embeddings for specific known people so that I don’t have to train everyone by myself?
I wonder if I could train a concept using 3 identical images with only 1 change, and appropriately captioned, e.g.
Man sitting on a chair with green eyes Man sitting on a chair with brown eyes Man sitting on a chair with blue eyes
would it learn what the eye color means? Or would it overtrain the chair?
Depends on your images
hey guys
guys is S.D 2.1 is better than S.D 1.4 ?
shoutout to commodore 128
is there name for "anti variant?
👋
as in, thing that does something that is not expected, as a variable?
Hello is there a place where I can learn how to use the danbooru tags?
I am new to this and I want to learn
@cerulean ether You can add your own bot to your own server - but not take this bot and run it in your server.
how can i do that?
please guide me through
@cerulean ether Are you familiar with integrations? Perhaps check #1042896447311454361
u mean like how to use the prompts and all?
I meant building bots inside of your own discord. You'd be communicating across in such that your own API key is used to reference the engine that will allow sending and receiving of prompts and images.
I would suggest checking the pin 📌 message in #1042896447311454361
How can I connect my laptop’s automatic1111 webui to my PC with a decent vram?
@wide juniper You can run the webui on your powerful PC then use port forwarding to connect to it over your vlan. By default it only listens to localhost, but you can setup a tunnel
What do people use for photorealistic images of people? I am found the dreamlike-photoreal does very good but mangles the anatomy a bit. I found the moDi model to get the anatomy just right but very cartoonish. Mixing them 50/50 is giving me very good photorealistic human results. Does anyone have a model or model combo that they think does well for this?
isn't that what the --listen is for?
ahh even easier.
Is there a limit on creating?
If you’re using your own hardware, not really. If you’re using others’ hardware there may be a cost
is anyone having any trouble using stable diffusion atm ?
No, but I run it on my own machine; what are you using?
OK, what does the log say?
Does it cost anything if i use #dreambot channels?
@north stirrup send it to you prvt
No, please.
Yes, DreamBot costs money past a few freebies
File "C:\Users\Name\Documents\AIArt\stable-diffusion-webui\venv\lib\site-packages\gradio\routes.py", line 284, in run_predict
output = await app.blocks.process_api(
File "C:\Users\Name\Documents\AIArt\stable-diffusion-webui\venv\lib\site-packages\gradio\blocks.py", line 983, in process_api
data = self.postprocess_data(fn_index, result["prediction"], state)
File "C:\Users\Name\Documents\AIArt\stable-diffusion-webui\venv\lib\site-packages\gradio\blocks.py", line 930, in postprocess_data
prediction_value = block.postprocess(prediction_value)
File "C:\Users\Name\Documents\AIArt\stable-diffusion-webui\venv\lib\site-packages\gradio\components.py", line 3308, in postprocess
file = processing_utils.save_pil_to_file(img, dir=self.temp_dir)
File "C:\Users\Name\Documents\AIArt\stable-diffusion-webui\modules\ui_tempdir.py", line 18, in save_pil_to_file
shared.demo.temp_file_sets[0] = shared.demo.temp_file_sets[0] | {os.path.abspath(already_saved_as)}
AttributeError: 'Blocks' object has no attribute 'temp_file_sets'
I don’t know that problem; try asking in #🤝|tech-support with the log
thanks
Is more VRAM going to make things faster?
i found somoene on reddit mentioning that problem too. said to disable dream fusion for now. so i turned off the extension and rebooted the terminal and webui
more vram usually comes hand in hand with faster gpu and faster vram, so usually yes
Well I’m looking at rtx3090 not ti. That should be the cheapest 24 GB card right? Does this mean it probably offers the most value right now?
no. its still very much a premium product and is priced as a luxery item not a value item. i've seen 3090 at higher price tags than 3090ti's, only because they were one of the only 24gb cards available
might've been an oc edition of a 3090 too
What’s oc?
overclocked. some manufacturers do it at the factory
Yep, in AI/ML vram is the number one thing and the more you can get the better it is even if it means sacrificing some speed/cuda cores.
What gpu has good value for sd then?
You tell me because I think they are all 50% over priced.
I see
"good value"
Nvidia, and AMD too, got that sweet crypto money and crypto mining is dead but they refuse to let it go back to normal.
I think they realize theres a ML revolution going on that they can bank on too
the day someone gets a library going that lets multiple gpu's share a memory pool for training, it'll be like mining all over
Yes, but in the end they are going to have to give it up but they are up against a wall because they have pro cards they will kneecap the 4090 to protect.
as is tradition
I watched a video and they let the 4090 have a feature (hardware wise) that you only see on A100s while they kneecapped everything else. Due to that one feature it is still, overall, 56% faster than the 3090 in AI/ML.
yeah usually one instruction set changes everything lol
If they had not kneecapped it well over 100% faster
yeah they still got the pro market products that come off the same dies. they give the pro end the best bins and then the ones that don't bin as well have features disabled until they test well enough. then they develop a product around that. It's not always nefarious. a lot of the time it's just how the dies bake out in the lithography machines
"not always"
I am due for an upgrade as my 1060 is near death (sad) but when a 4060 is looking to have 8-10gb of vram that is shit considering it is 3 gens later now so six years.
my vega 64 was something of a 1070 equivalent. i was feeling the belt on that one for the past year
@lilac reef how is it dying?
besides the 500-650 price tag.
it's a 1080p class consumer gaming card. that's why we get 10gb cards still in 2023
Must be ram heat or something I can't measure but I had to redo the tim, and the pads but to try and train SD it throttles itself the longer it goes. Has to be ram related because the card stays cool but this gen it didn't have a way to measure ram temps.
consumer grade with pro-sumer pricing. Gotta love it.
prosumer is one of the most lucrative hardware markets haha
@lilac reef tim?
it's why sony sells a pro line of the playstation hahaha
looks at his 2nd hand, usually 64F running card
thermal insulating material. Iow that thermal paste.
ahh
shouldn't you not put insulation on a gpu?
so is it hard to create my own machine
Mine had all but dried out. The pads aren't perfect as this required 3mm pads but evga was no help, and everyone said 1.5 or 2mm bought both for 30 bucks and said to hell with it I am not buying again.
not hard once it's set up on proper hardware
oh great what hardware do i need?
preferably a gpu with at least 8gb of dedicated vram
or a macbook with one of their fancy neural chips
i have a gaming laptop
gaming laptop might work might not. sometimes they use shared memory banks
look into automatic1111 webui. there's instructions on their githb for installation
i do have 15,8 ram
sure
that's system memory. gpu uses vram
oh i thought you misstyped xd
yeah it gets tedious with all the initials haha
i only have 128 mb vram on display 1
yeah your laptop gpu is a classic "gaming" laptop.. meaning it doesn't have a dedicated gpu. it has an integrated gpu on the cpu die and shares memory with the system
so i need like a pc
or a laptop with a dGPU yeah
damn thats unfortinute
or a mac that has a dedicated neural engine chip thingy
I'm not sure at this point, I'm still a bit confused about EMA weights, maybe try both and compare the results? I think that's what I will do.
Please let me know. My training is to slow to experiment... but like, someone must know these things. It's not alien technology, right?
Yes, but people give confusing answers when I have asked. They say that it isn't useful for training, but for inference only (generating images). Yet other sources says that it IS useful for training, so I'm not sure at this point. Will train with and without EMA to see the difference.
I'd love to know anything that lets me save step economy, as I'm training on a laptop andd it takes forever
It's the fastest consumer card with 24gb of VRAM that takes advantage of the latest CUDA optimization features, besides the 4090. The value depends on what you need it/are going to use it for really and your budget.
Is there some sort of extension which allows you to upload models from interface? Like, the uploader from Prolapse's colab, but within the UI itself?
Anyone good at img2img and know how to outpaint a tv around a picture?
Have tried so many times
I just don’t know how to do that 😅
Hi! I would like to ask if you would be interested in taking part in our exhibition of AI outputs, on the topic "communication / dialogue". I would like as many nationalities as possible to participate in the exhibition. I currently have 25 author outputs ready. The exhibition will be in Zlín in the Czech Republic. Under my auspices and the Tomas Bata University in Zlín. The output can use any text to image model. Any aspect ratio. The exhibition will also have its own website. Each author will receive a catalog from the exhibition. If interested please let me know in DM! Thank you.
Can I run Stable Diffusion locally on my computer?
If you have a decent computer, yes
6GB or more of vram, although preferably more
honestly I'd do it in three passes, working out prompts to get a front orthographic of just a television, secondly the image I want inside, and finally, combine them with traditional image editing.
👋🏻
@grave barn what's your goal? Would you prefer command line or to use a graphical interface?
I use Automatic1111 so I have text boxes and sliders to enter things like positive and negative prompts, steps, sampling method, resolution, etc. on the command line, I'd be formatting a long string of text to do the same thing.
I guess the other question is do you want to run it locally, or like with google collab or another cloud service.
once you know your environment finding guides online will be a lot easier.
Is there a way in A1111 to run img2img, send the output image back and run img2img again, like 10 times or something?
Im doing it manually now but maybe there is an automated method?
Loopback script should do that
so the script knows to select the last image I guess?
yeah it keeps feeding the output into the next iteration
Oh wow! I will try that. Thanks!
only downside is I think it maxes out at 30 iterations
nvmind, the slider only goes to thirty, you can type any value you want.
works perfectly. thanks again
Hey is there a set of docs somewhere? Having trouble finding out how to do the aspect ratio
how do you mean?
like midjourney has the --ar 2:3 for a 2 to three aspect ratio output
I've been crunching it manually.
manual all the way baby
I made a spreadsheet.... I should convert it to a google sheet to share.
You enter the square pixel space, and it gives you resolution at different aspect ratios with similar pixel count
(short side/ 2) x 3 = long side
Thanks! Would you mind dropping a quick example for me of what that prompt would look like?
Thanks, I checked out some prompts and figured it out! Appreciate the help 🙂
Well
Caspian's Tank Diffusion is in the works
It will take me 98 hours of CUDA
But inshAllah it will be worth it
I will make sure to share it with the community inshAllah
So if you want to see the progress of the tank diffuser boi
Hmu
Is this AI capable of outputting images as detailed and specific as mid journey?
not sure, but running locally makes it amazing for many usecases
This ai is more capable than midjourney
If you know how to use it
Don't know if anyone else plays with ratios, but in lieu of a script, here's a spreadsheet I use to adjust ratios while keeping a static pixel count in generations: https://docs.google.com/spreadsheets/d/17LXM_GgHib7W9dj88j7pqgenKsohNht-9f-CaV-yp9w/edit?usp=sharing
the thing about midjourney is that they add extra prompts without you knowing, you can even type "fksdhfkjdhsf" and it will give you a very detailed portrait
you need to learn how SD thinks and it will become a very useful tool
Oh ye? I'll try it out
Where do you download it to run it locally?
I'm very dumb with programming
Midjourney creates very clean images. I haven't had the same success here yet but I just got here.
theres github repos
heh i can't remember off hand but yeah .. i'm using the "diffusers" python library. i've been able to make my own prompt randomizer front-end , assists for running img2img on batches of tiles.. so much fun. Hell in the past 2 months that i've paused from running it there's probably a whole load of new things to try out
Can you feed it images to base it's output off of like in Midjourney?
thats exactly what i'm doing, for example i've made something where i can make a low-res set of tiles in a grid , and my script will cut it up and run batches of different prompts on them
https://github.com/huggingface/diffusers i was using this stuff.
I recently gave it a "gates of hell" prompt and it feels watered down
Midjourney I mean
Seemed less detailed than I expected too
And it does weird deepdream stuff like putting a door inside the door
what i've been doing is stringing together lots of thematic and descriptive words, and spitting out lots of random permutations of those. 5 seconds per image on rtx3080 - its fascinating watching what it'll make
Are you using the cloud or are you doing that locally?
It seems I'll need to have a strong understanding of python to get this thing to work right
Maybe I could pay someone on Fiverr to build a UI for me
yeah i'm sure someone will have done this by now
i must admit i suck at UI programming.. most people running locally are probably just using python - but there's no reason a local UI can't be done
Do you know of a tutorial on YouTube that may be able to at least walk me through how to do the python stuff myself?
maybe leave a question in Reddit and someone can give a better reply. i'm sure there's solutions to this by now.. progress has been rapid
the correct use of sd is "[describing the image you want], [adjectives]".
Example: a painting of the gates of hell, concept art, voodoo, sharp focus, global illumination, red lighting, smoke and fire, classical painting, detailed illustration
this is the result: #1047760914008522782 message
Just wanna inform the following, because like me, someone might be using this version cause of the latest fixes: PyTorch-nightly has been compromised. If you installed this version during December 25-30 UNINSTALL IMMEDIATELY and consider formatting your OS. Source: https://pytorch.org/blog/compromised-nightly-dependency/
i must admit setting up the environment was quite involved, i.e. I had to hunt down some other dependencies (CUDA stuff, "conda", i can't remember details), no idea if there's a betters solution for that now..
How do i check if i installed it on that
Sheesh
It means we are early, but it is also hard work
We're like the old school computer game programmers my dudes
The article provides a code to check if youre affected. Run the code in your installed enviroment, e.g. in the WebUI folder use source ./venv/bin/activate and then run the code provided in the article.
how exactly, there is no directory called venv\bin
Can you get good outputs here in the dreambot channels or is that thing a bit nerfed?
Are you using webui? Did you modify the webui-user script? Are you using windows+nvidia?
yes to all
i AM an old school computer game programmer 🙂 (PS1 games, cut my teeth programming 8/16 bit stuff at home)
I mean what the hell is this output?
Ok, can you pls show your webui-user.bat?
connecting the future with the past, i'm so fired up about retro things again because Stable Diffusion can enhance retro graphics . It's like its given a point to all the nostalgic itches I have
Is it okay to poke around here for people who offer services?
I'm looking for someone who could help me build something
Something with stable diffusion
I need to get it to work and then I need to slap a UI onto it
What exactly
- Pretrained models for different but reliable outputs (e.g. poses, backgrounds or settings)
- Take images and then train on them
- Apply 1 to 2
- Give output (bonus: upscaling)
- Save the output to private individual profiles
- Other stuff: stuff that UI into a sleek mobile app
I prefer SD to MJ in so many ways but can’t deny there’s a crispness / resolution to mj images that has me envious. Is there anything on the horizon that will put SD over the top?
Will we ever see an open source alternative to ChatGPT, and will it be possible to run it offline on a personal computer? Like what Stable Diffusion is to Midjourney and Dall-e? Arte there any projects like this in the making?
i think those models are a LOT bigger. no idea if it would be possible to distill smaller domain specific versions
I'd wager it's just a matter of time.
i've just been reading a big explanation of why someone expects the gap will widen (between what can run locally and in the cloud) r.e. that kind of AI
Yeah, that's what ChatGPT itself told me.
I hope so. I will suck when they take it away and it becomes paid services. I'm getting too used to using it to solve every problem that appears in life, and things are going better that ever!
haha thats why i'm nervous about using it too much despite how much people rave about it :/
At work I've pretty well got my group sold on paying for it when it switches over, we use it so much to start scripting commenting old scripts/programs, etc. lots of business use.
saves dozens of hours a week.
some1 wanna change tha world ?
we could do a social network where people could say if they ok or not with decision that they gov take in theyre name.. respecting democracy for the first time ...
I hope it's like Midjourney, that you pay a fixed price and can use it for the whole month. Not like Jasper, that have limited words per month.
There seems to be an open source alternative in development: https://metaroids.com/news/an-open-source-version-of-chatgpt-is-coming/
heeyeyyy, does anybody knows if i can add the bot to my own server?? Or I cant do that
can sd v2+ be used in Automatic1111's webui?
yes but it needs an extra step
Did this tutorial for installing and running SD locally work recently for anyone?
I'm going to try it in a few days when I'm back at my house with good internet.
what is the difference between ema and no ema by the way?
one is for training the other isn't.
Hello everyone.
hello guys, if someone is good at drawing digitally and editing. please DM me, i need your help
What are embeddings. Aren't they just checkpoints?
what do you need?
hello guys somone can tell me the diference when (()) are used in one prompt?
Linux is getting better recently. A guy I know knew of a Linux distro that can basically run any windows game and apps + customizable and works pretty much like windows. Only problem is with anti cheat, but I just hope it would get fixed soon.
Wait, how are you disabled? Sorry for the question. I'm just curious.
Wat
Been in DA for like 10 years, only gotten around to publishing my digital art at around 5 years ago. It's a welcoming community, and a pretty wholesome one too, once you get past the weirdos.
Sorry, this is what happens if you try to continue a conversation from the opposite side of the time zone.
needed help with a server
Hi guys im new to SD, I was wondering if there are any resources on learning how to create our own anime model
There are many
Search up:
Dreambooth
Textual Inversion
Hypernetworks
Etc.
What are your system specs?
2080 gpu, 32 gb ram, i7 7700k
i see some prompts with thats and i want to know if have any diference
sorry about my eng
You wanna look for "Lora" for dreambooth if you want to train on your own hardware
The documentation can be found in the auto1111 repository
where i can find that?
thx
To use models that use 1.5, do' I need 1,5 in my models folder?
I just have standard 1.4
;w;
anyone know the input for feeding it an image?
anyone know whats a safe place to dl f222 model?
am i able to use stable ai to make fan art of characters from my favourite marvel and disney movies?
yes
Is there any stable diffusion model which is trained on large number of images
isnt any ai art model trained on images?
this should maybe be an obvious answer but
higher LR has higher risk of forgetting cooncepts, right?
but the tradeoff isn't linear.... learning longer and a lower LR won't make it 'forget' the way training a short time at a high LR will?
for context, I'm using individual custom BLIP captions with detailed natural language and no class images (because there are no tokens or classes etc...)
Stable Diffusion...?
5.85 billion images, not enough?
I meant over stable diffusion is there any model that extended the training with large number of images
Well yeah, except those would be specialized models
@wise stratus whens the next
tease 👀
There are 864mn parameters tho
LAION-5B has 5.85 billion images
oh
bruh
time to make 5 billion diffusion
nah
we can optimize it like fuck
by poruing millions of $ into its development
heck
we can make an ai who would optimize the AI
i need to look into how it works behind the scenes
but 2.1 is very good with custom training tho
and embeds, loved it
silly question, but If I'm doing some dreambooth, should I resize all the images to be the same like 256x256 or 512x512? and so I can learn for the future, how do you know?
in stable diffusion 2.1 they remove ability to copy artist style right? or im missing something here
i think so , to reduce complaints. artists are trying to fund campaigns to get this legally restricted
too bad, in 2.1 i can't achieve any style beside photo realistic
I'm still interested in a model that is purely trained on photos to bypass these anti-AI arguments altogther - however it would be good to have both models around of course. One thing I'm wondering is if other filters could emulate hand-painted art styles (there's non-neural algorithms for that, i.e. procedurally trying to rebuild a photo with brushstrokes) - and then you could perhaps finetune (or train a model in the first place) on that
https://en.wikipedia.org/wiki/Non-photorealistic_rendering. EDIT: seems in recent years neural techniques have taken over for this aswell, hah. but i'd guess there's still tools out there that can simulate brushstrokes (i'm sure there was something like this for some of the paint tools which have elaborate brush engines with colour bleed and so on)
It's great that it works for you. For now, I think the best option for me is to revert back to using version 1.5. I rely heavily on inpainting and manual fixes in my drawing program, so I need a style that is more hand-illustrated.
are you using a local installation? embeds might give you what you need in 2.x, they are really powerful
(jargon: is an "embedding" different to a "finetune"? makes it sound like a translation layer perhaps thats easier to distribute than a whole new version of the net?)
Finetune is generally used as a blanket term for things like embeddings and custom models- embeddings are tiny files, usually not even 100 kb in size, that give your prompt a nudge in a certain direction
yes, i do. tbh, I just discovered embeddings today after trying version 2.1, so i'm still not sure exactly what they do. in my mind, it seems like the model is trained to only produce a specific style?
It's not adding something to the model, rather it guides the prompt where you want to go, using vectorized tokens- it's basically a prompt shortcut
That's the fun part- they are really flexible, all an embeddings really contains are very specific prompt tokens, nothing more nothing less
(thanks for th explanation . "nudge a prompt" .. sounds like the vector output from the text encoder, probably just added to whatever prompt you give, ok. maybe its possible to create that vector directly e.g. working back from images, i.e. to give more control than you'd get from text)
you don't need to download an entire new model to get the results you want this way
It's hard to explain not knowing exactly how much you know about all of this haha
But training an embedding is pretty much what you said
instead of changing weights of the model it searches for the part in latent space that contains the style you want
https://discord.com/channels/1002292111942635562/1055676123020795965 just to show what it can do, shameless self promotion
I couldn't implement S.D. from scratch or anything but I've got an overview in my head.. 3 components - (i)text->embedding vector (text encoder), (ii)VAE ( goes to and from a compressed version of image space to work in), (iii) the U-Net (does the main work of diffusion iterations, uses the embedding to guide working on an image in the latent space of the VAE). "custom model" would be a new version of the whole thing
I just woke up so I still have a bit of brain fog, but that sounds pretty correct
i see, after exploring the embeddings in civitai, i noticed that most of them have a semi-realistic style that does not fit my needs. the results are too smooth and perfect, and what i envision is a style like greg rutkowski, with lots of brush strokes, so that i can continue the drawing in my own software
i gather that going from image -> embedding for diffusiuno models is much harder than earlier generative models (VAE's and GANs) because there isn't a simple latent space from which the whole image is generated.. but there is supposedly as process that can actually do this.
I'm sure it can be trained though- my ThisHonor embed (from concept art of the game Dishonored) for instance contains very heavy brush strokes
(probably not your goal but i wonder if you could use img2img to go the other way .. you continue drawing but then feed it back into a diffusion model to make it consistent with its photoreal output..)
or train embeddings on their own style
Any1 knows how do you refine a generated image. Like creating different poses of ur char with face and outift remaining same ??
You're right this is the closest option that meets my needs, i'll give it a try
(and yeah this is of interest to me aswell, i do some handpainted stuff aswell and it would actually be pretty awesome to have a net that can replicate that , fill in from sketches and turn photos into it..)
The plan is for me to write a quick start guide on training embeds soon- that might come in handy for stuff like this
I just need to find the time 😄
that sounds like a great option
i've definitely seen S.D (still using 1.4 here) do very interesting things on my fully shaded handdrawn image, i.e. re-interpreting the shapes with a sense of 3d but substituting elements based on a prompt. I've never actually tried to jjust make it a straight photoreal re-interpretation though.. but from what i see everyone else doing it seems like that should be possible)
If you really want to take the time you can make a composite of inpainted parts, then do a low denoise img2img pass to make it fit together
(i'm guessing that means setting the parameter that says "how strongly to make it follow the prompt" low, i recall 3 parameters.. iterations, strength, and one other EDIT 'guidance scale' is the one that says how strongly it follows)
Well- not really. All img2img really does is add noise to your image and denoise it again- so a high denoise strength means it adds a ton of noise which it then denoises again with the prompt provided
So if you have a composite image with clear cutout parts etc, you can add a little noise and denoise it again- to make it all more coherent, while not changing everything too much
class StableDiffusionImg2ImgPipeline(DiffusionPipeline): .. def call( self, prompt: Union[str, List[str]], image: Union[torch.FloatTensor, PIL.Image.Image] = None, strength: float = 0.8, num_inference_steps: Optional[int] = 50, guidance_scale: Optional[float] = 7.5, ... ): ... I thought it was that 'guidance_scale' parameter that controls how strongly to use the prompt - I also thought the caller to this pipeline would add the noise - i can't see a parameter here , but the example or UI probably does it for you. Now i'm guessing quintusdiast is using webui etc rather than python so there would indeed be a noise parameter somewhere)
Guidance scale is exactly that, but I meant that that wasn't what my earlier denoise img2img comment was about 🙂
(i definitely remember seeing the effect of varying noise .. sorry my experiments are on another machine , i'll dig it up to see what i was doing ..)
I'm not a programmer or technologically gifted though, so my knowledge on that stuff is limited
i saw a reddit post where op used negative CFG scale (for science!), but apparently (now?) AUTO1111 does not allow this 🤔 does this even make sense? is this be the same as swapping positive & negative prompts?
(so what I did was started with the 'diffusers lib' img2img example - I think the actual noise addition you are talking about was implemented there . i made something to generate random prompts, and do tiling textures via feedback hacks, and run batches on a grid of tiles cut out from an input image . i think there's a better way to do repeating tiling tesxtures out of the box now . I use a mac mini M1 as my daily driver and was running StableDiffusion on a desktop PC which is off at the moment. I've got sidetracked doing actual art again lmao .. my intention is to use SD to upscale and vary it, but i'm hedging bets on the copyright/legal debates by focussing on art thats complete without it first..)
What games have you made so far?
Does it make a big difference what checkpoint i use in SD?
for sure. some are better at specific styles, some are easier to prompt, some are focussed on specific results
Oh really? Is there anywhere i can get new ones?
https://www.nytimes.com/2022/12/31/opinion/sarah-andersen-how-algorithim-took-my-work.html
I am pretty disgusted by the take this artist chose- not only does the title put Alt-Right and AI image generation on equal footing suggesting that they are the same or overlapping, she's also completely choosing to ignore fair use and all implications that brings with it. Which is a shame
i see them conflating ai art and NFT/crypto people aswell. i hated NFT/crypto because I wanted GPUs for AI
I’ve been playing around with several generators, and so far none have mimicked my style in a way that can directly threaten my career, a fact that will almost certainly change as A.I. continues to improve. It’s undeniable; the A.I.s know me. Most have captured the outlines and signatures of my comics — black hair, bangs, striped T-shirts. To others, it may look like a drawing taking shape.
So all of this fearmongering in the title and the rest of the article has the conclusion that it's pretty much a childs' impression of her comic when you prompt her name. Yet she felt the need to say
The LAION data sets have also been found to include photos of extreme violence, medical records and nonconsensual pornography. There’s a chance that somewhere in there lurks a photo of you. There are some guardrails for the more well-known A.I. generators, such as limiting certain search terms, but that doesn’t change the fact that the data set is still rife with disturbing material, and that users can find ways around the term limitations.
Which is not at all related to her argument, and does not have anything to do with the image generation process- it just means they are available on the web somewhere that has been scraped
God I hate this haha
Hi, i have a question, how can i set an image size (like for a wallpaper) ?
Oh, many ways lol. I'm like a crypt. I have many rare autoimmune conditions (Lupus, Reynauds, Sjogrens, fibromylagia, etc), deal with paralysis of my limbs, loss of feeling, have weakness, carpal/cubital tunnel, gotta have my knee replaced, starbismus/lazy eye, almost legally blind in my right eye.....along with other rare conditions. I'm a circus with ADHD but I keep on going. I usually use a cane to walk.
In the process of trying to create a society with few guardrails, almost no shame, and no ostracization, people are now terrified of what’s inside other people’s heads. Go figure.
Hallo guys before 1 week chatgpt knows What stable diffusion is but now they dont what this is. Why?
Also, good morning, everyone! I hope you are all well today!
Living the dream.
Damn, respect.
Also, you can do traditional styles with 2.1, etc
Do you mean you want to know what Stable Diffusion is?
Thanks. I do what I can with what I got and try to make the most of it
Can't say I always succeed 😂
But I try
Could you help me with Dreambooth Lora? Im trying to make a model for the community
It's 2023; the slate's clean. Right!?
I have a question rather
A s k away
So i saved the parameteres right after my ckpt save(set it on like, 1k steps?), so when i continue training with the loaded parameters, does it continue from the image it left off or despite the steps, does it go back to the first image in my dataset?
I'm not 100% sure, as I haven't had time to work with Lora. But I would imagine that it works the same as TI or DB locally.
so?
resumes at the image it left off?
I would probably just go with no
I see
I'll double check and if I hear differently, I'll let you know
it isnt a problem either way, cause i can do Lifetime steps/steps per image and delete the images up to that count
from my dataset
nods
I just haven't used it myself, so, unfortunately, I just can't say either way confirmed
think this way, if it saved the index of the image, it would continue from there and dont care about the images before, if it did not save, it would just start at the first image, which would be the correct one since i deleted the processed ones
so either way
anyone having an issue on google colab, that won't appear the public url
yes?
Guided training, or what you might call textual inversion/mini fine tuning 2.0
The cloud model I was using is in error state 😩
I'm not
But who knows
Is there another Finetuned Diffusion that's not the Anzorq one
The Github does a pretty good job of explaining how it works
I will probably try it, at some point, but I'm currently busy making models and tutorials for 2.1
Esp for Dreambooth, etc
1.5 included
Guess I can't generate anything for a while until this is fixed
It says "Preparing Space" and never moves past that point
I'm on Chrome OS so I can't use a local model
Figures the moment I actually have time to make them this happens
Yeah, I see it, too
The owner will prolly need to fix it
@pastel beacon that is a specific model, you'd have to look for something similar someone else has compiled, either as a checkpoint or embedding
so what is the most popular distro these days for local stable diffusion?
@main quail I'd think Automatic 1111 followed by InvokeAI
ive been trying to get invoke working, automatic drives me nuts due to no spacebar support
Hello guys. I have a question about SD. I have Nvidia GeForce 3060 Ti 16gb, but SD Told me. I have Not so much GB. Why?
So GPUs have different kinds of memory
SD is using what's called "Dedicated Video Memory" or "VRAM" for short
VRAM is also
A: a fraction of the total memory on the GPU
And B: Can be impossible to allocate manually depending on the graphics card
Nvidia is unfortunately one of those graphics card companies that doesn't let you change the memory allocation for VRAM
Believe me I have tried
I even went into my PCs BIOS and there was no setting for changing VRAM
Short answer, your 3060 TI has 8GB- there is no 16GB variant 🙂
How much VRAM does it have? That's the important thing.
I mean 8gb
Is 8 GB the total memory on the graphics card or is it the VRAM?
those two values are not interchangable
Speaking of VRAM, is there some way I can get Stable Diffusion UI to access the shared video memory?
are you sure you're not talking about shared memory? because that is your ram, not your GPU
8GB is 100% all of the memory on such a card- the virtual memory you might see in the task manager is 50% of the ram you have
has nothing to do with the actual chips on the gpu
#🏞|general-with-images allows it
Are you saying you're getting an error like," CUBA out of memory?"
Hello. What are the default modules of stable diffusion 2.1. Is there a glossary or index of the modules in alphabetical order?
Ayyy Finetuned finally loaded
Paidjourney
i have got an idea on weather how their ai works and its not what we all think it is
yeah paid shit
and the results are preety narrow
my idea is , its not really just a image gen model
its a tag builder ai
like give it an apple and it automatically adds high quality tags around it (just so the people dont complain about shit results)
its not better than stable diffusion but it is better at pleasing people however
well about nijijourney
still with narrow stuff man i want stable diffusion anime model
when I switch models in a1111, is there a way to autoload a VAE, by specifying it in model.yaml? I've been copy/pasting the VAE file with different names, it adds up
i use colab notebooks so idk
i am here for a question
when are we getting a good text ai that is better than clip or blip
sd is already at its peak but the language models makes harder to generate good images and takes many tries
@crimson veldt this may interest you, someone made a proof of concept which is a lot more accurate https://medium.com/@enryu9000/anifusion-sd-91a59431a6dd
oh thanks
imma be back after reading that shit
I tried it (it's just an SD fork), and though it's rough around the edges it makes images SD/NAI/etc is simply not capable of, and far more accurate for the input tags
so this is a language model built for basically anime focused sd models?
or is this a model like anything v3?
not a model. it replaces the way text prompts work, to be far more accurate for the input
I mean it's a model too, but he forked SD for his model to work. Worth trying one evening, it doesnt take long to set up
oh
if you're familiar with danbooru tags, try it, you'll quickly see what I mean
yeah ive been using anything v3 and other models like yohan
they require danbooru tags but are not "that" acurate
from the char anifusion showed it looks better at understanding tags and prompts
this one is accurate, but wont make flawless images like Anything3. (maybe it does at higher steps, I didnt try since I have a crappy GPU)
free or paid?
free
how long does it take to make an image?
but i just get 4-5 hours
nice
I'm just not gonna associate my real name and phone number with SD gens though. Nice try, Google.
check out camenduru for colab
I'm still holding out for become Prime Minister and booting out that lamer Trudeau
this guy created colabs for most models, you can just run it without writing any code
so it's notebooks with preloaded models?
yep
nice. Have you uploaded your own model before?
nope
its hard since idk about ai stuff and i am new
i am not familier with coding too
so if i get a model i wanna use i just contact camen and he upload a colab
I mostly had a case of ADHD, but damn... that's tough.
ah, you dotn need to know coding, I just meant that if you download a model from some other website (because it's not available on camenduru), you can load it via Google Drive.
ADHD is fun lol
ah that
did it but in early version and things messed up later i found out that novel ai model needed specific settings
from all you've suffered through it makes me sad that people are trying to take it away. :(
then i never tried because i found preloaded models
well, when your attention is all over the place and dopamine is hard to come by for your brain, it tends to go into weird places, really weird places...
But hey, I tend to learn easily and handle well under pressure, so that's a plus. And then I tend to hyperfocus on things that I may or may not want, so there's that.
all these things developed superfast
I'm alive! S o m e h o w. (I had a rare blood disease that almost x'd me multiple times...strokes, etc.) That is what is most important--continuing to persist despite difficulties in life.
That is why I believe, or rather, want to continue to work with SD, because it can do a lot of good
I know there are many good people in the world and I believe, and can see, many good people working together behind this technology
i too want to make a career in machine learning
There are many good people in this community. Kind people
There's so much to learn, too.
ai community is 90% good people unlike artist community
i am not even level 1 lol
Adderall has really helped me
ill start after my exams are over
The Lupus does make it worse
I'm open about my medical stuff, to a degree, simply because I know there are other people who have struggled like me
hey are you both programmers?
well... I had none for all my life. Reason: Money problems.
And I want them to know they can have hope, and to not give up
I know that pain
It sucks being in a third world country
There are a lot of great artist communities, too. Some people are simply misinformed.
It has only been in the past year that I have gotten Adderall.
same. I have known a lot of good people in the art community. While we may differ regarding AI art, at least we still had fun and commune together.
ive fought them and man they never wanna learn . well most of them are at our side and already with ai community
Ive spent most of my life without it.
how did it felt when you finally had it?
i agree they were good and many understand ai
There are a lot of challenges with ADHD.
but i am talking about human only art community
sam does art apparently lost all respect after misinforming thousands of people
It's supposed to help with narcolepsy
As if your head was cleared, like it was at peace. Without way too many thoughts crammed into your head every second?
But did it help with mine? No
damn...
hey whats maven role stands for?
Over time, it helps those pattern of thoughts emerge, and make them easier to access
It's for moderation; I'm a new mod
I just help when I can
any idea how big stability ai team is?
Cuz health lol
well... If I had the chance, once I get a job after college, I'll try it.
I'm sorry but I can only answer general questions
nvm its around 250 people big
It's certainly worth it
It's helped improve my life
Ofc, everyone is going to be different
But it's certainly improved my memory, for sure
will do
So, what brought both of you into the AI community?
mine is kindof funny or shocking
be a teenager
dicover dall e mini for meme
discover midjourney
dicover some other ai
discover unstable diffuion through some blog and reach stable diffusion
do all this for ai por-
Sounds like quite the journey!
(Of course, I'd love to hear anyone else's story, too)
when i heard about stable diffusion it was able to produce shit but when i reached here there were already big things out
i reached here after the releasing of anything v3
kind of funny how i knew about stable diffusion models before SD itself
Started with work stuff. A Game Dev project in Unity with Nav Meshes whilst incorporating AI. Found out about AI dungeon by that time, and since then, I went into the AI rabbithole.
Niiice
Found out about SD from "two minute papers", looked around, poked at the parameters in fast stable diffusion colab, and the rest is history.
for me novel ai leak drama made me fall into a hole
i am continuously using ai after that
yeah short and covers what we wanted to actually hear
same, that's where I got some info about new tech.
i discovered him after image gen ais
airtpreneur was the one for me who introduced stable diffusion stuff
Sometimes I look at that bit there's always so much news on Discord at this point
And so many AI-related Discords
thousands of ai discords
they all happened to exist after SD
now they are contributing in ai, great decision for taking this project open source
ah two minute papers love that yupyup
finally i have found what i want to be after school ends
i'm thinking for creating fictitious layouts, you could then guide it with img2img "i want a coastline here, some buildings over here, now flesh it out.."
also you could make repeating worlds
how about the EDiff-i?
(add something to train looking at a point on a map and hallucinating a panoramic view from ground level .. that could be pretty interesting)
With in-painting, or a game?
that should work for the kind of use case. Either that or inpainting.
as a gamedv tool.. i'd imagine this h appenging offline
I think so
that's strange, he's a year long member. probably a hack?
wait, it's still a leaf. Either a lurker or just hiding.
meh
back to topic
i know of someone who created a EDiff-I plugin for SD, though its still under dev. Not to mention not released as an extendion for the WebUI
There are a lot of developers here
Working on cool projects
There's a lot of application for game development and AI, I think
Can anyone please help me to understand if I use SD locally, do I own the generated output, if it's distinguishable from the training data? Or who owns it?
This isn’t settled law yet, but it’s likely that “raw” prompts would be public domain
at the moment I believe it's a gray area. But if there was someone who'd own it, then it'd be the one who created that image. So either you, or the AI :P
Inpainting, img2img etc. can likely produce copyrightable works
Copyright can’t vest in non-persons, so if “the AI” would own it, it’s PD
yes, that last part was a joke :P
Does it mean that likely there would be no copyright of generated new images and anyone can use it for whatever use cases?
In other words, if I have commercial applications that generate logos to print on T-shirts, then anyone can pick the logos from my gallery?
a funny detail is that if someone were to take a spray can, and paint the side of a building, then they own the copyright to it, and if someone removed it, then they who removed it are doing something very naughty. US law is a little wild sometimes :P
The interesting question is whether you can copyright a prompt, making a “raw prompt” a derivative work of a copyrighted prompt
Uh, that doesn’t actually list anything about model use; it’s just for the website
BECAUSE SOME JURISDICTIONS DO NOT ALLOW THE EXCLUSION OF CERTAIN WARRANTIES OR THE LIMITATION OR EXCLUSION OF CERTAIN CATEGORIES OF DAMAGES, SOME OF THE ABOVE LIMITATIONS MAY NOT APPLY TO YOU. gotta love the only important part of it :P
What is the copyright for using Stable Diffusion generated images?
The area of AI-generated images and copyright is complex and will vary from jurisdiction to jurisdiction.
Eh, vague
What if I get the output of SD, add a couple of details, and put a copyright on that. Is it possible?
Yes, so long as your details are independently copyrightable, though do note that someone copying regions containing only PD content would get away with it
Copyright requires original, creative work
The threshold for “creative” is pretty low but does exist
what's PD?
Public domain, i.e. works without copyright or where it’s expired
Yeah, the jurisdictional inconsistencies will be awful for years
It (DreamStudio) was released under a world-wide license:
(Just look at the DreamStudio ToS for more info.)
The license for you would run locally on your machine you can read on Git, or by just looking in the folder
img2img copyright , hmm. "can't copyright generated images" (sounds fair to me,actually). but then if i draw my own images, i can copyright them , right. Then if i feed to img2img.. whats the result.. my copyright? or not? :/ at least I'm possibly edging toward beleiving that I could use img2img generated images in a game i release.
also is there such a thing as an imagge license like this: "can only be used to train opensourced AI" - kind of like copyleft.
how do you know which version of SD you have?
Maybe not the exact solution you want, but you could always go with the tried and tested "not worrying". Especially if what you're making is for a game, then the game would be the work you want to protect, and even if it's determined that you dont have the rights to protect certain assets, you'll still have copyright protection on the more important bit, the game itself.
Take, for example, something like Gwent. Sure, CDProjektRed has undeniable copyright on all of the art for Gwent cards, but even if every image was in public domain (I personally believe) it wouldnt devalue the final product. They would still have whatever licensing agreement exists for The Witcher game adaptions, and they'd still have the rights to Gwent the Game 🤷♂️
right to be fair if hardly anyone notices it in the first place (very likely) .. its also unliekly that i'd have to worry about the finer points of this. especially if i can show that the images are infact my own designs, just detailed by SD
@bleak matrix I still don't know this. Have you tried making a game before?
tbh i'm unliklely to even finish the game, lmao. SD keeps the mere act of making games its own reward
Out of curiosity, what kinda game stuff are you making with it? I've been working on card-game-esque use cases since it seemed pretty straightforward 😅
Havent found any reliable way to get coherent assets yet, though I've seen some promising proof of concepts
i'm making an FPS. i've looked at it for generating textures, and backgrounds. Agree - its hard to make something coherent and useful. But i've been "img2img" produce interesting results just detailing and varying things . So my current plan is "i make assets at 128x128,256x256" , and SD upscales and varies them to 512x512,1024x1024
i also want to try running it on lowpoly models to texture them, not sure if that will work, but even making multiple views (like oldschool doom sprites) could be interesting, with enough view angles, maybe yoiu could approxiamte that in a manner similar to NeRFS or feed back into photogrametry..
and yeah i've seen people suggest that the best use of SD is infact just as "concept artist"
I guess I haven't paid too much attention to the texturing side of things
I could see that being pretty useful, but I'm a bit too addicted to Substance Designer that I don't think I'd enjoy using it for that kind of stuff.
right its quite possible that Substance Designer will remain superior to what i'm doing :/ but i am addicted to handrawing textures, so i'll see how far I get
If you're going for the ps1/2 demake style I could see the low poly texting working amazingly well, since the whole point of that is creating "too much" detail and then crazily downscaling it
Yes, I have. And yes, I have stuff on Xbox, etc. But I prefer not to give out personal info. 👍
Where can I find Dreambooth training examples besides training faces and styles? I want to see what I can do and what its limitations are before I upgrade my GPU for it
Other stuff I am under NDA
RIght what I make looks somewhere in that era, just with more lighting going on
You'd need to do the standard painful stuff with UV unwrapping and such to finagle the textures into place, but there's your "creative input" anyways. I don't think anyone would argue that creating a game texture out of public domain work means you abandon the game texture's copyright (assuming actual work was done in the conversion, and the texture isn't just a 1-to-1 of the public domain work)
I've wanted to mess around with the demake art style for a bit, but I was always put off by needing a more "realistic" style (I prefer hand-painting stylized models in Substance Painter, too 😅 ), but if I could set up a work flow to generate it, that would be a really cool way to work faster.
i'm not really an artist, i'm a programmer - my main focus is my custom engine , and i'm just improvising art as far as i can . I can draw a little bit. I did a lot of pixelart growing up (16bit days) . it seems like with "img2img" it should be possible to do something thats a lot better than "nothing", or the risk of trying to pay an artist for a game that is unlikely to make any money. I'm building for myself really. (and i'm aware "fair use" -personal use, show friends etc has already been ruled as ok by the courts)
I also am not really an artist, and should be a programmer, but I forgot to keep telling myself that over the years and have probably spent hundred to thousands of hours practicing art at this point
It's all supposedly for game dev projects, but then I keep making more art and keep not making games 
I wanna buy Alice and Sparkle
But yeah, it sounds like you're gonna be A-OK regardless of any legal decisions. I would be baffled if they somehow managed to stick some kind of license on AI-generated art assets that would immediately de-license any products that use them, and even if one day your game textures end up public domain... 🤷♂️ who cares
not a lawyer obviously 😄
It's funny that book made artists get mad lol
And I say this as someone pursuing a career in children's book illustration
You can find examples/models, etc in #1047197565365538826
Also, understanding how it works is fundamental to getting good results
Substance is great
Although I always enjoy seeing new programs, technology, and ways of doing things come to light
Hey, what matters is that you enjoy what you are doing, eh?
What kind of illustrations/genre/medium?
aye does anyone know if there's a way to select the og hires fix for automatic1111, I'm pretty sure it changed
from an img2img kind of thing to a straight up upscaler
can you use stable difussion without CUDA like a boss?
does anyone know who owns "Playgroundai?
I am trying to find a SD model that has the knowledge of Craiyon
I want a version of SD that knows a bunch of obscure game characters.
where i can see the version from sd?
I've been working on this for a while as a local install on a Windows system, as far as I've found, while there are ways to make it work, none of the other methods are as memory efficient as CUDA. In my case, I'm hoping to just gut up and buy an NVIDIA RTX A5000 by March instead of hacking my dual AMD cards into working.
machine learning community secretly being nvidia shills
I understand how it looks that way, but really I think AMD kind of ignored this market segment and are mostly behind on it.
then again maybe MY perception is skewed 🤷
Hey guys!
Do you have any articles or git repos you would recommend for v2.1
Somone know any good nsfw or anime model?
The Anime model I use is Eimis Anime diffusion
But it's one of the first anime models, so I think there are new ones that might be better
berry mix and protogen are my gotos
when using automatic1111, is it possible to have it generate using different prompts during a batch? So it would generate Prompt A for the first generation in a batch and use Prompt B for the second, Prompt A for the third, Prompt B for the fourth etc? or possibly just do them half and half, like for a batch of 4 do the first 2 as Prompt A and the last 2 as Prompt B? Or all of a batch using Prompt A but then change to Prompt B for the second batch, and back to Prompt A for the third etc? basically just some way to do multiple prompts without having to come back and start a new batch each time
Scripts: X/Y Plot, then select Prompt S/R on one axis, then put the variable part of your prompt as the first part of a comma-separated list.
that sounds promising, thanks!
So e.g. with the prompt foo bar you’d put bar, baz, qux in the script to run foo bar, foo baz, and foo qux
exactly what i was looking for, worked like a charm
Huh did something change about automatic1111?
I can't seem to find settings for "first pass size"
Guys, it seems that Unstable Diffusion released their model
is there a good tutorial for how to use embeddings?
Just published a video on the subject of AI & Copyright: AI vs. Blood Mouse — Disney, AI Art, and Copyright
The Copyright Association, representing big corporate interests such as Disney, appears to be making moves against open source AI image tech. A lot of artists are hopping on board after the Concept Art Association started a GoFundMe campaign to join the Copyright Association and lobby for copyright law expansion. The video explores some ways they could potentially try to outlaw open source AI models. Enjoy!
also I really dislike the newest update to Auto1111
I have the Upscaler set to none and no detailed faces and yet it's still doing a second pass on every pic
for no reason
they dont represent big corporates only tho
and if i was in their shoes id make the same step tbh
the reaction within the SD Reddit was absurd and delusional as expected too
to this news
what news
check out the video, i go over Copyright Alliance's official position paper on the subject of AI tech
just watching it 😄
artists joining the copyright alliance including Disney in it against AI art companies
ah, that's probably a good move
really AIs should only source from free use pics and whatnot
i mean i would join too for several reasons
part 2 of my video explores why that's a very bad idea for artists
hey guys, just tried using stable diffusion on Ubuntu Subystem on Windows. Could anyone help me? The process always gets killed and i rly dont understand why. Thank you guys
also ppl on SD Reddit share fake news or generally misleading and false claims
about Adobe and others making their own AI art generator
which is technically true
but
not how those people are selling it on Reddit
so artists wont cry about Adobe Ai Art generator
but anyway, anyone know how to make Auto1111 stop doing the pointless second pass?
second pass?
I tell it to do a batch of 10, it does 20 passes
like when you do detailed faces and it does a second pass on the face
but I don't ahve restore faces on, nor do I have the Upscaler selected, it's set to None
it's only become an issue since he added in the Upscaler direct into txt2img
can't post pics, otherwise I'd just show you
suffice to say, it doubles your batch times for no reason
did you know that there are 22 million people in america that are millionaires or make at least a million a year what's your excuse not being one of them
Procrastination
wasn't born a millionaire
I was born upper middle class and am a physician and I'm still not a millionaire 😦
(being born wealthy is a huge part of it)
fuck u got going on new year new money get yo ass to work
you can be one
I'd like to ask a simple question. How long does 20k steps for embedding take usually? Is it around 3 hours?
(Seems like a good way of knowing whether this is fine, my rtx2070 is nearing 81c as I'm doing this)
how can you tell which models can do depth2img? Like, I know the stable2depth one has to be able to do it, but can others like 2 or 2.1 do it? What about 1.4, or other diffusions like dreamlike-diffusion?
Guys, any colab notebook where I can use pre trained concepts/model?
Did anyone have any semblance of success training a Textual Inversion with Anything V3 through Automatic Web UI? Because so far it’s failing really badly for me.
Guys im trying to do a batch generation of 8
But I run out of memory?
I dont run out of memory without batch?
what is a 2nd pass?
how do you get the 2.1 bot to do a prompt?
dream : ?
I use batch count for that, I think batch size tries to run multiple at the same time
is it in auto111's ui? Also do you know why it takes much longer with high res fix upscaling then with regular extra's upscaling?
Yeah A1111, not sure about the highres setting. I feel it produces better results though
seems to take like 6x as long lol
But I think I need it because Im generating 512x960 images
also my dedicated gpu memory stays relatively high use after doing these gens
on your actual Stable Diff window, you'll see it go from 0-100% once and generate an image. When it's doing a second pass for whatever reason, be it restore faces, hires fix, etc, it will do a second pass instead of spitting out hte image in the folder. So a batch of 10 will get you 20 lines of progress bars filling to 100%
It's stuck at 5gb used after the batch of 8 with 1.7x high res fix upscale
I have everything off and no upscaler running, but it's still doing two passes per image, effectively doubling the render time
why does it double pass yours if all is off?
it's started since they added the built in upscaling on txt2img if you want
I don't know, this is what I'm trying to figure out
I updated Auto1111 after like 3 weeks
I just updated today also
Last batch gen it got to peak of like 7GB gpu ram used, now its at 9GB
It doesnt seem to flush whatever was in the ram 🤔
and now I get error: RuntimeError: CUDA out of memory. Tried to allocate 3.75 GiB (GPU 0; 12.00 GiB total capacity; 7.74 GiB already allocated; 1.78 GiB free; 7.77 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF
grr seems completely random if it will spike the memory into failing and then the memory stays maxed unless I shut down auto webui
Been a bug since october..
Good work on this. Styles belong to everyone not just media corporations and individuals. The website techdirt documents many of these same tactics big media has used for decades to steal from artists and the public at large.
Art for Me, not for Thee
people are selling ai art? how do i get in on this
if theres a new tech wave hustle, i want in
its called nfts
@astral goblet it's here man, the number of things that have exploded in the past few months will only increase exponentially
making money there requires being part of the organized crime ring that's trying to launder money
and then hoping you get out of the ponzi scheme before it pops.
i want to get in to new schemes. not the same old scheme
well crypto is still up if you bought it before the pandemic :>
thank you, pls share it with artists who are upset about the open source tech. i really made it mostly to help give them a different perspective
I'm talking to some Anti-AI people, one person is rationally asking questions, the others are doubling down on "Art is not theft!"
In the end they want to stop "AI" from reaching any popularity, they are convinced "we will loose all our JoobsSSS!"
@sinful mango post that video in https://www.reddit.com/r/DefendingAIArt/ if you haven't
one actually told me "learn to draw!" I'm like, oh the dozen programming languages I've learned isn't enough...
I took a picture book illustration class in 2019. right before the pandemic
i have aphantasia, not complete but it's there. i can't visualize images in my mind. instead my primary mode is dialog. i describe everything very rapidly in my mind. like a very fast talker that i manage to keep up with
We made book dummies using traditional tools, and then the following years while I had free time I made dummies for two other books
it has always really prevented me from drawing
i can draw but i can't see any end product, so i'm working blind
The fact that an AI children's book got people heated on Twitter is just funny to me
i've lived the "not all people can draw" argument most of my life
not all people put the time into learning to draw well
thank you i will, i plan to share it on reddit tomorrow morning (i've found sharing stuff late at night things often go unnoticed)
i've got probably 100 sketch books of drawing. i been to visual arts college. i've put the time in. but aphantasia means a blank slate is entirely empty for me. I have a lot of ways aroudn that to still be creative, but it usually involves painting over "stolen" ip or other compromises i make in order to actually see what i'm trying to describe in my head. being mind blind is a huge hinderence that most just don't understand
I wouldn't say all styles are public domain, if you're an artist and developed your own style, a la the Sam Does Arts guy, building models specifically based only on your art to then iterate and generate more stuff that essentially is directly mimicking your work via direct sourcing is pretty messed up, but I'm clearly in the minority on that opinion so I'll leave it at that
@sinful mango 5:14 in rings so true, artists being given the illusion they are joining some grass roots operation
I was reading what an artist said on another discord server about how some artists are scared of the new technology. I can sympathize with them. This group that raised over $200,000 to lobby congress along side media corporations is not the answer. Big media has always stolen from artist and always will. Now they have convinced some artists to go along with them.
sam does art doesn't have a unique style. he has a style that he sticks too, but it's not his or original
@past vortex perhaps, but the primary tool of the beginner artist is to mimic advanced artists they adore
eh, it's still his work, and he shoudl have a say on if AI source his work or not
yeah tbh the entire situation there is really sus...
all digital artists are using AI already or will use it soon
style isn't something you can own, otherwise a corporation would just create a blanket style ownership mat and control it all
it's no different than if I'm in the park with my kids and some dude starts photographing my kids
as a parent I have a right to privacy for my kids
I'm glad to hear that.
you can't just use their likenesses in an ad if they happen to both be wearing Nike hats or whatever
i have compassion for visual artists as well, i was spooked by the tech myself when i first saw what it could really do. for them it must be much worse. that's why i tried to give them the hacker perspective of becoming one with the tech to enhance their own power
we need to give permission
copyright is about as limiting what rights you have as well as giving you rights over your work. one limit is that the style alone isn't yours
I think Sam's argument of "it should not be you ahve to opt out, you should have to opt in" is pretty fair
and reasonable
they may not be using machine learning, but they are using AI. But soon they will be using machine learning based AI tools if they want to work at a studio.
regardless, the actual works are
there are 100 other laws like "trademark law" that apply too. Sam Does Art may be a registered trademark, but i don't think it is
I think you're taking the right approach with these videos.
I can't take one of his pics off insta and then put it in my, say, calendar and sell it for money without his permission
No, but if you look at that pic on insta, that's fine
insta has a license to show it to you
thanks for taking the time to watch it!
to show it, not resell it or reprint it for monetary gain
If Stable Diffusion had trained on Wikipedia images, the artists would have had no say. And the world wouldn't have been any worse off.
yes. So looking at it is fine. The ai models don't store the image after it's looked
that would be an insane claim. 100s of trillions of images in one dvd? insanity
instead what we get is a small 4gb sized neural network configuration that is good at denoising images
the issue is if AI is sourcing its DB off of copyrighted image, even f the content generated is a new image, it was still based off of a copyrighted source and is taking modeling data directly off those source images, it's not the same as me looking at his work adn saying "hey, I think I'll make a drawing in a similar style"
because even though we call it AI, it's not actually intelligent
