#🏞|general-with-images
1 messages · Page 159 of 1
Cats would totally do this if they could.
"I made you a sammich! Oh by the way bwaauulwgh"
(This was unintentional, the prompt was "a cat making a sandwich out of bread, ham, mustard, cat food, and cheese", but... cat's gotta cat.)
Searge img2img
Yes it is.
☝🏻 Flux and SDXL were discussing how SD3 had far too much to drink...
SDXL img2img
Quick question, do you like it with the butterfly or without the butterfly ?
Without
Silence of the lambs ...
first image its flux second i do with sd 3
Quick question!
Mojo
Is there any lora or lycoris for yoruichi pose
what is it richi pose
Search it, it's pretty cool
It's like cat stretching
Investing in crime ^^
A nutmeg grater, silk slippers, and a bodkin!
I'd invest in a scam-free & spam-free Discord experience.
what's a go to proper photo grain simulator ? anyone use anything like that that's solid ?
published my first public model - contains huge amount of research info : https://civitai.com/models/731347
Full Training Tutorial and Guide and Research For a FLUX Style Hugging Face repo with all checkpoints, prompts and more details > https://huggin...
20 seconds in Photoshop, Lightroom or Krita
anyone know, what ai-model this artist use ?
https://www.instagram.com/jasonscarecrowart/
the lightning was perfect and really good realistically
which model to generate painting like this?
portrait for business man
hello everyone! would it be possible for anyone to point me to a good checkpoint to create translucent figures, humans and/or animals please ?!
flux dev, base model, does those just fine with the right prompt
downloadable from Civit Ai ?
think it's still only downloadable from the black forest labs github and huggingface spaces.
alright. Thanks for your time!
It´s Stable Cascade with a fine-tuned checkpoint for Stage C (Invictus Redmond v1.1) and a fine-tuned model for Stage B from ClownsharkBatwing. Here you can find the workflow and the according model-links: #🏞|general-with-images message
https://civitai.com/models/618692/flux @royal monolith
Ducks
Got the missing better cascade? I think it was A
One part is missing in your linked text
yes, it´s the ft vae HQ, don´t know where I got it from though, you can use clip_l instead anyway
Thank you! ❤️
It's such a nice and friendly community here ... people helping each others ... I like that!
no i just wrote woman with ducks on water
Only Flux creating "hands" for shark 🙂
Good morning coffee!
Made original idea to dynamic to get much more versions. For me SDXL not working because the shark missing or wrong:
A beautiful {25|30|38|40|45} year old {natural|red|redhead|blonde|black|colorful painted} {long|short|cool} {curly|straight} hair model woman is relaxing in a {blue|green|red|gray|black} hammock, wearing a sexy {light|dark}-{green|white|black|red|yellow|brown|blue} {fishnet transparent|elegant|luxury|normal|cheap} {normal swimsuit|lingerie|sexy underwear|sexy swimsuit|one piece swimsuit|bra and panties|silk apron}. She relaxes with her perfect legs stretched out, set against the backdrop of a sunny beach, with palm trees swaying in the breeze and the blue {quiet|stormy|waving} sea shimmering in the sunlight. From the water nearby, an anthropomorphic {funny|creepy|friendly|attacker|angry} shark is {standing|jumping|attack} upright, wearing stylish {brown|green|blue|pink|funny} sunglasses and holding a glass of {whiskey|coke|pepsi|wine|beer|caffee} in one fin. The shark has a wide, happy smile, its demeanor laid-back and confident, as it casually sips the cup of drink while observing the woman. Shark flirting with woman, talk something {erotic|funny} to her and {smiling|watching} her. The scene is {playful|funny} and surreal, blending tropical {windy|stormy|windless|summer|rainy} paradise vibes with the humor of a cool, {drinker|drinking} shark, creating a unique, fun, and detailed full-body photo realistic shot.
🤣
As I see booth pic made by Flux? 🙂
Yes
Kid shark like milfs (and singing):
what a crocodile 🙂
Flux changed to male ....
Maybe female shark....
As I see "he" have success 🙂
Not too dangerous idea to tie up the hammock to the shark?
I changed it to male 🙂
(too much image when the woman keep the drink instead of the sea creature. How to hack the prompt if I want drink to the creature's "hand"?)
I don't give buy for yourself:
As I see the sharkcrab is exist and attack:
Generate a picture which is featuring with 3D scene, a white chinese buliding beside a jade river, with a gian glowing moon in the background. Jade and golden color theme, Chinese painting in the style of Pixar, C4D rendering, Minimalist Style, warm and peaceful feeling --ar 3:4 --personalize 78yex4m --stylize 550 --v 6.1
Here is your image...
...and guess what, I'm swimming calmly in the sea, suddenly I notice that the surfer in black is stuck in one of my protruding teeth, and I've been dragging the poor guy along for hours
SD3@ClipDrop
Flux accept any unreal mutations 🙂
ahah these are adorable and so detailed!
My favorite topic but for today I finish them 🙂
SDXL img2img using Mobius checkpoint
SDXL img2img using Mobius checkpoint
SDXL img2img using Mobius checkpoint
Did use some alr used prompts online but it did turn out good
Nice! I like the lighting
yea looks like old school phone camera 
hi
Open .bat file and delete --dark mode
it follows your system theme, but you can force it to white with --theme light
Bored the shark:
how do I fix this I alreeady did the upcast setting
/img
?
you have a very old video card and can't use half precesion. You need to launch the UI with --no-half as a command line argument. this is not an ideal situation. This doubles the size of the model in memory. fp16 vs fp32.
RIP to one of the Greatest Kings in history
okay thank you where but I cant type in the cmd where do I have to put that? and I have an Nvidia 1650 SUPER is that that old?
if you're using automatic1111 you'd open webui-user.bat in a notepad, and add it to the line for comand line arguments. assuming you use that bat file to launch your app. It's the recommended way to launch.
1650 is quite old for video card standards
Dave Prowse was the 'physical'actor wearing the Darth Vader costume - and James Earl Jones voiced Darth Vader. R.I.P.
Darth is awesome. The voice is iconic. Such a different vibe too. James switches the inflections completely! I'm glad Prowse's voice wasn't used. I've seen the raw take lol. Good times.
James was just one of the truest actors around. Such a great perform. Knows how to embody a character. One of my favorite moments was his cameo on Big Bang Theory and he was acting so juvenile, taking sheldon to ding dong ditch prank Carrie Fisher late at night.
yeah its old. 1650s were a budget line of cards produced after the 20s came out. Long before the 30s. the 50 series are almost here. You're back on a late generation 10 series.
Thats a monumental amount of time in the GPU space. Especially for machine learning purposes.
@topaz smelt i told you where here
oh wow. WOW. where is my head i answered that post already. you weren't asking again. doie
Time to go outside. Get vitamine D
ARGS!!
truth
🥳
lol wow
its quite hard to not trigger that effect sometimes
sure does look pretty noisy also
kind of old tv scan lines noisy
主题:《从释经到讲道》学习班
时间:2024年10月9日-11日
地点:花园酒店
导师:知名牧师吴荣滁牧师
内容:包括深入研究圣经、理解其中的神学思想和传达方式,以及实践性的讲道技巧,以提高你的领导能力和牧养工作效果。
Missing a photo of Trump on the wall
This one?
...or did you mean a signed photo?
😄
It would've had to have been great in the first place. It was only somewhat good.
Would anyone tell me how to get as good as the result that we get in Midjourney for this prompt
A moon with a large circular hole filled with glowing yellow electronics.
Details: Intricate details, photorealistic rendering, textured lunar surface, craters, soft ambient lighting, visible wires, circuits, and chips, warm yellow glow, cinematic lighting, depth of field, volumetric lighting.
Style: beeple, Greg Rutkowski, trending on artstation, hyperrealistic.
there is a secret method to steal an image
- use a captioning model to write your prompt
- use IP adapter to transfer the style
- use depth and canny edge control nets to transfer the composition
- use color transfer tool to transfer the color
stop using greg rutkowski in your prompts - not only did he make it very clear he hates that, but also his name really doesn't get you what you think it does
Why would he hate it o_O ? It only made him more popular among AI people who never heard about him lol
lmao
yeah, and i pointed that out to him. he responded with a nasty gram. he hates it. waste of free publicity in my opinion
his art was never in the dataset. like one or two bits of it. CLIP G and CLIP L however, associate his name as a token to a lot of that kind of style. With that in mind how you're just referencing the general style of D&D gamebook art, ALL artists in the field and not rutkowski specifically, you can see the name as a tool to aim for an aesthetic.
As long as somoene isn't publishing it as "This is art by Greg Rutkowski" it's 100% in the clear and he can suck nuts
that being said, refines get far away from the base model's clip alignment
doesn't matter if his art is in the data set or not. he doesn't want his name used, so out of respect for another human being, dont' use his name
the tool only works so well depending on which model you use
sucks to be him I guess. His name isn't really that valuable
no, it never was in anything but disco diffusion - but people tend to just use what others use
some people demand respect that's just unreasonable. Like some celebrities don't even want fans to look at them. I aint about that
so you're okay with us using your name as a prompt modifier then?
Now, another example. Gary Larson. He doesn't want farside published online at all. Deal. I'll never make a farside lora. Even though i love farside
100% . How could I ever be harmed by that?
you couldn't be, and it might have some cool effects with stable - but some people just don';t want their personal stuff touched. have you tried using it, btw?
if you start publishing stuff as if i made it and you're acting like me, that's another story. entirely different context
well that falls under several fraud laws anyway
Some people are obsessive and think they can control the world. Their name is just a name though. Anyone can use it so long as it's not impersonating them maliciously. slander. libel. there are limits, but mostly it's just a name
yeah. The laws have been long established and rutkowski is trying to extend his name rights beyond those
not about that at all
imagine if JK Rowling said all people who had the last name rowling must change it and stop using it
fuck. that.
so being curious, i ran your discord name through flux dev and got these - what it would do in a longer prompt though, not sure
Keep using Greg Rutkowski's name. Since flux1 is a base model (distilled but still close to the base and default clip understanding) it works again. pretty well. Not as good as G though mind you
SD3 has clip G so rutkowski really fires it up there
i'm fine with all of those. it's probably the same as adding any kind of nonsense token though.
sure, but a nonsense token that, by itself, lands in a nice area of latent space
RNG
on flux dev it's likely more todo with aesthetic guidance
regardless of which specific mechnisim, it's still a nice result. trying something a bit different
prompt: sunset by artist "nuuideas" - all have a nice art sort of feel to them. so i think your discord handle might be a good modifier
100% fine with that. I doubt it's working for the same reasons as rutkowski would though.
tokens are just tokens. They affect the vectors that aim into the latent dimensions. If someone says not to use a token because it looks like their name, i'm 100% going to tell them to jog on
nuu ideas are always good to experiment with
doesn't matter why it's working, really - unless you're wanting to dig into the wiring.
it's just aiming a vector. not really that big of a deal
me too, and just to make sure it wasn't a fluke, i ran it as the only prompt a second time and got really nice art sort of result. so i'm keeping this as something to use
sure, but sometimes it's nice to be able to aim at a specific target
its like a textual inversion embedding but found manually
yup. which is why not using a specific token because a person is offended by it, is weird.
prompt: sunset by artist "fartfacer".
sometimes a cigar is just a cigar
that goes into the realm of being polite. someone's name is someone's name, and if they don't want people to use it, out of respect for another person, don't use it.
brighter coloring, more saturation though
not quite as nice
courtesy only extends so far. when people ask too much, they're now the ones being impolite passive aggressively. forcing you into a position where your only stance is to obey their inconsiderate demands, or appear rude to them. Oh well.
and more of a vector look to them with sharply definced areas of gradient
it reminds me of the 8 bit digital art era
it does - that's another prompt to play around with. what if you change sunset to something else?
another image. i've got prompt game already.
might be good for a new side-scroller background set
though as i investigate, it seems "fart" is the token that leads stuff to a pixel aesthetic. intersting.
so is it tokeising it to f-art ?
here's the result with a space in the middle. prompt: nuu ideas
still art, but in a different direction
yeh. you're kind of just doing the legwork to demonstrate my point for me. cool.
part of what i'm doing with this exploration 🙂
i've found a couple modifiers that flux really likes - stick the term yesteryear on the start of any prompt and see what happens
https://sd-tokenizer.rocker.boo/ says that fart is it's own token on clip L
Informs you about how your prompt/words gets turned into tokens, privately. For Stable Diffusion models, CLIP models
fartfacer is f-art-fac-er though. But individual tokens don't matter so much when they're in asequence like that
i'm not sure that page is using Clip-L though. all the options are the same
you used fartface not fartfacer, however - does it tokenize the same way?
no i used fart facer
i read it wrong then
fart face is the more common one thanks to thanos. but wall facers are an awesome thing so i sorta smashed it together
https://www.youtube.com/watch?v=z6hEfK5C-Sg for scientific reference
Subscribe to SaturdayNightLive: http://j.mp/1bjU39d
SEASON 34: http://j.mp/15M8eL8
Office Sketches: http://j.mp/17px9Lx
The business world can be cruel. Aired 10/18/08
Subscribe to SNL: https://goo.gl/tUsXwM
Get more SNL: http://www.nbc.com/saturday-night-live
Full Episodes: http://www.nbc.com/saturday-night-liv...
Like SNL: https://www.fac...
yeah, it's a term that's been around a long time, usualy coming out of the mouths of small children
didn't know they did a skit with it though
small kids know whats up. they're untainted by lame memes
which - strange connection happens in old memories - reminds me for some reason of a player i had in a DnD game so can to the session one night and said that someone had called his house and left a message on his answering machine with the phrase "the ice cream has no bones. The ice cream... has no bones'
so now i want to use the ice cream has no bones as a prompt
https://youtu.be/GCWIT4IGix0 more scientific reference
This is just madness I dunno what else to say.
youtube rabbit holes
gotta do the research
ya gotta pay the tax!
Кот на столе
prompt: Umbra pegasus on the top of a mountain, lighting crashing through the sky
Good morning coffee
1. 法棍:长条形、大且深棕色,刚刚烤好,表面粗糙,带有面粉痕迹。
2. 模特:穿着比基尼,法棍放在她的胸前。
3. 风格:复古风格,柔和光照,暖色调
I took a pill in New York ... ... ...
boy
improve this house and make it a little more realstic
#🏞|general-with-images improve this house and make it a little more realstic
Done...
Ollama img2img
to create link for thread without images
Great concept, you should work in advertisement
I was making some images for a video I am working on and had SD make me T800 breakdancing at a disco. This is the result. Never change SD, never change! lol
lol
Epic!
@nimble mason flux really should be in a lab - prompt: Xylose Phosphate
Thank you! Unicorns are special fun 😄
Good morning coffee
The real reason why Unicorns extinct
I know that one with dinosaurs from back in the days 😄
Are you advertised by Artstation?
wdym?
I got a mail from Artstation saying: Moofi published artwork "OR-I-DJINN"
Which model is it? I'm strugglign to make phoenix =.=
possibly because you are following me you get notifications? Don´t know much about how Artstation works in that regard
Ahhh... possible. I don't really use it. To many social media to watch them all ...
🙂
would be great album cover
Funeral music for astronauts ...
dark trap 😡
https://youtu.be/SpDkEJk9QKE
Provided to YouTube by DistroKid
Stuck on Ishimura · Istasha
The Distortion That Creates Them
℗ Pet Snails
Released on: 2024-06-28
Auto-generated by YouTube.
or anything dark really lol
Haha, fits quite well 😄
@languid pebble
Selbsbildnis? 😄
wessen? 😄
Deines
dachte ich mir 😉
can it do painted appearances?
Ollama img2img in ComfyUI
Ollama img2img in ComfyUI
Ollama img2img in ComfyUI
Ollama img2img in ComfyUI
Ollama img2img in ComfyUI
JuggernautXL DPM++2M Karras
really nice meal
anyone who can help me understand why this prompt never gives me realistic looking pics?
realism means art
you just want to say something like (photo:1.3) of a busy office space, people working
what does the :1.3 do? (im very new to this :))
it boosts that word a bit
Flux dev
angry salami
You make me wonder what your prompts may be 😄
It´s as well with an input image of the 4080 here in open case 🙂
@languid pebble
lol
🙂
btw you can check out stephen gammell if you haven´t yet 🙂
Cool!
would be interested with what you come up
he´s not the only one in this prompt, yet a pleasant starting point I feel 🙂
Childbook Illustrator ...
yeah, well, definitely not, at least the paintings I´m referring to 😄
I never really did a lot IMG2IMG ....
possibly it´s about time? 😄
I'll check ... most of the time I know what I want ... but I add some wildcards sometimes, too ....
The most interesting part with AI is getting what you didn´t really expect I feel 🙂
I'm sure there's a market for it 🙂
f markets 😉 😄
Arrgs ... need to check my workflow ...
create an image of a colorful mantis shrimp playing an electric violin as fish look on in amazement. larger than life, pixar style
Have a look here -> https://discord.com/channels/1002292111942635562/1237461679286128730
Whats your avg gen time?
Thinking about returning my 7900 gre and buying a 4070 super ti (not the same as 4080 i know)
I only have juggernaut xl as of now
So have not tested any others
might test flux soon heard alot about it that its great with texts
Let me check SDXL quickly
Yeah it’s the best at text for open source and comparable to the best closed source.
@amber pilot would have to wait a bit because I would have to reboot for a clean value. Currently busy though
With multitasking programs open it delivers around 2.45 it/s
at 1024x1024
Yea thats the same for me with 7900 gre
wait
Windows machine?
that´s with dpmpp SDE (karras) taking twice the power of let´s say euler_a, where it´s 4.73 it/s
yep
Alright, would make no difference in me switching to a 4080 then, maybe better with a 4070 ti super idk really.
so you got 4.73 with euler a karras as well?
4.5 i believe
Good morning coffee!
Very dark ...
Can anybody please speed-up this Ollama/Flux workflow?
Ollama/Flux img2img (took 30 minutes to make!!!) Aha! I changed to "Unloads Model" and now licketty-split!!!
Flux/Ollama - img2txt (Ollama) - txt2img (Flux)
Flux/Ollama - img2txt (Ollama) - txt2img (Flux)
Hi guys, I'd heard of a watermark remover that used Stable Diffusion (I think), do you have a name please?
I'm looking for a good watermark remover to train a LoRA Flux, since nowadays we even monetize digital images x)
You sure you dont mean copyright instead of monitize?
Hmmm make sense x)
If you can copyright AI Art - let the world know; as it is exceedingly difficult to prove agency i.e. "I made that art!", as a court often decides that it is more likely made by an algorithm
Not sure if he was talking about ai art
I'm looking for a good watermark remover to train a LoRA Flux
it's the unicorn of allergies hard at work
It's throwing up candies ... looks like it's kinda censored in Flux
ah. looked like pollen to me
They are way smaller 🙂
whats the difference in using t5xxl_fp8 and t5xxl_fp16?
fp16 better but more vram
Then there are GGUF models and text encoders, which are almost as good, and save a lot of Vram.
Ollama img2txt / Flux txt2img - all one process
Get a real job you scamming tramp
As a cat, you should know that's not a real fish. The girl part is just so that a man feels like they're intellectually superior to this person and can't be scammed. Men are easy to fish in like that
quanting the Unet goes a bit better than quanting the text encoder
can alternatively lower VRAM burden from text encoders by just unloading them
"Cat sitting on the moon, eating a glowing fish, with Earth visible in the background and stars in the dark sky."
Take one for free ...
Slowly figuring out how to make a good longer multi clip FLUX text to video gen lol
IMG2Video?
Ahh... got it. sorry!
Pretty much. Blue section on the left makes the image, rest extends the video
Shit, forgot it was a img 2 video lol. Gonna see if i can make it a text to video xD
Cool ... but needs a lot of time I'd guess ....
I'm looking for something better than SVD for IMG2Vid
add a prompt section in front of it to create the image, then have that image become the one you would have uploaded
I'd think it's better to seperate the generations ... cause you don't want to animate a bad created picture
Theres no benefit for text2img
why? the animation could be cool
purz does that all the time and his stuff is very cool
But better use a good created picture for an animation than just a text prompt
As long as the text prompt has no influence on the animation ....
if its gonna take a long time
then yeah maybe want to make sure the input is good
Flux.GGUF_Q8_0
Easy to create 10 Pictures and animate the best. Harder to create 10 Animations ...
most 'not so good' have chunks that are fine, i just use those parts
Prompt section is here where it makes the initial image :P
Sadly SVD doesn't have a txt2img yet
Cause it's the worse concept ....
Flux.GGUF_Q8_0
doesn't need to. it won't know if you created the image first with that, and then you just ran the image into svd, or you uploaded an image
No, i mean, i got a text to video, but not for FLUX
TXT2VID only makes sense if the prompt has an influence in the kind of animation
Otherwise PIC2VID is the better solution ... like SVD
try cogvideox 5b, its the best open model rn and only requires like 5gb vram at the least. Image to video support isn't here yet but should come out this month(most likely next week I believe)
Thanks for the information!
I'll keep an eye on that
yeah here is the link: https://huggingface.co/THUDM/CogVideoX-5b
its not quite kling level but still great.
Oh wow, yeah just seems to have arrived.
At the moment I have pictures of a product willing to make a video ...
you can just try it here: https://huggingface.co/spaces/THUDM/CogVideoX-5B-Space
What does it mean by "negative values"
it thinks you're using photoshop, i think.
1-10 what do you give me for the repaired hand?
Good morning coffee
Can we restore old monochromatic images using Stable Diffusion
You could have a look on SUPIR ... not 100% sure if it fits to your needs ...
Thanks I'll try that
there's a recolor control net out there
SUPIR for an upscale stage would be fantastic too
you might want to look at inpainting for problematic areas
Do you know "Super Menteur" ?
🇫🇷 🥖
C'est moi, n'est ce pas? 😉
haha, non mais, je vais te repondre en image 🙂
Super Liar en image?
So I just make a two word prompt ... I see!
Flux Q8
Qu'est ce que c'est un Super Menteur?!
It's "Super Menteur" !
Now completely overtaken by his replacements, he's almost outdated.
Chirac! Ooh la la!
You may be too young to understand this reference x)
Soixante huit
Serieux xD ??
Ouais
Ah, alors "Mangez des pommes" vous parlera x)
Jacques baby!!! 😄
Chirac sur le dos d'un ours!!!
La je travaille Super menteur, il merite un beau wallpaper en 4k
Ah ça, faut demander a bernadette
Bernadette qui?
sa femme xD, vous etes vraiment de 68??
Ouais, mais j'ai oublier beaucoup!!! MDR
le RPR ça s'oublie pas
"l'Etat? C'est moi!!!"
🥳
La même avec Poutine est on est bon / actuel ^^
C'est tout le coté artistique de la photo, Chirac serait le dernier digne president, le seul capable de monter un ours...
Comme Poutine x)
Il est vrai ^^
Fan Club Jacques Chirac!!! 🥳
made it a little more crispy
but its virus ,not food 😃
Good morning coffee
Had to show these off somewhere
Thank god not so bad here
donald trumpin anime style,sunny v2
Prompt: https://civitai.com/images/29750620
"Snow Moon" - Oberon (Warframe) | Reimagined
Young Donald T. ...
@crimson dawn Well, i've somewhat gotten the gist of how it works, now to put it to the test 
512 x10, and depending on how much memory it uses, maybe by 10x more xD
512 sized tiles is default on forge too, but you can go bigger tiles. like if you can do 1024 images, do 1024 tiles
i think
iopaint leaves some blurred marks. What can I do?
super cool experiment . it is pretty great to work with. i've used it with flux a bit but those experiments got lost in the sauce
Yep, bigger is always better, especialyl when you're dealing with sizes where one square is still within the same "color palette" lol
Aye. I've been screwing with flux as well. Heck, thanks to ai-toolkit, i made my first lora! Never understood how to get kohya to work. And despite all my effort, haven't found out if toolkit works for 1.5/SDXL 
what's your issue with kohya? it's the dataset structure isn't it?
Yep. Way too damn overwhelming. Meanwhile ai-toolkit was just add images, add text per image with prompts, open it's config file, name total steps (i use 100 steps per image, so 600 steps for 6 images), then simply fire up it's .py file and it's config file, and it's going lol.
this image from the other day was using tiled diffusion its good
this is downscaled to 1k but the image was 4k
Damn lol
@wispy nest What upscaler node/model did you use to shoot it up to a res the sampler dealt with?
CAuse iterative even on 6/10 steps i set it to is a tad slow lol
I always use adaptive token dictionary
yeah it seems that way. the toml files are powerful. All i do though?
I make a folder called "datasetname" and inside that folder i make one called "#_token" in my most recent one it was 5_robot. That tells it to repeat every image in there 5 times. or 1_token would make it do 1 image repeat per epoch. that's how others work by tdefault i think, 1 repeat. then in the folder its the same. txt files paired to photo filenames. to build datasets with multiple concepts, you'd put each concept in a 1_token folder. or with the repeats to balance the set. 2_datasetwith10images and 1_datasetwith20images. sorta.
i just ignore toml files, use the folder structure, and the folder name is what trains when you drop captions.
huh, never heard of that one
Link me it's node
the kohya gui one automates a lot of the tedious stuff too
https://github.com/Phhofm/models/releases/tag/4xNomos2_hq_atdgoes into the "upscale with model" and "load upscale model" nodes
Aye. That's what i've tried too, but for some reason, my last attempt was just either no lora effect, or directly one of the training images lol
Or no, not all of it, just #_robot xD
Most of the rest just confuses me lol
love that when you get the image directly back out
sometimes i love that
at least you know the training works haha. like a business that can't handle the volume of customers
Ah ok. What's the differences between nothing, 135k and 205k?
Yep xD
Ship me your SDXL and 1.5 config for kohya, and i'll give it a go :P Howmany images do you use normally?
I prefer the 180k (the default one)
Ah ok. So no numbers is the final version, and rest is just "opinion" based earlier "spitouts"
yeah
i dont really have a config saved. i just use the presets and fiddle with settings everytime. SOON||™|| i'll combine all my ideas and explorations into one preset and use it.
haven't done sdxl for a while. been training on flux since i got back into this stuff. Just recently i figured out that cosine_annealing is just cosine schedule with a minimum learn rate, which is cool beans
if that one is too slow, he released versions for drct-l, dat2 or mosr as well
they will all be faster, but a bit worse
they are in the same repo
adaptive token dictionary sounds hot af. but i dont know why it intrigues me so
must explore that later
its a 🔥 name yeah
I don't even know what cosine is xD
so, i can't really use a upscale model, because upscaling an image is gonna eat vram like mad, unless there's a multidiffusion for upscaling models xD
learn rate schedule. constant would be the same learn rate the whole time. cosine is going from what you set to 0 then back up, in a cosine wave, the sexiest of the waveforms.
annealing just bottoms out at a minimum setting instead of 0. all this time i thought it was fancier math that i needed to be in the right headspace for
oh I think you can tile for most upscalers
Curious if i can mess with the tiled diffusion node to simply change checkpoint model to upscale model 
I know you can tile them in Chainner
https://github.com/LabShuHangGU/Adaptive-Token-Dictionary?tab=readme-ov-file#fast-inference according to this, it's fast on cpu?
its very slow on CPU
oh i misread
I've done it before so its not impossible
idealy, you know the result before you hit bake. so it's somethig you could click before lunch
yeah I used to queue up a bunch of upscales and then just leave it
the Span or Compact ones are more suitable for CPU like this one https://github.com/Phhofm/models/releases/tag/2xNomosUni_span_multijpg_ldl
or this one for patient people https://github.com/Phhofm/models/releases/tag/4xNomos2_realplksr_dysample
What does it actually do?
its just like ersgan but better
Ho, so it's basically "nvenc vs X265", where nvenc is miles faster, but much worse?
Ah, like when you use hardware video transcoding, gpu is way faster, but it's more a "ignore quality control, just pump out frames", while cpu is thorough
oh I see
its a bit different, cos you can run all of these models on either CPU or GPU
just like stable diffusion itself
For streaming, it's no issue, but it's when said transcode will be the new smaller compressed movie, you want it to look good when natively played, and not a movie to watch at the summer house :P
Gotcha.
You guys wouldn't know of a upscaler model tiler, would you? xD
some people have a GPU so bad that
their speeds are kinda comparable to GPU
Indeed lol.
Meanwhile i wanna do stable diffusion on my steam deck, so the tiled shit is definitely gonna help xD
I meant for comfy with upscale models :P
not sure, sometimes there is a node for things and sometimes not
Cause so far i've made 4-6 or 8k images even if it takes forever with these ones
Also, does cannier have a resolution limit?
not sure, I haven't hit it
generally with tiling there shouldn't be
but AI apps don't have the most robust code
Nope lol
its likely a very high limit
since Chainner specialises in upscaling
the different codecs lol. i like that analogy. its not perfect, but i get where you coming from
Aye :P
Well, all that remains now, is to wait for nvidia's announcement for the rumored "28GB" 5090, and whether it's worthy of it's 2.5 grand pricetag so i can increase the damn iterations per sec lol
And the 5090 needs 600 Watt?
maybe yeah
Someone told me it's better to get 2 3090....
I don't think they will shrink 🙂
its more about whether it splits well
sd3.5 is Api access only and its not really "better" then flux. It's worse in humans, prompt following, text rendering. However, it does seem to look better and more stylish compared to flux without any thing else, also has more knowledge since its undistilled.
I think we are talking about different things. If I am not wrong with the new TOS they said SD3 wasn't really finished and they will release another version.
well 3.5 is sorta still 3
Is there a 3.5?
Yes, not open source but some people have api access.
Let's hope the best for Open Source ...
👋
I hope you are fine!
Best wishes to you
bizarre unbelievable action photograph of an inlaid grobyk poop beansbattle and statistics infographic eralian greebling wobodogger wearing beh urban scene twist horrific horrific splatter device : thrilled husband legendary artiststroseating homomorphic bean dissection sliced noodles schematic manuscript masterpiece scientific masterpiece bizarre repeating vandalhtaking high tech Hardware excellent use
so not only do you want them to have only a few colors, but you want them tiny?
I am building a gen-model for our product, it called curtain light,
might prompt for a light brite
the curtain light contains only few pixels, we wish we could come up a end to end model to generate the low-res pic dirctly
you don't want an AI for that, that's the wrong tool - just write a script to draw to the screeen
the pic i show u is from SD1.5, then resize ,color enhancement, etc
okay, well, it's still the wrong tool. it's like you're trying to use a bulldozer to do the job a shovel would be better suited for
you might try an LLM instead of an Image generator
Maybe
then what is the prompt when llm
we ve also tried LLm
i dunno. i'd go talk to meta and tell it what I wanted to do, and get it's suggestions, then ask it for specific code and assitance.
but really, all you need is a script that draws colors to specific pixels on the screen - old school tech
got cha, I'll try it on 4o or maybe the latest 1o, thx buddy
yeah, see what 1o can come up with.
it o1 lol. i got it wrong
meta is free... just sayin'
Will it still block the use of living artists - their unique styles - in the prompts?
Flux1.Dev + LoRAs
No idea, possibly or possibly not. It should know more camera terms and maybe a bit more styles.
Flux1.Dev + LoRAs
Oh boi, this will take a while to convert SVD_XT_1_1 to tensorRT, and had it prepare it to be able to generate 100 frames 
they say SD3 is unfinished since months. In the same time bfl trained flux 😅
I don't think they are training sd3 in background. Maybe they do some dpo, but I wouldn't expect any big improvements here
Audioreactive Video Playhead system, now with real-time MIDI control + 21GB of new timelapses, and SD configurations.
LK + UBridge + Smartphone → TDAbleton → TouchDesigner
You can access these project files, plus many more systems, tutorials, and experiments, through: https://linktr.ee/uisato
Someone else loves TouchDesigner!!!! 😄
van Gogh-y type stuff - Ollama img2img, with Flux output
😄
I like the images here, may I ask what LoRAs you're using?
Draw a complete cell
Hi, and thank you for your appreciation of my work! The LoRAs are by andreac75 = Blue Future, Toy Camera and Wreath BW. Have a great day!
hello guys, Who knows what negative value is?
Negative Prompt? Everything you don't wanna see. But I'd try to leave it empty ...
Have a look here: https://discord.com/channels/1002292111942635562/1237461679286128730
img2img with Ollama, Flux + LoRAs
A pirate-themed Furby standing on comically tall peg legs, depicted in the style of an oil painting. The Furby is of normal size but standing on exaggerated peg legs, with stormy seas in the background. The mood is dramatic, with dark clouds and turbulent waters adding to the pirate atmosphere, while the Furby maintains its characteristic fuzzy, round appearance. The scene captures a sense of adventure and whimsy, blending the quirky appearance of the Furby with pirate aesthetics.
Borrowed from DallE Theme of the Day
"Minimalist rugged oil painting in faded earthy blue hues, capturing delicate details in vast solid patches. A weary and anxious queen wearing a night robe standing near an open stained glass window in the high tower of a castle, holding a candle in a simple holder. View from out of the window. She gazes into the starlit night, as a distant search party of horse riders at the bottom of the castle run away on a dirt trail into the trees."
You can try flux for free here https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell 🤗
These are great, I'm using Vintage Movie and I love it. lol
it's like a value less than zero, so imagine going subtracting value when you're already at zero.
"I'm looking for some ComfyUI flows that can quickly and accurately paste product mockups onto white templates (well, 80% accuracy is fine, I can do post-processing later). If anyone has this information, could you share it with me?"
some kind like pasting exactly the brand of left photo into the bare bottle on the right
A candle burning in the vacuum of space, with cosmic swirls of color replacing traditional smoke and flame shapes. The candle's flame merges with celestial elements, creating a blend of fantasy and abstract art, with vibrant colors and surreal space forms dancing around the candle.
Can I ask for help in general guidance on where to start replicating this process? The processed image is on the left; the original image is on the right. Basically, they turned 2d art into a realistic image while retaining many details of the original art(pose, color, facial expression). I am trying using img2img while using the epicrealismXl model, with denoising strength at 4, but it's not producing the same result.
Also, for reference for other images like this by the original author on Twitter.
https://www.sotwe.com/N0LS0L
See tweets, replies, photos and videos from @N0LS0L Twitter profile. 35 Followers, 0 Following. Do AIs dream of space hamsters?
https://t.co/O9wYy9cKHs
IP adapter with some reaslistic images as input would help
as well as injecting some noise before sampling
thanks I will try this now, I will search in youtube for how, never used IP adapter before
no way to tell without looking at the actual image code, and probalby not even then
i can do that with photos in photoshop, or generate it with ai
someone is selling an ai and i wanted to know if he is just scamming because my father wanted to buy it
I’m leaning toward ai, human looks weird and code is formatted a bit weirdly too. Could be real tho but human looks a bit too smooth.
selling an AI? yeah - no.
your father can do really good, AI images, for free - he doesn't need to pay that guy a cent
I agree with crystalwizard, flux(open source) could probably create similar images, no need to buy an ai from Facebook, seems very shady.
yeah idk he was throwing weird numbers and comparisons with stable diffusion https://blog.skytells.io/avagen-a-new-era-in-generative-ai/
but he in his linkedin he was suppose to be a ml engineer at apple so i did not know if he is legit or not and wanted some proof before telling muy father that he is a scammer
who cares what he's supposed to be. not one of us is paying someone to generate AI images. but if he can't install it at home, there are sites like mage.space where he cna get a free account.
don't pay the guy - he's probably trying to sell stable diffusion, which is free and open source. he sure isn't trying to sell apple's products, he'd be out of a job fast
ok thank you for the advice
Good morning!
they've got an official github and replicate so maybe they will post stuff on there if they make something
they just seem very small and new, its plausible they trained a small diffusion model but doesn't seem worth using
it only costs about $2,000 to train a model the strength of SD 1.5 from scratch these days
How many steps do you generally use? And you use euler?
I'm getting decent output (laughing at the horns tho) and using 20 steps with euler, but was curious what others did.
Yes, 30 steps, Euler and Simple
And I use the Wraith_BW LoRA
Do you see much difference between 20 and 30? I really don't.
I haven't tried Wraith_BW
I skipped the LoRA for speed and just bypassed that node
Okay, I mean, I keep it in mind but really, it is different but I don't know if it's better.
I'll check it out
I'm loving my workflow tho. I've got a slider node for my steps, an AI prompt generator, a 2d slider for image size and it's working great.
Good to know!
But you don't use B&W? How does it prompt?
If you turn the Strenght to 1 in the LoRA - then all output is B&W
How do you attach the json to it to share, tho?
What is it about monsters with spaghetti in their mouth?
Try this for spaghetti-in-the-mouth output
Prompt = bizarre unbelievable action photograph of an inlaid grobyk poop beansbattle and statistics infographic eralian greebling wobodogger wearing beh urban scene twist horrific horrific splatter device : thrilled husband legendary artiststroseating homomorphic bean dissection sliced noodles schematic manuscript masterpiece scientific masterpiece bizarre repeating vandalhtaking high tech Hardware excellent use
Dunno - json is code, screengrab lacks the code
Although a PNG made with the workflow will have the same code as the json - if u d/load the PNG from the server
Yeah, that's what I was looking for. Mine is so spread out that it's hard to share and still see node names.
Cool - and yeah - too small to see 😄
Yeah, hmm
Okay, I only cut out the preview image and I joined the Reactor nodes into a single node, otherwise it seems good
Yours is by Searge - I love the nodes he makes!!!
Sure is, I love that. The thing I added that really helped with loras is text combine tho. I wish that was included in the Searge LLM node
The extra text is just for LoRA triggers to be included along with 'photo-realistic' that I want in every prompt
There are 3 Reactor nodes with acccompying preview image nodes, but that's pretty straightforward. I just load in 3 different images of 3 friends and if I think one of them would like it, I save it and pop it over
I left the strength on 1. heh
I'm new to having a good PC for images, and new to comfy, so I'm pleased
You're doing very well!
I'm liking this Wraith LoRA. I have photo-realistic in the prompt but it's very much an art style.
Yes, it needs careful prompting as it can "take over" the whole picture
To such an extent it is ignoring the details in the prompt
Can someone Image 2 image this?
I wanted it to be more photorealistic, but it always takes away the glow of the flowers
... won't be long ...
Incredibly scammy, the ceo photo itself is ai generated(probably by flux which is open source) which alone tells you enough. The text on his lanyard is gibberish which happens very often if you don't tell flux the text to generate, skin is smooth too.
It's a scam, just use Flux which is open source(you can run it locally or online). It's the most popular model right now which can generate great text, perfect humans with 5 fingers, and has very good prompt following.
A lot of scam offers online at the moment offering unlimited A.I. for one fix payment. That can't work.
There are already so many free services that run flux, and a bunch of a.i models, no need to pay unless you are training or want to run models on virtual gpus.
I only generate local ... don't wanna be censored or my prompts beeing published ...
Why not?
I just found this prompt fascinating due to its varied and detailed output, and how it seemed to depict otherworldly objects and creatures. Some of the generations give a Voynich manuscript vibe.
It fits to my topic ... human - machine ...
Good one @nocturne oak
Damn cool! Thank you!
Or hot?
can someone explain to me what this means and why it pops up when i try to use some models? Im very new to this i apologize if this is a dumb question
Three finger hand too
Shadows too... This idiot didn't even try
Says his name is hazeem Ali but the next on the project says hazem AI lol
Anyone know what style this is? Found it on pinterest
similar style
https://imgur.com/gallery/retro-1T3iPkp
Did u do this in SD? If so, how? 🙃
I've made it with Flux, prompting only
"Create a text design for text: "XYZ" with each letter filled with "Your motiv" ...
Ho! Ho! Ho! I’m a mighty pirate!
This one I used m3t4ll0g0 from @vagrant birch (strictly speaking, it is a Flux creation. Getting m3t4ll0g0 to work in Ollama produces too much Alpaca!!!") 😄
Dolphin-esque
Enrique
what would be the easiest workflow for making a person from an image move around a pose i grab from another film?
I would suggest animate everyone as keyword for your search. You would need some custom nodes but it should work.
packaging with a xenomorph skull in the meat department of the store
search where?
twitter first
Lmao
They didn´t even menction which version of stable diffusion
🤣
guys, any idea what image-2-image workflow for ComfyUI I can use to transform screencaps of 3D models into concept artwork? Something that preserves details well, but also changes the style to a digital painting.
something similar to this maybe?
Try this Ollama/Flux img2img combo - the workflow is in the PNG (Open in browser)
oh I did not mean that specific style
just a workflow that would transfer source style to an image
does this work like that?
Yes, and you can also ask it to replicate a style
searched for Flux\Flux_LoRA_Wraith_Bw.safetensors but can't find this
ah think I found it
Wraith_BW can "swamp" your image - so to tame it - keep its strength quite low
no idea what to do with the image combo module
what image do I need to pull with that and where from exactly?
there's a huge list if I click value but not sure where it's pulling it from
think I figured that out too
but now I get this
what model do I need there?
this is the error btw
ComfyUI\custom_nodes\ComfyUI-IF_AI_tools\IFImagePromptNode.py", line 193, in describe_picture
raise ValueError(error_message)
maybe I'm missing ollama?
or selected model is undefined?
IF_AI_Tools
Select model - llava2, or llama3 - d/load under Models at ollama.com
D/load the models by typing ollama run llava2:latest
Turn OFF "Keep Alive"
oh ya I missclicked that 😄
erm... is this the command prompt method? I don't really know how to use this 😦
where do I input these commands?
