#🏞|general-with-images
1 messages · Page 134 of 1
Ah, I see, then where do I find the weight (what weight to begin with :-D) and "pick style"?
click on the ipa nodes and click the weight drop down
cat asleep in my arms so typing like shit 😄
with clownschedule B
can´t see anything alike (when right-clicking the IPAdapter Embed node)
ah, you are referring to choosing the weight type? Because they renamed it?
yep
like before it was style transfer SDXL and now it is simply style transfer (and strong style transfer)
yup
yes, thank you 🙂
can someone pls do mexican mike tyson
You'd have to give me AT LEAST 50 dollars to sleep with Mexico Mike
dreadnought warhammer 40 k
Hi. Tried using your workflow but one node is missing. Which one is it?
And where could I find sdxl ai creator checkpoint ?
go to comfyui manager, "install missing custom nodes"
Yes it usualy works but no nodes show up there
can you help with next pront:a photograph of a snake, with large ice wings, an iridescent skin, a hypnotic look, which makes it a mythical creature, primordial force, elegant, captivating, aura of mystery, tail with sting, is on top of a snowy mountain, clouds around him, with little light on his right side.
i think it's the extra samplers node
err, pack i mean
samplercustomnoise maybe
or maybe it's just flat out ksampler and your shi tis fucked up
I have that installed
Hi, can you help me with this promt: A photograph of a fantastic animal that is a western dragon, with angel wings, its body, translucent and dark, reveals its skeleton in a play of light and shadow. This being, its food is souls, it has an indomitable and mysterious power, it is a duality between light and darkness, between the celestial and the earthly, it stands guard on a rock on top of a mountain at sunset. -- ar 3:2
nah that's something else, haven't had problems before
have you updated comfyui in the last 24 hours?
Yes
last 24 hourslast 24 minutes
ftfy
Just tried again 😄
try just adding a ksampler and manually hooking it up in place of the dead node
Sure will try
Here is the image you requested.
Here is the image you requested.
Thanks you
Hello, can you help me with this promt: It generates a realistic photo about a creature, it has a large humanoid body with imposing musculature, crowned with curved buffalo horns that radiates a sense of power in a majestic enchanted forest. His large, majestic wings gracefully reflect the celestial light as they unfold. Masterfully, in her hands she controls fire and makes flames dance at will in the place where ancient trees and shadows move to the beat of ancestral magic. Her intense and penetrating gaze reveals the wisdom of ancestral beings, embodying strength, magic and majesty
Here is the image you requested.
Thank You 🙂
Hello, can you help me with this promt: An art piece in photographic style, full-body view, featuring a mouth resembling Orbea variegata with sharp teeth, the tongue like that of the Xenomorph's Facehugger, the head of the Xenomorph, and the body of the Demogorgon from Stranger Things, with a tail like that of the Xenomorph.
Your image is now loading.
Please Wait Patiently.

Here is the image you requested.

@jovial tiger In an old-world kitchen, a coal-burning coffee maker of polished ebony metal, with intricate silver filigree, commands attention. It's perched on a stone countertop, its firebox aglow with glowing coals. A delicate porcelain cup sits beneath the spout, catching the rich, dark coffee that drips slowly into it. Around it, the air is filled with the robust scent of coffee mingling with the earthy undertone of burning coal, as morning light streams through a nearby window, highlighting the steam that curls gently from the spout.
...almost 100% adherence
only thing missing is coffee dripping dirctly into the porcelain cup
✅ old-world kitchen
✅ coal-burning coffee maker
✅ polished ebony metal
✅ intricate silver filigree
✅ stone countertop
✅ firebox aglow with glowing coals
✅ delicate porcelain cup sits beneath the spout 🤨 catching the rich dark coffee that drips slowly into it 🤨
✅ morning light streams through a nearby window
✅ highlighting the steam that curls gently from the spout
insane
no seed hunting. dpmpp_sde is really good with sigma
@nimble mason A professional headshot of a Chinese woman in a business suit. She has a confident and approachable expression. The suit is a dark color, perhaps navy or black, and she is wearing a crisp white shirt underneath. Her hair is styled neatly. The background is grey, not distracting from the subject. The lighting is soft and flattering, highlighting her features without creating harsh shadows. The photo is taken with a high-resolution camera, ensuring every detail is captured clearly. The composition is centered, with the woman's face in the middle of the frame. The photo should be hyper-realistic, highly detailed, and high-resolution 16k. --ar 3:4 --v 5.2 --style raw --q 2 --s 250
Here is the image you requested.
Hello, can you help me with this prompt? An art piece in photographic style of a creature that is depicted in full-body view, with a mouth resembling the flower of Orbea variegata with sharp teeth, the tongue like that of the Xenomorph's Facehugger, the head of the Xenomorph, and the body of the Demogorgon from Stranger Things, with a tail like that of the Xenomorph.
Here is the image you requested.
Photo where a hybridization is made between the demogorgon from the Netflix series "stanger things" and the xenomorph from the horror movie "Alien"
Here is the image you requested.
Photo where a hybridization is made between the demogorgon from the Netflix series "stanger things" and the xenomorph from the horror movie "Alien"
Hi, can you help me with this promt: A photograph of a fairy, with brown skin, a bright green dress clinging to her body, she has translucent turquoise wings. She who is sitting in the middle of a magical forest, is greeting a spirit that looks like an orange fox. -- ar 3:2
Here is the image you requested.
Photo where a hybridization is made between the demogorgon from the Netflix series "stanger things" and the xenomorph from the horror movie "Alien"
Here is the image you requested.
Here is the image you requested.
Photo where a hybridization is made between the demogorgon from the Netflix series "stanger things" and the xenomorph from the horror movie "Alien"
Here is the image you requested.
can someone help me prompt makeup 
im trying to mainly get eyeshadow/eyeliner but no matter what weight or what keywords i use it all comes out badly or without any makeup
@arctic spear A person wears a Chinese female professional outfit. This dress is deep-colored or black, and she is wearing a loose white shirt inside. Her hair is neatly arranged. Pay attention. The lighting is soft and attractive, it highlights her features, but it won't produce a shadow on the eye. The photo is taken by a high-resolution camera, so it can clearly capture every detail. The photo should be realistic, highly detailed, high resolution 16k. --ar 9:16 --v 5.2 -- original style --q 2 --s 250
Here is the image you requested.
let´s say both are mechs ;-D
This looks more like one 🙂
@arctic spearA girl in a kimono stands on the beach and releases a goldfish
Here is the image you requested.
here they are
wasn't sure if you were doing any img2img stuff
i toss in crap from here all the time and see what i can do with it
awesome
are you still editing stuff with photoshop liquify? i still need to try that
yes, I still do, yet not with the recent ones, nostly for special images only 🙂
looks more like this though 😄 #🏞|general-with-images message
oh haha
Thing is I cant share images
Jelly fish drill
Excat the same problem here ...
Thank you! Found it!
Gold
Here is the image you requested.
How to generate these images gusy?
Can you help me with the next promt: Photograph of a half spider half woman creature, arachne-like, tarantula bottom, scorpion tail abdomen, huge mouth on abdomen with fangs, beautiful woman, blood on her hands, skeletons on the ground, dark forest background, dim light, cobwebs around--ar 2:3
Here is the image you requested.
good morning
Good morning 🙂
Made with SD3
Hello, could you help me generate an image with the prompt: Skin color---Olive-toned complexion; Hairstyle---Dark brown, chin-length bob with bangs; Jewelry---Large, sculptural gold earring; Photo style--- Close-up, high-resolution portrait; Lens--- Shallow depth of field, focus on the face; Background--- Simple, monochromatic pale blue; Light--- Soft, diffuse lighting with a subtle highlight on the cheek and earring.
lol My Youtube video has negative views ^^
pixart-sigma
pixart-sigma
Really happy with it so far. Even single subject images are way more prompt following.
im using llava 1.6 mistral with an input image then literally plugging the resulting prompt in the T5 conditioning
Haven't thought to do that yet. Sigma goes up to a 300 token limit instead of 75 from clip. Clip couldn't handle the longer output from llava but now this can. Neat stuff
exactly
I set it to a low token limit of ~200-250
and even those are extremely long
Hi, Can you help me with the next prompt:Photograph of a creature, inspired by the myth of the leshen in its version of the video game the witcher 3, has a body composed of elements of light poles, the size of a tall lamppost, emits its own light of a warm and dim color, wears black clothes and broken, his head is a cow skull with light pole supports as horns, werewolf claws with metallic texture, lives in alleys of cities, at night, only illuminated by its own light--ar 2: 3
portal 2
YaY!
Here is the image you requested.
crow sharing bathing
Need a picture? 😄
yes 😆
ahahaha 😆
Wait! Is that Crowman in the background?
Muahahahahaha
What is everybody's favorite model for prompt adherence, to get a result as close as possible to the input prompt? For example this prompt: "closeup of a grey cat wearing a blue suit, a red hat and a green tie is sitting on a white table in a room with big windows"
crows bathing lol
i'm doing some testing and I'm curious what checkpoints are usually considered best for prompt adherence
I didn’t expect that my words would be taken as a request 😃
We asked you 😛
Might be sd3
crow cant drink it
yeah but that isn't available yet
I have another one
Is it peeing? o.O
cat
Here's your picture
At this point, pixart sigma.
//
Here is the image you requested.
24 gb is 4090?
If not Notebook version
I don't know ... but it's the benefit of AMD to have more VRAM ... just not so supported by A.I.
i try chap gpt for tech support
chat)
it doesn't give a direct answer, but it does suggest interesting thoughts that you hadn't thought of.
Not sure whether it knows the latest models ...
Looks like only 4090 with 24 GB but there are AMD with 24GB for half price
Ohhh... see: Gigabyte GeForce RTX 3090 Gaming OC 24GB GDDR6X Grafikkarte GV-N3090GAMING OC-24GD Schwarz
But 3090 not really cheaper
4090 and 3090
yeah it doesn't make much sense to buy a new 3090 right now
Pray for Taiwan or prices will be higher 😄
one time i thought asus +nvidia give me quality...
prompt: "high quality, detailed photograph, woman"
I still hope so ... Asus Rog Scar Strix here ^^
and i have 2 dead motherboards and video card with problematic shadows
now i buy gigabyte
I won this from ASUS ^^
south bridge die
not electro?
Electro worth 7k-9k ...
i have electro bike
45KM/H ... so no E-Bike ....
50
You can tweak but not allowed here
is dangerous you every time test speed)
I asked ASUS for a 5k Coupon for their online shop ... I can't drive the bike ...
but is very fun
Big city here ... to much stress
only big problem with cells...
I prefer a new notebook in 1-2 years 🙂
Bike is beeing produced in my city 🙂
Another woman?
my battery fell into the water and some of the cells died
pixart sigma
Bad luck! There are companies that fix it ...
much cheaper than a new cell
Sorry for you
it really is pretty great. I'm loving the new composition over normal sdxl
yeah, it understands fewer concepts for sure, but the ones it understands it understands really well
i'm hoping a finetune can add a lot of those in just like with sdxl base
DJ Neelix at an OpenAir 🙂
@nimble mason @cyan shoal the one thing it's horrifically bad at, is hands. I'm trying denoising, I'm trying hand detailer... they're so bad that neither of these things can fix what pixart is doing.
yeah it's REALLY bad
worse than SD15
worse than SD15 base
i actually haven't seen it generate good hands even once
arms merge into each other too
if schedule == 'DPM-Solver':
if not isinstance(pipe.scheduler, DPMSolverMultistepScheduler):
pipe.scheduler = DPMSolverMultistepScheduler()
num_inference_steps = dpms_inference_steps
guidance_scale = dpms_guidance_scale
so I don't know if I'm going in the wrong direction, but I was looking at teh code.
it's mentioinging a dpmsolvermultistepscheduler. you know where I also saw that scheduler?
ella. so i wonder if we need that here somehow
huh
lol
i'm at the point where I have to throw out the depth controlnet because it's picking up the 57 fingers so it doesn't force the 2nd sampler to do bad fingers, with a 0.7 denoise to fix it all.
i think that might require some aggressive unsampling if it's fixable at all
probably the clownschedule
where is generate
@minor fractal
good to know whatever we get will be the lower quality. 🙂
but i guess that's the nature of it. when we have it downloaded then we can go nuts on it with comfy
exactly
I bet the magic workflow behind this is literally just highresfix (upscale -> img2img)
clownscheduler™ hah
probably. I thought I heard him say at one point that with sdxl there's lots of samplers. with sd3 there's mainly just 1.
hmm
but i don't know how true that is.
pixart sigma
hmm, that'd explain it
@nimble mason
Looks more like DJ Harry 😄
holy shit that's awesome
wonderful
Guys, if I drag one of your pictures to stable diffusion, can I see the prompts that were used for that image ?
I remember someone mentioning this kind of thing in one of the guides
But I can't remember
with comfyui yes
click open in browser and drag the png in
This is not like mid journey, you have to install on your machine
Thank you 
is this an extension ?
Why are you trying to generate such a huge image with a SD1.5 model?
SD1.5 models are trained on 512x512 images; some can go up to 512x768 or even 768x768 on occasion.
You're getting shit because of that.
Do you always give up so easily on things?
Post your settings, show another example.
i need to replicate this, let me show u
into this
need to do that exact same thing for 3 other images
Are you saying you need to add the stuff that is on the left & right sides of the first image?
nono, just enhance the character, add all the details on the face mask, details on the clothing
basically just upscale and make it super detailed like that 2nd image
the background doesn't really matter
So, you are looking for something called "generative upscaling". I don't use Automatic1111 much at all anymore, but I'm sure there are tons of videos on YouTube for A1111 generative upscaling. There's probably also a bunch of people here that use it to do that. The thing is, if you haven't gotten anywhere in a week and need it by tomorrow, you're probably not in the best position right now.
generative upscaling, okay thank you ! at least that's a direction i can follow to try and get what i need
Best of luck.
is this kinda what you have in mind
no ?
what's wrong with that one
here lemme upscale this one for ya
here ya go

a place that is not easy for these poor old people! ... 😀👍
is this better? i tried to keep the composition more like the original this time
sure, thanks
got any others for me to upscale
Can you help me with they next prom: "A creature from ancient mythology, its body is a fusion of the majesty of a dragon and a serpent, adorned with delicate, large butterfly wings, wearing medieval attire. Its body is adorned with resplendent scales, and its eyes are large and hypnotic. The iridescent and ethereal butterfly wings gracefully unfold in the wind, carrying with them an aura of mystery and charm. Its gaze is penetrating, a mixture of ancient wisdom and indomitable ferocity. Its hair is of a fantastical, long color."
1000x1000? weird resolution
workflow embedded
certain brands of frozen pretzeldogs are capable of asexual reproduction
Meow Wolf would be all over this stuff for an art installation
nifty there
The brother of the rabbit from Monty Python's: The Holy Grail. Dude's jacked... Jokes aside, I like that hybrid of a rabbit and a panther
that one is awesome
next
Neelix Sister always says he looks like Harry. That's why I prompt him. Good eyes!
Looks like my A.I. doesn't know GLaDOS ...
here we go
ok
sick
lol
this thing is gonna be stupid with nature
I dont think I want to fiddle with anything
this might be perfect
whoooops
lol
I mean.... common.
lol
very cool 🙂
993 911 Speedster with wide body driving fast on the street, Car is blue and has HRE wheels. Location Times Square New York in the background, dramatic lighting , rain is falling and lightning , --style raw --ar 9:7 --v 6.0
生存一张星空的照片
It looks like infinity with selfcreating holes ...
993 911 Speedster with wide body driving fast on the street, Car is blue and has HRE wheels. Location Times Square New York in the background, dramatic lighting , rain is falling and lightning , --style raw --ar 9:7 --v 6.0
Design an GPU with 'ionet' text
lovely ,3d,ai robot.
hello, does anyone know how to generate different emotional face angles from a base image?
in batches
around 100
example:
Can someone help me with a prompt template for garment generation with specific fixed attributes required in Fashion Garment industry.

what kind of specified attributes? youre best bet is probably going to be some regional prompting and inpaining
wildcards/lists (few different ways to go about this) with different expressions (or run images like that through lava or clip interrogate or something for the expression prompt), do the same thing for pose/angle. probably is a way to tie in control net open pose or depth or similar with this, but im not sure how. add xyz plot, columns for angles and rows for expressions.
or vice verse
8kx6k. absolute overkill upscale 🙂
so many weird artifacts when zoomed in
Get a lora of said person and generate a list of expressions with chat GPT, then feed the list into your prompt as dynamic prompt and let it generate a grid
yeah thats probably way easier than the way i said!
Look at the eyes lmao
You could even fix the seed, that way only the expression should change and the image should almost stay the same
i saw that, i got one even better, his bowtie
Hairy
Hairy eyes ... and I thought the song was about hungry eyes ...
im dying 🤣 its so detailed but so wrong
/help
sighs i guess its time to ditch tilingfor the 8k pictures. its always hair , faces, or eyes all over the place
do you know any cloud platform with these available tools where it can make the process faster?
A painting depicting two stylized figures in an outdoor setting, suggestive of a street or café. The figure on the left has an elongated body, dressed in a dark suit and wearing a fedora. The figure on the right, also with an elongated body, wears a flowing dress and a wide-brimmed hat with a red ribbon. Render the scene in a style that mimics a detailed mosaic or stained glass, with vibrant colors and intricate tile work, offering a stunning visual complexity and transforming the artwork into a visually rich composition.
Im not into cloud services sorry, I was throwing ideas for local installs
You could try lower denoise, not sure what you used to upscale
theres some websites that let you use loras, but your control is going to be limited. as for the prompt, just use chatgpt i guess. im pretty sure there is a free version
yeah i might lower it a bit, but it was pretty low already. im about positive that tiling VAE is responsible for the artifacts. it was supir
which model would be good with lora?
its a delicate balance with noise on there when going that high up in res. need a LITTLE bit to get details, but too much and you get luscious eye lashes and hairy bowties
Depends on what your character is, anime needs anime model and realistic stuff well...
Which Supir workflow are you using
for now anime and realistic
standalone local gradio app. i havent run it on comfy backend yet
Ah was about to say
Then Id check civitai for the top models respectively
yeah i am currently checking it there
have you run on comfy? the thing im skeptical about is that it will increase gpu usage, and with 20gb vram and fp16 settings, 115 steps, 8kx6k, and llava, that took nearly an hour. im not sure what benefit would be had by running in comfy
Beyond 5k you seem to need FP8
Im using a 4090
and that was with tiling vae 😮
thanks guys for the help
115 steps is too much imo
Like way too much
id agree normally, but i was aiming for ridiculously overdone detail. just, not necessarily with the oddities i got
I doubt that anything beyond 30-40 changes a lot besides render time
ill try lower steps, lower denoise, and fp8. is odd though, i normally aim for around 100 when inpainting
Thats the comfy workflow I found yesterday, not mine

the weird thing here, and one of the reasons i used slightly more than normal denoise, is around the mouth where i did some editing in gimp. had assumed it would get fixed. so, somehow hair gets added to eyeballs but a little blurriness goes untouched
amazing
Attributes like - Create a female model of height 5'8 is wearing a size S Top of - Fabric/Material: White Viscose Rayon, Pattern/Print: Monstera leaf print, Leaf Print Motif size : 2 inch, Leaf Print color: Teal #008794 and #90E4C1 on white color base fabric, Cut: Shirt Style, Fit: Regular, Occasion: Casual, Weather: Summer, Length: Regular, Neckline: Mandarin Collar, Sleeves: Three-Quarter Sleeves, Hemline: Curved, Closures: Opaque Buttons, Background Color Palette: White Studio Background, Lighting: soft diffused light, Composition: symmetrical balance, Perspective: Two point, Focus: shallow depth of field, Lighting: Studio white, Color Palette: Bright Catalogue Studio, Activity: Model Shoot, Camera Techniques: ISO: 100 Aperture: f/1.8 or f/2.8 Shutter Speed: 1/125th
not sure you will be able to get that specific with prompting alone.
Can you suggest some alternatives?
studio photoshoot clothing model advertisement, aim for getting it close with prompt & controlnet for initial generation (pos prompt:female mode,slim,(try diff words for specifying size like 2 inch, not sure if it will work) teal leaf prints on white shirt, casual summerwear, white background,soft diffused white light( you can specify lighting direction around here), shallow depth of field, (you can try adding your camera specs in too but its not going to be 1:1 with a real photo
controlnet for the pose on that first gen
inpaint the finer details in like collar, button opaqueness, hemline etc
can I turn an image like this into (for instance) a painting by van Gogh, using img2img and a LORA?
you can try specifying that stuff in prompt but i doubt youre going to get every single detail how you want just doing that, hence the recommendation for inpaint. if youre struggling with that part, try editing in somerthing like gimp. just basic stuff, like if youre changing the neckline you could use the heal or clone tool and make a very rough neck line fix, and then run it through img2img with low denoise or inpaint with same
1.5 model, controlnet inpaint , independent image upload. clip interrogate your pic and use it as prompt. delete anything from interrogate related to detail, photo, realistic, etc. add in oil, painting, artwork, (in the style of van gogh:1.(whatever))
inpaint and controlnet are always the answer
can I use SDXL?
You can use the COSXL edit model to prompt a style
thanks
But that requires a special workflow.
generally the case, but that's different with res_momentumized
highest quality images i've made yet have been about 400 steps (200 for initial generation, 200 for upscale) using this noise schedule via img2img
Were talking about SUPIR tho
Can anyone create an image with this prompts
Gamer playing Water game,hands over the white table, Realistic Water Wave with surfer coming out of gaming mobile phone, Animation,Ultra realistic, Horizontal mobile, both hands
I want a more good image can anyone 🙂?
"I would like to create a poster for Wahaha AD Calcium Milk. The poster should have a vibrant and attractive design that catches the eye.
At the center of the poster, I would like to see a large, prominent image of a refreshing bottle of Wahaha AD Calcium Milk. The bottle should be positioned in a way that highlights its shape and design, making it the focal point of the poster.
Surrounding the bottle, I would like to see a variety of bright and colorful elements that complement the overall design. These could include splashes of vibrant colors, abstract shapes, or even illustrations of happy children enjoying Wahaha AD Calcium Milk.
In terms of text, I would like to see the brand name "Wahaha" prominently displayed at the top of the poster, with the product name "AD Calcium Milk" below it. The font should be clear and easy to read, and the colors should contrast well with the background to ensure maximum visibility.
Finally, I would like the overall tone of the poster to be upbeat and cheerful, reflecting the refreshing and enjoyable nature of Wahaha AD Calcium Milk."
here your image
your muhahaha valhala milk
.
Yeah but where is mobile 🙂?
ai ignore promt)
Can anyone create an image with this prompt female creature from ancient mythology, her body is a fusion between a dragon and a snake, adorned with large and delicate butterfly wings, dressed in medieval attire. Her body is adorned with resplendent scales and her eyes are large and hypnotic. The Iridescent and ethereal butterfly wings unfold gracefully in the wind, carrying with them an aura of mystery and charm. Its gaze is penetrating, a mix of ancient wisdom and indomitable ferocity. It is found in a fantastic forest with exotic flowers
Okay thank for your try 🙂
Leave that I have generated with Leonardo
hello, can you help me with the next promt: full body photograph, heroic pose, mermaid woman, blue scales, with crimson red dragon wings, fully extended, slender body, white skin, short black hair, round face, small thick lips, large black eyes, slender hands. in 2:3.
No.
I've noticed lots of ppl are coming in with that same exact quote, but with different prompts
Starting to wonder if it's bots generating shit for some paid service, idk... Would be pretty funny though considering what they tend to get from us
It's like they come with Midjourney prompts asking us to change them to SD prompts ...
give them crow sharing bathing 😆
Hello, can you help me with the next promt: Mythological creature that combines characteristics of a dragon and a crab. I want it to be giant, with shiny scales and sharp wings. The creature should be in a pose that conveys aggressiveness. Use mostly purple and yellow. The environment in which this creature is found is one of war--ar 3:2.
Hello! Could you help me with this promopt: Photograph of a Cyber Tiger, white tiger, white armor, metallic claws, white fur, black stripes, neon lights, unicorn horn, mystical aura, intense and penetrating eyes, reindeer antlers.
i need some bokoys boxers
First time SUPIR worked here
TBH it was a test how to create a guy with shorts hanging at his knees. Tried that cause I wanted him to sit on a toilet ^^
hello, nice to meet you. I read your post that are looking for gradio dev. I have rich experience that have built gradio app. I want to work with you.
here your image what you requested
Here is the image you requested.
Here is the image you requested
japan animation
had to look it up, not seen then movie and weeeeeell... 😄
they couldve done a better job casting
Here is the image you requested.
hello, can you help me with the next promt: female beast, she is a fusion between a dragon and a snake, she has large and delicate butterfly wings, she has a floral outfit. It has glowing scales and its eyes are large and hypnotic, along with long glowing hair. Butterfly wings are iridescent and ethereal. It is located in a fantastic forest, the color palette is fantasy, full of exotic flowers.
Maybe we should stop kidding around and not answer ... I don't really feel comfortable with it ... but we can't help them all ...
Imagine it is question bot and anwser bot
clownashark can help all)
you bot?
Here is the image you requested.
Here is the image you requested.
its bots for train midjourney 😄
No NSFW 
iron man
extra leg
he is spiderman
Long Dingdong ...
@jovial tiger stable diffusion 3
fell asleep in discord on the keyboard when I woke up it froze due to the fact that 8000 characters were entered there
you should see what happens when you use it as a prompt
you got it yet? 🙂
@cyan shoal @nimble mason so I've been playing with trying to get better quality and faster inference with pixart sigma. I may have stumbled upon something that's fast AND good.
that image is raw, with no upsampling or sdxl refiner.
naaahh
I would tell you guys
if I were allowed to tell you guys
but if I would be then absolutely
if stability allowed me I would also do one of these "I got access, give me prompts"
interesting, and it's ddim too
like generic ddim so its also faster than res momentumized
any cfg rescaling?
yeah, but it's got its own scheduler. batwing lives for schedulers.
I'm doing lots of upscaling etc afterwards, but I was just trying to get stage 1 down.
i still need to do prompt following tests first.
nice
a few interesting noise schedules
yeah ok fine, closeups look amazing with that ddim. arms look all messed up. sigh. can't win with this mdoel.
model.
A priest, he is sticking his fingers into a light socket, electricity is blowing out his face, he likes tree frogs and the Bible, pancakes
maybe the first chunk of denoising with another sampler, then switch?
That man is electric
you didn't specify if he was a man
A duck, walking up to a lemonaid stand, he says to the man, HEY, got any grapes? 4k, cinema, bum bum bum
epic
yeah no text for pixart sigma and SD1.5 ELLA
somewhat surprising from pixart sigma, but its not like they promised text at all
I understood just fine
but the rest of the prompt was nicely followed
Then he waddled away.
I was about to compared this to muse but holy hell, Muse is 3 Billion paremeters, close to SDXL
World war 3, ducks are fighting cats, the cats are taking over the litter box, the ducks have balloons, the fat kid needs to poop, it’s a sunny day, plenty of mushrooms, who’s hat is this?
and bigger than SD3-2B
This is how it starts
ww3 was always gonna start with balloons.
it seems to think ww3 equals a kid in a old timey fisherman's outfit.
Ducks fighting baby kittens, the ducks are holding balloons and the kittens all have catnip, giant smiling faces with moody peanut shaped clouds, 8k cinema 35mm film, Hot and scratchy
Well hmmm
"the aristocrats!"
It is the world record dart throwing competition, the crowd is massive and they are all silent waiting for the award winning dart to be thrown, a kid farts very loud as the dart is being released, the man pees out cotton candy for the kid, everyone is please with the gas, unicorn
Please indeed
that is awesome
A Kitten is addicted to catnip, the dark army descends on the streets of New York lighting up all they see, cinematic, close up, f/1.8 aperture, mirrorless, iso 400, slight color grade, skeletons of the dead unicorns
This is how you win awards
I think that's too random. doesn't give me much more than a cat
Its the close up stuff I bet
@nimble mason this has been supir'ed
pixart sigma actually has tiny subjects, like REALLY tiny ones that still have clear details unlike regular sdxl. so supir actually makes sense for this.
interesting
this is a pixart sigma one and a 4x upscale
it's almost great but the structure needs some clean up
comfyui could really use an image browser...
that's 3.61 i think
close to 3.56 which is my monitor (5120x1440)
a really aggressive denoise with res
that's really good
looks very good, just... some messed up shit and lost some control over the composition
i was using ipa tiled with composition, high strength
yeah, looks really nice
i'll try the clownsched next
200 steps, ready or not here we come lol
wow.. these really wide aspect ratios are a way to cheat more resolution out of this thing
hah
yeah
if we can come up with a way to stabilize the composition and upscale these things... it'd be pretty great
pixart's the only model i've seen that generates >3 AR images without repeating elements
awesome
another thing i'm wondering about here... maybe a single pass of a very light clownsched on these would be the cherry on top
at the same time though it might fight the original comp
i've found when it wants to go a slightly different direction, you end up with a fuzzy image
not badly blurred, but a subtle effect, looks almost like low cfg
4.5-5 or so
heavy clownsched
oops, i added noise
so is that pixart or just regular sdxl?
this is sdxl
pixart seems to not handle upscaling well at all
tiled ksampler... only did one test, but eww
got this
right
Yeah everything I've tried to do with pixart for upscaling has been awful
without exceeding the bounds.
Yeah for sure same experience
The exact depth model is huge too
Haven't tried many
But I know I've noticed that
want my flow?
Sure
should be in there.
of note, there's 2 positive text boxes.
if you know of a way to have the text from one auto copy to the other, let me know.
There is I'll show ya in a min here
i have 2 because they each have their own source of clip or t5 in this case, so can't usually share
oooo this turned out pretty nicely
you want the node pack comfyroll studio to have that same node
but just right click on the text encode nodes, convert text to input
ok cool, thanks
np
ok, so i have a "positive" node as part of the fooooooooocus nodes. so that coupled with the convert text to input should do it
yeah anything that spits out text/prompt types will work
also, btw qrcodemonster can also be helpful for stabilizing a composition
as can the optical illusion one
that's awesome
a menacing squid, positioned as a battlefield commander, with its tentacles gripping a large cannon that shoots rockets. Its eyes gleam with a mischievous laugh, set against a dark, smoky background.
expanded of course.
as expected, i can't quite get teh cannon to fire anything
diffusion_pytorch_model.safetensors: 9%|███▊ | 31.5M/335M [00:26<04:01, 1.25MB/s]
diffusion_pytorch_model.safetensors: 0%| | 0.00/3.46G [00:00<?, ?B/s]
model.safetensors: 5%|███▏ | 73.4M/1.36G [00:27<07:20, 2.93MB/s]
diffusion_pytorch_model.safetensors: 2%|▉ | 83.9M/3.46G [00:27<16:10, 3.48MB/s]
i swear, i want to throttle any dev that does this shit
matteo is great cuz he refuses to do it
now i'm closing comfy and restarting
doesn't rename their stuff from the default? yeeeeeaaahhh
that, and auto downloads
my connection isn't great
so right there that means comfy is outta action for 30 minutes minimum
i'd like to actually be able to download the shit in the background via my browser, instead of having comfy locked up that whole time
PixArt: Not using xformers!
Expect images to be non-deterministic!
Batch sizes > 1 are most likely broken
just noticed that
and i just remembered reading that with pixart alpha, image quality with xformers was supposedly significantly better
looking pretty good now
unsampling version
interesting... it's less faithful than simple gaussian noising... prolly cuz res does so much crazy shit with each step
which looks better to your less biased eye?
out of the last two
also, do you have xformers running with your comfy env?
are frogs supposed to have 3 or 4 toes on their back feet.
because other than a little bit of leopard spot difference, that's pretty much the only change.
no controlnet, noise only, composition strength = 2
hah at the graffiti'ed island house
you know what is it?)
idk
😃
@nimble mason A bottle of white wine is placed on the table. It may be a clear and translucent white wine, such as wine or other fine wines. This drink is often associated with relaxation, celebration, or socializing. The wine bottles are placed on the table, which may be made of wood or other materials, giving a sense of stability and neatness. Through the window, you can see the outdoor scenery, which is full of spring and vitality. The windows may be glass, clearly reflecting the view outside. The scenery outside your window might include green leaves, blooming flowers, a sunny sky, and perhaps fluttering butterflies or busy birds. These elements convey the warmth and vitality of spring. The quietness of the room and the white wine on the table contrast with the vibrant scenery outside the window, showing the beauty of the changing seasons. Create a 9:16 poster
Here is the image you requested.
as always there is a small nuance... 😄
whats does res mean here? i have an idea from context but not sure 😄
@tired basin
i hate when my arms turn into ps1 graphics. always at the worst times
what in god's name happened there?!
dunno, i put in your settings XD
forge
freeU 1.1, 1.1, 0.9, 1.1 then
this is the sort of thing I was getting yesterday with it
a bit grainy and theres consistency issues with the pole and the hand, but otherwise looking pretty good
this is without prompts
dang.. not shabby
the meerkat was Meerkat, fish-eye lens, macro photo with a wide angle lens, animal close up with wide field of view lens
swap meerkat for any baby animal and it works awesome, no neg prompt req
hey a girl popped up this time
yeah, I try to keep the dataset balanced so it doesn't skew either way, and definitely doesn't skew nsfw like some do
I'll let you be the judge, but you can see why I threw it into the mix!
lol, I typed in Pepe
haha.. that's not a frog, nor a frenchman
im having touble getting a unicorn, its just doing fish
da fuq
LOL
🙂
I mean that is cool, but it is not a unicorn
ye, im having trouble getting the unicorn
oh ok, so I had to add the word rainbow to get it
we need 4 more of those
last one
How would I get the ai to make pixel art simliar to this style? I've already done the pixel art stuff I just need help getting it to follow that pictures theme
oooo, thats a good image to make
well this is atreu acid
OMG LOL
wow, godspeed is interesting
idk, I never ticked upscale
draw a picture with cute penguin in a egg
heres an upscale one
which channel we should try
not a unicorn
Nature
/drea
@minor fractal @jovial tiger
At least the arse can fly 😄
So we'll get increasingly better base models maybe
But at least SD 3.0 won't be delayed
But it won't be a permanent problem if it ends up being rather shoddy quality
what model are these from? looks nice
😃
Happy bee day! 🙂
I think you missunderstood the concept! Just kidding! 😂
how can you not understand in the middle of this universe of varieties! . we tend more or less towards the extreme but we always fall towards points of balance in order to regain momentum... 
Let's hope for a balance ...
He will never run out of honey!
he dont want they friends)
We've had bees at school ... I know how to handle them 🙂
No need to harm them for honey ...
I once caught a bee with my hands, released it and it stung a boy in the face 😄
but this summer karma overtook me, I crashed into a flying wasp with my lips....
Awww... I'd prefer a bee ^^
Yes ... didn't win with this one ... but it's one of these I still remember 🙂
Dancing bear 🙂
is not bees 😲
Alien bees ... sounds like prompt meaterial 🙂
Doesn't look healthy 🙂
Hmm summer, cockroaches ...
Some eat them ...
shrimps and cockroaches are in the same family, so we've all eaten them.
Why not ... it's more a brain thingy ....
with filled pockets? 🙂
yep 😃
Honey!
After lykon's tweet yesterday mentioning the changes, my first reaction was that it's going to be months before we can download this thing.
I want summer!!! 😛
Amy is a beautiful girl with braided light brown hair and wearing a light blue dress and A wise, old kindly woman handing a red silk ribbon to Amy beneath the Tree of Dreams
We all would want to meet Amy ^^
#🏞|general-with-images Amy is a beautiful girl with braided light brown hair and wearing a light blue dress and A wise, old kindly woman handing a red silk ribbon to Amy beneath the Tree of Dreams
No she isn't.
need remember winter...
I'll tell my bees to attack winter!
Amy is a beautiful girl with braided light brown hair and wearing a light blue dress and A wise, old kindly woman handing a red silk ribbon to Amy beneath the Tree of Dreams#🏞|general-with-images
Bots offline here
Xi Jinpingprison
nah its like 2-3 weeks probably, if API stuff won't come this week then 3-4 weeks lmao
I just don't know how long these RLHF stuff last
but no longer than 1 month tbh
after 3.0 releases: "I can't wait for 3.1"
well, from the demo video, it does text really well, about 50% to 25% of the time, so a lot of generations are needed. same for hands, same for objects that people interact with. so maybe some of that is in the architecture change side of things.
I think 3.0 coming out in 2-3 weeks should happen as we are dying for a new model
and if we really are getting further trained or modified models then there's no reason to say "but this is the last model and they rushed it, what will we do now"
unlike games, a rushed model is not forever bad, cause we can finetune easily ourselves
and from lykon's images I think as a base model it looks good already
so sd3 is going to segment the userbase. that's always bad. whatever architectural changes that come later, they can't make people have to retrain their checkpoints because that'll just segment the community even more.
and then there's that
800M for SD1.5 users
2B for SDXL users
8B for enthusiasts
2B already looks good and iirc it's almost finished training too
with DPO and stuff
which personally, I think will be the worst part. lykon mentioned you need more than a 4090 to train the biggest version of the model....which means very few will have checkpoints for the biggest one. sad face.
well if you mean dreambooth stuff then we'll still be getting massive finetunes just like before, just maybe slower(?)
but loras will sadly fade away for 8B
yeah
2B and 800M will keep going
Loras, dreambooth, enough vram to train those of course
and 6B is like, the black sheep tbh
like idk what to say about that one
but at that point just use 8B since that's their flagship model, its guaranteed to be better finetuned and trained and everything
it is guaranteed to run on 16GB and 24GB and with T5 quantization around 12GB
6B is also 512px iirc
not like its a problem cause it has the amazing 16 channel VAE (thats why 2B also looks good at 512)
ewww 512
no its good at 512 no joke
I've been fooling around with LLMs a lot, and cutting a language model in half has a big impact. many will say 'minimal quality loss' but that's garbage. the reasoning ability and understand what you wanted is severely impacted through all of my testing.
have you seen models at 4-bit quant using stuff like gguf and exl2?
we're not talking about HQQ 2-bit or whatever
I'm just talking about going from fp16's to fp8's.
ew fp8
something can understand my instructions at fp16 that just flat out can't at fp8.
im literally using int8 with pixart sigma T5 and it works perfectly
and most people say fp8's for llms have hardly any loss.
I do all the time. 🙂
if you use ollama, 8's and often fp4's are the default
The normal mistral 7b is 14-15 gigs. the default ollama run mistral7b is 4 gigs.
I've been running the dolphin-mixtral q8 for a long time now. 46 gigs. fp16 is 96, so I can't fit that. the difference in reasoning is huge between the 2. I've been running against the full size via api and it can easily handle stuff the 46 gig can't.
I just use koboldcpp which has overpowered features such as min_p and dynamic temperature
and I mostly use Q4_K_M and Q5_K_M
Text is really good from what ive tried yeah 
But i dont know if everyone will be able to run the big version with T5
@jovial tiger
its getting CLOSE
API IS RELEASED
id believe that
oh neat api
especially now that the API is out
(t5 can be quantized to 4bit)
maybe 2-3 weeks isnt so unrealistic
especially with those sample images
I understand cherry picking but you cannot cherry pick quality, only how much it really adhered to the prompt
you cannot suddenly have something look like 320px big and then one that is 4K
I just dont get how the other stable assistant images look like if they were from 800M or something
😄