#๐๏ฝgeneral-with-images
1 messages ยท Page 156 of 1
Prompt: https://civitai.com/images/24901772
"Buzz Drop" - Marshmello mask (Flux test) ๐ถ
The signature stuff got out of hand
yeah ๐

Flux@nf4
1/4 nice but with extra arm
3/4 just creepy
A.I. knows us pretty well. I asked for a "No shit sign"
lol
"No human, no cry!" sing
Flux@nf4
There has been a Lady at my last exhibition who would love your lighthouse pictures ...
Send me her email? If you can ...
I'm sorry ... I don't have it. But there are lovers for pictures like that
Flux@nf4
What sites do you exhibit/sell on?
I am at Society6, fineartamerica, redbubble and Printler
Like this ๐
Physical, and not online?
Trying to sell NFTs only scammers stole my time ...
NFT - too many scammers and time-wasters!
That's why I wanted to create the "No shit sign" to stick a post on X that I don't have ETH and don't need promotion ....
fooling around with masking
Hey guys, I'm using schnell Q4_0, and honestly this is worse than normal 1.5 models
Something about the contrast and exposure puts me off
Use Contrast Correction Node in ComfyUI
Or specify cloudy/overcast look?
I'm using stable diffusion forge
I don't think correction will help, the image is almost blended
I'd play that game
Flux@nf4
Ouch! Gloriously good! I like the light across his face
semi animated scanning stuff with mask & inpaint workflow in comfy
using increments and some math nodes to shift the masked area
๐ญ Willy Wonka tea factory
No, the Mad Hatter's tea party
white
that is a good start, small addition:
white lilac color theme anime image
try something like:
white lilac color theme anime image of a small girl with very long hair holding a big rifle, (big rifle in front of her:1.05), (walking with her cat:1.08), sport shoes, black garters, (monochrome:0.6), cat ears, (flat colors, water painting:0.75), empty apocalyptic white background with broken buildings, simple anime style, hand drawn:0.8), bright
change the weight depending on your model, and maybe add some negative prompt
lol I wasn't being serious
though it was a good start. ๐
what did you use for the caption?
Personally, I use the tool in Kohya_SS, Caption W14, with the prefix $ebl4_Flx_v1, .
This gives :
$ebl4_Flx_v1, solo, looking at viewer, short hair, shirt, black hair, 1boy, closed mouth, male focus, black shirt, facial hair, thick eyebrows, portrait, beard, mature male, realistic
You then need to retrieve all the caption prompts and fill in your json like this:
after that, I personally put it on pastebin, then copy the RAW link, and fill it in on the site, in the advanced settings.
The photos should be good, but so should the prompts.
hey guys , what should i do if i get constantly deformed images
deep shrink res adapter hidiffusion
Tell the alien overlords you need a new cat ๐ฑ but actually you are using too big of dimensions or the wrong dimensions for the model.
If you got the model off civitai check the page for the suggested dimensions
Or google the typical dimensions for the model type you are using
thanks all
what does this do ?
they allow you to generate at different dimensions
it actually doing good job but how
with sd15 model its hard to get both the rifle and cat right. cat's gone here.
with and without it , same key
she got neural link, elon already have flux in his claws by the grok 2 partnership
I can't get it to make realistic art.. What am I doing wrong?
nothing, flux is not specialized in realistic stuff. it's always a bit over-realistic in a way
guys do you use comfy ui , i only use stable diffusion , i get really confused in compy
i want to use LivePortrait does it work in stable diffusion ?
are you using A1111?
yes
LivePortrait freely available on Pinokio2
is it worth the effort to migrate to comfy ui ? i am very comfortable in A1111 now
yeah comfy ui is worth it
Uhm- I'm using a lora that's supposed to enhance realism and it generates half anime half realistic images.. not.. over realistic?
I just find it really unstable
idk much about flux loras
maybe I'm using it wrong..
Haven't tried a LoRa with Flux ... have you played around with the weight?
Well I'm starting to think schnell is not what I want to use
quality is just not good...
I remember of having read LoRo only works with Dev ... but not 100% sure
they might be able to make some limited loras for schnell
Never tried schnell ๐
interesting style here, almost an american style like from Animated Batman shows...
Hey guys.. what is wrong with my generation?!?
First one is the official dev version,
second one is a quantization of the dev version,
and the third is schnell?? they are identical!
I swear I'm going crazy
Surely they are not all the same..
hmm..
first one is schnell
hmm..
they're all schnell
...
you downloaded schnell and named it dev ๐
I think forge isn't loading the right model..
๐ญ who is pulling a prank on me
First two are exactly the same metadata
how do I check metadata?
First image is metadata for the first two images. Second is slightly different with the different module.
hmm
You can check metadata on the "PNG Info" tab in your Forge page.
(Or use a program like the above... shameless plug. ๐ )
thanks
no prob
png info doesn't display metadata for some reason
Oh I thought you said you were using Forge.
maybe you need to reinstall it? I dunno
I reinstalled it yesterday but I don't know if I missed something
"webui_forge_cu121_torch231"
/stable
#๐ฅ๏ฝroles Hey guys.. what is wrong with my generation?!?
First one is the official dev version,
second one is a quantization of the dev version,
and the third is schnell?? they are identical!
what-
weird, I dunno. ๐
hlw how to use this stable diffusion
I'm starting to think you are a bot...
you ca'nt use it in this channel.. I think you can still use the artisan service in those channels.
otherwise you can do a local install
it's fine I guess... I could just open it in a text file, tho it is not as pretty
Okay.. what the hell.. forge isn't changing model ๐ญ
I think I got to check my console or something..
yeah something is broken
or is it just a bug in the latest version..?
I think my client was bugged, probably should reload the window more often
I haven't really had much luck with schnell.. never seem to like the results much.
Doing a couple of tests between nf4, nf4 v2, and dev, now.
Not sure- why forge says "[Unload] Trying to free 953674316406250018963456.00 MB for cuda:0 with 0 models keep loaded ..."
that's a lotta megabytes LOL
yea-
restart the thing
make sure you're using the right version of the model
yea this looks right, I'm pretty sure it's dev now but
you should probably not generate ai if you can't find the stable diffusion repository
that's the generation? lol
Not sure what happened
was your prompt "Blurry photo of a carrot" ?
20 steps?
nf4 vs dev
I think I might actually prefer the nf4 one in this case, but just goes to show they can both yield pretty good results.
not sure, what context?
oh, in the model name
BitsnBytes (more info: https://github.com/lllyasviel/stable-diffusion-webui-forge/discussions/981 )
๐ wait it says GTX 10XX/20XX might not support NF4 but my gpu is rtx 2060?
it's not a typo right?
not sure, I use a 3060
looks like you're right... might not work with nf4
use original dev version ๐
uoh
render at 512x512 first to make sure it's working... then you can bump the res
I do have the original dev version but I also have a quantized version, does quantization make things worse?
Nobody using Cubes finetuned Model?
It's rather I don't know of it
quantized .. you mean nf4?
dev Q5_0
is flux good with words or with sentences?
I've heard natural sentences are best.
yea seems a bit better, but lacking in realism but maybe that's what I get for doing an anime character
yeah, it is probably just drawing the character
hmm.. I didn'ty even name the character though
strange..
well it does put patterns together I guess
yeah just try something simpler, should spit out something more realistic typically..
hmm
NF4 on my 8Gb RTX 2070 is OK, along with 64Gb RAM - 90 seconds/1024x1024 image
90 seconds, nice, not too bad
currently takes me around 3-4 minutes on 6gb with 32gb ram
practice at low-res first. 512x512, save some time
I run about 3.6-3.7 s/it. about 1 minute 10 seconds to do 1024x1024. (3060, 12GB VRAM)
I think the extra RAM makes a lot of difference
it does probably
something.. is wrong with the rem cosplay
@jaunty zephyr simple wedding-ring with a broken metallic heart and a red gem as blood
Same boat no matter what model you choose tbh
I'm on an Arc a770 16gb. All of the models from 4-bit GGUF to fp8 safetensors are in the range of 2.7-3.7IT/S.
What was this produced with?
If this uses comfyui-3d-pack
I am now sad
Picture with Flux, Animation with Stable Video Diffusion, and some work afterwards
stable video diffusion
I can't use that I believe.
At least I don't know a method that doesn't go through the 3d pack.
SVD you never know what you will get ... but if I like the picture it's worth a try
Later I add a SloMo cause SVD only gives you 4 Seconds, and pump the Video up to 25 Frames/Second instead of 6/Second
๐
Hope you are fine!
i start watch yor film
There's a lot of talk about the need for character consistency in AI Video, but with "REAL" (narrative) Filmmaking, character consistency is just one element of a much larger quest for CONTINUITY. In this video, I outline why Continuity, and not just consistency, is a pivotally important foundation for any AI Filmmaking tool that is meant for st...
Please developers, please listen to that.
to a certain extent yes but
a ton of very popular movies have lots of massive continuity errors
Of course but nowhere near as much as AI does now.
How it would work is- do NOT generate your scene at once with prompts, that will never give you the degree of control a pro filmmaker will need )or graphic artist for that matter.) Generate all characters doing their thing separately against a "blue screen". Or neutral grey background... Also generate major objects against a neutral background in comfy, then render the background. The remove all the elements and recomposite them against the new background. Then relight everything using C-Light or whatever the future equivalent of it will be- much like a real director, arrange your scene and light it... then the generations can begin... basically they are the equivalents of "takes" in movies, you'll need several and pick the best. And then paint in motion and lip-synch and music.
More or less this could be the process, youd still need a script and storyboard of course.
compositing yeah
All this 3D stuff gives me hope because it shows that AI is starting to understand things 3 dimensionally. That means it will eventually udnerstand that character how it would look from any angle.
Downloaded q8_0
since that's literally as close to fp16 as you get with slightly more vram compared to fp8
Flux is going to text2video next ....
with gguf tech thats gonna be nuts
flux video at 4_0 quant
Flux beated SD3 .... so let's hope the best ...
what are you talking about?
its better but not in all things
movies
i know but you talk about svd?
well whatever the tools will be, svd is definitely not there yet, its just how the workflow and pipeline will look like conceptually
Mojo you start watch gremlins?
Watched first part .... second still waiting ...
i watch 50 % your move
about ufo
oh not true
40 min
want them in my home
repair broken things
Next trial ...
Just a question for anyone else using this node.
Do you have to place the base prompt in prepend_tags and then the LLM prompt in text?
I ask because I put an identical prompt in both slots and it gave me a full prompt instead of leaving out the main prompt in the output.
Damn, wrong figures but the shadow is accurate.
Wario's up to somethin' folks.
็ๆไธๅผ ้ฃๆฏๅพ
own
be wary of people trashing on forge. subversive efforts are afoot by people that want to monetize workflows that do everything forge does
you do know who posted that, correct?
Creator of control net framework
Flowers, cars, water, space, these elements design a poster with the theme of "No waste"
#๐ ๏ฝshow-and-tell Flowers, cars, water, space, these elements design a poster with the theme of "No waste"
Good Morning Coffee
Happy with it?
It's definitely geared to a more artistic look - but it made claims for more coherent text - which didn't produce!
Moonface ๐
SD3 GOLD checkpoint by Dice_AI
I went to the Organ Donation Centre. Didn't have the heart to go in!!!
Love those grapes--what a fun image!
I'm sure nobody will notice him...
"Mr Mothbee?!" ๐
AuraFlow0.3
SD3 "Leg Thru The Bathtub!!!"
Boys gone fishing! SD3 via SwarmUI
flux.1-dev
Bro really loves bathtubs and crabs and bagpipes on the beach
@celest sigil the model cars look great
lol!
Good wine year here.... Prost!
๐
(its me on the picture (LoRA))
Trying to get stuff like SVD to work on arc is more painful it seems
I can't do full res.
Doing 768x432 instead.
If this video could be continued constantly until it loops and maintain consistency
That'd be fascinating for 3d model usage
That would be nice!
At least I found out how to make them longer ...
photo of a cat driving a car on the moon, cinematic, holding a cardboard with the text "FLUX ON 512P"
there are 3 moons lmao
Lol
256 nice though
the modularity of resolutions with flux is a nice change
Let's see what happens if I do 256 with the 8_0 variant
Clean? Yes.
Is it the same as 4_0 at numbers at 256? Yes.
8_0
4_0
2001 Chrysler Aspen
That's part of why there's a base and max shift, vs just having a single shift value, and why width and height are also factored into the equation. In the comfyui section of this channel, I was explaining it a day or two ago and how it all factors into the output shift value.
Probably let's it navigate the network better, and it obviously shows in the results
And yet it was done with the 4_0 quant of flux.1 dev
used ollama prompt encode for better prompts
I'm just hearing about flux now, I'm watching a video as we speak, it is very impressive
Dare I say almost perfect
Sadly not. It is missing a large percentage of NSFW data and certain things that would make it all-round better
However it is fantastic for a start.
If that's your mission
True
I like the results I've been getting with SDXL, it blows early stuff like dalle out of the water but still has those trademark glitches pretty often
But sometimes I like that chaotic style, it feels like the signature of ai that we will soon throw in the garbage
There will be little reason to use it once flux catches up on the tools front
But it is still a good model nonetheless
I like the ethereal vibe you get from glitched out sidewalks and dramatic lighting and color
Let me share something
Made in fooocus with negative prompts but no adjusting weights or inpainting
It's really not bad
But maybe I can go back and fix the back of the jacket and the bottom of her hand
You can bake in anatomy that's NSFW and just make it inaccessible (likely through captioning and censoring images in the dataset), which is what they did. I've had flux periodically spit out people missing tops or pants and it's babrie/ken dolled. You'll see it mostly when you prompt for things involving many people in the scene, like crowds
Sd3 screwed the pooch by attempting to redirect concepts instead and they likely just removed a lot of NSFW stuff from the dataset rather than censor it(costly)
i2i with controlnet is pretty spiffy
Good Morning Coffee ready ...
ๆๆฏไธไธชๅพฎๅ็่ฟ่ฅ็ผ่พ๏ผๆ่ฆๅไธ็ฏๅ ณไบๅจๆฟๅฎๅ จๆ่ฒ็ๅๆ๏ผ่ฏฆ็ปๅ ๅฎนๅฆไธใ่ฏทๅธฎๆ็ๆๅ ๅฎนๅฐ้ข๏ผๆฝ่ฑก้ฃๆ ผ๏ผไธ่ฆๆๆๅญๅบ็ฐใ ๅๆ๏ผๆ ้ข๏ผ ๐ฑๅจๆฟๅคงโ้ทๅบโ๏ผ่ฟไบไปถไบๅไธไธ่ฝๅ๏ผ#ๅฎๅ จๆ่ฒ #ๅจๆฟ็ฆๅฟ ๅ ๅฎน๏ผ ๐ฑๅฆๅฆไปฌๆณจๆๅฆ๏ผๅจๆฟ้้ขๆๅ ไปถไบๅไธไธ่ฝๅ๐ ๐ฒ็ฌฌไธ๏ผๆง็ๆฐไธๆฌกๆฒกๆ็๏ผๅซไน ๆฏๆงๅคๆง๏ผไธ็ถๅฏ่ฝไผ็็ธ๏ผ็ๆฐไผๅทๅบ๏ผ็ญๆฃๅผๅ่ฏ๐ฐ ๐จ็ฌฌไบ๏ผๆๆ็ซๆถๅซๅจๆ่พนๅ้ข็ฒ๏ผ้ข็ฒ็ฒๅฐ้็ซไผ็็ธ๏ผ็งไผคๅฏไธๆฏ้น็็ฉ็๐ซ ๐็ฌฌไธ๏ผๅ้ฅญๆถๆๆบๅซๆพ็ถๅฐ่พน๏ผ็ตๆฑ ๆธฉๅบฆ่ฟ้ซไผ็็ธ็๐ต ๐ฃ็ฌฌๅ๏ผๆฒก่งฃๅป็ๅทๅป้ฃๅๅซ็ดๆฅไธๆฒน้ ๏ผไผ่ฎฉๆฒนๅผ้ๅชๅฆๆฒธ่ พ๏ผๅผๅ็ซ็พ็่ณ็็ธ๐ซ ๐ฉ็ฌฌไบ๏ผๅซๆ้ธก่ๆพๅพฎๆณข็ๅ ็ญ๏ผ่ๅฃณ้ปๆญขๆฐไฝ่จ่๏ผๅๅๅคงๅฐฑ็็ธๅฆ๐
Here is the image you requested.
Its so buggy for me right now but I wanted to test the new imagen 3 with some random prompts I found online and check this out, only about 7 seconds to generate
Long trains of text have yet to find an able-enough Gen AI generator
More likely to be the prompt. The clear text is the only bit I prompted for ๐
INK GRAPHIC STYLE COMIC, psychedelic hippie style 1960s advertising poster, magazine cover,, Norman Rockwell style, Arthur Rackham style, woodcut, bright rich colors COLORS,, She poses for the flirts, straight hair, , beautiful detailed face, a little smile happyness girl SUNGLASS , on big sup board
american young woman girl half-turned SHORT BIKINI, lively natural pose, camping on the beach, twilight , SUNSET, big waves,
ART-DECO, Hokusai style, Egon Schiele style,
SD3
AuraFlow0.3
Hi
Hello
read the information in this channel -> #artisan-faq
Want cake? ๐
Thank you for using comcom analytics.
"comcom analytics" supports all community managers (moderators and server owners) by stats, visualization, and analytics.
If you have any questions, feel free to ask us!
Your dashboard
Help
Support server
Other languages
en: help
ja: help Japanese
prompt?
dogshit take right here
but only grandpas and lazy people fall for AI generated shit
if anyone outside of that demographic cares about debunking something as AI, they can do so with a bit of critical thinking and observation
You see tons of it online and on social media and just don't recognize it as you scroll by because a lot of it has already jumped the uncanny valley gap.
Obviously, if you spent time analyzing every single thing you see, you might start to notice it more, but the point is that a lot of it doesn't stand out like a deformed thumb anymore at a 0.5sec glance
and fail badly
Hahaha what is this
Bro tried to make an Apple commerical
This might one of my best images ever created with ai
something silly he made and posted
Even if ai had real feelings I would still oppress them
Technology was built to serve us and to suggest the opposite is madness. That's why I don't like social media
then why are you using discord?
Discord isn't optimized for user retention like Instagram
I use this like sms
It's not physiologically addictive like other platforms
neither are a lot of other social media sites. but discord is still social media
Semantics.
what you mean is that you dont' like instagram
Or TikTok, or Facebook, or YouTube these days, or Snapchat
But those apps all have something in common
the fact that they aren't apps? just websites that you can use an app to access?
I'm finished discussing this
You're trying to win an argument with semantics instead of seeing or refuting my point
the only point i see that you have is that you don't like instagram. not sure why you're feeling the need to discuss that on this discord, in this channel
We were talking about technology as tools
you were a little off the deep end then, i think - if you think that it would be acceptable to oppress an intelligence that had real feelings and was aware of them just because it's your slave
Guys I just found the best solution to cutting down image generation time. I was using flux schnell fp8 for a little bit now and while it is my favorite by no doubt, it takes forever compared to my previous main generator. After following a tip from Reddit saying to reduce your desktop resolution, I went from 70-100s to 22 on my first try
Two snakes, with extended tails, not so symmetrical. One tail extends to the left and wraps around a pine branch. The other tail stretches to the right, coiling around a flower
how do you know it's not the other way around?
While that does reduce GPU overhead by a tiny amount, I highly doubt it would make that level of difference for most people. Realistically, you were running out of vram probably by a hundred megabytes and hitting the shared memory pool. By lowering the desktop resolution, you lowered the frame buffer size (think windows defaults to triple buffering, so desktop height * width * 3 color channels * 8 or 10 or 12 bits per channel * 3 frames for the buffer).
Comfyui has an added launch flag to reserve more or less vram. It defaults to 500mb, so raise it to like 0.6 or 0.7 (the launch flag is in gb)
yeah so I wrote that too early, I realized after a bit it was because I ran the same prompt.
for some reason with flux, on average it takes me 40 seconds for the prompt processing then 20 seconds or so for the actual image
Oh that long portion was it loading everything then. They don't load the models until the hit the prompt encoder and ksampler, at least with the gguf versions
Open your task manager and look at the drive graphs
When it's going slow
Well I stopped it a while ago cause I was done making images, but do you know how I could decrease that time to process the prompt?
I understand that the reason flux is so good is probably because of how it comprehends prompts, but it's just annoying how long the wait is when I could cut it in half and generate twice the images
move your models to a faster drive like an ssd or ideally, an nvme drive. your average 50USD nvme drive can load like 3-5 gigabytes per second. regular sata3 SSDs cap out at around 400-500 megabytes per second
I have only one drive, it's a 512GB nvme ssd. I have an external 4TB drive however it's not actually that fast
oh and mechanical HDDs cap out around 100MB/s
the external might be limited by your USB level
like if it's only USB3 and not 3.1 or something
it might not be doing superspeed mode
and they cap out around 100MB/s in that slower mode if i recall correctly
so a big 12GB model will take a a while. Ten seconds per GB, so 120 seconds for 12GB
(if only loading at 100MB/s)
Create a logo for "Grills Inn" that embodies the essence of a modern, inviting grill restaurant. The logo should feature elements that convey warmth, flavor, and a passion for grilling. Incorporate imagery such as a stylized grill, flames, or a sizzling steak. Use a bold, approachable color palette that reflects the rich, hearty nature of grilled food. The font should be clean and contemporary, ensuring that the restaurant's name is prominent and easy to read. The overall design should feel both trendy and welcoming, appealing to a broad range of diners.
Using the rbchar0al Flux.Dev LoRA - it adds an inimitable style!!! Hats off to its creator โญ
For some reason, Flux.Dev has started to work at 2 minutes/image instead of 20!!! Kudos
Luckily, I keep all models on an SSD
Same. I actually want to move the bigger ones over to one of my nvme drives though like the t5xxl and flux models since they're big. They'll load in like 2 seconds on my 5GB/s nvme
Flux.Dev and rbcharc0al LoRA
Flux.Dev with rbcharc0al LoRA
what's the theme of the lora?
ty
last one in this group I think is top 5 AI images I've ever seen

Flux.Dev
Flux.Dev and rbcharc0al LoRA (red and black charcoal)
a Gundam head
"Don't mess with me!" ^^
lovely
Just started up Flux+IPAdapter ...
@fleet goblet Ty for cool images
So cool!! What'd you do for them?
How'd you make the video?
Used LLM-Party with a set systemprompt and base prompt to endlessly make it make images
Very interesting! I've not heard of those systems. May I ask your prompt?
...The prompt is AI generated. That's what I am getting at.
Lol
Oooh gotcha gotcha. Sorry, I misunderstood
Also, if I remember right it should be embedded straight into the file.
It's been a long day, my heads totally not in the game rn ๐
This is my first result after trying the new nf4 model of flux. How come it's so bad?
What's the prompt?
I'd recommend the GGUF models over the nf4 one
4_K_S will do you a far better job
I think I installed the gguf thing or something
That's the nf4 safetensors
you put these in the unet folder in models
and run them with the gguf nodes
I also want to say that I was told nf4 was faster as well, and I literally got like a 20s difference. It still takes forever for me to generate one image
i took your image and florenced it for a prompt
then ran that through my llm that enhanced prompts
It's gonna take forever if you don't have the VRAM required for the model, quantized or not
Im on the schnell files page, which one do I get? I'm super inexperienced with flux
Less to do with flux and more to do with quants
Ive been doing this in lowvram mode
Someone said it's better for 8GB
Although in settings I think it says 16GB are dedicated so I'm confused
I'd first try 4_K_S and go from there
Hey guys
Whats the difference?
Perplexity, mainly. Lower quant = less bits per weight.
Dan do you have the workflow for this new gguf thing?
No I mean is there a workflow, also where do I put the gguf file cause its not a checkpoint apparently
yeah
Is that downloadable via manager? Thats how I got the nf4 loader
You would know this if you had simply just looked at the github page i sent
@fleet goblethave u made gguf work with new control net ?
I don't use the current controlnet because it's still in alpha
Also why wouldnt the controlnet work
yeah i get error
the weights haven't changed
Oh mb I didnt look at that yet, I was too busy on trying to download the file from hf
oh the flux controlnet union is beta now
Ok so I ran the portable comfyui command in the terminal as it said on the github but I dont think Im seeing it
Do I have to restart the comfyui
ye
Ok so I got the node up, but theres no clip to drag, I don't think I downloaded any from the hf or github
Yeah, because it isn't embedded into the model.
You need to download clip_l and t5xxl seperate and load it with dualcliploader for these gguf quants
Specifically Clip_l and t5xxl safetensors
Omggg so much workkk
No shit, sherlock. This is AI.
๐ญ
I can download the fp8 clip right? It should be faster for me since I'm on a mid-end pc
lol
YEAHH
I cant deal with these 40+ second wait times
It's so much better than realvision for everything besides realistic stuff, but its legit 12x longer
Flux will take 40 seconds no matter what
WAIT
At 1024x1024, schnell will at minimum by itself
BEFORE YOU SAY WHY
It's cause of the prompt right? I did the tests and I realized that almost 70% of the time taken to generate an image is spent before the actual sampling process
nope
Can you send me that enhanced prompt thing?
I tried scrolling up and its not there
Huh? That's a different install entirely alongside LLM-Party
I have ollama set up
No I mean the prompt you made
I want to try out this new setup
By the way are these settings right? I didnt know if that was part of the reason that t-rex earlier looked absolutely cooked
Oh. You want the enhanced T-rex prompt.
I don't have it anymore since I enhanced
a different prompt
Ok thats fine
But are those settings right?
Thats from the workflow from forever ago
schnell can do simple, normal and beta scheduler fine I think
Yep
that's basically it
Which it will no matter what
Is flux only long because they have some super advanced prompt understanding thing cause I swear it takes 2/3 of the time just stuck there
Yeah cause 8GB
Wait my memory is not even 60%
Does that mean I shouldnt be in lowvram
It's not loaded into your vram
Lowvram loads clip text encoders into CPU, under sysram
Normal, if you want it to load into vram
However it might try to keep models loaded. That's the problem
What does that mean?
But I'll set it normal, high is prob way too much for me until I finally upgrade my gpu
As in, it might try to keep models loaded into VRAM despite it going to the next node.
Ok
Also just wanted to check the previous generations before I got all this new stuff, turns out that this is actually my slowest in a while at 108 seconds
But prob due to the cpu thing
Wait what happens if I dont set a vram mode?
ill just set it to normal
I started it again after setting it to normalvram
YO
I dont know if it kept my prompt remembered so it didnt have to take long on it but if it didn't, it took 55 SECONDS
Dan thank you bro
You literally cut it in half
Give me a new prompt so I can see if it was a fluke or noty
@fleet goblet
You will not believe this. I gave chatgpt the prompt to make a new version in a random style, and it did it this time in 33 seconds not counting that little wait time at the vae decode thing when it turns the data into an image
Also sorry for spamming, I'm just excited this works now
Thank you kind sir
Anyways after putting this version through firefly to remove the 6th and 7th finger, as well as some random floating sword, check this paper mache version out:
WOW
I do think you should lower your denoise thing though. I did that and ever since, my photorealistic pictures have come out way better since the noise looks like texture to the skin
Or you can manually add noise but I like keeping mine between 0.85 and 0.95
Look how cute this little guy is
This is way better actually, adds more depth I think
I'm happy that it does.
I basically do the chatgpt thing you did but locally
๐คทโโ๏ธ
Dude you're literally a life saver
Did you make that just now in case I said that to you
Nope. That's an older image.
LOL
I do have it running right now, though.
I run the 8_0 variants since I have the VRAM for it. I don't have the VRAM for the full precision ones
But unlike you I'm on Intel Arc.
Btw, I'm using firefly to edit these images since some things look weird, but I cant figure out how to get this license plate to say something like "GOFAST"
What was the prompt for this image?
See intel arc might be one of my favorite gpu's however I'm sticking with nvidia due to its compatibility with the games that I play, especially this space sim I bought a while ago that is NOTORIOUSLY buggy with AMD, so I dont even wanna think about it trying to run on arc
A car with rocket boosters emitting purple gas speeds through a space tunnel with swirling red, yellow, and black colors, all captured with intense motion blur to emphasize the high-speed movement
This is gonna seem funny to you
Would you like to know what I added to the prompt to add those words?
๐ค
The license plate of the car has the words "GOFAST" on it.
Same prompt. I only added that.
Weird
Is it possible that whenever I get those like annoying errors when everything else works fine that it's just a seed thing?
Are some seeds destined for me to fail?
Some seeds can straight up be not good
however, keep in mind you are running lower-precision quantizations of the models
Oh right
Though I will say the differences are not nearly as big as you'd expect...
As you can clearly see they really aren't that different at all lmao
Some people have gotten better images than expected with 4_0, directly on par with FP16.
not fail, but since the AI does math and uses the seed for the starting point of that math, the math could land the final result in a strange area of latent space. and then you wind up with odd, strange, and bazaar data
Are there seeds that are just godly and give you most of what you want
There are certain seeds that do have the right values for the latent space diffusion to appear a certain way.
no. the AI is doing matrix multiplication to find locations on an XYZ grid and retrieve the data that it stored at those locations (called vectors) when it trained. if you use the term apple, it's going to find all the information it learned about apples. but if the seed just happens to be the wrong random number it could land at a vector where some data about apple, some about apple tree, and some about a dog named apple is stored right next to each other. and it might give you a furry apple
Can you sum that down in like 1 sentence
I started reading and my brain overheated
So it's a no, but you then explain yourself in such a way that makes it seem like a Yes
and because every AI stores the data itself, and decides the vectors itself, when it trains, there are not 'special seeds' that work for all ais
woh, i made an image to image workflow of flux by mixing two workflows
because I'm pretty sure a furry apple is a failure when you try to generate a normal apple.
it's only a yes IF you explore one specific AI enough to know that seed xxxxxxxxxxxx gives you good results when used with teh tokens for yyyyy
that seed will only work with that specific AI and those specific tokens along with the all teh other specific settings such as cfg, sampler, scheduler, etc
so save that if you run across it so you can use it again
But that is an impossibility, as one seed that gives you good results regarding one specific set of tokens might be good with a different prompt.
Too many factors.
it's a random number. when you hit geneate, the first thing that happens is that all your prompt words are turned into numbers we call tokens. then the AI starts doing math. and it finds ALL the data for those tokens. then it rolls another random number and picks a chunk of that data to use
Okkk so a no then
it's a no
Which is just an elaboration of what I just said I assume?
it's a clarification of what happens for those that aren't aware of how this works
there are no magic numbers. period
you can lock your settings and math in place by using the same seed each time, which will let you change the prompt and see how the tokens work
There are magic numbers in very niche, very specific cases where you know a seed outputs what.
That's basically what you said.
Lol
welcome to the world of actual mathmatics. i said there are no magic numbers. and I'm very very very tired of you trying to twist what I said to make it mean what you want. i'm sick and tired of trolls.
so i'm done with you. i wasnt' talking to you, anyway.
Supposedly
I am a troll
and not just stoned and having a good time
๐คทโโ๏ธ
go jump in a lake and don't come back out
I did not even think this was an argument.
Alright.
Telling me to end my life over something like that, is pitiful.
Ironically, it seems it's my turn to block you.
Yall I just wanted to know how seeds worked regarding prompt accuracy why you fighting over calculus ๐
We aren't fighting over anything. He's mad I explain things in a different way.
I never even insulted him. Seems just questions were enough for an individual who dare not have the patience.
I didnt really read any of it cause I got too confused on the first message so
Yeah he took that WAY too farf
Also the last part is actually just me memeing on him since I noticed his pointless anger
Aka this
By the way, I generated the image again and it gave me this
I think this is the best of the three
Nice.
But how can I get it so it makes a space time tunnel and not an actual tunnel:
A car with rocket boosters emitting purple gas speeds through a space time tunnel with swirling red, yellow, and black colors, all captured with intense motion blur to emphasize the high-speed movement. the license plate reads "GOFAST"
when you put the words in " " you tell the AI to write them.
don't use the word tunnel
I wanted the thing from like the peabody and sherman movie if you know what I mean
try wormhole
But like then what do I replace it with?
OH SHOOT thats so much smarter
think about sci-fi and the terms used in movies
But again thank you Dan for helping me generate like quality images
and add the term "motion blur' to your prompt
Im now averaging 30s an image, way better than 100
I did, I put intense motion blur
Still giving me a road
i screen shared ur render using flux
cars run on roads, the AI knows that. what's your prompt?
Sandland how did you end up choosing 0.74 as your denoise, its so specifid
A car with rocket boosters emitting purple gas speeds through a space time wormhole with swirling red, yellow, and black colors, all captured with intense motion blur to emphasize the high-speed movement. the license plate reads "GOFAST"
Maybe I should write flying
try: rocket car flying through a wormhole, motion blur
see what just that gets you
A flying car with purple gas rocket boosters speeds through a space wormhole with swirling red, yellow, and black colors, all captured with intense motion blur to emphasize the high-speed flight. the license plate reads "GOFAST"
Let me try that first
I gotta redo it cause it made some weird flying contraption
Let me try 1 more time
try the short prompt i suggested first
Yeha it didnt work, Ill try yours
See I like that tunnel but with a flying car
Do you think you can inpaint it for me and add the car I showed earlier?
so now you ahve a good idea of how the AI is thyinking
so i would take that into photoshop, remove the rocket, take the other into photoshop, remove the background, and then composite
Got it I can try
not bad
If the tires worked and maybe I colored the car slightly cooler, it would be perfect
you have photoshop - generate a few more cars
Ideogram v2 released (The below image is from ideogram v2, the rest are flux)
Seems quite good as well.
^
Flux: Taking the dog for a walk
What resolution is this
thanks for WF
Seems luigi found a sparkler
Other than the L being replaced with the utorrent logo
Stable Video Diffusion
new ideogram 2.0 looking good
AuraFlow0.3
4032x2304 It's a 3x upscale.
Flux: Taking the dog for a walk
how is the speed on ur gpu ?
Flux.Dev with rbcharc0al LoRA
Yes, flux
you can get much more realism with flux. It's hard to prompt, though
Flux.Dev with rbcharc0al LoRA
Flux works on my RTX2070 8Gb VRAM PC
yeah the smallest quants may be ok
6 biggy bytes might be a tight squeeze ... try it!
unless you are doing extremely "secret" images
would recommend cloud
its like 1-2 dollars per hour for 40-80GB VRAM gpus
Secret images ๐
lol
Ngl I'm not gonna pay, I don't have a commercial use for this at the moment
yeah I understand. cloud fees having been costing a lot lately
SDXL is already leaps and bounds past what I remember ai generation looking like
oh I am still on SDXL lol
It is frustrating when it can't get something but sometimes I like the glitches
I think when our outputs become flawless it will become less interesting to me and I will be more likely to side against it
Companies like Adobe are already trying to leverage ai into workflows but idk if I like that
yeah I spent most of my time trying to break the model in funny ways
That honestly looks real
I just find that every model makes the pictures a little too high contrast/almost beveled looking
Some grading/filtering might fix that though
there is a genre of lora called boring reality that you might like
otherwise stock photo or amateur photo models
it looks pretty good
yeah I don't have anything aftermarket for fooocus
it comes with a regular model, realistic and anime
22GB
You can download others
Yes
just needs to be a .bat file right
I wasn't aware SD had video capabilities! I must investigate! Thank you! Any YT videos that might help me on my way to get started?
ipiv motion
Hey guys! I have been trying to generate Iron Man Prime for over a year now but, every model available cannot make anything other than MCU/Movie iron man. Any tips?
a lora would be ideal
Flux.Dev
No, they're .safetensor files
ok
I launch my program through bat files but I see the safetensors in advanced settings
Loving Flux 1 so far
damn that looks real
soogsx lora
Isn't nanotechnology great? (Plus, he's got a good ceramic coating applied. ๐คฃ )
Sorry in Cologne at the moment so can't answer everything.
wow that's impressive
wait ima need 2nd image prompt thats good
Real photos from Cologne ๐
im fucking stupid i didnt read the top