#🏞|general-with-images
1 messages · Page 160 of 1
ah need to download at ollama.com
cmd yes
Either direct d/load from ollama
Or open a command prompt and type ollama run llava2:latest
don't know if command prompt works for me
ok
where do I put the model if I download it from ollama?
It knows where it s going automatically
ah it's an executable
Yes, double-click when it has finished d/loading
still gives the invalid model selected error
there's no model to select
no model when I type ollama list either
finally figured out
llava2:latest dows not exist for some reason
had to pull llava
then run it in command
and now I have the model in the module
no idea what's going on here
how is it pulling the text string?
I provided this image
and gave it this prompt in the IF image2prompt : What is this an image of? Make it look like a Heroes of Might and Magic VII artwork.
and then I get this prompt: Question: What is this an image of? Make it look like a Heroes of Might and Magic VII artwork.
Response: Best quality, professional, Product photo shoot, This image features a character that resembles the style of a Heroes of Might and Magic (HoMM) VII artwork. The character is a woman with red hair, styled in a way that might be reminiscent of fantasy or role-playing game aesthetics. She has a confident expression and is holding what appears to be a staff or wand. The background is dark and moody, which could suggest an epic or adventurous setting often associated with the HoMM series.
In the style of HoMM VII artwork, this character might represent a hero that the player would control in their quests within the game. The art style includes detailed shading and lighting effects to give depth and dimension to the character and the environment. Mythical, surreal, magical, otherworldly, imaginative,
and the result is this
oh forgot to lower the strenght on the LORA
Well, it needs careful prompting - some'll work - and others it'll miss by a mile!!!
what's a good low strength?
0.20?
also, insetad of what is this image of, I should describe the source image right?
I just need to apply a painting effect to the 3D model, that's all 😅
Make the image look like a painting by van Gogh or Dali or Picasso or Geiger or xxxxxxxxxxx
The source image prompt follows on from the initial question - what is this an image of?
ya I descibed the source image, but results are similar tot he first one
it shows a closeup of the character, not the whole character
I nned the same pose and same details
just a digital painting / concept art effect applied to the source image
Mebbe you actually need Controlnet to go with that ...
ComfyUI wikipedia, a online manual that help you use ComfyUI and Stable Diffusion
Well search for ipadapter style and composition, this should do the trick. But you input image without background has not much noise to create nice painting backgrounds
here's a raw output from my newly installed Flux Dev installation through ComfyUI. Isn't it incredibly grainy? Am I stupid? This is 10 steps.
yeah that's pretty grainy
I can't believe I got it running on an 8gb RTX 3070, and only 16gb system memory. Apparently, 20 to 30 steps is normal when using Flux [dev], this is 26 steps:
Vaguely abstracted modernist oil painting in an expressionist painterly style. A young child stands by the glass wall of a zoo, a gorilla sitting on the other side in its leafy green enclosure. They both hold up a hand to sign a kind "I love you". Subtle imperfections and splattery effect. Bold textures."
Say less
I tried with control net and IPAdapters, problem is I need to keep denoise to 0.2 - 0.3 in order to maintain maximum detail, everything over that distorts details or hallucinates new unwanted ones. Then there's the balance between detail and painting effect: if I keep denoise to 0.3, the painting effect is almost non-existent.
I don't need the backgrounds, the result will be cut-out
Yes it is a delicate balance - keep trying until you find the sweetspot
Openpose Controlnet will give you the subject's pose
Canny Controlnet will give you the contours and outline of your subject
ya I used canny and depth
is there any ipadapter for flux that works in forge?
in this tutorial i am gonna show you how you can install and run ipadapter using flux GGUF model on both Comfyui and forge webui #comfyui #forge #flux #fluxnf4 #fluxggguf #ipadapter
Chapitres
00:00 Intro
00:25 Installation Part
02:57 Workflow Overview
04:57 Forge webUI
07:43 IPA Results
My Upwork Profile
https://www.upwork.com/freelancers/~01...
thanks
Prompt: https://civitai.com/images/30615175
"The Keeper" - OC (Glory Day)
Good morning!
Just had Flux crash my entire PC. GG
Don't try to make fp16 work with 8gb VRAM and 16gb system memory... lol
Yes, that's a little low ....
You also have something to eat 😄
the whole thing instantly locked up and both my monitors turned off with a "no signal" message. my PC stayed on, but it was unresponsive. had to restart the whole thing
There are lower versions that should work ...
This is Flux1[dev] running in ComfyUI. I can understand that NF4 is easier on a lower VRAM/system RAM system, but is the image quality significantly different or are the two indistinguishable?
small differences
its definitely fine
not indistinguishable though
TBH I didn't try lower configs ...
Thanks man, appreciate the help. If I set the weight_dtype to fp8_e4m3fn with clip_name1 set to t5xxl_fp16.safetensors, I can make it work - it does slow down my system severely though. Would it be worth it to try NF4 or should I stick to my current setup?
I'm gonna clarify that I'm not doing any type of productivity though. I just like to gen for fun.
its important to keep the entire diffusion model in VRAM
personally if I had low VRAM, I would cache text embeddings
i.e. make my text embeddings earlier and save them
then use a workflow that loads existing text embeddings instead of using an encoder
I believe there is smaller than NF4 by the way
the GGUF quants go really low
What resources would you recommend I look at to understand what this means? I'm pretty new in this space - I'm just a gamer trying to make some cool pictures with my RTX 3070. I'm not shy to try to learn about how these things work, though. It just seems a little confusing and overwhelming. Thanks in advance.
I think I can explain each bit
so
you have DRAM on your motherboard, which is what is called RAM most of the time
and super fast VRAM on your graphics card
you want things that require speed to use the VRAM
the text encoder is t5 and clip
and the diffusion model is the main Flux bit
I'm with you so far... :D
overall Flux is 4 models
clip, t5, vae and the main diffusion bit
so
if you put clip and t5 on your DRAM that is not so bad
but if you put the main diffusion bit on DRAM its really slow
I am actually not sure about the VAE. Its small anyway so I guess put it on VRAM
Because I need to use the tensor cores in my GPU to handle the diffusion, yeah?
yeah
there is a thing called CPU-offloading
where the model is mostly on GPU but some layers are on CPU
I don't like it but some people do that
Do you know how I would specifically allocate the clip_l and t5xxl_fp16 processes to be handled by the DRAM rather than the VRAM?
by far the easiest thing would be to just use ComfyUI
and it does all this for you
it can even do it dynamically
so it can load and offload parts during the workflow, automatically
I'm already using ComfyUI, and I actually have it all loaded up as we speak.
like offload clip after clip has finished
okay there should be comfy settings that get what you want, as far as I know
would recommend asking on the comfy discord
I don't do low vram stuff personally so I don't know
I'll do that. Thank you so much dude.
I always feel bad asking for help with these things as it's unpaid labor LOL
its okay people like talking about AI and hardware stuff
If we can help many like to do ...
I personally put everything FP8 cos there is a speedup just for FP8
on recent GPUs
this is the --fast flag in comfy
you do lose a little bit of lora following with anything other than FP16 though
I really appreciate it!
So, it'd be better for me to use t5xxl_fp8 rather than t5xxl_fp16?
your GPU wouldn't get the FP8 speed boost so there is not special advantage for FP8
its not huge anyway its like 30%
ah, because I'm not on more recent architecture?
I use cloud so I jump around GPU a lot
That makes sense. I've tried cloud but I find local rendering really interesting. I don't know if my power bill does.
Working local with A.I. for 2 years now and don't really have an issue ...
its mostly upscaling without tiling that requires the crazy high VRAM
I've started using tiling more these days anyway
Another expensive hobby for me... oh well.
ye its a costly hobby
video is taking ages to get build from an image on my macbook.. any suggestions?
video?
Wait for NVIDIA 5090 😄
I gotta wait to hit the jackpot before anything else LOL
Thank god I have a 5000.- Coupon for the ASUS store ^^
Is that dollars or another currency?
€ ...
Jealousy sets in...
You'll be able to buy exactly three quarters of the RTX 5090 with 5000 EUR. ;)
with how GPU prices are evolving... lol
Bad luck AMD missed the A.I. train ....
At least the AI boom has resulted in more companies looking into fabricating semiconductors, which hopefully will drive down the prices considering there's gonna be more competition.
We'll see about that last part though.
image to video
... on a Macbook?
yes
Stable Video Diffusion?
yes
Remember to bring the fire extinguisher!
It has to calculate 25 pictures ... that takes some time ... even on a 4090
ok
must be some serious computing power behind something like this
meanwhile my system is at a grinding halt to make something like this:
I use SVD to generate at 3 FPS and boost it to 25 FPS afterwards ...
"Hawaii 5-oh-9-0!!!"
Plotagraph does a lot like this - or the Motion Timeline in Photoshop! It's pretty cool!
Sometimes it helps if you are an oldie with A.I. 😄
Yes, most video is as yet underwhelming. But I am trying Klingai, and RunwayML Gen-3 for a better, more joined-up experience! 😄
Suggested price-point $1,600 - https://www.dexerto.com/tech/nvidia-rtx-5090-2301788/
That would be fair ...
But I think 4090 has been around 2k for some time
When 5090 is released - buy some 4090's like there's no tomorrow!!! What with the price-drop ...
Not sure about that. Rumors say they already clear the storage ...
But the price on used market should drop ...
Banana! 🤪
A close-up, intense view of a baseball catcher signaling a low curveball, with the focus on the right hand giving the sign. The catcher’s fingers are clearly extended downward, hidden behind his legs, showing two fingers as he discreetly calls for the pitch. His left hand holds the mitt low to the ground, but the real attention is on the precise and subtle movement of the fingers, communicating strategy in a tense, pressure-filled moment. The dirt-covered ground and beads of sweat on his hand add to the intensity, while the shadowy atmosphere heightens the focus on the sign itself.
Did you generate this prompt with ChatGPT?
I'd think so ... A.I. won't know how to signal a curveball
DallE Theme of the Day Discord - I borrowed it!!! 😄
GGUF Flux + LoRAs
At first the video card progressed for the sake of games, now for the sake of AI
Does anybody knows of an AI that could turn Oil Painting to realisitic 3d model?
logo chinese style
A charming chalk drawing of a futuristic spacescape, featuring a campsite with tents, sleeping bags, and outdoor essentials, the sky is a glimpse of outer space with stars and comets. The landscape radiates warmth and comfort, bathed in a golden glow that entices viewers to explore its hidden secrets. Looming over the campsite is a sleek, modern space station, connecting to the lunar surface via a shimmering energy bridge that glows with life.
look at this shit lool
the boy with the raven almost looks like one of his screen shots though. what's your prompt?
the same as hugginface a girl holding a large black raven close to her face, eerie and mysterious, in the style of BSstyle004....trying using 1024 x 1024
nice, detail is crazy. what are you using?
Stable Cascade with Invictus Redmond v1.1 checkpoint for stage C (most important), for sharpness the CSBW refined lite stage b and HQ vae with 21 steps along dmpp SDE karras on Stage C and euler_a karras at 31 steps on stage B. Can link you the models if you like
here is as well the workflow (it´s using an input image in this case)
This one? https://civitai.com/models/316681/invictusredmond-stable-cascade-stage-c-finetune-generalist-model
will try it out later
yep
@scenic yew here is the workflow
Q8_GGUF Flux
guy
|
Im trying to install these files, what folder do i put them in when i get them?>
you keep posting psychedelic pictures again😃
User hasn't listened to enough albums (0 of required 9) for a chart this size.
Please try a smaller chart or a bigger time period (weekly/monthly/quarterly/half/yearly/alltime).
like novelai alternative
A vintage Polaroid camera with a picture emerging from it. The photograph distorts reality, showing trees, buildings, and faces that twist into impossible shapes. Skyscrapers spiral upwards, melting into the clouds, while the horizon folds in on itself. A human figure stretches like elastic, caught between dimensions. The once-clear landscape morphs into a kaleidoscope of colors and surreal shapes, defying logic. The image emerging from the Polaroid frame contains this surreal world where reality and the impossible blur together.
so cute ❤️
Намалюй Логотип Pavlo Ruban School Діагностика ходової частини та Встановлення кутів коліс . Елементи автомобіля: Включи силует автомобіля або його деталей (наприклад, колеса або підвіску), щоб одразу зрозуміти, що моя школа пов'язана з автомобільною діагностикою.
Інструменти: Додай іконки інструментів, які використовуються при діагностиці авто (наприклад, гайкові ключі або рівні), щоб підкреслити практичний аспект навчання.
Текст: Використовуй сучасний шрифт, щоб написати назву "Pavlo Ruban School". Можна виділити слово "School", щоб акцентувати на освіті, або "Ruban" для індивідуальності.
Кольори: Використовуй кольори, що асоціюються з автомобілями, такі як синій, чорний або червоний. також можеш додати металеві відтінки, щоб підкреслити технічну тематику.
Символи: Можеш додати символи, такі як компас або шестерня, що може символізувати точність та якість навчання.
Draw Pavlo Ruban School Logo Diagnostics of the running gear and setting of wheel angles. Car elements: Include a silhouette of a car or its parts (such as wheels or suspension) to immediately understand that my school is related to automotive diagnostics.
Tools: Add icons of tools used in car diagnostics (such as wrenches or levels) to emphasize the practical aspect of learning.
Text: Use a modern font to write the name "Pavlo Ruban School". You can highlight the word "School" to emphasize education, or "Ruban" for individuality.
Colors: Use colors associated with cars, such as blue, black or red. you can also add metallic shades to emphasize the technical theme.
Symbols: You can add symbols, such as a compass or a gear, which can symbolize accuracy and quality of learning.
Trying the "vintage travel poster" look. It's tough.
A photorealistic image of a lighthouse standing on a cliff by the sea, with its beacon shining through turbulent weather. The light pierces through stormy clouds, symbolizing hope and guidance. The image captures the contrast between the dark sky and the bright, unwavering lighthouse light, representing direction in times of uncertainty.
Spaceballs? 😄
Cheesus Christ!
Just post 1
People will see full version clicking it ...
yeah I know.. I just don't like to flood the chat 🙂
I'd think if you post 2 pictures seperated it should be OK
Hi dicordos
Fine here and you? Sounds like there's a new Flux-Controlnet upscaler
you use flux local&
Yes ... but AFAIK there's no downloadable workflow yet ...
I tried online and that's the result ...
dont understand its work on yor pc &
or online only
a cant run xl models...if i start flux/...
its kill my pc
There's an online version of the upscaler at the moment but to many requests: https://huggingface.co/spaces/jasperai/Flux.1-dev-Controlnet-Upscaler Willing to use it local when I find a workflow ....
Here you get a few credits for free every 6 hours. Better only produce 1 picture per trial: https://fluxpro.art/create
SD1.5 is still good for some things!
The only bad thing is to upscale it in a good way ...
you know what is it?😃
Kinda mask?
same game
So one second I'm generating a cute girl or something and then I add an innocent seeming word and suddenly stable diffusion feels like generating an eldritch abomination
Why is this?
Like tf
I spend every Saturday night hanging out at the Turex Naterfed Hat Lime.
This just looks like a very computationally expensive bicubic scale.
Not perfect but if you need a bigger print OK
would you mind posting the prior upscale version, would like to compare to upscaling locally
Here
Some details gone, some added ... if you need a biger print it's OK
Florence2/GGUF_Flux img2img
I'd like to make other images like this but do an inpainting with a new image. What's the best way of doing that for a large number of different images? I'd like the cards to have different images, but the same background.
Has anyone tried high CFG scale (30 CFG), using the Integrated Dynamic Thresholding (CFG fix) in forge UI for Flux models, in order to create details? The image is from SD 1.5, automatic 1111 for reference (!)
I haven’t used SD 1.5 for a long time and forgot that you need to be an promt engineer to get simple things😃
Hi all, does anyone here understand why Stable Diffusion doesn't understand prompts and makes siamese and many inconsistencies?
Hi there, would go with attention mask and the ipadapter. Create a workflow in comfy use the developer tools to make an api out of it and then use a simple python script to send the large number of card prompts to your api.
Please don't post NSFW images in this Discord. You're liable to get banned. Also, I don't know what type of model that is, but if it's based off of SD1.5 then its base resolution is 512x512. If you go much higher in terms of pixel count, you will start to get repeating features.
I'll experiment with some of that but it seems beyond my level of knowledge. I think I found something to help with implementing that tho. https://github.com/SaladTechnologies/comfyui-api
Sorry I wasn't thinking when I shared I forgot to blur 😬
amazing news
wow
that's gorgeous
"My treasure..."
pretty sure these are safe to eat
Wanted for murder ...
Prompt: https://civitai.com/images/32038635
"I'm everywhere I'm so..." - Luigi (Nintendo)
"Minimalist rugged oil painting in faded earthy green hues, capturing delicate details in vast solid patches. A post-apocalyptic city lies in smoking ruins, beginning to be overtaken by nature. Vines crawl up the sides of half-crumbled skyscrapers and leaves form in mats in the streets. Intense bluish mist slightly obscure the setting sun in warm salmon tones. Low angle cityscape from the ground on the other side of the river near the city."
First posting of mine.
I am "Testing the Waters" as it were
must be done with the Prime Minister of Israel
Nice 🤗
Does anyone know any model which can create anime screenshots / anime screencap style images
pretty sure that hasn't been done
Ohh damnn :(
But i see people making pic that are straight up from anime but ai generated
70+ text prompts from today's img2img using Florence2
good
Still easier than that mess of cobwebs in "comfy" UI!
comfyui is 100% the way to go
I believe it, but I'm not very intelligent lol. Other people's workflows hardly ever work, there are 10s to 20s of dependency plugins for every workflow. I couldn't even figure out how to get a lora to work in it without downloading a workflow and it was still not right.
In Forge you put your lora in and prompt it and you're gold.
my main advice would just be to not use other people's workflows
if you build it yourself out of the nodes its easier
it takes a while but if you learn what each node does
you don't rly need to look up workflows after that cos it fits like lego blocks anyway
Character:
Anna, a friendly ghost, is floating gently, with a soft, translucent glow surrounding her. She has long, wavy blonde hair that drifts slightly around her face. Her facial features are warm and kind, with gentle eyes that convey wisdom and care. She’s wearing a faint outline of a factory worker’s uniform – a simple, ghostly version of workwear, with a button-up shirt and utility pants that are slightly faded to fit her ghostly appearance.
Background:
In the background, you can see a whimsical, old-fashioned factory floor. It’s a light and airy space, with soft, muted colors that give it a dreamlike feel. There are conveyor belts, machines with rounded edges, and the sense of light work being done. Ghostly workers in the background are faint and almost translucent like Anna, giving a sense of her past life in the factory.
Style:
The overall illustration is playful and charming, with bright, soft colors. Anna, though a ghost, feels approachable and kind, making her a great character for children to relate to.
#🏞|general-with-images Character:
Anna, a friendly ghost, is floating gently, with a soft, translucent glow surrounding her. She has long, wavy blonde hair that drifts slightly around her face. Her facial features are warm and kind, with gentle eyes that convey wisdom and care. She’s wearing a faint outline of a factory worker’s uniform – a simple, ghostly version of workwear, with a button-up shirt and utility pants that are slightly faded to fit her ghostly appearance.
Background:
In the background, you can see a whimsical, old-fashioned factory floor. It’s a light and airy space, with soft, muted colors that give it a dreamlike feel. There are conveyor belts, machines with rounded edges, and the sense of light work being done. Ghostly workers in the background are faint and almost translucent like Anna, giving a sense of her past life in the factory.
Style:
The overall illustration is playful and charming, with bright, soft colors. Anna, though a ghost, feels approachable and kind, making her a great character for children to relate to.
with flux schnell
any model like niji journey?
Just for comparison when upscaled locally with realESRGANplus x4
Thanks Moofi
Flux Pro is just another level. Overtaking DALL3 as well in realism
does anyone know how to prompt an output like this? i love the style
which front-end are you using?
anyone knows good lora for icons or maybe prompt that limits main subject to like 80% of image size?
Like this thingy
well, that contradicts my explanation...bare minimum - it needs to be completely inside image, not to be cut out
, but if it wuld be like 80-100% that's even better
like...there's example of what I'm trying to avoid:
it often gets worse...that's somewhat usable, but you get the idea 
Stole your prompt and changed it to be photographic.
I used on macbook draw things as the application
What exactly is this, anyone can tell me please?
Trying to be as exact as possible with the tons of information you added to your question.
It is a screenshot with black background white letters which shows python code. It checks for some variables and wether they are true or false. As the variables are named nsfw_preview and nsfw I would say it is code to prevent showing images with nsfw content in the preview (maybe if an specific option is not set or set).
And how do i change this, can u tell me please?
First by adding some more information, and second by maybe using a channel like #🤝|tech-support
What information you might ask yourself. Well start of by what do you want to change, what are you trying to do, what tool are you trying to get running, maybe the operation system, how could anyone try to recreate the problem,…
Ok thanks i ll ask there
ok guys what will i have to do to get tis image colored in stable diffusion? New to this so please go easy on me lol
Hmm, Flux gives me quite different results. I don't have your custom nodes, so maybe I mistook your negative for a positive in the metadata.
This is Searge SDXL Nodes
#artisan-1 people running on the beach
Women face many challenges to their safety in Nepal. Domestic violence is widespread, reflected in the fact that Nepal is one of the few countries in the world where women have a lower life expectancy than men. The problem is compounded by the low number of female police officers, which makes it dif
4
6
2
how
Congrats!
in an alternate reality, Frieza is actually Barack Obama...
The entire fate of humanity rested on a young Sayian, who had grown up on Earth, and was called “Trump”...
With colorize you mean like this?
Why an online service, you could use CogX img2vid Workflow or?
As usual the last 5 Frames are out of order and my prompt was kind of simplistic: a blond man in front of a mechanical device get burned by an explosion
Always trying different ways ...
And I didn't install Cog5 local yet 😄
Understandable because it has some limitations for now. For example the resolution is fixed with 720 x 480 so the cropping was necessary. Pretty sure if time goes by it will be easier to use out of the box
I've used it once online and it took me 1200 seconds 😄
llustration of a 5 year old boy at school.

Illustration of Picasso at McDonalds 🥳
A.I. generated Podcast based on my X (Twitter) profile.
Mojo
dicordos 🙂
where did you make your video?
The one with podcast just showing a picture?
last your video
Audio is from here: https://notebooklm.google.com/
I have an open tab with it but don't think I've tried it ...
😃
its ai dictors?
All A.I. with my twitter as only input 😄
poor pigs ^^
don't worry the fried meat went to kebab
good morning
Good morning 🤗
i not sleep need to go😎
Searge SDXL + Clownshark and Future Cubism LoRAs
Guys what would u say the best model rn is for realistic images on people?
Flux
Figured XD just checking on what people answer with
Not only are the basic model(s) incredible, but the lora works 10 times better than on any SD version.
Question what loras you recommend for realism
I mean I think it’s known also now that Flux tends to also make plasticity skin
All this realism, 4k and others lora like this are just dogshit.
There are a lot of people who don't really understand what they're doing, and give a biased view of their experience. Take Flux1dev, or flux1devQ8, then the smaller models, there's already a world between the 2.
In any case, Flux is currently the most realistic.
Fair enough I just struggle with doing prompts
Personally, I don't think the skin looks plasticky...
ayo prompt needed
`a woman, sexy, black tanktop with a mini skirt, on a futuristic Yamaha motorcycle, a pistol in her right hand, futuristic cyberpunk city, night, neon lights, shadow, sharp, low angle view, dark colors, lora:le@hFluxv1:0.6
Steps: 20,
Sampler: Euler, Schedule type: Simple,
CFG scale: 1, Distilled CFG Scale: 3.5,
Seed: 4266710719,
Size: 1366x768,
Model hash: 52cfce60d7,
Model: flux1-dev-Q8_0, Lora hashes: "le@hFluxv1: e3b0c44298fc", Version: f2.0.1v1.10.1-previous-488-g79b25a82, Diffusion in Low Bits: Automatic (fp16 LoRA), Module 1: ae, Module 2: clip_l, Module 3: t5-v1_1-xxl-encoder-Q8_0`
(hires fix x1.5, 40 steps 4xultrasharp, EXTRA tab x2 4xultrasharp upscale)
What’s that lora u had on?
A lora I made
Ah ok nice my friend
Ty for the information also
Appreciate it I remember u also tryna help me with getting flux in MacBook with Forge
I am trying to use controlnet openpose and it seems to be ignoring the poses I gave it entirely.
is it my model?
its works with many poses?
It doesn't work with any pose at all as far as I can tell
yes, it just ignores it entirely
what in cmd when you start it?
I only added it today, but the model is from almost a year ago so it might simply be incompatible with something
I'm trying that next
what your contol weight
1
work for me 😃
rerefence in left
which one do you use?
soo weird, I have the exact same one
but I'll try reinstalling it
you know where put it?
in the controlnet folder, right?
did you get it from civitai or hugging face?
Isn't there supposed to be a ph or y-whatever file?
all new?
yes
lets try!
it works now
just finished, trying it now
first try it with preprocessor and picture
👍
do you have controlnet 1.1? I have 1.1.455
dont remember how check
at the top left
now all works?
I don't think so
ControlNet v1.1.455
use simple pose
if not work check cmd
hmmm
model conflict
it says it's not compatible with my sd version
so, I have to get a different version of SD altogether?
update your version maybe
idk how to to that lol
hm
neq cn model? I assume that's 'neq controlnet model'?
if i was you i will try different cn model or different checkpoint
can you change
idk
Stable Diffusion checkpoint
you have 1.5 based model?
where do I check that
I'm not sure that my model that I gave you is compatible
first try download 1.5 based model an try
oh
you need cn model for xl
could I just grab xl cn model then
oh okay
lemme do that
first, delete the old files
you need use only right cn model
dont know 100% what model for you
maybe there
works now?
what is it)
trying that one next
I dont understand
it gets it
it gets the pose
but it doesn't seem to apply it
the thing about control net is
the difference in quality between different ones is huge
so you've gotta do a bunch of research each time
to make sure you have the best one for the task you want to do
how do I even check if it's following my poseing and not my prompt?
just did that, trying again
I mean,
it's smilling, but it seems to match
yeah nvm
it's not lol
not sure open pose was ever a good idea TBH
compared to just using canny and depth at low strengths and durations
the main reason I wanted poses above everything else is that I want to train a lora trainer for a chracter I made so I can make a comic with them
And I CANT because this damn poser doesn't want to listen to me
what now in cmd?
i2i Searge SDXL
titanic?
yep, historic photo actually
@nimble mason @clever oar
I could fix flux out of memory error, I forgot adding swap memory to windows using the SSDs 😁
the current staircase looks older than the one from the past😃
its message fom me ?😃
Idk xd
Im good, and you?
Ooh nice
first monster make with flux
second sd 3
I think flux has less brightness in pictures
Yeah probably, in my opinion flux is the best local model but sd3 has better textures most of the time (by default)
but at the same time they are an order of magnitude better quality
hands and geometry
i want mix with thix models
all good from flux and from sd3
Yeah it can do a lot of things better
But its not perfect
DMD2 is another model that is one of my favourites, the effects or lighting of it is epic
too heavy for the system
xl based?
Yep
I used LeoSam´sHelloworld as secondary model or something like that, I dont remember how it worked
It generated that in 4 steps, fast af
Maybe we could build kinda snowman ....
Would u lot say flux is also good with anime or I rather just use a dedicated anime model?
tensor.art is good
"old washed up cosplayer sick of posing"
i2i Searge SDXL + LoRAs
It’s actually pretty decent but if you don’t care about prompt following or text rendering, anime-specific models might be better.
There are a few anime specific flux models too, you can try them out.
yeah that's the thing, Flux has very high image quality but
its not a specialist model
and the anime community have some specialist ones
Yeah I’ve seen some Flux Loras for anime like the flat anime colour lora and a few others
Idk about anime Flux models I have not seen them
i ask chat gpt what gpu is needed for run flux local
answer:
GeForce GTX 1660 Super
GeForce RTX 2060
GeForce RTX 3070
😃
I'm running Flux.Dev on RTX2070 8Gb VRAM, but I also have 64Gb RAM
these are not the minimum, these are recommended 🧐
GTX 1660, possible, but with low size models like Q2
maybe I didn’t specify the model, that’s why this is the result
Intel Core i7 AMD Ryzen 7
NVIDIA GeForce RTX 2070 AMD Radeon RX 6800 XT
chat gpt said is recommended
strange
don't use an AMD GPU, RocM and Zluda are much slower than CUDA, and require separate installation of branches and forks.
now i dont buy amd gpu -)
if you were in france, I would have suggested to sell you my old 2070 Super x)
what's the price? -)
ChatGPT doesn’t know much about flux, flux is pretty recent and chatgpt has a knowledge cutoff. It can use web search but is not very accurate for things like this.
I believe 3090 is usually the best bang for the buck for sd, and just general ai, plenty of vram and pretty fast while being not too expensive.
good choice
Amd gpus are pretty amazing too, better then most nvidia gpus I believe but it’s just that most libraries don’t really support it.
i think about new gpu /
first i want 3060 becouse is cheap and 12 gb
now i think need something better
Used 3060 are good for its price (I got mine for $150), but a used 3090 would be a better option for AI because it has 24gb of vram
Let it grow ... let it grow ... let it grow ...
Using Flux 1.1 Pro API is afaik 0.04$ ...might also be an idea
AFAIK you can use it for free @ gliff
120€, but originally it was a PNY dual fan, I put 2 Noctua chromax black (92x14 mm), because with Stable Diffusion, I couldn't stand the noise of the fans anymore.
As a result, it's probably the quietest of the dual fans.
nice price in my country more expensive used gpu
does anyone know what software is used to make an image like this?
diffusers
stable diffusion?
@nimble mason my glorious king
do you notice something off with dimensions or something ?
or in this is there something off?
Image / Create an eerie, gothic world filled with whimsical, exaggerated characters who inhabit twisted, shadowy landscapes. The scene should blend dark, muted colors with vibrant accents, capturing a sense of fantasy and isolation, while evoking both beauty and unease.
Here is the image you requested.
Yeah, that rock in the middle make the man in the right seem like a giant or something if i look fast haha
cool gen! Is this flux?
This is eery and creepy and beautiful 🙏🏿
yes that seems to be the case, ty. Do you think that changing the rock will fix it ? Still the position of the man is weird maybe
It is not flux, but I don't quite remember if I used cascade and then a lot of inpaint and upscale with SD XL
or jus SD XL
the first one was an img2img with flux
and the inpaint and upscale with SDXL
sometimes I really hate SD as it seems to fix some foot is a nightmare
like why it do it so bad
I got the feeling no one here ever inpaint or correct image
Amazing quality, how was it done?
what model, flux, SD3?
Flux
flux dev? do you use some kind of img2img there?
regular flux dev took like 7 minutes to generate here
with 1 megapixel dimension
Flux dev, no img2img or cnet etc
there is always somee kind of pattern repetition (pardon the redundancy) in your generations, it may have to do with the workflows you made? usually there is carbon copy of stuff
It made me remember this
What do you mean by carbon copy
like the pink trees
That's prolly the prompt
The screen cap above is pretty much the entire WF
All that's missing is the prompt and the load model and save image stuff
the way the flowers appear with the same design
it is something that happens in many generation but in yours I noticed specially, I though it has to do with the weird wolkflow stuff
The originals are produced by Searge SDXL i2i; then fed into GGUF_Flux/Florence2 (i2i/t2i) to produce the final image.
flux colours so nice
Have you produced a Flux version for RES4LYF?
yeah he did, it works on flux now 🙂
Anyone got a w/f or embedded png at all?
Cascade RES4LYF produces extraordinarily unique looks - and 11 minutes/image on my 8Gb VRAM PC 😄
here you go
just delete other stuff in the workflow if needed
Thank you
probably should delete Mega Modifier node its weird
Florence2/GGUF_Flux is so cool!
yeah I've been using Florence2 since the day it came to Comfy
as well as Moondream and WDtagger
Clownsampler and Sharksampler fail to load ...
yeah they aren't in the manager
but there are install instructions here https://github.com/ClownsharkBatwing/RES4LYF
that's a lora
FLUX is my fav model
i2i Florence2/GGUF_Flux
Getting somewhat hang of kohya for SDXL, and training just isn't quite there yet
This was the result using ai-toolkit and flux. Instant perfect result lol
made with leonardoAI's anime XL, still can't find anything not commercial close in terms of style and flexibility
Ive tried i think everything and i just cant find a way to do it.
Something like this?
YEAH this is wayyyy better than anything ive ever gotten
how did you manage to do it
Used the ipadapter and a simple juggernaut sdxl model.
hmm odd ive tried it before but I cant seem to make it work
maybe im missing some models?
So the whole workflow to create it was:
-
used a LLM to describe me your drawing in a prompt. Result:
"A cute anime girl with long, wavy orange hair tied into two large pigtails. She has light skin and is wearing glasses with round lenses, giving her a studious yet playful vibe. Her outfit consists of a lavender sailor-style top with dark stripes on the sleeves, paired with a matching pleated skirt in a muted purple tone. She has an expressive, confident stance, hands clenched into fists at her sides. She also wears knee-high purple leg warmers over dark shoes. Her eyes are large and sparkling, in shades of light purple, adding to her charming appearance. The art style is clean and simple with bold outlines, in a vibrant and light color palette." -
Used a comfyui workflow with a simple JuggernautSDXL Model. (changed the prompt to old woman instead of girl). Added the ipadapter and used your drawing as Composition Input and one image from the Lora you linked (the one with the organge Hair) as source for the style.
If you would take for example the black haired style image (more glossy 3d render look) you might need to change the hair color in advance because it might come out black instead of orange otherwise.
would it be better if I used comfyui instead of stable diffusion?
Should work with webui and for many people webui frontend is easier to handle. I prefer comfyui because of the structured way of the node editor.
I see thank you for the help and info btw :D
You're welcome
hey, if I try to upload an image to cloud storage such as Edgestore using nextjs and if I send it to the image generation AI endpoint for (image-to-image) generation, I get error: input not instance of Blob
London.
Flux1.Dev.fp8 in PortraitMaster
these are so good, do you have any tips on making almost very precise anime character fan arts? I'm new to ai
The details here are amazing!
sorry, I don't usually do fan arts, but you should consider loras etc trained on specific character
Oh okay, thank you
You can try flux, it’s pretty good at prompt following and text rendering. For example Naruto in dragon ball style.
Isn't flux a paid one. Can it do multiple art styles? Like I'm going for a bit of a anime realistic style which is clean. like this
Try with this one 😄
https://civitai.com/models/818292?modelVersionId=915018
❤️
some images made with scribble controlnet on SDXL, still tuning the parameters
Flux(dev and schnell) is open source like stable diffusion, you can run it locally. There is also a "pro" one but its pretty similar to flux dev and closed source. The image I created was from the base model(not for anime but general) and still performs pretty good, you can try out an anime finetune as well.
I got this from quantized Flux.1 dev, you could probably get better results from some anime finetune of flux.1 dev
Quickie ...
Oh dam that really looks pretty good for general without any anime art style checkpoints. How did you manage to make luffi exactly like the one in anime with sll the details like scars and stuff. Lora?
