#🏞|general-with-images
1 messages · Page 163 of 1
Nagens for dinner today
he's in an operating theater surrounded by sharp knives. be very, very quiet...
ok I'll be quiet like a ninja
Stable Diffusion 3.5 Medium alongside other open-image base models. This model only requires 9.9 GB of VRAM
Anyone have a good comparison between XL/2/3/3.5?
I'm curious how much of an improvement 3.5 is
that sounds like a good project for you :)
notebook girl with name keli style Barbie
the channels to generate in are the Artisan channels. start by reading the information here #artisan-faq
Bilbo Baggins, SD 3.5 Large > Flux1Dev > Upscaler
My lonely NES is killing me...
eye
Flux RF Inversion
I gave a super random prompt, and all Flux wants to generate is macro photos of insects.
1.5(less then 1b) knows the most art styles I believe but has very very bad prompt following, and the base model is really low quality. got a lot of community support, lots of finetunes, ip adapters, and controlnets. Finetuned models are great at their specific domains though.
2(less then 1b) didn't get much community improvement, should have slightly better prompt following(still very bad) then 1.5 but not that great image quality either. Finetuning it helps generate better images but its not as good as 1.5.
sdxl(3b) has better prompt following(not great compared to modern) and better image quality, pretty bad text rendering but could write single words unlike above, got a lot of community support and has lots of finetunes, ip adapters, controlnets and techniques like lcm
sd3 medium(2b) was ok, should have more detail and prompt following then above but honestly the aesthetic was worse than sdxl, and humans were horrible especially on different poses, but decent text rendering, however very little community support
sd3.5 large(8b) is a considerable improvement, knows lots of art styles, great prompt following, nice image quality, good text rendering. seems to have lots of support. sd3.5 medium(2.6b) is alright, the aesthetic quality is worse then sdxl but prompt following is better, not too great text rendering, humans are similarish to base sdxl.
Has anyone tried to add lying Sigmas to Cascade Ultrapixel?
prompt? That's cool
thx
Create a freddy krueger with realistic body
Billions must make the official Catholic Church mascot a lora
I am trying to create images like the following, but they always come out blurry. I don't have any problems with other images with all the same settings. I also sometimes get blurriness with photos of white sand beaches. Is it the brightness? What is it about this prompt? "flat white background, 3D Unreal Engine render of an old green wooden chair"
https://github.com/ClownsharkBatwing/UltraCascade guessing you've seen this
but this is much more ready to use than the other implementaitons just fyi
i have two gpu's one with vram 12 gb on with 16 gb. is there any possibility to run stable diffusion video using these two. its would be a great help. i am new learner .
Get 200 credits for the best voted new A.I. https://www.recraft.ai/invite/W7Usp3Pk31
Need to find my most complex prompt and try it there. They say it can do loooong texts
works with -"in cafe"
what is it do
A new A.I. Model best voted by users at the moment.
thx maybe i try it later
I think the XL model is the best because it is available on moderately expensive video cards with good quality
XL should be fine ...
more freedom)
i can use 1.5 only XD
recraft looks really strong yeah
might need to run a different model as refiner over it sometimes cos the photorealism is a bit off sometimes
but it does layouts very well
Flux didn't do the prompt that good ...
It's paying. We can't use it locally for free...
If you open account using ref link you get 200 instead of 50 credits. The referer gets 500 credits ... so better refer
You like a result, you want to tinker with it, and met with this. This will take a while to install nodes for 
Especially if i have to uninstall "conflicting nodes" and install them manually lol.
Купальник для художественной гимнастики
#artisan-1 купальник для художественной гимнастики
#купальник для художественной гимнастики
Rhythmic Gymnastics
New Discord stuff.
you very like gold😎
Thanks. Super Flux with my LoRa 🙂
When you don't render in 30 Steps but 3 times 10 Steps
What workflow does that?
Literally none of them I've seen re-iterates the same image three times at 10 steps.
Selfmade Workflow .... bad luck I don't know how to add the Detail Demon
Mojo
is the text also ai
Yes! Flux Model!
some results with Mr P Brawl Stars LoRA that I trained for fun on weights.gg
Didn’t expect Flux.1-dev lora training to be good with just 8 images lol
wow that's actually crazy
There's a new model that can do even better
do you use comfyui
Yes but this new model is available only online ...
https://www.recraft.ai/blog/recraft-introduces-a-revolutionary-ai-model-that-thinks-in-design-language If you want I can send you an invite. 200 Credits for you (insted of 50), but also 500 for me ...
Recraft V3 is state-of-the-art in image generation
I tested it a bit, flux dev actually performs better.
One of the worst things about recraft is that the text is positioned horribly and looks like you photoshopped it on the image.
Good realism tho but prompt following is also pretty eh imo compared to things like sd3.5 L and Flux dev
You can also place text and an image manual and let the A.I. bring it together
the song must not be the same
vegetable fashion week
Hello! I am new to this community, im not sure where to ask for help. How would one go about creating such images? The base model for all of these is SDXL, and I can see that the creator has used specific checkpoints, but my generations are no where near what is shown in the image.
what are you using for prompts?
I use comfyui, and usually just manually input the prompts
I was experimenting with image to image upscaling using ksampler and it added way much more details (upscaled twice)
Is that the standard practice to add details into a base image?
there are lots of things, especially if you're using comfyUI, that you can add in that can enhance an image.
Could you tell me what else are some good practices?
one thing you probably want to experiment with is changing the number of steps. and changing the cfg values. also, the sampler and scheduler you use will ahve a huge effect - and some models will not work with some samplers or schedulers.
All you should be able to see on CivitAI what others use
yeah. read through civit and see what people are saying they are doing and useing
I have been using Euler, samplers converge after some amount of points right? Will adding more steps increase the details to a huge extent? I was under the assumption that more steps will have diminishing returns on details beyond a point, and my gpu is pretty shit
euler isn't always the best choice
don't assume. experiment
Yeaah, I have been doing that till now, but couldnt wrap my head around how could I possibly get more details into the images
Alright, will do
that's why you do a grid. just use the word 'apple' and then try generating it with various numbers of steps. or various cfg values, and so on
and study the changes in it
There are always a lot of things to try ... like Detail Daemon ... SuperFlux ... you can see some of the add ons in youtube videos
I could achieve these results just by increasing the steps and playing with the basic variables I assume?
never assume
i nothing else, contact the guy that created those and ask him what he did
Thank you, I havent heard of these will take a look into it 😁
And I don't think you will get the same picture ... different graphic cards, software versions ... etc.
Did that haha, waiting for his reply
Yee sure sure, I wanted to know how to achieve such details, not the exact images
Thank you both
And every Model is different ....
They also often need a different kind of prompting ....
It's not as easy as they might tell you
what he said - thus my admonition to explore
his images look to me like he has hundreds of hours in figuring out how to make it work
"it" being ai generative imagry
Started spring 2022 😄
hundreds of hours :) i'm just a few months ahead of you
For any reason I can't add nodes at the moment ^^
comfy without nodes is pretty rough
I have a lot, but can't install 4. I think it's cause they have different requirements ... but not 100% sure
Or 4 Node Packs
Clowny is pretty deep into that topic ....
you can get a decent amount of the benefits if you use DPM++ 2SA and then put Eta and S_noise as high as you can
DPM++ 2SA can keep up with clown a lot better than euler
git clone
I think Flux is better with DPM++ 2M but all that stuff is condusing me ... getting to muc
flux is better put in the darkest possible drawer and locked away
Manual doesn't work, too ... waiting for new Comfy, now ...
what nodes are you not able to add?
I do like DPM++ 2M a lot as well
I think one was the WAS?
clown?
No ... not from Clown ...
clownsampler
has he actually created a new sampler and named it after himself?
haha it works with a second node called sharksampler 😂
😄
oh this is much too good :)
and a batwing scheduler?
that would be good yeah
And Lying Sigmas ....
he tends to use a big noodle chain of schedulers I don't use that bit
I prefer manual sigmas to schedulers
and entire comedy branch of math
I think it's clown scheduler and shark sampler. Something like that anyway.
also the names change over time (sadly)
there's some other stuff in the repo I haven't even tried yet
I would recommend you to search for the tile controlnet if you want more detailed images...
Will do, thank you!
Example of the tile controlnet left base generation and right the tile upscaled one
Its past midnight here, so will have to see how this goes tomorrow haha
Btw, how intensive is creating a 3d model from an image? I have gtx 1650, with 4gb vram
there aren't any good at home open source text or image to 3d AI generators
CRM seems pretty impressive tho? Im not sure if its open source or not
Are they not good at the mesh generation or material generation more specifically?
even the paid ones aren't that great at mesh geneation yet
you'll be better off just using blender for at least another year
Shit, here i was thinking i could maybe get this to give some passable results
you might be able to. try and see
2dgame background pixel sky cloud
Here is the image you requested.
what does this option do? it appears to have no effect on my clip skip slider usage.
eyeball teddy train, eyeballs falling, eyeball rain, ancient sad eyeballs
I guess it doesn't know Thunderbirds very well...?
doesn't know a lot of the old kids shows
what model are you using?
I'm trying several Flux variants
oh. well - yeah, not surprised flux doesn't know.
but it knows some. it knows there are things that fly
PixelWave Flux
elmobirds?
that's ... i have so many questions...
I have no answers 🤷🏻♂️
Only more questions!
That was also a Magic Roundabout prompt.
what's magic roundabut?
An old kids TV series
Nice 🙂
I like this one, too.
Netflix film for 18+
guys, where can i learn nsfw art?
Youtube or Maybe better: CivitAI
not sure whether it's forbidden to discuss that ... but for more learning a chat isn't really the best place. We are not paid teachers. We help each other a bit ...but this is not a course
Try unstable diffusion discord. But avoid the nsfw niche channel 🙂
any link to it?
just search google
the blur in the background or something else?
yeah
using pony realism with comfyui
why do i generate these blurs when i am using pony realism?
I had a similar problem when using a "wrong" sampler (IIRC DPM++ 2M) with pony realism in A1111. Changed to SDE sampler and it worked. Changing to a different checkpoint (Cyberrealistic pony) also worked.
try something other than euler and karras
Paint a picture of sunset and lone ducks flying, autumn water together in the sky
Children's picture book, in the style of Hayao Miyazaki, forest, small animals, little boy, night, starry sky
Here is the image you requested.
Here is the image you requested.
Quantum physics good morning coffee
Another firefighter gone rogue
using which model this was generated?
boys lets say i wana train a pony lora on the ben 10 omnitrix badge
i want it to learn the logo and the body suit styles
and i got 2 folders one with 88 images and the other with 250 images
how many repeats should i use for the 88 images one?
like this
body suit and the badge itself
Photo op!
"Now, if she could only turn those into water cannons!!!" 🥳
The two spires I mean! 😉
Last date
quoth the raven, "never more!"
Do not open the box!
Flux and Silhuflowart2 LoRA
which sorcery is this? which model can handle text so well? 3.5?
Flux ...
nice
3.5 should be able, too. But I am more used to Flux
i am stilll stuck to xl and lightning, i guess i should try some new models, but i am afraid it will be to slow on my mac
I have no idea about Mac, but for Flux there are already a lot of smaller and faster versions ...
how much longer flux iteration takes compared to xl with same sampler?
if for example you have 10it/s with xl and 5it/s with flux, i can guess what i could expect
I never really cared ... I'm going for quality. But I am sure you can find info about that using google
i will try anyway, i just wandered if you know
when your mac costs like 2 decent gaming pcs, and you have 3-5 seconds per iteration you want to break it 😂
My Workflows are damn crazy using Super Flux with Detail Daemon and my own Lora ... it's not really fast here ...
Just give it a try ... it's not that much work.
just say no to macs?
i need mac for work
your mac got nvidia gpu?
there is no mac with nvidia
I don't know the M4 ... I wonder how they can share only 16GB ....
i have m3 pro with 18GB of unified ram, and it is slow
what you need, is an nvidia GPU with 16 gig VRAM
system ram shouldn't be used for this
and an integrated GPU is always going to be slow
Macs are also used for video cut and so on and they can handle that .... I think they have a special architecture ...
that's video. but the current AI technology is all written for nvidia, python, and cuda
macs/apples were always the leaders in graphics, but this is totally different
you're just getting graphics as the end result
I have never tried that, so I have no idea whether there aren't good solutions for Mac, too
the funny thing is, you can run stable on an iPhone, but not an android. yet you don't want an apple product for your desktop for it
Even the models converted for the apple ANE (neural cores) are not faster. But if you have a notebook which can work 8-10 houes without a power cord compared to 400w consumption there must be a difference 🙂
yes, but that's hardware, not software
But the shared memory is quite nice for the the larger llms with 96 gbyte shared "(V)RAM".
m2 ultra with 192GB of unified RAM is insane
but slow
its faster than my m3 pro with 18GB
Yes it is but still even with the actual torch version with support for the mps backend of apple you won't be fast. Even if you use it for parallel processing (8 images at the same time...) the GPU Cores are limiting the performance
more unifed ram is like having more vram on GPU, so it works faster
since 2/3 are reseved for vram
2/3 of 192 is 128
so 128GB for VRAM
but it's not an nvidia gpu, so ...
you're just going to be slow
"An Nvidia GPU is preferred for AI image generation because of its specialized "Tensor Cores" which significantly accelerate the complex mathematical calculations needed to run deep learning models, allowing for much faster image generation compared to a standard CPU, making it ideal for generating high-quality AI images quickly; essentially, Nvidia GPUs are designed with parallel processing capabilities that handle the massive data processing involved in AI image generation very efficiently. "
as opposed to normal graphics work
... just not long strings of text?!
depends on the text
Llanfairpwllgwyngyllgogerychwyrndrobwll-Llantysiliogogogoch
wtf is this?
going to scotland?
i cant click on this
where'd you get that from?
i found it under anouncments
but click on notification does not take you there (as it should)
What's the best civitai model for monster design?
I'm looking for one that pertains to the designs of silent hill
Yum This Pumpkin Spice syrup is just what I need in my egg-flip!!!
most of these are just images of people walking through a spooky fog
it tried
why do you need a lora? all the AI base models do monsters just fine with just a prompt
Its the name of a town in North Wales
sorry, i did know that. not sure why i thought scotland
and what about non english characters?
СРБИЈА
ČĆŽŠĐčćžšđ
예쁜 여자
漂亮的女孩
The people ther just call it: Lianfair ... that's what's a commercial says here
go run some through sd3.5 and see
okay well go run it the non-english characters through flux and see?
AFAIK Asian Characters it's more hallucinating ...
thanks, any idea about non english Latin characters? or Cyrillic ones? same as for Asian?
since the Ais are trained on english text, probaly no non-english is going to work well. you'd really need loras for those and i wish you lots of success
I just prompted an asian city, no specific charecters. I can't read them but some asien friends told me they don't exist
Maybe you have more luck prompting a specific text, but have to try ...
i am just curious, i have no specific need
There are some asian models, too ....
Made for prompting in an asian language so they should replicate that ....
Hunyuan-DiT
I mean there's different types of monster styles I guess, I figured I might need a lora that captures a specific artstyle
you might have to train one for that
might not also
I'll just use ponydiffusion
A cute girl with white flower in hairs.
read the information in this channel: #artisan-faq
What happens if I use the same seed but change the prompt a little?
a realistic HD image of inside of the store full of prams, cribs, and bicycles
it changes the image. that's the reason for locking the seed down, so you can tweak the prompt or settings.
you can't generate in this channel. read the information in #artisan-faq
Here are your images...
How you make em throw up?
Schrödingers Coffee
We lie down together amid the cacti. The succulent leaves closest to the ground are a marbled grey, as if turned to stone, and we become absorbed by them, feeling our way around their rounded contours with our fingers. As we gaze up, following their odd tear-shaped forms bundled together, the sprawling double cypress tree – two trunks locked in an embrace – claims our attention with its swaying branches splitting into ever finer branches and twigs ending, here and there, in clusters of cones. Initially, we can’t really tell if it’s the wind that’s causing the canopy to stir and sway that way. It forms a dark shifting frame that we enter and get lost in as one does in a forest.
It’s just accidental. I’m retrying these prompts I had done months ago with Invoke, SDXL, and the nightmare prompt node, and every so often I find one that gives wild, varied results.
Flux Turbo LoRA w/f with 2 x KSamplers, 2 x Upscale and Sharpen
"The players are literally melting in this heat!"
It was at this moment in time that The Joker learned why Batman was known as a sore loser...
Batman: "I'm the one that wears the cape...!"
I remember when I spammed batman images because it was the only human I could get the original SD3 medium to make accurately
I have a 64GB M3 Max and use Flux. It’s a bit hard to describe the generation speed for you: one, because I use an adaptive ODE sampler for quality, but it doesn’t have a fixed number of samples and the time per sample is slower than normal samplers, and, two, because it’s been fluctuating up and down as I update PyTorch with new nightly versions. But it’s running as fast as I’ve ever seen it right now at about 6.5-7 minutes for a 1MP generation with bosh3. If I were to use DPM++ 2M, I’d estimate it would take about 5.5 minutes for 50 steps. I’m running the full-fat dev, not a quantization.
yeah speed comparisons can't really work because there are way too many variables
CUDA and torch version matter a lot
and then whichever combination of TensorRT, TorchAO, Torch.Compile, FP8/INT8 Matmul people are using for acceleration
or the opposite, whichever combination of GGUF formats is slowing them down
batspam... yummy
Made a fake "Planet Earth" style doc using Midjourney, Runway, Suno and Elevenlabs... Check it out if ya like. I think it turned out kind of cool. Everything is AI except the text and the editing. https://vimeo.com/998739582?share=copy
This is "Planet Unknown" by Shelby Meinzer on Vimeo, the home for high quality videos and the people who love them.
"A promotional image for a tea shop located on a bypass road, named 'Milestone Cafe.' The cafe has a cozy and inviting interior, with large windows that provide a view of passing vehicles and greenery outside. A prominent 'Milestone Cafe' sign is visible, showcasing the logo and inviting customers from the bypass. Inside, warm brown tones, wooden furniture, and soft lighting create a relaxing ambiance. Tables and chairs are arranged for comfort, and shelves display various tea selections. The overall mood is welcoming,
I want a apple image
Are all this images made by ai
Are you asking yourself?
They look bad enough to not be AI, unless a layering workflow was used.
Urban camo
I feel like the model messed up here in term of logic but I like it more
This is quite an old SD generated image. SD has changed quite a bit since then and it is hard to replicate this type of camera angle I find and a figure so far away...
nice colours
just looks like an oil painting.
I mean the character style, not so much the medium. Is there an artist known for painting characters in that style
not as far as i know. to describe it, it's just an oil painting of a girl with brown hair. a fairly common oil painted look in fact
For example WLOP has a particular character style
I agree this looks super common, which is why it's hard to figure out, but I could've sworn there was a relatively popular digital artist that made characters/eyes shaped like that, and people were making loras styled after it. Maybe I'm hallucinating tho
you might be thinking of Margaret Keen, but hers were big eyed kids
there is some term for non-anime graphical digital novel or comic art but I forget the name
there are civit loras that use the term
ah I found it
on civit its called Western Comic or Western Animation
Samdoesarts is your answer

Thank you, that was in fact the name I was thinking of, but couldn't remember

It's actually not far off the prompt. She's transforming into data.
ah that makes so much sense
okay the model was right then
I thought you prompted woman in front of city
画一只小狗
Good morning coffee
"Today, we are adding new high-resolution capabilities to FLUX1.1 [pro], extending its functionality to support 4x higher image resolutions (up to 4MP) while maintaining an impressive generation time of only 10 seconds per sample. Higher Resolution, No Compromise in Speed FLUX1.1 [pro] – ultra mode: This option enables image generation at four times the resolution… "
what is the easiest way to extend an image sideways?
make it from portrait to widescreen for example
Outpainting AFAIK
how do you do outpainting?
There are webservices for that. I don't really do that often. I'd search at youtube ...
Local tools like invokes canvas or comfyui support outpainting.
👋
SD3.5 medium. prompt: a white man wearing a white tee-shirt and a black man wearing a green jacket running away from a small sports car. the car that is fairly far in the distance, in the background, is on fire.
no mutation ?strange
mutation?
no, none of that. SD3.5 large and medium do a very nice job of not doing that.
when it first came out and I checked I had problems with it
you are sure you're thinking of sd3.5 rather than sd3-2b-medium?
i use 3.5 large
maybe you're using different settings than i am. the sampler, scheduler, number of steps, and cfg will have a huge effect on whether you get those mutations or not
you use it local?
yes. for that image, i'm also using skip layers. it's sd3.5 medium. sampler: euler_ancestral, scheduler: linear_quadratic, skip layers: 22,23,24 scale: 1 steps: 36 cfg 3.7 a man and woman gazing into each other's eyes. She is stroking his cheek, his hand rests on her shoulder, sunset.
low cfg ,in sd 1.5 my sfg is 7
i never every go that high with cfg. ever. maybe 5 but that really is rare. yes, low cfg
and low steps but not terribly low
you like creative ?
you restrict too much, you get issues
you might want to experiment. lock the seed down. lock everythign down, then do a series of gens where you just change cfg by one decimal place at a time, starting with cfg 3
and working up to 7
and then study them
i dont undestand whats update im my sd , now i cant use ip adapter =out of memory -before i use it witout problem
are you sure you're using the IP adapter that was trained for the model of stable diffusion you're trying to use?
how many ip adapter version?
ipadapter is a model. and just like all other models, like loras, it will only work with the base model it was trained on. you can not use IPadapter for SD 1.5 with anything other than SD1.5 base or SD1.5 checkpoints. if you want to use SDXL, you would need an IPadapter trained for SDXL. same with SD3.5 - you would need an ipadpater trained for SD3.5 and it's not out yet
you have to have enough diskspace on your system drive for the system to run as well as quite a few other things, even if you have stable running on some other drive mostly
check your disk space, check your swap space, and if those are fine, post in the #🤝|tech-support channel and give the exact errors
i hope it help
i will clean my harddrive
Any idea by anyone if there is any Image to Reflief/3D COnverter Model?
draw a paperwall with a hello kitty
Here is the image you requested.
Their last selfie ...
alien robo evolution?
Henry
Looks like ... time traveller 🙂
I hope the method of reproduction has also evolved🙂
They just eat the earth and reproduce themselves
What kind of coffee do you drink, instant or ground?
capsules?
at one time I had a 0.5 liter mug and I drank one such mug of coffee every day in the morning🙂
No capsule made of platic or alu ... coffee in a paperfilter
I’m designing my room in SD , it’s beautiful and interesting, but probably very expensive🙂
Denoising strength =50 .
if more then this is no longer my room
ip adatper still now work
sad(
jewelry store on the face?
pretty expensive 😄 Old prompt translated for Flux
light on the nose not pretty ...
Using a special Flux Version only the trainer and I have and a technology to improve ...
Mojo
Ohhh.... forgot my LoRa
I recently listened to several songs made in suno, I really liked it
I have created some, too
i waifu-ized my elden ring character lol (using Sakimi-Chan flux LoRA and some inpainting, before and after)
how long does it take to learn to do something good?
https://www.youtube.com/channel/UCaIN2kn0zofdQlHPeg15HIQ You also need luck 3 days maybe?
some song like real
If you wanna use them commercial you need a subscription ... but you chatgpt analyse songs that fit to the style you want and use it ...
subnautica
no need ,just for fun
Give it try with free credits than ... should be enough to create 4-5 good ones.
i cant use it every day>?
Than you will have a good idea how to use it
If I remember right you start with 500 credits that will be renewed every month ...
👍
So sure you can focuse on one song every day ...
what you know about voice cloning, can i found something like elevenlabs for free?
Yes ... Search for RVC on youtube ... but didn't try that combination yet
I have made a model of my voice and used it with a vnese song. It worked pretty good ...
but if the voice in the song you wanna change has been manipulated it doesn't work good
RVC
i read about it little bit
good tatoo for night drive on bike
mass effect in jewelry store🙂
;-P
Ukiyo-e painting with Gustav Klimt influences. Beautiful fairytale princess discovers a big shiny, golden compass in dense, dark forest, gnarly trees, lush green vines and colorful flowers, full of magic and mystery. Dwarves watch in amazement. Sparkling light floats in the air, adding a sense of mystery and fantasy.
Trying out the use of generating landscape wallpapers in different styles.
Welcome! Nice picture!
@languid pebble Thanks, bro....🥹
bark is pretty bad now, there are far far better alternatives
voicecraft - best voice cloning ability but no streaming support and a bit slow(around 1x realtime on gpu)
xttsv2 - ok voice cloning ability, very very fast streaming(0.1 sec) and very fast(6x realtime on gpu)
styletts2 - ok voice cloning ability, no streaming support, but incredibly fast(8x realtime?)
fish-speech - good voice cloning ability, very fast streaming(0.15 sec) and very fast(5x realtime on gpu)
Also, you can look at glm4 voice, it doesn't support voice cloning but it can generate speech with multi-emotion(angry, sad, whisper) and different speech rate like gpt4o.
@scenic yew Hey, bro..... I am new to discord . Can you give me link to any user manual to how to generate images in here ??? I am not going to lie.....this discord thing and chatbot staff are going over my head 🫨
Service car of RoboCop in pinkish neon light 😇
@languid pebble A thank you gift for you, bro..... 
@languid pebble Don't take it otherwise....but your name reminds me of a character, a cartoon character 
thx can i use something in your list online ?
There are many hf spaces for the models, just search them, you can try here: https://huggingface.co/spaces
best quality is usually locally run but its decent in spaces too
I tried several spaces but the result was not very close
can you give the voice file? which voice do you want to clone?
I tried to clone different prizedents😎
You need a voice without any background sounds ...
not close like elevenlabs😎
can you give some voice file that you tested with xtts
i delete it , maybe you show example
how?
it works now with images?
Yes ...
with donald trump
let me try with xttsv2
xtts default settings are pretty bad, you need to change with them. You can make xtts even laugh and do sounds like ugh by changing settings. I'm not sure if this is the right channel to discuss about xtts though.
I am a nobody here ... but as long as there's no other deep discussion ... just do it!
not worry there we ask about food😆
image or video or sound are parts of one whole
All about A.I.
Why did you use exactly? I didn't really understand.
fish speech
hf space: https://huggingface.co/spaces/fishaudio/fish-speech-1
model: https://huggingface.co/fishaudio/fish-speech-1.4
oh is very fast
but why on korean lol
oh this better and not support my lang((
I tested it with a sample of my voice, but the result wasn't convincing
you usually need 10+ sec for fish-speech, experimenting with the params can help too, Voicecraft is better for lower sec but the demo has a error right now.
Do you have some settings to recommend?
depends on the sample, but usually
set iterative prompt length to 0
set top p, temp to low if its a very long sample, and opposite if its shorter, should be 10 sec at least
try to keep repetition penalty 1.0 except if you get large spaces
Ok. Thanks.
Xtts is really good y, i dev this with it (partially) :
Tried running an animation with mochi. Stretches the capabilities of the 3060.
hi..any1 know which artist that has this art style? found this on pinterest and description said it used nijijourney
Try to search on civitai ...
😅 how do i do an image search on civitai? does it has feature like google image?
Black horse man fusion with anime effect
hello there im new and i have just one question for now, see i have made this character in forge UI with a PonyXL model and a bit of inpainting and now i want to make a character sheet or reposing her for a costum lora, i watched over 10 houres of tutorial but nothing worked for me. does anyone have a idea or a working workflow for cumfy UI (yes i have that too)
#Black horse man fusion with anime effect
Black horse man fusion with anime effect#|
how do i make the whole body instead of only the head to face the camera? what prompt should i add?
It sounds like the name of a model. You could search for that
the sweatshirt came out great on this one
This one too
the lady looks the same too
Same description 🤷🏻♂️
"blonde lady, sonic sweatshirt, masterpiece, absurdres, greg rutkowski"
don't use greg unless you are doing sword and sorcery paintings. that's all he paints
it was a joke 😉
What model and loras for cars been looking on how to do cars properly
Flux with bio-ink lora for the glowing patterns.
Bio ink on civit AI? Also one more question how is the prompting formula for cars with Flux
Yes, on civit. Just prompt for the car you want, it's pretty simple.
You don't need the lora for cars though.
Ok thx, I can specify certain on the car that I want different?
yeah, just use the specific make and model in your prompt
Thanks I’ve struggled with AI and cars that I’ve never tried to do it on flux
I will try now when I get home
A car with tuning modifications and sleek body-kit was in my prompt.
👍
I’ll try specific things on the car if that doesn’t work out all to well I’ll keep it simple like yours
True
Is it Buldozer AMD architecture?
flings rotten tomatos
The headlights still annoy me how odd it looks
they don't look nearly as odd as some of the ones i see on cars on the road
:( post the errors in #🤝|tech-support and maybe @dry crow can figure out what's going on
i dont know what is error -its only out of memory
i was in tech support with this question
not help
what prompt should i add to make the person standing straight as if shes taking passport photo?
you could try adding passport photo to the prompt
im trying to make this pose but the body refuse to face forward 😅
heres another 1
Abdalfattah el sisi 3d character
#🏞|general-with-images Abdalfattah el sisi 3d character
#artisan-1 cat
so this happened to me when i tried to generate a medieval warrior with "plate armor"
well - apparently he thought it was dinner time ;)
thanks pony
king
king god
King Charles III, formerly known as The Prince of Wales, became King on the death of his mother Queen Elizabeth II on 8 September 2022.
In addition to his official and ceremonial duties in the United Kingdom and overseas as The Prince of Wales, His Majesty has taken a keen and active interest in all areas of public life for decades. The King has been instrumental in establishing more than 20 charities over 40 years, including The Prince's Trust, The Prince's Foundation and The Prince of Wales’s Charitable Fund (PWCF).
His Majesty has worked closely with many organisations, publicly supporting a wide variety of causes relating to the environment, rural communities, the built environment, the arts, healthcare and education.
King Charles III, formerly known as The Prince of Wales, became King on the death of his mother Queen Elizabeth II on 8 September 2022.
In addition to his official and ceremonial duties in the United Kingdom and overseas as The Prince of Wales, His Majesty has taken a keen and active interest in all areas of public life for decades. The King has been instrumental in establishing more than 20 charities over 40 years, including The Prince's Trust, The Prince's Foundation and The Prince of Wales’s Charitable Fund (PWCF).
His Majesty has worked closely with many organisations, publicly supporting a wide variety of causes relating to the environment, rural communities, the built environment, the arts, healthcare and education.
King Charles III, formerly known as The Prince of Wales, became King on the death of his mother Queen Elizabeth II on 8 September 2022.
In addition to his official and ceremonial duties in the United Kingdom and overseas as The Prince of Wales, His Majesty has taken a keen and active interest in all areas of public life for decades. The King has been instrumental in establishing more than 20 charities over 40 years, including The Prince's Trust, The Prince's Foundation and The Prince of Wales’s Charitable Fund (PWCF).
His Majesty has worked closely with many organisations, publicly supporting a wide variety of causes relating to the environment, rural communities, the built environment, the arts, healthcare and education.
Above post is from a 6 year old, or a meaningless bot.
I've tried new IP-Adapter for Flux, it's really great https://x.com/ShakkerAI_Team/status/1855962063165891069
IP-Adapter for FLUX.1 is here on https://t.co/YcvIezXSsP. A new generation of AI creative tools is coming! Helping you generate images in any style you want. #shakkerai #ipadapter #flux
For some reason my prompts I’ve been using consistently for awhile drastically changed the art style it generates for the worst. Any idea why that could be?
New model? Different settings?
You must have changed something, give us a clue.
what cause the small glowing specks to appear?
try using a different VAE
the only option in VAE is auto or none
you can get other ones from civitai / huggingface / etc
VAE quick guide :
1/ What is a VAE.
It's a part of the stable-diffusion pipeline that encode/decode information from tokens to latent space and from latent spaces to pixels. Aka it transforms math statistics in pictures
2/ Where do I put my VAE ?
- VAE with
.vae.pt,.vae.ckpt,.vae.safetensorsextensions go into the models\Stable-diffusion folder - VAE with
.pt,.ckpt,.safetensorsgo into models\VAE
3/ How do I use my VAE ? Three possibilities : - Either you name it similar to another one of your model (eg : Anything-V3.0.safetensors + Anything-V3.0.vae.pt), by doing that it should automatically load the VAE when you load the associated model.
- You manually load your VAE by going to Settings -> Stable-Diffusion -> sd_vae and selecting your VAE
- You add an easily accessible VAE dropdown at the top of your page to quickly switch back VAE by adding
sd_vaeto your Settings -> User Interface -> Quicksettings list
In civitai you can filter VAE
thanks! that fixed the problem 😁
3.5L Turbo
@languid pebble after digging it appears this is the problem
My new code is using this site to generate the art. https://api.stability.ai/v1/generation/stable-diffusion-xl-1024-v1-0/text-to-image
The website generating my previous art is: The website I’m using: https://api.stability.ai/v2beta/stable-image/generate/sd3
I’m attempting to fix it now, will report back
Yes ... different models need a different type of prompting
With the latest models you can prompt more in natural language and more complex
because the newer models use t5xxl - but it can also cause you issues and run off down rabbit holes more easily
Works fine here 🙂
Yeah, it works fine with my other app but my new app is having so many issues getting it fetch the api. Had to take a break from trying lmao
hoi i iahve a question since im not familiar
im using illustro and my image is blurry here my setup unsure if you guys would know how to fix this?
I don't use A1111, but make sure you have a valid VAE selected.
Change the width and height to 1024x1024 (presuming it's an SDXL model?).
Sampling steps to 25.
Hires steps to 20.
Denoise to 0.4.
CFG to 5.
Hangover coffee 😄
Cheeky robot ^^
So lazy that I prompt my signature 😄
good way to invite forgerys
They don't have my A.I. Model ...
they don't need it, forgers just need your signature
Nobody willing to do that with me 😄
trimmed it short to keep things sfw
sometimes the hands are not ok sometimes it works perfect ; ) its custom i2v
A.I. and hands ... even harder with video ....
i will try and retrain on bigger models that are based on flux , but it takes 6 a100 to train, thats 14$ / h
probably 80-100$ per lora.
Pretty expensive ...
Im betting on the fact that there are thirty people out there
Do you need them to get that cash back?
thats why im doing it in first place. $
Ahhh... a friend has made a Flux based model ... good luck for me nobody else has it local ...
lora or model?
nice
yeah flux lora can be done on 24gb vram
I am not sure exactly what you trained
did you fine tune cog
Mochi-1 is surprisingly good with hands especially for open source even with anime which its not trained for but obviously not near perfect
Pretty good!
This is probably a better view of realistic hands with mochi
the background is supposed to be blurry - that's how you can tell it's in the background, not right behind the subject
the blurry part is because of vae tiling, you can turn it off but it will use massive extra amounts of vram, there are quiet good gens, some I made and some other people, but yeah limited to 5 seconds sadly, and no image2vid yet. The devs are saying it should come before the end of this year though.
Can you tweak it reducing the FPS?
Yeah I think so, but that will just make it slower or faster.
SVD you can creaty with 3 FPS and render the missing frames later ....
If you are happy you can also SloMo it by 2 times
#🏞|general-with-images message Tweaked SVD
good night
Oh you mean like that, I don't think so right now. But you could probably use some other tools, and when img2vid comes, you could probably extend videos.
and yeah you can slowmo it too
good night cofee
Nini dicordos
We'll see
Anyone have adobe illustrator and can just quickly resize some things in an image for me for 10 bucks?
@languid pebble Good evening coffee
Good morning coffee 🙂
three-dimensional ball made of sheep fat jade in Hotan, Xinjiang, with a white and round png picture in the background.
read the information here #artisan-faq
I think I have a good img2img setup now
I can use wd14 tagger, florence2 and ollama.
A digital illustration of a cute, anime-style girl with long, wavy brown hair and large, expressive purple eyes, smiling with a playful expression. the girl is positioned in the middle of the image, inside a cube-shaped light box, which is placed on a wooden floor in a dimly lit room. the background features a cozy living room with a dark couch and a rug, creating a warm and inviting atmosphere. the lighting is soft and warm, casting gentle shadows that accentuate the girl's features. the box itself is square in shape and has a cartoonish, kawaii style with a soft, pastel color palette, adding a touch of whimsy to the scene.
not a bad description. The original prompt, however, was a little shorter: a photographic 3D cube with an anime face painted on one side
That prompt was auto-generated by your image.
using Florence2 Promptgen 2.0.
Then fed at 0.5 denoise.
right. i understood that and I think it did a really good job
want a couple more images to test it with?
Sure. It probably won't be perfect with all of them.
This is with Pixelwave Flux finetune btw
And the Vit-L improved clip.
these should be somewhat complex for it. let's see how well it does
A minimalist rhino head in a faceted, geometric style with sharp angles and a prominent horn. The design uses bold, black shapes without a background to create an abstract yet recognizable silhouette.
A cozy bedroom in morning sunlight, warm sunbeams through white curtains, unmade bed with cream colored bedding, a sleepy cat stretching, coffee cup on bedside table, minimalist interior design, soft lighting, lifestyle photography style, warm color palette, high-end interior magazine aesthetic, 4k, detailed, --ar 4:5 --style raw --v 6
Good morning coffee
add realism
A digital illustration of a cute, anime-style girl with long, wavy brown hair and large, expressive purple eyes, smiling with a playful expression. the girl is positioned in the middle of the image, inside a cube-shaped light box, which is placed on a wooden floor in a dimly lit room. the background features a cozy living room with a dark couch and a rug, creating a warm and inviting atmosphere. the lighting is soft and warm, casting gentle shadows that accentuate the girl's features. the box itself is square in shape and has a cartoonish, kawaii style with a soft, pastel color palette, adding a touch of whimsy to the scene.
read the information in #artisan-faq
mass effect
Good morning coffee
Good morning coffee
Fortnite... Flux model is awesome.
what's fortnite
Good morning green tea
a childs game, there is a skin for arcane jinx.
Author: @gleaming fox
MotionPrompt: $freefire_michael_jakson_emote
Command: </random:1272485059353640963>
Background: from video
Model: v2-turbo
Explore more features at viggle ai
I did this when Flux came out (that's me in the photo, looking chubby)
I like how Kling makes it the most
5 Seconds ... ... ...
before kling can 10 second
Just a fun thing ...
kling does 10 second videos
5 Seconds is more SVD .... yeah it can follow a prompt .... but 5 seconds isn't enough
not now in free account
that's becasue you have a free account. you have a demo account, you only get a taste - they're not going to change that
we have no choice, models with this quality are not available locally
sometimes mochi does ok
The runway can video to video
https://bento.me/guillaumedagens
https://www.instagram.com/guillaumedgns/
GTA IV, where Rockstar's cult game comes to life like never before! Thanks to artificial intelligence and the Runway Gen-3 Video to Video model, this gameplay has been transformed to resemble a scene filmed with a digital camera from the 2010s.
🎥 What you'll see:
Ul...
runway?
meta
what you use promt or video or image
prompted an image and then told it to animate it
How do you use this type of open pose with already predetermined shape? What type of controlnet should I use? Directly using Openpose_full result in black image
Hello! I'm new to this server, although not new to Stable Diffusion. Would like to ask for advice or ideas if anyone has. I'll attach here a couple of pics to start with. This is a OC I have created and keep using for many different gens. I've trained a Lora (then re-trained, and trained again and again) and by this point I'm achieving really good consistency with her features etc. --> I don't necessarily need to prompt anything about her appearance, the lora usually handles it all. Or I can just promp something about hairstyle or expression etc.
I've kinda accepted that the results are somewhere between anime and photorealism. Like...quite realistic but you can tell it's not photorealistic. It might be that my lora is just making a sort of a bottleneck here, since the images used to train the lora were not photorealistic, the outcome with the lora can never quite get there. Right? I include in my prompts things like "photorealistic" or things like that. I am not complaining, but just wondering. Woudl you guys easily come up with any ideas on how to increase the level of realism, textures etc.?
I use flux and tend to use rather simple and short prompts. I can include one example below. Should I go for more complex prompting to achieve realism?
Photorealistic picture, cinematic style, a beautiful mature British woman. She's wearing casual outfit with white ribbed tank top, oversized flannel shirt worn open, stonewashed blue jeans, red converse trainers, large wooden beads necklace, bracelets, round sunglasses. perfect hand,HDR, intricate details
That third one has the tiniest tie I've ever seen 😂
Good morning coffee
Haha yeah. I guess if/when ladies wear neckties, they're not supposed to be very long. But still. 😄
一只猫猫
生成一个狗
good morning 生成一个狗
She wears a tie? Maybe the tie just appears smaller due to the rest of the composition 🙂
you should probably remove the "photorealistic picture, perfect hand" part and maybe the intricate details if you are fine with it and maybe add "shot from iPhone, shot in 2004"
should help a bit, or you can just apply a lora like this one: https://civitai.com/models/689192/aesthetic-amateur-photo-flux-dev
Guys i need help please
do u guys know How to change anime art style without change the whole in AUTOMATIC1111, i mean like not changging the face, hair, costume, just like change the art style
like this art : https://www.instagram.com/animazing.art/
i try a couple method but not work
and do u guys have recomended setting like adding some tab VAE, or like adding some tab clipSkip.. Thanks btw
use different models, loras, artist tags in prompts, etc.
#1237460438229450772 a car
Thanks for the tip! I tried modifying the prompt but it didn't really make much of a difference. I think the lora is the reason; it's been trained with not-photorealistic images so it affects the outcome. Ah well. I'm okay with that.
😉
I keep getting OOM when I try to inpaint on a1111 with SDXL. What alternate? I just need to be able to use grounding DINO with it
switch to swarmUI rather than a1111?
Ok I'll check it out
stable foundation
looks like it's taking a leak
2 mouths to feed 🙂
thats Schlonky Bong, the forgotten nintendo character
I see the reasons 😄
a cute girl'
I meed a Various cartoon shapes of animal soft rocking horse