#✨|sdxl
1 messages · Page 116 of 1
Show me. 🙂
I used it to make taylor z'dar swift
that workflow only uses one ipadaptor though. prompted taylor swift. had the image of mr z'dar
I haven't bleed in a good two weeks so I dont' plan to change that if I don't have to
😄
steven z'dar seagal
just like look up an image of someone skinning a buck in their garage
or I guess there's a war right now probably tons of vids on liveleak
I use it to make quality things
I'd strongly suggest you check out the ipadaptor stuff. if you're trying to be a purist with the nodes make your own I guess
Not XL but since we're doing cursed images, I merged conditioning for Shrek and Rainbow Dash
left one is probably my favorite SD 1.5 image lol
lovely
it's perfect ✨
yes, cursed is a good word
I have a more cursed one
druggie shrek hit another ogre with his truck while driving under influence
thx it's the image that made Dreamshaper my defacto standard 1.5 model
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt. - GitHub - tencent-ailab/IP-Adapter: The image prompt adapter is des...
It's like a witch saw a girl with that big eyes and lips snapchat filter then cursed her so her face contorted to form it in real life
so im not the only one making cursed pony hybrids
I should write that conditioning interpeter node and make a new one with SDXL
just made that with ipadaptor
nah that's just shrek as a basement brony boy
Very sexy images.
looks like he's been huffing paint
What better then a coffee and some cursed images on Sunday. 🙂
Cursed pony 1.5.
if I have two timestep ranges would it be a concat or a combine so they work together?
combine
concat is a fairly new concept to me
combine works like you'd expect
XL decided to PS the head on, lol
so what's the deal with ipadaptor outputting black images? what needs to be adjusted?
ogre ponies kinda working now
cool thing with comfy is you can overlap the timesteps
Hmm. Haven't messed with that before
that's what I want to automate with some interpretation of keywords/brackets from the prompts
tedious as hell to do with nodes
3 nodes for every split
or 5 for timestep
Yeah. Needs to be more nodes that make things efficient
you need separate prompts, conditionings, and combines for every equivalent BREAK, + two timesteps for every equivalent square brackets
turns into spaghet real fast
its ok AMD i didn't want to use my vram anyways
look at you doing actual research into the cyclops bias situation meanwhile I'm just making a Sonic the Hedgehog shaped Rainbow Dash
What's using up all that vram in that setup?
amd
it uses like 2.5x the vram do do anything because it's libraries are incomplete
thought I think that specific time was it just not unloading any models cause I technically didnt exceed the limit
1024 batch size 4
some of them almost don't look schizo
Ahh. That's a significant difference. Sucks they haven't caught up
it depends what it is. over in the LLM world where flash attention and all that fancy shit isn't core it gives the 3090 a run for its money
but on Stable Diffusion yea it turns my 24 gigs into 10
Do the models load multiple instances in that situation?
?
Never mind. You explained it a couple responses later
yea 4090 bois can do batch size like 36 or 49 on XL without hitting memory limits while I can barely squeeze in 16. For LLMs though I compiled a patched fork of AutoGPTQ and my results were almost identical to Sytan's 3090 in our initial tests. Mine was actually using a little less memory cause lightweight desktop environment
need to learn more about what sucks up vram
depends
How does anyone do a BS that big? not in training and I don't have anything that goes that high, though comfy I never go above 4.
sd 1.5 resolution absolutely bombs your vram, so running img2img at like 4k is impossible for me
sdxl resolution doesn't bomb it as much but the VAE spikes it harder
just like set it to 36 see what happens
do science
commit crimes
I use AIT 4 is it
yeah, what's up with the vae thing? I've notice that
I'm not fucking with the AIT nodes till they're officially added in comfyui
Wow
that kid has some tooth decay going
Yeah, that seems to be his problem. Everything else is fine.
you can notice it on 1.5 too if you hit really large resolutions
all of a sudden it takes like 30 seconds to decode
what is the different between -O and -P before the path?
idk check wget -h to see what the flags do
I just made some vae nodes. still can't say I completely understand what they do though
but I get sick of having to run vae's to each encode and decode
you get used to it...
and don't want to download a bunch of nodes just for that. I know others have done it
I added pixel scaling to my Principled node and had to include the vae so it could decode then immediately re-encode
I think these latest cursed images are too cursed
hey hello i am reinstalling my stable diffusion due to some errors can you tell me from where i need to install for latest version(github)
just saw my gpu hit 380 watts on "silent" mode can you not pls
can you tell?
.
so you mean a1111
idk why you're pinging my message about watts for that
also it just hit 401 watts why
thought you were talkinig to him
speaking of watts, I tried to get chatgpt to give me instructions on putting together a dual a100 laptop
it seemed to tihnk it wasn't a reasonable idea
I know
unless you used external enclosures
am I dreaming? is comfy history now in reverse order?
yes
its so you can quickly load last
he finally caved and changed it after being asked about it a bunch of times lol
oh thank god, been wanting that for so long 😍
are you guys using webui or local installation?
you'd need to carry around a 30 lb lithium battery to power it
sonic the ogre
those are the same thing?
well I mean, there are several programs that can run stable diffusion
we'll need a whoooole lot more to go off of than that
like OS, what interface you're using and how it was installed, gpu, what do the errors say, etc
download forom here again
i am using rtx 3060
but thats for laptop
just want minor stuffs to be done
use the 1-click installer auto has. beyond that not sure what else to do.
lot of windows users in here have been having problems with nvidia drivers so maybe update that too
i updated that
did the problems start right after?
also cause it's Auto disable all extensions see if that helps
i want that
update the extension or disable it
I like the where's waldo collage lol
like this?
ComfyUI or Auto1111 1.6
either one will work
is auto1111 and sdxl same?
my friend

you should read about what stable diffusion actually is
ay kid you got any of dat speed
yes. what he said
sry iam from unreal community i donot know much about
i just use stable deforum
hope ai in future creates game also
which would help us
long ways off from a whole game
already lots of games with AI generated assets though
mostly textures, sprites, backgrounds
I think AI will pretty quickly improve npcs though
npc doing weird things would be nice
yezz that would be cool
@willow bane
automatic1111 install: https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Install-and-Run-on-NVidia-GPUs
automatic1111 sdxl after install: https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features#sd-xl
totally unscripted game. Nobody knows what will happen
I mean you can kinda do that with an LLM already.
thanks
yeah, but with explosions
write a daemon that reads the output log and just flashes a .png explosion on your screen every time someone dies
i use kobold oobabooga and such but usually cant find any use for local ran llm
@visual glade FYI another one I've noticed since yesterdays conversation.
Using git checkout 0e3b64117218c50a554b492269f5f35779839695 this Tiny Terra Nodes Debug works as expected ie prints to condole plus in its own window (1st Image)
Using anything newer (currently on ComfyUI Revision: 1397 [a74c5dbf]) the Node in the workflow no longer displays the debug info although srill in console
ooglyboogly is good for dicking around with chatbots
or just like writing smut i guess
noice
my native language is not english (should be evident), maybe that is the reason i dont enjoy llm that much
in other languages it doesnt work as well as english
not even close
and all the fine tunes will be even more english biased
cause the tuners are all english
yeah
damn brits colonizing everything
or even if they are not, they will do it in englsih because it is the damn standard
i demand all llm to be written in basque!
so is Air TRaffic Control
basque sounds like a made up language in a homebrew DnD campaign
we really need to just revert back to latin
what I dind annoying is they default to bloody American English ;o)
I used latin to bypass bing's guardrails. didn't work the next time I tried it
american english is the more proper english
britain is the one who developed an accent post-colonial era
america is more old fashioned
I realized it was doing web searches for the latin I was sending it
American English is the result of one mans laziness compiling a dictionary :p
rather than just translating it
modern basque is made up in some sense. It is a mix of amny local basque variants made in the 19th century. It is called batua.
minus the colour -> color and defence -> defense shennanigans
and apparently by the time it came back with the web search results it was down for whatever
I will concede that from an histroical perspective aluminum is more correct thatn aluminium 🙂
not the first time the british have conceded 🎆
didn't know they spelled defence like that. 😐
did you use inbreed family in the prompt?
so fence defence
defenzee
then the dictionary guy changed it to defense cause its easier to spell
in the Caribbean this is de fence
just used a couple of my existential bean nightmare images with ipadaptor
it worked
inbreeding worked..?
it worked to generate a nightmarish image
that from a prompt i found in civitai
tomorrow I'll have to learn this ipadapter thing everyone's hype about
feed it that image of Pepe the frog giving anal birth to Hank Hill and see what magic it creates
I need to learn more about ipadaptor myself
feed it the pepe Hank image
maximum learning
why torch is not reconizing my gpu
the first time i use it was using
bruh
but now this is not
one moment
post it on Twitter or something saying "artist's rendition of Trump at the trial" get lots of internet points
using @ionic gulch inpaint on the upcoming NightVisionXL 🙂 gotta love it
the super soaker assassin
generated
loading curse [#### ]
lol
I had to change it up. the images were so cursed they kept coming out as black squares
need to figure out why that happens
is it black in the whole preview or just when saving
use the fully qualified fp32 vae if it's saving
so they'll only do that when finishing sometimes?
if it's a vae precision issue then it'll turn black when saving
the SDXL vae turns black if autocast to fp16. I know there's a "patched" one on hugging face but even that turns black on img2imf or goofy resolutions
ahh. well I don't even know which vae I have. "sdxl_vae"
gotta use the full 0.9 vae and leave it on default settings for comfyui
in auto you probably need to manually set full precision vae
I'm assuming that's not the 0.9 then?
does it give rainbow barf lines on some textures? that's the 1.0 vae
yeah, not sure why you'd know. but no, I haven't noticed that at all
I forget where I downloaded it though. probably huggingface
zoom in on eyes n stuff
but it's weird' several in a row might be good then several black ones
of the same everything too
other than seed number
the black squares are actually the reason I started thinking about the conditioning node strength node
seems like messing with those sorts of parameters would help
should i install CUDA?
if you want to use the program
so yes ?
yea VAE probs
it looks fine in the live preview right?
download the SDXL 0.9 vae from stability'a hugging face and see if that fixes the borken seeds
downloads
?
lol, meant "downloading" as in the 0.9 vae
nice texture there
very heavy impasto, inspired by Thaevil1
Cool that the smoke comes out of the glass, he was a magician also hahaha
maybe he is drinking chlorhydric acid
This is spreading colors by knife 🙂
Yes.
Try something like: A hyper abstract macro close up shot of ((a lady standing on a mountain)) facing the camera, using very heavy brush strikes and pallet knife phat thick paint, dripping paint effect, using only the colors red yellow and black.
Force colors also.
Maybe some abstract, surreal theme? 😉
will try something thank you for suggestions!
And tailor your - only for what is needed.
txt, signature, multiple people, dupes
Is what I like when doing abstract and people.
Also cracking paint is nice to add stacks nice with the knife.
i am just testing and playing 🙂 Going test.
@stone fossil do you think it is posible to achieve droste effect? I mean picture in picture in picture......
Not sure what you mean by that.
But I think that is kinda what I have been doing with people wearing hats with worlds in them.
Like this one.
We create a subject in a subject.
Anyway sky is the limit. 🙂
One thing I have also noticed is AI loves effects that are not real.
i will show you link on wiki
So for example:
Electric galaxy blur
Diamond prism glow
Rainbow smoke trail
Neon watercolor burst
Pixelated firestorm
Rustic kaleidoscope swirl
Glitchy lightning burst
Icy water reflection
Cosmic flower bloom
Grunge confetti explosion
Luminous dust cloud
Frozen crystal mist
And you can make up anything lol.
@stone fossil i think it loves not real things but not as photo imo.
🙂
anyone plz solve it
😄 noice.
thats that droste effect but not generated just link to it
@willow bane better to ask in #🤝|tech-support
asked
damn, it got pokemon right
pathfinder?
Look at his fingers, does Steven have Parkinson's?
Try first to add --xformers in your webui-users.bat and then check
@low hill like that?
What GPU u try to run it on?
Guess RTX 4090 ;D
@stone fossil i think gpu is in his nick 😄
xD
U do not need them in ur nick but under ur belt. 😉
Here started to judgle with multiple GPU's in 1 rig a bit back.
It can be useful.
damn, your 3090 consumes 412W
Can do 420.
I love that number.
Also ive caught it misbehaving peaking above but its fine.
Temps are fine fo sho.
Is that a pregnancy test? If so whats the outcome? 😄
Very Nice, reminds me of something I did in 1.5 a few months back
does anyone else have this (mildly) annoying issue with ComfyUI ?
nope
@stone fossil where do get innovative prompts from?
You recognize this one?
hey guys
Had a quesiton
I'm looking to img2img with sdxl, but I need the "denoise" option for the advanced Ksampler
Can't seem to find it
Damn, dude...what did Tom Hanks do to warrant this?
Advanced sampler uses the “start at step” as the denoise in a way.
yes I've read up on it now
denoise = (steps - start_at_step) / steps
Ok yeah thats better thanks
What I didnt understand is that is start at step=0 %100 denoise?
sure doenst seem like it
How many steps do you have going into the inpu?
It's 25, at 20 refiner kicks in until the end
here 40 / + 10 or 50 / + 20.
Do you use the refiner?
Ofc.
I've made some loras with good results, but they are soooooo big. 1.7 GB. do you know why?
Lol, I mostly don’t
Well I check the base img that pops.
That’s a fine tune question, I’m not good with training
If it sucks I wont even let it hit the refiner.
Cancel it nothing is saved.
Even with a batch.
Same
these are samples from 3 different loras. They are very good and Lora works great. But they are so big...
These look really good
thanks
Warning: Dark humor ahead.
Gotta head down to the store and get some stuff.
Also available in a box.
...or you can pick one up from the generic aisle:
Looks like this one is already defect
That's what you get for buying generics.
But at least it's sealed.
Do they also have sealed kittens?
why when I generate some images in the bot, it suddenly appears as a BIN file to download?
Found that. Do you think its still good?
??????
I found this one at the local store for ya.
It's a multipack, so probably saves a few bucks.
I think most of the users in this channel are generating locally. Did you asked in the bot support?
Remember bonsai kitten?
I'll go see
@hardy cipherCan you believe that we were all gushing over dall-e and dall-e 2 last year. Now we can make better and faster results than both MJ and dall-e 3.
Dall-e 3 is a thing in closed invite access.
Bing does have a great image creator though.
I can't think of anything that's progressed faster than text to image ai technology
It uses dall-e but also is just better than dall-e 2 in all ways.
The moment the concept turned the corner into something that could be developed further, it was bound to take off like a rocket. I keep trying to tell people that they need to get on-board immediately or be left behind.
Pretty much all generative AI is going to be on that path for the near future.
parabolic
love playing around with multiple img2img techniques. with multiple steps you can do nice things with img2img, ip adapter and inpainting. copy & paste (clipspace) makes it easy to move images between output and input to refine the image in multiple steps.
the llms for me are hard to get into. I really want to check stuff out but when I do it takes up a huge amount of storage and can easily be installed wrong. @hardy cipher
I need to figure out how to code something for ComfyUI that allows for an addition to the main menu. I've got this idea that I think everyone would love.
Visions Of Chaos has a neat download structure though.
I rage quit and deleted my attempts several times
but I hate giving up
I actually think that Visions Of chaos should've popped off like all the other UIs
For me I did the latest Lama. It was simple to set up. I got it running but knew that I did something wrong when it all it did was answer with dumb shit.
be careful with that or else the mods will take it down like they did with my "slight nipple" post
good way to use my workflow:
- generate some images with all four options (base, base+refiner, both upscalers) turned on.
- if you know if refiner helps for the style you want to do turn off upscalers and base or refiner
- create images fast until you find a good one
- use the recycle button on the seed node to get that seed back and generate an upscaled version
- use copy & paste (clipspace) to copy the image to genimage input
- use inpainting to fix details or if you want to have different images in the same style use img2img (test different strength settings) or ip adpater with a text prompt that describes the most important elements of the image
inpainting workflow?
woolen stuff?
crochet
yes, added inpainting 1-2 days ago:
https://github.com/JPS-GER/JPS-ComfyUI-Workflows
IMO my solution gives much better results (no / much less artefacts) than the example workflow provided with the model. didn't see any other workflow that does that much to improve inpainting yet.
sweet. I'll check that out a little later
when I have the time to do such things
with automatic1111 what is a good number of steps? 40 ? 60? 80?
......
in all seriousness, it just depends for me
the more intricate something is, the more steps I'll tend to want
longer prompts, in my experience, seem to do better with more steps
thanks. And wich sampler do you prefer?
Is this with ComfyUI?
always go for the one with the words/letters. that means "best"
I actually don't know. was always a euler a guy
some of the change the image with more steps, others more refine it rather than change structure
some of them are definitely faster than others, or slower. heun takes like twice as long as the others per step
but it's good for things like landscapes I believe
I've doubt between 2M and 3M Karras
i'm using a lot of steps. 60-80 for euler a, 80-100 for dpm 2M, because images get better and that part of the process is quite fast on my 4090. vae and other things take more than half of the time generating images.
JPS, you know about ipa adaptor. why would I render a few images fine, then all the sudden with the same configuration get 5 or 10 completely black images?
you should check out my vae encoder/decoder nodes when I upload them. maybe not breaking new ground with them
but they load the vae in the same node, and singleton so only one instance of it
never had black images on my workflow/system. biggest problem i have with it are extreme aspect ratios like 21:9 or 9:21 - those move the main content to the side of the image, because the interal calcualtions of ip adpater somehow use square aspect ratio all the time.
16:9 works, but also moves the content of center most of the time
it's one of those things that I believe has a solution, but I can't find any info on
I mean, all software issues technically have solutions
but you know what I mean
was thinking of making some custom conditioning modification nodes just to mess with the input of things to the sampler to see if that's the issue
because no errors so hard to really pinpoint what's going on
accidentally flipped the positive and negative last night. it's better than what I was actually trying to make
Is there any way to fix hands like we can do with faces ?
@cyan crown you can use inpainting and just redraw until you get good ones for example
in past experience the most efficient route was to just use photoshop or even paint to add proper hands
control net pose also has a hand feature, but no yet available for sdxl afaik.
and then use inpainting or something to integrate them
controlnet!! I'm also waiting for qrcode monster for sdxl
or you can add different hands in photoshop or some other app and use that as base for a soft img2img to merge them with your image
I find it pretty interesting that it flat out refuses to do cyclops unless really finely groomed to do so. but it creates fingers like they're just texture or something. just keeps adding more if the noise looks like it wants them
for photo yes, but for drawing the only solution seems inpaint
16 dpmpp 2m karras
7 steps
No, but loved how it came out.
be a mad lad
16 gives bad results
low steps work with lots of bokeh in my experience
but if you want the whole image to pop and be in focus you need more
if you finetune strength settings you can merge different styles with img2img, canny or depth control net, but if you only want to change the hands and keeps everything else the same inpainting is the best, if you can't draw the hands yourself.
7 steps
I'm starting again using Automatic1111. But hires fix seems working bad with sdxl
the highres fix is basically just a predefined img2img + upscale process. you could do img2img yourself and have more control over the strength and other settings. this also has the advantage that you can only do it for images you like and really fine tune them, instead of having that highres fix option turned on all the time (as i did myself in the beginning).
Yes i was used to do with img2img, but scaling 2x with automatic1111 takes half an hour, while with comfy oly 1 min
I love my new crappy newspaper photo lora
sounds like a vram problem. 2x shouldn't take 30 minutes unless you run into shared vram memory. maybe try the sd.next fork of a1111 - that one does some things different. maybe it works better for you with your amount of vram.
12GB 4070
100 steps 4x upscale?
80 2x
with comfyUI 1 min or 2
with auto1111 20/30 min
also generating images is slower
why do you want to use a1111 if comfyui works better for you?
Somehow I prefer results coming out from Auto1111
there are some nodes that try to replicate the prompt handling of a1111 - maybe that helps. but i don't know for sure - haven't had time to check those nodes or prompts in detail myself.
why make your workflow so complex? why not use comfybox?
Yes.
haven't had time to check the various comfy frontends yet and if one of them could replace my menu system without giving up features or usability. how does it handle moving images between different workflows? would i be able to use my current workflows and just replace the menu system or are there limitations?
Love it ... Cool stuff. Your own workflow?
Yes and lora.
you can send to img to other load images in other workflows in comfybox easily
@mossy canopy and i can add an unlimted number of own smaller workflows and move outputs from one to the other easily and turn every kind of input to a web based menu option?
yes.. you can check their default workflow.. and you will know...
might be interesting. will have a look at it if i have a little bit of extra time.
how does it compare to similar solutions? afaik there are 2-3 of those frontends availabe for comfy.
I was gonna releaase them hehe, till some smart ass told me lora's are over rated.
😄
Now they are for ever locked up.
You can release it to me... I dont tink loras are obselete... I love the use of them.
ahmm... overrated? no 😄
an integrale part of fine-tuning and expanding capabilities
I like the challenge of trying to get a certain outcome without them, but they are still necessary in many cases.
i just tried comfybox.. dont know about others.. ill have to check
yeah of course it's cool to see how far you can get with prompting, exploring this is really great. but LoRAs can introduce new content that just isn't there - not just guidance.
those different solutions were one of the reasons why i haven't check any of them yet, because it looked like a lot of work to find the best one, that will succeed in the long run.
Absolutely. It's just a matter of preference and needs, though.
exactly. I do sessions with just a model and sometimes I go all-in and stack LoRAs to re-configure the latent space.
I see LoRAs as ingredients
I always shake my head when someone starts asking things like "What's the best <insert x>?", or "How can I get this image to do <insert x>?" Because the answer almost literally always involves a massive amount of random chance combined with a selection of choices. It doesn't get "easy" or "simple". LORAs can help, but they're not a silver bullet unless one's need/preference is very broad or unless the LORA is very specific.
i got nada.. which frontends are you talking about other than comfybox?
Who asked that in this conversation?
Where did I say someone did?
yeah. they are tools (addons with AI super powers) that are like blueprints and can give some guidance to the neurons in the neural network. or used as creative ingredients to explore and create interesting things
doens't stable foundation have their "own" solution now: https://github.com/Stability-AI/StableSwarmUI
and i think there is at least one other i don't remember the name
"I always shake my head when someone starts asking things like "
You left off your own context of "in this conversation".
I didn't say anyone did in this conversation.
Oh boy do i make garbage ingredients
it's fine
So don't generalize... al lot of people don't know where to start and they ask...
you got your style going on. always interesting to look at 🙂
there's beauty in many different things
don't generalize
lol, what?
My comments weren't aimed at you; I wasn't talking to you until you replied to me.
some of my favorite art is garbage themed. like trash can kids
I'm refering to to everyone that comes here. Not all is as clued up as you seem to be... Just drop your snide remarks.
yep found them.. i played stable swarm just once. when it was first lauched.. you can try my comfybox space with linked comfyui .. if you want to try.. i can dm you the link..
he was actually anti generalizing. wishing people would be more specific when asking questions like "whats the best way to do <x>? " then there is no best way. They need better specific phrasing about what they're looking to accomplish
don't have much time at the moment, but sure you can dm it to me or i'll come back to you later, if i have more time to check it out.
What are you even talking about, dude? You literally replied to me. I wasn't even being snide.
the really good one to not like is "can i ask a question?" there's even a domain for that gem! https://dontasktoask.com/
it's linked to my free colab.. so.. :V
Good stuff. I really need it. So many people don't know how to ask
public schools teach kids that they have to get permission to ask a question first. then they enter the real world and annoy everybody
dont just ask .. ask gpt first
i love this sentiment, but as it stands, chatgpt is going to just confidently make shit up 3/5 times
People that don't know how to ask questions are going to have a bad time with ChatGPT
using chatgpt as an expert is a no go. Use it as an assitance for subjects you have expertise in, and can spot it's bullshit
do that second one
that is one of the single most annoying things ever (closely followed by getting offended on behalf of someone you don't know) .
Someone pings you a message saying "can I ask you a question"
FFS you've already distrurbed my peace, just aski the bloody question will you!!!
What's even better is when people just ping you "Hi"
And then say nothing for the next 30 minutes
and that
Microsoft even have a quick link for it
I get in work with Teams as well
or just add you as a friend out of the blue. no message prompting it. so you search their name on the server and they've never said anything. so check to see if its a scammer to report the scam message, but they actually just want help with lora training and saw me in the channel a month ago
I don't mind if someone messages and ask " are you free for a quck call"
That I can live with
(more on Teams than here)
Nailed it!!
how to use inpainting to remove some random thing on the frame?
like a stick or a feather?
Instruct pix2pix is probly best. "Remove the x". I'd personally just edit it out and cover it up then regenerate to finish up if pix2pix doesn't work.
Chat etiquette i don't think was ever well defined as it came up.
Take phones for instance. Nobody ever said "Hello" before. It was "ahoy" or "good day!" and other context specific greetings. So when the phone showed up, people were like "how the hell do we greet people on this box??!" and were all confused by the etiquette around it. So Bell made an advertising campaign for this new fangled "hello!" idea. Hello would be how to greet people over the phone. They even commissioned a song. Hello my baby hello my honey hello my rag time gal!
fun facts about etiquette
damn, didnt prompt anything, and it removed it
Keeping someone on the hook or leaving them on hold, while you form your question, is really inconvienient for the person you're approaching. considered bad form in a lot of ways
use bing chat or other internet connected ai chat assistant then.. less chances of hallucination
do you even personalise negative prompt depending on the prompt?
or is the negative prompt the same across all generations?
a willingness to learn and fact check is important when using any of the LLM's as a teacher. Bing chat will make shit up so often too. It's just chatgpt afterall
when you expect something like the first 2 images but from the same prompt & style settings you get a random car.
This Is Stable Diffusuion. This is The Way
Bard is pretty neutard. I think google is trying to control it to send people to alpha advertisements plastered pages. They are the worlds largest advertising compnay after all
No comments or reactions on my Garbage Pail card above... damn, son...thought I hit it with that. Got the text I wanted after coming up with a good name and everything.
Just asking an LLM for information will get made up nonsense most of the time. It's the new 3dtv
i didn't even notice it hahaha i love that
at this point people should already have a sixth sense about what is made up and whats not with this llm models..
Your comment inspired me to make it. I was messing w/something else about Neo at the time and combined the thoughts.
It looks like one of the new style GPK cards. i love that they cameback
doesn't that depend on your definition of "advertising company" ;o)
it's a perfect GBK pun. GJ
Diff each time depending on the + prompt.
Their revenue comes from selling ad space
they make the most of any ad company. The best around. Noones ever gonna bring them down
"some" of tehir revenue ;o)
But does that make them an "adverstising company" like WPP or Omnicrom (oh ok they are advertising agencies that actually create the adverts companies like Google display)
but Im rambling again
The most of their revenue. Like a huge most. And they make the most of all the ad companies around the world. It's their primary business.
Their business model is the reason why so much SEO ad soaked garbage exists on the web today. it's a direct consequence
A big par of their success comes from favoring search results that tie into their advertising tracking more. They've adjusted those algorithms to suit their business model already, so it's expected that Bard is doing the same.
Google Service acconts for approx 90% of Alphabets revenue
https://www.investopedia.com/articles/investing/020515/business-google.asp
services == ads. on that same page " One of the primary ways Alphabet generates revenue through advertising is through its Google Ads program."
Trust me when i say google does not dominate the enterprise space
Their services are often a joke in it departments
If you've got the budget for adobe, why would you build a tool chain around corel, kind of jokes
gmail/cal/docs/sheets is a legitimate competitor to the MS suite. I kinda hate gsuite, but the collaboration features are really best in class and that's worth a lot
but yeah, that's not what's making them money. at all.
It seems like iceycold and Winston Woof are having a discussion about Google and whether it should primarily be considered an advertising company.
The key points appear to be:
-
Google makes the majority of its revenue from advertising through Google Ads and selling ad space on its platforms.
-
iceycold argues that this makes Google fundamentally an advertising company, since ads are its primary business and main source of revenue.
-
Winston points out that Google also provides various services like search, cloud computing, etc. So while ads may be the largest revenue source, perhaps it shouldn't be simply labeled an "advertising company".
-
iceycold counters that Google's services exist to support its ad model. Things like search algorithms favoring ads, cloud services not being as competitive, etc.
-
The debate centers around whether ads being the main revenue source defines Google as an "advertising company", or if its various non-ad services still make it more diverse.
So in summary, they are debating the extent to which Google should be viewed as an advertising company vs a more general tech/services company, based on the outsized role advertising plays in its business model and revenue.
I like how that summary is basically longer than the original arguments 🙃
They're the worlds largest advertising company, despite all their diversification
They're going to leverage that position with Bard. It won't be an unbiased algorithm
gives me hope for the future. Chatrooms will come stock with a sidebar AI summarizer, bullet pointing the conversations and organizing notes about them
Here are some thoughts on this discussion:
-
They both make reasonable points - Google clearly makes most of its revenue from ads, but also provides many non-ad services. There's no definitive "right" categorization.
-
I think it comes down to emphasis - you could accurately describe Google both ways depending on what element you want to highlight.
-
Categorizing any large company with diverse operations under a single label is inherently reductive. "Advertising company" captures Google's core business, but ignores its other efforts.
-
Google has incentivized prioritizing ads in its products, so it's fair to be critical of over-reliance on advertising. But it's also an innovative tech company in many regards.
-
"Advertising company" has somewhat negative connotations today, so resistance to that categorization is understandable. But ads do fund many free services.
-
Google wants to be seen as more than just ads, but its ad model is entrenched. Its attempts to diversify revenue so far haven't fundamentally changed this.
-
Overall I'd lean towards calling it an advertising/tech/media company - recognizing advertising as the core revenue engine, but also its broader ecosystem. The emphasis can shift depending on the context.
In summary, I don't think there's a black-and-white answer here. Google has elements of both perspectives represented in the discussion. Its large size and complexity resist a simple categorization.
dont most media service use ads as a main source of revenue? Nbc, abc, radio, etc....
You raise a good point. Reliance on advertising revenue is very common in the media industry:
-
Traditional media like television networks (NBC, ABC etc), radio stations, newspapers and magazines have historically relied heavily on selling ad spots or space to fund their content and operations.
-
Many digital media and tech companies also rely on ads to monetize their free services - examples beyond Google include Facebook, Twitter, free news sites, mobile apps, etc.
-
So in that sense, having an ad-supported model is not unique to Google in the media landscape. Advertising could be seen as the dominant revenue engine across much of the industry.
You're right that by that standard, many media/tech companies could be characterized as "advertising companies" to some extent.
A few additional thoughts on Google vs other ad-driven media:
-
Google is arguably more ad-dominant than many older media companies, with ads making up 80-90% of revenue. Traditional media tended to have more balanced revenue sources.
-
Google's ad products are seen as particularly advanced and dominant in the digital landscape, giving it significant influence over online advertising.
-
Diversification efforts by companies like NBC into new media/tech have changed their revenue profiles somewhat as well.
-
But you make a fair point - many major media firms are driven by advertising in some capacity. Singling out Google may ignore that broader context.
So in summary, while Google is particularly ad-dependent, relying on advertising revenue is indeed a very common media industry practice. You raise a good perspective to consider in this debate.
Well part of the drive to streaming is that those companies would like to move to subscription services, which are generally preferred to wall street to advertising
But there's also a difference in that those media companies create content, and then attempt to monetize it
Google doesn't create content, they run display ads against other people's content. So the dynamic does shift
Excellent points! You're absolutely right:
-
Media companies are indeed trying to shift towards subscription/direct revenue models rather than pure ad reliance. This transition is still ongoing, but streaming opens up that possibility.
-
There is a fundamental difference in that traditional media companies invest in creating original content, while Google simply provides a platform for others' content.
-
This changes the dynamic, as Google is not bearing the cost of production. Its role is ad enablement, targeting and display - monetizing others' work rather than creating its own.
That's a key distinction I failed to make earlier. The traditional media companies rely on ads to recoup self-funded content investments, while Google facilitates ads without content creation costs.
Given that core difference in content costs/risks, it makes sense that "advertising company" feels like a more apt categorization for Google's role than for a traditional media firm. You highlighted a very important nuance in the business models - thanks for clarifying! Really great points that improve my understanding here.
- type in + prompt.
- add style as needed.
- add negs.
- press ctrl + enter.
@mossy canopy are you by any chance feeding our meaningless inconsequential ramblings into som form of LLM based AI to generate bullshit summaries?
Ads fund a lot yeah. The problem with google is they revolutionized advertising with massive amounts of tracking data. Big data science. advanced algorithms that give the user the ad that will have the most impact. They've turned psychological manipulation into a very technical science.
Given how they've developed their search algorithms and tracking data over the years, why wouldn't they build Bard the same way? It's going to be geared for manipulating people into looking at ads more than it will be a useful tool.
I don't trust a lot of these LLM's but Bard especially i'm sure will have some engineered ulterior motives
You raise very thoughtful concerns about Google's approach to advertising and how that could influence Bard. Some key points I take away:
-
Google has pioneered new levels of personalized/targeted ads using extensive data collection and predictive algorithms. This enables more effective psychological manipulation.
-
There are reasonable concerns that Google could take a similar approach with Bard - optimizing it to serve ads and steer users towards commercial interests rather than neutral information.
-
Given Google's history and core business model, it seems likely Bard will be designed with advertising ROI as a priority rather than just user benefit.
-
The depth of personal data Google has amassed raises suspicions about how that could shape Bard's behavior - perhaps promoting certain viewpoints, interests, etc.
-
More broadly, many LLMs like Bard lack transparency about their training objectives and optimization, making it hard to trust their neutrality.
I think you make very fair points. Google has demonstrated their adeptness at using technology and data for persuasive ends. It's prudent to be cautious about how that could manifest in Bard as well, rather than assume pure altruistic motives. Maintaining a critical perspective on these systems and their creators is important. I appreciate you sharing these thoughtful concerns.
damn right i make fair points llm summarizer
Excellent.
@soft zealot @mossy canopy we are training an llm to generate sd prompts, so can we please get access to your prompts so that we can add them to our training dataset
Sure. My prompts are usually in the images I post here. I don’t hide them. Knock yourself out transcribing them
ok tq, will download your images and use your prompts
Are you for real??
yes
So you don’t realise when someone is extracting the urine?
huh?
thats just clip interrogate https://huggingface.co/spaces/fffiloni/CLIP-Interrogator-2
Do you just take everything literally
Read what I wrote carefully
I was politely telling you to eff off by pointing out the bledding obvious
oh
mb
i am not too proficient in english so I didnt understand
so you drop images and get the prompt?
damn
it does beg an interesting rhetorical question (ie you don't actually have to answer it)
If you as a human can't differentiate between a "serious" and a non serious" response how do you expoect to succesfully train an LLM ?
it depends on the other person whom I am talking to, if I am talking to to you I always read seriously
Clip interrogate usually provide "wrong" result compared to what I want
In which case I recommend you reset your base expectations
i'll try
thats why people use blip to caption for training clip .. but blip doesn't recognize artists styles..
I also used blip but after all, I stick to use wd14 for lora training.
i use both..
which lora this time?
😄
blip has its own advantage for the tag out of wd14. Both of them have limitation. I really want to have a tool reduce manual work for the training. A custom llm should be the final answer.
if bard or bing didn't censor their image captions . i would've used them..
I want to caption in a specific way that bard and bing might not my answer
ok, done with mechanical dragons, on to next! they be cool tho
i mean, i recognize the character -> that's a win in my book
i think i can even hear him whistling
It is a poor lonesome cowboy 🙂
https://huggingface.co/liuhaotian try his models..
img2txt?
hmmm... 100% free reigns might be bit too much lol
for chating with images.. getting your captions in a centain format then just prepend wd 1.4 tags for accuracy..
That clip is too good
Do you know how to install that clip model locally?
found the pot of gold
clown dog?
I just sorted through a fantasy prompt session and found this - I think he's not amused 😄
shouldn't have put it in plain sight 😄
Clown Dogs.
any dogs = best dogs
that link i gave u .. its for llava models.. you can try them in oogabooga .. to chat with images... for batch captioning using llava i recommend using oogabooga api to write something get your captions in a certain way .. or you can copy their code here.. https://huggingface.co/spaces/LinkSoul/Chinese-LLaVa swap the model out..
(don't inspect my image prompts from this point on, this is my full random+song lyrics random verser at max. people might discover i'm a madman)
Nice raccoons bro
Raccoon power!!!
Raccoon thread?
my kids has one!
all good man
wicked
i need more gpu's XD
sure
... fuu... company a100's are not in use
do i really have to update my a1111 install and get my fav model installed there? XD
F A11.
Isn't clip ViT-H-14 better than Llava?
I tried 3 images from MJ and I paste the prompt into SD and the images were 80% similar that was great
keep it running .. no one will know.. 🤣
yes. but its not a caption model .. its a image chat model .. but if you want your captions in a certain way.. u can prompt here. it can get pretty creative..
Damn,
Lucky boi
Make the GPUs go brrrr......
make the boss pay for my waifu's instead of me
Yeah I get it, but I will use clip vit,do you know to install and use locally
Exactly
Your boss got the shit out of you and now it's ur turn
yes.. just git clone the repo.. run it locally.. ;V
or just use comfy clip interrogate nodes or a1111s clip interrogate
Where should I run them?
No, this clip is way better than them, and I am planning to generate 1mil prompts
me and my slightly drunk ass should not be on $100k+ hardware lol
what's the worst thing that could happen
@mossy canopy
dont you feed them.. they look so thin..
self sustain
loving some of the randomness I'm getting from this prompt
@mossy canopy try bronze rubbed metal, or any other kind of metal. chrome ❤️ or ...
bismuth!
bismuth dragon!
pcb lora 😄
😉
@mossy canopy
sd 1.5 is soooooo oldskool 😄
aight. got it working again. now. i just need to feed it. things. nope, something broke with the update
creepy
for some reason, restart sampler is not in the new update on the docker container
fml
device?
add git pull in docker commands..
perhaps i'm just using a stupid repo
good day
nah, it's a repo someone made that acts as a wrapper for a1111 and some other uis
yeah, can't be assed to do that now 😦
Hmmm hmmmm.
It looks good but also is extremely sparse with details.
i know this prompt 😄
this is op.. great for small details..
didnt upscale. ..
No I mean visual clutter. Mine is cramped with details.
Yep Eface is a good rl mate of mine who once ever tought me how to use SD.
@mossy canopysee? bismuth dragon is where it's at
Thanks for that for ever lol. 🙂
i just cost you $2000 of hardware or so...
god these are so awful i love them
Hey I only have hub bucks. 😄
reminds me of that old series 'dinosaurs'
yes.. the inconsistency
"hr giger has entered the chat"
🤣
bismuth dragons be shiny
Whoppa nice mates. 🙂
not working
which model?
i'm using protovision
nightvisionxl
might be a model issue, if nightvision doesn't know what bismuth it (what it looks like to me), then it won't respond to it
thats the closest..
pretty damn good though
i really dig this one tho
used different prompt.. a close up of a metal dragon on a reflective surface, digital art, trending on polycount, digital art, colorful muscular robot, she has iridescent membranes, steampunk horse, blue scaled dragon, 8k hd wallpaperjpeg artifact, crystal dragon, intricate detailed 4 k, chrome armor, holographic design
that's what clip/blip/vit made of it?
:v
just clip
still, very nice ^^
i just have ~30k more old prompts to dig through to see if anything cool shows up 😄
Hello! I would like suggestions for producing this "style" of art (e.g. checkpoint, lora, prompts, etc.)...
oh shit. these old prompts are gold!
sd 1.5 could not handle them. AT ALL
sdxl brings them to life. for real lol (and still fails to capture their true essence)
exactly..
did you clip it again?
yes..
How on earth did you do that...? Checkpoint? lora? prompts? Nothing I've tried creates that look.
i think clip interrogate, and then feed into sdxl model
this prompt 😄
i mean, the first one, goat horns, viper scales, tiger fangs/face, and pointed ears on a human body?!
as @floral island said.. prompt:
a painting of a crane in a forest, a detailed painting, by William Morris, tumblr, arts and crafts movement, deer, painting of samarkand, golden inlays, 1 9 2 2, philip taaffe, black velvet painting, artemis, heraldry, in a forest at night, not cropped, vintage - w 1 0 2 4, symbolic mural
did clip really put forth -w 1024 as a token set?!
lol
i haven't clip interrogated in like... uhh... yeah
What is the significance of "1 9 2 2" in that prompt?
somebody probably trained their interrogator with scraped midjourney / bot servers
years are often referenced for aesthetic styles
and I guess there's a benefit to seperating the numbers like that?
oh wow that looks sick
i'd have no idea there. that might just be a meta that developed, or somebody's scraping engine doing it. maybe just a poorly formed regex that creates the training data
dont know why somebody would space out numbers
so. what's this creature called except sd weirdass?
wtf
i mean, it's fckn awesome for sure
but i have no idea wtf kind of creature my sd created
goat facial structure, snake skin, kobold ears, goat horns, tiger nose and fangs, dragon eyebrows, human build. oh yeah, blue skin too... sdxl sniffed too much greek mythology i think....
a type of chimera
vader got nothing on these crusaders
not sure yet how sdxl based models act around that 75 token limit
can't get a grip on it yet
ENTER THE FRAYWarhaven’s dynamic melee combat rewards new recruits and veteran players alike. Choose from a roster of unique soldiers, each with their own motivations for fighting beneath the banner of Warhaven.Coordinate with your squad before each battle and outwit your foes with tactical maneuvers. Fight to control footholds and the powerful ...
Sep 20, 2023
ooh, got my first couple conditioning nodes working
Obviously I'm a beginner. Now that you alerted me to "Interrogate CLIP", I gave it a go on the same image and got this:
a painting of a deer in a forest with trees and foliages on a wall with a mirror reflecting it, leesha hannigan, Edward Burne-Jones, an art deco painting, arts and crafts movement
Since this is different from your result...I presume it is controlled by the currently selected "checkpoint"?
clip is "an interpretation"
only use danbooru interrogators
so dpeending on exact input image, sometimes it might differ
nope .. two different clip models.. i used ViT-H-14 OpenCLIP
Er...my automic1111 has a "Interrogate DeepBooru"...it does not have a "danbooru". Just a typo, or is there really two versions of that?
probably. are you using comfy? if so there are lots of interrogators. tbh danbooru is some weird stuff. but I use it when I want to get very fast results. if you ever look it up it's a rabbit hole
Comfy ❤️
Worse, just now when I tried "DeepBooru" on that image...it gave me something COMPLETELY non-sensical:
long hair, multiple boys, multiple girls, short hair, weapon
booru is plain worthless unless you want pure anime weeb shit
i tried it for 2 prompts -> it's a prompting style that's VERY restrictive
I've used it in workflows when I wanted to create a quick prompt to then use it to modify the original image or something. but yeah, I was being moderately sarcastic
I'm using automatic1111...not "comfy". Er, am I right in thinking "comfy" is a reference to a different UI?
after war
😉
thiccc neck
Move ya body.
Sexy pix.