#vibe-coders

1 messages · Page 10 of 1

vagrant folio
#

yesterday flash was very slow for me for about 4h

#

the best aproach Ithink is combine all three of them

stark sapphire
#

trying something new. Not sure if i will actually proceed and make it all work.

stark sapphire
#

haha i guess i will make it happen. going all out.

gentle aspen
#

heyyyy welcome @rain lava . Yipeee, you fixed your problem didn't you??🎉🎉

rain lava
stark sapphire
rain lava
stark sapphire
rain lava
#

Okay

gentle aspen
#

guess who is building a summarizer AI model. it is going to be about ~90m params, it will be opensource and it will be at 8k CTX.

before you complain that you have seing many "summarizer" AI models, this is tiny, and also has a small reasonign chain to filter out only needed part of a text.

cobalt echo
#

💡 Save Gemini API Prototyping! ⭐️
Hey everyone! Google recently (March 2026) restricted the $300 Welcome Credits to Vertex AI only, effectively putting AI Studio behind a paywall for new devs.
We all know AI Studio is way faster for prototyping Gemini 3.1 Pro and handling 4K images (no GCS overhead like in Vertex). Forcing us into Vertex for initial tests just kills the "vibe coding" speed.
I’ve opened a feature request on the Google Issue Tracker to bring the credits back to AI Studio. We need Stars to get Google's attention!
Vote here (Click the Star ⭐️): https://issuetracker.google.com/issues/504375652

rain lava
gentle aspen
rain lava
rain lava
gentle aspen
#

it wont be a crazt architecture, i just want ot see it work, then I am adding formating support. for example, people can get responses in: plain text, MD, HTML, etc

gentle aspen
rain lava
stark sapphire
rain lava
# gentle aspen bpe

Using a BPE at that small of params is going to tax you.
a 50,000-word vocabulary with a hidden dimension of 512, the embedding matrix alone takes up roughly 25.6 million parameters.

gentle aspen
gentle aspen
rain lava
cobalt echo
# rain lava What makes you think everyone else agrees though? I know alot of people that lik...

But Vertex AI is absolutely the right product when scaling production apps and implementing MLOps pipelines. The problem is that this credit can only be used for creating proofs of concepts.

The latency for uploading 4K images through IAM and GCS buckets to train models using Vertex is very large, around 30-60 seconds per image. On the other hand, this process is much faster in AI Studio (instantly). Also, on-demand Vertex endpoints often reach 429 Resource Exhausted errors due to sharing one enterprise capacity across the whole company, which makes "vibe coding" in AI Studio significantly more convenient.
We are not against Vertex AI – but we just want to have the freedom to use our intro credits where it is more convenient to us. If you appreciate flexibility, please give a Star to our issue!
See support thread: https://issuetracker.google.com/issues/504375652

gentle aspen
rain lava
#

Byte has cons too

gentle aspen
rain lava
#

Deepspeed is more for multicard setups

#

It creates vram buckets and it will slow u down

gentle aspen
rain lava
gentle aspen
gentle aspen
rain lava
gentle aspen
fiery lagoon
rain lava
fiery lagoon
#

hmmm

#

such a good friend

gentle aspen
#

I was gonna do 25m params for the test run

gentle aspen
fiery lagoon
#

ur too cool for me now

#

gold on google

gentle aspen
#

btw @fiery lagoon you might wanan delte that gif

fiery lagoon
#

y

gentle aspen
#

a mod might ban or smthn, idk

fiery lagoon
rain lava
fiery lagoon
gentle aspen
#

anyways, back to 25m params at 8k. why does CTX always scale O(n^2)? so annoying...

fiery lagoon
rain lava
gentle aspen
#

u use gemini?

fiery lagoon
#

gemini is peekkkk

#

deepseek is peeeek

rain lava
#

This is news to u...?

fiery lagoon
#

chatgpt is asss

gentle aspen
#

no offense to gemma, but full on offense for gemini. isn;t good for what it says it is.

rain lava
gentle aspen
#

but now..

gentle aspen
fiery lagoon
#

2 3 4 all ass deepseek is on top

gentle aspen
fiery lagoon
#

deepseek is just the best

#

tho

gentle aspen
rain lava
rain lava
gentle aspen
gentle aspen
#

anyways i gtg, i have to download osme datasets

rain lava
gentle aspen
#

sam price, google is mor eexpensive

#

my days, I need to get better fingers or a better brain for typing

rain lava
gentle aspen
rain lava
#

"$17
Per month with annual subscription discount ($200 billed up front). $20 if billed monthly."

#

Pro doesnt even get high traffic priority.

#

Pro shouod be plus, its kinda eh tbh

gentle aspen
#

only thing I hate baout lcaude pro from teh reviews are the rate limits

rain lava
rain lava
gentle aspen
# rain lava Nah theyre all bad

they are indeed bad, but they are the reason why good models exists in the first place. i just love the innovation they brought with those models

gentle aspen
# rain lava How is that the reason.

personal bias to OpenAi yk.
I mean afterall they are the reason why transformers are popular besides Bert (which is nott even a pure decoder transformer)

gentle aspen
rain lava
gentle aspen
#

dw, it's just my nature lol :)

delicate blaze
# cobalt echo But Vertex AI is absolutely the right product when scaling production apps and i...

id like to talk about vertex.. have a specific project i would like to launch a team of agents into.... i am also very comfortable using Claude Code & Antigravity for almost all of my recent projects. but the costs associated with vertex are not yet totally clear to me and thus i hesitate to get into it without clarity. would you be willing to jump in a DM or invite me to your own server here on discord so we can jam?

cobalt echo
gentle aspen
#

aight, Ijust made an architectureusing GPT5.4:

Tokenizer:

  • Custom byte-level BPE
  • Vocab size: 16,384
  • No UNK dependence
  • Fast Rust/tiktoken-style runtime implementation

Model:

  • Decoder-only transformer
  • 15 layers
  • d_model = 384
  • 6 attention heads
  • 2 KV heads (GQA)
  • head_dim = 64
  • SwiGLU FFN, hidden = 1024
  • RMSNorm, pre-norm
  • RoPE positional encoding
  • Tied embeddings
  • Full causal attention
  • Target context = 8192
  • Total params ≈ 29.9M

I am sus abou tthe 15 layers. that looks too much for a 29.9m model, i might wanna recalculate it, but does it look solid?

rain lava
gentle aspen
#

I made sure that there is about 25x the params

#

for SFT i am aiming for about 100m tokens and for PT, I already collected 500m tokens. i am doing some extended PT on reasonign tokens, so i can support -100 masing in the seciond SFT run without ruining the gradients too much

#

I just want to play it safe for now

#

for a little experiemnt. I mean, this is very simple, so it should work

vagrant folio
#

Hi Ag power user in pro, just to confirm Ag also reduce a lot tha flash model usage?

#

Before I was able use it for 3h withut stop now in just 1 and half i run out of model usage

ionic mica
#

Para que es este server @dapper fractal @uneven echo

gentle aspen
#

can someone help me with this😭

rain lava
gentle aspen
#

i have to log into github to do that

rain lava
#

Then do accouht recovery

#

Also u can contact support, it tells u to even if u cant.

#

And check ur email fpr anything tos related

gentle aspen
#

still making some reasonign datasets and curating the PT dataset. but I will start training today

#

but i am more onverned about my github

#

i am so messed up dude. my github = my life

#

I NEED IT

#

pls helpme

gentle aspen
#

lets see how things go

#

might overfit it a bit.. I didnt have enough tokens too, plus it is at 2 epoches

gentle aspen
rain lava
gentle aspen
#

it takes me to this page where I have to put in my number. it says "incorect mobile phone number" EVERYTIME. :(

gentle aspen
#

no, but i filled that correctly

rain lava
#

Did you try removing/adding a 0 at the start of the number?

gentle aspen
#

yeah I tried all. look what it says

rain lava
gentle aspen
#

from what exactly???

rain lava
#

Also try raw numbers, 0 dashes etc.

Ur rate limited from too many ohine num reqs.

gentle aspen
#

I should try tmrw ig

#

brooo, i cant live without github

#

:(

#

😭😭😭

rain lava
gentle aspen
#

github is my social media platform 😭. it all my fun in life brooooooo

gentle aspen
rain lava
#

Yes?
Have u literally never got a rate limit on discord? It takes like 10mins

gentle aspen
#

never even heard bout it. DISCORD HAS RATE LIMITS??!!

gentle aspen
#

LETS GOOOO!
I got it working, somehow....

it is kinda dumb, i trained it with like 4m ish data for a 270m model, it is absolutely not enough, so i will train more, but to confirm, I knwo this works. NICE!

anyone got tips on improving and making the CoT stable?

gentle aspen
#

I am going to have a fun ~time~

rain lava
stark sapphire
#

<@&1009526435276394496> spams in more channels.

gentle aspen
#

YAYY!!!
I distilled claude opus4.6 into gemma3:270m:

#

I should try it witha harder prompt. btw this is still a test run.

I just wanted to nump up the steps and training data, it was trained on 150m tokens (not a lot)

but the reasonign chain is wayyy stable than qwen tbh.

i will focus on the reasoniign structure, bcs even tho it is mostly opus, i mixzed in soem deepseek data too, bcs i am broke, and opus is expensive

winter viper
#

sup folks, news on opus 4.7?

gentle aspen
#

AI in their first few steps are so dumb lmao.

#

btw guys, i have a question.
is gemma4 using mamba?

rain lava
rain lava
undone pendant
# rain lava

Still no dev update on Opus 4.7? Are they working on it?

undone pendant
#

restart for update 😮

#

1.23.2
Apr 16, 2026

Bug Fixes
Fixed bug that prevented MCP servers from loading and bug that prevented accessing workspace-specific settings.

Improvements (0)
Fixes (2)
Fixed bug that prevented MCP servers from loading
Fixed bug that prevented accessing workspace-specific settings
Patches (0)

--

Huge

rain lava
rain lava
gentle aspen
rain lava
gentle aspen
#

making some progress on my 30m model (stilld oing PT)

should I opensource the weights?
it is:

SP BPE 16k
30m @ 8k
casual LM
decoder only transformer, nothing crazy.

gentle aspen
gentle aspen
# rain lava THIS is scratch btw

man, i hate scratch. I mean, no ffense, it does help you improve engineering reasonign, but it makes you think developing is all sunshines and rainbows, and makes yu not appreceate actual programmars.

i mean, I shouldn't be speaking, bcs i use AI, but respect to all the people who actually put some work.

rain lava
gentle aspen
#

It's not a 5 marks question bro😭

rain lava
rain lava
#

What in training its also asking questions..?

#

Also whats the loss rn

#

And did u give it enough data to not overfit yet

gentle aspen
gentle aspen
gentle aspen
gentle aspen
#

I did it in a scale of ~25 x params

#

istg, if someone asks about sum "opus4.7 on AG" I am fr going to crash out bro

rain lava
rain lava
gentle aspen
gentle aspen
#

to give you the best coding model for $0?

rain lava
#

Complaining without knowing how good they have it

#

Chat can this card train llms 😭

gentle aspen
rain lava
gentle aspen
rain lava
gentle aspen
#

I feel bad for you...

#

out of all the cards you can get, you got that?

rain lava
#

Thats not whats in my pc theyte just spare parts 😭

gentle aspen
rain lava
gentle aspen
rain lava
gentle aspen
#

it's 6GB right?

rain lava
#

8gb

gentle aspen
rain lava
#

100m on an rx580?

gentle aspen
gentle aspen
#

omds my typos

rain lava
#

So it can play games atleast

rain lava
gentle aspen
rain lava
#

Or non ti idk

gentle aspen
rain lava
gentle aspen
# rain lava I meant in games

not ba, but dont expect to crank up the settings in FH5.

I mean, you already have a 9070, right?
I remember you said something liek that.
that is WAYYYYYYYYYYY beter

gentle aspen
lime sentinel
#

Not even close 😄

#

A 9070 XT is in a completely different league, way newer architecture, massively higher compute, bandwidth, and proper support for modern ML stacks.

rain lava
rain lava
lime sentinel
#

A 1070 Ti is okay for light experimentation, but the RX 580 is basically legacy for ML at this point.

lime sentinel
#

For gaming, yeah RX 580 ≈ 1060/1070 range.
For ML, NVIDIA still wins hard because of CUDA + ecosystem.

rain lava
#

Ther3s a chance the 1070ti is broken tho!

#

Its 5 + years old used

#

After my father upgraded his computer i gutted the old 1 for that gpu

lime sentinel
#

Vintage hardware build: comes with nostalgia, uncertainty, and random crashes included.

#

🤣

#

Either it boots… or it becomes modern art real quick

rain lava
lime sentinel
#

Orange/blue artifacts on boot? Yeah, that’s less Windows issue and more GPU VRAM waving goodbye😂

gentle aspen
signal raft
#

ChatGPT 4o ahh, but it is goated

lime sentinel
#

If this is ChatGPT, I need a refund latency’s too high.

rain lava
lime sentinel
signal raft
lime sentinel
gentle aspen
rain lava
signal raft
#

I mean it is obvious, they want you to use the cheaper Gemini Models

#

Instead of Opus or Sonnet

gentle aspen
#

for me, codex performs really wellcompared to AG.

AG is too confusing and delivers less for what it is.

lime sentinel
gentle aspen
#

Sam Altman lets me use anything

#

lol

rain lava
signal raft
rain lava
gentle aspen
lime sentinel
#

AG feels more flexible, but that often comes at the cost of clarity and signal-to-noise. Codex just gets to the point and ships usable output faster.

rain lava
lime sentinel
#

Feels like a UI decision that got ahead of the model capabilities.

rain lava
# gentle aspen also like this👆

Ppl were arguing abt when i said google wasnt an ai first company, which i was wrong but thats the irony 😭 a company making sm ai then making shi ai

gentle aspen
rain lava
#

Limited its thinking time i thijk tbh, and unlimited it on benches

lime sentinel
gentle aspen
lime sentinel
#

That’s a bit harsh tbh. It’s not that they can’t train good models they clearly can. The harder part is aligning one model across multiple use cases search, docs, code, without degrading performance in any one domain.

#

Indie models often feel better because they’re narrowly optimized coding-only, while big companies are trying to balance generality, safety, latency, and scale at the same time.

gentle aspen
lime sentinel
#

Agree on one thing though forcing a single model to do everything usually leads to mediocre results in specialized tasks like coding

lime sentinel
rain lava
gentle aspen
lime sentinel
gentle aspen
gentle aspen
rain lava
gentle aspen
#

and the plus model is free at their websiote. dont know how they did that. tbh china has better infrastructure when it comes ot making things efficient

rain lava
gentle aspen
lime sentinel
#

I’ll benchmark it myself really good tends to depend on the workload

rain lava
gentle aspen
rain lava
gentle aspen
gentle aspen
rain lava
gentle aspen
#

after about 20b, it is all about data, and not about parameters.

an older model (2024 ish) can do really well with good coding and math data

gentle aspen
#

a model does not have to be 1t to perform good yk

rain lava
# gentle aspen here

8m just proving that its free due to a lower param count there? Where did i ever say 358b is bad?

rain lava
#

I wanted to see that 😭

gentle aspen
#

here is a better view at that beauty

gentle aspen
rain lava
gentle aspen
# rain lava Define structure

instead of:
let me think this through. I shoudl summarize this. some points would be to talk about how neural networks are ood at understanding semantic relations ships, and maybe even how ML works would work!

compared to:

let me think this through.
We got:

Neurl networks

I can talk about:

  • how they understands semantic relationships.
  • How machine learngn works
  • modern ways of re-inforcement learning.
gentle aspen
#

what in epstein is this bro???😭

gentle aspen
#

I have ebing looking at this the whole day:

#

bro really said "ebing"🥀

rain lava
#

If my google nest-mini says "Sorry, I don't understand" when I ask a question, am I smarter than it?

gentle aspen
#

yayy, i get 2 more TPS when i put my RAM to 4800 --> 6000

it was runing parts of the tensors in ram. ykw? I shoudl run qwen3.6 and see

rain lava
rain lava
rain lava
gentle aspen
rain lava
rain lava
# gentle aspen dang...

My parents use a 6400 cl32 2x32gb kit for webbrowsing whilst im stuck on 5200 cl40 ... I could get a 35-40% increase if i get their ram

#

5200 CL40 (Stock, XMP): ~72 GB/s
​6400 CL32 (Stock XMP): ~88–90 GB/s (+25% increase)
​6400 CL32 + HBM Enabled: ~98–102 GB/s (+38% to +41% total increase)

gentle aspen
rain lava
#

"Office pcs" is a stretch, theyre more workload capable.
Rx 6600XT Eagles.
I7 14700kf
32gbx2 superram

rain lava
#

16gb

#

Okay maybe not 16, my parents love 7million tabs

#

32

#

Or 24

gentle aspen
rain lava
gentle aspen
rain lava
#

Helium is less expensive than firefox and uses blink still (same as chrome)

rain lava
rain lava
rain lava
stark sapphire
#

Who here uses Cursor over Antigravity?

signal raft
#

I think most people agree that Cursor is the better overall Editor, the only good thing you get with Antigravity is refilling usage limits

stark sapphire
#

yea this, but I think Cursor also gives you more freedom, you can choose out of more models, and it even allows you to connect your own local AI, which Antigravity can not.

verbal axle
#

Yoo

undone pendant
gentle aspen
#

hey man, i love how you are building something nice!
but this stuff goes under the "marketing" category. yo might get banned. so i recommend you delete this message and share it in general.

sand vault
#

i was thinking to use cursor until i just saw they just struck a deal with SpaceX xD

gentle aspen
#

Whats wrogn with SpaceX?

stark sapphire
gentle aspen
#

yayyyy, 27b Qwen3.6

#

bruh... 600b, and no vision, or 256k ctx

rain lava
# gentle aspen yeah ofc they do. tell me a single human who doesnt. i feel like 24gb is the swe...

I think I'll sell my ram and buy this really good kit.
It's a CUDIMM, 8400 MT/s, CL40 24Gb×2 kit, and it's only like 700~800 bucks.

After I sell my current RAM or prices go down it'll be alot cheaper ofcourse.
I can run at 8400MTs with my CPU if I'm lucky, but if I'm not lucky I'll just run 8000~8200 MTs with some tighter memory tightenings.
It's effective memory bandwidth with HBM (High Bandwidth Mode) should be around ~130Gbps compared to my ~60Gbps.
And because it's CUDIMM it should work on my motherboard at that speed (If not, I'll downclock until it works with tighter timings)

State | Speed | Theoretical Max | Effective Bandwidth

Current (Corsair 5200 CL40 UDIMM): 5200 MT/s 83.2 GB/s ~62 GB/s
New (8400 CL40 CUDIMM): 8400 MT/s 134.4 GB/s ~118 GB/s
8400 + HBM: 8400 MT/s 134.4 GB/s ~131 GB/s

gentle aspen
rain lava
gentle aspen
#

you were able to get like 128 gigs for that price

rain lava
#

No?

#

Curriencies bro

gentle aspen
#

yes

#

you did

rain lava
#

AUD vs whatever u use

gentle aspen
rain lava
#

No, I'm not american.

#

Then this one, it's less at other stores but I prefer JB

#

Metric | Current (5200 CL40) | New (8400 + HBM) | Percentage Gain

Effective Bandwidth ~62 GB/s ~131 GB/s +111%
Inference (CPU/Offload) ~4-6 tokens/s ~8-11 tokens/s +85% to +105%
Training (Offload Step) Baseline -- +35% to +45% Speedup
Inference (VRAM only) Baseline -- +3% to +5% (Negligible)

hexed marsh
#

hello! i have small problem with Antigravity. the latest update made the agent auto apply all changes. before i could review the changes it made nicely line by line, accepting and rejecting them. now the changes get autoapplied and i can only see the comparison that i can comment on, without ability to revert specific lines.

is there anyway to turn this back?

vagrant folio
stark sapphire
hexed marsh
#

i never touched settings, its already on ask for review

stark sapphire
#

Hmm does it never asks to you if it shall proceed?
Sometimes it indeed does proceed, but it only does it so you can work faster for certain small scale things.

#

sometimes my agent says "Let me proceed with this" without my input. Which is fine with me personally.

hexed marsh
#

no, i dont have issues with agent, with the ui

#

before it showed the diff directly in the file it worked on

#

if agent would change this line from True to False for example, then in the editor it would show like this, only with the first one hilighetd in red and second in green, and i could accept it from here

#

oh nevermind. it works today xD

stark sapphire
#

lol

hexed marsh
#

i guess i hit some limbo yesterday

#

i'd swear it didn't do that yesterday

#

or i was drunk and opened different ide

gentle aspen
daring ravine
#

Hey i am on pro , antigravity, gemini, but it fails. Does it work for you?

gentle aspen
#

btw, does anyone know some good embedding model for general text?

I like nomic embedding bcs of it's opensource license, but I need soemthing small and good like qwen3:0.6b embedding model

serene crow
#

I did a Proof-of-Concept using Google AI Studio.

TidyUp AI analyzes and transforms disorganized spaces into clean, organized, and structured environments.🧹🌟
#buildwithai

gentle aspen
#

I should make grammarly for vibecoders

#

it will make your prompts better.
is that a good idea?
also i will allow it to run Ai models locally or let the user choose models from providers like Openrouter, Ollama and someother stuff

sand loomBOT
#

We’re kicking off a brand-new AMA series with the minds behind Gemma 4! 🌍

<@&1043109761299783771> <@&1493624938207379649> Join on #1041705871723466792 our Google DeepMind’s @spring halo & Ian Ballantyne on <t:1777384800:F> for an inside look at how the next generation of AI is moving beyond chat and into real-world action.

Upcoming Sessions:

Don’t miss the first-ever Gemma 4 AMA. See you there!e

hollow river
#

cool

snow heath
#

damn, explanation of a benchmaxxed model

pearl thunder
#

Let's see what we will leave this event ♥️

#

@sand loom

sand loomBOT
wise rock
#

Invalid Invite

sudden beacon
robust geode
#

Hey guys , what's this channel is about, like what I can share in it

wise rock
hallow carbon
#

Hey yo

#

Actually I got a mail from Google Student Ambassador

#

I don't know where to ask regarding this

#

Can someone help me with this Elders

sudden beacon
vagrant grove
hallow carbon
#

Mine was applied by Previous GSA

#

I was busy in work 😢, Working over things

#

I needed help - So he helped me in filling things

#

I did the video

#

And finally uploaded

#

But I made an issue

#

😭

#

Well this is what I got yesterday,

#

😭

#

He said they might disqualify me - I am not sure, Please Moderators help me

#

I faced this error

#

I shared both images of mail - I received on first day and the one i received on the 2nd day

stoic citrus
hallow carbon
gentle aspen
#

Is there a better stack than electron and JS or python and Qt?

Python and Qt is just a fever dream. man, it HAS to be the buggiest thing y'all (inc. me) has ever seing in our ENTIRE life...

magic dagger
#

Develop a webapp

#

not everything needs to feel native

silent night
#

hey

#

does google just reset all quota for all antigravity users ?

balmy depot
silent night
#

my quota just got refreshed all

balmy depot
#

Must be nice.

silent night
#

yesterday gemini 3.1 pro high/low, claude sonnet/opus 4.6 was 3 days remaining

balmy depot
#

I'd show you mine, but it's kinda depressing.

silent night
#

both of them still exhausted

#

only this account got refreshed

#

but somehow I'm experiencing a bug

fossil estuary
#

Hey guys I need jules review

gentle aspen
#

yoooo anyone tried GPT-image-2 or GPT5.5

I think both are beating the anthropic's and gogole's so hard, they might cry😭😭✌️
never seing this powerful models.
tbh, AG should add GPT5.5, it is just... do what you say like opus4.7

rain lava
gentle aspen
gentle aspen
rain lava
gentle aspen
rain lava
gentle aspen
#

also GPT5.5 is twice more expensive, but ayy more token efficient, so it is either the same money spent as gpt5.4 OR less, bcs it spends less tiem reasonign

rain lava
gentle aspen
# rain lava Does that also beat Mythos Preview?

almost...
You see, mythos is a 15t MoE model, and Anthropic is known for making their models more "creative" instead of Logical. with the creative thinking + better reasoning accross better semantic understand per given context, it performs somewhat better in terms of understanding and iterating.

now, tell me if you (or me) have the money to pay for a 15t model from anthropc while it's Opus4.6 (fastt) model itself costs about 150$/1m tokens😭✌️

gentle aspen
#

the only thing which gemini does better than any other model which is not gemini is... drum rolls... research... thats it... only that :/

gentle aspen
# rain lava 15t is such a waste tbh

well, each model has their own advantages. also 15T is not a dense 15T, it uses sparse MoE with a better internal architecture. dont know what they mean by "interna; architecture" but it looks like it an uyltra efficient 15T

gentle aspen
rain lava
gentle aspen
#

tell me if thatt sounds like something

rain lava
rain lava
#

How much prompts with avg token usage

#

And what model?

gentle aspen
gentle aspen
gentle aspen
rain lava
# rain lava :0

Ok but benchmarls arent everything... (talking abt gemini 3.1pro here)

gentle aspen
rain lava
rain lava
gentle aspen
rain lava
#

Idk how they raised quotas, they literally cancelled a massive ram order

gentle aspen
rain lava
gentle aspen
# rain lava Idk how they raised quotas, they literally cancelled a massive ram order

bro, just dont care abou thte coporate stuff.
if they give soemthing, just dont question it. Who cares if they cancel RAM, buy RAM, OR EVEN MAKE RAM (maybe ppl will care if they do that). either ways bro, just us it.. it's free.

beggars can;t be choosers
if google limits claude usage on AG, I am limited in claude usage. if openAI removed codex, i dont have codex.

simple as that.

gentle aspen
# rain lava What ctx is 5.5?

in codex default it is 256k, but with fast mode (better intelligence and 1.5x speed at 2x token usage) it is 400k-1m

rain lava
gentle aspen
#

OpenAI is known for 256k-400k ctx.
Gemini is known for 1-2m
Meta is known for 2m
grok is known for 1-2m
claude is known for 64k-1m

rain lava
gentle aspen
gentle aspen
#

gemini is good for planning, GPT is good for frontend, claude is good for backend.

sometimes there might be other models which re good at other tasks.
no matter how much they market their models as "good at everything🥀💔" IT IS NOT. period.

rain lava
#

Why dont companies realise params arent everything? A 15t model, and over t models, are still competing with a qwen 358b model.

gentle aspen
#

BTW, is my new UI better than the old one?

rain lava
#

Which is new which is old...

gentle aspen
# rain lava Why dont companies realise params arent everything? A 15t model, and over t mode...

because after soem point the same data is useless. more params gives the model "creativity" if you would liek to cal lit.

more params means, the model will get more headroom for more context, and more data, also higher params mean it has better semantic understanding. if those benchmarks was something a bit harder. the big models will getmore and the smaller models will get absolutely SMOKED!

rain lava
gentle aspen
rain lava
#

U cant tell me ts isnt ai compression

#

Lnler, suve,

gentle aspen
#

it is more liek bi-cubic downscaling

rain lava
#

I get suve, whats lnler

gentle aspen
#

thats why the low res inmage has the squar-ish looks instead of a smooth anti-aliased look

rain lava
#

Suve is an a but reducuced sm to an u

gentle aspen
gentle aspen
rain lava
#

Look at my img, theres a word called lnler and suve

gentle aspen
#

maybe a problem with your discord client. bcs when i recieve images it s crystal clear.

did you update hte app?

rain lava
#

I js updated,
Still ahh

#

Unless they somehow messed up the android port (even tho its fine linux) idk

gentle aspen
#

ohh wait, I have the same issue with my redmi pad SE too

rain lava
rain lava
gentle aspen
rain lava
gentle aspen
gentle aspen
rain lava
#

Well, moe arch.
Unless theyre trained mostly hard stuff and not much easy?
If so, howre they not fitting easy stuff in 15t?

gentle aspen
gentle aspen
#

thats why the data is outdated. people fix it in SFT and RLHF.

gentle aspen
#

<@&1009526435276394496> Hey, how did I get a timeout?
I didn;t do anything wrong.

plus, i was trying to explain something to a person.

gentle aspen
rain lava
vital acorn
#

I made a project in which everything is done client side. No actual backend, just pure JS, by choice.

But still want to make it as painful as possible for anyone trying to inspect or snoop through the network tab. No API keys are exposed but there are API calls I'd rather keep private.

My approach is layered annoyance rather than actual security. If you defeat all my layers, honestly respect, consider it a gift from my side.

The layers:

First, if DevTools is undocked and screen dimensions look unnatural, redirect with a "your system appears compromised" message. Casual snoopers give up here.

Second, JS bundle is heavily obfuscated with split Base64 encoded prompt chunks scattered across files. No plain English searchable anywhere in the code.

Third, network tab gets flooded with decoy requests that look legitimate. Finding the real API call becomes a needle in a haystack.

Is it perfect? No. A determined 2% will get through eventually. But 98% of casual people trying to copy the idea will hit the first or second layer and walk away.

gentle aspen
vital acorn
#

Yes

rain lava
rain lava
gentle aspen
#

anyways, ima go get a lil break. i have being vibe coding too much. i should take little break by vibe coding, which i can go back to vibecoding after a small vibe coding break while vibng to some music

gentle aspen
rain lava
gentle aspen
#

recent data is licensed and hard ot scrape or extract, but the old is opensource, free and unlicensed, so they can easily obtain it. thats why many models are outdated

gentle aspen
rain lava
vital acorn
vital acorn
gentle aspen
gentle aspen
rain lava
#

Layer 1: The DevTools Undocked / Dimension Check
​The Reality: It’s a great filter for casual snoops. However, a determined user will just keep DevTools docked, open it on a secondary monitor with normal dimensions, or simply use a local proxy like Proxyman, Charles, or Burp Suite. A proxy intercepts the traffic before it even hits the browser UI, rendering DevTools detection completely moot.
​The Verdict: Effective against tourists, useless against anyone who knows what a proxy is.
​Layer 2: Base64 Prompt Chunks & Obfuscation
​The Reality: This stops people from simply Ctrl+Fing your source code for keywords. The fatal flaw here is that no matter how heavily you obfuscate the string, it eventually has to be reassembled and handed to the browser's native fetch or XMLHttpRequest function to make the call.
​The Bypass: An attacker doesn't need to deobfuscate your code; they just need to monkey-patch window.fetch to console.log the final, clean URL and payload right before it gets sent out.

gentle aspen
#

Claude is better at more logical stuff

rain lava
# rain lava Layer 1: The DevTools Undocked / Dimension Check ​The Reality: It’s a great filt...

​Layer 3: The Network Tab Decoy Flood
​The Reality: This is easily the most diabolical and amusing layer. Needle-in-a-haystack tactics are genuinely frustrating.
​The Danger: You run the risk of DDoS-ing yourself or ruining the user experience. Browsers limit the number of concurrent connections per domain (usually around 6). If your decoys clog up the queue, your legitimate API call might get delayed, causing a sluggish UI. Furthermore, if you are paying for bandwidth or API invocations, you are literally burning your own money to annoy a hypothetical thief.
​How to Make It Even More Painful
​If you want to double down on the annoyance for that remaining 2%, here are a few more psychological weapons you can deploy:
​The debugger; Black Hole: Add an obfuscated setInterval that calls debugger; every 50 milliseconds. If a user opens DevTools, the browser will freeze in an infinite loop of breakpoints. They can disable breakpoints, but it’s another highly irritating hoop to jump through.
​Monkey-Patch Native Functions First: Before they can monkey-patch fetch to steal your URLs, you monkey-patch fetch, XMLHttpRequest, and console.log to behave erratically if they are called directly from the console.
​Decoy Payloads with Variable Responses: If you use decoy requests, don't just send them to a 404 endpoint. Have your server (or a cheap serverless function) return fake, valid-looking JSON data. If the attacker finally isolates a request, make them waste an hour trying to decipher a JSON payload that is entirely meaningless.

gentle aspen
#

nvm, @rain lava is the AI prompter here 😄

rain lava
rain lava
vital acorn
rain lava
misty wasp
vital acorn
#

my Gemini can message me directly now

rain lava
gentle aspen
rain lava
rain lava
gentle aspen
#

I am concerned about humanity tbh

rain lava
gentle aspen
rain lava
#

Ur ai?

vital acorn
#

Yes my AI agent... (Used Groq API for LLM)

bronze pecan
#

Ok

rain lava
bronze pecan
#

Coriander can I get a coding for my application

gentle aspen
#

IDK what model you used, but those ethis are finetuned and RLHFed into the model, maybe you are using an outdated model.
no AI agent will pirate movies.

and to answer your questions: YES! AI companies does have LONG system prompts to make the AI ethical (just how you should doit too if you dont want to get sued or banned on GitHub)

vital acorn
# gentle aspen IDK what model you used, but those ethis are finetuned and RLHFed into the model...

😆 I’m actually building it mainly for this purpose… no AI just starts pirating on its own, I’m shaping this environment to make it do that. Just working on one of those late-night ideas 😅

Basically trying to create something more like GPT that can actually do some actual search tasks instead of just searching on Google…

And yeah, I’m not planning to release it anywhere, just making it for fun.

stark sapphire
#

Why i hate AI sometimes.
It randomly decides to look in a folder it should not. In this case it's "Reddright" It's not even the project we are working on.
i know i can tell it to not look, but still.

gentle aspen
vagrant folio
#

Hi All gemini-cli has new zed ide integration is experimental

vagrant folio
#

and also JetBrains IDEs

#

Any other IDE that supports the ACP Agent Registry can install Gemini CLI directly through their in-built registry features.

winter viper
#

<@&1009526435276394496>, its in every channel

brittle spade
slender stag
#

Hey everyone,

I’m currently using the Antigravity free version, and I’ve been facing an issue in Agent Manager where my chats keep disappearing after a day or two.

Is this expected behavior?
Does the free version have any limits on chat history or storage?

Would really appreciate if someone could clarify this. Thanks!

rain lava
trail vale
acoustic cloak
#

@south turret
Which ai model we had a convo around was the last time we spoke?

rugged mango
#

heya, im working with google AI studio and asking if there is a way to import a github project onto it?

rugged mango
#

i am just asking if i cana import/connect a github repo to the google AI studio when building apps

stark sapphire
#

<@&1009526435276394496>

hallow marsh
stark sapphire
hallow marsh
south turret
#

Been using Opus for months now

#

Now I use both Opus 4.7 and GPT 5.5

gentle aspen
#

Guys will this UI look good for my app?

I made this in figma, I just want to know if this UI will look good on my AI powered study app

stark sapphire
#

I see you often ask about UI. But i will keep saying the same thing. Don't worry too much about UI. If it works, it works.
Don't try to spend too much time on it, but rather focus on functionality.

gentle aspen
stark sapphire
#

It gives me Nintendo vibes.

#

and i kinda like it. but it depends were you use it for.

gentle aspen
gentle aspen
gentle aspen
stark sapphire
#

the only thing that would trigger me, is that the buttons are not alligned at the buttom

gentle aspen
#

This was my old and crusty create menu.
users can create study cards from this page.

it looks bad, and also doesn't feel user friendly.
Now check this out:

stark sapphire
#

I would change the size of the text boxes.

gentle aspen
#

yeah yeah, you will be impressed when i show you the new UI. gimme one sec, i am taking a screen shot

stark sapphire
#

it's fun seeing this actually. I see the things AI does a lot when making certain elements.

#

needs lots of iterations

gentle aspen
#

When user's press the create button, this dropdown will open. then the user can select what they want. then they will come to the respective menu, like this:

stark sapphire
#

PS: GPT5.5 is my new favorite AI model for coding. It just solved my website issue in just a single prompt.

#

aaah i seen the first image before

gentle aspen
gentle aspen
stark sapphire
#

yea i member

#

Hey Google, you hear that? Open AI is now better than you.

#

I want google to be #1 for reasons.

gentle aspen
# stark sapphire Hey Google, you hear that? Open AI is now better than you.

if you scroll up, you will see me say "There are AIs for their respective categories"

no AI will be #1
You should be smart enough to know what model to use for your use case.

for me, claude is good for iterating, chatgpt is good for nothing (1% frontend, and 99% glazing that you are different for making a frontend), gemini for research for very obv reasons.

gentle aspen
stark sapphire
#

xD

#

you are right, no number 1. But I was simply impressed by GPT5.5. I tried to solve the same problem with Claude and Gemini, and non of them were able to fix it. Asked multiple times.

gentle aspen
#

If you got pyshological issues after looking at this graph. dont worry I got it too.

Can someone make this benchmark make sense for me (and anyone who sees this and doesn't get concerned)

gentle aspen
stark sapphire
gentle aspen
stark sapphire
#

haha i dunno. I only ever used Antigravity and Cursor.

#

I find Cursor to be working better. less buggy. But is way more expensive.

rain lava
rain lava
#

@gentle aspen how's training going?

normal current
#

I'll be patient until Google I/O on may 19th, I guess we'll get some updates then.

rain lava
rain lava
rain lava
rain lava
#

<@&1009526435276394496>

gentle aspen
# rain lava <@1341827001429921892> how's training going?

it is going really well actually.
and i managed to make gemma3 reason out a better reasoning chain that opus4.1
since googles grokking was absolutely phenomenal it made my life even easier.

the only problem, is Gemma3:270m has a shallow depth for proper generalizing when there is: less epochs, not the preffered amount of data, etc.
so it reasons out something genuinely crazy 9not in a good way).

but in the positive side, i managed to make a small model reason.
The better part is my program made over 11m tokens as 16k samples as of right now, so it is a matter of time, until I finetune either gemma3 or SmolLM2:360m in a few days.

gentle aspen
# rain lava Same, I don't like how much google is falling behind. Especially when I pay an e...

https://youtu.be/F6T-G33jF3c?si=t-OIuZhpvTGcuNFm

Dang... This video changed how i see google.
but i can remember how the lacked in performance back then.

In 2023, Google lost $100 billion in market value in a single day after its AI chatbot got a basic fact wrong on Twitter. Their image generator was producing historically inaccurate results, and their search engine was telling people to eat glue. Over the next two years, OpenAI, Anthropic, Meta, and dozens of startups raced ahead while Google st...

▶ Play video
rain lava
# gentle aspen it is going really well actually. and i managed to make gemma3 reason out a bet...

If you want you can use https://kaggle.com --- It offers a free tpu v5e-8 (You just need to do verification stuff), or just use their GPU t4 x2 or a P100.


T4 x2 Has Tensor (Unlike the P100) and has a total of ~30GiB VRAM (Both togethor)
P 100 has HBM2 Memory, but 0 tensor cores.
And the TPU v5e has 16GiB VRAM/Core, and HBM2 memory (You get 8 cores).


Another alternative is Google Colab, as they offer some other premium GPUs and TPUs kaggle doesn't though I find Kaggle gives you much more free stuff.

gentle aspen
#

but the t4 is interesting

#

can you tell emabout the rate limits

rain lava
#

For the T4s

gentle aspen
#

DUDE LOOK!

gentle aspen
rain lava
gentle aspen
rain lava
rain lava
gentle aspen
rain lava
rain lava
gentle aspen
#

What gives me the bes rate limits an speed?

bcs i just searched and they said that t4 x2 has a ~9 hour sesion limit

rain lava
#

30h for t4s or p100 and 20h for tpus

gentle aspen
rain lava
rain lava
gentle aspen
rain lava
atomic vortex
#

Are you folks using Google Colab Pro subscription or just the free one?

rain lava
acoustic cloak
serene crow
#

𝗩𝗶𝗯𝗲 𝗰𝗼𝗱𝗶𝗻𝗴 𝗱𝗶𝗮𝗿𝗶𝗲𝘀 𝗘𝗣1. - I built 𝗧𝗶𝗱𝘆𝗨𝗽 𝗔𝗜 in this weekend.

🔥Friendly AI that takes a messy space and outputs a visual organization guide.
💡Gemini Flash + Nano Banana (Image Flash) + Gemini Live API
💡The Flow: Image Input ➡ Spatial analysis ➡ Organized Image Output & Q&A with Gemini.

#BuildwithAI using GoogleAIStudio

atomic vortex
serene crow
serene crow
smoky dust
#

Firebase is cool complexity.

vagrant folio
#

for all, if any had next issue and was able to solve it:
Antigravity
After update any chat I do in any project new or old, my chat history is not linked to the project so every time I need search for them

gentle aspen
#

so it is godex now?

fossil estuary
#

Yo guys I need jules review, can someone explain what it is, does it excel in what it does... because Ihavent seen any latest videos of jules by anyone

fossil estuary
bright jacinth
fossil estuary
bright jacinth
#

Mostly the boring, time-consuming stuff! Writing test cases, updating documentation, doing massive multi-file refactors, or tracking down tricky bugs. You just hand it a ticket and let it do the grunt work

fossil estuary
#

I see, can it also be used for checking code vulnerabilities?

bright jacinth
#

Yes but make sure promts are good

fossil estuary
#

I see

bright jacinth
#

3/5 rated model for me

uneven bridge
#

@stark sapphire watssupp bro how yu doing? I'm sorrry how we both crossed paths before we got a chance to meet.. i hate it had to happen that way hope you forgive me and we keep building this GDG family forward google_cloud ❤️

gentle aspen
stark sapphire
uneven bridge
#

thank you for knowing my play ! im brining it all home

#

🥷 🎁

uneven bridge
#

love you all Googlelogo was never in y'all lane i'm playing my role see you all soon

silent night
#

what tf

stark sapphire
#

i wish AI would ALWAYS say this, instead of, YUP! here are some ideas!

stark sapphire
silent night
raw sparrow
#

hii everyone

tall kindle
#

...

gentle aspen
stark sapphire
#

i'm not sure what you're trying to imply.

uneven bridge
#

https://www.loom.com/share/878a0a2e72714f88ada46c272e73ed3b its already done but yu can check out the before

#

I love my ad i be back with one later

gentle aspen
#

guys, pls (for the love of god, plSSSS) pick GDev

soft gate
#

Hello guys

glacial beacon
#

Hi

clever anvil
#

👀

uneven bridge
#

🤭 😘

#

keep them ights on buddy

#

i keep going mf i got it all

#

🎁

#

i was on yall fr juu heard i can do it all & wall back to wall st.

#

this fun too me 😘

#

im from richest city in the world them ads is money moves freeeee lunch !

#

and this the cloneeee lol i love yu all thank for having me apart of the team

sand loomBOT
#

Curious to learn how to get creative with Gemini Canvas? gemini

Our friends on the Gemini Discord server are hosting an event on their server this <t:1777487409:F> with a Gemini's Creative Technologist for an in-depth technical session showcasing his latest workflows with Canvas and Nano Banana. Tune in to see his personal prompting techniques to inspire you to unlock your own creativity with Gemini.

gentle aspen
# uneven bridge

hey man, i am not the guy who discourages others. but CRMs and AI powered "SAAS tools" are just too overated. and you just a small fish in a pool of millions of sharks.

I reccomend you try somethng new and innovative. maybe evn collab with someone here.
No hate. Cool project. Just give it a bit more spice.

sudden fable
#

Hello Vibers, What are you all building? and Why are you building this?

gentle aspen
#

any tips on improving my settings pages?

dim current
#

Hi.

rain lava
floral marlin
#

Hi y'all guys here...

sick ibex
#

Feeling spicy and got tokens? add this at the end of your prompt #DareYou 😄

hearty basalt
#

Hi

uneven bridge
stark sapphire
rain lava
sick ibex
#

I am not ready to face that kind of reality yet

smoky dust
#

#VibeGarchaNepal 🙏

uneven bridge
#

big GDev ion want no problems with chuuu @stark sapphire gemini

delicate ember
#

is geminiCLI and the extension Gemini code assist the same?

round solstice
#

No.

#

GCA has an agentic relationship with VSCode, and can observe certain windows as well as update files. Gemini CLI launches and edits things directly using tools. You can use Gemini CLI in the internal terminal of VSCode... I did that for a while before Antigravity came along.

smoky dust
#

Firebase too

uneven bridge
#

get ready to take over my city im taking year off for school and to learn from the goats ! thank you guys love yu family let get ittt #GDG google_cloud ❤️ 💚

vapid hornet
#

Hi we're hosting hackathon everybody is open to apply and register!
🌐 Google Developers Group Hackathon at KIMEP University — Open Worldwide (Remote Friendly)
🚀 BUILD WITH AI 2026: THE FABS 3D AI CHALLENGE
If you’re into 3D development, software engineering, or machine learning — this is for you.
Join from any city, any country — fully open to remote participants worldwide.
Build the next generation of 3D AI systems, Autonomous Agents, and Spatial AI products. Present your ideas to experts and compete for a $1,000 Innovation Grant Pool + Google Cloud Credits.
🎯 3 Tracks: • 3D AI & Spatial Intelligence
• Autonomous Agents
• Applied ML in 3D
📅 Key Dates: • Submission Deadline: May 2, 11:59 PM (online)
• Hybrid Online Pitch Day & Results: May 3, 12:00–18:00
🔗 Register: https://gdg.community.dev/events/details/google-gdg-on-campus-kimep-university-almaty-kazakhstan-presents-build-with-ai-2026-online-build-phase/�

Google Developer Groups

Virtual Event - BUILD WITH AI 2026: THE FABS 3D AI CHALLENGEHosted by GDG on Campus KIMEP × FABS3D AIA high-stakes, asynchronous DeepTec...

uneven bridge
#

i knoo you guys worked real hard to get where you at & i don't want to half step this process to be in this community

torpid sun
#

yooo wsp guys

#

i am @gentle aspen

#

I forgot to add an MFA authenticater app to my acc, and I had to reset my computer bcs of the recent hacks 🙁

#

<@&1009526435276394496>

Hey, can I get my role and levels to this acc temporaraly until i contact discord and delete my existing account, so I can get my role back to an account made from the email i hadf. bcs i made a temp Gmail for this acc, so i cant trust this account forever

torpid sun
# rain lava Fr 😭

I AM TRYINGGGGG

  1. it is Qt
  2. so, now I have to write code for both dark AND ight mode, bcs for some reason I thought Qt was "good" bcs it take less RAM. well... ChatGPT is responsible for that
#

for any newbie trying to build an app, PLS DO NOT go with Qt

rain lava
torpid sun
#

I had to reset my computer. i forgot that i deleted all the sessions

#

I forgot to log in with my phone while i had the sesion on my computer.

now I lost many friends i had 🙁

torpid sun
torpid sun
#

aighyt, today i WILL implement the darkmode

#

codex is already on it

#

btw, i am kinda sad about gemm4

it was all hyped and all that, but it consistantly fail or get very low scores on benchmarks.
also the extra embeddings makes it heavy for mobiles devices, not light .

the only thing i use gemma4 is for creating synthetic datasets and tasks which need gemma to follow niche system prompts

#

well... there we go

rain lava
torpid sun
#

umhh... hh oh okaayyy

torpid sun
#

listen, man. it is still in the beta, and i need to doa lot of improvements.

but it is okay for now. I mean i cant push updates anyways, bcs github thought it would be cool to suspend me for no apparent reason

rain lava
torpid sun
#

it is "okay", but certain places have very noticable imperfectrion.

also the whjoel codebase looks like broken spaghetti rn

rain lava
#

God i hate switching back to windows just to play 1 game... all they needa do is js let battle-eye do linux im pretty sure its js 1 button

torpid sun
#

just use a VM

rain lava
#

Anti cheats in a vm will get me banned

torpid sun
#

windows is getting on my nerves too

#

I am planning to change to linux comlpetely

rain lava
prime bane
#

Hello Everyone

rain lava
prime bane
#

Am a Vibe coders development

torpid sun
torpid sun
#

somethin feels more user friendly and convinient in the long run

rain lava
torpid sun
#

I hate gnome

rain lava
#

Same which is why i got off it

rain lava
#

Even tho its arch based

#

Ppl complain abt arch updates for being buggy and ive never gotten one bug

torpid sun
#

actrually arch based linux distros are good for me. bcs many try their best to be user firendly, and the fact them being arch means I have a lot of control

rain lava
#

I was hoping my tablet could use the Native Terminal with Android 16 but my older SnapDragon 8 Gen 1 doesn't support uVM Terminals only protected terminals...

#

I'd love to run proper Linux on this tablet but thanks to Samsungs Proprietary Drivers, Khox Fuses, and a ton of keys I cannot

gilded zephyr
#

hey folks, if you have to inspect entire code files and folders and detect duplicated ones if there is.. how would u do using antigravity/gemini cli?

pastel hemlock
#

Lets say entire model was built using no truncation or zero compression is that a good thing

dire oar
#

hey all , yesterday was my final submission of hack2skill promtwar virtual hackathon and i got 76 rank

small creek
#

Hi guys

#

Need help looking for someone to recommend me some good AI for my project.. my client is impatient to see the results🤧

pastel hemlock
torpid sun
#

POV: me with 35% plus credits left before weekly reset 😈

#

nvm guys, don't do it...
I am already at 0%🥀🥀

fast tangle
rain lava
atomic dock
#

hello

torpid sun
#

YOOOOOOOOO

#

EVERYBODY!!!!

#

btw @rain lava

#

remember how I was making a sumarizer model?

#

it finally finihed training. i will be pushing it o ollam atoday

torpid sun
#

dude it is INSANE!

#

it beats a 30b model at summarizing niche stuff

#

plus it is wayyyyy cleaner

#

I am doing benchmarks rn

rain lava
torpid sun
#

should I fake osm ebenchmarks by mixing in a training too😈
I mean OpenAI already does that. maybe I shouldn;t. maybe I should be a good boy and do it fair

#

what way shoudl i go wioth>/

torpid sun
rain lava
# torpid sun it beats a 30b model at summarizing niche stuff

Well that makes sense. Models trained on good stuff rather than the whole internet are usually better

I actually "bench marked" some local models.
And found Qwen 3.6 is really bad unless it's what it's really good at... (Coding, it beat 3.1 Pro by using more efficient libaries etc too...)

rain lava
torpid sun
#

thats why I always use community finetunes for personal projects, and release builds for commercial/public stuff

#

or my own finetunes

rain lava
torpid sun
#

i prefer qwen opus distills, bcs qwen models are already socratic right off the bat. a opus finetunes make qwen better. also it doesnt have a stupid audio encorder which makes the model use more vram than it is meantot

rain lava
torpid sun
#

googles main points were coding and better visual understanding, also they said the semantic understanding is better (probably a bigger vocab). this could also mean they will release a cracked embedding model, but other than that, gemma4 just underwhelmed me with its poor performance and the resources it taskes to do a dead simple task

rain lava
torpid sun
#

no I havnt. gemma4 by itself made me sad. i just didnt have enough copium to try it 🙁

#

I will tho

rain lava
torpid sun
#

I reccomend you try models froma guy called "jack wong" or something liek that. he releases some good models

#

usually merges and distills, but they are crazy good

rain lava
torpid sun
#

idk why people glaze distill-qwen models soo much. i find deepseek by itself crazy good compared to qwen distills. it makes the model dumba nd unstable

#

not this unstable

#

Anyone here knows russian?

#

my model can apperantly geneerate russian

#

I just want ot test yk

rain lava
rain lava
torpid sun
#

you have to explicitly download the deepseek model

torpid sun
rain lava
torpid sun
#

haha OKAY!
i am so excited.
this is my first model which didnt halucinate or goes rogue after a very simple. dead simple. prompt

#

I am doing my benchmarks on:ROUGE, BERTScore, FactCC, G-Eval, BARTScore, SummaC

#

is it solid?

rain lava
torpid sun
#

aight

rain lava
# torpid sun aight

tbh idk if it's just the 9070xt but I can PL this thing to 70% and still get 95 tflops at fp16

rain lava
rain lava
torpid sun
#

ohh mb, i didnt read that right. as a conversational human model, io create halucinations. if my response was bad, please DM me. for further release notes please refer to www.myparents.com

#

lol

torpid sun
#

are you genuinely out of your mind?

#

mr sherlock holmes 😭

#

it was a JOKE!

#

😭😭🥀

rain lava
torpid sun
rain lava
torpid sun
#

ofc it is not mine

#

it was a joke

#

I made it up

#

dont search up random sites, bro

#

😭

rain lava
torpid sun
#

yup!

torpid sun
#

I am pushing the model right now. i will add the readme tomorrow, i have a lot to type.

#

MY ADHD!!!!

torpid sun
#
#

finally!

#

made it

#

btw, you dont have to say things like Summarize this: ... you can just throw in the text like photosynthesis is the ....
Accepts upto 7k words without emojis, and 6.8k words with emojis.

as of now, it only supports english.

#

incase anyone want to try, here is a 1200 word text to start off lol 🙂

fiery lagoon
#

ts is crazy

#
#

oh

#

u already posted it

#

VERY GOOD GUYS

#

give it a try

#

its peek

torpid sun
#

eyyyy thx for the glaze, man!

#

yeah! it is actually really good. (well it gotta be for the years of research lol)

naive sparrow
#

Okey

torpid sun
#

Okay... this is very interesting than I originally thought...

#

just to let you know summarizer:q8 is my model

#

it consistantly beats all the models (including the 20-24b models)

#

just to let you know LFM2 is 24b parameters

#

also, in BARTScore the lesser the better. (whih is my model)

#

Lol, this benchmakr kinda revealed the dirty things other companies do

torpid sun
#
#

I just wrote one

rain lava
torpid sun
#

i will see

pastel hemlock
torpid sun
pastel hemlock
torpid sun
pastel hemlock
torpid sun
#

lol, my model summarized its own modelcard🤣

#

this is some yellow king defeated by a green lord ahh irony

pastel hemlock
torpid sun
#

parameters?

pastel hemlock
torpid sun
#

how much parameters does that thing has?

pastel hemlock
torpid sun
#

just tell me how much parameters that hting has bro😭

pastel hemlock
torpid sun
#

something what an amateur would say

#

lol

#

just tell em how much XXXm or XXXb parameters your model has dawg

pastel hemlock
#

110

smoky dust
#

How does everyone define vibe coding

rain lava
rain lava
severe spindle
#

been using gemma 4 31b

#

seems to love to halucinate? Does anyone know?

rain lava
severe spindle
#

any clue if this is just how the model is or perhaps theres some problem?

#

im having to run multiple verifications for what it writes otherwise

rain lava
# severe spindle mm told it to run vitest. using openrouter with copilot so getting the standard...

LLMs have a massive bias toward success. When they look at a messy terminal output from vitest, they often "vibe-check" the logs. If it looks like a test ran, their internal weights often skip the "FAIL" markers and just report "Tests passed!" because that's the most common pattern in their training

Teling an AI not to do something (like "no shape tests") is hard. By mentioning "shape tests" in the prompt, you're actually increasing the "activation" for that concept.

Having 262k tokens is great for reading a whole repo, but on OpenRouter, using that much context can actually make the model less likely to follow the very last instruction you gave it. It gets "lost in the middle" of all those old failing tests.

severe spindle
#

ohh. Could you suggest what I can do then to prevent this?

#

Im changing the prompt from mentioning shape tests to instead ask it to ensure all tests are behavioural . Hoping that helps

rain lava
# severe spindle ohh. Could you suggest what I can do then to prevent this?

Tell the model: "If you say a test passed, you must quote the specific line from the terminal output that shows the green checkmark or 'PASSED' string." Making it retrieve the exact text prevents it from just making up a summary.

Add a line to the system prompt: "Assume the tests have failed unless you can prove otherwise. Be highly critical of the logs."2. Ending the "Shape Test" LoopLLMs are bad at "not" doing things because the keyword ("shape test") stays active in their attention.

Instead of saying "Don't use shape tests," tell it what to use instead. For example: "Strictly use functional logic tests only. Shape-based assertions are deprecated and forbidden in this repo.

#

If that doesn't help try lower the context length or lower the temperature.

severe spindle
#

ooh Ill try that out :) thanks a ton

#

dont think I can change the temperature of the context length within copilot but ill try the changes in the prompt and system instructions once :)

#

thanks a ton Corban!

torpid sun
#

orr just stop paying openrouter, and gwet an actual good model like gpt5.5 or claude opus/sonnet.

aint no opensource model (unless there those huge ones like Kimi or deepseekv4 pro) wont help you much

torpid sun
#

GLM is good

#

idk man, mistral is a bit contro

rain lava
rain lava
torpid sun
#

wdym?
glm4.7 is good, i use it everyday

rain lava
pastel hemlock
rain lava
pastel hemlock
rain lava
pastel hemlock
#

Actually its probably more like 150

rain lava
pastel hemlock
# rain lava A 110 Parameter model can't even write a proper sentence.

A “110-parameter model” here does not mean an AI language model with 110 weights.

It means my physics framework has ~110 system parameters: temperatures, heat loads, conductances, vacuum pressure, gas flux, Knudsen number, laser pulse energy, adsorption rates, spin coherence, timing jitter, and cross-coupling terms.

Those parameters define a coupled cryogenic + gas + laser + surface + quantum-sensing system.

So the question is not “can 110 parameters write a sentence?”

The real question is:

Can a ~110-parameter multiphysics constraint model find one operating point where all thermal, flow, optical, surface, and detection constraints pass simultaneously?

That is a completely different category of model.

rain lava
pastel hemlock
#

Only a narc would think like that

rain lava
pastel hemlock
rain lava
pastel hemlock
rain lava
pastel hemlock
rain lava
pastel hemlock
severe spindle
#

anyways

torpid sun
#

aight asight chill

#

its nothing deep

#

sorry for this chaos I started

#

either ways... nice work @pastel hemlock

#

js don't argue about anything, you might get banned if else. js for your own sake, bro. keep it clean!

vagrant folio
#

Hi Guys question for those who use Antigravity:
Im experiencing a new bug:
1-new clean sessions are using withut request context from other projects.
is only for me or the are other people

torpid sun
#

maybe it is a memory feature

#

to test weather if I am right, try asking something which the an instance of the last chat wouldknow

#

if it answers right, might be a new memorry features liek all the AI hood. js turn it off in the settings

vagrant folio
#

what I know:
if you ask for old info from session he will answer
but answer from other project before wasent working for proper separation

The test I did was basicaly I create a workflow which do specific task one by one
Then create a new workflow but with other steps

then create a new session and run this new workflow and ask him why he try use those command and he give answer from older session totaly using other workflow insted the new created

another issue was I was working then he start coding a web feature and on cli project when I asked why he try to do that his answer was because the project X use UI when I was working in totaly new project non related

#

I found this issues most of the case worknig on a workflow creations

vagrant folio
#

well I dii my debug the main issue seems extension.js

#

from antigravity it change how he save session history etc

#

and seems he has the big bug

#

it store info to sql database

#

so I even test wipe the sql and test new nothing so next testing Im doing is fix it directly to cofirm

#

and a new bug I have is Im unable submit bugs

#

so im goign to one support to try get to AG

#

but right now im rewriting the workflow to gmeini-cli

#

to be used in vscode

#

has more 3 pro quota

#

and I think will be easy create a antigravity clone extension which will use gemini-cli ACP jaja

torpid sun
#

holly essay bro

#

discord supports MarkDown

#

😭

#

Discord MarkDown

Discord suppots MarkDown
see, you can format stuff

  • or just say all your points
  • like this!
#

tell me that code line like this

maybe it is a whole code block
# like this!
#

|graphs|can|
|be|also|
|created| here|

#

okay, maybe not that part

#

butyou get the point right?

vagrant folio
#

Antigravity History fix part1 🙂 this one I tested for new history chat and it worked. still need research for old chat from older project if possible. But right now for all new chat the history is working fine in my case

#

The issue of Antigravity issue is because they did changes on this file how he handle the link to the project which is stored in the database sql

#

So I will store the exact result later of changes to have them at hand for nw version if the issue persist

#

in other hand I consumed the flash model in just 1h

#

Im moving to Vscode and gemini-cli with code assistant and free codex for now. and antigravity seems his low quota even for flash model will be for reverse engineering himself for fixes, maybe will be easy add acp support for gemini cli

serene crow
#

Waiting for GoogleIO

vagrant folio
torpid sun