#stax・🥞

1 messages · Page 1 of 1 (latest)

oblique pewter
#

Introducing Stax: a new tool for AI evaluation. 🥞 google_labs

Stop guessing if your AI is working. Stax gives you the hard data and flexible tools to see what’s really working in your AI, so you can build breakthrough products with confidence 🔥

labs_love Try the product for free at stax.withgoogle.com.
💡 Learn more about Stax at https://developers.google.com/stax/overview 🥞
💬 The Stax team would love to hear your thoughts right here in this channel! Whether you’ve found a bug, want to share what you’re evaling, or learn about best practices, this is the place to be labs_love look out for folks with the Googlelogo next to their names!

Google for Developers

Stax documentation - Overview

little peak
#

when I try to login/signup I get this

rare egret
#

@little peak Thanks for your feedback. Are you trying to sign in with a workspace account or gmail? Also note currently Stax is only available in US

dense python
#

first

patent sun
stable gale
vapid dagger
crimson star
#

Explain to me like im 5 , this AI tool is for what?

dawn lily
oak path
#

or where can we see the documentation to ask nicely NotebooksLM to anwser with videos

halcyon sphinx
#

whoaaaa

rare egret
stark quarry
#

Only US

#

🙁

restive canyon
#

Yooo

oak wren
#

hmmm

woven spear
#

Getting the XML error, be kind make an HTML page that says only available in the US

weary hazel
#

but still doesn't work on firefox

limber summit
final meadow
#

Works with VPN or no?

echo palm
#

Hey guys vpn please oursufe of US please

limber summit
#

If it's geoblocked then VPN also doesn't help cos it needs login and the region is set in my Google account

wary latch
#

Is it possible to use it with best answers as reference?

limber summit
#

Was/Is the same with other geoblocked Services/Labs Projects.

patent sun
shell valve
#

Cool

shut phoenix
#

Same as all the other non us sign ups! Looking out for an update

wary latch
shut phoenix
#

Is this intentionally US only or an error?

#

Have been using langtail so excited to try this

little peak
patent sun
patent sun
glacial vortex
#

Is it possible to add custom models via OpenRouter API key?

patent sun
shut phoenix
#

That's gutting, any non us workaround or exceptions. I consider myself a citizen of the world if it helps!

glacial vortex
weak bloom
#

could this be used to perhaps automatically tune a system prompt for example to fit my own criteria?

patent sun
wary latch
#

GitHub models is also good for evaluations. Easy way to test many models easily.

patent sun
restive canyon
#

Uk pls

quick musk
#

on release, broken

#

"POST" is not supported

#

"Unexpected Error: Something went wrong. Details: Request method 'POST' is not supported"

patent sun
quick musk
amber cedar
#

This XML file does not appear to have any style information associated with it. The document tree is shown below.
<Error>
<Code>AccessDenied</Code>
<Message>Access denied.</Message>
</Error>

patent sun
amber cedar
#

ohhh thank you!!!

#

will there be any update on when it'll reach other countries (👀 new zealand)

patent sun
#

For all folks who are blocked because of the country restrictions, I'd love to hear what you're looking to use Stax for!

amber cedar
#

i would be keen in seeing how it works just generally and especially am interested in comparing models at different types of tasks (especially general knowledge comparisons, and data manipulation)

sonic nexus
#

@patent sun can not access it how can access outside of us

patent sun
cosmic mulch
#

I can't go in.

#

Ok, I got it

sonic nexus
#

@patent sun hmmm any plans labs and real agent mode for pro users or do you have any suggestions? Cli updates?

#

Maybe it out of your knowledge base.

#

Is out*😅

patent sun
patent sun
sonic nexus
patent sun
quick musk
#

Status?

cosmic mulch
#

And what else could be the problem? If anything, I'm comparing two models (Google and openai) And there are API keys for both

quick musk
#

Supposed to be fixed coming out

cosmic mulch
patent sun
brittle parcel
#

I clicked that pancake emoji yet getting access denied error on web

#

Is that unavailable in India?

vestal linden
#

It sounds so interesting, it would be great to have access to it and try a few features,
I have access denied now.

lusty holly
#

If anyone needs help, they already know where to find it.

stone turtle
#

Would love to know when support in Canada is coming out

patent sun
lusty holly
#

I'll show you two prompts, which were refined on my opal, and were tested on stak

lucid estuary
#

woo

lusty holly
# lucid estuary woo

These two prompts basically have nearly 16 processes, not even counting the OPaL refinement steps.

lusty holly
#

👀

noble talon
deep dragon
#

Please open for South Korea too!

lusty holly
#

of 81 refinement flows in one step, all failed

#

It didn't work for me, I lost it

last pivot
#

@patent sun I'm from India, and I am unable to access Stax. Any update?

quick musk
#

also, I still got nothing

last pivot
last pivot
lusty holly
#

there are 187 tests, and I didn't get past 32.6

patent sun
lusty holly
lusty holly
#

I didn't think it would be so difficult to refine text

brisk heart
#

@patent sun Hello! I'm in Korea, but I can't use a stax. Could you please do something to make my account available?

patent sun
junior coral
#

I would like to access stax from India tho

patent sun
junior coral
light flume
#

Is it possible to add images to the input prompts to be evaluated?

patent sun
patent sun
light flume
# patent sun Not yet, but we are working on it!

I use Gemini to extract information from construction blueprints, however, because these have very complex and diverse layouts, I am trying to test which prompts provide the most effective extraction from many drawings.

#

some examples @patent sun

patent sun
light flume
#

Originally, yes; but not when I shared them with Gemini. I converted PDFs to PNGs, then I share PNGs only.

brazen shoal
#

Hello. Thanks for Stax, it's an amazing tool ! I think you really nailed the workflow for evaluating Gen AI applications 🙂

Do you plan to support Vertex AI in the coming months (e.g., with Service Account key file) or propose Stax from GCP ? This would help us use Stax in our organization.

mint laurel
#

is it possible to evaluate generative images also in stax ? would stax be going in the image segment also or would it be more into the text only evaluation segment ?

royal lance
#

@patent sun Hello Swilt, is it possible to add another AI provider to the list? If yes, could you explain how? Thanks.

cosmic mulch
#

What does it mean?

cosmic mulch
livid summit
#

Pls can someone answer my question

I'm in Nigeria and when I tap to use stax in my chrome browser, it said it's not available in ur country. But still it redirect me to this site

And said "if u want u use stax u must tell us here"

Pls explain

weak zinc
lusty holly
cosmic mulch
lusty holly
#

session is leaving on its own, and excluding the training

#

?

patent sun
patent sun
patent sun
cosmic mulch
sweet wigeon
#

how to use?

sweet wigeon
# sweet wigeon

hello <@&1366831844502274180> could you please make this available for me

#

I am from India

cerulean ruin
#

hello, this is not a moderation issue. please only ping mods if there is an active issue that goes against server rules. Thank you!

royal lance
patent sun
sweet wigeon
royal lance
#

@patent sun Than you. how would you prefer to move this forward? would it be alright if they contact you directly to discuss what’s needed, or is there a specific channel you’d like them to use?

lusty holly
#

mine basically ends the session, and even deletes the process done, everything simply disappears

#

This is the third time I've been doing refinement and the session simply ends and everything disappears.

#

then you need to clear the entire browser cache

#

the result of this refinement simply disappeared 😩

#

basically the session ended and took the whole process ☹️

#

@patent sun

weary hazel
lusty holly
weary hazel
#

thanks for explaining 😅

lusty holly
weary hazel
royal lance
# patent sun What country are they in?

Shanghai, China, they develop and operate their own llm models across text, video, and multimodal. you might already be familiar with their image and video models under Hailuo ai.
https://www.minimax.io

Build with next-gen AGI: MiniMax offers Foundation AI models (text/audio/image/video). Power products like Talkie, Hailuo AI & enterprise APIs. Scalable, secure AI solutions for 50K+ global businesses.

weary hazel
rare egret
cosmic mulch
dense folio
#

Pls add Germany for this pray

lusty holly
#

вы поняли?

dense folio
#

i can't speak russian

quick musk
#

so...

#

stax seems to be still broken sad_marx

patent sun
quick musk
#

yeah, just "Unexpected Error: Something went wrong. Details: Request method 'POST' is not supported"

patent sun
quick musk
lusty holly
dense folio
#

english/german/spanish

#

NO russian

patent sun
crystal vortex
#

Can anyone explain to me like a 5 year old what Stax actually is and how it works?

patent sun
# crystal vortex Can anyone explain to me like a 5 year old what Stax actually is and how it work...

Made this with the help of Gemini 🙂

Imagine you have a super smart robot that can write stories for you. But how do you know if the stories are any good? Are they exciting? Do they make sense?

Stax is like a special helper that reads the robot's stories and gives them a grade.

Here's how it works:

  • You give Stax the robot's story. This is like turning in your homework.
  • You tell Stax what makes a good story. Maybe a good story needs to be funny, or have a happy ending. These are your rules.
  • Stax reads the story and checks your rules. It's like a teacher with a checklist, making sure the story has everything it needs to be great.
  • Stax gives the story a score. This helps the person who built the robot understand if it's a good storyteller, or if it needs more practice.

So, Stax is a tool that helps people check if their smart computer programs (AI) are doing a good job. It makes sure the AI is working the way it's supposed to.

quick musk
lusty holly
patent sun
quick musk
#

i do most of the work myself

royal lance
#

@patent sun Any updates on this matter? Thank you

quick musk
royal lance
patent sun
quick musk
patent sun
#

i do most of the work myself

lusty holly
royal lance
royal lance
silver glacier
#

How can I get access to Stax?

quick musk
#

it's US only (currently! expanding soon...)

silver glacier
quick musk
quick musk
#

also, i noticed something kinda funky here;
curling Gemini does use POST. in the literal code that it gives you

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash:generateContent" \
  -H 'Content-Type: application/json' \
  -H 'X-goog-api-key: GEMINI_API_KEY' \
  -X **POST** \
  -d '{
    "contents": [
      {
        "parts": [
          {
            "text": "Explain how AI works in a few words"
          }
        ]
      }
    ]
  }'

the bolded. POST. that seems... funky.

#

i'm not gonna question it, but i'm just more or so directly pointing it out for y'alls

ionic condor
#

I can't access stax in Nigeria

please, resolve this asap

subtle vector
#

Lovely new product, i'd love to try stax, we are using langfuse and it's quite confusing!

ionic condor
#

Alright
thanks

fading arrow
#

Yes, I tried and everything went well.

subtle vector
patent sun
subtle vector
#

🤞 can't wait to try it

patent sun
#

the POST error is fixed!

oblique pewter
snow flicker
#

Can we use stax in UK

lusty holly
#

one more try

patent sun
lusty holly
#

That I took these two seeds, that I generated this

#

I went to change tabs

#

@patent sun ?

patent sun
lusty holly
patent sun
lusty holly
lusty holly
patent sun
empty ocean
#

Hi I’m in Taiwan

#

Would love to try Stax. I’ve been using braintrust

patent sun
#

Would love to try Stax. I’ve been using

quick musk
patent sun
quick musk
#

Crying laughing 😂
Perplexity can't even get it right

#

Gemma is a local beast, bro
That's gonna be the AI that's controlling my HA setup soon

small hollow
#

@gritty domethanks ill check it out👍

mint laurel
finite schooner
#

Heyy When global access?

#

I am from Taiwan

spiral musk
#

hey everyone

pine crown
#

hello to all

#

when will stax will be available in europe?

spiral musk
#

i was wondering whether i can actually use that eval dataset (or part of it) sitting in stax to automaticlly improve my original prompt with good/bad examples?

pine crown
#

seems a great product!

#

how can I access it from EU ?

quick musk
#

Us only

placid notch
#

I would like to try Stax, but it seems it is not available in Japan yet

pine crown
frail herald
#

For the automatic evaluator, can we change what model is doing the eval? Or is it a custom built model to test the output in a certain way ?

patent sun
#

i was wondering whether i can actually

#

Did the classic "What is bigger? 9.9 or

patent sun
carmine vale
#

Hey, can i use stax for AGI- abstract reasoning benchmarking ? (Like solving abstract problems) or is it for LLM for specific uses only ?

(Ive never tried stax, but in considering it)

patent sun
sand trail
#

South Africa google_labs

opal raven
#

Hi y'all! Is anyone exploring improving spatial reasoning for LLM-based evaluators? Super excited to test out Stax but I do wonder about the quality of LLM-as-a-judge for spatial correctness checks

drowsy cosmos
#

Hey! My free API key allows me to sign in momentarily and then I get the sign that my session has expired. Is there a certain amount of money I should be expecting to spend to mess around in the sandbox a little bit?

patent sun
drowsy cosmos
#

okay, I assumed it was me

#

glad to be helping identify stuff. Chat in 20 minutes

patent sun
drowsy cosmos
#

totally, it flashes pretty quick but I can do that

carmine vale
opal raven
#

if LMs aren't performant I'll be thinking abt a way to do this with rule-based checks 🤔

patent sun
#

yep images and videos! Either on the

patent sun
patent sun
main grotto
#

Hi stax team! I wanted to try stax, but unfortunately it is not yet allowed in my country. I just submitted the form, and join the discord to get to know more about the tool 🙂 thx!

elder tapir
#

Not available in Argentina neither Peru 😩 . When could i be able to try it? Thanks 🙂

ruby flare
#

Any way we could have native json importing?

jaunty pagoda
#

Very impressive and seems very user friendly. I look forward to learning more with Gemini.

patent sun
patent sun
ruby flare
patent sun
ruby flare
# patent sun If you want to send me a DM of a sample of your data I can help make sure it wor...

So I had initially grabbed a sample of the R1 reasoning dataset just to play around with it. Which was a bit of a process to move from their json to csv. I haven’t put a lot of time into it but the last checkpoint was the evaluations screen timing out trying to load the 10k examples.

My actual goal requires moving about 10ish long form reports I’ve made via a very defined system prompt that can produce 10k word reports into a usable csv. Then using Stax to improve my “deeper research” prompt itself using existing report creation planning and output as a dataset.

TLDR: stax doesn’t like the R1 reasoning dataset and I’m still in the process of coming up with the right way to make the dataset itself for the deeper research prompt refinement project.

I’ll still circle back on this once I’m back at my desk.

patent sun
ruby flare
# patent sun If you are willing to DM me a sample of this data, we can try it on our side to ...

How I missed the chat toggle is beyond me. Ha. That answers a lot of questions I had. Doing some digging this was the initial dataset I was trying to work with. Note: it was more a test to understand Stax. I have no real understanding if this is the right tool or workflow.

https://huggingface.co/datasets/a-m-team/AM-DeepSeek-R1-Distilled-1.4M/blob/29d2baec029f630403382b1db2e3bd2172315c73/am_0.9M_sample_1k.jsonl

ruby flare
patent sun
#

What I was initially confused on(which

dawn lily
#

hey i am non US

#

i would like to use this service

dull nexus
#

Hi Team, I am from India
Could you help me understand how can I get access to stax?

shell coral
#

Hello Team , I am from India

#

Could you help me undetstand how can I get access to stax?

fringe bobcat
#

I would love to use this from Singapore! I've had to use a US VPN and another US Google Account for now

This is shaping up nicely, I'm using Promptmetheus as my prompt engineering IDE and there's really not much competition. It's good to see Google getting into this critical space for us AI developers.

patent sun
fringe bobcat
humble wave
#

Which stax region is available

#

?

#

?

patent sun
lusty holly
#

Hello my friends
anyone need help?

humble wave
#

Bro, I'm from Brazil using a VPN and it didn't work.

#

It worked with VPN

#

Thanks

patent sun
patent sun
lusty holly
lusty holly
#

I came back in the best shape

#

I built a complete flow in Opal with 15 agents, where they refine ASMR prompts that are already prepared. After that process, the result is a single prompt reviewed by 16 agents. Then comes the STAx stage, where I can further refine this result.

So, I created two TERTs and refined them for other models, generating a sequence of 200 prompts. From those two that I first redefined in Opal, I used the evolution process and produced more than 200 prompts based on those initial two I made in Opal.

lusty holly
median mural
#

Please open up to Australia

obsidian ravine
#

PLS OPEN UP FOR MALAYSIA

chilly wing
#

Please open for the Netherlands 🙂

vestal patrol
#

Please open up to Singapore🤗

storm lotus
#

How to get access to stax on this page it says you can apply here

hearty dust
#

Australia please

lofty osprey
#

Please make it available in BRAZIL

brittle schooner
#

is there a possibility to make stax available in germany ?

oblique pewter
#

📣 Interested in gaining access to Stax outside of the U.S.?

Please reply in this thread with the country you'd like to be added! If you've previously shared, your response has been recorded.

storm lotus
patent sun
wheat rampart
#

Any idea when STAX will be available in Belgium?

shut phoenix
#

Or updates on UK release

red roost
#

how is stax different than other benchmarking/evals platforms out there?

rare egret
# red roost how is stax different than other benchmarking/evals platforms out there?

Thanks for your interest. Stax moves beyond generic benchmarks by letting you evaluate AI models using your own specific data and criteria. Its key feature is the ability to build custom "autoraters" (LLM-as-a-judge) to grade model outputs on your unique quality standards. Stax is free, although you have to bring API keys. We are constantly working to make this tool more useful and open to hear your thoughts for improvements.

mint depot
#

I'm trying to use a custom model that's hosted on Amazon Sagemaker (URL is like "https://xxxxx.execute-api.us-west-2.amazonaws.com/prod/models/test/invoke"). It uses the OpenAI response format. It works when I call it from my command line. However, in Stax it fails and I'm getting a "403 forbidden" error (see image). Is there a way to debug the request that Stax is making so that I can try to figure out what's wrong with the request?

patent sun
analog ruin
#

hello

urban cloud
#

hello everyone

unique cave
#

When will stax available in India?

urban cloud
rare egret
weary hazel
patent sun
#

<@&1410311843674849300> - exciting news!

Stax is now available in 10 new countries/territories:
🇧🇷 Brazil
🇨🇦 Canada
🇭🇰 Hong Kong
🇮🇳 India
🇯🇵 Japan
🇲🇽 Mexico
🇳🇿 New Zealand
🇵🇰 Pakistan
🇰🇷 South Korea
🇹🇼 Taiwan

We're working on getting access to more countries. So stay tuned!

terse hornet
#

Vietnam please

neat ferry
#

Hi! When would Stax be available in Singapore?

mossy dragon
queen monolith
#

Hi! I am a developer working on a LLM-based chatbot for my company. Can I ask how Stax ensures consistency across multiple runs of the LLM-as-judge evaluation? Based on our experience using other LLM-as-judge tools, even if the dataset and LLM is the same, the evaluation ratings might differ because of the LLM judge.

patent sun
#

Hi! I am a developer working on a LLM-

raw hamlet
#

Is there any ETA on when Stax will be available in the UK?

mossy dragon
bronze rover
#

Demo looks cool but still waiting for availability in Germany.

  1. Is it possible to evaluate Vertex AI Search results? So instead of evaluating LLM's I wish to evaluate an app endpoint.
  2. Would I be able to collect feedback via Stax API from my app frontend? Currently Vertex AI Search has this feature only via the widget.
restive juniper
#

Hi,
I have been trying out Stax a bit but have a few questions:

  • Is it possible to change the thinking budget on models, especially flash 2.5?
  • The sample csv for import is locked. Can you share it with me?
  • Will it be possible to try more than 2 Prompts/ models at the same time?
  • Will there be a way to share projects with other people in the future?
  • Is this tool expected to be locked behind a subscription in the future or will there be a free tier?
mossy dragon
patent sun
#

Demo looks cool but still waiting for

night notch
#

Is it possible to use stax to test my custom agent that uses Llama 3.18 b instant model via Groq api?

mossy dragon
covert cloud
#

Hey, how can I share my projects with my team so that they can take a look?

mossy dragon
patent sun
#

Hey, how can I share my projects with my

covert cloud
mossy dragon
raw hamlet
#

Hello there, I keep on getting the following error when I try to generate an output. Is anyone else experiencing this:
"inference_reason": "Error while sending request to Gemini API: Retry failed after 3 attempts" - Can someone help me troubleshoot?

mossy dragon
patent sun
#

Hello there, I keep on getting the

raw hamlet
#

Hey guys something really odd just happened, I was creating my own evaluator and after making the changes in the editor but before I could click on save ,all the content got replaced with content from another evaluator.

mossy dragon
covert cloud
#

Hey is there a maximum character limit for output column?

patent sun
#

Hey guys something really odd just

patent sun
#

Hey is there a maximum character limit

minor thunder
#

Hello Guys, I think Stax is now available in South Korea! Does anyone know how to add documents like PDF files as user input data? Does Stax only supports text-based data at the moment? 🧐

mossy dragon
mossy dragon
minor thunder
mossy dragon
opaque pivot
mossy dragon
spiral skiff
#

Hi! Really looking forward to have access to Stax in Malaysia

mossy dragon
wintry surge
#

I'm hoping to try and add an antrophic model from vertex. Anyone done that?

vague galleon
mossy dragon
plain hound
#

Would love to start using stax in BE 🇧🇪

mossy dragon
little gorge
#

amazing

hushed furnace
#

When will stax be available in Germany?

mossy dragon
little gorge
#

ok

#

what is that

solemn totem
#

we need stax in POLAND 🇵🇱

mossy dragon
oblique pewter
#

Regions Thread 🗺️

Thanks everyone for sharing where you're interested in seeing Stax expand to. To help keep conversations organized, and to help keep track of these requests, please reply to this thread with the countries/regions you'd like to see Stax become available in.

If you've already made a request in this channel, you do not need to also post in the thread. Thank you! labs_heart

mint panther
#

hello, Still there is limitation in Gimini

rigid sun
#

If wanting to add a Custom Model (which includes some further orchestration), and choosing the OpenAI Model Provider, does that call assume the OpenAI create "Chat Completions" api request and response body format? (in contrast to the new Responses api). If so, will it make the call with stream: true and accept server-sent events?

sullen crown
#

Hello,

I am new to stax. I am trying to upload a dataset via CSV file. When I do so, the system loads forever without showing a success or error message.

Even when I upload the sample csv provided in the documentation (pasted in next message for reference), the system loads forever in the same manner.

Any help is appreciated. Thank you

#

Sample csv from documentation:

input,system_instruction,output,expected_output,model_nickname,human_evaluation,tags,metadata
Write a poem about {{metadata.poemType}},Respond using iambic pentameter,"At dawn, the ducks glide smooth upon the pond, Their feathers glisten in the early light. They quack as if to greet the waking day, And splash with joy the calm of mirrored sky. Their life unfolds in graceful arcs and streams, A lively dance upon the water's stage. Each ripple speaks of nature's humble art, Bestowing peace upon the human heart.","Along the quiet pond, the ducks do glide, Their feathered coats as soft as morning dew. They quack a tune beneath the rising sun, While ripples dance in water’s gentle run.",Gemini 2.5 Flash,1,poetry,"{""poemType"":""ducks""}"
Write a poem about {{metadata.poemType}},Respond using iambic pentameter,"At dawn, the iron beast begins its run,
Its steadfast wheels embrace the morning sun.
Along the tracks it whispers ancient tales,
Through mountain mists and over verdant vales.
Each rhythmic clang recalls a bygone art,
A journey forged by both the pulse and heart.
The engines song resounds with strength and grace,
Uniting lands with steady, measured pace.
In heated nights or chill of autumn's rain,
The train endures, a bridge o'er time's domain.","The iron serpent glides along the track,
Its beating heart revives the steel and black.
It weaves through fields and towns at break of day,
A moving dream that never fades away. ",Gemini 2.5 Flash,-1,poetry,"{""poemType"":""trains""}"

rigid sun
#

I've been unable to login to Stax this afternoon. When I try, it reports "Your Stax account is pending deletion." before kicking me out; however, I did not request account deletion. Is there an issue right now?

mossy dragon
mossy dragon
rigid sun
mossy dragon
#

I've been unable to login to Stax this

weary hazel
#

Where can I find the introduction to Stax? I wanna know what it's about and try it. Last time I was really confused when I tried it

mossy dragon
weary hazel
#

I see now. So it's like testing different models to find out which one has the best fit for my use case

#

and like creating our own benchmarks

#

the outputs kinda feel a bit unnatural but understand what the product is for

mossy dragon
weary hazel
mossy dragon
weary hazel
#

I'll take a look on how to do it

#

and provide feedback

mossy dragon
#

That's great, looking to hear from you, Thanks.

weary hazel
#

i'll try it with a fine tuned gemini model

#

or maybe some open source model and call it from the api within stax

queen dagger
#

is stax is available in India?

mossy dragon
spring cliff
#

#stax Can I please get approved to use Stax

mossy dragon
meager siren
#

Hello

mossy dragon
bronze rover
#

I would like to evaluate Vertex AI Search endpoint (discoveryengineapi) from Germany/Austria using Stax. Would this be possible?

mossy dragon
mossy dragon
muted bay
mossy dragon
bronze rover
# mossy dragon Yes , You can evaluate custom endpoints using Stax here https://stax.withgoogle...

Thanks for the response and hoping to see the service available in EU.
I have a feeling this wont work as I wont be able to specify vertex ai search in place of models. The result will be that Stax won't be able to interpret the output.
Stax should have an option for me to provide a JSON example and learn the schema from it.

Model Provider: Select a provider (e.g., OpenAI, Anthropic) whose output format your model's responses match. This is required for compatibility.

mossy dragon
onyx cliff
#

Would love to use stax in Bangladesh. 🇧🇩

cobalt matrix
#

Hey team! Love the concept of this product. Can't use it in Australia however. Any chance this could be rectified? Alternatively is there going to be any issues using this with a VPN?

deep dragon
#

In the sample project, when I hit Generate Output,

I get Path not found: inference/projects/chat error. Is this known issue? (I googled, and searched in this Discord server for it but nothing found)

mossy dragon
# deep dragon

Hi, this is due to some internal error and will be cleared soon. Thank you for reporting the issue.

real zephyr
mossy dragon
dire canyon
#

@earnest smeltAs stated in #start-here・📍; currently, our team can support engagement in English; the discussion in this server will largely remain in English for now.

long canyon
#

👋 are there any plans to support more integrated methods of uploading data?

I'd love to be able to programmatically upload CSV data, pull in from BigQuery, etc. Something more along the lines of Opik or DataDog online evals where they pull in all the data from the traces

mossy dragon
mossy dragon
unkempt void
#

when access from kazakhstan?

#

what is the point of gating access geographically?

#

especially for an eval service

jagged wadi
#

me and my students fellow really like to have it here in Morocco

mossy dragon
mossy dragon
sleek garden
#

isnt stax a bit costly if it is an ai powered app builder? (arent the apps going to be costly to run I mean resource-wise)

honest cypress
#

why isnt stax available in germany/europe, or some other locations? like whats the reason?

oblique pewter
# honest cypress why isnt stax available in germany/europe, or some other locations? like whats t...

hey Elias! each market has a variety of considerations (legal, for example) that we need to address before we're able to launch any Labs tools or experiments - especially experiments, which, are experimental in nature 🙂 but in every case the ambition of the teams working on these projects include shipping to every market we can as quickly as possible. we'll ping the roles here in the server when we have updates in regards to expansion. appreciate the feedback!! google_labs

honest cypress
true thunder
#

Hi Hi,

Stax in South Africa would be a great plus.

vague kindle
#

Hey team! Just discovering Stax, looks promising while addressing one of our core issues is to evaulate our results.
I run a finance workflow automation company and I am wondering how we could use stax to quicks judge our results

#

I am based in Sweden, thought this is important since certain solutions are geo locked

mossy dragon
#

Hi @vague kindle , That's great to hear. You can use Stax to compare results for models Side by Side, Create a your own data(often datasets), upload CSV files (large data) and evaluate ,..Also can create a custom evaluators to suit your own evaluator.
Can we know on what areas or use case you were focused on?
Thank you.

real arch
#

Were

tribal carbon
#

use a vpn

cerulean ruin
tepid barn
#

Hi

rigid frigate
stoic kraken
#

Hello, Stax in Bulgaria would be great 🍀

hardy verge
tribal carbon
#

oh

proven valley
#

I wish I had a Stax-like interface to evaluate HuggingFace-downloadable models.

frozen trellis
#

Hi guys, I stumbled upon Stax today, and it would be great to have it working in Italy and CH. labs_love

mossy dragon
lucid isle
mossy dragon
lucid isle
#

Like, For Example. If i made a Evaluator for checking how does it identify and organize priorities. Since, My work with Opal requires that. Opal is an n8n tool

mossy dragon
hoary vale
rain knot
#

Don't get me wrong, I totally see your point. My take is that while we can automate these steps, the real challenge is keeping it precise. General automation can easily lose its way without very sharp system prompts and output constraints to keep things on track. Doing it manually for now at least gives you that granular control over the logic

lucid isle
eternal thistle
#

When is stax avaliable in aus and can a solid wire guard VPN bypass this

#

Very unethical to charge the same rate for a subscription for a lesser version of the product and features

#

Also. Why does Google not make these tools more innately intuitive, even their descriptions are so simple and useless.

Google labs has tools thag can make tutorial type videos. Why not make them for each of the google lab tolls with apeici use case video variants.

My team pages ultra because many tools are simply not well explained and have outdated infom

#

Is this by design? So people use the tools more frequently and for longer amounts if time to benefit training data?

cold swift
drowsy dune
#

Sweden please 🙂 as rest of the tools

surreal badge
#

hello guys

ivory carbon
#

Hi Google Stacks team,

I’m currently conducting my Master’s thesis in Germany, focused on evaluating LLM responses and classifications on PKM datasets. I’d love to use Google Stacks, but it appears the Labs platform access is currently restricted for German users.

Would it be possible to unlock/enable Stacks for users in Germany? It would let me avoid a lot of manual preparation, coding, and evaluation setup by using capabilities Stacks already provides—making a major difference for my thesis work.

Thanks a lot for considering this, and I’m happy to share more details about my research needs if helpful.

Best regards,

cedar zenith
#

Hi bro

ivory carbon
sturdy quartz
#

Will work in most cases ig

hushed python
hushed python
paper bluff
#

Can we integrate Stax with any of the monitoring tool like Dynatrace, which is configured to capture Prompt input, Tool call response and final response [agentic AI], and use that as input to Stax and run evaluation (in Production), and generate metrics/score such as as Faithfulness or correctness and we gauge Hallucination?

paper bluff
#

anyone?

faint nymph
#

Any chance to have this product in more countries(Lithuania)?

lapis willow
#

Any chance to have stax in Hungary?

cold hull
#

what is stax

polar hill
#

Hey 👋 new updates on Stax Labs here?

waxen onyx
#

Stax in South Africa please.🇿🇦

vernal cove
#

Will STAX be available in APAC soon?

celest fox
#

Hey, can we have Stax in Ukraine? 🇺🇦

outer turtle
#

Hey, bring stax to Kenya please