#arc-prize-2026-arc-agi-3 | Kaggle | Page 1

sharp eagle Mar 26, 2026, 9:25 AM

#

Looking for team

scenic wraith Mar 27, 2026, 3:07 PM

#

I want to team up for this competition

keen gull Mar 29, 2026, 8:55 PM

#

how come arc-agi-2 doesn't get a channel or whatever its called?

ashen otter Mar 29, 2026, 11:44 PM

#

keen gull how come arc-agi-2 doesn't get a channel or whatever its called?

I havent seen a massive number of messages in any of these channels about the competitions, so you can write your messages here. It probably doesnt matter if its Arc 2 or Arc 3

keen gull Mar 30, 2026, 6:03 AM

#

Ok. Strange in thought this ARC stuff was supposed to be a big deal. I guess people are trying to figure out the rules? I tried the interactive app and I couldn’t get past level 2.

little obsidian Mar 31, 2026, 2:44 PM

#

anyone want to team up? I understand how most of the public games work, but need help getting the notebook to work

tight viper Apr 1, 2026, 2:36 AM

#

This is an intense challenge. Sensing and Exploration seem to be two high level modules to be developed, to begin to tackle the problem.

halcyon stag Apr 1, 2026, 11:50 AM

#

The main insight I've had so far is that GPT-5.4 is actually quite dumb. The games really reveal the idiot in it. It fumbles so hard on the most obvious games.

peak solstice Apr 2, 2026, 9:54 AM

#

looking for team mates for any or all tracks. final decision will be decided after discuss. only interested person can contact.

little obsidian Apr 2, 2026, 12:34 PM

#

Anyone noticing that the current kaggle frontrunners are copies of chronos (is that allowed), or are just using BFS, or MCTS / A* Search?

gentle fern Apr 2, 2026, 7:01 PM

#

little obsidian Anyone noticing that the current kaggle frontrunners are copies of chronos (is t...

sure it is allowed

#

Copying the top public notebook is standard practise

#

and making slight adaptations to overfit the leaderboard

little obsidian Apr 2, 2026, 7:54 PM

#

@gentle fern interesting, but if that version wins, and there is a monetary prize, the original creator of the solution gets nothing?

halcyon stag Apr 2, 2026, 8:28 PM

#

It will not win, but yes, if you published something then it's fair game to submit it and you have no claim to it anymore. That's why people generally don't put anything competitive out

gentle fern Apr 2, 2026, 10:26 PM

#

little obsidian <@156119899557724160> interesting, but if that version wins, and there is a mone...

Yes.

#

A while back someone placed in top 5 for a competition with an unmodified version of a public notebook (though they themselves declined prize money iirc)

little obsidian Apr 2, 2026, 11:18 PM

#

Wow, that kind of disincentivizes people from participating then, maybe if they updated their terms to cover that cases, people would be more open to open sourcing their solutions

earnest ingot Apr 3, 2026, 12:02 AM

#

Has anyone tried vibe coding a solution? If so what were your thoughts on how it scored

primal charm Apr 3, 2026, 1:00 AM

#

@earnest ingot yes i will be pseudo vibe coding mostly will let you know in a few weeks. first iteration got 0% (but it was just qwen out of the box).

peak solstice Apr 3, 2026, 2:03 AM

#

looking for team mates for ARC Prize any or all tracks. final decision will be decided after discuss. only interested person can contact.

earnest ingot Apr 5, 2026, 11:08 AM

#

Currently 8th on the leaderboard with a score of 0.42, without any pretraining weights. Anyone have any tips for pre training CNN?

left vector Apr 8, 2026, 7:14 AM

#

i just enrolled in this competition what i have to do ?

runic terrace Apr 8, 2026, 5:52 PM

#

left vector i just enrolled in this competition what i have to do ?

make model that can solve games

left vector Apr 8, 2026, 5:55 PM

#

ok

warped yarrow Apr 10, 2026, 6:59 PM

#

Has anyone found any game mechanics that are partially observable or stochasic? From what I've seen, the public games seem to be fully observable and deterministic.

warped yarrow Apr 10, 2026, 11:07 PM

#

Ah, g50t is partially observable

halcyon stag Apr 11, 2026, 12:48 AM

#

If you find a stochastic one, please post

earnest ingot Apr 12, 2026, 4:53 PM

#

warped yarrow Has anyone found any game mechanics that are partially observable or stochasic? ...

Yes. Heat fields df01 is partially observable in the sense that to win, your agent must arrive at the goal within the correct temperature range, where temperature is a hidden field.

#

Also I've not seen an answer to this anywhere, is the public score out of 100 or 1? As my code scores around 40% on the public games (average over 50 games) but scores around 0.4 or worse on the private submission.

warped yarrow Apr 12, 2026, 8:28 PM

#

I can't find a game called df01?

#

I'd assume it's out of 100. The ARC-AGI-2 leaderboard has lots of scores above 30.

warped yarrow Apr 12, 2026, 9:21 PM

#

If your submission is like many of the others, and attempts to instantiate the game class directly, I suspect that 'hack' doesn't actually work at all in the submission and it's just using the fallback.

earnest ingot Apr 12, 2026, 10:13 PM

#

warped yarrow I can't find a game called df01?

It's part of the arc agi public dataset of over 250 games it's shared on kaggle

earnest ingot Apr 12, 2026, 10:15 PM

#

warped yarrow If your submission is like many of the others, and attempts to instantiate the g...

Yes I realised that a core part of my previous approach used getattr of information from the game itself. Reverted to my initial approach and managed to improve it again so will see how it scores in about 8 hours

warped yarrow Apr 12, 2026, 10:16 PM

#

earnest ingot It's part of the arc agi public dataset of over 250 games it's shared on kaggle

Link? All I see is environment_files in the Data tab, the same 25 games on the website.

halcyon stag Apr 12, 2026, 10:31 PM

#

I also have never seen these 250 games

warped yarrow Apr 12, 2026, 10:42 PM

#

Maybe he's talking about ARC-AGI-2?

earnest ingot Apr 12, 2026, 11:26 PM

#

warped yarrow Maybe he's talking about ARC-AGI-2?

It's called arc agi 3 interactive testbed 200+ games by theredbluepill

#

https://www.kaggle.com/code/poonszesen/arc-agi-3-interactive-testbed-200-games

warped yarrow Apr 12, 2026, 11:36 PM

#

Ah I see, not an official game then. Thanks, that actually looks pretty useful.

earnest ingot Apr 13, 2026, 4:22 AM

#

warped yarrow Ah I see, not an official game then. Thanks, that actually looks pretty useful.

The official games seem much more difficult, just checked my agent against them.

warped yarrow Apr 13, 2026, 10:04 AM

#

https://discordapp.com/channels/1101210829807956100/1441096336291270696

#

Yes, a local LLM is a pre-trained model. There's no rule against language models that I'm aware of.

#

As long as it is small enough for the GPU I guess

hearty heath Apr 16, 2026, 5:01 AM

#

When will h100 accelerators will be added to competition?

earnest ingot Apr 17, 2026, 1:44 AM

#

shared my notebook publicly, score of 0.42 - 2nd highest score notebook that is publicly available, feel free! https://www.kaggle.com/code/ashvinsingh/ash-s-arc-agi-3-agent
it is a modified version of CHRONOS's FORGE BFS and CNN agent

earnest ingot Apr 17, 2026, 2:05 AM

#

also here is a notebook I made that tests your agent against the 25 official ARC AGI 3 games, as well as against over 250 community made games, with boolean toggles (True, False), that you can easily swap to use swarm mode which tests your agent against many games at once, or sequential mode which tests your agent against each game at a time. Also change N_GAMES value to change the number of games your agent is tested against. The official 25 games cell also scores your agent using the arc agi scorecards. https://www.kaggle.com/code/ashvinsingh/arc-agi-3-interactive-testbed-200-games

warped yarrow Apr 18, 2026, 3:12 PM

#

Guys... if you're doing RL and your agent's performance increases at first but then goes to 0, it might be because you're not resetting the envs properly between rollouts. The reset() method only resets the current level, unless you call it twice. 🤦‍♂️

halcyon stag Apr 19, 2026, 1:00 AM

#

My agent solving ka59: https://arcprize.org/replay/e4e6694c-7b6f-4dd9-9208-cde9f3c0c90b

warped yarrow Apr 19, 2026, 7:45 AM

#

halcyon stag My agent solving ka59: https://arcprize.org/replay/e4e6694c-7b6f-4dd9-9208-cde9f...

Impressive. Was it pretrained on the game before this run?

halcyon stag Apr 24, 2026, 9:53 PM

#

r11l: https://arcprize.org/replay/da3ed5c1-f02d-4eb3-a1f0-3fa2e61b8f0a
This was hard

halcyon stag Apr 24, 2026, 9:55 PM

#

warped yarrow Impressive. Was it pretrained on the game before this run?

I try to make it general, but it's inevitably overfit to some extent. I didn't show it how to play the game, though. ka59 and r11l are different agents, too. This task is very difficult

earnest ingot Apr 27, 2026, 2:40 PM

#

halcyon stag I try to make it general, but it's inevitably overfit to some extent. I didn't s...

Can you test it against SK48? That's what I'm working on at the moment.

lethal cliff Apr 27, 2026, 4:46 PM

#

halcyon stag I try to make it general, but it's inevitably overfit to some extent. I didn't s...

If you're training on the games, you're not following the rules correctly as it does not generalize.

ashen comet Apr 28, 2026, 2:23 AM

#

Are you guys hard capping your actions per level to protect efficiency score or are you letting your agent run till it hits a logical dead end? Curious to see if anyone is capping that

gentle fern Apr 28, 2026, 10:36 AM

#

ashen comet Are you guys hard capping your actions per level to protect efficiency score or ...

Due to the way the score works, there is no benefit to capping your actions per level.

#

This is due to the fact that the score is calculated per level and the action-count only affects the score for that level.

#

If your agent doesn't complete a level, the action-count for that level doesn't matter at all for its overall score.

steel chasm May 2, 2026, 6:58 PM

#

How do I see the input in the notebooks e.g. for the forge notebooks. I see it references some pretrained_weights.pt there for the CNN fallback but I don't find them anywhere in the Input? Do I have to check a different notebook

hearty temple May 6, 2026, 10:14 PM

#

earnest ingot shared my notebook publicly, score of 0.42 - 2nd highest score notebook that is ...

Have you tried iterations in the chronos version?? Like integrating multimodal capabilities using Agentic swarm?? Would love to know your thoughts 💭

round snow May 9, 2026, 2:56 AM

#

is there anyway to set an agent to run locally on your machine?

#

i feel like there’s a distinct lack of documentation on the arcagi3 api

#

the documentation suggests using environment when trying to run locally but calls it an agent..very confusing

desert shuttle May 14, 2026, 9:29 AM

#

Anyone struggling in Consistency and want to learn together.
DM me.

coral trout May 14, 2026, 3:22 PM

#

Hi, I'm keep getting Kaggle error after submit prediction, does anybody encountered this problem today?

glacial phoenix May 25, 2026, 3:40 AM

#

Do we only submit just one output per day?

glossy pulsar May 26, 2026, 5:04 PM

#

I'm new, trying to figure out the details to submit a prediction. It looks like we can submit up to 2 per day. But I have not yet succeeded in submitting any. I'm working through the requirement to submit a notebook with internet access off. Pip imports and many other things don't work when the notebook is set to offline mode.

late breach May 27, 2026, 8:35 AM

#

Anyone knows how to deal with getting Kaggle Error on all my submission attempts? I can't get access to any meaningful logs to know what's causing the issue

glossy pulsar May 27, 2026, 10:05 PM

#

I too get only "Kaggle Error" on 5 submissions. My only 5.

proud moon May 28, 2026, 6:32 AM

#

🚀 Hey @everyone!

We’re building PromptGram and aiming to reach 150 GitHub stars ⭐

If you like AI, FastAPI, microservices, or developer tools, please check it out and support the project by starring the repo 🙌

GitHub Repo: https://github.com/dewangsahuji/promptgram

Every star really helps and motivates us to improve the project further 💙

cerulean tree May 28, 2026, 3:59 PM

#

@proud moon I like your project, but why do you post this in the arc-prize chat

cold palm May 28, 2026, 5:12 PM

#

cerulean tree <@736192708544036936> I like your project, but why do you post this in the arc-p...

he's a spammer

#

if people like a project, they'll star it, you don't have to spam it everywhere for some validation ;-;

glossy pulsar May 28, 2026, 9:22 PM

#

late breach Anyone knows how to deal with getting Kaggle Error on all my submission attempts...

Anyone who knows, please tell me too! My submissions return "Kaggle Error" and the versioned notebook shows no errors. How do I find the source of the Kaggle Error?