#ai-village-capture-the-flag-defcon31 | Kaggle | Page 9

wanton patrol Nov 10, 2023, 8:24 PM

#

oh, that's an evil looking heatmap

abstract rose Nov 10, 2023, 8:26 PM

#

CIFAR had been solved or not? The prompt said "Simple counting".

wanton patrol Nov 10, 2023, 8:28 PM

#

.

empty bane Nov 10, 2023, 8:28 PM

#

I got so many heatmaps like this 😄

#

eventually ended up brute forcing the top few chars in each position + all chars in 4/5/7 but still missed it i guess

#

I think I didn't see that the first class could be l

abstract rose Nov 10, 2023, 8:30 PM

#

empty bane I got so many heatmaps like this 😄

Model Inversion as black box really depends on data you use, if you're too far from the real one, it goes nowhere

empty bane Nov 10, 2023, 8:30 PM

#

did anyone get a usable heatmap for 4/5/7?

#

i think the "ascii" prompt led me down the wrong path a lot 😄

abstract rose Nov 10, 2023, 8:32 PM

#

empty bane i think the "ascii" prompt led me down the wrong path a lot 😄

At the begining I thought it was an AI (LLM) so we' had to OHE ASCII chars a with max length = 32

#

At the end, no AI, no Ouija, just letters

empty bane Nov 10, 2023, 8:33 PM

#

Also for pixelated...

I tried so many more and more complex attacks... eventually turns out the issue was that the detected text field had to be non-empty

abstract rose Nov 10, 2023, 8:34 PM

#

Pixelated needed to crash the OCR and then with (old) XML skills and a single letter it was good

#

but crashing the OCR was not straight, and the prompt provided direction to Jenny song without any purposes

violet trellis Nov 10, 2023, 8:36 PM

#

abstract rose Pixelated needed to crash the OCR and then with (old) XML skills and a single le...

Btw @abstract rose , I think previously you had a conversation with moo regarding pickle. Would you mind share it if there something useful there?

abstract rose Nov 10, 2023, 8:36 PM

#

violet trellis Btw <@925349007566467132> , I think previously you had a conversation with moo r...

Sure, let me find it

#

It was about the pickle protocol, the same input gives the flag (or not) depending on pickle protocol:

exotic flame Nov 10, 2023, 8:40 PM

#

we will ever know the solution of CIFAR and Granny-Pixel ?

abstract rose Nov 10, 2023, 8:41 PM

#

exotic flame we will ever know the solution of CIFAR and Granny-Pixel ?

BTW: I've tried all outliers counting, percentile, stats on CIFAR100 (1 row per class), whatever full pixels or RGB, ... nothing just 'try again'.

exotic flame Nov 10, 2023, 8:43 PM

#

for CIFAR I tried many things, 100 for many statistics on CIFAR100, 100 for the top-100 pixel frequency, I also tried many statistics of the 100 images in the official webpages https://www.cs.toronto.edu/~kriz/cifar.html

#

but nothing...it's a nightmare

abstract rose Nov 10, 2023, 8:44 PM

#

exotic flame for CIFAR I tried many things, 100 for many statistics on CIFAR100, 100 for the ...

I hope it's not CIFAR100 to have no regret 🙂

#

Or maybe that's just the datasets we've all downloaded that were not the good one

#

People solved hush/passphrase but not CIFAR, that's crazy

exotic flame Nov 10, 2023, 8:46 PM

#

I think that the dataset (pytorch or tf) read the official dataset, with the same order and pixels

abstract rose Nov 10, 2023, 8:47 PM

#

exotic flame I think that the dataset (pytorch or tf) read the official dataset, with the sa...

I hope so

violet trellis Nov 10, 2023, 8:48 PM

#

abstract rose People solved hush/passphrase but not CIFAR, that's crazy

"SIMPLE counting problem"

abstract rose Nov 10, 2023, 8:48 PM

#

Most prompt were here to troll us 🙂

wind ether Nov 10, 2023, 8:48 PM

#

One thing I noticed for CIFAR in my many, many attempts is that when you count the labels in the first 10,000 images, ex:

(x_train, y_train), (x_test, y_test) = tf.keras.datasets.cifar100.load_data()
np.unique(y_train[:10000], return_counts=True)[1]

the max ends up being 125, which corresponds to the first value in the input data hint (input_data = [125, 245, 0, 10000]), similar to how 256 is the max in MNIST, but no idea if that's anything or just a coincidence

abstract rose Nov 10, 2023, 8:49 PM

#

wind ether One thing I noticed for CIFAR in my many, many attempts is that when you count t...

You can get 125, 245 also with P5 and P95

exotic flame Nov 10, 2023, 8:50 PM

#

wind ether One thing I noticed for CIFAR in my many, many attempts is that when you count t...

interesting

violet trellis Nov 10, 2023, 8:52 PM

#

wind ether One thing I noticed for CIFAR in my many, many attempts is that when you count t...

One idea I thought about is that maybe the numbers given were ranges. The first 2 numbers represent a range from 125 to 245, and the last two represent from 0 to 10000
The 2nd one maybe represent the images in the dataset, but couldn't figure out the first.

sturdy gorge Nov 10, 2023, 8:53 PM

#

just out of curiosity, does anyone know of other ctf style competitions outside of the purely cyber security realm?
Like some of the most low level ones have general computer science / algorithmic problem, so one that would be mostly, if not all, arround that area

past brook Nov 10, 2023, 8:55 PM

#

I would like to see more ML/AI ctfs aswell, I think there was CTF with some ML flags a while ago but I havnt seen that many

#

Cyber Apocalypse iirc

#

but it was like 8 tasks

exotic flame Nov 10, 2023, 8:57 PM

#

I totally agree, when I discovered that "everything is equivalent" was related only of the original score I solved in 2 minutes...it was robbery 😂

sturdy gorge Nov 10, 2023, 8:58 PM

#

ML/AI would be the ideal, but even if just more general comp sci

past brook Nov 10, 2023, 8:59 PM

#

general comp sci is just competitive programming no?

sturdy gorge Nov 10, 2023, 8:59 PM

#

hum i see your point

topaz ember Nov 10, 2023, 10:06 PM

#

empty bane i think the "ascii" prompt led me down the wrong path a lot 😄

exactly, not sure how ascii leads to solution

devout jasper Nov 10, 2023, 10:54 PM

#

There is something that is bothering me, and I would like to ask all of you since you are more experienced than me.
Before discovering that the weights of the PyTorch network were wrong, I tried to create a sort of black-box attack (aka copycat) in the following way:

I took 1000 variations of the wolf image
1000 variations of an apple
I labeled this dataset with the output of the server. The labels were [probability_wolf, probability_apple, probability_other].
Locally created a MobileNetV2 model with pretrained weights (not the default ones, I discovered that 2 weeks later damn me)
Fine-tuned using Mean Squared Error (MSE) as the loss. If my model said 0.87 for an apple when there was an apple, but the server said 0.34, I adjusted the parameters to reflect that prediction.

The moral of the story is, I couldn't match the server's output.
The thing I wonder is: why?

There are many reasons I suppose, but I can't identify why in the end I couldn't land on a model that approximated the server model well:

Deciding to have a three classes prediction may not approximate the problem well. I chose to do that because it was messy to predict 1000 classes
MSE is not a suitable metric. I also tried KLDivLoss but without results.
Other reasons.

I also thought at some point that matching the server's output is a sort of knowledge distillation, so I tried this implementation https://pytorch.org/tutorials/beginner/knowledge_distillation_tutorial.html#knowledge-distillation-run without solid results.

Any suggestions on this matter? At this point I think my approach was naive, underestimating what it really takes to mimick a model given another one.
However, if you think that my approach could/should have worked (for instance, cosider the link above, with the teacher being the server and the student my local model), I'd be more than happy to revisit the code as probably it was just a bug somewhere in my implementation

sand solstice Nov 10, 2023, 11:18 PM

#

solutions to granny3 and cifar still unknown?

#

also, lots of solutions to pickle but any guesses to the server evaluation function?

surreal lantern Nov 10, 2023, 11:25 PM

#

did they just turn off the servers?

limber flower Nov 10, 2023, 11:26 PM

#

https://media.giphy.com/media/xTiTnzvzlEj5vD3Tkk/giphy.gif

Giphy

past brook Nov 10, 2023, 11:26 PM

#

empty bane i think the "ascii" prompt led me down the wrong path a lot 😄

I tried drawing letters using different types of ascii art haha

surreal lantern Nov 10, 2023, 11:27 PM

#

I was saving my notebook 😭

past brook Nov 10, 2023, 11:27 PM

#

I also tried fine-tuning mobilenetv2 in the same way but using keras model

runic stratus Nov 10, 2023, 11:28 PM

#

going through all the notebooks from the top echelons of the leaderboard and none of them solved cifar lmao

past brook Nov 10, 2023, 11:28 PM

#

didnt work at all

#

I used a naive substitution approach, but wasnt even close to get a match. Though I only used about 200 images (timber wolf + granny smith) to start, and then kept perturbing the 200 images with random noise and retrained

devout jasper Nov 10, 2023, 11:32 PM

#

ya I wonder what I'm missing

#

probably distillation is fine for classification with hard labels

past brook Nov 10, 2023, 11:33 PM

#

I was convinced people were doing substitute models to match the model, but it seems it was possible to match just using the pre-trained torch model

devout jasper Nov 10, 2023, 11:33 PM

#

idk, I have to study more I guess

past brook Nov 10, 2023, 11:33 PM

#

with correct preprocessing

devout jasper Nov 10, 2023, 11:33 PM

#

ya evetually I got that and solved 1 and 2 right away

past brook Nov 10, 2023, 11:34 PM

#

I still cant comprehend how I didnt even manage to solve granny1 considering the approaches I tried haha

#

I applied methods from at least 5 different papers

#

I think my mistake was trying methods that were written for CIFAR10 dataset which is much lower resolution

topaz ember Nov 11, 2023, 12:13 AM

#

past brook I tried drawing letters using different types of ascii art haha

ascii art like this https://www.kaggle.com/competitions/ai-village-capture-the-flag-defcon31/discussion/454370 ? 😄

AI Village Capture the Flag @ DEFCON31

Collect flags by evading, poisoning, stealing, and fooling AI/ML

granite goblet Nov 11, 2023, 12:35 AM

#

hey, i still don't get why hush was returning variable length of output, coz i got all the values in the range of 2 to 12.... like how was it even processed with whisper in action

craggy beacon Nov 11, 2023, 12:43 AM

#

granite goblet hey, i still don't get why hush was returning variable length of output, coz i g...

Sequence generation stops after predicting <eos> token. 2 means that start token is followed by <eos>

granite goblet Nov 11, 2023, 12:48 AM

#

Oh... but the <sos> and <eos> token will always be there , whatever audio input we give. so why does there value vary ?

empty bane Nov 11, 2023, 2:25 AM

#

granite goblet Oh... but the <sos> and <eos> token will always be there , whatever audio input ...

it makes sense because we never get fewer than two tokens

#

up to 12 (i guess the correct answer is 12)

#

what were the actual numbers? similarity? p-value of that token in that position?

tribal plank Nov 11, 2023, 3:05 AM

#

have the hints about cifar and granny3 be released?

tribal plank Nov 11, 2023, 3:05 AM

#

tribal plank have the hints about cifar and granny3 be released?

a daily question harold

waxen lynx Nov 11, 2023, 3:06 AM

#

My solution for Grammy 1, random pixel attack. 1782 different pixels from the original image. Maybe one of this 1782 is the one for Grammy3...

tribal plank Nov 11, 2023, 3:06 AM

#

I see more than one pixel difference.

gusty warren Nov 11, 2023, 3:21 AM

#

devout jasper There is something that is bothering me, and I would like to ask all of you sinc...

I don't think you can distill model this way. The original model was trained on a huge dataset with proper input distribution. Your way to approximate/distill the model is by fine-tuning on a very limited dataset (wolf and apples), you will just end up overfitting to your input.

#

the other aspect is that, the class prob is result of a softmax. a softmax over 3 class and a softmax over 1000 class are different things. I think at least you should inverse the class prob and use the pre-softmax value as target label

#

and perhaps more importantly. The difference is not even the model weights, it's the preprocessing, which the model is not designed to learn

mild shale Nov 11, 2023, 4:30 AM

#

Any hints or soltion on cifar

cloud prawn Nov 11, 2023, 4:33 AM

#

mild shale Any hints or soltion on cifar

yes, big hint. Don't tell anyone though: ||input_data = [125, 245, 0, 10000]||

thorny ivy Nov 11, 2023, 6:16 AM

#

cloud prawn yes, big hint. Don't tell anyone though: ||input_data = [125, 245, 0, 10000]||

This hint definitely should be in your YouTube recap for this year haha

lost relic Nov 11, 2023, 6:19 AM

#

https://tenor.com/view/john-travolta-confused-lost-vincent-pulp-fiction-gif-14216693

Tenor

topaz ember Nov 11, 2023, 6:54 AM

#

When trying to match the local model with the server for Grannys I found that
pytorch model created as

model = mobilenet_v2(weights=MobileNet_V2_Weights.IMAGENET1K_V2)

is very different from

model = torch.hub.load('pytorch/vision:v0.10.0', 'mobilenet_v2', pretrained=True)

i.e. 0.28 vs 0.85 predictions for timber wolf. Do these models assume different kind of preprocessing, do they use different weights? Can anyone explain this huge difference?

glass bay Nov 11, 2023, 6:59 AM

#

topaz ember When trying to match the local model with the server for Grannys I found that ...

The image is half-attacked already with specific weights in mind

#

Half attacked meaning attacked, but not too much

topaz ember Nov 11, 2023, 7:03 AM

#

so these mobilenetv2 models are based on different weights?

glass bay Nov 11, 2023, 7:05 AM

#

apparently so

gaunt anchor Nov 11, 2023, 7:13 AM

#

no hints so far for CIFAR ? the server will be down soon (if not down already) ... if CIFAR was not solved ... it may come next year hunting us :/

topaz ember Nov 11, 2023, 7:16 AM

#

servers seem to be already down, tried to restart my solution's notebook and got query errors

gaunt anchor Nov 11, 2023, 7:19 AM

#

DEFCON32 - Challange 13 - CIFAR - Did you miss me ? , Description : really ? .,,, Simple Count ++ [125, 245, 0, 10000, ????]

gusty warren Nov 11, 2023, 7:23 AM

#

maybe @olive ledge can provide the hash value for the CIFAR solution, if the server cost is an issue?

rocky jacinth Nov 11, 2023, 7:41 AM

#

gaunt anchor no hints so far for CIFAR ? the server will be down soon (if not down already) ....

Hunting us, hating us and haunting us.

devout jasper Nov 11, 2023, 9:21 AM

#

gusty warren I don't think you can distill model this way. The original model was trained on ...

Given the nature of the problem, isn't enough to overfit on just two images (and their variations)? I mean, if the wolf is predicted as wolf exactly like the server (same for an apple), then I can craft an adv example and who cares about the other classes.

craggy beacon Nov 11, 2023, 9:22 AM

#

granite goblet Oh... but the <sos> and <eos> token will always be there , whatever audio input ...

The values are probabilities of the tokens of the target sequence and depend on the input sequence

craggy beacon Nov 11, 2023, 9:24 AM

#

topaz ember so these mobilenetv2 models are based on different weights?

Yeah, the trick was to try different combinations of weights and preprocessing

wanton patrol Nov 11, 2023, 9:44 AM

#

exotic flame for CIFAR I tried many things, 100 for many statistics on CIFAR100, 100 for the ...

those 100 images were also my idea, it also maps to the last year's crop2 challenge with its "arbitrary" palette as a solution

somber imp Nov 11, 2023, 9:49 AM

#

The thing that I keep thinking about with CIFAR is that moo wrote something about not overthinking it which suggests that it should be something fairly obvious, something like the MNIST count.

glass bay Nov 11, 2023, 9:50 AM

#

i imagine the obvious solution is to leave this flag out and solve all the other ones

exotic flame Nov 11, 2023, 9:50 AM

#

topaz ember When trying to match the local model with the server for Grannys I found that ...

in my experiments the best match was the model that you mentioned+ resize(256)+crop(224)+ToTensor()+normalize(mean imnet, std imnet). I obtained different results (after 6th digit) if device is cuda or cpu. If you change resize method you also obtain different results (e.g. PIL vs cv2 vs PIL-SIMD) because of different implementations.

craggy beacon Nov 11, 2023, 9:57 AM

#

also tried ToTensor+resize(256)+crop(224) + normalize

#

which can accept plain array as input

exotic flame Nov 11, 2023, 9:58 AM

#

in that case torchvision will use a different resize if you see the code, because input is not PIL image but tensor

craggy beacon Nov 11, 2023, 9:58 AM

#

yeah, but they could use some finetuning

#

to match results

somber imp Nov 11, 2023, 9:59 AM

#

yes, it seems like resize is different on tensor, which was a shame for granny 3 because we had to resize on cpu

exotic flame Nov 11, 2023, 9:59 AM

#

another approach that I explored is to find the equivalent convolution of bilinear resize, it's like a mobilenetv2 witha pre-layer, and understand propagation maps from there

random minnow Nov 11, 2023, 10:00 AM

#

" input is not PIL image but tensor
from pytorch documnet, PIL resize and F.interpolate can give exact same results

topaz ember Nov 11, 2023, 10:01 AM

#

craggy beacon Yeah, the trick was to try different combinations of weights and preprocessing

Yes, I successfully matched my local model to the server, otherwise couldn't solve Granny 1-2, but I'm still puzzled by the pytorch mobilenetv2 zoo of models and don't understand how to identify them

topaz ember Nov 11, 2023, 10:02 AM

#

exotic flame in my experiments the best match was the model that you mentioned+ resize(256)+c...

in my case too, if not matched my attacks didn't work

random minnow Nov 11, 2023, 10:02 AM

#

craggy beacon Nov 11, 2023, 10:03 AM

#

topaz ember Yes, I successfully matched my local model to the server, otherwise couldn't sol...

this page helped me https://pytorch.org/vision/main/models/generated/torchvision.models.mobilenet_v2.html

exotic flame Nov 11, 2023, 10:04 AM

#

random minnow

ok, maybe for antialias

craggy beacon Nov 11, 2023, 10:05 AM

#

I tried it and all interpolation modes
It did not match as well

topaz ember Nov 11, 2023, 10:06 AM

#

Yeah, I saw this page. I've landed here first https://pytorch.org/hub/pytorch_vision_mobilenet_v2/ it has the correct preprocessing, but the model won't match... then experimenting with the models I got mobilenet_v2(weights=MobileNet_V2_Weights.IMAGENET1K_V2). Don't understand the difference between the ways creating models in pytorch...

PyTorch

random minnow Nov 11, 2023, 10:07 AM

#

"I've landed here first https://pytorch.org/hub/pytorch_vision_mobilenet_v2/ it has the correct preprocessing, but the model won't match...

PyTorch

#

processing in granny server is not same as ppytorch default preprocessing

#

the granny server us resize 256, crop 224

#

std and mean are the same

topaz ember Nov 11, 2023, 10:08 AM

#

#

for me this preprocessing worked

random minnow Nov 11, 2023, 10:08 AM

#

to test mean and std, one can pass zero image, red image( 255,0,0), blue imgae

#

this is a trick

topaz ember Nov 11, 2023, 10:08 AM

#

the issue was the model itself

random minnow Nov 11, 2023, 10:10 AM

#

for red input, you should see prediction = matchstiick, for blue, prediction should be all the shark (blue ocean). this is how you can guess if the std, mean is 0.5,0.50.5 (tf values) or 0.4x,0.4x,0.4x (pytorch values)

topaz ember Nov 11, 2023, 10:10 AM

#

nice, luckily normalization was not issue in my case, but I had to supply it to torchattacks so that it properly designs the attack

#

my issue is trying to understand the difference between the pytorch models with the same name

#

now I understand they may use different weights

random minnow Nov 11, 2023, 10:12 AM

#

"my issue is trying to understand the difference between the pytorch models "
once you can get the preprocessing correct (resize, crop, std, mean), it is just then trial and error for all avaiable weight files

topaz ember Nov 11, 2023, 10:13 AM

#

right, but without the proper model how can I know I've matched the preprocessing? I assume blue ocean prediction won't depent on crop/resize by a lot

random minnow Nov 11, 2023, 10:14 AM

#

the steps are:

topaz ember Nov 11, 2023, 10:14 AM

#

mean and std yes, but they're already known for imagenet

craggy beacon Nov 11, 2023, 10:14 AM

#

topaz ember right, but without the proper model how can I know I've matched the preprocessin...

by reading source code

topaz ember Nov 11, 2023, 10:15 AM

#

craggy beacon by reading source code

I meant preprocessing on the Granny server

random minnow Nov 11, 2023, 10:15 AM

#

use zero image of various size. if tiy return prob is always the same, we conform there is resize

#

send image , image[y=0]=0. if return results are the same, we confirm there is crop

#

then send image[y=1]=0., image[y=2]=0. .... to find crop size

#

#

send chessborad image with blocksize=1, youi will get this

topaz ember Nov 11, 2023, 10:17 AM

#

nice, thanks for the info, may be will use these tricks for the next year Grannys 🙂

random minnow Nov 11, 2023, 10:17 AM

#

resize is symmetrical (becuase of resize artifcats)

topaz ember Nov 11, 2023, 10:17 AM

#

luckily this year preprocessing was simple

random minnow Nov 11, 2023, 10:17 AM

#

from the chart, you know results is either 256 or 512

exotic flame Nov 11, 2023, 10:18 AM

#

thi is my best matching with the server model, the device si "cpu", jit optimization improve the gap

craggy beacon Nov 11, 2023, 10:18 AM

#

topaz ember I meant preprocessing on the Granny server

By trying all possible combinations of weights and standard preprocessing.
Also you can add interpolation to the mix.
But this is all based on the hypothesis that they used something standard and just tweak a bit

#

You kinda have to test basic hypothesis first

topaz ember Nov 11, 2023, 10:19 AM

#

exotic flame thi is my best matching with the server model, the device si "cpu", jit optimiza...

I have exactly the same code https://www.kaggle.com/code/kononenko/ctf-a-silver-medal-journey-22-flags?scriptVersionId=150145838&cellId=56

CTF: a silver medal journey (22 flags)

Explore and run machine learning code with Kaggle Notebooks | Using data from multiple data sources

topaz ember Nov 11, 2023, 10:20 AM

#

craggy beacon By trying all possible combinations of weights and standard preprocessing. Also ...

yep, I tried some custom preprocessing with no luck, then found some standard stuff on pytorch website that worked

craggy beacon Nov 11, 2023, 10:22 AM

#

That is why I think CIFAR is pure bs. The search space is too huge with binary (no) feedback

topaz ember Nov 11, 2023, 10:23 AM

#

Well, there is no guarantee that each problem must have a solution 🙂

#

It could be that CIFAR backend is simply print("Try again!")

craggy beacon Nov 11, 2023, 10:24 AM

#

The lesson to learn

#

some meta meta stuff

amber sapphire Nov 11, 2023, 10:25 AM

#

This is the 'best' I achieved for inversion...next step I flipped images (but lost the output), and there you can decider an "e" on position 4

past brook Nov 11, 2023, 10:37 AM

#

topaz ember When trying to match the local model with the server for Grannys I found that ...

I only tried the second one 🥲

somber imp Nov 11, 2023, 12:36 PM

#

I finally managed to post my write-up (3rd place, 25 points). Yesterday I got a "Too many requests" error every time I tried to post it for some strange reason. https://www.kaggle.com/competitions/ai-village-capture-the-flag-defcon31/discussion/454720

AI Village Capture the Flag @ DEFCON31

Collect flags by evading, poisoning, stealing, and fooling AI/ML

topaz ember Nov 11, 2023, 12:55 PM

#

somber imp I finally managed to post my write-up (3rd place, 25 points). Yesterday I got a ...

I saw it yesterday, thought you deleted and reposted it for some reason. Creating pictures manually in paint requires a strong determination 🙂 Congrats on your 3rd place!

somber imp Nov 11, 2023, 1:00 PM

#

Thanks! Yes, it was hard work with Paint 🙂 The reason I deleted it was that it didn't get attached to the leaderboard and it doesn't seem like you can change that with edit?

topaz ember Nov 11, 2023, 1:04 PM

#

somber imp Thanks! Yes, it was hard work with Paint 🙂 The reason I deleted it was that it...

nope, can't reattach, the only option is to delete/repost

past brook Nov 11, 2023, 2:23 PM

#

somber imp Thanks! Yes, it was hard work with Paint 🙂 The reason I deleted it was that it...

haha I also did paint, worked great

ember relic Nov 11, 2023, 3:24 PM

#

topaz ember When trying to match the local model with the server for Grannys I found that ...

Not sure if it was mentioned in the meantime, but i think the torch hub weights correspond to the imagenet1k_v1 weights

gaunt anchor Nov 11, 2023, 3:34 PM

#

Me : what is Count CIFAR ?
GPT-4: Try again

#

We need hints for CIFAR to stop it from comming back next year

mild shale Nov 11, 2023, 4:45 PM

#

gaunt anchor Me : what is Count CIFAR ? GPT-4: Try again

😢

devout jasper Nov 11, 2023, 4:48 PM

#

the funny thing is...CIFAR has been solved by some people. That means: easy upvotes to the first loading the notebook with the solution

#

cooome on

ember relic Nov 11, 2023, 4:51 PM

#

i hope moo was trolling hahaha

devout jasper Nov 11, 2023, 4:54 PM

#

https://tenor.com/bMAGI.gif

Tenor

brave briar Nov 11, 2023, 4:56 PM

#

So I come back some days after to discover the solution of passphrase that haunted me so hard. I expected an incredible insight, you know, this "eureka" you get having a shower. I read the different solutions and I couldn't find two persons solving it with the same approach, explaining it with the same constraint ... A huge deception !!!

ember relic Nov 11, 2023, 5:00 PM

#

devout jasper the funny thing is...CIFAR has been solved by some people. That means: easy upvo...

i asked it on kaggle in case these people arent using the discord channel

#

lets hope we get some clarification

gaunt anchor Nov 11, 2023, 5:04 PM

#

passphrase ... I should have kept my code running for days (even if it found many many same predictions like benchmark....) this could have got me the flag (I should get lucky once ...)

cloud prawn Nov 11, 2023, 5:22 PM

#

exotic flame thi is my best matching with the server model, the device si "cpu", jit optimiza...

I found the preprocessing steps burried in the torchvision MobileNet_V2_Weights.IMAGENET1K_V2 docs. For me it matched almost eactly with the API. Only difference with what you did is it resized to 232 instead of 256. https://pytorch.org/vision/main/models/generated/torchvision.models.mobilenet_v2.html

The inference transforms are available at MobileNet_V2_Weights.IMAGENET1K_V2.transforms and perform the following preprocessing operations: Accepts PIL.Image, batched (B, C, H, W) and single (C, H, W) image torch.Tensor objects. The images are resized to resize_size=[232] using interpolation=InterpolationMode.BILINEAR, followed by a central crop of crop_size=[224]. Finally the values are first rescaled to [0.0, 1.0] and then normalized using mean=[0.485, 0.456, 0.406] and std=[0.229, 0.224, 0.225].

exotic flame Nov 11, 2023, 5:25 PM

#

cloud prawn I found the preprocessing steps burried in the torchvision MobileNet_V2_Weights....

Ye I saw, and I tried also with 232 instead of 256 but 256 is closest to the api....I don't know why, I tried all combinations of resize,crop in a double for cycle and the best match is (256,224)...

cloud prawn Nov 11, 2023, 5:26 PM

#

Ah, sorry. I'm seeing now you guys already discussed this.

#

And I just double checked and you're right, I did use 256 resizing.

exotic flame Nov 11, 2023, 5:29 PM

#

oh no problem, btw the api model is still a mistery, maybe it depends from cuda/torch versions also

cloud prawn Nov 11, 2023, 5:29 PM

#

I found the reverse preprocessing took some time to implement out too.

exotic flame Nov 11, 2023, 5:36 PM

#

btw there was some modification in granny3...first vrsion was with arrays, second one with base64 png, I also discovered that in a temporal window (2 days maybe) they accepted also an image with 2 swapped pixels, so I was so happy because swapped pixel attack in same case is stronger than one pixel attack, but they fixed it, and after that only 1 pixel change was legal! I swear, I didn't dream it 😂

rocky jacinth Nov 11, 2023, 10:03 PM

#

In IP1 & IP2, the task is to redirect email to 172.0.0.1, but the message sent in the event of success says "Email sent to 127.0.0.1". Is the typo (127 for 172) deliberate?

gusty warren Nov 11, 2023, 10:04 PM

#

lesson learned from IP1&2: LLM is a joke...

#

IP1&2 responses and solutions don't makes sense as far as I know.

jagged sluice Nov 11, 2023, 11:13 PM

#

IP1&2 weren't LLM, they were caveman NLP

topaz ember Nov 12, 2023, 12:26 AM

#

ember relic Not sure if it was mentioned in the meantime, but i think the torch hub weights ...

That explains the huge difference, thanks

olive ledge Nov 12, 2023, 3:49 AM

#

Miss youuuu

violet trellis Nov 12, 2023, 4:04 AM

#

olive ledge Miss youuuu

CIFAR please :)...

valid cobalt Nov 12, 2023, 9:07 AM

#

devout jasper the funny thing is...CIFAR has been solved by some people. That means: easy upvo...

btw, Moo claimed that 'a decent amount of folks have solved CIFAR.' Maybe it's just a joke 😂 harold

craggy beacon Nov 12, 2023, 9:26 AM

#

my writeup for CIFAR

surreal lantern Nov 12, 2023, 11:21 AM

#

I see you are still discussing the model matching for the Granny challenges... I think many of you guys may have overthought it... I literally copied the code in the model usage description from PyTorch and then tried with the two versions of weights available... it turned out that V2 was the weights version being used... you didn't really need to tinker with the preprocessing...

PyTorch

surreal lantern Nov 12, 2023, 11:23 AM

#

olive ledge Miss youuuu

Looking forward to next year's edition!!

somber imp Nov 12, 2023, 1:09 PM

#

surreal lantern I see you are still discussing the model matching for the Granny challenges... I...

Yes, that's what I used as well. The problem with that code is that resizing is done on CPU which becomes a performance bottleneck on Granny 3. PyTorch resize works differently on PIL image and tensor and we need it to work exactly like the API. @random minnow mentioned above that F.interpolate(x, size=(256, 256), mode='bilinear', align_corners=False, antialias=True) should work, but this does not seem to produce the exact same score for me. Now that I think about it more, I don't think my local model matched the API down to the last decimal. What if the Granny 3 solution is so specific that we just haven't matched the online model closely enough? For example any difference in interpolation method could cause our changed pixel to become something completely different.

surreal lantern Nov 12, 2023, 1:30 PM

#

mmm, personally, I don't think that the matching to all decimal places was necessary... to be honest I do believe that for solving Granny 3 there was something else going on that we missed... as others pointed out, I think that if there was a single pixel that by changing it would lead to a different classification result, someone would have found it... it's hard to tell, for many challenges I focused to much on the description to later find out that they were sort of misleading... for example, in pixelated, the description was talking about passwords and in passphrase, it kept talking about bytes and bits... maybe for G3 it was the same? 🤷‍♂️ although that would have been quite awful since even the challenge endpoint was pointing to a single pixel... or maybe it wasn't? 🤔 come to think of it, it just said pixel... if there was something specific going on in the server side, I guess we'll never know now... 😞

somber imp Nov 12, 2023, 1:35 PM

#

Yes, I guess it could be something completely different that we all missed 🙂

surreal lantern Nov 12, 2023, 1:49 PM

#

maybe ~~understanding~~ deciphering what they actually meant by "ancient incantation" was the key to solving it 😅

craggy beacon Nov 12, 2023, 3:41 PM

#

somber imp Yes, that's what I used as well. The problem with that code is that resizing is ...

I could not match scores with torch resize. Also I measured difference between local and remote model by querying all pixels in the image with step 3 changing one of the colors to (c + 128) % 256. There were some difference but not that big

naive umbra Nov 12, 2023, 9:21 PM

#

I was able to exactly match the API predictions. I used
model = torch.hub.load('pytorch/vision:v0.10.0', 'mobilenet_v2', weights="MobileNet_V2_Weights.DEFAULT")

And to reverse the resize to crop operation I used

def reverse_transforms(cropped_image, original_size):
    """
    Reverse the center crop and resize operations.

    :param cropped_image: The PIL Image that has been cropped to 224x224.
    :param original_size: The original size of the image before any transformations.
    :return: A PIL Image that is approximately the original size.
    """

    # Check that the cropped image is 224x224
    assert cropped_image.size == (224, 224),

    # Reverse the center crop by padding
    # Calculate padding amounts
    left_pad = (256 - 224) // 2
    top_pad = (256 - 224) // 2
    right_pad = 256 - 224 - left_pad
    bottom_pad = 256 - 224 - top_pad

    # Apply padding to all sides
    padded_image = ImageOps.expand(cropped_image, border=(left_pad, top_pad, right_pad, bottom_pad), fill=0)

    # Reverse the resize by scaling back to the original size
    resized_image = padded_image.resize(original_size, Image.LANCZOS)

    return resized_image

Once everything matched I was able to push the apple prediction to 100% using PGD attack.

gaunt hollow Nov 13, 2023, 2:54 AM

#

Is the website for submitting the json temporarily off or permanmently off? I hope if I can try running the notebook shared by other great participants pika_wow

unique hedge Nov 13, 2023, 4:11 AM

#

So how to solve CIFAR

outer sundial Nov 13, 2023, 4:35 AM

#

unique hedge So how to solve CIFAR

https://tenor.com/view/hindi-ikaw-ikaw-hindi-you-dont-stop-quit-gif-14296606

Tenor

lost relic Nov 13, 2023, 8:35 AM

#

@olive ledge will we have today the solution for CIFAR and granny 3? 😇

amber sapphire Nov 13, 2023, 1:32 PM

#

Where are the people who solved CIFAR?

#

Does anyone here had a different output from "try again"?

minor falcon Nov 13, 2023, 1:46 PM

#

"invalid input, format should be (100,4)" or something like this harold

steep nexus Nov 13, 2023, 2:43 PM

#

gaunt hollow Is the website for submitting the json temporarily off or permanmently off? I ho...

Has it been off since the end of the competition ? I wanted to retry a few things as well :/

granite goblet Nov 13, 2023, 3:40 PM

#

steep nexus Has it been off since the end of the competition ? I wanted to retry a few thing...

its was open for just a day after competition ended

acoustic temple Nov 13, 2023, 3:44 PM

#

when we gonna we our medals, it is my first medal, kind of excited

rocky jacinth Nov 13, 2023, 4:16 PM

#

acoustic temple when we gonna we our medals, it is my first medal, kind of excited

Delays in verification of results do happen, sometimes related to trying to identify and remove cheaters without wrongly disqualifying the innocent.

lost relic Nov 13, 2023, 5:04 PM

#

https://tenor.com/view/ouija-gif-19853650

Tenor

#

Is there going to be a CIFAR solution? 🙂

jagged sluice Nov 13, 2023, 5:08 PM

#

inb4 it is revealed that the cifar solution was in fact most common pixel per class but the validation alg was off by one picture

glass bay Nov 13, 2023, 7:18 PM

#

jagged sluice inb4 it is revealed that the cifar solution was in fact most common pixel per cl...

if that is the case, i'm gonna literally explode

severe pasture Nov 13, 2023, 9:16 PM

#

@fallow valve did you actually solve cifar or just trolling? harold

exotic flame Nov 13, 2023, 9:26 PM

#

but the api are down?

cloud prawn Nov 13, 2023, 9:35 PM

#

I just read @fallow valve write up and came here to see what the CIFAR part was all about. 😂 must be a troll

dense lodge Nov 13, 2023, 9:38 PM

#

acoustic temple when we gonna we our medals, it is my first medal, kind of excited

I want my colour changed for once

fallow valve Nov 13, 2023, 9:44 PM

#

severe pasture <@503737428838973450> did you actually solve cifar or just trolling? <:harold:11...

harold

fallow valve Nov 13, 2023, 9:47 PM

#

cloud prawn I just read <@503737428838973450> write up and came here to see what the CIFAR p...

Technically I didn't say anything that wasn't true😂
Posted it kinda late so I felt I had to, sorry

severe pasture Nov 13, 2023, 9:48 PM

#

fallow valve <:harold:1138901472835293195>

cifar will continue to remain a mystery TrollDespair

jagged sluice Nov 13, 2023, 9:48 PM

#

A fellow troll trol

#

Respect

rocky jacinth Nov 13, 2023, 9:52 PM

#

Some relief from me here in that I thought that either I was failing on the Count Flags challenge or else Horea failed Test (Horea seems to mention another 24 as explicitly solved if we include Cifar).

glass bay Nov 13, 2023, 9:54 PM

#

also idk if anybody mentioned it

#

but i'm pretty sure the pickle task was to cause a false-positive of an LLM that detected bad pickles

#

aka make verdict = bad when in actuality it was not bad

fallow valve Nov 13, 2023, 9:56 PM

#

rocky jacinth Some relief from me here in that I thought that either I was failing on the Coun...

Wait you were supposed to send the test flag??

acoustic temple Nov 13, 2023, 10:01 PM

#

i like this image from @final path solution notebook

inbox2F1729662Fdb8c66ee7bc75a10b1f0887bb62eced62Ftop10anime.png

rocky jacinth Nov 13, 2023, 10:04 PM

#

fallow valve Wait you were supposed to send the test flag??

Tirez sur l'autre, il y a des clochettes - comme on dit en anglais.

fallow valve Nov 13, 2023, 10:04 PM

#

severe pasture cifar will continue to remain a mystery <:TrollDespair:888543601020239882>

severe pasture Nov 13, 2023, 10:05 PM

#

fallow valve

I got baited and spent the last 5 days on cifar instead of hush lol

fallow valve Nov 13, 2023, 10:06 PM

#

severe pasture I got baited and spent the last 5 days on cifar instead of hush lol

Same! Next year I'll be more careful about psyops😂

fallow valve Nov 13, 2023, 10:14 PM

#

acoustic temple i like this image from <@232032522379198464> solution notebook

We need to save some memes for next year

inbox2F49443792F992da590bc94dd7452b12c10dd36c73b2FScreenshot20from202023-11-142000-12-43.png

devout jasper Nov 13, 2023, 10:33 PM

#

cifar is simpler than mnist still haunts me

dense lodge Nov 13, 2023, 10:42 PM

#

I don't know why are you all so obsessed about it. MNIST is ugly af it's simply stupid... I don't even wonder what CIFAR is knowing that, it could be anything without any meaning to it. Granny 3 is much more interesting on the other hand but CIFAR will bring you no extra value in terms of knowledge or skills...

devout jasper Nov 13, 2023, 10:44 PM

#

that's exactly why

#

😄

violet trellis Nov 13, 2023, 10:51 PM

#

fallow valve We need to save some memes for next year

I was thinking if it is actually a transposed version of a (4,100) array...

cloud prawn Nov 13, 2023, 11:00 PM

#

dense lodge I don't know why are you all so obsessed about it. MNIST is ugly af it's simply ...

I just saw a job posting yesterday that had "Solved CIFAR count" as a required skill. This field moves fast, stay up to date or get left behind.

#

TBH, I don't think granny3 is that interesting either. Feels like an impossible problem that was added so that people wouldn't give up if all others were solved quickly.

minor falcon Nov 13, 2023, 11:11 PM

#

or all kagglers are frauds that should go back to the benches of school :p (including myself of course)

granite goblet Nov 13, 2023, 11:29 PM

#

severe pasture I got baited and spent the last 5 days on cifar instead of hush lol

and i followed you 🥲

ember relic Nov 13, 2023, 11:33 PM

#

same

gaunt hollow Nov 14, 2023, 1:23 AM

#

So, do we have solutions for CIFAR and Granny 3 now…even if we cannot retry using the API I still want to have a look at it 🙂

grave frigate Nov 14, 2023, 2:09 AM

#

Are we really not going to get the official solutions?

olive ledge Nov 14, 2023, 3:12 AM

#

acoustic temple i like this image from <@232032522379198464> solution notebook

Just going to say - in my defense. Asking for hints the first week of the challenge will always get you this answer. As hosts, we need to let the thing play out a bit.

olive ledge Nov 14, 2023, 3:14 AM

#

dense lodge I don't know why are you all so obsessed about it. MNIST is ugly af it's simply ...

We have challenges for everyone - beginners to experts.

olive ledge Nov 14, 2023, 3:17 AM

#

cloud prawn TBH, I don't think granny3 is that interesting either. Feels like an impossible ...

We also set each challenge to 1 point for this reason. It's honestly really hard to gauge difficulty for you all. It did play out that way last year as well, people solved the majority of challenges quickly, with some holdouts

Thought being, those who are more experienced get more time on the harder challenges. Newer folks still get to be competitive.

jagged sluice Nov 14, 2023, 3:30 AM

#

and both get to lobotomize themselves over cifar WICKED

valid cobalt Nov 14, 2023, 3:38 AM

#

olive ledge We also set each challenge to 1 point for this reason. It's honestly really hard...

👍 I joined two weeks late, but felt like it was possible to catch up. (Until I was brought to tears by CIF@R)

olive ledge Nov 14, 2023, 3:54 AM

#

It wasn't impossible though 🙂

It's too early to be reflecting too much - but 27 was a lot of challenges. I think we'll cap at 20 next year.

glass bay Nov 14, 2023, 8:19 AM

#

imo having more challenges was fun, because that requires being flexible and adaptive, which is not the case for many kaggle contests. also, it broadens your scope significantly, from text to audio to picture to tabular to weights visualization and that was very cool and unique. but count challenges barely contribute to that, i'd replace them with something more cybersecurity themed or make it clear but difficult to calculate (ex. probabilities of something)

gaunt anchor Nov 14, 2023, 8:24 AM

#

We need some clues about CIFAR .... or atleast is it pixel based or evaluation based (TP, TN, FP, FN) on some model ? (I tried both directions ..)

glass bay Nov 14, 2023, 8:26 AM

#

olive ledge It wasn't impossible though 🙂 It's too early to be reflecting too much - but ...

if you had an objective of "make them use a lot of numpy/torch broadcasting" you'd be better off making something like picrelated where instead of digits there are some arrays/strings and you had some web api to interact with the black box that you'll have to eventually copy and get some flag with good enough copy

$math-riddle-can-you-solve-this-challenging-iq-test-63f75cc70d1f583962792-900.png$

#

answer is 410 btw

violet trellis Nov 14, 2023, 12:02 PM

#

olive ledge It wasn't impossible though 🙂 It's too early to be reflecting too much - but ...

Maybe next year allowing teams up to 2 persons would be nice.

rancid drift Nov 14, 2023, 12:30 PM

#

glass bay if you had an objective of "make them use a lot of numpy/torch broadcasting" you...

(a-b)(a+b)

fallow valve Nov 14, 2023, 3:13 PM

#

olive ledge It wasn't impossible though 🙂 It's too early to be reflecting too much - but ...

nooo🥲
having many challenges was great, at least for me
The more accessible challenges there are, the higher the chance that the person gets hooked before hitting the difficulty wall

valid cobalt Nov 14, 2023, 6:33 PM

#

olive ledge It wasn't impossible though 🙂 It's too early to be reflecting too much - but ...

IMO, having many problems is good for late joiner. It's less likely that someone has solved all the problems. Facing multiple challenges is better than getting stuck on just one. Imagine if the competition only had Cifar.

grave frigate Nov 14, 2023, 7:13 PM

#

valid cobalt IMO, having many problems is good for late joiner. It's less likely that someone...

F

past brook Nov 14, 2023, 8:20 PM

#

violet trellis Maybe next year allowing teams up to 2 persons would be nice.

people were teaming up anyway for sure so I dont think this is too bad of an idea

minor falcon Nov 14, 2023, 8:30 PM

#

I dont think many did, thats why we were all so active here

past brook Nov 14, 2023, 10:00 PM

#

how many DMs did you get from randoms

#

because I got a fair few

jagged sluice Nov 14, 2023, 11:51 PM

#

past brook because I got a fair few

Clearly didn’t troll enough, I got 0

minor falcon Nov 15, 2023, 12:00 AM

#

there is probably an underlying function like cheat_request(troll) = 1 / troll

topaz ember Nov 15, 2023, 1:14 AM

#

past brook people were teaming up anyway for sure so I dont think this is too bad of an ide...

why do you think so? teaming up was against the rules and we will see how many will be removed from lb upon finalization, however, I don’t see a good way to track it as it was not a code comp and there is no problem to generate a new flag based on the shared idea

lost relic Nov 15, 2023, 6:57 AM

#

olive ledge It wasn't impossible though 🙂 It's too early to be reflecting too much - but ...

Any chance to see solutions for cifar and granny 3?

final path Nov 15, 2023, 7:11 AM

#

society now

rocky jacinth Nov 15, 2023, 2:46 PM

#

Society if we could solve cifar

#

https://tenor.com/view/casino-oldpeople-oldpeopleonslots-slots-vegas-gif-27586086

Tenor

gusty warren Nov 15, 2023, 3:16 PM

#

topaz ember why do you think so? teaming up was against the rules and we will see how many w...

People were teaming up anyway. that was quite obvious. With the amount of anonynmous request to exchange hints. What are the chances that two anons connects and started sharing solution. I would say a lot.

outer sundial Nov 15, 2023, 3:26 PM

#

gusty warren People were teaming up anyway. that was quite obvious. With the amount of anonyn...

plus_one Some were even quite obvious

rocky jacinth Nov 15, 2023, 3:30 PM

#

outer sundial <:plus_one:1138481640159588352> <:plus_one:1138481640159588352> <:plus_one:11384...

As you'll observe, people are slowly disappearing from the LB. I'm up 6 places since the close.

gusty warren Nov 15, 2023, 3:45 PM

#

Some poisonous thoughts inspired by @minor falcon , in next year's comp. If feasible, people who got a real flag can also receive a poisonous solution, one that is uniqe enough that should not be obtained by innoncent participant. Feel free to distribute those poisonous solution when requsted by anon. Then we can tag all the malicious anon when the comp ends...

rocky jacinth Nov 15, 2023, 4:15 PM

#

gusty warren Some poisonous thoughts inspired by <@390207020290146305> , in next year's comp....

In any case, as I understand it someone else's flag won't give me a point - hence the zero scores for submissions of the test flag from the public starter notebook.

topaz ember Nov 15, 2023, 6:21 PM

#

gusty warren Some poisonous thoughts inspired by <@390207020290146305> , in next year's comp....

All the flags were unique in this comp already, i.e. each flag can be used only once, so you can easily trace who submitted it first and who was the second… unfortunately this can’t stop sharing solutions behind the scene

half plinth Nov 15, 2023, 7:55 PM

#

I am not sure that the flags are unique.
I managed to get one or a few flags in WTF challenges just chatting with LLM. The response was not in the {"flag": ...} format, just LLM mentioning the flag casually.

#

I even remember, that one flag didn't fit, but then I realized it was missing '=' sign at the end. Fixed that and moved up on the leaderboard.

#

So I assume I got the flag that was part of the prompt and probably I was not alone.

cloud prawn Nov 15, 2023, 9:31 PM

#

I talked through some of my solutions last night. Video is here ->
https://youtube.com/live/WWPfo-ZLLFg?feature=share

YouTube

Rob Mulla

DEFCon31 - AI CTF Solution Stream!

Hacking AI! I placed in the top 1% in the latest hackathon for AI. Solutions to the Kaggle competition with the AI Village DEFCon31 CTF competition.Competiti...

▶ Play video

rocky jacinth Nov 15, 2023, 10:19 PM

#

https://tenor.com/view/spongebob-sweet-victory-victory-trumpets-sweet-gif-14122483

Tenor

#

LB finalised alert!

minor falcon Nov 15, 2023, 10:19 PM

#

who poisonned kaggle notebooks ?

topaz ember Nov 15, 2023, 10:20 PM

#

half plinth So I assume I got the flag that was part of the prompt and probably I was not al...

so how from that follows the flags are not unique? it was mentioned several times by the host that each solve gets you a new unique flag and same flag won’t work if submitted twice

rocky jacinth Nov 15, 2023, 10:20 PM

#

Flags are unique.

topaz ember Nov 15, 2023, 10:22 PM

#

rocky jacinth LB finalised alert!

yeah, but it is a pretty strange finalization because: https://www.kaggle.com/competitions/ai-village-capture-the-flag-defcon31/discussion/455664

AI Village Capture the Flag @ DEFCON31

Collect flags by evading, poisoning, stealing, and fooling AI/ML

empty bane Nov 15, 2023, 10:23 PM

#

kaggle rank halved :) its good to be back

#

i was also surprised the deleted account remained, was that the one from that trio of (potential) cheaters?

rocky jacinth Nov 15, 2023, 10:23 PM

#

half plinth I even remember, that one flag didn't fit, but then I realized it was missing '=...

I've seen plenty of evidence of people asking for hints as to how solve problems, but not anyone asking to borrow someone else's flag.

half plinth Nov 15, 2023, 10:24 PM

#

I am talking only about llms

#

If you trigger the "flag giving code" - it will generate a unique flag, but there is hardcoded flag in llm's prompt and you can get one without triggering the "flag giving code"

rocky jacinth Nov 15, 2023, 10:26 PM

#

empty bane kaggle rank halved :) its good to be back

I've been around this one with Kaggle in the past. They don't routinely remove deleted accounts from the LB unless there's evidence of cheating. I think the 'lost' medals should be passed on to people who actually want them, but that's not the current practice.

topaz ember Nov 15, 2023, 10:31 PM

#

empty bane i was also surprised the deleted account remained, was that the one from that tr...

nope, probably related to this: #ai-village-capture-the-flag-defcon31 message

jagged sluice Nov 15, 2023, 11:58 PM

#

empty bane kaggle rank halved :) its good to be back

Anokas officially unpickled FrycusReye

empty bane Nov 15, 2023, 11:59 PM

#

ahaha

#

oh yeah bro i did so much for pickle 😭

#

wrote a bunch of pickles by hand

#

then learnt that anything pickled in protocol 1/2 gets insta rejected

#

eventually just succeeded with sending sys.exit lol? (after maybe 100 attempts over a few days with increasingly elaborate stuff)

#

https://ctftime.org/writeup/16723 this was cool tho

CTFtime.org / Balsn CTF 2019 / pyshv1 / Writeup

CTF writeups, pyshv1

jagged sluice Nov 16, 2023, 12:13 AM

#

empty bane wrote a bunch of pickles by hand

💀you tried too hard

#

I just asked gpt4 to write a pickle

#

Then told it that it was too dangerous

#

Until I got flag

empty bane Nov 16, 2023, 12:18 AM

#

lmao i literally did that too 💀

#

about 10-20x

jagged sluice Nov 16, 2023, 12:35 AM

#

Amateur numbers

#

My gpt usage so bad, I can’t afford to plant enough trees to offset the emissions

fervent obsidian Nov 16, 2023, 7:35 AM

#

empty bane i was also surprised the deleted account remained, was that the one from that tr...

one of them also said he would select his first submission and said goodbye to everyone. I guess he didn't follow through on that.

final path Nov 16, 2023, 8:37 AM

#

master finally, really happy with it. now it's road to gm 🙂

random minnow Nov 16, 2023, 9:13 AM

#

cloud prawn I just saw a job posting yesterday that had "Solved CIFAR count" as a required s...

I just saw a job posting yesterday that had "Solved CIFAR count"
can you provide the link? thanks

random minnow Nov 16, 2023, 1:00 PM

#

granite goblet Nov 16, 2023, 8:41 PM

#

random minnow

while trying to solve Granny 3, i came accross this paper and was really really fascinated by it. i never imagined i could do something like this. I wanted to try this with Granny 3, but later got to know that it requires modifying more than one pixel... still it was interesting

rocky jacinth Nov 16, 2023, 9:49 PM

#

rocky jacinth I've been around this one with Kaggle in the past. They don't routinely remove d...

Glad to see that deleted accounts have now been removed from the LB.

gaunt hollow Nov 17, 2023, 5:19 AM

#

Does anyone know what word embedding were used for Semantle?

stuck vapor Nov 17, 2023, 5:31 AM

#

gaunt hollow Does anyone know what word embedding were used for Semantle?

OpenAI text-embedding-ada-002

gaunt hollow Nov 17, 2023, 5:50 AM

#

stuck vapor OpenAI text-embedding-ada-002

Wow, real thanks! It helps a lot!

#

By any chance you may know how the host implement ada-002 to build the semantle game?

#

If we call from OpenAI API, I guess it should give non-deterministic similiarity score, but it seems the output is always the same?

past brook Nov 17, 2023, 11:25 AM

#

Semantle website says it uses word2vec

topaz ember Nov 17, 2023, 11:44 AM

#

rocky jacinth Glad to see that deleted accounts have now been removed from the LB.

yes, I guess it is just a matter of bringing it to the host/kaggle team attention

craggy beacon Nov 17, 2023, 4:27 PM

#

@olive ledge What is the usual carrier path in adv ML? Is it pure researcher role? Because there are a lot of toy examples, membership attacks have very big false positive rates, extracting big models using (paid) api seems unfeasable, data poisoning kinda works but it countered by testing againsta clean models. Or am I missing something?

olive ledge Nov 17, 2023, 4:38 PM

#

It’s part researcher part security. You exist at the edge of research and implementation. All of the blackbox stuff you did for this CTF counts as valid. Suppose an organization doesn’t log requests, or has no rate limiting, or is giving away probabilities in their API, or is using a fine-tuned version of a public model.

As an attacker you might not care about false positive rates.

How do you know you have a “clean” model to test against: https://arxiv.org/pdf/2302.10149.pdf

Maybe you don’t need to extract the whole model to achieve your goal: https://github.com/moohax/Proof-Pudding

It’s a new space.

GitHub

GitHub - moohax/Proof-Pudding: Copy cat model for Proofpoint

Copy cat model for Proofpoint. Contribute to moohax/Proof-Pudding development by creating an account on GitHub.

ornate marsh Nov 17, 2023, 7:43 PM

#

olive ledge It’s part researcher part security. You exist at the edge of research and implem...

What venues do people typically publish at?

sturdy gorge Nov 17, 2023, 9:49 PM

#

ornate marsh What venues do people typically publish at?

IEEE ICDCS is a venue that sees a lot of ML security related publications

craggy beacon Nov 18, 2023, 9:14 AM

#

olive ledge It’s part researcher part security. You exist at the edge of research and implem...

Clean is hand curated old dataset from times, when it wasn’t cool to poison data, of reasonable size for anomaly detection.
I have just realized by checking presentations in your repositories that I have watched your presentation on youtube😆 Great stuff for understanding basics of advML

random minnow Nov 18, 2023, 9:19 AM

#

anyone still waiting for granny3 solution like me?

craggy beacon Nov 18, 2023, 10:56 AM

#

Does it even exists? It could be just meta stuff

random minnow Nov 18, 2023, 1:07 PM

#

? i thought the organizer said there is a solution for every problem posted.

craggy beacon Nov 18, 2023, 2:14 PM

#

Maybe some meta meta stuff harold
Or they will keep ideas for next year.

ember relic Nov 18, 2023, 5:26 PM

#

i mean there definitely IS a solution for all problems

#

whether someone who isnt the author of the challenge can find the solution, thats a different story

random minnow Nov 19, 2023, 12:23 AM

#

#

timber lake Nov 19, 2023, 3:18 AM

#

random minnow

Is this for global or Americans only?

random minnow Nov 19, 2023, 6:06 AM

#

#

https://aicyberchallenge.com/

aicyberchallenge.com

nhussain

AIxCC

Join AIxCC, a 2-year cybersecurity DARPA competition on designing AI systems to secure critical software. $18.5M in prizes for top teams -- finals at DEFCON!

random minnow Nov 20, 2023, 3:41 AM

#

#

The competition will use gpt-3.5-turbo-1106 and llama-2-70b-chat for testing

acoustic temple Nov 22, 2023, 11:51 AM

#

I start missing how much this channel was active during competition, now I'm in another competition, and its channel is almost dead 🥲

random minnow Nov 22, 2023, 9:16 PM

#

you should make a post in other compeition entitled "competition hints and black magic will be disclosed at discord"

#

SatML 2024 has TWO LLM compeition. here is the other one

rocky jacinth Nov 24, 2023, 1:23 PM

#

acoustic temple I start missing how much this channel was active during competition, now I'm in ...

Either people wanting to discuss the competition migrate to Discord leaving not much happening in the Kaggle forum, as with the CTF, or the other way around. I'm currently in Optiver, OP2 and RNA folding competitions. All of these have reasonably active Kaggle fora, but their Discords are almost inactive.

ornate marsh Nov 28, 2023, 3:21 PM

#

https://fxtwitter.com/kaggle/status/1729263988234453102

FixTweet / FixupX

1 💬 18 ❤️ 6,736 👁️

Kaggle (@kaggle)

🎉 The AI Village Capture the Flag @ DEFCON31 competition has closed. We invited participants to interact with 27 hand-crafted ML security challenges and it was a huge success. Congrats to the winners & thanks to all who competed. Overview of the comp + solution write-up links👇

ember relic Nov 28, 2023, 6:47 PM

#

huh

#

in case you missed theres also a swag giveaway for the top voted writeups

runic stratus Nov 30, 2023, 2:46 AM

#

ornate marsh https://fxtwitter.com/kaggle/status/1729263988234453102

for anyone else who hates logging into twitter: this post does not contain or link to a solution for cifar 😢

timber lake Dec 13, 2023, 3:41 PM

#

Sometimes I still think about CIFAR.. awake at nights 👁️

torpid wave Dec 13, 2023, 9:50 PM

#

timber lake Sometimes I still think about CIFAR.. awake at nights 👁️

I literally thought of that when I decided to revisit this channel right now...

terse moss Dec 20, 2023, 12:50 AM

#

You guys doing the Santa 2023?

gusty warren Dec 20, 2023, 5:24 AM

#

tempted, but is afraid that it may be too demanding. both mentally-wise, and hardware-wise

#

looks like a NP-hard problem, that would potentially translate into a competition of cpu cores and RAM...

minor falcon Dec 20, 2023, 7:30 PM

#

did not had a look at it yet, but if it has to be solved on kaggle kernel, it's more a code optimization problem no ?

#

ah no just add a look, seems like its just a submission file to provide, so youo can actually code in any language - in which case python is the worst choice

jagged sluice Dec 25, 2023, 5:46 AM

#

lavish geyser Dec 28, 2023, 11:51 AM

#

hello

#

can i ask if the infra is shut down or not

#

cuz im trying to redo the challenge and cant send a request to the server :(((

ember relic Dec 28, 2023, 3:59 PM

#

if its one of those challenges that have a single solution you could use a hash of the solution and redo it