#neurips-2023-machine-unlearning | Kaggle | Page 1

graceful sand Sep 11, 2023, 8:24 PM

#

This is a very interesting task

#

Any idea how to do it? Anyone?

sage hinge Sep 11, 2023, 8:35 PM

#

they have a started notebook in the nips competition page : https://unlearning-challenge.github.io/

NeurIPS 2023 Machine Unlearning Challenge

Website for the NeurIPS 2023 Machine Unlearning Challenge.

outer hill Sep 11, 2023, 11:21 PM

#

Interesting challenge, looking forward to what everyone tries

deft elk Sep 15, 2023, 4:36 AM

#

Interesting one!! wanted teammates for this. if Interested please DM me

candid flicker Sep 16, 2023, 5:40 PM

#

Interested to team up for this.

undone kindle Sep 18, 2023, 10:56 AM

#

looks like the top-1 entry is just +0.01 above the standard solution. Anyone is fighting with this challenge, and finds it difficult?

cobalt juniper Sep 18, 2023, 12:33 PM

#

undone kindle looks like the top-1 entry is just +0.01 above the standard solution. Anyone is ...

It is very challenging

#

maybe during this week organizers will share more details

undone kindle Sep 18, 2023, 3:04 PM

#

is it normal that if you put more than 1 epoch in the standard notebook you get lower performance???

mellow oracle Sep 18, 2023, 6:25 PM

#

Hello, I am looking for teammates for this competition. A little bit about myself, I have good experience in Machine and Deep Learning. I have done multiple internships in the same. This is my first Kaggle competition and I plan to experiment and learn a lot throughout this competition. Please DM me if you wanna team up.

undone kindle Sep 18, 2023, 8:24 PM

#

Apropos erasing memories

https://www.youtube.com/watch?v=FQ9l4v7zB3I

YouTube

neoknowstic

Non-Euclidean Therapy for AI Trauma [Analog Archives] #SoME3

PATIENT ALICE: An Artificial Intelligence suffering from hallucinations of a lost puppet show. These hallucinations need to be erased.

GENERATIVE MODEL TYPE: Diffusion-based.

PRESCRIBED TREATMENT: A Latent Space Editing method that involves the Pullback, the Jacobian Matrix, Eigenfaces and SVD.

NOTE: read the Erra...

▶ Play video

round raptor Sep 23, 2023, 10:23 PM

#

It seems that cifar10 is much easier too solve than the hidden dataset. I’m new to cv. Any tips for other adequate benchmark datasets?

full echo Sep 24, 2023, 8:47 PM

#

undone kindle is it normal that if you put more than 1 epoch in the standard notebook you get ...

i suppose, the original resnet is sooooooooooooooooooo finely trained that adding epochs overfits

#

so does like 4 separate approaches i've tried and tested on cifar10 that have failed to pass 0.05 on the comp data

#

the thing i haven't done is GAN-ing the whole dataset anew, and besides that i'm compeletely out of ideas

undone kindle Sep 24, 2023, 8:49 PM

#

How van you gan-ing?

#

Can*

#

But you want to extract them?

#

Because I thought you wanted to gan the dset and save it

#

Anyway looks bugged this competition

#

There is no way that adding 1 si gle epoch destroys the metric

#

Either the metric is broken or they finetuned the whole process and crafted too perfectly that you can broke everything with a small change

full echo Sep 24, 2023, 8:53 PM

#

undone kindle How van you gan-ing?

as in try to come up with a fitting augmentation or whatever with something that generates images similar to that of forget set instead of them that won't trigger under MIA and won't mess with the score too much

undone kindle Sep 24, 2023, 8:53 PM

#

Ah OK OK write a paper man who cares about the competition lol

#

Lol

full echo Sep 24, 2023, 8:55 PM

#

undone kindle Either the metric is broken or they finetuned the whole process and crafted too ...

its both imo. they stated the first thing in desc, thus why no medals for it, and the second is somewhat self explanatory from their accuracy being 98% train 96% test on the pre-forget model

#

infortunately the cifar10 notebook gives 99.8 and 88 or smth which is waaaaaay bigger window for imperfect algorithms to work just fine while failing on the competition leaderboard

undone kindle Sep 24, 2023, 8:56 PM

#

Yeah but man....+1epoch and you break all the thing?

#

Or just a small change in the lr

full echo Sep 24, 2023, 8:58 PM

#

well +1 epoch when added to 1 epoch sounds like massive overfitting waiting to happen

#

tbh what would fix like 90% of the frustration is so that kaggle could show disassembled metric

#

aka show separately the forget score the retain score the test score

#

like you could at least get where to tune hyperparameters to

#

would be cool if authors could address some of our concerns but oh well

nova moth Sep 24, 2023, 11:23 PM

#

has anyone tried generating "anti samples" and putting them into the model? I saw that concept in some machine unlearning conference.

void carbon Sep 25, 2023, 10:00 PM

#

I feel like the competition turned into finding out how to interact with the hidden dataset and not about unlearning anymore. I was deeply invested in the earlier CIFAR notebooks when they announced, but I've been slamming my head against a wall with this new one

full echo Sep 26, 2023, 12:41 PM

#

void carbon I feel like the competition turned into finding out how to interact with the hid...

tbh same. The problem is that CIFAR10 notebook (w/ 99.8% and 88% on train/test) does not represent the model in the contest (w/ 98.98% and 96.43%) good enough, and also that its hyperparameters are way to fine to afford any change off of the default solution, thus there are 1 submission people in top50. Either there is a complete breakthrough, guaranteeing 0.06+ consistently (NO idea what the top2 people did with their 0.08+), either you pray on RNG. IMO even interactions with the dataset are not really in play, since there are many discussions on class weights being seemingly pointless. Unless you mean just straight up finding a completely similar dataset with faces, finding ways(and recourses) to train ResNet on it until it reaches approximately the same accuracy and only then you get somewhat representable testing environment that you can actually tune your parameters and approaches in

full echo Sep 26, 2023, 12:42 PM

#

full echo aka show separately the forget score the retain score the test score

i guess the quickest solution on the staff part may be this

flint bridge Sep 26, 2023, 1:29 PM

#

Is there a way to get hypothetical data, of which the structure is similar to hidden data. It would be useful at least to check the code, if it is working or not.

full echo Sep 26, 2023, 2:00 PM

#

flint bridge Is there a way to get hypothetical data, of which the structure is similar to hi...

there are examples of notebooks in which CIFAR10 dataset is imported and loaded and run

#

haven't checked this one personally but here https://www.kaggle.com/code/asarvazyan/unlearn-faces-or-cifar10-submit-w-o-exceptions

Unlearn Faces or CIFAR10 - Submit w/o exceptions!

Explore and run machine learning code with Kaggle Notebooks | Using data from multiple data sources

flint bridge Sep 26, 2023, 2:29 PM

#

full echo haven't checked this one personally but here https://www.kaggle.com/code/asarvaz...

Thanks

cloud seal Sep 26, 2023, 7:45 PM

#

Hello,
Quick question, is changing the batch size allowed? I have a doubt here. If yes do you know the max batch size possible on P100?

nova moth Sep 30, 2023, 2:27 AM

#

is there any information on the data? like what are its dimensions

full echo Sep 30, 2023, 7:30 PM

#

nova moth is there any information on the data? like what are its dimensions

3x32x32 (cifar10-like), presumably faces of people

nova moth Sep 30, 2023, 7:40 PM

#

full echo 3x32x32 (cifar10-like), presumably faces of people

got it, thanks

muted yew Oct 3, 2023, 10:20 AM

#

I just couldn't access Kaggle site, did anyone have the same accident?

terse cargo Oct 3, 2023, 6:35 PM

#

muted yew I just couldn't access Kaggle site, did anyone have the same accident?

yep, had it today. Seems to be fixed rn

full echo Oct 4, 2023, 1:21 PM

#

Is there any chance for the competition to be remade with different metric in presence of this discussion https://www.kaggle.com/competitions/neurips-2023-machine-unlearning/discussion/442582

NeurIPS 2023 - Machine Unlearning

Erase the influence of requested samples without hurting accuracy

inner geyser Oct 5, 2023, 9:06 PM

#

Hi, I'm not able to submit any notebook successfully. It just keeps running forever

nova moth Oct 6, 2023, 1:00 AM

#

inner geyser Hi, I'm not able to submit any notebook successfully. It just keeps running fore...

he does not know 💀

#

the submissions take 4 hours + because of the 512 model checkpoints

inner geyser Oct 6, 2023, 3:08 AM

#

Ah thanks

#

Is there any faster way to measure metrics before submitting?

full echo Oct 6, 2023, 7:25 AM

#

...no

full echo Oct 6, 2023, 7:29 AM

#

inner geyser Is there any faster way to measure metrics before submitting?

no, except locally recreating the whole setup they describe in the paper. they outline some of it in code at neurips challenge notebook here https://github.com/unlearning-challenge/starting-kit , but i personally couldn't recreate it compeletely

GitHub

GitHub - unlearning-challenge/starting-kit: Starting kit for the Ne...

Starting kit for the NeurIPS 2023 unlearning challenge - GitHub - unlearning-challenge/starting-kit: Starting kit for the NeurIPS 2023 unlearning challenge

sleek gull Oct 8, 2023, 4:00 PM

#

Is there any other dataset with pre-trained and re-trained models available?

fallow basin Oct 8, 2023, 4:14 PM

#

interesting paper about unlearning - https://browse.arxiv.org/pdf/2310.02238.pdf

unique venture Oct 11, 2023, 5:22 AM

#

Hi, I am getting notebook timeout but the same code works with starter kit just fine even for larger epochs.

#

Can someone point me the possible issues?

peak sapphire Oct 16, 2023, 2:05 PM

#

can any one review this https://www.kaggle.com/competitions/neurips-2023-machine-unlearning/discussion/447573 ?

NeurIPS 2023 - Machine Unlearning

Erase the influence of requested samples without hurting accuracy

deft elk Oct 18, 2023, 2:34 PM

#

Hi, is this topic something that I can pursue as an undergraduate student? Or perhaps work on it for my final thesis?

full echo Oct 18, 2023, 2:40 PM

#

deft elk Hi, is this topic something that I can pursue as an undergraduate student? Or pe...

Maybe, but not within this competition, since the metric used is extremely controversial

gilded tartan Oct 18, 2023, 11:31 PM

#

Hey there, I haven't checked on this competition in two weeks

#

Can anyone give me a TL;DR of the developments that occurred while I was away?

tardy blade Oct 20, 2023, 1:06 PM

#

Hi, i am new to kaggle, i was trying to join this competition but couldnt find input data for this competition. I trued running the pinned starter notebook but getting filenotfounderror for csv files. Is there any beginners guide? Please guide me

gilded tartan Oct 20, 2023, 4:24 PM

#

tardy blade Hi, i am new to kaggle, i was trying to join this competition but couldnt find i...

This competition is a little special, as all the data is hidden and can only be accessed when your notebook is submitted

#

So the goal is to develop an algorithm to perform the required task without access to the data, and then submit your algorithm to be evaluated using the hidden test set

terse cargo Oct 21, 2023, 6:21 PM

#

deft elk Hi, is this topic something that I can pursue as an undergraduate student? Or pe...

I've took this topic as my final thesis. There are truly many problems there if you'll start digging into the topic. Different metrics, different scenarios, different model architectures etc... So pretty easy to write something unique even if you'll do some comparison across methods for even the same model but different data scenarios.

terse cargo Oct 21, 2023, 6:25 PM

#

unique venture Hi, I am getting notebook timeout but the same code works with starter kit just ...

There are 512 models untraining on notebook evaluation. + Time for zipping. Maybe thats why its failing.

unique venture Oct 22, 2023, 10:41 AM

#

terse cargo There are 512 models untraining on notebook evaluation. + Time for zipping. Mayb...

Yeah i did notice that we are running it 512 times.
So what i tried was to add dropout layers to the model and then finetune it. It timed-out even for one epoch.
Is adding dropout layer increasing the computation cost so much that it fails?!

unique venture Oct 22, 2023, 11:07 AM

#

terse cargo I've took this topic as my final thesis. There are truly many problems there if ...

Even I am doing this as one of my course project. Can you suggest some good metrics to use (separately offline). The starter kit used simple MIA but they mention they use whole bunch to attacks for testing.

static flare Oct 26, 2023, 3:24 PM

#

SDG;

shadow patio Oct 26, 2023, 5:47 PM

#

Anyone looking for an additional team member? I did last year's Multimodal scATAC/scRNA prediction competition, but looking to step up my game for this time around 💪

fallen spoke Oct 27, 2023, 1:56 AM

#

How the hell do people get ~0.09??? Any change degrades the metric like crazy lolol

merry cliff Oct 30, 2023, 3:26 PM

#

Hey all! Has anyone changed the model architecture or we are supposed to go with ResNet18 only?

stray zephyr Oct 30, 2023, 8:11 PM

#

Hello, I just joined the machine unlearning competition and I am new to Kaggle competitions as well. What kind of data are they using for training the target model and can we have access to this dataset ?

terse cargo Oct 31, 2023, 8:28 PM

#

unique venture Yeah i did notice that we are running it 512 times. So what i tried was to add d...

hmmmm, hard to say, the default model that is proposed in starting kit runs about 4-5 hours. so it might be it. Or it might be some bug as well

terse cargo Oct 31, 2023, 8:31 PM

#

unique venture Even I am doing this as one of my course project. Can you suggest some good metr...

I cannot as I didnt have time to read all those papers suggested in kaggle forum. I can give you a link to them from where you can take them. I will though be doing deep dive from now on, so I can send you some interesting stuff in about a 2-3 weeks that I will find.

terse cargo Oct 31, 2023, 8:33 PM

#

stray zephyr Hello, I just joined the machine unlearning competition and I am new to Kaggle c...

The type of data is described in their paper attached in data on kaggle. We cannot access the dataset.

covert pollen Nov 17, 2023, 7:03 PM

#

Hi everyone, is supoosed to use only the resnet18 model? Or is supposed to find another model? Thank you!