#deep-past-initiative-machine-translation
1 messages · Page 1 of 1 (latest)
just run %pip install xxx in a cell
The rules of the competition are that we can’t have Internet connected notebooks doesn’t that stop this from being a permitted approach?
you can clone the repo into your notebook env (as in, set it as a 'input data source' and install from there)
@hasty cape - wondering about the use of tools like Claude Code to help generate code. The rules for this competition say to not share competition data with any entities not participating in the competition. Does that preclude me from using Claude Code in a local jupyter notebook? Thank you in advance, from a Kaggle newbie!
Hey all, how long are people’s successful submissions taking to run?
In 'normal usage' you are not really sending any competition data to anthropic, except maybe one or two examples, in which case I would assume shouldn't be problematic. you can make sure to not share any input prompts/data for training as per https://code.claude.com/docs/en/data-usage#data-policies (I hope you are not trying to get claude code to read the whole dataset and generate some code to train a model... ) disclaimer, I'm not one of the competition organisers, I'm just another participant..., so if you are really worried, you might need to ask on the kaggle discussion forum
I'm having the same issue. it's been 1hr+ and it's still haven't done scoring the submission. Did yours work ?
thank you Wendy.
I’ve gotten one successful run and it was ready about 2 hours after submission. But maybe it’s a queu thing? Not quite sure m
I need help submitting my successful run. I have successfully completed "Save & Run All" multiple times, but my notebook does not appear as a selectable option when I click "Submit Prediction" on the Submissions tab.
My notebook runs offline (Internet OFF) and produces a submission.csv in the output folder. Has anyone else encountered this issue or is there a specific step I'm missing to make a version "eligible" for scoring?
Mine runs for about 3 minutes... But I'm not able to submit. I feel like I'm either blind or looking in the wrong place. 😛
Of course the two aren't mutually exclusive. 😉
I found that I had used up a 'successful notebook' slot by submitting a new notebook before uploading the actual code. And now my successful submissions has been running for 1+ hours.
Here's why the spinner is still going:
The Scale: While the "Commit" only processed 4 records, the real competition test set likely has thousands of rows (8,000+ indicated in the challenge details).
The Math: If your notebook takes ~3 seconds per record (based on your logs), then:
1,000 records = 50 minutes.
2,000 records = 1 hour 40 minutes.
The Hidden Run: Kaggle does not show the logs for the active Submission run to keep the test data private. You will only see the "Success" and the final score once it has translated every single record in the hidden file.
For me it normally takes 15-20 minutes. I basically followed the shared inference notebook and with P100 it shouldn't take an hour to do the inference. Maybe you forgot to change the runtime
It's in session options -- accelerator -- choose whichever you want, probably GPU P100
Hi guys from where the dataset is to be downloaded
Hello all, I need 1-2 team mate for 2 competition. Interested individuals should have deep interest and expertise in modern deep learning architecture specially NLP and transformers. Must have access to GPU machine or cloud. Interested individuals fee free to dm me or mention me here.
Hello, everyone. This is my first competition. Have a wonderful day
hey everyone, this is my first competition
Hey is anyone onlin ei need some idea on something
my question is regarding the analysis of the training dataset in itself
what's up
so as of now to implement a model for this hackhathon i am working on this given train dataset but it is very big and manually working to make inferences, get domain knowledge, and build up paramets to input whne training the model seems like a hefty approach, is there any alternative you are taking up? like maybe to infer data on the train dataset better or using an entirely different source of data to train the model on for testing?
i have curated a manual dictionary and rules that i have inferred uptil now but working with 1561 rows and going through each row manually, translating it word by word manually, in this day and age with the speed we have, this seems like a very slow approach
any inputs?
@dawn terrace
i'm using LLMs for a lot of my work
you can maybe pass your dictionary as context
I'm using LightGBM+CatBoost+XGBoost regression+RecSys for this task
Hi, this is my second. Nice to see someone else new to the environment
Hii guys can anybody help . I am new in kaggle competition and unable to make submission . I am getting this red flag
Cannot submit
Your Notebook cannot use internet access in this competition. Please disable internet in the Notebook editor and save a new version.
disable internet acces in your session options before submit
and if you have to install packages use the package manager or offline wheels
im running a submission file and i have tried submitting it 5 times on different days with multiple tweaks but it is running into an error that says notebook threw an exception and there is no specific error output to work on , any help?
hey guys any one want to team up for this i got 21.7 score in it on first try without any extra data now i have collected a data worth of 35000 rows which took me 1 and half week
if any one have good knowledge of dl and had made model regrading this problem lets team up and score high rank together
i will be able to work on it from next month cuz of exam right now
hey can i join
DMed
Looser