#commonlit-evaluate-student-summaries
1 messages Β· Page 1 of 1 (latest)
Hi @vernal nova , I am. I'm kind of beginner to competitive data science. Hoping to learn a lot through this competition no matter the outcomeπ Any suggestions for a newbie like me?
That's great to hear! Welcome to competing π Hmm if you're just getting started, I'd recommend copying 1 or 2 of the most popular notebooks and read through their documentation and try to understand their output. Then, I'd tweak some of the methods and weightings myself to gain an understanding of how these affect the output.
Thanks for the @vernal nova , I'm also a newbie, will give give it a try
Iβm also on this. Have you made any submissions yet?
I'm also taking part in this competition, submitted a few models, but I'm struggling in doing better, and still at the bottom of the leaderboard. I'm still using old school machine learning techniques by trying to extract text features. I'm considering now using transformers π€·ββοΈ
Can we use BERT in this competition? Is it against the terms of this competition or not?
you can use BERT
I have a very strange bug where the submission file has only 4 rows, do anyone knows how to fix?
https://www.kaggle.com/competitions/commonlit-evaluate-student-summaries/discussion/431446
Automatically assess summaries written by students in grades 3-12
I am facing the "Submission csv not found" error after submission. Can anybody help in how to resolve the issue?
Assuming "Submission csv not found" was not a typo on your part: file names usually don't have spaces in them.
Yes they don't, it was a typo. "Submission.csv not found" this was the issue. Apologies from my end.
This is not a bug. In code competitions, the real test set is hidden and once you submit your notebook -- it runs on the actual test set which is much larger.
Try to run your notebook on a sample of the train as a proxy for the real test set and see what output it generates in the Kaggle env. That will help with debug
In the Output folder of the Kaggle environment it is storing the "submission.csv" file. But after submitting, the error keeps popping up. This is the first time I am participating in a competition, so I don't have a lot of knowledge in the Kaggle Environment.
It probably errors out at some point when it runs the actual test set and so no submission.csv is generated. Try to run the same code on train like it is test and see what happens
It saved the submission file in the interactive Kaggle notebook without any error.
Provided, this time I tested on the whole train data, just to be sure if this is happening due to size of the data or not.
Hi guys, do you have any ideas how to imporve performance of training? At the moment my 1 epoch is around 3 min, it's long, and model consist only of backbone + linear
hi every one in competiton can we submit mutliple submissions files with different names or with same name
Only Submission.csv will get evaluated.
I think again you are inaccurate with names. It is "submission.csv" rather than "Submission.csv" as that single capitalized letter makes a difference.
Yes, again apologies on my part.
Hello Everyone,
I have started this competition. I wanna learn about Roberta and everything basics in NLP that reqiures to learn Roberta. Can you provide me with resources or roadmap on where to start and what to learn? I couldn't find any good tutorials on Roberta. Can you suggest me resources on where to start and good platforms I can use to learn?
Hi @uneven reef , Just search for BERT variants , you can find several practical resources, such as:
https://github.com/facebookresearch/fairseq/tree/main/examples/roberta
or original paper:
https://arxiv.org/pdf/1907.11692.pdf
Thank you
Anyone wanna work together for this competition?
use deberta v3
Hi, I am Riya Saxena, 4th year UnderGrad @IITRoorkee. I am looking for teammates to participate with. Anyone in? (I have good experience in NLP.)
CAN I JOIN
IAM ALSO A 4TH YEAR UNGERGRAD AT IIT DHANBAD
interested
Can we use RNN for this competition?
Hi, I am new to kaggle competitions and have some questions. Are we allowed to use a pretrained model like llama2 in this competition? How do I decide if a model is allowed based on the license? Can I train a model and upload to huggingface and use it? Thanks
Hi there,
Yes, you are allowed to use a pretrained model like llama2 in most Kaggle competitions. However, you should always check the competition rules carefully to make sure. The rules will usually specify which models are allowed and which are not.
To decide if a model is allowed based on the license, you need to read the license carefully. Most licenses will allow you to use the model for competitions, but there may be some restrictions. For example, some licenses may require you to give credit to the original authors of the model.
Yes, you can train a model and upload it to huggingface and use it in a Kaggle competition. However, you should make sure that the model is not violating any licenses. You should also make sure that the model is not plagiarized.
self.model_name = "username/debertav3large"
self.model_dir = "/path/to/local/directory"
can anyone help me in getting username of debertav3large and its repository id
microsoft/deberta-v3-large
Did anyone use BERT for this competition?
deberta?
I think many people use variants/improvements of bert, like distilbert and roberta
because native Bert performs worse than the new variants iirc
Hey everyone, I recently made a discussion post on how to get started for beginners! You can check it out here: https://www.kaggle.com/competitions/commonlit-evaluate-student-summaries/discussion/436739
Let me know if you have any feedback or if I should add/change anything to the post!
Automatically assess summaries written by students in grades 3-12
Thanks
Zoom is end:)
Video: https://youtu.be/L6OQmXk1Am4
Presentation: https://docs.google.com/presentation/d/101mmtjEvIwKKP2klTwJZ_yIGydXyorQkKn19SrMoZnE/edit?usp=sharing
Please @everyone come to Zoom: βοΈ Thurday 7 September, 19.00 (CET time e.g. Paris time)
"Hands on Hugging Face Transformers and Kaggle CommonLit challenge"
π¨βπ¬ Alexander Chervov "Hands on Hugging Face Transformers and Kaggle CommonLit challenge (https://www.kaggle.com/competitions/commonlit-evaluate-student-summaries)"
βοΈ Thursday 7 September
Educational webinar for beginners: Examples of using DeBERTa-like models from the Hugging Face Transformers collection to solve NLP tasks. Using the Kaggle ...
Hands on Hugging Face Transformers and Kaggle CommonLit challenge (https://www.kaggle.com/competitions/commonlit-evaluate-student-summaries) Alexander Chervov SciBerloga 07 September 2023 Video: https://youtu.be/L6OQmXk1Am4 1
hi folks, I recently posted a question here about the deberta v3 model,
https://www.kaggle.com/code/tsunotsuno/debertav3-baseline-content-and-wording-models/comments
please feel free to drop by and give it a look
too late to join?
I got 0.22 for content and 0.46 wording mse is this good or I need to improve ?
I have just completed implementing MobileBERT.
There are still some modifications to be made in preprocessing, and the result is not that great, but I believe putting this model on mobiles will be crucial for the real-life application of this service.
There are still many areas that has poor or no internet connections, and the children in such area shouldn't be neglected.
Please feel free to take a look!
https://www.kaggle.com/code/jasonheesanglee/updated-mobilebert-implementation
this looks really good, just make sure it is properly cross-validated and does not over fit