#commonlit-evaluate-student-summaries | Kaggle | Page 1

vernal nova Aug 8, 2023, 7:49 PM

#

Anyone competing in Commonlit? 2 months to go!

stiff turret Aug 10, 2023, 9:18 AM

#

Hi @vernal nova , I am. I'm kind of beginner to competitive data science. Hoping to learn a lot through this competition no matter the outcome😌 Any suggestions for a newbie like me?

vernal nova Aug 11, 2023, 5:10 PM

#

stiff turret Hi <@1110314527108636762> , I am. I'm kind of beginner to competitive data scien...

That's great to hear! Welcome to competing 🙂 Hmm if you're just getting started, I'd recommend copying 1 or 2 of the most popular notebooks and read through their documentation and try to understand their output. Then, I'd tweak some of the methods and weightings myself to gain an understanding of how these affect the output.

pallid bone Aug 11, 2023, 5:15 PM

#

vernal nova That's great to hear! Welcome to competing 🙂 Hmm if you're just getting started...

Thanks for the @vernal nova , I'm also a newbie, will give give it a try

ornate oracle Aug 12, 2023, 4:47 PM

#

vernal nova That's great to hear! Welcome to competing 🙂 Hmm if you're just getting started...

I’m also on this. Have you made any submissions yet?

wanton musk Aug 12, 2023, 4:57 PM

#

I'm also taking part in this competition, submitted a few models, but I'm struggling in doing better, and still at the bottom of the leaderboard. I'm still using old school machine learning techniques by trying to extract text features. I'm considering now using transformers 🤷‍♂️

orchid harness Aug 14, 2023, 9:04 AM

#

Can we use BERT in this competition? Is it against the terms of this competition or not?

hard wren Aug 14, 2023, 9:13 AM

#

you can use BERT

#

I have a very strange bug where the submission file has only 4 rows, do anyone knows how to fix?

https://www.kaggle.com/competitions/commonlit-evaluate-student-summaries/discussion/431446

CommonLit - Evaluate Student Summaries

Automatically assess summaries written by students in grades 3-12

jolly zinc Aug 16, 2023, 2:03 PM

#

I am facing the "Submission csv not found" error after submission. Can anybody help in how to resolve the issue?

faint sequoia Aug 16, 2023, 5:15 PM

#

jolly zinc I am facing the "Submission csv not found" error after submission. Can anybody h...

Assuming "Submission csv not found" was not a typo on your part: file names usually don't have spaces in them.

jolly zinc Aug 16, 2023, 5:16 PM

#

Yes they don't, it was a typo. "Submission.csv not found" this was the issue. Apologies from my end.

alpine briar Aug 17, 2023, 3:49 AM

#

hard wren I have a very strange bug where the submission file has only 4 rows, do anyone k...

This is not a bug. In code competitions, the real test set is hidden and once you submit your notebook -- it runs on the actual test set which is much larger.

alpine briar Aug 17, 2023, 3:50 AM

#

jolly zinc Yes they don't, it was a typo. "Submission.csv not found" this was the issue. Ap...

Try to run your notebook on a sample of the train as a proxy for the real test set and see what output it generates in the Kaggle env. That will help with debug

jolly zinc Aug 17, 2023, 4:51 AM

#

alpine briar Try to run your notebook on a sample of the train as a proxy for the real test s...

In the Output folder of the Kaggle environment it is storing the "submission.csv" file. But after submitting, the error keeps popping up. This is the first time I am participating in a competition, so I don't have a lot of knowledge in the Kaggle Environment.

alpine briar Aug 17, 2023, 4:54 AM

#

jolly zinc In the Output folder of the Kaggle environment it is storing the "submission.csv...

It probably errors out at some point when it runs the actual test set and so no submission.csv is generated. Try to run the same code on train like it is test and see what happens

jolly zinc Aug 17, 2023, 5:03 AM

#

alpine briar It probably errors out at some point when it runs the actual test set and so no ...

It saved the submission file in the interactive Kaggle notebook without any error.
Provided, this time I tested on the whole train data, just to be sure if this is happening due to size of the data or not.

tawny spear Aug 17, 2023, 12:50 PM

#

Hi guys, do you have any ideas how to imporve performance of training? At the moment my 1 epoch is around 3 min, it's long, and model consist only of backbone + linear

lyric plaza Aug 20, 2023, 9:31 AM

#

hi every one in competiton can we submit mutliple submissions files with different names or with same name

jolly zinc Aug 20, 2023, 3:19 PM

#

lyric plaza hi every one in competiton can we submit mutliple submissions files with differ...

Only Submission.csv will get evaluated.

faint sequoia Aug 20, 2023, 8:10 PM

#

jolly zinc Only Submission.csv will get evaluated.

I think again you are inaccurate with names. It is "submission.csv" rather than "Submission.csv" as that single capitalized letter makes a difference.

jolly zinc Aug 21, 2023, 1:17 PM

#

faint sequoia I think again you are inaccurate with names. It is "submission.csv" rather than ...

Yes, again apologies on my part.

uneven reef Aug 21, 2023, 5:49 PM

#

Hello Everyone,

I have started this competition. I wanna learn about Roberta and everything basics in NLP that reqiures to learn Roberta. Can you provide me with resources or roadmap on where to start and what to learn? I couldn't find any good tutorials on Roberta. Can you suggest me resources on where to start and good platforms I can use to learn?

gleaming kindle Aug 22, 2023, 8:06 PM

#

uneven reef Hello Everyone, I have started this competition. I wanna learn about Roberta an...

Hi @uneven reef , Just search for BERT variants , you can find several practical resources, such as:
https://github.com/facebookresearch/fairseq/tree/main/examples/roberta

GitHub

fairseq/examples/roberta at main · facebookresearch/fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python. - facebookresearch/fairseq

#

or original paper:
https://arxiv.org/pdf/1907.11692.pdf

uneven reef Aug 23, 2023, 6:25 AM

#

gleaming kindle Hi <@460870401812070400> , Just search for BERT variants , you can find several...

Thank you

uneven reef Aug 23, 2023, 8:53 AM

#

Anyone wanna work together for this competition?

pale blade Aug 23, 2023, 11:12 AM

#

gleaming kindle Hi <@460870401812070400> , Just search for BERT variants , you can find several...

use deberta v3

round sundial Aug 24, 2023, 7:54 AM

#

Hi, I am Riya Saxena, 4th year UnderGrad @IITRoorkee. I am looking for teammates to participate with. Anyone in? (I have good experience in NLP.)

lyric plaza Aug 26, 2023, 7:07 AM

#

round sundial Hi, I am Riya Saxena, 4th year UnderGrad @IITRoorkee. I am looking for teammates...

CAN I JOIN

#

IAM ALSO A 4TH YEAR UNGERGRAD AT IIT DHANBAD

dire onyx Aug 26, 2023, 3:00 PM

#

round sundial Hi, I am Riya Saxena, 4th year UnderGrad @IITRoorkee. I am looking for teammates...

interested

high heron Aug 27, 2023, 7:24 AM

#

Is this competition very similar to LLM Science Exam ?

#

🤔

uneven reef Aug 28, 2023, 4:32 PM

#

Can we use RNN for this competition?

undone dagger Aug 31, 2023, 3:56 PM

#

Hi, I am new to kaggle competitions and have some questions. Are we allowed to use a pretrained model like llama2 in this competition? How do I decide if a model is allowed based on the license? Can I train a model and upload to huggingface and use it? Thanks

ember mortar Sep 1, 2023, 7:41 AM

#

undone dagger Hi, I am new to kaggle competitions and have some questions. Are we allowed to u...

Hi there,
Yes, you are allowed to use a pretrained model like llama2 in most Kaggle competitions. However, you should always check the competition rules carefully to make sure. The rules will usually specify which models are allowed and which are not.
To decide if a model is allowed based on the license, you need to read the license carefully. Most licenses will allow you to use the model for competitions, but there may be some restrictions. For example, some licenses may require you to give credit to the original authors of the model.
Yes, you can train a model and upload it to huggingface and use it in a Kaggle competition. However, you should make sure that the model is not violating any licenses. You should also make sure that the model is not plagiarized.

lyric plaza Sep 1, 2023, 2:31 PM

#

self.model_name = "username/debertav3large"
self.model_dir = "/path/to/local/directory"

#

can anyone help me in getting username of debertav3large and its repository id

tawny spear Sep 2, 2023, 7:53 AM

#

lyric plaza can anyone help me in getting username of debertav3large and its repository id

microsoft/deberta-v3-large

ornate oracle Sep 2, 2023, 8:03 AM

#

Did anyone use BERT for this competition?

bold cobalt Sep 2, 2023, 1:10 PM

#

ornate oracle Did anyone use BERT for this competition?

deberta?

simple field Sep 2, 2023, 2:04 PM

#

ornate oracle Did anyone use BERT for this competition?

I think many people use variants/improvements of bert, like distilbert and roberta

#

because native Bert performs worse than the new variants iirc

simple field Sep 3, 2023, 11:46 PM

#

Hey everyone, I recently made a discussion post on how to get started for beginners! You can check it out here: https://www.kaggle.com/competitions/commonlit-evaluate-student-summaries/discussion/436739

Let me know if you have any feedback or if I should add/change anything to the post!

CommonLit - Evaluate Student Summaries

Automatically assess summaries written by students in grades 3-12

ornate oracle Sep 4, 2023, 7:45 AM

#

simple field I think many people use variants/improvements of bert, like distilbert and rober...

Thanks

vocal moon Sep 7, 2023, 9:15 AM

#

Zoom is end:)
Video: https://youtu.be/L6OQmXk1Am4

Presentation: https://docs.google.com/presentation/d/101mmtjEvIwKKP2klTwJZ_yIGydXyorQkKn19SrMoZnE/edit?usp=sharing

Please @everyone come to Zoom: ⌚️ Thurday 7 September, 19.00 (CET time e.g. Paris time)
"Hands on Hugging Face Transformers and Kaggle CommonLit challenge"

YouTube

SciBerloga

👨‍🔬 Alexander Chervov "Hands on Hugging Face Transformers and Kagg...

👨‍🔬 Alexander Chervov "Hands on Hugging Face Transformers and Kaggle CommonLit challenge (https://www.kaggle.com/competitions/commonlit-evaluate-student-summaries)"
⌚️ Thursday 7 September

Educational webinar for beginners: Examples of using DeBERTa-like models from the Hugging Face Transformers collection to solve NLP tasks. Using the Kaggle ...

▶ Play video

Google Docs

CommonLit and Hands on HF

Hands on Hugging Face Transformers and Kaggle CommonLit challenge (https://www.kaggle.com/competitions/commonlit-evaluate-student-summaries) Alexander Chervov SciBerloga 07 September 2023 Video: https://youtu.be/L6OQmXk1Am4 1

wise lynx Sep 12, 2023, 8:22 AM

#

hi folks, I recently posted a question here about the deberta v3 model,
https://www.kaggle.com/code/tsunotsuno/debertav3-baseline-content-and-wording-models/comments

please feel free to drop by and give it a look

Debertav3 baseline (content and wording models)

Explore and run machine learning code with Kaggle Notebooks | Using data from multiple data sources

long drift Sep 13, 2023, 7:29 PM

#

round sundial Hi, I am Riya Saxena, 4th year UnderGrad @IITRoorkee. I am looking for teammates...

too late to join?

north hare Sep 17, 2023, 12:46 PM

#

I got 0.22 for content and 0.46 wording mse is this good or I need to improve ?

hushed gull Sep 20, 2023, 4:08 AM

#

I have just completed implementing MobileBERT.

There are still some modifications to be made in preprocessing, and the result is not that great, but I believe putting this model on mobiles will be crucial for the real-life application of this service.

There are still many areas that has poor or no internet connections, and the children in such area shouldn't be neglected.
Please feel free to take a look!
https://www.kaggle.com/code/jasonheesanglee/updated-mobilebert-implementation

[Updated] MobileBERT implementation

Explore and run machine learning code with Kaggle Notebooks | Using data from multiple data sources

prisma arch Oct 10, 2023, 3:32 AM

#

north hare I got 0.22 for content and 0.46 wording mse is this good or I need to improve ?

this looks really good, just make sure it is properly cross-validated and does not over fit