#playground-series-s5e3

1 messages · Page 1 of 1 (latest)

glossy crescent
#

Hi I am doing this kaggle contest

subtle river
#

me too,!

hazy moss
#

Me too

maiden nova
#

mee too

quartz fable
#

hello everyone... can anyone please help !!??

#

how can i handle this imbalance data ?

hazy moss
nimble briar
mint burrow
#

I also doing it...

hazy moss
#

Hmmm, two people has 1.0000 in leaderboard...

quartz fable
haughty hatch
#

Anyone want to teamup for this competition? Rainfall prediction

wheat aurora
lethal fox
vital oak
#

outside of leaderboard probing , what do we think is the "best" score so far?

wheat aurora
wheat aurora
vital oak
wheat aurora
vital oak
#

yes

wheat aurora
#

For the missing point what did you do for that one?

vital oak
#

I didn't do anything yet with that. I was trying to get a baseline but it seems weird that I'm getting such a big difference with only 1 missibng point

wheat aurora
#

It does add quite a bit since huge outliers can wreck some common metrics. Try fixing that first and see what you get.

#

Especially since the LB dataset is only about what? 700 entries?

cold otter
#

The public LB dataset is right now only 146 rows of data

#

I have been getting these huge differences too, 0.88-0.89CV - 0.84-0.85LB..

glossy wind
#

Hello, I am a beginner in Data Science and very new to Kaggle competitions, and I've been pretty stuck on how to improve my score in this contest as most of my attempts have seen no improvements.

Currently, my approach has been engineering features, adding temporal features, selecting the best features, and then using an XGBoost model to make my predictions.

However, I cannot seem to increase my public LB score past .82 (which I got by just throwing the unprocessed dataframes into my XGB), and many of my attempts I do to improve my public LB score end up making my score worse. For example, I tried engineering more features but that decreased my public LB score from ~.82 to ~.81. I tried doing forward feature selection, but that also decreased my LB score by .01

I'm pretty stuck here because I don't know what I'm doing wrong or if I'm unknowingly doing a common beginner's mistake. I don't really understand how other people's XGB models are getting public LB scores of above 0.85. I'm not sure if my feature engineering is lacking, if I'm using the wrong model, if other submissions are overfitting the public LB, or something else. Any advice helps!

My current notebook: https://www.kaggle.com/code/michael927/rainfall-pred

whole furnace
#

you can check the OOF method (out of fold) brought up in the discussion, should be helpful but be careful on not overfitting your model

wheat aurora
#

Pretty much what I expected. 0.003-0.009 was the difference between 500th and 1st

#

Spent an hour and got about 0.006 below first place. Can't complain about that

ember thunder
#

Hello.

cold otter
#

Big shake up, from 2.5k to 222
Damn, can't complain either though

ashen ruin
#

Will we be getting an access to the winning solution of this competition? If yes, where? In the discussion or here?

PS: I’m new here.

wheat aurora