#playground-series-s6e3

1 messages · Page 1 of 1 (latest)

merry spruce
#

Hii! Soo the competition just began, and I wanted to do an EDA on the data, cuz that's something which I haven't done a lot in the past and wanted to try my hand there. I think it also might be beneficial to know the features and how they interact in the data.
Here's the notebook for that: https://www.kaggle.com/code/rupankarmajumdar/predict-customer-churn-s6e3-eda-baseline
Please do let me know if I need to add anything for making the EDA more impactful. Also gave a baseline with LightGBM.

twin ruin
analog bramble
rare folio
# analog bramble Any feedback, please? Thanks!! https://www.kaggle.com/code/sangeetha007/trial-3-...

I'd consider using CatBoost given the dataset is dominated by category values.

You may also consider using the SVM but if you do use my HVRT repository to reduce samples down to 50k to make it feasible https://github.com/jpeaceau/hvrt. Also, determine irreducible error, investigate the residuals of the SVM, logistical regression model and CatBoost to evaluate a potential ensemble approach.

Do not use random sampling, with notable class imbalance of features, it will likely hurt performance.

#

Also hyperparameter optimization (HPO) with cross-validation for logistical regression and CatBoost is appropriate. You can for the SVM as well, though I'd use multiprocessing as even while reducing down to 50k samples to make it usable, the SVM is quite slow

coarse trout
idle terrace
#

Hi, I'm Doing RealMLP + CatBoost + XGB, is that a good approach? I've gotten a score of only 0.91702.

#

can someone please provide feedback?

zenith mesa
zenith mesa
idle terrace
idle terrace
#

That Shake up was crazy, I climbed almost 200 spots.

grand vessel
rotund briar
#

check my bio 😁

grand ginkgo
#

check my bio 😁

queen vapor
#

Check my bio 😁

shell ridge
#

No!