#💬┊general
1 messages · Page 2 of 1
@torpid flower what is the highest score you achived in the tutorial
for titanic
0.79
Should I go through all the python guides before trying titanic?
Hi, I want to get into deep learning. Please is there any documentation, courses or yt channel that you guys can recommend? Thanks
Hi, I learn about machine learning and data science
@arctic plume go for it. Learn as you build the model. @last finch I’m building a deep learning series in Nov/Dec
no it is not necessary. you should have enough knowledge though. mostly loops and conditionals should suffice. If you already had knowledge on python there is no need to go through the guides.
I'm here to discuss interesting problems in the competitation
Hello community....lets make this a win-win🤝
hi i would like to learn about machine learning
Hi guys i want to learn machine learning with kaggle competition
Is it possible to shut down the computer and keep the train td kaggle model on?
yea
you can also save the model locally
Hi everyone! I am new to the world of data science, i am overwhelmed by the things going on kaggle, i have tried to do activities here and there but i would highly appreciate if any experienced learner can help out or take me in a team, so i can grasp on concept and other necessary stuff and become efficient on kaggle. Thanks
i passed a json doc into model using langchain and when making an inference of the RAG model it does not seem to be responding based off the json data specifically, when asking it certain questions that should have a good response. Do i need to restructure my Json Data away from the nested dict style to something more condensed?
if the notebook is active it will continue to run
Okay. Thank you
If you save and commit, the notebook continues to run in the background.
Hi My name is Angel and I am new to data science. I look forward to learning all that I can so that I may do this full time. Any advice is greatly appreciated and cant wait to meet you all!
Start with titanic and house prices
Trying my first competition after them and it helped a ton
Hello,, I'm an swicth carrier and I'm here to learn Machine learning, EDA and whatever about Data Science
Hi everyone, I'm just getting started with competitions.
Hi guys , just finished working on New York taxi trip duration prediction. Here is my notebook, please check it out. https://www.kaggle.com/code/nishchay331/pc-1-new-york-city-taxi-trip-duration . If you have any suggestions on improvement or any better idea , feel free to let me know . Thank you.
welcome to kaggle community !
I'm Andazi, new to data science and eager to excel. Please check out my recent project at this link: https://www.kaggle.com/code/andazi/economic-analysis-of-nigeria/ I'd greatly appreciate any suggestions or corrections you can provide. Thanks a bunch
can anyone point me to a dark mode setting for the site? I have cataracts 😦
I have the dark reader extension, and usually it does well. Unfortunately it seems to affect some of the elements on the page in a negative way I've experienced so far
its better to download the app in phone or laptop which has dark setting by default
pp
how to ensure that the usability factor of a dataset is good while uploading a dataset?
need in harry a satellite dataset for a segmentation graduating project!
Hello everybody, I wanna ask you, can we increase the amount of augmented dataset from the given dataset in tensorflow.
Heyy folks, any suggestions like tutorials to start working with tensorflow???
I had this same question! They actually give you little tips like describing your columns, data, etc
How I can join various groups here?
Check out id:browse to see all the channels available.
guys i have a question
how do i import all of the jupyter notebooks from github into my machine?
for the fastai course
If all the notebooks are in a single repository you can just clone the repo to your local machine either by downloading the zip file or by using the git clone command
thank you so much
i downloaded jupyter notebook on my mac with all the fastai modules
but it says modulenot found error
could someone please help?
I'm create text to emotion speech converter but don't know how much cost or gpu required
Can anyone help me on this?
Hi Everyone! 👋
I'm a Kaggle noob who likes drinking chai 🍵
hi

Ahoy,
Here for fun and learning from others!
Is kaggle site working for you? Since yesterday I am facing lots of problems on the site
It frequently crashes
I am not able to create datasets from notebook outputs
Notebook outputs are not properly visible
It is ok for me
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=DfWpXuwSFc8
For this report, our community wrote hundreds of essays covering a broad array of machine learning topics, and then experts from our community selected the best. The result is a collective perspective on the rapid advancements of AI, shedding light on the most salient topics in modern machine learning. The Kaggle AI report 2023 includes 7 winnin...
Hi to everybody. I have studied a little the basics of ML and would like to start building models to make a portfolio and/or participate in competitions. Have a good weekend.
Hello, just a former football player from Mississippi interested in the Big Data Bowl
hey guys
i am currently doing b-tech cse with ibm specialization
and i am in 3rd year
what courses do you prefer me for good opportunities
@here
bonjour à tous je suis étudiant en 5ème année de formation professionelle dans le domaine du big data et statistique ravi d'etre ici parmis vous
Hey, anyone taking part in HackSquad 2023?
Hello Everyone, Myself Nitish Pal. Learning Machine learning and trying to exploring this exciting world of AI/ML with kaggle .
Hi all This is Umesh Ramanathan!! Feels great meeting a lot of people of my domain
hey guys i am a data scientist student and looking forward to getting into kaggle competitions and expanding my knowledge in machine learning
Hi there! I'm here to compete in the "Open Problems – Single-Cell Perturbations" competition.
Hello everyone, I am Toheeb Kayode. I am an Data scientist in training, I am here to to build on my existing proffessional network and partake in competitions.
Hi, if I have ideas for Kaggle, where can I write to them?
heyyy!!!
You can either write them on the feature submission forum on the website, or on the Kaggle feedback thread on the discord server, for something less formal.
hi
Can I write it to someone in DM? I prefer talking to person
I'm Grace, an information project management student, and my passion for technology has led me to love data analysis too. I'm delighted to be part of the Kaggle adventure.
Check out this explosive natural crystal rough stone gold winding unshaped bracelet. It's stunning jewelry that will add a touch of elegance to any outfit. Get yours now at https://www.quickstore.pro/products/explosive-natural-crystal-rough-stone-gold-winding-unshaped-bracelet-jewelry-crystal-bracelet. Don't miss out on this beautiful accessory! 💎🌟
is there any available sources for high accuracy deepfake image (binary) classifier?
Hi I'm Rocky. I currently work in data analytics but looking to build my skills and hopefully move from what I currently to sports analytics.
Hi, I am new here
how do you get a bronze in datasets, is that like uploading data to https://www.kaggle.com/datasets ? I guess you have to collect some data from somewhere and clean it up right, I guess that's probably good to practice
Download Open Datasets on 1000s of Projects + Share Projects on One Platform. Explore Popular Topics Like Government, Sports, Medicine, Fintech, Food, More. Flexible Data Ingestion.
Hello, I'm Noah....finished my masters in Statistics then raised twins....now trying to re-learn and update for job search
yes, you need to put out a dataset on kaggle then get a few upvotes and you will have a bronze medal
Hello, I am Ridhima, a recent graduate in data science, looking to build my skills and trying to land up with a new job
Hi i am a software developer in the field of web development using technologies like angular javascript and .net etc. i am MCA passout in 2020. now i want to explore my knowledge in the area of ML . i am a beginner so thats why i am using kaggle. Thanks.
Anyone here familiar with yolo and darknet?
Hello! Good to see you here. Already looking forward to your weekend reading list post haha
i will do segmentation on xDB dataset which contains images and masks each one has an image or mask pre-disaster and post-disaster how I preprocess this data and make it ready for my model and how the model will be so I can give it the two images before and after and their labels to give me the final mask ??
hey did u check this link https://paperswithcode.com/dataset/xbd, there are few papers acutally implementing models
it even has the gihub repo
Hi everyone ! 
Hello~ I'm here to learn all about kaggle 🙂 🙂
Hi, I am Dhruv. I have started learning and making projects in ML and I am new to Kaggle and looking to learn from you all
Hello everyone
Hello, I'm a data scientist from greece and I like to tackle on new challenges!
Hi all, I am Mayuri from India working as a freelance data analyst. I write blogs and notebook on data science for helping beginners to understand the concepts easily. Happy to connect with you all.😊
Hello everyone, My name is Hritwij and I'm from Maharashtra, India. I love in depth Machine Learning and to learn the whys and hows. I'm here to ask a lot of questions and learn with you all. Also I'll share whatever opinions that I find reasonable. Happy to connect with you all 😄
Hi, I am a master's student in statistics, undergraduate civil engineering. Good luck to everyone for the private score.
Hi all, love for technology has led me here. Looking forward to learning with you.
Happy to meet you
Same
hey, is there something wrong with Kaggle notebooks today. seem to be taking awfully long to start up?
Yes,facing the same issue. Can't open the site.
i found it was a problem on my end. and things r fine now
hi
hey, are there any datasets related to Tax laws in India or US in Kaggle??
hi i am a beginner how to tune your model to make it perfect for example when to add a dense layer or a drop out is there a course/book for that?
anyone of you currently working in the tech industry?
Hi all, very junior newbie here. Quick question, what do computer science use for note taking apps? Thx
Usually when I take notes I use Obsidian.
Hello im a masters student intrested in ML, looking forward to learn from you all
Hello everyone I'm a software engineer & full stack developer as major of mathematic.
Yes, performance is same in both training and test
I improved more but can't go past that
Maybe more dataset or hyperparameter tuning should do something
Hi everyone! I'm MaryKat_ -- I work as a parapro in a special needs classroom. I am learning Python, SQL, JavaScript, and HTML.
current performance loss decreased mse is 0.1812 on standford rna project want to decrease more
this is model what can else i can do
I studied NLP basics and now learned about transformer and implemented it, I now didn't know what to do next
what i have to learn and what i should do??
Deploy them
Learn to fine tune them
For specific tasks and try to implement them , or deploy and test it
I'm in my process to complete data science course from coursera. Now I want to start competing in kaggle competitions. If any one can guide me in competitions and help me get data science job without a degree. Pls reply

I tried to solve it both ways like using sequence model and artificial Neural network but both models getting same mean absolute error. Did cross validation, did features engineering able to improve slightly but no luck can go past 0.18 mae , anybody knows what else I can do like domain knowledge perspective? Stanford ribonaza RNA project
Hi everyone, hope all are ok. Does anyone recommend a good, complete and free (lol) guide for timeseries analysis?
Anyone else unable to click "submit predictions" to submit competition files?
Chrome and Edge both have this issue
what's a nice follow up to the Titanic challenge?
I like the banner
yeah I do use that button, did not face any issue, if that's a temporary issue I did not submit for the last 2 days (I use chrome)
Hello, this is my first time here.
🚬
How fast are kaggle notebook GPU compared to Google Colab?
Which course are you completing?
I am getting different values when i try to check shape of train data in colab. can anyone help me?
Hi this is my first time here i am an engineering student at CMU studying AI
Hi, I'm new here. Would anyone like to team up with me for the competition?
Hi, I would like to join your team. I am new here as well
heyy..... I want to make a team on kaggle. hit me up if you want to join my team!
I would like to join your team. I am new here as well
yoo yoo hey i am new
That would be great
hii...i would like to join
i wanna join a team too
I would like to join. do you wanna join too ? @old umbra
anyone who uses rx 6650 xt i need some help?
Uh, hi guys I'm new to using Kaggle. Where can I find an introduction/guide to using Kaggle. I'm a student and would like to participate in competitions as well to include it in my ECs for college applications. It would be really helpful if someone could show me the ropes.
did you checkout https://www.kaggle.com/competitions/titanic/overview ?
Start here! Predict survival on the Titanic and get familiar with ML basics
Yea, I saw an ad on that joined kaggle
But idk what to do at all
I have no idea how all this works
If it helps, I know how to code in python
i am also kind of on the same boat...am a beginner
this is the tutorial for that
aight, ill check it out
im reading the overview and all rn
how long do these competitions go on for?
wait, how do I accept the rules and join the competition?
oh it's ongoing, my bad
@tough copper checkout the video in this one too
hello , i have found kaggle's github organization , did anyone know how can i join it?
Isn't there a request join option there ?
I seriously recommend getting any degree if you are in india
Hi I am Venkat Sai. I am a Sophomore from India. Love to be here...looking forward to learning a lot from all of you!!!
Hello! kagglers I'm glad to be part of you!
Depends on your data, and what you want to do
so i am doing the titanic competition...you must have done it already ?
Hello everyone! I am glad to become part of this community! 👍😀
Hi everyone
Feature and model selection
how to do that...am a beginner
Do you know EDA and logistic regression?
Titanic model can be made using those things
Welcome to my data science journey through the Kaggle Titanic - Machine Learning from Disaster Project!
In this video, we'll dive deep into the world of data analysis, feature engineering, and machine learning to predict passenger survival rates on the Titanic.
As Kaggle states: "The competition is simple: use machine learning to create a mode...
I followed and did his work. You will get to know many things.
@jovial gulch
thanks a lot for this...let me go through
hello everyone
https://www.kaggle.com/tamsquare/code my competition submissions and notebooks here if you upvote i will be appreciated thanks!
@admins @moderators can we have a voice channel plese
@candid sandal
Reason: Bad word usage
Feature engineering might help
how to do that
check kaggle notebooks, google, yt tutorials, tonnes of content available online
It is a whole concept, so you should be researching on it and apply the applicable feature engineering techniques for your data.
hello guys I am new to machine learning but I find it interesting hoping to get to know interesting things
Hello all. I am in my last semester of an applied math degree, and interested in ML/modelling, finance, and this year a lot of LLM stuff (prompt optimization and agent architecture mostly)
Hello @surreal wing
What do you think about learning solution from past comptitions? I mean, I have opened this thread (https://www.kaggle.com/discussions/questions-and-answers/451632#2508772)
Summary of every past competition.
Hello!
Hello eveyone
Hi Everyone
Hi everyone, I just open sourced Colab2PDF and I'm trying to promote it.
Colab2PDF: Convert your Colab notebook to a PDF. No configuration necessary.
- The novel thing about this conversion method is that it doesn't require you to type in any filepaths or filenames.
- While configurable, it doesn't require any configuration to run (just copy-paste-run).
- It's also built on top of Quarto (which is built on top of Pandoc).
- It has TinyTex built in, so you don't need to apt install large LaTeX packages.
- It also auto-installs any necessary LaTeX packages on-the-fly.
- Less than 100 lines of code.
Hi everyone I'm a developer with a business background that is in the process to learn more about Data Science
hi
Hi every one I am a researcher and a student
I hope you have all see this https://chat.lmsys.org/?utm_medium=email&utm_source=gamma&utm_campaign=-lmsys-2023
Would love to discuss such topics in a dedicate channel for LLMs but the NLPs channels are also cool for it, what do you all think?
tbh I'm not so excited by new models anymore. There's so many of them all claiming to be better than the others... I am actually more interested in work that make these models useful in a production context, e.g. more efficient serving, running on resource limited hardware, how to link with internal document sources etc
Interesting! I'll check it out later
hello all I have recently joined to my first kaggle competition, but what in the rules of that competition it says "Internet access disabled", does that mean I can't import external libraries?
thanks
thanks
Im excited
Generally how much time and effort does it take to become a competition expert ?
I have a personal technical blog, where I write article and tutorials whenever I get time. You can take a look and search for your need https://dhirajpatra.blogspot.com
If any related post you want me to write kindly let me know. Thank you.
Think different is our daily technological stories of Artificial Intelligence, Data Science, Machine Learning, cloud, open-source, Python, management
You don't. Rather then dropping points or features, or imputing values, there are some techniques that let you train ML models with NaNs in the feature matrix. XGBoost for example works really well with missing values. If you want to train other ML models (like linear models, SVMs, neural networks, etc), you could use the reduced features approach. https://www.jmlr.org/papers/v8/saar-tsechansky07a.html
@wary dawn so what XGBoost put in missing values
It leaves them as is. When it's time to split a feature that contains missing values during training, XGBoost checks whether it's more advantageous to put samples with missing values to the left or the right leaf based on the gain. This is pretty neat because if missingness in a feature correlates with the target variable, XGBoost learns it and makes your predictions more accurate. If you were to impute or drop points/features, you might miss out on this and the model performance would suffer. Check out Section 3.4 of the XGB paper for more details. https://arxiv.org/pdf/1603.02754v3.pdf
@wary dawn But if we leave NaNs value, then it will difficult to do normalize and changing categorical into numerical value. I don't know how I have to proceed with that
Sklearn's standard scaler normalizes continuous features with NaNs based on the means and stds of the non-missing values so that's not a problem. If you have missing values in a categorical feature, replace the NaN with the string 'missing' or something similar and use the one-hot encoder. The missing values will be treated as another category and a corresponding dummy feature is created. If you have missing values in an ordinal feature, replace NaNs with the string missing, decide where the missing category falls in the ordered list of categories, and then apply the ordinal encoder.
Once you are done with preprocessing, there should only be NaNs in continuous features.
Hi!!
I need to find articles/competition that Find your ideal payment by changing loan amount, interest rate and term and seeing the effect on payment amount.
D you know ?
@wary dawn thanks for this much explanation
Hello,
does anyone have any access to a heathcare dataset - related to dengue
I wanted to make an AI model for predicting if a person is suffering from Dengue...
Something like this one https://www.kaggle.com/discussions/general/91461 Preferably something with less null values for accurate predictions
Typhoid and Dengue Fever Symptoms Dataset.
also I was trying out for a social-enabled solution - something dealing with SDG's, And the project idea shouldn't really exist or it should be development of a project idea to something better
your suggestions/advices would be of great help!
also
Am working on an ai project and there seem to be many null values in the dataset
would you advice me to go with fillna or dropna?
also If I use fillna and fill in avg random values wouldn't it affect the dataset?
And since the project is dealing with Healthcare would there be a huge affect if I add in avg values.
One more thing: sklearn is moving towards estimators that handle NaNs. https://scikit-learn.org/stable/modules/impute.html#estimators-that-handle-nan-values
@wary dawn one less thing to do, but it will increases time of training model
Why do you think that?
@wary dawn model have to put some value which best fit the null values
If you have more than 50% nan values just drop the column
If it’s less than 20% use either mean or mode depending on if it’s a categorical feature or not
Hi yall, anyone know active servers/communities to learn prompt engineering?
Nope. Again, XGBoost does not impute, no value is 'put' to 'best fit the null values'
More than 50% but If I drop that column it would affect prediction
so how do I go about it?
Usually if nan values are greater than 50% it’s not important
But then again depends on the feature and particular dataset you’re working with
cool thanks!
Hi guys is anyone also unable to select any GPU on the accelerator tab?
Which dataset were you working on though?
I can give a better answer then
Hi, I believe I had this problem, I believe I solved it by doing the phone verification of my account (in your settings)
Hi! @gritty salmon thanks for your reply! In my settings under the phone verification section it says "verified". I'm also pretty sure I did the verification a while ago. 😦
Thebloke has an active community
Hi !
I'm new here !
Can anyone tell me how to do projects on Kaggle ?
take a look at the titanic
and the guides
Its not really a public dataset but its a dengue dataset
https://www.kaggle.com/discussions/general/91461
Typhoid and Dengue Fever Symptoms Dataset.
Ok, so I can't help you more. I hope you will find an answer to your question, maybe you will be more lucky if you ask in #❓┊ask-a-question (if you hadn't already ask there)
Cool will do! Thanks 🙂
:)
hello there
hi
Does anybody know if Kaggle has any university or college datasets that has 2 or more tables (like Student table, Class table, other tables) so I could design a relational database on SQL
Hi all, created a notebook on Time Series clustering using a statistics method called functional data analysis (FDA). Please take a look and let me know what you think if you're interested: https://www.kaggle.com/code/yuqizheng/time-series-clustering-with-fda
are all the competitions in kaggle machine learning based?
Let's connect on Kaggle! https://www.kaggle.com/akshitsharma1
can anyone who has completed the python course please dm me?
is there anyone who knows how to take multiple inputs in tf model i am having trouble preparing data
as of to your second question, the choice between fillna and dropna is determined by the nature of your dataset and the significance of the missing values. If the number of missing values is small and dropping them will not significantly increase the size of your dataset, you may choose to do so. If dropping them means losing valuable information, filling them with appropriate values (such as the mean) is a good alternative.
i can help with that
but before getting started need to have a perfect dataset which relates to ur desire
yea dataset The right one I am not finding...
The AI Model Part and deployment with flask I can work on that but dataset I am not finding the right one
wait
btw why flask ??
oh deployment , ok
@halcyon leaf i cannot find any datasets.. what if i publish my own dataset .. are u willing to use it ??
also i need to grow my tire progress in dataset..
well I need accurate results...Like idt we can make a dataset like dengue one
As its medical report
and its going to be a social enabled solution so it requires accurate results
i have sample results..
i don't think i have real life data with accurate medical reports
let me see....
@halcyon leaf btw you can take a look at this dataset .. just made and published few seconds ago
https://www.kaggle.com/datasets/dipayancodes/dengue
Note - it's just a sample
Yea its good but I would want accurate stuff...
@halcyon leaf heyyy check here
i just found this dataset from government website.. ig it will work for u.. it contians accurate data..
just click on export and then .. export to csv
thats number of cases in an area
does anyone have experience in mcq question genrate using NLP, I mean how should I approach a problem
need to research well then lol
Hello,
How to work with Over-fitting model?
Mitigate through various strategies like cross-validation and hyperparameter tuning ..etc
Cool thanks!
Hi everyone!
I'm just starting to work on Kaggle. Would you advise where to start and what should be solved? Preferably with links. Thank you in advance!
ok
failed to submit what happen?
Reached at kaggle expert position, learned alot. Datascience community grows by sharing ideas
Thanks alot I m really loving this journey, participating in competition making new friends.
Hi, I am new here. https://www.kaggle.com/chinzorigtganbat
it was a great experience from learning and reasearching from kaggle .. today i am thrilled to say that i reached contributor tier :))
https://www.kaggle.com/dipayancodes
Hi, i joined on the lux challenge and now also looking into the Enefit
Hi, when awarding medals.. Does Kaggle also consider how old a notebook is? Like my notebook is 5 months old with 27 upvotes (22 non novice), yet it hasnt got silver medal
Hi Everyone. Im data scientist trying to improve my skill and knowledge.
hi i was wondering if i could use ngrok to open a tunnel in order to remotely collect training data, i am taking part in the UBC Ovarian Cancer Subtype Classification and Outlier Detection competition
im passing this data to another computer with wandb...
I have to make a chatbot for education purposes like I have a json file as a dataset. I have to make a chatbot that answers from the dataset. Does anyone have any idea how should I approach the task
do you have experience with NLP and chatGPT?
@runic trout I have some experience with NLP but not with gpts
dont worry! do you have experience training of any model ?
u can start with searching what is qna models, how to make closed domain qna models
Hi everyone! I'm Maria, new to the data science field and also to Kaggle. I'm here to learn and practice!
Does Kaggle have any plans to improve the editor?
I'm really missing PyLint and Black integration.
I sure hope so. Just customizable keybinds with editor emulation so my emacs muscle memory stops screwing me up would be a godsend
Hi this Harsha K , I am new to data science field and also to kaggle . I'm here to learn and practice !
Is Kaggle down for anyone else?
it was, now its working
Did they solve the issues from last night?
Hey everyone 🙂 I just made a notebook on powerlifting (squat) and i was wondering if you could give me some feedbacks since im new to python and pandas !
Heres the link : https://www.kaggle.com/code/sebastienmotionstats/pandas-in-depth-squat-guide-and-data-2023-11-16
Hope you like it and please comment the notebook so i can see the feedbacks 🙂
kaggle is down for me right now 😦
Hi everyone, I'm Leo a Biomedical engineering student but I'm so interested in doing machine learning, I hope to learn alot here.
great!
hi everyone! i wanted to see if there's folks who'd be interested in a voice chat discussion about open versus proprietary models and how you use either / or / both. if you're interested, react to this post or reply and i'll try to find a time for next week. thaaanks! #llms message
great!
is there any model that is particularly good for sentiment analysis? I would want to go from a string (a tweet for example) to a float between -1 and 1, just to know if the tweet is saying something positive or negative and how good/bad it is, what model would you use to train this type of data?
Hi Everyone. I'm undergraduate student trying to improve my skill and knowledge
So many members but the server seems inactive 
^-- 👍
🙂
@wicked berry I think people are busy writing code, and have all notifications turned off
This is true for me haha
the server is pretty inactive in general
For the mohs-hardness regression data set, where can I find what the acronyms of the features actually are e.g. whatis "el_neg_chi_Average". Am I just supposed to google this or is there somewhere I can find this for future competitions also?
'https://www.kaggle.com/competitions/playground-series-s3e25/data?select=train.csv
Playground Series - Season 3, Episode 25
I am new to competitions and I just saw the impact EDA could have on performance metrics, any tips for a rookie like me?
This is going to be a chemistry term, I think el is electronegative
Its an electronegativity measure if I am not wrong
this is as far as my chemistry knowledge goes
general q but does anyone know how to solve this azure error?
How does one disable internet on a notebook
On the right hand side panel of your notebook there will be an option to do so.
hi y'all just curious about the environment, does everyone have their own gpu? or do you guys pay colab pro to run the models?
Thank you
Hi all, how are we?
Hi, does anybody know the SOTA AI for generating a talking head video with picture and audio as input?
@torn oak we use GPUs connected to GCP instances. They require some setup but make it easier to manage to get a project started.
is this better than paying for something like lambda cloud? here are their prices for reference https://lambdalabs.com/service/gpu-cloud#pricing
Maybe.
Regarding local vs. cloud based GPUs, it’s easier to launch a VM with GPUls and tune/scale as needed. If you need bare metal or have on-prem requirements, then go local.
I'm using Paperspace - they have two options, either running a dedicated VM or using notebooks in a similar way to Kaggle/Google Collab, with an option to connect to the jupyter remotely from your own editor
When a job asks for at least 1 year of experience in data analytics, could creating your own personal projects count as experience
If a public community has seen your personal projects and commented/collaborated on them, then it would worth asking the job to consider them qualifiable. Just make sure it’s a year’s worth of projects 🙂
That’s what I been doing 👀
Give it a shot. Stand by your hard work 🙂
Heyy there
I guess so
hello
Any pandas active learner out there ? I would like to share some code and also learn from yours!!! 
Hey all I just joined the channel looking forwards to connecting with all of you through this journey
Reason: Posted an invite
Hello! Just starting on my Kaggle journey and studying machine learning outside of work. Looking forward to making connections 😄.
i too new in kaggle, i want learn data sciences with kaggle, and i will want know What topics do I need to prioritize?
test
Hi. My name is Marco. I'm new in Kaggle, but very excited already for I hope to improve my ML knowledge.
@fair cliff Could I DM you?
Sure you can, how would I be able to help you?
Thanks, I just need a simple guide about AI and ML
visit my NLP Project
https://medium.com/@smn.acm/bigbasket-products-query-engine-bert-qdrant-718bee72143a
To achieve the goal of creating an NLP Query Engine capable of responding to product-related inquiries on BigBasket, we’ll leverage a…
Reason: Posted an invite
Kaggle does not allow what they call "self-promotion," which includes links to your own work in discussions. In the absence of being able to share links to notebooks and datasets, how do people get noticed on Kaggle? Competitions are one way, but what about work done not related to competitions?
You can post links to your work on #🔗┊sharing-projects. And also promote it on other Data Science-related channels on Discord.
I assume this is commonly done. Why does Kaggle frown on this type of sharing? It seems that it is incredibly hard to share a notebook or dataset otherwise.
@ivory basin The issue is that the forums get flooded with people spamming their work. We'd rather you focuss on making great work and let the hotness algorithm do the work of promoting for you. As mentioned, it's fine to share your work in the #🔗┊sharing-projects channel, but we've had to be strict about stopping people sharing work everywhere since it overwhelms the forums and makes them useless to everyone.
I could definitely see the forums being overrun with spam.
in titanic dataset how is the name column given? is it with place and name?
Hey, I've just started with machine learning courses and will work with Titanic data. I'd love to talk a peer, or someone who completed it already 🙂
Hello, all
I wanna build a sign language interpreter. I got the idea from kaggle isolated sign language competition. Now, I don't know where to start, what to learn and where to learn it. I don't understand the code of the notebook in that competition. Do you know any roadmap I could follow on where to start, what to learn and where can I find the resources?
Has anyone ever tried to use a LLM for chunking?
test
Reason: Posted an invite
You guys know any helpful resources for better communicating as data analyst
I'm also curious about that, and do you have any system/method to record your process during data analysis and presenting it to others (also not technical co-workers) 🙂
The Storytelling with Data podcast is really good. https://podcasts.google.com/feed/aHR0cHM6Ly9zdG9yeXRlbGxpbmd3aXRoZGF0YS5saWJzeW4uY29tL3Jzcw?ep=14
Rid your world of ineffective graphs and mediocre presentations, one exploding 3D pie chart at a time! The storytelling with data podcast from bestselling author, speaker, and workshop guru, Cole Nussbaumer Knaflic and the storytelling with data team covers topics related to better business communications, data storytelling, and knockout present...
I also just saw this in Nature.
Exciting News for Badminton Enthusiasts! And Data science people 🥳
I've compiled a comprehensive Badminton World Federation (BWF) Rankings Dataset featuring Men's and Women's Singles, Doubles, and Mixed Doubles.
Dive into the world of shuttle supremacy!
Check out the dataset on Kaggle:
https://www.kaggle.com/datasets/mayuriawati/bwf-badminton-rankings-singles-and-doubles
Let's explore the stats and trends together!
Feel free to share and discuss.
Happy analyzing 😎
Reason: Posted an invite
test
Hey my kaggle buddys, just want to ask if somebody did join the following competition: https://www.kaggle.com/competitions/playground-series-s3e24/discussion/450325 maybe we can exchange and improve our training models, thank you 🙂
Playground Series - Season 3, Episode 24
No, I feel like presenting will be most difficult part of my job cuz I’m introvert
Thank you 🙏
Any data analyst here
same 😦
is kaggle down?
Is kaggle down?
yes
Oh
Kaggle down..
Its back online
Probably down again?
noo, it is active. I run a notebook now
i find it's slow
I’m also running notebooks but I can’t browse kaggle, some buttons in notebook aren’t working as well
Sorry for the outage/slowdown everyone, we had a new code change which had some unexpected consequences, it should be all fine now and hopefully we can figure out what went wrong!
Hi everyone, can someone recommend a good dataset/contest for a person that is just getting started on kaggle
can anyone tell me how I can tell if a dataset is beginner friendly or not?
Titanic, space titanic are great. Also I love the playground series once you get good at the basics
Between AWS and Azure, which of the two would you guys say has the most valuable Certifications in Data Science & AI?
Getting Started Competitions are probably your best options.
Thank you just finished the tutorial! It was helpful to at least know where all the buttons are
tyvm
The one you’re trying to get a job at
space titanic was more fun for me because there's more data, and you can't cheat (knowing who survived in the original titanic crash)
Excellent, I have my next target now. Can you please share a few more for me
I want to make a list and go through them one at a time
- Titanic, 2. Space Titanic, 3. Playground series
I'm a big fan of the playground series because theyr'e beginner friendly and are more competitive
you can earn swag and learn a lot
and I find the community in the playground series is really focused on learning and teaching
Thank you, I saved them
Hello, I'm just joining as member. I want to learn ML and DL. 
Hello, I just joined to the server
What does earing "swag" means?
Like merch. Check em out
Hello Shane Simon, I am new to this thing of Kaggle, I see that you are an expert, how much time it took you? What do you recomendme to be one?
Also, what else do you think it would be helpful the help me get a job in data science. Many people recommended me Kaggle as a place to start and practice my ML habilities, but also they say that for the way kaggle works I am not doing many real world thing. So what do you think it would help me?
Hi, it's Great to see you all, I just joined Kaggle and would love to learn ML from you all.
hey everyone, I was wondering is it possible to fine tune llava on kaggle 24 gb gpu
Hello, just joined kaggle and looking forward to join teams for Competitions.
Hello
Hello, just joined kaggle and basically for learning
currently trying to learn webscraping so i can start my data analytics journey yet im having trouble geting the href from a table. any help shall be apprecieated heres the code from bs4 import BeautifulSoup
import requests
import pandas as pd
base_url = 'https://www.basketball-reference.com'
response = requests.get(base_url)
soup = BeautifulSoup(response.content, 'html.parser')
tables = soup.find_all('table')
if len(tables) > 1:
first_table = tables[0] # First table
second_table = tables[1] # Second table
# Now you can process these tables
Assuming team names and URLs are within <a> tags inside a specific table
You would need to update 'your_table_id' with the actual ID of the table on the website
east_table = soup.find('table', {'id': 'all_confs_standings_E '})
west_table = soup.find('table', {'id': 'all_confs_standings_W '}) # Dictionary to hold teams
teams = {'East_Conference': [], 'West_Conference': []}
for link in east_table.find_all('a', href=True):
team_name = link.text
team_url = link['href']
teams['East_Conference'].append({'name': team_name, 'url': team_url})
for link in west_table.find_all('a', href=True):
team_name = link.text
team_url = link['href']
teams['West_Conference'].append({'name': team_name, 'url': team_url})
Now you have a dictionary with lists of team names and URLs which you can iterate over
print('West_Conference Teams:')
for team in teams['West_Conference']:
print(team['name'], team['url'])
error is 'NoneType' object has no attribute 'find_all'
hey, does anyone here knows how to fix the nltk (urlopen) error?
i too but i´m practice my oratory skills...this way is more easy
is very dificult and bother, but no imposible
help me
Hello everyone, Good afternoon
hi
Hi Everyone!
Can anyone tell me how to get started on Kaggle ?
and start doing projects ?
HI guys!
My college is conducting a hackathon in February and I'm in the marketing OC!
If anyone can help me out to get any potential collaborator please help me out!
If you guys have any question please DM me:)
Does kaggle sponsor by any chance?
Hi guys, I am a university student who interested to learn data science
Hello all ! I am a 2nd year college student pursuing data Science
Hello, everyone! My name is Rene and I am a recent master's graduate in Applied Statistics & Data Science. I'm looking forward to collaborating and working on projects within this community. Have a great day!
Hi guys, my name is Joi and I'm here to learn about data science.
Hi, My name is Milena and I would like to learn about machine learning models.
Hi everyone. I'm looking forward to finding potent people who want to learn Machine learning and want to make a good team.
Anyone up?
hello, everyone. My name is Ifull and i'm here to learn about data science and its fields
Hi, My name is Valentina and I would like to learn about machine learning models.
I'm also looking forward to collaborating and working on projects within this community. Have a great day!
Hi all, my name is Jake Knotek and I'm looking to practice and apply my data science coursework to problems and learn more about machine learning, as well as learn more about the career opportunities out there for me to pivot to.
Holla everyone, Dumebi here. i'm looking to learn about Data Science to pass classes and as a possible career field, thank you.
Maybe the staff or mods could create a machine learning channel here, so you guys can communicate and share with each others projects
However I can’t ping mods or staff cuz it’s against the rules
Does anyone know a ml model that we can use to count the number of people in a photo? Like a model from hugging face to demonstrate the impact of ml
Hi I am learning about Machine Learning. I would like join a team.
Hi, everyone. I'd like to learn more konwledge about AI.
Hi everyone, I'm a CS undergraduate student, and i'm here to learn more about ML with you guys. Hope we can have fun and sign in for some funny competition about ML to gether in the future.
Hey everyone! I'm a CS student, super keen to explore the exciting world of ML together. Let's have some fun learning and maybe even join some cool ML competitions down the road!
what about yolov5, its pretrained ML models are already able to do some simple detections, like detecting the people in a photo, what you need to do is to write a simple program to count the total number of people according to its output
is anyone else checking every day if santa 2023 is here
Ohh thanks I am just a new learner😅
It okay, I’m also a new leaner for ML, let’s make progress together !
Yeah that's great, all the best buddy
anyone having trouble saving their gpu notebooks? is it for everyone ? i mean the notebooks run fine with gpu i just can't save them for submissions...
its just the P100, t4 is fine
i can live with that for now XD
Hey welcome i would recommend you to try the Titanic dataset competition if you know all the basics
Hi Everyone. TidyMind from UK, England. I'm working on the Titanic Competition on Kaggle. Anyone else currently working on it?
Hi. From korea. Studying Data analysis
Hi everyone. I would like to join a time to competition.
Hello, I’m a chemist looking to learn more about machine learning and possibly make a career transition in the future and I’m excited to be here and be a student! I just started working on the Spaceship Titanic competition.
Reason: Posted an invite
Hi, I'm a career transitioning millennial, recent software development BAS grad, and prior longtime bartender. I'm here following the progression to learn more about ML etc.. and whatever else I find. Starting with the Titanic competition. Thanks for having me.
Hi everyone! I'm a data analyst currently but trying to take the next step. Looking forward to learning after just finishing my undergrad in Information Systems and Quantitative Methods last week at the ripe age of 31!
hey @everyone i got score of rmse0.4 by using my this notebook quick question is that how can i improve it
https://www.kaggle.com/code/ayeshairshadcoder/house-price-prediction-competition/
is it that lower your score, better is your model in kaggle ? because in leaderboard people with lower score are higher
yea
i assume you're new to this
the score, in general, is basically score for a loss function, which you can think of as a metric for how much the model predictions deviate from the actual data
you may wonder why other metrics aren't used, like accuracy for classification and R^2 for like continuous predictions or something like that
but i think the explanation comes down to the fact that using loss function as the metrics is just more precise
Hi, newbie here. nice to meet you all.
Hi all, Kamal here. Try my hands in kaggle compeition after a long hiatus
Hi all! just starting with the Kaggle community!
Is there any YouTube videos/channels showing a "real world" analytics project from beginning to end?
Hey everyone check out my new blog post on towards data science community.
#💬┊general
Exploring the Realms of Textual and Visual AI with Google’s Latest Offerings
looking for 3 members to work on few competitions ❤️
Hello guys! I'm Barney and I am a novice in the Data Science community. I can't wait to interact and learn.
ya i realised different competitions use difference evaluation metrics, some use loss function some use accuracy n stuff
yea so the lower your score the higher your rank on the leaderboard, depends on what kind of evaluation metric is used in the complete
Hello, I amm new here, I don't know if it's the right channel to ask this, but does anyone know if there is a database that matches DNA with Faces ?
Some even use the MedAE, which just cares about the error of the element in the median.
Hello I'm trying to load some R packages to work on my kaggle notebook but it tells me the following:
Why is this? How can I fix it?
I've tried to do install.packages("StatMatch") but I guess I can't download packages
hey everyone check out my new blog post. Feel Free to share your feedback.
Does anyone did project related to question generation
Kaggle is the world’s largest data science community with powerful tools and resources to help you achieve your data science goals.
i am trying to open this competition on my pc
but i keep getting an error
but all my friends can open it normally
Hey any signals processing experts ?
Exciting news, everyone! I've just shared new Blog post on my Medium towards data science community please read and share your feedback about this blog post.
https://jillanisofttech.medium.com/mastering-pandas-24-essential-functions-for-data-science-mastery-89c50a9f94ab
#1124416827670409237
Unlocking the Power of Pandas 🐼: A Comprehensive Guide for Data Science, Machine Learning, and Advanced Data Analysis Techniques
If you are into kaggle and chess, you may appreciate the kaggle chess pieces I have recently designed. If you are only into kaggle, then just take it as a piece of modern art 🙂 https://www.kaggle.com/discussions/general/462252
Kaggle inspired chess pieces.
Hi I am Dani. I have recently decided to learn about data science.
I don't know exactly how this works
do you have any suggestions related with linear regression projects?
I'm Keerthi, diving into the awesome world of data science! Super excited to learn and explore more.
Hi I am basit , I would like to connect with new people
Hello, everyone. I have some question to you. I want to develop marketing prediction and analysis chatbot using IBM Watson. For this, I want to integrate LSTM model with IBM Watson chatbot. Is it possible? If it is possible, could you let me know the methods? Thanks for your attention.
Listen, I'm just a beginner, but what I did is just predicting a linear function with added noise, then an exponential function with added noise, both with Pytorch. That's a cool start. Also, the titanic competition could be a good start if you intend to study generally ML and not just regressions.
Hey Dani! Nice to meet you. It's awesome that you're diving into data science. For a beginner in linear regression, I'd recommend starting with a simple project. You could try predicting something like house prices based on features like square footage, number of bedrooms, etc. There are plenty of datasets available online for this. It'll give you hands-on experience and a better understanding of how linear regression works. Feel free to ask if you need more guidance!
Hi guys, pls check my new blog post on CTEs and sub queries and comment on it https://medium.com/@i.raymon57/cte-vs-subqueries-28ee34104611
hey I was wondering if anyone had good services to do the following :
I want to have a server I can use with ssh. I want it to be rather powerful with at least 8 cores, and if possible a gpu and 32gb of ram. The main thing I also want is for it to be a "pay as you go" service as the ones I find are very expensive. I would only use it a couple of hours a day at most.
I saw AWS EC2 had a similar thing but you have to manually do snapshots of your session to get the data back which I don't like.
Any options ?
oh budget would be like 20-30 dollars per month, not sure if it's feasible.
Hey everyone! From India. I'm here to learn Machine Learning and Data science to use that in healthcare field.
Thank you @umbral belfry and @proper depot for your advice ! I am really excited to learn!
Hi Everyone, very basic question that's not specific to my learning competition. "Starship Titanic"
I have tested and played around with the results and I am ready to submit
Do I merely add my predictions that are normally used to weigh against the results onto a csv in the same format as the sample and his submit?
is my code itself reviewed at all, do I need to print the accurasy and show comments?
Greetings
hello
hello, i am preparing portfolio for data analyst. I want my skill set to be Excel, SQL, Python and Tableau. I am looking for project ideas that can actually land me a job
Hello I'm doing a project related to evolutionary computing and Im practicing with santa-2023
Hey everyone, new to the discord. How's everyone doing?
Hi everyone, does anyone have any suggestion for choosing the most relevant features for the Titanic dataset? what I did was to plot several histograms upon the different variables. Then I decided that the relevant variables where the ones had the highest difference.
I have not summited yet.
Since I am new I don't know if the effectiveness relies on the model or the variables.
Thank you
And have a happy Christmas!
Hey everyone, new to discord. Hope everyone's doing great.
Hey guys, I have a question. Does anybody have a good source for learning about the variables that go into a model ( stuff like learning rate, number of estimators...etc), and how they affect the model exactly?
I learnt those stuff from Andrew ng's Stanford machine learning course, Though you can always google stuff
Thanks for the answer !
Hey can anyone suggest good resources to learn numpy assuming I have already learnt python?
hello everyone
I have a question about whether I should perform exploratory data analysis (EDA) solely on the training data or on the entire dataset
Hello! You should perform exploratory data analysis (EDA) on the entire dataset ¹². EDA is the process of analyzing and summarizing the main characteristics of a dataset, often with visual methods ². It is important to analyze the entire dataset to form a complete understanding of the data and avoid missing any important information ¹².
Please note that EDA is part of the training process and should not be allowed access to test data ¹. Instead, you can split your training data into two and use one for training and another to test and tweak your model(s) ¹.
I hope this helps!.
Source: Conversation with Bing, 24/12/2023
(1) Is it better to do exploratory data analysis on the training dataset only?. https://stats.stackexchange.com/questions/189678/is-it-better-to-do-exploratory-data-analysis-on-the-training-dataset-only.
(2) Should exploratory data analysis include validation set?. https://stats.stackexchange.com/questions/424263/should-exploratory-data-analysis-include-validation-set.
(3) Exploratory Data Analysis in Python Course | DataCamp. https://www.datacamp.com/courses/exploratory-data-analysis-in-python.
(4) undefined. http://statweb.stanford.edu/~tibs/ElemStatLearn/printings/ESLII_print10.pdf.
(5) en.wikipedia.org. https://en.wikipedia.org/wiki/Exploratory_data_analysis.
I'm doing exploratory data analysis (EDA) on a dataset. Then I will select some features to predict a dependent variable.
The question is:
Should I do the EDA on my training dataset only? Or sh...
And is there a way to run a basic cpu only session 24/7
Are the channels under Getting Started set for auto-deletion or it's just that dead
hello everyone, I just finished spaceship titanic, any other competitions with a similar skill level or slightly harder would be appreciated. I have a bit of a hard time navigating the site and finding suitable contsts
im in the same boat rn
Try out any of the Getting Started or Playground category of competitions - depending on what skills you want to focus on.
Hello! I'm new to Kaggle and ML. How should I start building this titanic project?
Hello!! I am new to Kaggle .how should I start building a project
tyvm, thats helpful
is it safe to assume that they're all more difficult than spaceship titanic?
is this one considered beginner friendly?
https://www.kaggle.com/competitions/store-sales-time-series-forecasting/
Use machine learning to predict grocery sales
Yep! All the competitions in the getting started seciton are beginner friendly - that's a great next step.
damn, it seemed quite a bit harder than spaceship titanic, but maybe I just need to organize data a bit
Would anyone be interested in doing a deep learning project with me it will be a self dataset building DL ai dm me if interested 😁
Hi Everyone, My name is Sanket. I am from India. Looking forward to contribute in energy domain.
Hello! I'm new to Kaggle and ML. How should I start building this titanic project?
I would love to team up with someone to do the Spaceship Titanic competition.
I am in
Hello!
I am a student of BTech in AI at Kathmandu University in Nepal. I am only in first semester for now but I have been following different events of kaggle since a year ago. I have been waiting for a good time to start kaggle and now I think it is the time. If there is anybody to whom I can assist in different kaggle competition plz dm me.
Hello everyone!
I’m passionate about all things data and eager to dive deeper into the world of analytics. I have a background in business management, with a rich blend of experience in areas such as Data Analytics, Product Management, and Marketing. I’ve joined the Kaggle community to learn, share, and collaborate with like-minded individuals.
I’m particularly excited about exploring diverse datasets, engaging in thoughtful discussions, and improving my skills through challenges and collaborative projects. I’m here to learn and grow, so please feel free to reach out for collaborations or just a chat about data.
Looking forward to interacting with you all!
wsg [Your Name]
Hello
Happy New Year! Here is small gift from Santa: Kaggle holiday puzzle https://www.kaggle.com/general/464218 🙂
Holiday Kaggle puzzle.
Good day to everyone, I need help please. I find it difficult to comprehend new topics in data science. Please is this normal?
I find passion in data but I get lost when handling new knowledge.
What can I do? I'm a beginner data scientist.
Having the same issue, im learning the fundamentals of coding in python, variables, functions and stuff and now im still confuse of my path of where should i go next
Hello all,
I'm just a begineer to this field. I'm facing a problem or in simpler words stuck in a loop.
I'm pretty well aware about the theory and conceptual knowledge required of py, kaggle, maths, ml and all, but I'm not able to put things together to build my FIRST ML MODEL. Can anybody of you help me out with this.
bro kernels are down again or what? long queue wait on all(gpu/cpu/tpu) i was trying to run a sweep .. : (
Reason: Posted an invite
Hello, everyone
Hello everyone!
Hey 👋
Hi, I'm a masters in cs student with a focus on machine learning. I'm excited to get started on some kaggle competitions to learn more about the field and hopefully solve some interesting problems.
What do you folks use for data scraping? I'm using Beautifulsoup and I'm finding it difficult to find sites which don't use JS to load the styling of the websites
You can try Selenium.
hello everyone, newbie here tyring to get into the world of DS. happy to help out with any projects or collaborations, very eager to learn and upgrade myself.
Hello everyone! 🌟 I’m thrilled to announce that I’m broadening my online horizons by launching new content on both YouTube and TikTok. My focus will be on unraveling the fascinating world of Machine Learning and Data Science. I’d be absolutely delighted to have your support and engagement on this new journey. Expect a blend of insightful tutorials, deep dives into complex topics, and a bit of fun along the way! So, come join me and let’s explore these amazing fields together.
YouTube: https://youtube.com/@KarnikaKapoor?si=I4pEWFG8UyM2iomL
TikTok: https://www.tiktok.com/@karnikakapoor?_t=8imzU31Ddju&_r=1
Can’t wait to connect with you all on these platforms! 🚀 #DataScienceCommunity #MachineLearningExplorations
Embark on an AI and Data Science odyssey with a Kaggle Grandmaster! 🚀 My channel is a vibrant hub where curiosity drives learning. Expect to unravel AI mysteries through bite-sized, insightful videos, dive deep with comprehensive Python and ML tutorials, and witness data science transforming the real world. 🧠💡 Whether you're beginning your journ...
hey thats really cool
been looking to get into competitions too but dont think im at that level yet
i know it looks intimidating and it is but
start
you will learn along the way
the studying path is deprecated and absurd, like maybe not stupid but is not optimal for sure
one recommendation i would do to you i start seeing other people notebooks
im studying rn
but first makign a presentation cuz i forgot its due tmrw
and try to approach it yourself
from where if you dont mind me asking
in kaggle inside the competition
there are different sections
each competition has a forum for discussions
and notebooks section
where kind people share the way they've approached it
really? alr ill check it out
i've learned the basic models
i'll see what others have done
Howdy, I'm new to Kaggle and DA. I'm a self taught developer that managed to change careers into web dev through self study alone. I have become interested in DA and am currently studying to obtain the Power BI Analyst certification. I'm currently looking out for datasets that I can practice cleaning in excel and Power BI.
I'm also really interested in sports analysis and would love to get into predictive analytics in the future. Anyway, glad to be here!
Hello
Hi! I’m new to kaddle, and don’t know how to begin. This is super confusing. 🫤 I can not even understand the titanic prediction as far as what to do. Do I grab an image from a website and copy and paste? Or do I put the word life vest in as code. Only reason I join is because I want to create an app for kids and adults, and learn something new… Any recommendations? Thanks
Hey there, fellow data enthusiasts! 👋
I'm a budding data science enthusiast who's absolutely passionate about diving into datasets and uncovering insights. I'm eager to learn and grow alongside like-minded individuals who share my curiosity for all things data-related. I believe that together, we can embark on an exciting journey of exploration and discovery. Let's collaborate and make our data science journey an amazing one! 🌟
See other people's notebooks and understand the code. If they don't have a perfect score keep studying about the problem, compare other notebooks and try a better score.
Learning MySQL and Data Science with python rn
Long term goal is Deep Learning and maybe a little bit knowledge in business world
Currently know about stocks and do a little bit of trading
Any Advice for me?
PS I am really looking for like-minded peeps with some motivation and maybe looking to start-up a business in future?
keep up the good work, stay disciplined and try to focus on not too many things
Read "A Random Walk Down Wall Street" if you haven't already and also don't make a Quant start-up because there's heavy market saturation with ~70% of all market volume being done by bots.
Hi everyone, I'm new to the ds's adventures)
Have a nice new year to everyone!
Hi there! I can highly recommend the learning section on kaggle. Here's the link: https://www.kaggle.com/learn. If you select the grid view (right hand side of the screen) it will show you the relations between courses (i.e. which course the shown one builds on).
If you have no experience at all with either ML or programming, I'd recommend youtube as a good source for all levels of expertise, as well as the book 'How To Design Programs' available for free here: https://htdp.org/ (hope it's okay to post the link). But keep in mind, the book does not use python but it's a great start to learn all concepts of programming so far.
are there no voice channels?
No voice channels I'm afraid, as we aren't able to moderate them. Harder to get approval for something like that in the corporate world - thanks for understanding.
Hello there. I have been away from the data science space for a while and I am currently making my way back. I hope to engage with the Kaggle community, grow as a competent data scientist, and make some friends along the way.
Are there any practice problems/contests on kaggle (beginner friendly) where u can submit as many times as you want? I was having fun experimenting with Titanic contest but was hit with the 10 daily submission limit... 😢
no problem thank you for reaching out. just wanted to make sure I wasnt being blind and missing one
Thank you for posting that site for the programming book, its very helpful!
You're welcome, glad I could help! I like the book very much, it helped me more than most of the other books. I can also recommend the 'Head First' books from O'Reilly. The concept is to have fun while learning, and it works! There is also a python one (and DS and statistics and so forht...): https://www.oreilly.com/library/view/head-first-python/9781492051282/
Thank you, I will check those out also, every little bit helps.
You're welcome! Have fun!
👋 Hi Kaggle Community!
I'm Harry, deeply intersted in sports analytics, especially in soccer/football. 🥅⚽ I'm currently developing an ML model using xgboost to predict the number of goals in a match.
📈 So far in my model:
Implemented mean target encoding for teams.
Used rolling averages of each teams goals scored and conceded.
Extracted date-based features.
I'm eager to connect with others who are passionate about sports analytics and machine learning. I am keen to hear suggestions that could enhance the accuracy of my model.
If you're into data-driven sports predictions or have experience in this arena, I'd love to chat!
https://www.kaggle.com/code/harrycarson11/predicting-home-goals-in-epl-soccer-football/notebook
Hello everyone, im new to data science
Any good lightweight object detection model, that detects humans
https://paperswithcode.com/task/real-time-object-detection
Also whatever the flavor of the month YOLO is, so far as I know YOLO-NAS is latest.
Real-Time Object Detection is a computer vision task that involves identifying and locating objects of interest in real-time video sequences with fast inference while maintaining a base level of accuracy.
This is typically solved using algorithms that combine object detection and tracking techniques to accurately detect and track objects in...
Hi all, needed a bit of advice regarding training DCGAN on a custom limited dataset, please could someone guide me?
Hi guys, im new at data science. Hope i can learn a lot with u.
I'm working on a project to query & visualize free text from natural language datasets (query text, visualize topic distributions, etc). Basically, make working with free text as easily as working with tabular data in a traditional DB.
If you want to query free text, how would you want to query?
(A) A search bar (e.g. "search all text that talks about dinosaurs")
(B) A sql query by vibes (e.g. "select sentiment from chats where topic = dinosaurs")
(C) Other
Both, and sounds like a good project! If you can only pick one, I'd prefer the query that way you can enumerate options in a dropdown.
Thanks!
I'd prefer a query, but maybe we're looking at it from a programmers perspective, what would someone who's just accessing this prefer?
My notebook is getting stuck while compilation and the CPU usage showing 100%
Then after sometime the page shows unresponsive
Pls help!
🌈✨ Excited to announce my FIRST Udemy course, now FREE for a short time! It's not just any course—it's your tech toolkit, complete with revision cheat sheets to make learning stick! 📈
🔗 Enroll for FREE: https://www.udemy.com/course/beyond-coding-tools-practices-for-coders-and-data-analysts/?referralCode=26978D8EEFB45414316A
Why join?
🖥️ Command the command line!
🤝 Master Git & GitHub for seamless collaboration.
🚀 Utilize GitHub Codespaces with ease.
💼 Foster essential coding project teamwork skills.
📝 Filled with handy revision cheat sheets & more!
Make the leap! Enroll, review, and DM me your valuable feedback! 🌟
🔗 Begin your learning journey: https://www.udemy.com/course/beyond-coding-tools-practices-for-coders-and-data-analysts/?referralCode=26978D8EEFB45414316A 🎓
Hi guys, I am an aspiring data scientist, I look forward to learning from you guys
I'm searching for load datasheet of electrical power systems.
Can someone help me with that?
hello there! i'm a visual effects artist who is trying to shift toward programming and data science
All the best!
hello everyone , looking forward to talk and engage with you
😅
Hello everyone I have just completed data science course and wish I could learn and practice with you guys.!!!
Hello everyone, transitioning into data science from marketing.
This is a Q&A excerpt on the topic of AI from a lecture by Richard Feynman from September 26th, 1985.
This is a clip on the Lex Clips channel that I mostly use to post video clips from the Artificial Intelligence podcast, but occasionally I post favorite clips from lectures given by others. Hope you find these interesting, thought-provoking, an...
Funny how feynman doubts what is now a well known as problem of computer vision
Hi everyone, looking forward to learning from everyone! 🙂
Ever dreamed of having your own driverless autonomous car? Now you can, even on your Windows PC! Dive into this detailed guide on training an autonomous car locally using AWS DeepRacer, reinforcement learning PPO, and your Windows PC. we will create an AWS DeepRacing training environment that can be deployed in the cloud, or locally on Ubuntu Li...
it will be nice to try
Yo guy's, I'm new to data science all I know is that Python is involved

hi
Hi, someone from Ags. Mx to create a competition team and learn?
Hey. We are working on a project where we need to identify the early stages of pest on a leaf through a camera phone. Any Suggestions as to what and how we should do it?
Hey all,
Just wrapped up my work on the Books Dataset, where I conducted a thorough Exploratory Data Analysis (EDA) and developed a collaborative recommender model. Check out the notebook here: https://www.kaggle.com/code/faseeh001/book-recommender-system/notebook
Would appreciate any suggestions for improvement or any fresh ideas you might have. Thanks a bunch!
Hello, i am newbie at discord and kaggle.
If anybody can reply to this..... Would be helpful
Hi everyone,
A Lot of people suffer with messy folders and files on their laptop.
pip install leandesk `
You can use this module to clean any directory on your laptop,
with just one command and it will cluster all your files in different folders based on extensions
clean <folder_path>
Feel free to contribute :
github.com/ibrahim-string/leandesk
Hello guys, I am new to kaggle. I have experience with basic ML and worked on neural networks. Let's form a team and hop into some competition. Feel free to DM
Hey guys,
Just wrote my first post on the difference between statistics and machine learning! Check out the post here: https://www.kaggle.com/discussions/general/468530
Any suggestions for improvements are welcome!
📈 Why Statistics? The difference between Traditional Statistics and Machine Learning 📉 (Est. 5 min read) ⏳.
hey Does anyone know any how can i covert my streamlit app to react web app ?
how about this?
https://stackoverflow.com/questions/76770854/embedding-a-streamlit-app-into-an-existing-react-js-application-through-an-ifram
Thanks to all of you
I ascend to the global champion of the discussion tier today
Hi, is there data analyst projects on GitHub that has been done on a professional setting
There are all different types of Github projects. I think you need to clarify what you're asking.
I know there’s personal projects on GitHub, I’m wondering if anybody posted a real life work example
I think that would be kind of rare unless the data is freely available. However, you will find projects that have been applied to real-world scenarios. For example, I have a few projects that are applied to marketing. However, there are companies that have github projects posted. Ex: https://github.com/airbnb
Ok, is it possible for me to take a look at your projects
Oooooh sweet thanx
Are you a marketing data analyst?
I'm a data scientist for a healthcare company but I used to work as a marketing analyst and I still do freelance marketing analytics.
check your message requests. I don't want to blow up this channel lol
Oh sorry
Excuse me. May i ask you? Whats recommended Book for learning time series python like financial or econometric ?
- Hands-on Time Series Analysis with Python by B V Vishwas and Ashish Patel: This book covers the basics to the bleeding-edge techniques of time series analysis, using practical examples and data sets. It also introduces the latest packages like fbprophet and pmdarima. It is suitable for beginners and intermediate learners who want to apply time series analysis in various domains.
- The Analysis of Time Series: An Introduction by Chris Chatfield: This book is a classic text that provides a broad overview of time series analysis, including methods, forecasting models, systems, and ARIMA probability models. It also includes examples and exercises, and a free online appendix. It is ideal for students and researchers who want to learn the fundamentals and theories of time series analysis.
- Time Series Analysis by James Douglas Hamilton: This book is a comprehensive and rigorous guide to the concepts and methods of time series analysis. It covers topics such as trend analysis, forecasting, spectral analysis, state-space models, nonlinear models, and Bayesian methods. It is a reference book for graduate students and researchers who want to deepen their knowledge and skills in time series analysis.
I hope this helps you find the right book for your learning needs. If you have any other questions, please feel free to ask me. 😊
Source: Conversation with Bing, 18/01/2024
(1) Hands-on Time Series Analysis with Python - Springer. https://link.springer.com/book/10.1007/978-1-4842-5992-4.
(2) The 7 Best Books About Time Series Analysis | Tableau. https://www.tableau.com/learn/articles/time-series-analysis-books.
(3) Time Series Analysis in Python - Machine Learning Plus. https://www.machinelearningplus.com/time-series/time-series-analysis-python/.
Hi everyone! I did the Python course on Kaggle and I really love that way of learning. I wonder if any of you guys know something similar for Flutter or React or if there's any way to ask for that on Kaggle. Thanks a lot in advance!
Excited to share my latest project on #DataScience! 📊 Leveraging #MachineLearning for meaningful insights. As a #TechEnthusiast, I'm thrilled to unveil my Kaggle notebook focusing on Netflix's Best 🌟: Movie & Series Recommendations! 🎬 #AI
In this project I have shown :
📚 Import Relevant Library
📊 Basic Understanding of Data
🔍 Exploratory Data Analysis
🛠️ Feature Engineering
🧹 Data Preprocessing or Cleaning
❓ Dealing with Missing Values
🏷️ Feature Encoding
📈 Outlier Detection
🎯 Feature Selection
⚖️ Feature Scaling
🤖 Building ML Model
🔄 Automate ML Model
📊 Model Performance Comparison
🎯 Hyperparameter Tuning of Different Models
🔄 Making Stacking Model
📊 Printing Stacking Model Accuracy on Training and Test Data
Check out the Kaggle notebook for this project[https://www.kaggle.com/code/mehedithedreamer/netflix-s-best-movie-series-recommendation] 📗, and please provide your feedback and support! 🚀
#MMM
Please can anyone recommend a comprehensive beginner books for Data Sciences for a friend!…
Hi 😊.I am a student pursuing my data analytics course. In future I would like to become a data scientist
hello there, I'm himu. A CS student.
hello everyone i am mouch, I am learning python on kaggle, I want to develop a skillset in programming along with AI engineering. I would love to meet new people with the same goals as me
Hey there! I am a Cloud Engineer and transitioning my role to ML and Data Science
Nice to have you here
That’s great
Hi! I'm a third year university student & my Business Analytics Course instructor directed me to kaggle! hoping to learn a lot (:
I applied for a job I really like on LinkedIn and I even spend an hour typing a message toward the hiring manager, then the job posting got taken down and the hiring manager profile as well 😂
can somebody tell me when will the next competition begin?
anyone knows how do i start to learn GAN?
I have a question
Don't you all feel that just solving questions on kaggle feels like a boring work
It's not that u r getting jobs instantly
In data science and machine learning
nah im not, at least i could exploring more there
ofc 🗿
so we need to grind a lot, and dats okey
You need to use the skills you learned thru answering questions and apply it to a real dataset. With that you can show employers what you can do
Hi I am Mohil Wankar, I am a student of AI/ML from India. I would love to connect with you guyss on socials
https://bento.me/mohil-wankar
hello
Hi I'm Ferdinand, I'm into AI/ML. I'm a student from Nigeria
It's good to be in the community
hi, i'm kareem https://www.kaggle.com/kareem79. He/him
hi! my name is gaby (she/her) https://www.kaggle.com/mgurango
Has any one here with no degree or diploma got into data analytics
hi everybody
bonjour my friends,I`am a student from china.And this is my first time joining such group glad to meet you!!!
Hey! I am Ricardo from Chile. I am learning everything about ML and I hope I could receive your help when needed 🙂
Hello everyone ! I'm happy to join the Kaggle community ! 🙂
Hello everyone...I am Glen and I knew to data analysis, and happy to join Kaggle community
Hello everyone...I am a student from india, hope we get along 🙂
Hello everyone, I'm Sai, and I have just started learning ML.
Hy everyone, I am arujjwal and am here to learn ML and meet you all
Hello nice pfp
Nice to meet you alll 🙂
Nice to meet you too
Hello all 👋
Hi everyone! I hope you're doing well. I have been publishing data science and machine learning projects in Kaggle for some time and have recently attempted to create a portfolio website for these projects. I have published a first version containing my first six projects. Your feedback and advice (in DM or here, wherever you're comfortable) will be much appreciated.
https://sugatagh.github.io/dsml/
Note that the earlier four projects are reported with fewer code snippets, whereas the 5th (Site Energy Usage Intensity Prediction) and 6th (Electron Energy Flux Prediction) projects are reported with almost all code snippets. It's an attempt to showcase the technical side. It would be great if you let me know which approach is more appropriate according to you (between the less code snippets approach and the more code snippets approach) in the context of a portfolio website. Thanks!
Statistics | Data Science | Machine Learning | Artificial Intelligence
Hello all! My name is Arpit and I am here to learn more about business/ data analyst work.
Hi
Hello All, I'm new to here!
Hey, I'm Ahmed from a researcher, I'm happy to join Kaggle Community. I'm a coder in R and Python. I love ML and a data science.
Hi, I'm Michal my interests in computers are very broad but to be more specific, I really like anything that relates to AI.
Hi all,
I'm Deepanshu. Happy to join Kaggle Community.
Hi all,
I'm Rutvik. Interested in AI/Data Science to drive growth in Engineering fields Happy to join Kaggle Community.
hi
Hello Folks, happy to join Kaggle community.
Nice to meet you 🙂
Could I post my resume here? It won't have my personal information
Hello all, I am new to this community but have been using Kaggle (now a contributor) for a while now.
https://www.kaggle.com/discussions/general/471731
So here is a new notebook I created on Twitch streamers, Go have a look!!! Also don't forget to drop your feedbacks and suggestions for me to improve. Thank you!!!
🎮Analysis on Twitch Streamers dataset!!!.
What final year project ideas could be there for a bachelor's in AI student?
It completely depends on your interests, NLP, CV, RL, GNN, Fairness in AI, etc
Just identify your interests, go through a few papers on those lines in Core A* ranked conferences and see if you can do slight advancements in those
Hello, guys. I started learning ml recently. Here's my last nb. Could you gimme some advice about nb design? https://www.kaggle.com/code/yuriikretov/logistic-regression-heart-disease-risk-prediction
I just finished a python dev course, i went on Kaggle to practice everything i learned. I had a hard time for a lot of stuff but i read a lot of documentation and corrected my code.
https://www.kaggle.com/code/sebastienmotionstats/coviddataanalyzer-class-and-methods/notebook
If you all can just check it out and tell me what you think.
Hi I am Barış. I previously checked some tutorails and worked with openNN before. You can follow me on twitter too https://twitter.com/brs_ai
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=MRcI0G2dWr4
We recently launched a deep integration with Keras! Check out this short video walking through the details with François Chollet (Keras founder) and Meg Risdal (Kaggle Models PM).
About Kaggle:
Kaggle is the world's large...
Hi would anyone be willing to teach me ML using python 😄
hello everyone, i have a question
why doesn't kaggle have excel courses? and when would you realistically be using excel in data analysis and machine learning vs python or sql?
Hi, everyone.
I trained a model to classify real and false news articles. Please, take a look at the notebook and share your thoughts.
https://www.kaggle.com/code/rhythmsage/fake-news-random-forest-99-naive-bayes-93-8?rvi=1
Hi, My name is Darshan. I am new to Data Science
is ETL or SSIS part of business intelligence platform
there is a job I'm interested in and it asks "Formulates relevant queries within BI platforms". it doesn't say anywhere that ETL or SSIS is required
Server suggestion: Move currently active competitions to the top in the channel list and move old ones down? as is you have to really search to find the active competitions
🚀 Unveiling Advanced NLP Techniques! 🧠
Dive into the future of NLP with my latest work on Advanced Semi-Supervised Learning techniques. 🌐✨ Ready to elevate your understanding? Check it out here: [https://www.kaggle.com/code/hunter0007/dataopptracker-semi-supervised-learning] 🚀📈 #nlp #TechInnovation
Also feel free to share and leave your comments - https://www.kaggle.com/discussions/general/472660
Thanks for the feedback Patrick, we have a system that should do this automatically, but I think we've got a bug since competitions aren't moving down to the closed category quickly enough.
i have a general question about feature extraction from biological cell images
is there a good resource for information on how to extract / describe / quantify features such as morphology / shape / relative signal distribution
because I can do segementation and classification quite robustly but am looking for a more descriptive way to explain my findings to biologists
I built a query engine for filtering free text with SQL semantically. Feel free to use it! I filtered for results related to gardening haha
Hi, My name is Ars. I am new to Data Science ... What is going on here?)
Can you guys check my kaggle EDA and tell me if what i did for the correlation matrix is good ? https://www.kaggle.com/code/sebastienmotionstats/eda-dementia-analysis-motionstats
I feel like everything is good but maybe Im just clueless and miss something important
hello
everyone
i want to start study NLP is there any good source courses for that
Hi All! My name is Arslan, I am a neuroscience PhD student, excited about HMS competition!
Hello everyone. I'm new to kaggle and data science. I'm glad to be here. I just finish Python. What is next? Tatanic or Intro to Machine Learning?
Guys please checkout my new notebook and upvote please, thank you
Hello everyone o/
Could learning business books be beneficial to become a data analyst
hello, for the next 3 months I don't have much time to study machine learning and other sciences presented on kaggle, but I would really like to start practicing on it now. If I don't bother you too much, you can ask me what I need to study so that I can start doing it. I quite like solving Codewars as a hobby, I would like this to become one too. But English is not my first language because of this, I cannot read kaggle lessons as fast as I would like (many kaggle pages are not translated). Therefore, I would like to know about possible alternatives or a way to translate kaggle pages.
Thank you very much in advance.
Have you tried using google chrome's page translate feature? It's far from perfect but should help a lot
I noticed sometimes that option doesn't always present itself
Unless there's a plugin for it you are referring to
Hola
I have the translation function itself, but let's say when I want to take a lesson in kaggle, the text of the lesson is not translated by Google translator.
Hey guys, @left rock has a question for you all regarding some work, I know he wanted to ping kaggle staff and grandmaster so I will leave him to it!
@left rock you can ask the question you DM'd me about here!
yo lads
does anyone have some good sources for data engineering / data preprocessing learning materials/courses?
Just a general question, a feature in the train set has 1 less label as compared to the test set (say train has only yes and no but test has yes, no and maybe, which is present in very few data entries) can I copy the rows with the extra label to my train set and use domain knowledge to predict it? Its coeff on my target variable is around 0.2 so is it worth doing so much or can I let that feature be as it is as my dataset already has around 15-16 features and in the end it doesnt matter that much?
Hello @rare inlet
Check out sklearn's OneHotEncoder, specifically the handle_unknown parameter. Usually it's best to set it to 'ignore'. You should also consider retraining the same model a couple of times with a couple of different random states to measure the uncertainty of the test score due to the randomness of splitting, model training, etc. That way, the rare label might appear in the training set too sometimes. https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.OneHotEncoder.html
Examples using sklearn.preprocessing.OneHotEncoder: Release Highlights for scikit-learn 1.4 Release Highlights for scikit-learn 1.1 Release Highlights for scikit-learn 1.0 Release Highlights for sc...
The rare label does not appear at all in the train set (it doesn't exist) but is there in the test set
Then you cannot use it to train a model. You should only use info which is available in the training set.
The data would be tested more though, only 20% of it was exposed...but I do get your point
Hello y'all. I have recently built a model compression package https://github.com/satabios/sconce. Smack the star button for your boy, you stars goes a long way!!! Thanks in advance.
Reason: Posted an invite
HI guys, i wanted to know what courses did you guys take to study ML? how did you go from 0 experience to pretty knolwedgable
and any god recs fr courses?
Coursera course from Andrew ng still the goat
You have some specialty courses on Kaggle for basic stuff
I wrote a page on Loss Functions! come check it out !
https://www.kaggle.com/discussions/general/475921
⚙️ Loss Function: Easy read for beginners 👀 (Est. 10 mins).
Reason: Spam
thats rlly expensive tho
Anyone looking to join a team for https://www.kaggle.com/competitions/home-credit-credit-risk-model-stability/ please dm me
Create a model measured against feature stability over time
Can't you audit courses for free on Coursera? Also, if the monthly subscription fee is too much, they offer financial aid I think.
Hey guys !! which projects are the best to add in Kaggle profile for Data Analyst job
i need a team!
I'm learner of machine learning
For...?
For all you Data Analysts out there, what do you think is the most challenging task of your job
Hi Im new here
If you want to learn about hypothesis testing come check out my new post!
https://www.kaggle.com/discussions/general/476391
🕵️🧪 Null & Alternative Hypothesis with example!: Easy read for beginners 👀 (Est. 12 mins).
The question is in what field or domain do you want to work as a data analyst? I.e., finance, biomed, healthcare, tech, etc? That determines what projects are the best to add. Hiring managers are looking for experiences (internships, research and course projects) that are relevant for the specific job you are applying for.
Im Siva Prasad from Visakhapatnam,Andhra Pradesh. Currently working as key account manager but my dream is to become a data scientist.
Am I the only one who finds libraries like optuna annoying? In my opinion Kaggle leaderboards are meant for people who have used eda and feature engineering to the best of their abilities, not for people who have gotten lucky using things like Optuna
Hi, my name is Perry. I'm new in Data Science. Happy to join.
hi
hi
Is Time Series course a ready only course or it has video lectures? https://www.kaggle.com/learn/time-series
Apply machine learning to real-world forecasting tasks.
Kaggle courses are reading and exercises in notebooks - no video lectures.
They can be a great way to dive in or practice skills, but IMO are best complimented with other learning resources like other online courses if you want to go deeper!
👋
Got it. Thank you for the information Myles.
what the best roadmap to learn data science?
afternoon is looking at getting into data sience whit in the blockchain field
any advice
Hello everyone,
I'm a developer with 5 years of coding experience, proficient in Web, Blockchain, and AI. Currently, I'm seeking a partner in the US to collaborate with.
What are you working on?
evning
any one got some advice on my quistion
Hi! Here's to a start of a very long journey to become a data scientist!
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=gtOk3PmAqHc
About the project: The main aim of this project is to check the accuracy of different models in differentiating between cats and dogs images.
About Aayushi: Aayushi is a machine learning enthusiast with a demonstrated history of learning about the computer software industry. Skilled in Python (Programming Language), C++, communication, English,...
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=kLE-XL-68hw
About the Project: The project aims to give an in-depth understanding of how countries contribute to the global cumulative human impact on climate at the country level. The purpose of this study is to provide a comprehensive analysis of global greenhouse gas emissions from fossil fuels and to correlate these emissions. Comparing CO2 emission tre...
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=i67vHwVZPQc
About the project:
The project involved audio processing and deep learning for music genre classification specifically for Indian genres. Urvi explored different methods and libraries for audio processing and compiled their knowledge in this notebook. The project makes use of librosa library for audio processing and feature extraction and later ...
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=pVFA1Qp0fLw
About the project:
Rhowena's project is a time series forecasting using different machine learning and deep learning methods including CNN, LSTM and CNN-LSTM. The study evaluates the different approaches available for time series forecasting using metric scores and other evaluation parameters.
About Rhowena: Rhowena is a graduate student in Dat...
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=5OSYh5G2BSA
About the project:
The project is on data pre-processing + EDA + RandomForestRegressor.
About Wei: Wei is a current Data Scientist in government.
Connect with Wei: https://www.kaggle.com/emmaweizhang/account?isEditing=False&verifyPhone=False and https://www.linkedin.com/in/emma-zhang-b09855a9/.
SUBSCRIBE: https://www.youtube.com/c/kaggle?sub_...
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=br_LowpTpMU
About the project:
This project is about to study the seasonal trends of the gasoline price in the US and to predict the prices for the period of March 2023 to March 2024.
About Vannia: Vannia's passion for coding took her to the Data Science field and Web Development. She is a Women Techmaker Ambassador and GDG Cloud Edmonton organizer, where ...
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=BNLDyEVjsmA
About the project:
This project predicts the outcome of the 2023 Nigerian presidential election by using Twitter data to evaluate the sentiments (negative positive and neutral emotions) of Nigerians towards the election.
A total of 20,000 tweets, 5,000 tweets for each specific hashtag; #Obidatti, #Tinubu, #Atiku and #kwankwaso was gathered fo...
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=uJ61SS7bzH8
About the project:
Zilin worked with a large dataset sourced from Glassdoor, containing user reviews and rankings on different positions from various industries.
Their aim was to explore interesting company-wise and role-wise patterns and trends in the job market, predict ratings based on reviews using natural language processing techniques, a...
Hi does anyone have their kaggle competition and projects in their resumes?
I am struggling to see what employers are looking for from these projects
Need help with an internship resume, if that helps
hey
curius as a beginner would it be posible to build a small ai as a personal learning project
Just realized the Kaggle profile UI changed today. Need some time for me to adapt with it. But overall, it looks modern & much better than previous one. Thank you Kaggle team 🙌 !
🙂
If you are an expert(just joined today, don’t know who is who), could you please tell me what books or courses you might’ve taken to add to your knowledge.
Hi folks, im running a notebook called stable-diffusion-webui-kaggle. Its looking for a tunnel password. Ive given it my public IP (im on a residential providers network) and it keeps telling me the end point is not connected -- can someone explain ?
morning
good morning everyone ^^
That's a good request, I also would like to know, thanks! Maybe there's a list of sources somewhere?
Hey I am a student pursuing AI in my UG, Can i get some sample resumes which actually bypass ATS and have any chance in getting chances for Internships or job offers so that I can realise what to focus on.
hey
Someone answered this question in a different channel. A book @wary dawn gave was “Understanding Machine Learning: From Theory to Algorithms.”
That book is pretty advanced and very math-heavy. If you want to start with something lighter, I'd recommend "An Introduction to Statistical Learning".
Hi all! 🙂
morning
this is gonna be a long but interesting journey
is new is it ok to upload data sets while learning
Anybody know best place to learn Microsoft Azure
I know microsoft website has some training but maybe theres better resources
I recently passed the AI 102 exam. MS Learn is great to study the most important concepts. You can also practice with exam questions on this website www.examtopics.com
Ok, thanx
Can anyone please suggest a good book on Machine Learning for absolute beginners?
I have enrolled in few courses on coursera and also completed the Intro to ML on Kaggle.
But a book would help greatly.
Hello all,
I trust everyone is in good health.
I'm Diksha Aswal, a graduate student at SUNY Binghamton. With approximately three years of experience in the field of Data Science, I'm currently delving into Natural Language Processing. I'm eager to create a personal project in this area and am seeking a collaborator to join me. Together, we can exchange insights and ideas. If you're passionate about NLP and interested in working on a meaningful project together, please reach out to me.
Thank you and Regards.
hello
Has anyone used runpod.io for GPU compute before ?
Im stuck with an issue regarding CUDA not being detected and it has been scratching my head for hours. It is with respect to the newest CUDA version released about 3 months ago (yes, it is a new issue)
https://www.kaggle.com/competitions/pii-detection-removal-from-educational-data/discussion/477851
- There is no RunPod discussion regarding this, on anyone facing the same problem.
- The solutions to public stackoverflow and NVIDIA posts are scarce and do not work at all. I have tried everything there, and the replies to the above thread..
- ChatGPT hallucinates a response, and even if I prompt it to research, it could not give a workaround that makes sense. Neither can Gemini Advanced solve this.
Develop automated techniques to detect and remove PII from educational data.
Le français par example ?
hi
Hi ML girls and boys. This is my first step in the ML world. I wish all of you good luck!
Hey
Hi,
I started down this path last year. A former C/C++/Java developer ... with a small amount of prior Python exposure was my starting point. I attended an AI course at Oxford University in 2020. THen last year I decided I needed to immerse myself in the domain.
To get me going, I found the Kaggle courses excellent. Getting engaged hands-on was best for me.
If you know Python already, great job! If not, it is worth spending time there in my opinion. Some aspects you can pick up as you go along, but I needed to invest time in lists and arrays (numpy).
There is also a lot of helpful material at: https://www.tensorflow.org/tutorials
For books I liked the following:
- Deep Learning with Python by Francois Chollet
- AI and Machine Learning for Coders by Laurence Moroney
Some might disagree with me on whether they are beginner books - but I found them useful.
Good luck on your learning journey!
-Michael
theres no dark mode? in kaggle obv
Hello, I'm Safni, an undergraduate student and tech enthusiast from Sri Lanka. I'm thrilled to be a part of this community!
Still work on progress and i would appreciate feedbacks and tips :
https://www.kaggle.com/code/ehabessam/innovative-eda-feature-engineering-selection
I just finished an EDA that I want to continue once I finish my ML course. Id live to get feedbacks on my coding, steps and plots if you dont mind 🙂
Link: https://www.kaggle.com/code/sebastienmotionstats/eda-heart-disease-analysis-motionstats
Thanks 🙂
Hi, everybody.
I have a quick question.
I uploaded an Excel file (.xlsx) to Kaggle to use as a part of a notebook.
I have set it as private. How do I find the path or call that excel file in my notebook?
import os
for dirname, _, filenames in os.walk('/kaggle/input'):
for filename in filenames:
print(os.path.join(dirname, filename))
Once you run this you will see the path to any file in your notebook then you can call it via pandas
dfs = pd.read_excel(file_name, sheet_name=None)
Thank you.
If anybody uses Mistral, how can you upload an XLSX file to Mistral?
It keeps telling me that it does not have access, even though I have uploaded it on Kaggle as input to my notebook and have used pandas to read it.
Thanks, yes I do know a bit of python, enough that I work professionally on it lol
And yes, I did finish the intro to ML course on kaggle, and would probably do some more
FInally, thanks for the book suggestion 🙂
hey ive been using the kaggle api to download output from my kernel but theres no output in my terminal for some reason?
so in the first line i try donwloading from some other kernel and it worked but when i use mine there is no output??
Hi
anyone here knows any API to find weather by using latitude, longitude, date and time?
I know about OpenWeatherMap API. One of my friends used it but it's paid. Might be worth looking into.
Morning all! o/
@wicked berry you sent a friend request, you need anything?
Hello, I only attended lectures and learn only the theory content. How to start it on a practically starting from regression will it be a difficult or easy for me?
hello , I hope you're all doing good I have a favour to ask i searched in kaggle but couldnt find th emulti label antenna selection dataset its for a research trying to do in MASSIVE MIMO ,if anyone has an idea of where i can get it without generating it my self or idk if anyone could tell me how can i generat eit or have ready
Greetings, I have a college assignment which requires me to interview a DBA/Data Scientist or someone in a similar profession. I am looking for anyone who might be interested in participating. This assignment isn't due for a while but I felt that I should reach out beforehand to see if anyone is interested. Feel free to let me know!
I'm new to data science and I'm very excited!
Kaggle competitions are a good place to start
🌐 Machine Learning Partner Wanted 🌟
I'm diving deep into the world of advanced neural networks and looking for a collaborative partner.Here's what I'm looking for:
- 🧠 Advanced knowledge in neural networks like Resnet18, AttnGAN, StackGAN, and more.
- 🇪🇺 Based in Europe, ideally in Germany or Austria, for potential in-person collaborations.
- 🗣️ Fluent in English, Russian, or Ukrainian - proficiency in any of these languages is a must.
- ⏰ Located within +/- 1-2 hours of the Vienna timezone to facilitate smooth collaboration.
- 🛠️ Important: Must be proficient in using the PyTorch framework. Please consider this before reaching out.
Ready to innovate? 💡 Let's join forces to explore uncharted AI territories. DM me to get started! 🤝
Hi Everyone, I started studying Data Analytics and I am glad to be here to build the practical side of my study.
Excuse me, What's Best book for learning NLP ?
I'm Henrique. I'm from Brazil and I've just started to study data science, so I have little knowledge but a great desire to learn.
I am Cardinal have some knowledge on data science but and here to sharpen my knowledge and gain more experience, am open to projects and job opportunities here
is it DS was over supply?
I'm Leo, I'm from South Korea and I'm studying Machine learning. Had the course about deep learning in uni but I need to learn more and want to participate in many projects and I hope I want to go ML sector.
woah kaggle dark mode
ypu should thank me for this
Hi, I'm Emily and enjoy data science and Kaggle!
Hello, I am sapna saw and I am new data science world , i have knowledge of programming and i just started machine learning.
Hey, I'm new here, just learning around
Entrepreneur who is trying to find ways to implement different ways of data science in future companies
hi
Im working as a Business Analyst and now want to Data Science domain. Hope to land on a DS job which will literally serve my inquisitiveness to work on data and more.
hi guys how doing um new who can help about ml
Greetings fellow learners and data enthusiasts!
I'm Saumya Nishi from India and I'm thrilled to announce my recent entry into the Kaggle community. Having transitioned from a role as a Technology Quality professional in the life sciences industry, I am now embarking on a journey into the realm of data science.
Recognising the challenges that come with being a beginner in the field of data science, I am eager to contribute and share my evolving knowledge with all of you. In my initial steps, I have crafted several notebooks focusing on supervised machine learning. These resources are designed to provide beginners with a solid foundation in creating ML models to tackle real-world business cases.
I invite you to explore these notebooks and share your valuable thoughts and feedback. I am open to discussions and brainstorming sessions on new ideas and perspectives. If you find any of the notebooks helpful, your support through upvoting would mean a lot to me. It not only helps me gauge the usefulness of the content but also encourages me to continue sharing knowledge along similar lines.
You can access my notebooks here: https://www.kaggle.com/saumyanishi/code
Thank you for your time and consideration. Wishing you all a fantastic day and the best of luck on your data science journey!
any info for bootcamp scholarship 100% fully funded for data role?
Hi there,
I'm seeking inspiration for my master's thesis and I'd like to ask for your advice on what you would recommend to a younger version of yourself to explore, given the knowledge and experience you have now, so that such a thesis would be both engaging and marketable. Currently, I work in ML (Computer Vision), and I've been contemplating some way to combine neural networks with cloud computing and carbon footprint reduction. I'm open to any topic suggestions, opinions, constructive criticism, etc., because as I mentioned, I'm in search of inspiration, so even loose ideas could shift my focus to a certain area. Nonetheless, I still have over a year until my defense, so there's no rush, but I'll be grateful for all suggestions. Thanks in advance!
Hello @torpid solar
It's fantastic that you're exploring a topic that integrates your expertise in machine learning with broade themes such as cloud computing and environmental sustainability.
Here are some potential ideas to kickstart your master's thesis.
You can delve into how leveraging distributed machine learning across cloud resources can contribute to minimizing the carbon footprint associated with training large-scale models.
Hello @smoky plank
Nice to meet you
Hey, I am kaggler for fun!
I am excited to learn fascinating things in a competitive environment.
Hi. My name is Kristen. I am a single mother of a 16 year old boy, and 6 year old girl. I live in SC in the US right now. I am currently working to earn my Google Data Analysis Certificate, and I am very new to programming. I did try some html once a LONG time ago. I would love to connect with people. I am entering the Titanic competition, if anyone wants to create a team HMU!
#1101210830688751626 I am Kristen, Join me on the Titanic competition??
Hello Kristen! I am a new member of the Kaggle Community. I have a keen interest in data science and machine learning. Would it be possible for me to join your group?
Hi , i am Muhammad Abdullah From Pakistan . I am Student of BSCs at Virtual University . I am Junior Data Scientist .
Here my Kaggle Account Link : https://www.kaggle.com/abdmental01
Codes Link : https://www.kaggle.com/abdmental01/code
If you like my Work Do Upvote it Will Help alot.
Let's Connect !
Hey, I'm Muhammad Abdullah , a Junior Data Analyst and Machine Learning enthusiast. I love diving into data to uncover insights and build predictive models that make a real impact.
Hello I am Vinit, I recently only made my first Kaggle notebook project and also submitting it in Google – AI Assistants for Data Tasks with Gemma competition.
Checkout my project: https://www.kaggle.com/code/vinit0714/exploratory-data-analysis-eda-using-gemma-llm
Its a tutorial on Exploratory Data Analysis (EDA) using Gemma LLM. If you find my work valuable, I would be grateful for your support through upvotes.
Feel free to connect with me on LinkedIn: https://www.linkedin.com/in/vinit-mehra-a3b66718b/
Very Happy to be part of the community🙂 🙂
i'm a data scientist student at zeal education society in pune india
Hello guys, I am Anish, about to graduate. Looking to connect to new people have fun doing kaggle contests.
So, guys I just completed a research study on a specific nanofluid's thermal conductivity, collecting a dataset with various parameters such as such as nanofluid composition, temperature, and conductivity values. I want to use an Artificial Neural Network (ANN) to predict thermal conductivity based on the parameters I collected, I got inspired by several research papers on a similar topic I have questions about starting a feed-forward ANN like which software would I use and what skills do I need so please anyone that is experienced in developing similar ANN models like the one in this research paper
https://sci-hub.se/10.1016/j.csite.2021.101055
if you could give me a general understanding on how to start an feed-forward Artificial Neural Network (ANN) to predict thermal conductivity based on the parameters I collected please write an message on how to start
Hello, I am Dan Drai from Jaffa (Tel-Aviv), and i am entering my first competition "Prompt Recovery". I find the challenge proposed captivating, as it offers an occasion to dive very deep into the fundamentals of LLM's.
Hello, I am Emmanuel Oyem from Lagos, Nigeria. I just want to start my first competition on Kaggle, is there any tips i will need to know about, will really be grateful if i can get some tips. Thanks.
The titanic competition is usually the first competition people do. I am working on it right now with suboptimal results thus far.
Nice to meet everyone
Hello all! Just very new to all these things, looking forward to learn, and contribute towards this community in the near future!
Any self taught ppl pursuing a career in Data
Hello guys, I am , Chunlin,a phd about to graduate. Looking to connect to new people have fun doing kaggle contests. : )
Hello everyone. Social Science likes machine learning too lol 🙂
When is the perfect time to enter the prized competitions?
Like I have achieved some good percentages in 2 of the getting-started competitions and I don't know if I should enroll in the March Machine Learning Mania.
Hello! Is there a conversation for the competition "Multi-Class Prediction of Obesity Risk" that closed recently?:)
Hi, everyone! Would like to share my article - a step-by-step guide on building a virtual assistant for any business, maybe it appears valuable to you... or you would like to give any input or any comment 😄 - in the article I pick HSBC UK Bank as a target and build a chatbot for them that outperforms their own greatly.
https://medium.com/@vovakuzmenkov/building-a-fullstack-rag-solution-with-private-llm-a-step-by-step-guide-48a0a4467efc
hello!
Prized competitions are all different, first, you need to go through the ones currently available and see if there is any match with your interests and experience. As for timing, you can check the interactive timeline of active competitions to make a proper choice: https://www.kaggle.com/code/kononenko/interactive-tmeline-of-active-kaggle-competitions
!!!! URGENT
Is this the normal or the right format for the images to go to the model after doing normalization to them ??
We don't know what the colors mean, you need to add a color bar. Also, what do you mean by "format"?
I am doing road segmentation and this is one sample of the data that I have, when I do normalization on the data it becomes like the above images, so I ask if this is normal or if this is the correct output after normalization on the data!
I don't know because I cannot tell what the colors represent on the normalized images. 🙂 Please add a color bar.
Can anyone suggest me some resources for Tensorflow , spoecially for NLP?
Hey everyone! Check out my latest notebook on Weed Detection, where I achieved an accuracy of 100%! Feel free to explore the code and let me know your thoughts.
Link:https://www.kaggle.com/code/ayushtiwari2323/weed-detection-using-cnn
I'm Ahmed ... I like to join a group working on a ML project.
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=Frbhwgg28EU
About the project: This project aims to develop and implement an machine learning-driven diagnostic tool for the early and accurate detection of chest infections such as COVID-19 (Influenza) and pneumonia using chest radiography images. The tool will play a crucial role in identifying potential cases of these infections, thereby facilitating tim...
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=aIUc8Y9h51k
About the project: In this project I built and deployed a book recommender system. The goal of this project was to build a general understanding of the tools and processes used to produce machine learning models. I explored both serving and scaling a prediction service, as well as creating a pipeline for continuous training. I focused on underst...
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=_qCgiotc0yo
About the project: Power consumption analysis for the Non Interconnected Zones of Colombia using data Science tools and generating a report through an interactive dashboard.
About Andrea Franco: A passionate Mechatronics Engineer with a strong interest in the world of Artificial Intelligence.
SUBSCRIBE: https://www.youtube.com/c/kaggle?sub_con...
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=kRFyavXs5Fo
About the project: This project is a binary classification with the goal to predict whether the customers have defaulted. I trained and evaluated balanced_accuracy using XGBoost, LightGBM and HistGradientBoostClassifier. For the categorical features, I used ordinal and on-hot encoding and compared the performance. I also modeled the time-series...
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=Gso8ulRW0NM
About the project: This project is designed for answering complex scientific questions through deployment of a LLM pipeline.
About Josué Huamán: I'm a passionate Electronics Engineer and a graduate in Artificial Intelligence with over four years of experience as a developer, specializing in Machine Learning. My career has been focused on creati...
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=CnEnLZyzxms
About the project: The project involves a analysis of air pollution data at various geographic levels, including global countries, USA states, and USA counties. It assesses trends and variations in air quality from 2019 to 2022, highlighting regions with the best and worst air quality, outliers, and changes over time. The project utilizes geospa...
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=VWDM_VfWH8A
About the project: The project is about building a Machine Learning model that takes an image of a bird as input and then gives the bird species and details/information about the bird as output.
About Purity Nyagweth: I am a junior Data Scientist and ML Engineer with 3 years of experience in Data Science and Machine Learning. I am an active lea...
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=fxeyVy_DoxE
About the project: The goal of this project is to develop an innovative solution that leverages computer vision technology to revolutionize the way farmers detect and manage plant diseases. Our primary objective is to empower farmers with a timely and accurate tool for identifying plant diseases in their crops, thereby facilitating early interve...
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=n3W5V104jDc
About this project: This project focuses on the availability of affordable housing in the United States; in particular, the affordability of already-available housing stock. My research question was: What factors affect the percentage of available homes households making median income can afford? I created a measure of listed home affordability ...
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=32isuqHEz4s
About the project: Using 2011-2021 GitHub data obtained from Kaggle to learn what the programming language I should focus on as an aspiring data analyst should be. The data was cleaned and restructured using SQL and Python. The new data was then used to visualize and create Machine Learning Model.
About Anabelle Capois Espinal: First generati...
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=EvyKonBd5es
About this project: The Yoruba-RAG project focuses on enhancing the performance of large language models, like GPT-3, when handling questions in low-resource languages like Yoruba. The project involves web scraping from a Yoruba blog using Beautiful Soup, storing the data in a text file, and dividing it into smaller chunks. To effectively proces...
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=dZHlhNHQ-qQ
About the project: The project focuses on leveraging machine learning and deep learning techniques to identify and classify arrhythmias within ECG data. By using the MIT-BIH Arrhythmia Database, we have analyzed a collection of ECG recordings and categorized them into five classes, including normal rhythms and various arrhythmias. Our work invol...
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=NaWqNesUDCY
About the project: Regression and classifier machine learning models are used to predict the next day’s rainfall. Neural network regression, random forest, and XGBoost classifiers are trained on South Florida 2015 data with 44 features including the current day’s total rainfall, temperature, wind, surface pressure, wind, vegetation levels, and m...
Hello, new guy here from Hungary. At the age of 51, I start a new career. I study Power BI intensively, and want to supplement it with some medium-level of data science knowledge.
Hey everyone, I'm Abdelrahman from Egypt. Looking forward to learning with you all!
Hello, everyone.
I am glad to be here.
I am a machine learning/computer vision developer and I am truly enthusiastic about the opportunity to contribute my extensive expertise to your projects.
hello big family,
I'm new here, I'm a web designer. and I am training in machine learning. I am here to benefit from your experience through project work and other work.
Hello everyone
I have very recently began my data science journey ,currently i only know python , understand some ML models/concepts
wish to learn a lot from here
Hello world!
Really excited to learn a lot and get insights on ML and AI
hello everyone ,i am new to this data science ,eager to learn more about Ml and AI
Reason: Bad word usage
Hi! Beginner in python and data science. Working on the Titanic dataset.
hello everyone, a beginner and machine learning enthusiast here!
Hello everyone
I have very recently began my data science journey ,currently i only know python , understand some ML models/concepts
wish to learn a lot from here
Hi everybody! Just joined this server, hope to find great people and learning opportunities here 🙂 Have a nice day ahead 
Hi!! recently started learning data science, hope will learn a lot here!
oh me too i am recently started in machine learning
Hi everyone, on new here am a student learning Machine learning and have a little experience in python, hope this community motivates more in my dream career
Hello everyone. I am a rookie in machine learning. My undergraduate degree is in computer science, so I have some programming foundation. I have just finished reading Mr. Andrew Ng machine learning course and I am currently reading the Lizard Book (Machine Learning in Action), but I feel dizzy after reading it. So I would like to ask in the group if you can give me some advice on learning ML? Thank you so much!
Print welcome to Hyderabad
I was wanting to challenge myself to complete as many challenges as possible using c++ instead of python and coding everything from scratch
However it appears that the notebooks are python and r only
Is there still a way to compete in challenges that require a notebook submission while using c++?
Hi All. I am new here. Have been on Kaggle during my undergrad times. Done a couple of projects followed by a long break from it. Would like to persue my masters in Data Science. Have given a thought to brush up and skill up my knowledge and hands on in the data scient domain using Kaggle. Happy Learning!
Kaggle just posted a video! Go check it out!
https://www.youtube.com/watch?v=O6OYPeQtW8o
✍️ On Kaggle, you can now suggest edits to public datasets!
We hope this tool will empower the community to improve each other's datasets and make Kaggle better together. Watch the video to learn more about how it works.
...
Any AML Analyst here
Devin.
Hello. I am new here. Have zero experience in ai and ds, but looking forward to learning from you all.
Hello my name is success
whats devin discord name
Kind of a occupier software
oh
Devin is the Ai software engineer, released by Cognition Labs recently.

