#🎛┊applied-ml | Kaggle | Page 1

honest niche Aug 6, 2023, 3:04 PM

#

Has anyone tried spaCy's "curated transformers"? https://github.com/explosion/curated-transformers https://twitter.com/spacy_io/status/1687103018628722688?s=20

GitHub

GitHub - explosion/curated-transformers: 🤖 A PyTorch library of cur...

🤖 A PyTorch library of curated Transformer models and their composable components - GitHub - explosion/curated-transformers: 🤖 A PyTorch library of curated Transformer models and their composable c...

spaCy (@spacy_io)

Out now: version 1.0.0 of Curated Transformers 🎉 State-of-the-art Transformers, brick by brick. With support for Llama 2, torch.compile, ALiBi, TorchScript tracing, and many other improvements ✨

https://t.co/C4w6moAbD8

#

This event from Weights & Biases on LLM evaluation might be of interest to folks: https://www.eventbrite.com/e/deep-dive-into-llm-evaluation-with-weights-biases-tickets-689536541357?aff=WBS

Eventbrite

Deep Dive into LLM Evaluation with Weights & Biases

We're going to dive into how we can effectively evaluate LLM systems, with a focus on Retrieval Augmented Generation (RAG) systems.

copper acorn Aug 7, 2023, 6:57 AM

#

Recommender Systems in the Era of LLM . Awesome-LLM4RS-Papers:
https://github.com/nancheng58/Awesome-LLM4RS-Papers

GitHub

GitHub - nancheng58/Awesome-LLM4RS-Papers: Large Language Model-enh...

Large Language Model-enhanced Recommender System Papers - GitHub - nancheng58/Awesome-LLM4RS-Papers: Large Language Model-enhanced Recommender System Papers

honest niche Aug 8, 2023, 3:18 PM

#

Hi everyone. I'm pleased to share we've scheduled our first #1130788094753394719 event with Sayak Paul from Hugging Face talking about diffusion models & Diffusers! Hope you'll be able to attend. 🙂 https://discord.gg/kaggle?event=1138491012562554900

#

If you have suggestions for other speakers / events in applied ML or research areas, let me know!

honest niche Aug 8, 2023, 9:32 PM

#

Follow-up from the Discord event @graceful temple just held -- @fresh bluff if you or anyone at H20 would like to give a talk about H20 or your team's involvement in competitions or anything, hit me up 🙂

fresh bluff Aug 8, 2023, 9:34 PM

#

honest niche Follow-up from the Discord event <@1101209061871067309> just held -- <@742537000...

Sure thing, Meg! Thanks for the opportunity!

honest niche Aug 8, 2023, 9:34 PM

#

Just send me an email: meg@kaggle.com and we could coordinate there

honest niche Aug 14, 2023, 3:30 PM

#

Has anyone played around with Jupyter AI? https://github.com/jupyterlab/jupyter-ai

GitHub

GitHub - jupyterlab/jupyter-ai: A generative AI extension for Jupyt...

A generative AI extension for JupyterLab. Contribute to jupyterlab/jupyter-ai development by creating an account on GitHub.

honest niche Aug 16, 2023, 12:56 AM

#

A reading list for practical applications of LLMs from the inestimable Vicki Boykis https://gist.github.com/veekaybee/be375ab33085102f9027853128dc5f0e

Gist

Normcore LLM Reads

Normcore LLM Reads. GitHub Gist: instantly share code, notes, and snippets.

lunar ether Aug 21, 2023, 2:27 PM

#

honest niche A reading list for practical applications of LLMs from the inestimable Vicki Boy...

Vicki is amazing! Such a down-to-Earth practitioner. I read as much of her work as I can.

#

That reminds me - speaking of Vicki - her "What Are Embeddings" guide is astounding.

What are embeddings?

A deep-dive into machine learning embeddings.

queen fern Aug 23, 2023, 7:55 AM

#

Hi, I am currently taking the machine learning course on Google skills. I am a bit confused on the training, testing and validation set. After training set, do we use test or validation set?

frigid yoke Aug 23, 2023, 12:47 PM

#

queen fern Hi, I am currently taking the machine learning course on Google skills. I am a b...

You are training your model using the train set, and then you evaluate its performance using the validation set. You use the performance metrics coming from the validation set to improve your training. Then, in order to ensure that you haven't accidentally overfitted the values in the validation set (it can happen), you use your test set to ensure that the results in your metrics are indeed good

queen fern Aug 23, 2023, 12:52 PM

#

frigid yoke You are training your model using the **train** set, and then you evaluate its p...

Thank you.

fleet palm Aug 29, 2023, 4:51 PM

#

Here's a visual:

noble fable Aug 30, 2023, 8:17 PM

#

Hey, my team is working on real estate project and I am new to ml, I want to make recommendation model for this project can you please suggest me what to do?

#

Or model should be trained or developed when project get deployed?

#

I want to implement my learning

deft pecan Sep 1, 2023, 2:21 AM

#

noble fable Hey, my team is working on real estate project and I am new to ml, I want to mak...

is your dataset similar to the boston house price dataset? Is it a regression problem? my suggestion would be that you should start with visualization and think about what kind of data preprocessing should you do to your data before you input all of these data into some model. if it is a small dataset, I recommend you to start with simple model such as linear regression or SVM and do K fold cross validation.

mild ice Sep 9, 2023, 7:40 AM

#

Guys, please help me find resources for: Analysis of News Articles and videos for regional languages

I want to make a Media News Monitoring and Feedback System that can handle multiple regional languages, categorize news stories, and notify me about negative coverage of news in the media.
You may suggest some good resources related to sentiment analysis from news articles and video transcripts

noble fable Sep 9, 2023, 7:12 PM

#

deft pecan is your dataset similar to the boston house price dataset? Is it a regression pr...

Thank,
I already made a KNN model but I don't know how I would go ahead.
I mean how the recommendation model works on a big company's website.
I just implemented my learning as creating an API for knn algo by making features matric of some important attributes.
What should I do next...

velvet lotus Oct 3, 2023, 12:37 AM

#

Check out how you can use the ChatGPT API to solve various Natural Language Processing tasks in a very short amount of time with minimum coding experience in my latest video!

If you know how to work with python lists, dictionaries, for loops, if statements, while loops and functions, you can now easily solve NLP tasks such as text summarization, sentiment analysis, topic modeling, text transformations (like translation, grammar correction, style adjustments), chatbot development, and so much more!

On a personal note, it was a lot of fun to design the thumbnail for this one! 😁

https://youtu.be/e8z886SP-fU

YouTube

Data Science Cross-Validated

Create Your Own ChatBot in 30 Minutes!

The materials covered in this video are available in the GitHub repository below.
https://github.com/azsom/DSCV_ChatGPT_API

Let me know in the comment section down below if you have any questions and what you use the ChatGPT API for!

Follow me on LinkedIn! https://www.linkedin.com/in/andraszsom/

00:00 - 00:54: Intro
00:54 - 01:55: GitHub rep...

▶ Play video

finite moth Oct 29, 2023, 9:35 AM

#

Hi everyone 👋 I new to object detection and tracking domain and I currently working on a project. I want to tune yolo-nas model for my dataset. Current I have 4-6 videos with annotation. I have extracted annotations into separate frame-wise text file for each video.
Text format is: class-number x_center_normalized y_center_normalized width_normalized height_normalized
Eg: sample_1.txt ->

5 0.102 0.32 0.63 0.21

sample2_txt:

5 0.102 0.32 0.63 0.21
2 0.402 0.82 0.33 0.11

How do I fine tune yolo-nas? I initially had videos and 1 annotation file containing details about the bounding box and object class. Now how do I proceed? I tried following: https://colab.research.google.com/drive/1q0RmeVRzLwRXW-h9dPFSOchwJkThUy6d
But I don't know which dataloaders to use and how to format my dataset?

Google Colaboratory

velvet lotus Dec 14, 2023, 1:17 PM

#

If you want to learn more about Generative AI and how you can use it in your life and at work, check out my latest video! 🙂

https://youtu.be/uki2DndUfZE

YouTube

Data Science Cross-Validated

Professor Explains: Generative AI (non-technical)

If you want to learn more about Generative AI and how you can use it in your life and at work , check out the course below!
https://www.deeplearning.ai/courses/generative-ai-for-everyone/

▶ Play video

mild ice Jan 14, 2024, 3:57 PM

#

https://youtu.be/_g3d9_rblLI

YouTube

Ansh Tanwar

Train a Self-Driving Race Car on Your PC! (AWS DeepRacer DRfC)

Ever dreamed of having your own driverless autonomous car? Now you can, even on your Windows PC! Dive into this detailed guide on training an autonomous car locally using AWS DeepRacer, reinforcement learning PPO, and your Windows PC. we will create an AWS DeepRacing training environment that can be deployed in the cloud, or locally on Ubuntu Li...

▶ Play video

shy spade Jan 24, 2024, 1:35 PM

#

Hello everyone! I am new to machine learning and have been working on leveraging XGBoost via Python in a Jupyter Notebook to try and create an algorithm that can predict customer churn for a saas company using a data set with various features from the customer's usage over the last 6 months prior to their churn.

My dataset has 85 rows and is heavily unbalanced as - there are roughly 70 customers that did not churn and 15 that did churn.

Does anyone have any suggestions on how I can alter my model to better understand the minority class? Thanks!!

tepid flame Jan 25, 2024, 10:03 PM

#

shy spade Hello everyone! I am new to machine learning and have been working on leveraging...

before starting on any modelling I suggest you gather more data. 85 rows is nowhere near enough to build any ML model that would perform well in the real world

shy spade Jan 30, 2024, 2:07 AM

#

tepid flame before starting on any modelling I suggest you gather more data. 85 rows is nowh...

Hi @tepid flame - Thank you so much for the advice 🙏

Quick follow up question - Is there a general guideline to the minimum amount of data I should have to gather any meaningful insight?

I figured it was a low amount of data but wasn't sure how much would be enough (or not enough).

Thank you again Wendy!

surreal needle Jan 30, 2024, 3:20 AM

#

shy spade Hi <@534659493439733761> - Thank you so much for the advice 🙏 Quick follow up...

There is no general answer to that question - it is data-dependent. Maybe there is a single feature that explains a given outcome, in which case a relatively small amount of data may be enough. In real life for most outcomes there are multiple contributing factors, which is why more data points are needed. This is especially the case for imbalanced datasets, even though 70:15 ratio is not too bad. As to meaningful insights, people have different definitions. You may be able to get 85% accuracy from your dataset as it is right now because >82% of data is in the dominant class. That would still mean that you probably wouldn't be able to predict the low-abundance class with high accuracy.

fossil patrol Jan 31, 2024, 6:04 AM

#

shy spade Hi <@534659493439733761> - Thank you so much for the advice 🙏 Quick follow up...

Idk how complicated model you have been planning, but a simple experiment could be dropping features at random and checking accuracies.

If you follow a greedy approach it will be way complicated task, with around 2^85 - 1 models which is computational inefficient, but if you have any feature information you can try this way out.

Btw recommended to use standard things like correlation coefficient and stuffs to eliminate redundant features

fossil patrol Jan 31, 2024, 6:05 AM

#

surreal needle There is no general answer to that question - it is data-dependent. Maybe there ...

Here's why we make use of Balanced Accuracy or F1 scores to evaluate the model.

bright tendon Feb 1, 2024, 10:39 PM

#

Hi, has anyone ever used GANs before to generate synthetic data to train financial models? I have seen articles about the use of GANs in fraud for imbalanced models, I want to try it but am not sure exactly how I would apply a GAN in theory and any potential risks I should be aware of when building a model off the synthetic data it generates.

sudden hearth Feb 27, 2024, 7:12 PM

#

Hi! When putting a forecasting model in production, what metrics/kpi do you guys suggest are good indicators that the model is doing good/bad? Besides mape

velvet lotus Feb 29, 2024, 4:08 PM

#

sudden hearth Hi! When putting a forecasting model in production, what metrics/kpi do you guys...

You should monitor the same evaluation metric you used in cross-validation and to calculate the test score (the generalization error) during model development. If the metric during deployment starts to deviate from the test score, it might be time to retrain the model.

south spoke Mar 3, 2024, 8:11 PM

#

shy spade Hello everyone! I am new to machine learning and have been working on leveraging...

For model with small datasets Naive Bayes have decent results: https://scikit-learn.org/stable/modules/naive_bayes.html

scikit-learn

1.9. Naive Bayes

Naive Bayes methods are a set of supervised learning algorithms based on applying Bayes’ theorem with the “naive” assumption of conditional independence between every pair of features given the val...

south spoke Mar 3, 2024, 8:32 PM

#

Hi guys, I need somebody with decent #reinforcement-learning experience to review my attempt at #✖┊connect-x competition to train an agent able to beat provided negamax agent. Tonight, I will reach 10+ mil. training steps and results are still disappointing (at least only 1 illegal move in the last iteration, but still not able to beat negamax and win only 2 out of 3 games against the dumb random agent). I have published my cry for help in Kaggle discussions: https://www.kaggle.com/discussions/questions-and-answers/480250, but no reaction so far. So, maybe discord will be better?

sage apex Mar 8, 2024, 1:42 AM

#

Hi, guys
we can combine stt, chatgpt and tts to make voice chatbot
if we use streaming mode for stt and chatgpt, users can get response within 2 seconds and it can be useful.
I have developed AI call center by using Twilio and a voice chatbot.
I think it can take a role of human callers.

dark hatch Mar 25, 2024, 9:43 AM

#

Hey All! Would you like to quantize an LLM to run quickly on your laptop or tiny devices? Quantization is the answer. Here is a code walkthrough for using llama.cpp for GGUF quantization in 10 mins. Hope its useful: https://youtu.be/j7ahltwlFH0?si=Waa35po2YwqrQC3_

YouTube

AI Bites

GGUF quantization of LLM (Gemma) with LLAMA.cpp and @HuggingFace on...

Would you like to run LLMs on your laptop and tiny devices like mobile phones and watches? If so, you will need to quantize LLMs. LLAMA.cpp is an open-source library written in C and C++. It allows us to quantize a given model and run LLMs without GPUs.
In this video, I demonstrate how we can quantize a fine-tuned LLM on a Macbook and run it on...

▶ Play video

visual wharf Apr 17, 2024, 9:33 AM

#

I am running a notebook for RF and is taking over 16 hours with a sample size of 95,000 but using BayesSearchCV with x50 iterations, the same data took KNN 2 hours, DT, LR 40 mins. Do you think there is a issue with the code or is it just taking this long?

quartz wind Apr 23, 2024, 3:19 PM

#

visual wharf I am running a notebook for RF and is taking over 16 hours with a sample size of...

I can understand that is the algorithm itself. Do you know the differences between KNN and Bayes Search? It is a good starting point 😃

grand ether Jun 6, 2024, 6:31 PM

#

Are there any Machine Learning alternative for the course we follow for learning Python provided by the University of Helsinki? This course is very helpful, concise, and comprehensive. It would be great if we had a similar alternative for Machine Learning.

molten sand Jun 7, 2024, 3:59 PM

#

grand ether Are there any Machine Learning alternative for the course we follow for learning...

What do you mean? Kaggle has many courses on Machine Learning

mild ice Jun 13, 2024, 6:57 AM

#

Hello, How can I finetune llama3 for MultiLabel classfication. Do I need to follow the same prompt format for each row as mentioned https://www.kaggle.com/code/danielhanchen/kaggle-llama-3-8b-unsloth-notebook/notebookhere.
Or is there any other method like we do for BERT?

prime bolt Jun 28, 2024, 7:10 AM

#

How to get room direction in indoor photo/image.

quiet shore Jul 14, 2024, 1:59 PM

#

Hellooo, hi everyone. Can you recommend book or something else for learning neruralprophet?

leaden dew Sep 2, 2024, 5:15 AM

#

Hi, everybody. I have a question.
I want to make a method to architecture the neural network for given real problem.
Is this possible?
So, I mean can we make the certain arhictecture of network based on neuro science?
Please help me overview of this and methods.
Where I can find the proper references?

distant violet Sep 5, 2024, 7:45 PM

#

Hi All,

I am working with a client who is expecting me to do a commodity price forecasting on monthly basis. But they will be able to provide us with only monthly data for past 5 years. (60 data points)

I have tried Holt’s winter model, ARIMA, SARIMAX, LSTM, LR, Prophet. But the accuracy is not up to the mark.

What is the minimum data points requirement to do the monthly forecasting?

Can I please have help with the correct approach here?

split canyon Sep 9, 2024, 10:52 PM

#

leaden dew Hi, everybody. I have a question. I want to make a method to architecture the ne...

Halloooo, What architecture do you mean? the current MLP is based on Neuro. Try to search for this ✨. Kan is new and kinda hyped. I haven't tested KAN before but will do it in my studies.

QSAs7_XSkaJmR6e2Cx0iS8a4x3P4atwKtBs1ypWjwoBmiUQNGPqKmB9iBZqFsknBQnr6cHZy9mhZS_fb83PlM4dGYJMhFXswnYbdXFHhuqo7Z9LX2WqmZbwMZWEorWx7HxP_yynmhfTPiN0xY_9vdVU.png

leaden dew Sep 9, 2024, 10:53 PM

#

Thanks, @split canyon

split canyon Sep 9, 2024, 10:54 PM

#

leaden dew Thanks, <@443325778780880916>

Anytime 🫶

split canyon Sep 9, 2024, 10:57 PM

#

distant violet Hi All, I am working with a client who is expecting me to do a commodity price ...

Try to augment if possible or generate new features like windowing. If it overtfits just do regularization techniques. The thing is , it's sometimes not the models problem. Sometimes it's the data problem. DM me if you want to know more about time series forecasting 👋

ripe gulch Oct 17, 2024, 11:26 PM

#

Which ML framework should I learn as a beginner so that I can find job quickly??? I mean tf or pytorch

tawdry tusk Oct 23, 2024, 7:17 AM

#

ripe gulch Which ML framework should I learn as a beginner so that I can find job quickly??...

Tensorflow imo

vague panther Oct 23, 2024, 8:11 AM

#

ripe gulch Which ML framework should I learn as a beginner so that I can find job quickly??...

its quite debatable pytorch or tf learn both it will be more helpful

neat acorn Nov 23, 2024, 4:00 PM

#

ripe gulch Which ML framework should I learn as a beginner so that I can find job quickly??...

If you're more interested in industry side then go through pytorch if academia then go through tensor flow

naive wadi Nov 23, 2024, 5:43 PM

#

I have a question for the folks using ml at an enterprise level. Has a kaggle helped you with skill building and have you brought any kaggle ideas into production?

merry nymph Dec 4, 2024, 8:05 PM

#

If anyone is publishing papers either in LLM's or Computer vision please do include me iam too much interested to work due to lack of network iam unable to do things that what I want to do
Please please do include me if anyone is publishing papers 🙏🙏🙏🙏

fading garden Dec 4, 2024, 10:09 PM

#

naive wadi I have a question for the folks using ml at an enterprise level. Has a kaggle he...

I see a lot of enterprise level kagglers on BlueSky

naive wadi Dec 4, 2024, 11:07 PM

#

fading garden I see a lot of enterprise level kagglers on BlueSky

Oh yeah? Any one I should follow? Or recs?

leaden dew Jan 7, 2025, 6:34 PM

#

Hi, everybody.
I'm looking for ar related to project where that can detect palne and argument the objects on that.
If there is anybody who knows, please tell me.
Thanks.

leaden dew Jan 9, 2025, 4:05 AM

#

Hi, everybody.
We're building news analysis models and need to collect news data of 20 years.
Is there anybody who knows news data service well?
Please tell me.

fathom olive Jan 11, 2025, 5:34 PM

#

did anyone try creating a text translation model, like from english to some unique language!
Let's say we want to create a model that translate English to LangX.
Any ideas?

scarlet pewter Jan 12, 2025, 6:46 AM

#

fathom olive did anyone try creating a text translation model, like from english to some uniq...

I think this is indirectly implemented by LLM enthusiasts

That they convert English text to vector embeddings that is understood by the model; except in this case we are assigning LangX words to the vector embeddings.

The idea seems cool, you can try that way.

#

Oops, I meant query embeddings* or whatever an appropriate term be

mild orchid Jan 13, 2025, 5:55 PM

#

Hey everyone! 👋 I wanted to share that I'm organizing the Data & AI Blogathon. What’s it all about? You’ll be writing blog posts on topics like Data Science, AI, Machine Learning, and more. We’ll have a variety of categories, including topics like Data Science and Data Engineering, as well as different formats like case studies, tutorials, how-to guides, and more. With over 7 categories to choose from, you’ll have plenty of chances to get noticed and win!

What’s in it for you?

Get featured in big newsletters
Mentorship from experts in the field
Connect with top mentors, ambassadors and other AI professionals.
Get your work shared with over 500,000 followers

If you’re looking to grow your network, get advice, and get your work noticed, this is for you! 👉 Register here: https://forms.gle/FD9FfKJMYp6QCYEE7
Feel free to connect with me on Linkedin as well! https://www.linkedin.com/in/ginacostag/

Google Docs

Global Data and AI Blogathon: Participant Registration

Welcome to the Data & AI Blogathon! This is your chance to show your skills, share your knowledge, and get noticed by top professionals, mentors, and judges from companies like Google, Amazon, Microsoft, and more.
Important Notes:
Submission Guidelines: You need to submit at least one blog post during the event. We recommend submitting one post ...

forest moth Feb 28, 2025, 3:50 PM

#

Hey, i need help

We have long conveyor like 5 kms long
In which there are idlers around 4 to 5k
We divided conveyor section with imaginary line let's say every 20m , in that there are around 10-12 idlers of same dimensions.

We have normal data that is after replacement of idlers, and have abnormal data that is before replacement , we have dataset in the form of real positive fft that is each row contains list with 5k integers

If i train 1dcnn based auto encoder or vae it works section wise, like I can see higher reconstruction error in abnormal data. But it is impossible to create model for every section it will be computationally very expensive. I want single model that will work entire conveyor, but when I combine all data and train then it won't generalise well.

Also I tried extracting statistical features like kurtosis , skewness etc and trained dense vae but no luck what can I do ?

Note: i can see abnormality in normal data too. Even after cleaning it becomes more sensitive to normal data as well tell me better approach if you have any experience related to similer problem

wind cliff Mar 19, 2025, 6:44 PM

#

https://myhistory.co.ke ..i need you to atest this website give reviews i will be happy 😁. Guys i need your reviews

myHistory - Your Digital Health Journey

Track, manage, and share your medical journey—all in one secure place.

marsh matrix Mar 30, 2025, 7:21 AM

#

hello everyone, i am beginner - intermediate in ML

#

i learnt all the basic algorithms and neural network

#

built some models too

#

i am wondering what should i do now ?

#

what options do i have to explore ?

high token Mar 30, 2025, 1:32 PM

#

mild orchid Hey everyone! 👋 I wanted to share that I'm organizing the Data & AI Blogathon. ...

This is interesting, Blogathon..this is first time I heard. So. do you have a site or something

grave hornet Apr 1, 2025, 3:26 AM

#

marsh matrix what options do i have to explore ?

job maybe!!

desert silo Apr 1, 2025, 2:26 PM

#

What are the names of companies that are specially for aiml specialisation freshers?? List in India!

clever cove Apr 7, 2025, 9:19 AM

#

😂

small sandal Apr 11, 2025, 5:50 PM

#

hi can any one help me in learning ml

vague panther Apr 12, 2025, 7:25 AM

#

small sandal hi can any one help me in learning ml

-# ¯⁠\⁠_⁠(⁠ツ⁠)⁠_⁠/⁠¯

small sandal Apr 12, 2025, 11:33 AM

#

vague panther -# [¯⁠\⁠_⁠(⁠ツ⁠)⁠_⁠/⁠¯](https://cneuralnets.netlify.app/guideblogs)

thank you

winged crypt Apr 16, 2025, 3:59 PM

#

Hi, I have a dataset of floor plans (general floor plans not only architecural) trained and labeled and am using yolov11 instance segmentation to get the rooms, doors and windows information. However, the result is not very accurate especially some rooms are identified as weird polygons. Any advice on how to prepare high quality data or better machine learning / AI method to recognize the rooms information? Thank s in advance.

rich anvil May 17, 2025, 2:22 PM

#

🚀 Looking for Teammates – Join AgriIntel! 🌱

Hi Everyone,

I’m putting together a high-impact AI project — AgriIntel — a Smart Farming Assistant built to solve real problems faced by small-scale farmers like my own family.

👉 This isn’t just a side project or a college assignment.
AgriIntel is being developed as a serious portfolio product to help each team member showcase real-world impact, demonstrate ML/Data Science skills, and land strong remote roles.

🧠 What We're Building:

🌾 Crop Recommendation Engine — based on soil, pH, rainfall, etc.
📈 Crop Price Forecasting — using Time Series (Prophet/LSTM)
🍂 Leaf Disease Detection — with Computer Vision (YOLOv5 / MobileNet)
🗣️ Hindi Voice Assistant — powered by Whisper + gTTS
📊 Insightful Dashboard — Streamlit or React

✅ Why You Should Join:

📌 Build something recruiters will ask about
💼 Boost your GitHub + Resume with real-world work
🧩 Collaborate in a sprint-style, outcome-driven team
🌍 Contribute to a product that impacts real lives

🔗 Full Pitch + Roadmap:
👉 Click to View

📩 Interested in joining? DM me directly or connect on LinkedIn:
💼 LinkedIn – Dinesh Kumar

Let’s build AgriIntel together — and create something that truly matters! 🌾💡

—
Dinesh Kumar

Google Docs

AgriIntel – AI-Powered Smart Farming Assistant

AgriIntel – AI-Powered Smart Farming Assistant 1. Vision Empower small-scale and marginal farmers with AI tools to make smarter decisions about what to grow, when to sell, and how to protect their crops. We're building a focused, product-level portfolio project to get hired in...

fleet pilot May 25, 2025, 5:11 AM

#

marsh matrix built some models too

Try no code visualization tool like Orange 3.x. You can consolidate various concepts easily.

grave aurora May 28, 2025, 10:20 PM

#

Hi! Wondering if anyone has experience on ML applied to People Analytics? I’m researching the topic but can’t find realistic value adding project ideas

manic trench May 31, 2025, 2:58 PM

#

What is the best way to self learn ML?

fleet pilot Jun 6, 2025, 9:19 PM

#

manic trench What is the best way to self learn ML?

If you are just a beginner, the you tube videos by Statquest Josh Starmmer and videos by Louis Serrano can provide you a head start. Simple visualization and short videos for understanding the subject in the shortest possible time in my opinion. If you require a ,"no code" visualization flow tools to experiment with data and various models then you can use open-source Orange 3.8x version along with various addons provided. Very easy to learn with a number of videos tutorials. Other open source tools I have experimented with are Weka, Knime.

manic trench Jun 7, 2025, 5:32 PM

#

Thank you so much i will look into it

viral torrent Jun 11, 2025, 6:17 PM

#

my friends, does anybody have an idea for my graduation project? it should be AI related of course, should be innovative, no one has ever made it and it should solve a real world problem

#

appreciate it anyways

white yew Jun 27, 2025, 11:32 AM

#

viral torrent my friends, does anybody have an idea for my graduation project? it should be AI...

man, that's up to you. Scroll through the kaggle, look into datasets, read, watch youtube videos you like and come back to datasets again. Works with me, sooner or later i'll find a problem i want to solve for myself.
I think it's like open source contributions - you do it for yourself, coz you're interested in it, you want to know the answer, and so you have the greatest motivation of achieving it

manic steeple Jul 5, 2025, 12:46 AM

#

Hey guys,
Its been long since i processed datasets to be implemented for ML projects. Does anyone here provide me with some sort of guide that could potentially help me with developing projects and stuff??

cyan elbow Jul 13, 2025, 10:58 AM

#

Hey Guys,I am seeeking recommendations for an impactful AI/ML project that would strongly appeal to product based companies when they are hiring. The goal is to maximize my chances of securing a job.
Please do suggest me asap : )

gaunt jackal Jul 17, 2025, 3:03 PM

#

cyan elbow Hey Guys,I am seeeking recommendations for an impactful AI/ML project that would...

I know lead generation seems to be all the rage lately

sour light Jul 18, 2025, 5:31 AM

#

I want to participate in kaggle competition , I am new so I want to join a team for first hand experience

humble delta Jul 21, 2025, 7:34 PM

#

sour light I want to participate in kaggle competition , I am new so I want to join a team...

Let's go i am also new.

cinder lagoon Jul 22, 2025, 9:37 AM

#

👋 I just built a free tool that turns any PDF, image, or Word doc into a clean dataset using just a prompt — kinda like ChatGPT but for messy files.

Want to give it a quick try and tell me what’s broken or missing? Takes 2 mins. Would love your feedback 🙏
👉 https://pdf2dataset.streamlit.app

trim stream Jul 24, 2025, 12:23 AM

#

cinder lagoon 👋 I just built a free tool that turns any PDF, image, or Word doc into a clean ...

It doesn’t work when I click on the link

loud siren Jul 26, 2025, 5:02 AM

#

any advise on how I should get started learning ML and in a proper structure?

earnest gorge Jul 26, 2025, 2:51 PM

#

loud siren any advise on how I should get started learning ML and in a proper structure?

I build via LLM collaboration. I work with different AI models to run research, hone my understanding, and apply to see what works. Just shoot for the dream and then refit to MVP version and iterate. You'll get it, or get something at least haha You need to understand ML fundamentals but you could just let AI teach you about AI essentially--edit not a replacement from coding yourself, but if the learning curve is too steep just have fun! Whatever works!

fleet pilot Aug 3, 2025, 9:52 PM

#

loud siren any advise on how I should get started learning ML and in a proper structure?

Can you place your background directly or indirectly relevant to the subject matter?

craggy oracle Aug 6, 2025, 11:24 AM

#

https://www.historyofdatascience.com/dartmouth-summer-research-project-the-birth-of-artificial-intelligence/

History of Data Science

Dartmouth Summer Research Project: The Birth of Artificial Intellig...

Held in the summer of 1956, the Dartmouth Summer Research Project on Artificial Intelligence brought together some of the brightest minds in computing and cognitive science — and is considered to have founded artificial intelligence (AI) as a field.

prime bolt Aug 25, 2025, 3:28 PM

#

humble delta Let's go i am also new.

Let's goooo

acoustic basalt Aug 28, 2025, 6:21 AM

#

You can consider me too 😅

left hull Sep 1, 2025, 6:35 AM

#

hi all

acoustic basalt Sep 2, 2025, 8:47 AM

#

Hello

stark vigil Sep 15, 2025, 6:57 PM

#

Hi, @everybody
I have one question, I'm training ml models for the prediction, which is classification problem of 3 classes, where the number of samples are similar but the predition is skewed.
First class and second class is predicted with low precision tough, third class is never predicted. What's the reason? I can' t find the reason.
Before, when I applyed reinforcement learning, where the three classes were assigned to three actions and one action is never selected, too.
Actually, that is the preeiction model of forex eur/usd.

void wraith Sep 21, 2025, 7:44 AM

#

hello guys i am doing a small project on texture classification. can anyone help me with understanding what is wavelet technique?

stark vigil Oct 8, 2025, 10:28 PM

#

Hi, @everyone
Is there anyone who joins radical ai founders' masterclass?
I didn't have an opportunity to apply for that.
Please give me the meeting urls for them.

red jacinth Oct 17, 2025, 9:59 AM

#

stark vigil Hi, @everybody I have one question, I'm training ml models for the prediction, w...

did you take stratified training data, ie equal proportions of all classes in train and test data

stark vigil Oct 17, 2025, 3:03 PM

#

Yes, same

drowsy raptor Oct 21, 2025, 10:30 PM

#

Hi everyone, im doing the intro to ML course and im a bit confused on step 3 of the lesson 4 exercise (https://www.kaggle.com/learn/intro-to-machine-learning)

question: Inspect your predictions and actual values from validation data.
code:

print(val_predictions[:5])
# print the top few actual prices from validation data
print(y.head())```
the bit i am a bit tripped up on is: 
 ```# print the top few validation predictions --> this bit of the question confuses me
print(val_predictions.head())```

i was wondering what the correct way to do what the question is asking is. i done 
```print(val_predictions[:5])```
 but im not sure if thats what the exercise was asking

slim yew Nov 7, 2025, 2:05 AM

#

drowsy raptor Hi everyone, im doing the intro to ML course and im a bit confused on step 3 of ...

if it is just for inspection then i think both of them is correct. But I preffer y.head() method. But if you want to check accuracy or precision you can go for precision_score or accuracy_score and there are many more like f2_score,f1_score etc. Just my opinion i could be wrong.

warm bough Nov 8, 2025, 2:43 PM

#

Scam

neat acorn Nov 16, 2025, 2:55 AM

#

Hello all, I'm Muhammad Yousif, BS IT student with focus on data science and ai. I'm here looking for possible collaboration in research or work. If you're eager to collaborate just dm me with your ideas

hasty pilot Nov 17, 2025, 4:21 PM

#

HI guys I'm in everything programming ML and agents looking to team up ASAP

glass pagoda Nov 23, 2025, 6:26 AM

#

Hi everyone, I'm predicting the next F1 race winners, if you're interested you can check out the dataset and the notebooks here: https://www.kaggle.com/datasets/rockyt07/formula-1-championships-1950-2025. Feel free to tinker with it.

regal bronze Nov 24, 2025, 7:39 AM

#

🎥 New Video Released: Epistemic World Model vs Baseline – Full Generalization Test
I’m thrilled to share the full demo of my Epistemic World Model in action:
https://www.youtube.com/watch?v=Sw57PKee__w

In this video, I walk through the architecture, training curve, and generalization results in a high-entropy combinatorial environment ( Brazilian Lotto history). While the baseline model remains stuck at ~10% hit-rate for ≥1/6 events, my model climbs to 81.6% across 100 epochs.

✔ Core highlights:

Structured “Q1 (aleatoric) / Q2 (epistemic)” gating for belief management

Stable pyramidal state vector: Memory / Pain / Choice / Exploration

Continuous online learning & domain adaptation

Full comparison between baseline world model and epistêmic variant

Thought-provoking implications for future cognitive-agent architectures

Whether you’re working in ML, world models, reinforcement learning, or cognitive systems — this architecture might spark ideas for new directions in generalization.

Would love to hear your feedback, questions or ideas for collaboration.

#MachineLearning #WorldModels #EpistemicAI #1101210830688751626ization #KaggleCommunity #AIResearch

fleet palm Dec 16, 2025, 10:21 AM

#

hasty pilot HI guys I'm in everything programming ML and agents looking to team up ASAP

Same here @hasty pilot and @neat acorn maybe we can help one another by working together and learning alongside

neat acorn Dec 16, 2025, 12:44 PM

#

fleet palm Same here <@1256051207076577391> and <@1218016436513931305> maybe we can help o...

Dm me

hasty pilot Dec 19, 2025, 7:24 AM

#

neat acorn Dm me

what are we working on

neat acorn Dec 19, 2025, 7:27 AM

#

hasty pilot what are we working on

that's seems like black box we should have to discuss in dm

hasty pilot Dec 19, 2025, 8:06 AM

#

neat acorn that's seems like black box we should have to discuss in dm

cool, any hackathon on?

neat acorn Dec 19, 2025, 10:38 AM

#

hasty pilot cool, any hackathon on?

Maybe, actually I'm actively participating in multiple things hackathons, competitions, research and projects etc. so I can discuss things around these

#

For further dm me

trim pebble Dec 21, 2025, 2:12 PM

#

heyy! anyone with some experience in AI x cognitive science/computational neuroscience here? please hmu if so :)

sinful ermine Jan 28, 2026, 2:19 AM

#

Hi @everyone
📘 Python Loops & Strings – Kaggle Notebook 🐍
This notebook explains Python loops (for, while) and strings in a detailed and easy-to-understand way, with clear examples.
It’s especially helpful for beginners 🚀

Please check it out and leave a vote ⭐ and a comment 💬 — your feedback is highly appreciated! 🙌
https://www.kaggle.com/code/dastgeerjutt/3-loops-and-strings-detailed

acoustic prawn Jan 31, 2026, 6:02 PM

#

I'm training an MtG AI player. Here are my assets:

I have a functional rules engine, and a complete graph based world model. This world model is completely accurate and encodes relationships of arbitrary distance. I can easily implement a spider or walker to do traversal. GNNs or an RNNs which walks the graph could be applied here.

I have access to human-played game logs which, presumably, could be translated to resimulations of those games for observation. I can have a flagship LLM play against itself and have the AI observe. And, once the AI is halfway competent, I have self play.

And I have a clear goal. Given the state of the game world, multiple objectives, and a set of possible actions, how do I select the best possible action(s) when they're presented?

rustic dune Feb 4, 2026, 1:09 PM

#

🚗⚡ Just dropped a 🔥 Kaggle Masterpiece: Analyzed 271K Washington State EVs with INTERACTIVE MAPS, XGBoost Ensembles, & 2027 Forecasts!

Key Insights:
✅ Tesla dominates 60% – but Chevy Bolt crushes on range/price
✅ Urban Heatmaps reveal Seattle hotspots (download HTML map!)
✅ ML Beast: R²=0.95 predicting range, 94% CAFV eligibility
✅ Forecast: +50K new EVs by 2027 – infrastructure crisis ahead?

Built with GeoPandas, Folium, StackingRegressor (XGB+LGBM+RF). Perfect for policy makers & energy pros!

🔗 Dive in & upvote: [https://www.kaggle.com/code/hammadansari7/electric-vehicle-population-analysis]

What’s YOUR take on EV adoption? Rural lag or tech hype?

#DataScience #Kaggle #MachineLearning #GeospatialAnalysis #ElectricVehicles #EV #Forecasting #XGBoost #Sustainability #AI

@Kaggle @Tesla @robikscube @towardsdatascience @everyone

worldly jewel Mar 10, 2026, 5:40 PM

#

rustic dune 🚗⚡ Just dropped a 🔥 **Kaggle Masterpiece**: Analyzed 271K Washington State EVs...

Just a reminder that our server rules prohibit requesting upvotes and tagging in a broad group of users - although we love seeing the projects folks are working on.

crisp nimbus Mar 13, 2026, 12:33 PM

#

Hello everyone! 👋

If you want to upgrade your IT skills and learn more about the Microsoft ecosystem (Azure, AI, Cloud, etc.), come join the Microsoft Elevate Training Center! 🚀

This program is great for those who want to prepare for official certifications or simply stay updated with the latest technologies together with Dicoding.

Register for free through this link: https://www.dicoding.com/elevate/registration?referrer_id=5510036

Let’s go while the opportunity is still there!

versed hemlock Apr 4, 2026, 8:42 PM

#

Anyone can provide the best dataset download link for deepfake detection videos with good qualities videos and of various diiferent varities ?? It will be great help to me.