west urchin Mar 30, 2025, 3:31 PM

#

same question

rustic hawk Mar 30, 2025, 3:32 PM

#

ans also where do i get the coursework in AIstudio?

regal lily Mar 30, 2025, 5:26 PM

#

So because I am not 18 I cannot use Google Ai Studio leading to me not getting an API so what should I do?

earnest cliff Mar 30, 2025, 8:16 PM

#

I'm currently in my second semester of systems engineering. I consider myself a junior data analyst; I have knowledge in Power BI, Excel, Tableau, and SQL. I work as an administrative analyst at a company that makes slot machines similar to a casino.
I've been studying something, but I feel it's time to invest in education. I'm looking for those courses that cost 1-2 million, that last 10 months, and so on. They're similar to a bootcamp, but I use them more to reinforce knowledge and so on. I feel like YouTube courses and other platforms don't teach me enough; I feel like I've reached a point where I need guidance.
Would you invest in education like that?

buoyant cipher Mar 30, 2025, 8:20 PM

#

Hello, I am not being able to open the file "Foundational Large Language Models & Text Generation" as you can see in the screenshot, is that normal?

Capture_decran_2025-03-30_a_22.13.49.png

solar pecan Mar 30, 2025, 9:05 PM

#

How to check that er have successfully completed our assignments

wraith sparrow Mar 30, 2025, 9:06 PM

#

solar pecan How to check that er have successfully completed our assignments

Mind asking here https://discord.com/channels/1101210829807956100/1303438695143178251

solar pecan Mar 30, 2025, 9:18 PM

#

Plz answer my question

wraith sparrow Mar 30, 2025, 9:23 PM

#

solar pecan Plz answer my question

They will be tracked automatically

solar pecan Mar 30, 2025, 9:24 PM

#

But how should I get confirmed that it is completed

glad ibex Mar 30, 2025, 10:22 PM

#

I have an issue with uploading a dataset.
I uploaded a 40 chunks of total 2GB (parquet, gzip compression) and it's hanging for a few hours with this:
Is it normal, or should I do something differently? Now I am uploading with Kaggle API

drifting star Mar 31, 2025, 6:37 AM

#

glad ibex I have an issue with uploading a dataset. I uploaded a 40 chunks of total 2GB (p...

2GB should not take few hours. I have never used API to upload the data. Also try restarting once and check your internet speed. I have 5g and 2gb data takes about 15-20 mins

wraith sparrow Mar 31, 2025, 6:38 AM

#

drifting star 2GB should not take few hours. I have never used API to upload the data. Also tr...

He's master btw 💀😅

drifting star Mar 31, 2025, 6:41 AM

#

#❓┊ask-a-question I am trying to run an LLM model . And I tried different ways to load the model into memory . like device_map= auto, balanced, max_memory but I still see that 2 of my GPUs are underutilized. Besides my CPU memory is still available (I checked resource utilization). I am getting CUDA Out of memory issues and it is really frustrating. Can someone please help

#

I can share more details if someone can help

wispy meadow Mar 31, 2025, 6:42 AM

#

Hello, I am a starter in LLMs. Basically for my project, I am trying to train any base model(Currently GPT2), to generate a short story using a user's prompt. Now, I've used a Huggingface dataset(Basically, Prompts and stories). I am feeding them into the model and training it in supervised mode(I passed the stories as lables).
My losses aren't converging. If anyone wants details on my Training arguments I can show them if its allowed.

glossy crown Mar 31, 2025, 6:43 AM

#

drifting star <#1129507816697241822> I am trying to run an LLM model . And I tried different ...

which machine are you using and which model(and corresponding quant) are you using?

wraith sparrow Mar 31, 2025, 6:43 AM

#

drifting star <#1129507816697241822> I am trying to run an LLM model . And I tried different ...

Maybe it's exceeding available vram

drifting star Mar 31, 2025, 6:43 AM

#

L4*4

wispy meadow Mar 31, 2025, 6:43 AM

#

drifting star <#1129507816697241822> I am trying to run an LLM model . And I tried different ...

I maybe wrong, but since I've been getting this error aswell Throughout the whole night yesterday, I guess you're using a batchsize that requires more VRAM than you're providing.

glossy crown Mar 31, 2025, 6:43 AM

#

drifting star L4*4

model?

glossy crown Mar 31, 2025, 6:43 AM

#

wispy meadow Hello, I am a starter in LLMs. Basically for my project, I am trying to train an...

are you in any competition?

#

if not you can share

drifting star Mar 31, 2025, 6:44 AM

#

Qwne2.5- 32B but if VRAM is exhausting then that should be visible on the resource utlization screen.

glossy crown Mar 31, 2025, 6:44 AM

#

are you using fp8?

wraith sparrow Mar 31, 2025, 6:44 AM

#

Nope it's clear

glossy crown Mar 31, 2025, 6:45 AM

#

drifting star Qwne2.5- 32B but if VRAM is exhausting then that should be visible on the resour...

are you using fp8 quant?

#

can you share your code if it is not a part of any comp?

drifting star Mar 31, 2025, 6:45 AM

#

wispy meadow Mar 31, 2025, 6:45 AM

#

glossy crown are you in any competition?

No no I am not im just building a project on my own.

#

I'll share

glossy crown Mar 31, 2025, 6:46 AM

#

drifting star

I do not have exp with that library

#

I never directly use accelerate

wraith sparrow Mar 31, 2025, 6:47 AM

#

glossy crown I do not have exp with that library

Fr anyways is there much diff in perf or smth ??

glossy crown Mar 31, 2025, 6:47 AM

#

I was able to load and use the same model using vllm

patent cobalt Mar 31, 2025, 6:47 AM

#

why ! tell me the reason for uninstalling the jupyterlab packege for
Clean up the environment by removing an unused package ?

glossy crown Mar 31, 2025, 6:47 AM

#

qwen 2.5 32b it

wraith sparrow Mar 31, 2025, 6:47 AM

#

Accelerate n transformer ?

drifting star Mar 31, 2025, 6:48 AM

#

OutOfMemoryError: CUDA out of memory. Tried to allocate 540.00 MiB. GPU 1 has a total capacity of 22.28 GiB of which 511.38 MiB is free. Process 4171 has 21.77 GiB memory in use. Of the allocated memory 21.56 GiB is allocated by PyTorch, and 1.21 MiB is reserved by PyTorch but unallocated. If reserved but unallocated memory is large try setting PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True to avoid fragmentation. See documentation for Memory Management (https://pytorch.org/docs/stable/notes/cuda.html#environment-variables)
add Codeadd Markdown

CUDA semantics — PyTorch 2.6 documentation

A guide to torch.cuda, a PyTorch module to run CUDA operations

glossy crown Mar 31, 2025, 6:48 AM

#

wraith sparrow Accelerate n transformer ?

accelerate is used to speed up transformer

drifting star Mar 31, 2025, 6:48 AM

#

I usually get this kind of error

wraith sparrow Mar 31, 2025, 6:48 AM

#

glossy crown I was able to load and use the same model using vllm

btw do u use vllms in/for comps too ?

wispy meadow Mar 31, 2025, 6:48 AM

#

drifting star OutOfMemoryError: CUDA out of memory. Tried to allocate 540.00 MiB. GPU 1 has a ...

VRAM issue, I got this same error a lot of times while tweaking batch sizes

#

@glossy crown here

glossy crown Mar 31, 2025, 6:49 AM

#

drifting star OutOfMemoryError: CUDA out of memory. Tried to allocate 540.00 MiB. GPU 1 has a ...

do you wanna fine tune or do you want for inference?

drifting star Mar 31, 2025, 6:49 AM

#

But the 2 of the GPU memory is not used. I get this VRAM issue

glossy crown Mar 31, 2025, 6:49 AM

#

wispy meadow <@763292785293393920> here

how many tokens are your stories?

wispy meadow Mar 31, 2025, 6:49 AM

#

glossy crown how many tokens are your stories?

How do I check, I mean the stories vary in sizes

glossy crown Mar 31, 2025, 6:50 AM

#

wispy meadow How do I check, I mean the stories vary in sizes

max token limit

drifting star Mar 31, 2025, 6:50 AM

#

for now inference but I want for fine tuning also. This I get not during inference but when I am loading the model

wispy meadow Mar 31, 2025, 6:50 AM

#

oh I think its 512

#

atleast thats what I used in the tokenizer

glossy crown Mar 31, 2025, 6:51 AM

#

drifting star for now inference but I want for fine tuning also. This I get not during inferen...

maybe try dtype="half"?

glossy crown Mar 31, 2025, 6:52 AM

#

wispy meadow atleast thats what I used in the tokenizer

can you share the line where you get oom?

wraith sparrow Mar 31, 2025, 6:52 AM

#

wraith sparrow btw do u use vllms in/for comps too ?

@glossy crown

glossy crown Mar 31, 2025, 6:53 AM

#

wraith sparrow <@763292785293393920>

vllm is better than transformers for inference

wraith sparrow Mar 31, 2025, 6:53 AM

#

glossy crown vllm is better than transformers for inference

And for training

wispy meadow Mar 31, 2025, 6:53 AM

#

glossy crown can you share the line where you get oom?

Im not getting OOM right now, but I do get it when I change the batchsize per device in the training arguments to higher values and then try to train.

glossy crown Mar 31, 2025, 6:53 AM

#

wraith sparrow And for training

vllm is not for training

wraith sparrow Mar 31, 2025, 6:53 AM

#

glossy crown vllm is not for training

Oh right 💀😅

glossy crown Mar 31, 2025, 6:53 AM

#

wispy meadow Im not getting OOM right now, but I do get it when I change the batchsize per de...

that is fine

#

it means that many batches can't fit at once

#

use a smaller batch size

wraith sparrow Mar 31, 2025, 6:54 AM

#

glossy crown vllm is not for training

Accelerate is ?

glossy crown Mar 31, 2025, 6:54 AM

#

yeah

wispy meadow Mar 31, 2025, 6:54 AM

#

Yes I am, but the training is painfully slow and Losses don't converge. Maybe I should reduce max_tokens?

glossy crown Mar 31, 2025, 6:55 AM

#

glossy crown yeah

it is used in transformers training module

wraith sparrow Mar 31, 2025, 6:55 AM

#

Oh right

glossy crown Mar 31, 2025, 6:55 AM

#

wispy meadow Yes I am, but the training is painfully slow and Losses don't converge. Maybe I ...

what is the min batch size where error occurs?

wispy meadow Mar 31, 2025, 6:55 AM

#

glossy crown what is the min batch size where error occurs?

4

glossy crown Mar 31, 2025, 6:56 AM

#

glossy crown what is the min batch size where error occurs?

in my experience convergence is slow at beginning, then it speeds up and then loss stagnates/increases slightly

glossy crown Mar 31, 2025, 6:56 AM

#

wispy meadow 4

check with your train as well as test data

#

does it not decrease in train as well?

wispy meadow Mar 31, 2025, 6:57 AM

#

Yeah, tweaking any of them to 4, causes the CUDA error

wispy meadow Mar 31, 2025, 6:57 AM

#

glossy crown does it not decrease in train as well?

It doesn't

glossy crown Mar 31, 2025, 6:57 AM

#

how long did you run it for?

wispy meadow Mar 31, 2025, 6:57 AM

#

I mean the losses were 6.6+ then came down to 6.4+ then went to 6.6+

glossy crown Mar 31, 2025, 6:57 AM

#

wispy meadow I mean the losses were 6.6+ then came down to 6.4+ then went to 6.6+

for test?

wispy meadow Mar 31, 2025, 6:57 AM

#

glossy crown how long did you run it for?

2 epochs, 5000 steps in each.

wispy meadow Mar 31, 2025, 6:57 AM

#

glossy crown for test?

6.6+ to 6.3+

glossy crown Mar 31, 2025, 6:58 AM

#

oh

#

weird

#

are you using lora?

wispy meadow Mar 31, 2025, 6:58 AM

#

Right now no ig

glossy crown Mar 31, 2025, 6:58 AM

#

wispy meadow Right now no ig

without lora convergence is really slow

#

I never really tried without lora so can't really say how much slower it is

#

but it is pretty slow

wispy meadow Mar 31, 2025, 6:59 AM

#

Okay i'll try Lora then.

glossy crown Mar 31, 2025, 7:01 AM

#

@drifting star did it fix it?

wraith sparrow Mar 31, 2025, 7:03 AM

#

glossy crown <@1355790369186381985> did it fix it?

Bro she was trying for aimo 💀

#

Isn't it over ?

glossy crown Mar 31, 2025, 7:09 AM

#

wraith sparrow Isn't it over ?

not yet, also use they/them unless you know their gender

wraith sparrow Mar 31, 2025, 7:10 AM

#

Bro it's obv

#

From name

glossy crown Mar 31, 2025, 7:10 AM

#

wraith sparrow From name

harpreet is a gender neutral name

wraith sparrow Mar 31, 2025, 7:10 AM

#

And code

#

-psychology

wraith sparrow Mar 31, 2025, 7:12 AM

#

glossy crown not yet, also use they/them unless you know their gender

How many days left ?

glossy crown Mar 31, 2025, 7:12 AM

#

change your views bro, it is gonna make you suffer later in life

wraith sparrow Mar 31, 2025, 7:13 AM

#

glossy crown change your views bro, it is gonna make you suffer later in life

Change ur views 😄

glossy crown Mar 31, 2025, 7:13 AM

#

you can never infer gender from someone's coding style

wraith sparrow Mar 31, 2025, 7:13 AM

#

glossy crown you can never infer gender from someone's coding style

Lol there r too many factors

#

Like date of joining dc n kaggle too n username

#

In simple words I said behaviour aggregate of all the factors

#

I'm not even learning osint though

glossy crown Mar 31, 2025, 7:19 AM

#

you are just trying to normalize exclusivity and it is not funny

vestal halo Mar 31, 2025, 7:23 AM

#

Hey guys, I'm in Kaggle youtube channel, and I don't see live stream yet, any idea when it'll start ?

foggy crescent Mar 31, 2025, 7:24 AM

#

vestal halo Hey guys, I'm in Kaggle youtube channel, and I don't see live stream yet, any id...

10 pm IST

vestal halo Mar 31, 2025, 7:24 AM

#

foggy crescent 10 pm IST

Oh cool Thanks

raw pumice Mar 31, 2025, 7:30 AM

#

hie i am new in to AI ML Domain , can someone provide good resources to start my journey in thsi domain

wraith sparrow Mar 31, 2025, 7:32 AM

#

glossy crown you are just trying to normalize exclusivity and it is not funny

Ok i js guessed bro

rapid stratus Mar 31, 2025, 7:51 AM

#

Hi guys, I'm from India, When it will start?

quaint sable Mar 31, 2025, 8:03 AM

#

rapid stratus Hi guys, I'm from India, When it will start?

Hi there, the same confusion here, I have signed for 5 days course and today is the 1st day, I am expecting live video.but what time will it start?

drifting star Mar 31, 2025, 8:16 AM

#

wraith sparrow Bro it's obv

It is not abvious from name. It is a gender neutral name. Yes I am a female but many men in our community share the same name. We belong to sikh community and maximum names are gender neutral in our community. And yes I was trying for AIMO and it is getting over tomorrow but that ain't the point. The point is to learn. And I want to understand why 2 of my GPUs are underutilized

tough heron Mar 31, 2025, 9:02 AM

#

Hello everyone, is there a meeting now?

earnest crescent Mar 31, 2025, 9:10 AM

#

rapid stratus Hi guys, I'm from India, When it will start?

On the registration page there is count down timing showing, and right now it is [Starting in 03 hour and 49 minutes]

glad ibex Mar 31, 2025, 9:20 AM

#

drifting star 2GB should not take few hours. I have never used API to upload the data. Also tr...

It's 10+ hours already and no movement, I tried to create a second dataset with the same data, but it's doing exactly the same.
😦
@verbal crest
Sorry for bothering, but it seems I found a bug with dataset creation with an API
Dataset link is https://www.kaggle.com/datasets/dremovd/pump-fun-graduation-02-2025

glad nest Mar 31, 2025, 9:45 AM

#

HI. Can everyone help me how to summarize the white paper I downloaded in Notebook?

wraith sparrow Mar 31, 2025, 9:59 AM

#

glad nest HI. Can everyone help me how to summarize the white paper I downloaded in Notebo...

Use NotebookLM for that

sturdy pelican Mar 31, 2025, 10:08 AM

#

Is their any assignment for today?

drifting star Mar 31, 2025, 10:21 AM

#

I linked my kaggle but the welcome channel of kaggle still indicates that I have not linked my account. Further, I don't have any permissions to post on the channel. I tried to delink it once and again linked it but the result is same

drifting star Mar 31, 2025, 10:22 AM

#

glad ibex It's 10+ hours already and no movement, I tried to create a second dataset with ...

Dataset link is not accessible

rain chasm Mar 31, 2025, 10:23 AM

#

drifting star Mar 31, 2025, 10:23 AM

#

Also curious how could you keep your session active for 10 hours is it through API

wraith sparrow Mar 31, 2025, 10:24 AM

#

No

#

U save the notebook such way

#

I don't remember now

drifting star Mar 31, 2025, 10:25 AM

#

If you can recall and let me know that will be helpful

wraith sparrow Mar 31, 2025, 10:26 AM

#

drifting star If you can recall and let me know that will be helpful

Screenshot_2025-03-31-15-56-21-063-edit_com.google.android.googlequicksearchbox.jpg

#

Ig it's right

#

U js simply save it 😅

#

It automatically runs till max 12 hrs

drifting star Mar 31, 2025, 10:27 AM

#

glossy crown maybe try dtype="half"?

Thank you I will try 😀

drifting star Mar 31, 2025, 10:28 AM

#

wraith sparrow

Thank you 🙂

rugged dune Mar 31, 2025, 10:33 AM

#

Hey, Where will be the tomorrows live stream?

#

In which platform?

glossy crown Mar 31, 2025, 10:35 AM

#

drifting star Also curious how could you keep your session active for 10 hours is it through A...

it will not shut down unless you keep the session idle for 40 mins

#

if you need to keep something running and train for many hours you can save and run, but make sure to pickle the output

#

otherwise you would lose it

glad ibex Mar 31, 2025, 10:36 AM

#

drifting star Dataset link is not accessible

Yes, but it should be for Kaggle stuff

It's private
Dataset is not "finished", the status would prevent anybody to see it

New info:
I reloaded in CSV and it worked instantly

misty violet Mar 31, 2025, 10:55 AM

#

Hi. I am getting "ClientError: 429 RESOURCE_EXHAUSTED" when I try to run a cell. This is after execution of some 4/5 cells above it. Anyone has any pointers ?

#

Ok. retried it after a minute or so and it worked. I guess the server resource(s) were really exhusted.

wise holly Mar 31, 2025, 11:21 AM

#

So, for day 1 what we have to do ? Is there any assignment we have to complete

glad nest Mar 31, 2025, 11:30 AM

#

wise holly So, for day 1 what we have to do ? Is there any assignment we have to complete

you should have the instruction and the assignments in your email

thick viper Mar 31, 2025, 12:40 PM

#

how do i submit assignment

elder canopy Mar 31, 2025, 12:51 PM

#

In the Evaluation and Structured Data codelab, we explored different ways to assess LLM responses using autoraters and structured output formatting. Given the latest improvements in Gemini 2.0, what are the best practices for optimizing prompt structures to achieve higher accuracy and consistency in structured outputs, especially for tasks requiring reasoning, summarization, or extraction from unstructured text?

Additionally, are there any specific evaluation techniques recommended for ensuring that structured outputs remain reliable across diverse datasets?

wise holly Mar 31, 2025, 1:22 PM

#

How do know that I completed today wrk

honest gust Mar 31, 2025, 1:25 PM

#

glad nest you should have the instruction and the assignments in your email

i havent recieved it yet is there something i can do about it ?

glad nest Mar 31, 2025, 1:25 PM

#

what requirements file or package name I should command pip to install?

glad nest Mar 31, 2025, 1:28 PM

#

honest gust i havent recieved it yet is there something i can do about it ?

If you look in discord someone put the assignment instruction. Today’s Assignments

Complete the Intro Unit – “Foundational Large Language Models & Text Generation”:
Listen to the summary podcast episode (https://www.youtube.com/watch?v=Na3O4Pkbp-U&list=PLqFaTIg4myu_yKJpvF8WE2JfaG5kGuvoE&index=1) for this unit.
To complement the podcast, read the “Foundational Large Language Models & Text Generation” whitepaper (https://www.kaggle.com/whitepaper-foundational-llm-and-text-generation).

Complete Unit 1 – “Prompt Engineering”:
Listen to the summary podcast episode (https://www.youtube.com/watch?v=CFtX0ZyLSAY&list=PLqFaTIg4myu_yKJpvF8WE2JfaG5kGuvoE&index=2) for this unit.
To complement the podcast, read the “Prompt Engineering” whitepaper (https://www.kaggle.com/whitepaper-prompt-engineering).
Complete these codelabs on Kaggle:
Prompting fundamentals - https://www.kaggle.com/code/markishere/day-1-prompting
Evaluation and structured data - https://www.kaggle.com/code/markishere/day-1-evaluation-and-structured-output
Make sure you phone verify (https://www.kaggle.com/settings) your account before starting, it's necessary for the codelabs.
Want to have an interactive conversation (https://support.google.com/notebooklm/answer/15731776?hl=en&ref_topic=14272601&sjid=16012842710481496794-EU) ? Try adding the whitepapers to NotebookLM (https://notebooklm.google.com/?original_referer=https:%2F%2Fwww.google.com%23&pli=1)

YouTube

Kaggle

Whitepaper Companion Podcast - Foundational LLMs & Text Generation

Read the whitepaper here: https://www.kaggle.com/whitepaper-foundational-llm-and-text-generation
Learn more about the 5-Day Generative AI Intensive: https://rsvp.withgoogle.com/events/google-generative-ai-intensive_2025q1

Introduction:

The advent of Large Language Models (LLMs) represents a seismic shift in the world of artificial intelligen...

▶ Play video

YouTube

Kaggle

Whitepaper Companion Podcast - Prompt Engineering

Read the whitepaper here: https://www.kaggle.com/whitepaper-prompt-engineering
Learn more about the 5-Day Generative AI Intensive: https://rsvp.withgoogle.com/events/google-generative-ai-intensive_2025q1

Introduction:

When thinking about a large language model input and output, a text prompt (sometimes accompanied by other modalities such as ...

▶ Play video

Day 1 - Prompting

Explore and run machine learning code with Kaggle Notebooks | Using data from No attached data sources

tulip venture Mar 31, 2025, 1:31 PM

#

I am getting an error for "Install SDK", "Cell In[5], line 1
pip uninstall -qqy jupyterlab # Remove unused packages from Kaggle's base image that conflict
^
SyntaxError: invalid syntax"

glad nest Mar 31, 2025, 1:32 PM

#

Hi everyone. what requirements file or package name I should command pip to install? as i get error

honest gust Mar 31, 2025, 2:13 PM

#

glad nest If you look in discord someone put the assignment instruction. Today’s Assignme...

thankyou so much for letting me know

#

what is the use for api key that is generated by ai studio ?

glad nest Mar 31, 2025, 2:18 PM

#

why this error has come up? ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
jupyterlab-lsp 3.10.2 requires jupyterlab<4.0.0a0,>=3.1.0, but you have jupyterlab 4.3.6 which is incompatible.

drifting oxide Mar 31, 2025, 2:23 PM

#

im currently doing the intro to ml course and in the first exercise itself, the codes on the notebook don't work? it keeps on loading what should i do? i tried in both chrome and edge

honest gust Mar 31, 2025, 2:24 PM

#

glad nest why this error has come up? ERROR: pip's dependency resolver does not currently ...

your jupyter lab version 4.3.6 is not compatible with jupyterlab-lsp 3.10.2 so you can either downgrade your jupyter lab version that is use this pip install "jupyterlab<4.0.0" to install the compatible verion or create a virtual env to work in

honest gust Mar 31, 2025, 2:24 PM

#

drifting oxide im currently doing the intro to ml course and in the first exercise itself, the ...

try restarting the kernel

drifting oxide Mar 31, 2025, 2:25 PM

#

no luck with that

wispy meadow Mar 31, 2025, 2:29 PM

#

is it, more efficient to run an unsupervised Learning on GPT2 to finetune for generating story based on a context, or is using prompt as the input and story as the target to do supervised learning the better choice?

#

I did the latter, but my losses failed to converge. Also, the training times for such stuff, is absurdly high.

glad nest Mar 31, 2025, 2:31 PM

#

when i downgrade jupyter version to<4.0.0this error came up: An error occurred while committing kernel: Concurrenc yViolation Sequence number must match Draft record: KernelId=79702580, ExpectedSequence=4, ActualSequence=2, AuthorUserId=25935810

#

Also gave me a warning WARNING: Skipping jupyterlab as it is not installed.

#

although it mentions successfully installed

graceful axle Mar 31, 2025, 3:02 PM

#

drifting oxide im currently doing the intro to ml course and in the first exercise itself, the ...

show ss ?

abstract locust Mar 31, 2025, 3:47 PM

#

hi, how we can submit tasks we already done?

broken mason Mar 31, 2025, 4:49 PM

#

once we run the notebooks on kaggle, are we supposed to submit them ?

elder canopy Mar 31, 2025, 5:02 PM

#

drifting oxide im currently doing the intro to ml course and in the first exercise itself, the ...

You have to choose option on top right side that edit a copy. then you can run the codes.

misty violet Mar 31, 2025, 5:08 PM

#

broken mason once we run the notebooks on kaggle, are we supposed to submit them ?

No we are not supposed to. The submission is for the project at the end of 5 days. See the announcements ... everything else is optional based on how much time you have.

#

copy from announcements -- Hi all - To make things simple and clear for everyone, we will award the course badge to everyone who participates in the capstone project.

All other work is optional and can be done at any time. If you're short on time, we recommend prioritising the podcasts and livestream and then the codelabs. The capstone project will use knowledge you learn from the codelabs, so we recommend running through them but you don't have to submit the daily labs.

There is no time limit for podcasts, whitepapers or codelabs, but doing them this week will give you a chance to discuss with the community.

jade cave Mar 31, 2025, 5:22 PM

#

Hi everyone! I am a student from Iran and it seems that Iranians can not verify their phones due to not receiving any messages. I would appreciate any guidance with this.

umbral rune Mar 31, 2025, 5:23 PM

#

I don't get the code for the verification of Kaggal.. does anyone have the same issue

tepid lichen Mar 31, 2025, 5:34 PM

#

I committed both promoting and Evaluation assignment. How do I know that it's completed?

elder canopy Mar 31, 2025, 5:35 PM

#

Can any one tell me that what project we have to do at last .

obtuse storm Mar 31, 2025, 5:48 PM

#

So what should I do after completing the codelabs again?

vocal bison Mar 31, 2025, 6:03 PM

#

Is the livestream happening soon, I'm on the YT channel but don't see stream happening?

tepid lichen Mar 31, 2025, 6:06 PM

#

Yeah. It will stream in 12 mins or so

mortal surge Mar 31, 2025, 6:08 PM

#

What are the key differences between Generative AI models and Discriminative AI models?

frozen maple Mar 31, 2025, 6:16 PM

#

Is the youtube livestream from 4 months ago or is it truly happening now?

distant rune Mar 31, 2025, 6:21 PM

#

Anant did not get a chance to share his views when everyone else shared their views

dim river Mar 31, 2025, 6:22 PM

#

How does Gemini 2.0’s multimodal capability compare to GPT-4 in image processing?

visual tusk Mar 31, 2025, 6:22 PM

#

Guys, I know this question is kind of funny, but I couldn’t find the livestream video on the Kaggle page. The only video I found was published four months ago. That’s why I couldn’t participate in the last test.

open surge Mar 31, 2025, 6:24 PM

#

frozen maple Is the youtube livestream from 4 months ago or is it truly happening now?

its not new

pastel tide Mar 31, 2025, 6:25 PM

#

Do you see it prompting the user for different styles/formats of output as for structured output which we see deployed now by GPT and Claude?

dim river Mar 31, 2025, 6:27 PM

#

What’s the best way to evaluate the coherence of Gemini API outputs when working with this dataset?

proven burrow Mar 31, 2025, 6:39 PM

#

Hello , Why don’t I see the first day ?

vagrant quartz Mar 31, 2025, 6:42 PM

#

Temperature

spare bridge Mar 31, 2025, 7:01 PM

#

I could not see todays video because of the timing when will it be uploaded and can you send link?

trim relic Mar 31, 2025, 7:02 PM

#

should we cover the whole white paper on foundational LLM?

drifting star Mar 31, 2025, 7:14 PM

#

tulip venture I am getting an error for "Install SDK", "Cell In[5], line 1 pip uninstall -...

add exclamation mark in front of pip and try uninstalling

fallow light Mar 31, 2025, 9:33 PM

#

آیا برای روز اول تمرین یا کویزی داده شده که باید انجام وتحویل بدهیم؟

noble escarp Apr 1, 2025, 1:38 AM

#

How I can know my assignment are submitter, how how I can submit?

wispy timber Apr 1, 2025, 1:39 AM

#

Will there be another course in a few months? Maybe over the summer, so that students won't have to also balance school and homework?

sturdy pelican Apr 1, 2025, 3:27 AM

#

U can learn after some time, these all videos and livestream are available on the kaggle yt channel

wispy timber Apr 1, 2025, 3:27 AM

#

yeahh but I won't be able to do the assignments and capstone project

blissful sentinel Apr 1, 2025, 3:50 AM

#

how to satup day 2

placid elk Apr 1, 2025, 5:43 AM

#

How would I know if the assignment is done or not?

tepid shuttle Apr 1, 2025, 6:04 AM

#

code:

warning/error:

as i ran the code in cell, got he warnings in output then this kernel restarting pop-up. and this problem just goes on.

Using TPU VM v3-8, huggingface library used is the latest version

gloomy skiff Apr 1, 2025, 6:23 AM

#

How will i get to know whether I have completed my day 1 task will i get some notification or will it reflect on my dashboard or home page

tepid shuttle Apr 1, 2025, 6:39 AM

#

gloomy skiff How will i get to know whether I have completed my day 1 task will i get some no...

ask these questions on #5dgai-q-and-a .

vestal sky Apr 1, 2025, 6:42 AM

#

Is the assignment easy ?

wraith sparrow Apr 1, 2025, 7:21 AM

#

No nothing's easy

hardy basalt Apr 1, 2025, 7:45 AM

#

why does my kaggle notebook almost always get stuck on running when i press run all? like my very first cell which is just imports gets stuck on running

#

this only really happens if I press "runall" though if I run each cell individually it works and doesnt get stuck

blissful sentinel Apr 1, 2025, 7:49 AM

#

how to Complete assignments

hard sleet Apr 1, 2025, 8:04 AM

#

!!! For the 5 days GenAI Course this is not the right place to ask questions. Please refer to 5dgai-question-forum. This is the general QA Discord chanel of Kaggle.

sullen osprey Apr 1, 2025, 8:08 AM

#

Dear Kaggle Support Team,
I hope you are well. I recently registered on Kaggle to further my education and enhance my skills in data science. However, I encountered an obstacle during the phone number verification process for accessing codelabs; my verification is being blocked due to sanctions imposed on Iran.
I understand that U.S. sanctions target commercial and political activities, yet educational and research initiatives are generally exempt under U.S. law. Restrictions that prevent access to purely educational resources—like Kaggle codelabs—could be seen as conflicting with these legal exemptions. In fact, U.S. regulations typically allow for academic and humanitarian exchanges, and applying these restrictions in this educational context may run counter to that policy.
Could you please review this matter and consider lifting the phone verification restriction on my account so I can fully participate in Kaggle’s educational opportunities? I believe this change supports the spirit of U.S. law, which aims to foster academic development and innovation.

viscid gate Apr 1, 2025, 9:16 AM

#

How to get instructions for this 5 Days course? thanks

clever cliff Apr 1, 2025, 11:10 AM

#

viscid gate How to get instructions for this 5 Days course? thanks

#5dgai-announcements or emails, they will tell you what to learn each day

visual sun Apr 1, 2025, 1:01 PM

#

Hi, I'm a developer in Korea. I started studying AI this time, and what I want is to become a deep learning engineer in computer vision. So, I'm learning the basics from machine learning. How much should I study machine learning to move on to deep learning? Now I'm taking a look at the book "Introduction to Machine Learning with Python" and practicing it. Please tell me the direction for deep learning in the vision field

charred hull Apr 1, 2025, 1:03 PM

#

i recived day2 yesterday not day 1 anyone like me ?

wraith sparrow Apr 1, 2025, 1:28 PM

#

visual sun Hi, I'm a developer in Korea. I started studying AI this time, and what I want i...

.

visual sun Apr 1, 2025, 1:30 PM

#

wraith sparrow [.](https://www.manning.com/books/deep-learning-for-vision-systems)

omg i got a 404 error

wraith sparrow Apr 1, 2025, 1:31 PM

#

I'm bit disappointed myself

#

They don't ve that name book yet

#

Also this one's latest edition I'm not sure

dense parcel Apr 1, 2025, 1:32 PM

#

visual sun Apr 1, 2025, 1:35 PM

#

wraith sparrow I'm bit disappointed myself

I checked the revised link. I was thinking about looking at the book, but thank you for the recommendation.

wraith sparrow Apr 1, 2025, 1:35 PM

#

visual sun I checked the revised link. I was thinking about looking at the book, but thank ...

Lol book is really great

#

Never underestimate a Manning book

#

Even a decade old one would be better than some latest edition books of Packt publication

wraith sparrow Apr 1, 2025, 1:44 PM

#

visual sun I checked the revised link. I was thinking about looking at the book, but thank ...

latest one on it lil gen ai tho

visual sun Apr 1, 2025, 1:46 PM

#

wraith sparrow [latest one on it lil gen ai tho](https://cactus-girl-e53.notion.site/Generative...

Actually, i'm bad at eng yet. So I'm looking for a translated version, but you recommended me before, but I don't have a translated version of this book yet. I'll look at the former book first, and then I'll try this book in English! Thx

winged orbit Apr 1, 2025, 4:51 PM

#

tepid lichen I committed both promoting and Evaluation assignment. How do I know that it's co...

could you tell me how did you complete the prompting and evaluation assignment/ just running it in kaggle right/

peak sleet Apr 1, 2025, 5:30 PM

#

Are there ways for us to effectively create AI agents for embedding evaluation or train the agents through reinforcement learning for embedding optimization?

azure trellis Apr 1, 2025, 5:57 PM

#

Good day fellow kagglers. Please how do I go from here :df_train = create_embeddings(df_train)
df_test = create_embeddings(df_test)

#

I have not been able to run the two lines of codes and the subsequent ones successfully. I need your insight

wraith sparrow Apr 1, 2025, 6:08 PM

#

peak sleet Are there ways for us to effectively create AI agents for embedding evaluation...

Yes, there are ways to create AI agents for embedding evaluation and optimization. Here are some approaches:

Embedding Evaluation:

Learning to Rank: Train an agent to rank embeddings based on their quality, using reinforcement learning or supervised learning.
Embedding Critic: Design an agent that critiques embeddings based on specific metrics, such as similarity or clustering quality.
Adversarial Evaluation: Train an agent to generate adversarial examples that challenge the embedding's robustness.

Embedding Optimization through Reinforcement Learning:

Reward-based Optimization: Define a reward function that encourages the agent to optimize embeddings for a specific task, such as clustering or classification.
Policy Gradient Methods: Use policy gradient methods, like REINFORCE, to optimize the embedding parameters directly.
Actor-Critic Methods: Employ actor-critic methods, like Deep Deterministic Policy Gradients (DDPG), to optimize embeddings using both value and policy functions.

Other Approaches:

Generative Adversarial Networks (GANs): Use GANs to generate embeddings that are optimized for specific tasks.
Meta-Learning: Train an agent to learn how to optimize embeddings for a variety of tasks, using meta-learning techniques like Model-Agnostic Meta-Learning (MAML).
Evolutionary Algorithms: Employ evolutionary algorithms, like genetic algorithms or evolution strategies, to optimize embeddings.

These approaches can be used to create AI agents that effectively evaluate and optimize embeddings for various tasks.

wraith sparrow Apr 1, 2025, 6:15 PM

#

dim river What’s the best way to evaluate the coherence of Gemini API outputs when working...

Evaluating the coherence of Gemini api outputs involve assessing the relevance, consistency, and overall quality of the generated text. Here r some methods:

Evaluation Metrics:

Perplexity: Use the evaluate method from HF Transformers to calculate perplexity.
BLEU Score: Utilize the sacrebleu library, which is integrated with HF Transformers, to calculate BLEU scores.
ROUGE Score: Use the rouge-score library, which is also integrated with HF Transformers, to calculate ROUGE scores.

Coherence Evaluation:

Language Modeling Evaluation: Use the language-modeling-evaluation pipeline from HF Transformers to evaluate coherence.
Text Classification Evaluation: Utilize the text-classification-evaluation pipeline to evaluate coherence in text classification tasks.

Model-specific Evaluation:

Autoencoder-based Models: Evaluate coherence using reconstruction loss or perplexity for autoencoder-based models like BERT or RoBERTa.
Generative Models: Evaluate coherence using metrics like BLEU or ROUGE for generative models like T5 or Bart.

Example Code:
from transformers import AutoModelForSeq2SeqLM, AutoTokenizer
from transformers import evaluate

Load model and tokenizer
model = AutoModelForSeq2SeqLM.from_pretrained("t5-base")
tokenizer = AutoTokenizer.from_pretrained("t5-base")

Evaluate perplexity
perplexity = evaluate(model, tokenizer, "wikitext-2-raw-v1", "perplexity")
print(perplexity)
This code evaluates the perplexity of the T5-base model on the WikiText-2 dataset.

pine anvil Apr 1, 2025, 10:52 PM

#

Hello there, I am a bit stuck. Are we supposed to work in the same Kaggle notebook?

runic monolith Apr 1, 2025, 11:43 PM

#

Hello fellas! I've been trying to verify my kaggle account via phone number but to no avail. I've tried over and over again sending code to my number but still no code shows up. Please, any way around this?

bright wyvern Apr 2, 2025, 1:50 AM

#

Hello there, should i invest myself in Claris Filemaker? i am being transition to work with our company CTO and Filemaker is part of our internal process. Working with CTO over the years led me closer to data world.

whole delta Apr 2, 2025, 9:29 AM

#

@quick yarrow moving here not to flood the other channel

#

i didn't say yolo is the best model, it does perform well though for your task it will need some additional training

#

do you have a dataset of images that show where existing defects are? you need to show the model what it's trying to locate

#

let me pull up an example

#

#

the task here is segmenting parts of the spine

#

the image on the right (aka mask or label) is already existing in the dataset

#

thus you can train your model to replicate what is already in your data

#

#

example of mask vs what a model generated

#

does your data have the masks? if not training a CNN can be difficult

#

also this is how a segmentation model (in this case Unet) segments images

#

yolo doesn't output masks like this, instead it generates bounding boxes

#

given the structure of a semiconductor i would assume something like yoloX is more fitting for your task, it's also less computationally intensive to train compared to U-net especially if your images are hi-res

#

i'm not familiar with how to implement yolo though but afaik it's not too hard

#

@quick yarrow please don't send me friend requests, we can just discuss this here

timid herald Apr 2, 2025, 10:15 AM

#

whole delta <@1326151195038973992> please don't send me friend requests, we can just discuss...

Thank you.

timid herald Apr 2, 2025, 11:25 AM

#

whole delta <@1326151195038973992> please don't send me friend requests, we can just discuss...

well. I just learned kaggle learn course. could you tell me about the best study material?

#

Thank you.

slate arrow Apr 2, 2025, 12:50 PM

#

Day 3 (Generative Agents)
Welcome to Day 3. https://www.kaggle.com/learn-guide/5-day-genai

Learn to build sophisticated AI agents by understanding their core components and the iterative development process.
The code labs cover how to connect LLMs to existing systems and to the real world. Learn about function calling by giving SQL tools to a chatbot, and learn how to build a LangGraph agent that takes orders in a café.

Day 3 Assignments:

Complete Unit 3: “Generative Agents”, which is:

[Optional] Listen to the summary podcast episode for this unit (created by NotebookLM).
Read the “Generative AI Agents” whitepaper.
[Optional] Read a case study which talks about how a leading technology regulatory reporting solutions provider used an agentic generative AI system to automate ticket-to-code creation in software development, achieving a 2.5x productivity boost.
Complete these code labs on Kaggle:

Talk to a database with function calling
Build an agentic ordering system in LangGraph
Watch the YouTube livestream recording. Paige Bailey will be joined by expert speakers from Google - Steven Johnson, Julia Wiesinger, Alan Blount, Patrick Marlow, Wes Dyer, Anant Nawalgaria to discuss generative AI agents.

5-Day Gen AI Intensive Course with Google Learn Guide

bright turret Apr 2, 2025, 2:47 PM

#

How to check the Progress whether I completed the Day 1???

robust tinsel Apr 2, 2025, 4:27 PM

#

Hi everyone, I have a question on embeddings!

I was working on a RAG search project where the source document (a book) was ancient and hence the language used was antiquated. I was getting very poor performance with a standard embedding model and I suspect the language in the book was just very different to what the embedding model was trained on.

Is there a way in which I can salvage this situation and calculate useful embeddings? Thanks in advance. 🙂

neat wyvern Apr 2, 2025, 5:02 PM

#

Hello everyone, quick question. Is anyone experiencing issues with Notebook? Its not loading any cells for me.

floral carbon Apr 2, 2025, 6:19 PM

#

Is there a NotebookLLM API that can enable me leverage its features from within an app?

topaz apex Apr 2, 2025, 7:04 PM

#

Pls how do I get this code labs, is blurred on the live training and is a capstone project to be submitted when?

whole delta Apr 2, 2025, 9:56 PM

#

robust tinsel Hi everyone, I have a question on embeddings! I was working on a RAG search pro...

That's an interesting case really. Are you splitting parts of the doc into chunks for use within the vector space? Not sure if semantic chunking would work here if your LLM struggles to parse the language, but other options exist

#

my thought process would be that if you split the doc into smaller, more manageable chunks, you'll have an easier time searching within the vector space during retrieval

#

The other route would be to test different embedding models though I doubt that will bring much improvement, unless it's anything overly specific

#

Another idea that comes to mind, though not sure how feasible given the increased overhead, would be to have an interpreter LLM convert from the antiquated to a more modern version of your language (assuming this is structured similar to trad/simplified Chinese, ancient/modern Greek etc) and use the modern versions for embeddings and retrieval and the original text in your doc store and subsequently the answering LLM prompt

#

Do keep me updated with your findings if possible that sounds really interesting

neon gale Apr 2, 2025, 11:13 PM

#

Does the server guide just disappear when you complete it all (used to appear near Chanels & Roles and rules in left hand side of desktop app.

obtuse storm Apr 3, 2025, 1:48 AM

#

Any ideas where can I get EHR data?

round prawn Apr 3, 2025, 4:23 AM

#

kaggle lookslike crashed

#

kaggle account crashed

unreal grove Apr 3, 2025, 6:14 AM

#

It's Naveed from Pakistan
I had a question
That, if I wanna become a Gen Ai developer and get a job as soon as possible
So, in your opinion which fields should I focus

jovial spruce Apr 3, 2025, 7:43 AM

#

Subject: Seeking Guidance on Developing a Generative AI Application for Video SEO Optimization

I am initiating a project to develop a Generative AI application with the goal of optimizing video content for search engine visibility. The intended workflow involves users uploading video files, which the AI agent will then analyze to generate key SEO elements. Specifically, the application should:

Extract relevant information from the video content.
Generate SEO-optimized titles, adhering to a maximum length of 60 characters.
Suggest relevant hashtags (less than 10).
Identify pertinent YouTube keywords.
Identify relevant YouTube long-tail keywords.
I am seeking insights and recommendations from the community on the optimal approach to building this AI agent. Specifically, I would appreciate guidance on:

Recommended AI architectures and models suitable for this type of video content analysis and text generation.
Key considerations for data processing and feature extraction from video content to inform the AI model.
Strategies for training the AI model to generate accurate and effective SEO elements within the specified constraints (character limits, hashtag count).
Potential challenges and best practices to consider during the development process.
Thank you for your expertise and assistance in getting this project off the ground.

robust tinsel Apr 3, 2025, 9:03 AM

#

whole delta Another idea that comes to mind, though not sure how feasible given the increase...

This is a really good idea. Either I could look for a way to translate it to a more modern version or just download a contemporary translation of that book. Thanks Kostas!
I’ll give it a shot this weekend 🙂

whole delta Apr 3, 2025, 11:14 AM

#

robust tinsel This is a really good idea. Either I could look for a way to translate it to a m...

No probs, one note I would make would be to use the direct, untranslated version of the passage in the context window of the answering LLM, since translations almost always miss something from the original

foggy crescent Apr 3, 2025, 11:28 AM

#

what is this capstone project about?

balmy orbit Apr 3, 2025, 11:33 AM

#

what PC set-up do you guys usually use and think would be good for training models? Sorry if this is too general, I'm just getting started with kaggle and was thinking of getting a M4 mini

wraith sparrow Apr 3, 2025, 11:49 AM

#

balmy orbit what PC set-up do you guys usually use and think would be good for training mode...

Maybe better buy a good/descent gpu if u r considering offline training @Qafig

whole delta Apr 3, 2025, 11:50 AM

#

balmy orbit what PC set-up do you guys usually use and think would be good for training mode...

Kaggle offers you limited GPU runtime (30 hours a week, plenty if you're not training anything too intense) online

#

that and the ~50gb online storage should be enough for you to get started without spending a dime

#

training locally can be very expensive so do try to exhaust Kaggle first and see if it doesn't fit your needs before dropping that much money on a rig

wraith sparrow Apr 3, 2025, 12:05 PM

#

whole delta training locally can be very expensive so do try to exhaust Kaggle first and see...

what about that nvidia's blackwell ai computer for training models ?

whole delta Apr 3, 2025, 12:06 PM

#

i'm not familiar w that service or what it costs

#

if they have free compute units/tokens and it's easy enough for you to set up cool

#

but kaggle has the lowest barrier to entry imo

timid herald Apr 3, 2025, 4:29 PM

#

I'm going to develop population model using real-world data from a specific region.
The goal is to effectively transfer this model to a similar area while maintaining demographic and behavioral accuracy.
A key aspect of this role involves troubleshooting complex data inconsistencies and resolving modeling challenges to ensure high-quality outputs.
could you please give me the best algorithm and how to do?

covert rain Apr 3, 2025, 6:05 PM

#

Hello Everyone,
Where do I mark the assignments done?
Also, where can we get the capstone?

velvet summit Apr 3, 2025, 6:37 PM

#

covert rain Hello Everyone, Where do I mark the assignments done? Also, where can we get the...

@everyone

final sand Apr 3, 2025, 6:46 PM

#

Hello how to find groups for capstonea project

cedar gale Apr 3, 2025, 7:34 PM

#

my Day 4 1st codelab is still in queue.... Has anyone got through the queue successfully.... Mine is going on forever.... 😦

blissful creek Apr 3, 2025, 10:30 PM

#

Hi everyone, I am wondering if there is documentation/reference for the client class.

So far, codelabs showed me examples using

client.models.generate_content()
client.chats.create()
client.chats.send_message()
client.tunings.tune()
etc. Those examples are great, but where can I find a comprehensive list of functions that belong to the client class and arguments of those functions, not just example code snippets?

noble escarp Apr 4, 2025, 4:18 AM

#

Today I can't get any kind of mail for day 5 tasks and assignments

fast tapir Apr 4, 2025, 7:27 PM

#

Thanks for the clarification @verbal crest . Could you advise on best practices for sharing the call across channels, if allowed? I aprpeciate your help, thanks!

verbal crest Apr 4, 2025, 7:29 PM

#

fast tapir Thanks for the clarification <@1101209061871067309> . Could you advise on best p...

A single post in the general channel with no discord link is the appropriate amount of promotion.

wraith sparrow Apr 5, 2025, 2:48 AM

#

datasets vs polars library which is better

uneven pewter Apr 5, 2025, 7:56 AM

#

I have registered for a competition in diffrent I'd and my main I'd is diffrent I'm in a team what should I do?

gloomy nest Apr 5, 2025, 6:40 PM

#

Hey, is here someone that knows anything about efficiency optimization in pytorch? I would need some help.
Thank you!

acoustic leaf Apr 6, 2025, 5:58 AM

#

Hello. Is there anyone who has expertise in webscraping?

wraith sparrow Apr 6, 2025, 6:10 AM

#

Anyone ve experience with singlestore notebooks ?

wraith sparrow Apr 6, 2025, 6:29 AM

#

Anyone ve experience with uptrain ai ?

rustic drum Apr 6, 2025, 12:13 PM

#

i am not getting this (linting) in kaggel notebook, how can i enable it(though i tried finding i haven't got any option regarding that).

is there any plugin required?

graceful axle Apr 6, 2025, 1:51 PM

#

Screenshot_2025-04-06-19-20-40-60_572064f74bd5f9fa804b05334aa4f912.jpg

#

Guys please help me out

solid lynx Apr 6, 2025, 2:04 PM

#

graceful axle

Yes, I will advice to make submission on the 19th April, 2025

graceful axle Apr 6, 2025, 2:27 PM

#

What happens if we don't submit before the due date.

#

Can I submit the project after the deadline

Screenshot_2025-04-06-20-10-59-83_40deb401b9ffe8e1df2f1cc5ba480b12.jpg

visual moat Apr 6, 2025, 3:01 PM

#

@everyone Hello Everyone, I have a question regarding the code laps notebooks , I am doing all the code laps today, do we see any out files after running all notebooks? or just run is enough??
I know this is late to ask, does anyone know?, can anyone answer?

wraith sparrow Apr 6, 2025, 3:17 PM

#

run would be enough ig

golden sorrel Apr 6, 2025, 3:46 PM

#

How do people publish their datasets?
I'm curious to understand how people go about publishing datasets. Do they generate the data themselves, or do they collect it from somewhere else? If it's the latter, where do they usually get their data from?

haughty beacon Apr 6, 2025, 4:20 PM

#

Hello @everyone,

I should develop AI model who checked a seal and invoice to business. I need a dataset sample.
Can you help me to getting an example of sample please.

Thanks.

distant nymph Apr 6, 2025, 5:38 PM

#

Can anyone please tell if on kaggle they are asking me to submit the code notebook as well but in "submit Prediction" accepting only csv file so how i can submit the code on it.

astral charm Apr 6, 2025, 8:37 PM

#

Is there a deadline for completing the project we are supposed to do in KAGGLE??

mystic ocean Apr 6, 2025, 10:14 PM

#

I'm getting this error when I go to the competitions/gen-ai-intensive-course-capstone-2025q1 page. Is there a way around this?

jovial spruce Apr 7, 2025, 3:46 PM

#

astral charm Is there a deadline for completing the project we are supposed to do in KAGGLE??

April 20th is the deadline for the project

vapid nebula Apr 8, 2025, 4:37 AM

#

I’m trying to download a part of BNF onto NotebookLm but not able to. What I am doing wrong ?

winter furnace Apr 8, 2025, 6:02 AM

#

anyone here familiar with a workflow where i can do version control using git on a shared server?

for context, i have access to a shared GPU cluster which i can use for ML training, but ofc it's shared, i can't just have my private credentials on that server

so what's the best way of doing training on there while keeping my files under git?

also curious how to initiate training sessions, leave them running even after i close my laptop/close the SSH connection, periodically check training progress, etc.

ancient plover Apr 8, 2025, 6:25 AM

#

graceful axle Can I submit the project after the deadline

can you share this link

ancient plover Apr 8, 2025, 6:27 AM

#

covert rain Hello Everyone, Where do I mark the assignments done? Also, where can we get the...

got any answers?

radiant lodge Apr 8, 2025, 10:27 AM

#

Please, how do I join a team, for the capstone project?

hexed zephyr Apr 9, 2025, 2:50 AM

#

I want to become a ML engineer can anyone please guide me
right now i just done my matric exams and i am free for 3 months i want to start learning in this time period
so from where should i start and how i should continue my study

void pumice Apr 9, 2025, 5:33 PM

#

I have asked for help several times and I am really feeling unsupported in trying to complete my Kaggle project.🥹

quasi flume Apr 9, 2025, 7:35 PM

#

i dont know how to do the capstone
like should is just make a new notebook on the website and right my code in cells and run it or what/

#

little help please

west star Apr 9, 2025, 8:29 PM

#

Pls how do I merge teams for my kaggle capstone project
I can't find where to form a team or even merge a team
Pls help 🥲

wraith sparrow Apr 10, 2025, 1:00 AM

#

When u join the competition go to the Teams tab u can see ur team tho team merger deadline has passed maybe so u can't merge anymore

native wren Apr 10, 2025, 6:05 AM

#

graceful axle Can I submit the project after the deadline

NO. 20th april is the deadline for submitting the project after that you cant submit and the project wont be evaluated for the 5 days course

whole ore Apr 10, 2025, 7:36 AM

#

Any support with this error
'404 models/gemini-pro is not found for API version v1beta, or is not supported for generateContent. Call ListModels to see the list of available models and their supported methods.' @here

wraith jay Apr 10, 2025, 11:36 AM

#

When I run these lines in a fresh notebook:

!pip uninstall -qqy jupyterlab # Remove unused packages from Kaggle's base image that conflict
!pip install -U -q "google-genai==1.7.0" # Also tried "google-genai==1.10.0" with same conflicts
!pip install -U -q chromadb

The chromadb install throws a whole bunch of library conflict warnings. Does anybody heve the same issue?
Any solutions?

The messages seem to refer to big-query and tensorflow, which I will not use directly, but I wonder if they will break anything else...
I tried the solutions Google AI Studio and OpenAI gave me, but they didn't work.

crimson helm Apr 10, 2025, 7:08 PM

#

hi, i'm new on kaggle and i have a working notebook i want to submit in a competition. but i can't find how to import it ? the only option i found is to create a new notebook from scratch.

crimson helm Apr 10, 2025, 7:13 PM

#

crimson helm hi, i'm new on kaggle and i have a working notebook i want to submit in a compet...

oh finally found the option "link to a competition" ! 🙂

crimson helm Apr 10, 2025, 7:16 PM

#

crimson helm oh finally found the option "link to a competition" ! 🙂

but it creates a version and delete everything 😢

crimson helm Apr 10, 2025, 7:48 PM

#

crimson helm but it creates a version and delete everything 😢

save a version from Version2 solve my issue. sorry for the noise.

torpid shell Apr 10, 2025, 8:52 PM

#

Hello i have written notebook that exeute a chain via langchain, but the notebook is hanging in kaggle ,while the same is r unnig fine on codespace. am i missing something> does kaggle not allow chains? ihave this setup wtih OpenAI but gemini does not seem to have the same output parser agent = (
{
"input": lambda x: x["input"],
"agent_scratchpad": lambda x: format_to_openai_tool_messages(
x["intermediate_steps"]
),
"chat_history": lambda x: memory.load_memory_variables(x)["chat_history"],
}
| prompt
| llm_with_tools
| OpenAIToolsAgentOutputParser()
)

pseudo kestrel Apr 11, 2025, 1:16 AM

#

Hey i am Jenny, i am trying to work with Kaggle but this is too overwhelming dont know where to start, any help would be highly appreciated.Thanks in advance

unreal mesa Apr 11, 2025, 9:30 AM

#

Hi, have anyone encountered any issue with yfinance api on kaggle notebook before? It was working perfectly fine yesterday, but suddenly it returns empty dataframe

tidal breach Apr 11, 2025, 6:20 PM

#

Hi I have comepleted my CapStone Project everything runs fine but when I go to version it it says it failed... can any one help please

quasi flume Apr 11, 2025, 6:24 PM

#

tidal breach Hi I have comepleted my CapStone Project everything runs fine but when I go to v...

hey can i ask how to do it
like where do you write the code how to submit it

quasi flume Apr 11, 2025, 11:10 PM

#

?? aanyhelp

quasi flume Apr 11, 2025, 11:11 PM

#

tidal breach Hi I have comepleted my CapStone Project everything runs fine but when I go to v...

can you please tell me how you did it

wraith sparrow Apr 12, 2025, 12:53 AM

#

quasi flume hey can i ask how to do it like where do you write the code how to submit it

#5dgai-announcements message

tidal breach Apr 12, 2025, 6:54 PM

#

quasi flume can you please tell me how you did it

how i did what...just build a notebook like we did inthe labs

craggy goblet Apr 12, 2025, 7:58 PM

#

So I am somewhat new to ML and would like to know more about feature engineering and how does one go about doing it

fiery umbra Apr 12, 2025, 10:43 PM

#

pseudo kestrel Hey i am Jenny, i am trying to work with Kaggle but this is too overwhelming don...

Hi Jenny, I'm new as well but I think a good way to get started is to follow along with some YouTube tutorials. There are some good tutorials that show how to join a beginner competition, get some results using the Kaggle notebook/Jupyter notebook and submit those results for a score. Try searching youtube for "Kaggle titanic" or "Kaggle House Pricing" for help with those beginner competitions. A couple of popular channels I've seen are Ken Jee, and Ryan and Matt Data Science, for example.

pseudo kestrel Apr 13, 2025, 6:37 PM

#

fiery umbra Hi Jenny, I'm new as well but I think a good way to get started is to follow alo...

Thank you so much for the help, i really appreciate it

glossy crown Apr 13, 2025, 7:16 PM

#

Are questions asked here answered?

quasi flume Apr 13, 2025, 10:05 PM

#

tidal breach how i did what...just build a notebook like we did inthe labs

thank
i am really sorry if that bothered you
I get kinda lost with instructions

wraith sparrow Apr 14, 2025, 1:28 AM

#

glossy crown Are questions asked here answered?

except the ones asked by u 😉

tawny flame Apr 14, 2025, 7:58 AM

#

Where I can upload my project of 5days genai

ivory robin Apr 14, 2025, 12:58 PM

#

the kaggle web community looks like all spam or is just me?

wraith sparrow Apr 14, 2025, 2:17 PM

#

-# maybe u r new

ivory robin Apr 14, 2025, 2:40 PM

#

Indeed I am

coral rivet Apr 14, 2025, 6:14 PM

#

fiery umbra Hi Jenny, I'm new as well but I think a good way to get started is to follow alo...

Thanks James, it feels like you are the only one answering these questions instead of being dismissive

#

Do you think I could reach out to you if I need help completing mine?

#

This channel is full of eager people but not enough people lending out a hand

#

It’s suffocating knowing that there’s a “community” here, but it feels like a brick wall

fiery umbra Apr 14, 2025, 11:44 PM

#

coral rivet Thanks James, it feels like you are the only one answering these questions inste...

Hi Kobe, I'm only a beginner too, but yes if you have beginner questions feel free to ask and I'll do my best!

stuck owl Apr 16, 2025, 8:35 AM

#

Do anyone have the address of the recommended open source machine learning project on pulse waveform graph analysis? Recently looking for related projects in this area,I want to follow the big brother to do it first to see.

cursive owl Apr 16, 2025, 11:37 AM

#

Hello, I am pretty new to ML and was working on internal project . I wanted to get some understanding about what are best practices to store trained model (.pkl). Do we push them into github with other code or there are other storages which can be used for maintaining model versions ?

craggy crown Apr 16, 2025, 2:43 PM

#

Hey there!
After six months of learning to code (and approximately 4 existential crises), I finally trained and uploaded my first-ever deep computer vision model to Hugging Face. 🤗
Look, I’m still very much in the "wait, this actually worked?!" phase of ML, so I’d really appreciate some honest feedback—especially on the documentation and presentation.
Did I explain things okay? Does it make sense, or does it sound like I wrote it at 3 AM after too much coffee? (…Okay, that last part might be true.)
But seriously, any thoughts are welcome—code, structure, weird mistakes. I’m here to learn and you folks know way more than I do. 🚀
And if it’s terrible… well, at least I tried. 😅
https://huggingface.co/spaces/IncreasingLoss/Wildlife_Animal_Classifier

Wildlife Animal Classifier - a Hugging Face Space by IncreasingLoss

wraith sparrow Apr 16, 2025, 3:20 PM

#

-# next time try streamlit

wheat mirage Apr 17, 2025, 2:54 AM

#

I am doing all my project coding from within Kaggle - are there any risks of losing my work? Is my work private?

wraith sparrow Apr 17, 2025, 3:04 AM

#

-# ofc ofc

hardy basalt Apr 17, 2025, 3:19 AM

#

what do these green and red numbers mean next to my notebook?

#

my notebook is private btw

wraith sparrow Apr 17, 2025, 3:21 AM

#

It means the no. of lines added n removed in ur notebook code

hardy basalt Apr 17, 2025, 3:21 AM

#

ohhhhh

#

gotcha i was so confused

#

thank you

magic tangle Apr 17, 2025, 8:07 PM

#

I registered on time, and attended the course for 5 days during the course, but i am unable to join the capstone competition (https://www.kaggle.com/competitions/gen-ai-intensive-course-capstone-2025q1/overview) from the start. It gives me attached error. All the emails we get are from no-reply-eventsatgoogle@google.com, who do I need to reach out to?

Gen AI Intensive Course Capstone 2025Q1

Capstone project to apply & show what you've learned throughout the 5-Day Gen AI Intensive Course with Google!

hoary lagoon Apr 18, 2025, 5:22 PM

#

magic tangle I registered on time, and attended the course for 5 days during the course, but ...

Do you use Google form provided by the link you gave in ur message to submit the project?

ebon umbra Apr 18, 2025, 7:14 PM

#

Good morning, how can I obtain a credential at the end of the course?

#

I mean Day Gen AI Intensive Course with Google Learn Guide

craggy goblet Apr 18, 2025, 9:10 PM

#

I am working with a dataset for a local contest in which I have to predict price for cars and the columns are brand , model , model year , milage , fuel type , ext colour , int colour , transmission , accident , clean_title how should I go about creating features for it / some good tips for feature enginnering and processing

wraith sparrow Apr 19, 2025, 10:29 AM

#

idts

#

submitting the form should be enough

scenic quest Apr 19, 2025, 5:23 PM

#

I submitted my notebook and added it as competitions notebook, now it shows like this and I cant preview it
without editing I am not able to view the notebook. Anyway to undo that change ?

hallow fern Apr 19, 2025, 6:32 PM

#

Good day guys!
Happy learning!
Anyone that has worked with ransomware using deep learning.
I'm working on a similar project and I'm stucked.

lucid pollen Apr 19, 2025, 9:40 PM

#

Hello, I am unable to join the competetion. It says Permission 'competitions.participate' was denied

#

pastel tide Apr 19, 2025, 11:30 PM

#

Anyone get this?

An error occurred while committing kernel: The kernel source must be less than 1 megabytes in size.

meager marten Apr 20, 2025, 4:08 AM

#

Why can't I merge team?

meager marten Apr 20, 2025, 5:03 PM

#

I need to be verified to merge team in a competition but I cannot re-verify again and Idk why...

#

Bro...

meager marten Apr 21, 2025, 4:27 AM

#

When will this be fixed?

#

Contacted + emailed staff yesterday and still no response

carmine pier Apr 22, 2025, 4:28 PM

#

I attended the 5 day course, completed all notebooks. I submitted the Capstone project via Google form.

But I didn't receive completion certificate or badge.

Is there anything else I need to do.?

verbal crest Apr 22, 2025, 6:19 PM

#

carmine pier I attended the 5 day course, completed all notebooks. I submitted the Capstone p...

Badges took a little while to roll out, it's been added to your profile now

modern wing Apr 22, 2025, 6:59 PM

#

facing this issue in notebooks which I already have and each time I create a new notebook in incognito mode as well. Anyone knows how to fix this?

vernal frost Apr 24, 2025, 12:15 PM

#

How important is it to learn LLMs right now? I understand learning the very basics, but I mean like going deep into libraries and learning how different ones work.

The reason I ask is because I know LLMs are big right now, but you still need to know feature engineering, EDA, non-neural network models, etc, so it's important to not just do neural networks.

Also, I think right now to get a job working with LLMs you need either a grad degree or experience in software engineering or very specifically MLE, which would mean if you don't have that experience then it would be harder to get a job using them.

Double also, I'm asking because it seems like more and more jobs specifically titled "Data Scientist" have in the job descriptions more LLM stuff than not.

stoic terrace Apr 24, 2025, 5:21 PM

#

Hey guys, sorry for the stupid question, I'm doing some research about the best selling games of all times and facing some trouble finding the most adequate dataset. I'm sorry if i am in the wrong channel, but could someone please give me some directions? Thanks!

native rover Apr 25, 2025, 5:13 PM

#

Hey guys. Say I'm training a model in Kaggle and an interactive session is ended. Does my code still run in the background? Or do I have to start again. How do I check?

sullen echo Apr 26, 2025, 4:27 AM

#

need help in time series modeling

data:

Project year Month MoneyLeft
prj1 2024 1 1000
prj1 2024 2 800
prj1 2024 3 400
prj1 2024 4 100
prj2 2022 3 5000
prj2 2022 4 3493
prj2 2022 5 2000
prj2 2022 6 1000
fabrciate this for 10 to 20 projects ,each prorjecr can have month 12 to month 18
for a new project given moneyLeft for 2 or 3 months it should predcit next 4 months moneyLeft
the models like ARIMA ,SARIMA ,EXPONENETIAL SMOOTHING ETC will take only one season or trend,whick means we can train these model only on single project
1 .I have one solution like we can convert this time series problem to regression problem ,we can create lags or windows for three months and can predict for next 4 months , the problem here is it will train on that lags or windows only ,it should also be giving importance for project name (I do not no how to do)

other solution would be we can train the model for each project which is not feasible here in this case
how to do this

pliant silo Apr 28, 2025, 1:56 PM

#

Hi everyone,

I am trying to experiment with a Decision Tree (DT) model in scikit-learn. The task is to find the best model by trying all 64 combinations of the following parameters:

max_depth (2 values)
min_samples_split (2 values)
min_samples_leaf (2 values)
max_features (2 values)
max_leaf_nodes (2 values)
criterion (gini and entropy)

For each parameter combination, I need to:

Perform 10-fold stratified cross-validation.
For each fold, record accuracy, precision, recall, F1-score, and AUC ROC.
Calculate the mean and standard deviation for each metric across the 10 folds.
Repeat for all 64 parameter combinations.
Identify the best combination based on model performance (e.g., best F1 or AUC).
Fit the best model on the entire training set and then evaluate it on a test set.

Im currently searching on the internet on how to do this task particular. I would appreciate it if someone could guide me structure the loops/code for this efficiently? And maybe guiding me through the internet (resources, links, websites, youtube video, or anything)

Thank you

spring cobalt Apr 29, 2025, 11:22 AM

#

How to upload notebook on kaggle from GitHub?

weary gull Apr 29, 2025, 1:51 PM

#

pliant silo Hi everyone, I am trying to experiment with a Decision Tree (DT) model in sciki...

Sounds like an NP Hard problem. Perhaps consider a Gentic Algorithm approach to navigate your Solution Space and optimize your Model.

slim burrow Apr 29, 2025, 11:14 PM

#

Hi, I have a large dataset (extracted from an NC file). I'm trying to put it into an LSTM to make a per-day prediction for 30 days for the target variable. The problem is that the dataset is too large to work with inside a Kaggle environment (the free variant). There are 12 million samples in the dataset. That being said, I don't know what I should do here to reduce the size and also train my model effectively. I've used memory mapped arrays for storage optimization (data is still too large) and batch generation for the model. Any help will be appreciated.

#

The public notebook link:
https://www.kaggle.com/code/kekistan0100/lstm-better

lstm_better

Explore and run machine learning code with Kaggle Notebooks | Using data from ERA5 and GloFAS data for Pakistan (2010-2024)

shut roost Apr 30, 2025, 6:13 PM

#

Does anyone here have an access to a paid AI video generator? I have a prompt and I want to make that video but free models are so weak.

worldly osprey May 1, 2025, 7:10 AM

#

I'm trying to extract information from a PDF that contains tables, columns, and hierarchical structures. I'm having trouble preserving the layout and structure during extraction . trying to build PDF chat bot can any one help me on this

wraith sparrow May 1, 2025, 10:43 AM

#

Is conversational modelling still a thing in 2025 @glossy crown ?

glossy crown May 1, 2025, 3:26 PM

#

wraith sparrow Is [conversational modelling](https://www.parlant.io/docs/quickstart/introductio...

you pinged me how did I not get pinged??

glossy crown May 1, 2025, 3:27 PM

#

wraith sparrow Is [conversational modelling](https://www.parlant.io/docs/quickstart/introductio...

judging by the first few lines, it is a thing in 2025 💀

wraith sparrow May 1, 2025, 3:28 PM

#

glossy crown judging by the first few lines, it is a thing in 2025 💀

fr bro ? i thought its mostly limited to conversations

#

yeah they say they provide better n simpler tool integration too but didnt try yet

wraith sparrow May 1, 2025, 4:17 PM

#

glossy crown judging by the first few lines, it is a thing in 2025 💀

Lemme js delete it then I don't wanna do comp++ in 2025 anymore

grim bane May 1, 2025, 4:30 PM

#

Those who have uploaded their project on Kaggle, even if their name is not in the top 10 competitions and in the Honorable Mention, will they not get a certificate?

swift onyx May 1, 2025, 7:48 PM

#

hi folks. I signed up for the recent 5 day kaggle course via work, but I couldn't do it at the time. I wanted to do it this week, but my company has a block on their gmail accounts, meaning I cannot access the course content. I can access the materials via my personal gmail account but when I try execute the notebook - specifically cells that need to use pip for example, I get a connection error. A bit of reading online says that kaggle used to have an internet connection setting that is now gone. is there anything I can do to go through the course now? Thanks

clear vale May 1, 2025, 8:31 PM

#

Hey guys! So I am trying to learn about transformers how do they work, how they are implemented from scratch. Can you please recommend me some resources where they are explined?

subtle echo May 2, 2025, 8:48 AM

#

Looking forward to know How can I learn ML in 20 days being a Data Engineer

icy steeple May 3, 2025, 9:57 AM

#

I cannot use the Neural Pre-Processing Python (NPPY) https://github.com/Novestars/Neural_Pre_Processing/tree/master

Need help guys

Getting this 'Failed to build surfa' error

GitHub

GitHub - Novestars/Neural_Pre_Processing: Neural Pre Processing is ...

Neural Pre Processing is an end-to-end weakly supervised learning approach for converting raw head MRI images to intensity-normalized, skull-stripped brain in a standard coordinate space - Novestar...

celest sphinx May 4, 2025, 5:37 AM

#

Hello friends!
Finding datasets that hold the data i am looking for is hell

thankfully i found a properly managed website with a databank that had the info im looking for.
But is there some kind of dataset register linked to a LLM where you can just enter the frequency / type of data you are looking for (e.g. Solar radiation on the athmosphere) / NWSE area you need it for / what time period you need available, and it finds multiple datasets that match your needs ?

digital fulcrum May 5, 2025, 4:55 PM

#

Hey guys how can i start python from scratch coding

lime hamlet May 5, 2025, 5:51 PM

#

Hi guys, im new in DATA SCIENCE and AI Field I just understand ML Models regression classification clustering using sklearn then i move to deep learning ann and cnn using keras what you guys suggest me how to successful in this field ?? any road map?

CV and NLP is also another field how to know which one is better for me and im also new on kaggle please guide me

vital palm May 7, 2025, 1:49 AM

#

Hey folks, my team got an error, do you know if the we can get this file? We think this file is missing for the waveform-inversion competition?

fleet kelp May 7, 2025, 2:00 AM

#

Question...in the Data Science community when working with python or another programming language, do you stick to Jupyter notebooks or do you use an IDE along with the Jupyter Notebooks?

pallid pumice May 7, 2025, 1:44 PM

#

why it takes so long?

wraith sparrow May 7, 2025, 2:03 PM

#

-# just get used to it from now

harsh pier May 9, 2025, 2:52 PM

#

Hello! I have been tryin kaggle for the first time these past days, and yesterday i was able to autofill by pressing tab and it´s currently not working anymore for me, any advice? thanks in advance!!

cyan pollen May 10, 2025, 11:34 PM

#

Hi I've just completed a project for my datamining class and I've been thinking of using it as something to show off on my resume and to recruiters. I have a project report documenting my findings along with a colab/jupyter notebook. I wanted to ask what you guys think is a good way to present my project to others. Would it just be a zip file with all of the documents including the ipynb & html of the notebook? Or would you guys just set everything up on a github account. I'm just curious because I think this is one of the bigger projects that I participated the most in and wanted to know what would be the right way to present it. I'm a business administaration major in my uni and my emphasis is in data analytics and I've never really set anything up on github or used it much. Thanks!

wraith sparrow May 11, 2025, 1:10 PM

#

cyan pollen Hi I've just completed a project for my datamining class and I've been thinking ...

To present your data mining project effectively for your resume and recruiters as a business administration major with a data analytics emphasis, use GitHub to showcase your work professionally. It’s the industry standard, accessible, and demonstrates technical skills.

Steps to Present on GitHub

Create a GitHub Account: Sign up at github.com with a professional username.
Set Up a Repository: Create a public repo (e.g., DataMiningProject). Initialize with a README.
Upload Files:
- Jupyter Notebook (.ipynb): Clean, commented, and error-free.
- HTML Export: Download notebook as HTML for non-technical viewers.
- Project Report (PDF): Polished, with clear findings.
- Requirements.txt: List dependencies (e.g., pip freeze > requirements.txt).
- Optional: Include dataset (if small) or link to its source.
Write a README (Markdown):
- Project title and 2-sentence overview.
- Key findings (e.g., “Improved marketing ROI by 15%”).
- Technologies (e.g., Python, Pandas).
- Instructions to run the notebook.
- Links to HTML, PDF, and dataset.
- Your contact info (LinkedIn, email).
Clean Notebook: Add markdown explanations, remove sensitive data, ensure it runs.
Upload: Use GitHub’s web interface to drag and drop files.
Share: Add the repo link to your resume, LinkedIn, and applications.

Why Not a Zip File?

A zip file (with .ipynb, HTML, PDF, requirements.txt) is less professional, harder to access, and doesn’t show GitHub skills. Use it only if required.

Tips

Emphasize business impact (e.g., cost savings) in your report and README.
Test the repo in an incognito browser.
Learn basic GitHub via a 30-minute tutorial.
Post about the project on LinkedIn with the repo link.

This GitHub setup highlights your project’s value and technical skills.

GitHub

GitHub · Build and ship software on a single, collaborative platform

Join the world's most widely adopted, AI-powered developer platform where millions of developers, businesses, and the largest open source community build software that advances humanity.

void maple May 12, 2025, 7:21 AM

#

Hello everyone. I hope you are doing well. Do you know when we will get our certificates for our Kaggle projects, which were submitted last month?

worldly osprey May 14, 2025, 2:26 PM

#

hi can anyone help me to extract the information from the pdf The key challenge is maintaining document structure—particularly tables, hierarchical text, and tables with nested information during extraction and chunking.

gleaming osprey May 14, 2025, 2:40 PM

#

Hello everyone.

#

Now I am working on legal question-answer chat bot project.

#

And I have some questions.

#

Should I stick with Flask for new or would starting with FastAPI or Django be better for scaling later?

#

And are there any open-source legal Tech projects I could use it as a reference?

wraith sparrow May 14, 2025, 3:56 PM

#

gleaming osprey Should I stick with Flask for new or would starting with FastAPI or Django be be...

No need for django u can shift to fastapi for scaling deff

gleaming osprey May 14, 2025, 4:06 PM

#

got it.

lone ruin May 14, 2025, 4:54 PM

#

Can someone help me find a dataset for my project?
must have 1000 rows after removing nulls and dublicates
must work great with 2 models from this list(KNN,SVM,Liner Regression)
and be balanced(without a need to over or under sample)

obsidian bone May 14, 2025, 7:33 PM

#

Does anyone here have an experience with Ray framework? I have issue with running tasks on multiple workers, and I always get this error: The actor is dead because its worker process has died. Worker exit type: SYSTEM_ERROR Worker exit detail: Worker unexpectedly exits with a connection error code 2. End of file. There are some potential root causes. (1) The process is killed by SIGKILL by OOM killer due to high memory usage. (2) ray stop --force is called. (3) The worker is crashed unexpectedly due to SIGSEGV or other unexpected errors.
I'd appreciate any help to fix this issue

native elk May 15, 2025, 6:40 AM

#

how the heck do i light up a dot

#

do i need to do a badge

bronze fable May 15, 2025, 8:21 AM

#

native elk how the heck do i light up a dot

Comment, create post or committment

native elk May 15, 2025, 8:47 AM

#

bronze fable Comment, create post or committment

thx

maiden badger May 15, 2025, 10:58 AM

#

Hi, I’m learning backprop and I do understand that when using ReLU, it is undifferentiable at x = 0 so in practice people usually set it to 0 or 1. I’m wondering what’s the difference between the two options, what’s the pros and cons?

alpine token May 15, 2025, 1:18 PM

#

Hi what is an agentic solution to autogenerate queries for a NoSQL DB?

sterile lantern May 15, 2025, 9:36 PM

#

How do you receive the exclusive Kaggle swag?

graceful axle May 16, 2025, 7:25 AM

#

hey , i m a beginner and trying to sumbit my frist entry
how do i fix this?

#

my csv looks like this

sterile lantern May 16, 2025, 8:37 PM

#

graceful axle my csv looks like this

do you change it to TRUE? boolean. idk

next zenith May 16, 2025, 8:55 PM

#

Is there any way to automatically update particular packages in kaggle everytime i start a session? Something like an init file? or some other way around this, as most of the default packages are very old

lethal rapids May 16, 2025, 11:05 PM

#

Hi, I am unable to verify my kaggle account using persona. It failed and now says to contact support but i haven't received a response from them.

violet nymph May 17, 2025, 9:45 AM

#

Hi, I’ve created a notebook that I’d like to share, but I don’t want to break any rules about spam or self-promotion. I’ve learned a lot from other people’s notebooks, and I found many of them through Discussions.

What’s the right way — or time or place — to share a notebook? For example, if it’s related to a competition, is it more acceptable to post it in that competition’s Discussions rather than in the general ones?

I’ve already received a warning, and now I’m honestly afraid to even mention it.

wraith sparrow May 17, 2025, 12:44 PM

#

violet nymph Hi, I’ve created a notebook that I’d like to share, but I don’t want to break an...

Here https://discord.com/channels/1101210829807956100/1130784683907612764

buoyant dragon May 17, 2025, 6:35 PM

#

Hi, I am a user from Iran and I am interested to participate in competitions of Kaggle, the problem is that almost all competitions has the rule not being a resident of countries like Iran. What is the solution for this? If I have a teammate from other countries, is it allowable to participate in competitions?

muted gorge May 18, 2025, 5:36 PM

#

Hello, I've created a notebook but why it didn't show up at the list of notebook?

icy solstice May 19, 2025, 2:12 AM

#

muted gorge Hello, I've created a notebook but why it didn't show up at the list of notebook...

i am finding the same issue.

proven osprey May 19, 2025, 1:49 PM

#

I had a doubt how does the score works in kaggle even if I got better accuracy from previous submission but my score was less compared to previous submission?

austere pollen May 19, 2025, 8:47 PM

#

Hello

#

Guys there's one problems in my visualization data, but I can't find

rn_image_picker_lib_temp_f71b5f4b-0fd9-4c7b-8712-74268f5c4afc.jpg

spiral heart May 20, 2025, 5:39 AM

#

Hi everyone,

I'm working on a product classifier for ecommerce listings, and I'm looking for advice on the best way to extract specific attributes/features from product titles, such as the number of doors in a wardrobe.

For example, I have titles like:

🟢 "BRAND X Engineered Wood 3 Door Wardrobe for Clothes, Cupboard Wooden Almirah for Bedroom, Multi Utility Wardrobe with Hanger Rod Lock and Handles,1 Year Warranty, Columbian Walnut Finish"

🔵 "BRAND X Engineered Wood 5 Door Wardrobe for Clothes, Cupboard Wooden Almirah for Bedroom, Multi Utility Wardrobe with Hanger Rod Lock and Handles,1 Year Warranty, Columbian Walnut Finish"

I need to design a logic or model that can correctly differentiate between these products based on the number of doors (in this case, 3 Door vs 5 Door).

I'm considering approaches like:

Regex-based rule extraction (e.g., extracting (\d+)\s+door)

Using a tokenizer + keyword attention model

Fine-tuning a small transformer model to extract structured attributes

Dependency parsing to associate numerals with the right product feature

Has anyone tackled a similar problem? I'd love to hear:

What worked for you?

Would you recommend a rule-based, ML-based, or hybrid approach?

How do you handle generalization to other attributes like material, color, or dimensions?

Thanks in advance! 🙏

wraith sparrow May 21, 2025, 2:56 AM

#

spiral heart Hi everyone, I'm working on a product classifier for ecommerce listings, and I'...

To extract attributes like the number of doors from e-commerce product titles (e.g., "BRAND X Engineered Wood 3 Door Wardrobe"), use a hybrid approach:

Regex Baseline: Use patterns like (\d+)\s+door(s)? to extract numerical attributes (e.g., "3" or "5" doors). Fast and reliable for consistent titles.
Fine-Tuned Transformer: Fine-tune a model like DistilBERT to handle varied formats (e.g., "three door") and extract other attributes (material, color). Requires labeled data but generalizes well.
Hybrid Logic: Apply regex first; use transformer for cases where regex fails or for complex attributes.

Generalization: Extend regex patterns (e.g., (wood|metal) for material, (walnut|black) for color) and fine-tune the transformer on multi-label tasks to extract multiple attributes.

Why Hybrid? Combines regex’s speed and simplicity with transformer’s robustness for diverse titles. Start with regex, then add transformer as data and complexity grow.

#

-# U can also try scikit-llm btw

spiral heart May 21, 2025, 5:00 AM

#

thanks for the inputs @wraith sparrow

heavy lark May 21, 2025, 5:26 PM

#

Howdy, im new to everything, coding with R and Kaggle.

I'm trying to upload my code, but im running into errors. This is the code I would run on my computer since the files are stored here. I'm assuming kaggle doesnt have access to my computer files. How would i set up code for it to read the datasets i uploaded to kaggle?

heavy lark May 21, 2025, 5:34 PM

#

heavy lark Howdy, im new to everything, coding with R and Kaggle. I'm trying to upload my ...

Update: Chat GPT helped me
Answer:
divvy_trips_2019_q1 <- read.csv("/kaggle/input/cyclistic-bike-share-case-study-datasets/divvy_trips_2019_q1.csv")
divvy_trips_2020_q1 <- read.csv("/kaggle/input/cyclistic-bike-share-case-study-datasets/divvy_trips_2020_q1.csv")

livid bison May 23, 2025, 12:42 AM

#

Hey, I'm getting an error that there isn't a submission.csv file in the DRW crypto market competition. I'm pretty sure my script is producing a submission.csv file, but whenever I try to run it in the notebook it doesn't work because it crashes in the middle saying the kernel died. Is it possible that the error is coming because my script is too CPU intensive and crashes before it creates the file? It doesn't seem to be possible to look at the logs...

vast gull May 27, 2025, 3:21 PM

#

What activation function should I use?

#

I'm thinking GEGLU.
But not sure yet.

woeful basalt May 27, 2025, 5:42 PM

#

Hi does anyone used Wav2Lip model before? I got some questions to ask

fast axle May 28, 2025, 6:02 AM

#

We have more questions than we have answers chat. Back to the drawing board

rotund pond May 28, 2025, 1:14 PM

#

hi does anyone know why when i try to add a notebook to my collection, my collection folder name doesn't appear in the list of options?

ivory robin May 29, 2025, 5:45 AM

#

Hi! Has anyone real experience with ML applied to corporate employee databases? I’m looking for realistic project ideas 🤔

wraith sparrow May 29, 2025, 8:36 AM

#

ivory robin Hi! Has anyone real experience with ML applied to corporate employee databases? ...

every idea u could possibly think of its already deployed in action

valid veldt May 29, 2025, 11:42 AM

#

Hi. I'm working on a project which is to work in rural conditions where there are connectivity issues and stuffs. I was wondering to talk to someone who has worked on edge ai to help me answer some questions

ivory robin May 29, 2025, 3:08 PM

#

wraith sparrow every idea u could possibly think of its already deployed in action

Not trying to invent anything. Just seeking business problems that can be solved reallistically givem its a dataset of 500 employees.

#

I’m studying data science and trying to apply my learnings on the company I work for. Although my job title is not about that

fast axle May 30, 2025, 4:03 AM

#

wraith sparrow every idea u could possibly think of its already deployed in action

Not yet. I still have some internal review with the aliens before I ship to TOI-700

fleet kelp May 30, 2025, 7:00 PM

#

bold sedge May 30, 2025, 7:25 PM

#

Are there competitions where I don't need the face verification?

hushed scarab May 31, 2025, 11:12 AM

#

Hey everyone, I'm looking for good books on Generative AI starting from scratch. It would be great if they are available in PDF format and for free. Any recommendations?

rancid pine Jun 1, 2025, 4:18 PM

#

Hi chat, Im a new learner on kaggle and im trying to make a notebook submission for the titanic survivor prediction competition. But even though my output file is created and visible, the competion wont accept the notebook when I click "Create Submission"

#

any idea why? I can send screenshots if necessary

harsh berry Jun 1, 2025, 5:02 PM

#

testing post

trim oxide Jun 2, 2025, 4:17 AM

#

I am trying to build a simple Streamlit app for question answering on a given PDF. What approach should I use? Which open-source LLM would be suitable? What model should I use for embeddings, and should I implement RAG or another method?

raven pike Jun 2, 2025, 8:53 AM

#

#❓┊ask-a-question hello everyone,i wanted to know the difference between Gradient Descent, Maximum Likelihood Estimation (MLE), and Ordinary Least Squares (OLS) wrt linear regression .If anyone know of some good article on it,please tell

normal fiber Jun 2, 2025, 12:25 PM

#

#❓┊ask-a-question Hi, guys. I'm a student who is studying machine learning by myself. I recently start studying, so please understand me if my question sounds a bit dumb.

Recently, I'm struggling of dealing with feature engineering. I've heard that feature engineer requires domain knowledgement and it is one of the most crucial part in machine learning. So, I was thinking that, what if I just, make feature as a polynomial feature(like 2, or 3. not too much because of the overfitting) and then, eliminate some unnecessary features by using Variane Threshold or Lasso. Do you guys think that it can work well in any situation?

I want some advice for feature engineering. Thank you, senpai! (greeting from Japan)

craggy goblet Jun 2, 2025, 4:03 PM

#

#❓┊ask-a-question hello I am a student learning machine learning and I have 2 questions
Is there any good roadmap I can follow
What are some good books for ML and AI

wraith sparrow Jun 2, 2025, 4:04 PM

#

craggy goblet <#1129507816697241822> hello I am a student learning machine learning and I hav...

https://roadmap.sh

roadmap.sh

Developer Roadmaps - roadmap.sh

Community driven roadmaps, articles and guides for developers to grow in their career.

fleet kelp Jun 2, 2025, 7:00 PM

#

fleet kelp

poll_question_text

Do you think using Intelisense is a form of cheating yourself?

victor_answer_votes

3

total_votes

5

victor_answer_id

2

victor_answer_text

No...if you are using the suggestions to help learn

victor_answer_emoji_name

🙃

wraith sparrow Jun 4, 2025, 5:55 AM

#

guys my friend needed an help with a survey kinda from students in states
https://buildpad.io/research/63YT5Zt
(kindly share with ur friends studying in states)

wise pewter Jun 4, 2025, 9:10 PM

#

hi everyone, im taking introduction to ml and dont understand a line of code the underfitting and overfitting part.
I was wondering why do they use the whole dataset to fit the DecisionTreeRegressor model after tuned with the best tree size, instead of training one.

alpine locust Jun 5, 2025, 8:08 AM

#

Hi I am a data scientist and machine learning engineer. I have worked on many projects on Kaggle and I am now a Kaggle Notebooks Expert. I am looking to work on real world projects. can anyone guide me on where to find them?

proven osprey Jun 5, 2025, 1:15 PM

#

wise pewter hi everyone, im taking introduction to ml and dont understand a line of code the...

Because once you've chosen the best tree size, there's no need to hold out part of the data anymore. You've already decided on the model's best parameters so now you want to give the model as much data as possible to learn .

wise pewter Jun 5, 2025, 7:24 PM

#

proven osprey Because once you've chosen the best tree size, there's no need to hold out part ...

Thank you 🙏, I did some searchings and they all said avoiding using a whole dataset to train the model in any case

gilded drift Jun 6, 2025, 9:13 PM

#

craggy goblet <#1129507816697241822> hello I am a student learning machine learning and I hav...

If you are just a beginner, the you tube videos by Statquest Josh Starmmer and videos by Louis Serrano can provide you a head start. Simple visualization and short videos for understanding the subject in the shortest possible time in my opinion. If you require a ,"no code" visualization flow tools to experiment with data and various models then you can use open-source Orange 3.8x version along with various addons provided. Very easy to learn with a number of videos tutorials. Other open source tools I have experimented with are Weka, Knime.

quiet lake Jun 7, 2025, 4:05 AM

#

Dear all friends, does anyone know where I can start as a data scientist intern, i completed the course.. please suggest me , or any guidance, you little help can be a big help for someone.. please consider

ivory robin Jun 7, 2025, 11:03 AM

#

Do you folks have an online portfolio? What tech do you use to set it up?

ivory robin Jun 7, 2025, 1:18 PM

#

do I look like an angel for any specific reason? 😛

sharp herald Jun 7, 2025, 3:23 PM

#

Hello everyone! I want to start reading the ML/DS papers. Where can i do that and what kind or specific papers i should read firsly because i am beginer in the ML field 😀

cunning hearth Jun 7, 2025, 5:57 PM

#

hey i need some debugging help

#

trying to access the higs-boson dataset with this code

data_dir = KaggleDatasets().get_gcs_path('higgs-boson')
train_files = tf.io.gfile.glob(os.path.join(data_dir, "training", "*.tfrecord"))
valid_files = tf.io.gfile.glob(os.path.join(data_dir, "validation", "*.tfrecord"))

print("Found", len(train_files), "train TFRecords")
print("Found", len(valid_files), "valid TFRecords")

however, it can't seem to access this db
output:

get_gcs_path is not required on TPU VMs which can directly use Kaggle datasets, using path: /kaggle/input/higgs-boson

Found 0 train TFRecords
Found 0 valid TFRecords

Detecting the Higgs Boson With TPUs

Explore and run machine learning code with Kaggle Notebooks | Using data from Higgs Boson

#

i fixed the error,
turns out, the new name is higgs-boson-dataset, and there's no validation/training files. you have to make ur own, like this

data_dir = KaggleDatasets().get_gcs_path('higgs-boson-dataset')
print(tf.io.gfile.listdir(data_dir))

all_files = tf.io.gfile.glob(os.path.join(data_dir, "*.tfrecord"))
all_files = sorted(all_files)        
random.shuffle(all_files)
split = int(0.8 * len(all_files))
train_files = all_files[:split]
valid_files = all_files[split:]

weak yew Jun 8, 2025, 3:53 PM

#

I have developed an app using Gemini API, now I wish to make it a web app and host for free on a server. Can anyone guide me on this?
I have designed the project in notebook but I want to make use of other web developing languages to build an interactive single-site website.

plain verge Jun 8, 2025, 9:10 PM

#

Hi all, this is my first post here!

I'm working on a regression problem where the target y is a function of four features: X1 and X2 are continuous, and N1 and N2 are integer "family/group" IDs. For each family (N1, N2), y is a function of X1 and X2, and y is continuous. Each unique (N1, N2) defines a unique y = f(X1, X2), but calculating y without ML is very expensive and requires a lot of supercomputer time. So, we want to use machine learning to predict y as accurately as possible, using as few training examples as possible. Ideally, we'd use just a handful of data points from some (N1, N2) families (and their X1, X2 values) and then be able to predict y for new (N1, N2) families reliably.

My current pipeline holds out all data for one (N1, N2) pair to simulate predicting for a new family, trains XGBoost on the rest, and evaluates. With lots of (N1, N2) pairs in training, results are excellent (R^2 about 0.99). But with only a handful, generalization to new pairs is poor.

What feature engineering or model strategies would help generalization to unseen families?

Are embeddings, one-hot, or other encodings of N1/N2 better?
Would neural nets (Bayesian, feed-forward, etc.) help vs. tree models in this case? (I'm more comfortable with trees.)
Any best practices for uncertainty estimation?
References to similar few-shot tabular regression problems?

Any advice or references would be greatly appreciated!!!

graceful axle Jun 9, 2025, 10:08 AM

#

Hey! Is there anyone participating metal kaggle hackathon competition?

cyan oar Jun 10, 2025, 2:40 PM

#

I have a question regarding llms.

I'm a bit confused about which path to choose for the long term, as both have great potential for work. On one hand, building models from scratch, including pretraining, optimization, and evaluation, offers a lot to learn. On the other hand, working with RAG, vector databases, agents, LangChain, and Hugging Face also seems promising. I want to focus deeply and excel in whichever path I choose.
According to the current job market and the oppurtunities all over the world where should I go?

stoic elm Jun 10, 2025, 2:43 PM

#

Hey! @everyone I have started learning machine learning recently. If anyone expert in this field can guide me, it would be a great help to me. I want to know how to learn it and where to learn it as there are tons of resource and i am unable to select which one should i pursue. My end goal is to be a data scientist. ( dont mind my grammar please)

cyan oar Jun 10, 2025, 2:51 PM

#

stoic elm Hey! @everyone I have started learning machine learning recently. If anyone expe...

You should see some videos about data science roadmap.If you have the proper prerequisites You can start the '100 days of ml' playlist by campusx and continue if you like it. Then you can move to deep learning and so on

stoic elm Jun 10, 2025, 2:53 PM

#

cyan oar You should see some videos about data science roadmap.If you have the proper pre...

I have covered all the prerequisites required... do i have to learn abouts scikit library or any other Library beforehand or i can learn within the course which u suggested?

#

Currently i am learning from krish Naik and cs229 by Stanford

cyan oar Jun 10, 2025, 2:59 PM

#

stoic elm I have covered all the prerequisites required... do i have to learn abouts sciki...

Scikit learn itself falls under machine learning. If already you have the other prerequisites then you should start as soon as possible

fathom lava Jun 11, 2025, 6:19 PM

#

I tried everything I could to debug it. Is there anything else I can do? Any help would be greatly appreciated.

strong jolt Jun 12, 2025, 11:06 AM

#

Is it common to get this type of speed while adding input from a datset (that was created from the output of a notebook)?

#

The dataset if of 4GB T_t

devout pilot Jun 12, 2025, 2:34 PM

#

lmao

trim atlas Jun 12, 2025, 6:41 PM

#

Was running a huge notebook that it lagged my computer when opening it, after that I cant run it again and even after deleting it I cant create a new notebook or importing a data source or run my other notebooks or do anything else. Thought it was my fault and I started to panic at 1AM, hell I even linked my Kaggle with Discord just to find some support here, and after 30 minute of trying everything I could I saw an official message about site-wide availability issues, so I'm not the only one experiencing this right?

stone shadow Jun 13, 2025, 10:22 AM

#

I wrote a retrieval system using UV with a lot of custom models, but it has to be offline for the code competition. I am considering whether there are other solutions besides the package environment. Wheels will encounter a specific package build failure error, and there are several pitfalls.

In the end, I did this (Thus it work, but time cost heavily)

Upload all models and the File as a dataset
Build a .venv in an Internet-enabled notebook
Extract the .venv and run the test data in the competition submission.

peak skiff Jun 13, 2025, 11:12 AM

#

hi guys. i need help, trying to train a ml model for melonoma classification. used isic dataset w 9 classes but it was highly imbalanced. i tried data augmentation and several different ways to reduce overfitting but i can't get an accuracy more than 60%. i need at least 85% accuracy for my project. i have short time left.

using free google colab rn and can't use any dataset more than 8000 datas. my ram is 8gb. i want to try binary classification and working on it rn. do u have any advices? this is really important for me.

thin raptor Jun 13, 2025, 1:16 PM

#

peak skiff hi guys. i need help, trying to train a ml model for melonoma classification. us...

It could be a case of underrepresented classes in the training data?

In which case you might want to partition the larger classes then pair them with the underrepresented classes, or modify the loss function to add weights to particular classes

cunning sundial Jun 14, 2025, 5:22 PM

#

Is it Legal to Share Public Instagram Profile Data as a Dataset on Kaggle?

short haven Jun 15, 2025, 5:30 AM

#

i am having a problem in power query .
will anyone help me ?
problem is my order date in text format there are no null and error value but when i tried to convert to text to date value then its create a lot of error value .
i tried to take help from chatgpt but it wasnt work .

craggy goblet Jun 15, 2025, 8:14 PM

#

I am somewhat of a beginner I have submitted in like 3 to 4 competitions (like titanic and house prices )with like mediocre rank and accuracy my questions are

what sorts of compititions should I participate in
how do I get better or learn how to get better

dapper yoke Jun 16, 2025, 12:05 AM

#

craggy goblet I am somewhat of a beginner I have submitted in like 3 to 4 competitions (like t...

This will entirely depend on what field of AI/ML you want to go into. Genai, computer vision, etc. For me, I am interested in generative AI, so I generally look for competitions that had their primary focus on generative AI, like the Gemma 2 competition. To get better, just keep on learning, and stay very open to new opportunities or chances to gain more knowledge. That is how I got better, at least.

craggy goblet Jun 16, 2025, 2:01 PM

#

Like for basic Machine learning with say titanic , house prices etc type of compitition with predicting RMSE,MAE,MSE type evaluation metrics and sort of datasets where you have to regression or classify based on numerical or categorical columns basically I am a beginner so I am trying to avoid all the time series , NLPs , computer vision , NNs , generative AI or anything along those lines which is like a bit deeper stuff till my skills can be in some sense considered intermediate to some degree like I know what I am doing sort of thing in like normal machine learning contests ( think titanic houseprices etc but not limited to , for some examples as to the type of compititions I am talking about )
Here are a few questions I have

I have finished the statquest Machine learning playlist + learned about data preprocessing steps and know how to use pandas ,matplotlib, seaborn ,scikit , numpy from certain videos as well as know about hypertuning . is there anything else theory wise I would need to know for being able to perform well in compititions
what differentiates a good score from bad one is it knowing when and where to apply well-known preprocessing steps , knowing which features to engineer and which model to select or does it require some other knowledge basically do I need to learn more theory wise to perform better or do I need learn what to do when with what I already know theory wise in application
how do I decide and when to sort of specialize in a certain field of AIML and how to get info about these fields

uneven marsh Jun 16, 2025, 10:02 PM

#

Hi there, I am new to Kaggle, and mainly AI. I am looking forward to learn how to code AI. I hope that using this knowledge, I can make my own AI models and apply it to things I do in free time: robotics, drones, things of that nature. However, when I try understand information from the internet, I get lost immediately. I really want to get a solid grasp on this aspect of programming. I am certified in Java as I recently passed a Certiport IT Specialist certification for Java. I do know some python too, as I have taken such a course last year in school. But I still am confused.

#

#❓┊ask-a-question

finite bridge Jun 18, 2025, 3:09 PM

#

hi kagglers,

when you guys first enter competitions, do you have to do a lot of research to gain domain knowledge? in past competitions, the way winners code all show that they probably had a really good understanding of the comp's domain. do they just know this because its their profession or do they spend a lot of time researching?

bleak anvil Jun 20, 2025, 4:26 AM

#

finite bridge hi kagglers, when you guys first enter competitions, do you have to do a lot of...

In my particular case, I was pulled in order to do the backend for a framework. So, in this case... I would say most people have a framework in place and they create modules that apply to that particular problem.

That's my observation being here for less than 2 months.

nova rain Jun 20, 2025, 6:06 AM

#

Hey yall Im Prana. New to python. I have a background in r-studio. There is big knowledge gap even though I have a masters. I’ll give it month or two and I should be engaging more on the rooms. For you intermediate and experts are there any discord groups your recommend a beginner like me to join or any type of platforms. I just need people to bounce ideas back in forth on topics such as LLMS used and things they learned, no judgement ignoring zone. You can dm me.

bleak anvil Jun 20, 2025, 6:13 AM

#

Hey @nova rain 👋

I only have a bachelor's in physics and pure math—ended up dropping out of my master’s in pure math due to a mix of financial and personal reasons. Honestly though, degrees and titles only go so far. They're supposed to be keys, but it’s really the network and the collaborative energy that makes things move.

What helped me most was bouncing ideas around with others and building multi-agent LLM constructs together. A lot of gaps in my understanding got patched just through collective exploration—kind of like debugging reality with friends. 😄

What kinds of ideas or frameworks have been catching your attention lately?

fresh forge Jun 20, 2025, 7:24 AM

#

hi, i wanna select the good feature has a relation on Risk level
but when i use a chi2 get result 10 columns has relation on risk level but when check it on corr
i found there is not any relation on risk level

#

check this https://www.kaggle.com/code/aymenezzalarb/project-management-risk-raw

project-management-risk-raw

Explore and run machine learning code with Kaggle Notebooks | Using data from Project management Risk Raw

rugged juniper Jun 20, 2025, 1:11 PM

#

vast gull What activation function should I use?

activation function? let me know detail

rugged juniper Jun 20, 2025, 1:18 PM

#

obsidian bone Does anyone here have an experience with Ray framework? I have issue with runnin...

Hi, This generally means the worker process crashed.
Check for out of Memory

rugged juniper Jun 20, 2025, 1:22 PM

#

fleet kelp Question...in the Data Science community when working with python or another pro...

You can use Jupyter Notebooks for experimentation and communication.
but Use an IDE for writing real code, testing, and maintaining larger projects

hearty roost Jun 20, 2025, 4:18 PM

#

Hello! Can anybody helping me with this question? import dask.dataframe as dd
from dask import delayed
from fastparquet import ParquetFile
import glob

files = glob.glob('/kaggle/input/hms-harmful-brain-activity-classification/train_eegs/*.parquet')

@delayed
def load_chunk(path):
return ParquetFile(path).to_pandas()

df_train = dd.from_delayed([load_chunk(f)for f in files])

df_train.compute()

#

I was working on this code, but encounter error:

📎 message.txt

gritty nymph Jun 21, 2025, 2:41 PM

#

how could i change my name?
it would be better if my displayed name is in english or at least alphabet

tired venture Jun 21, 2025, 2:54 PM

#

#

pw is correct,but canr login kaggle api on azure notebook,why?

raven pike Jun 21, 2025, 6:15 PM

#

has anyone done mnist using alexnet ?i need help regarding it

cursive owl Jun 22, 2025, 4:25 AM

#

Hi , i am working on creating a intent classification along with topic modelling, wanted some directions on how we can use rsics dataset. And has anyone experience with annotating data set https://www.kaggle.com/datasets/veeralakrishna/relational-strategies-in-customer-servicersics/data

Relational Strategies in Customer Service(RSiCS)

A dataset of travel-related customer service data from four sources.

rigid compass Jun 22, 2025, 3:17 PM

#

Hey i want to learn machine learning from where i can learn. Any suggestion

ivory wasp Jun 23, 2025, 3:45 PM

#

Been having some problems with the submission of the Prediction interval competition II: House price eventhough my submission is being generated at the end when i try to save my version it shows that there is no output can anyone help me out why is it like this

primal palm Jun 24, 2025, 7:24 AM

#

Hello everyone,

I’m a fourth-year AI major at Damascus University.

Over the past three months, I’ve been following the AI learning path on DeepLearning.AI. I’ve already completed the first course in the specialization and am currently working through the second.

As the concepts have grown deeper and more varied, I’ve realized that I’m not yet flexible enough with coding: I often struggle to understand existing code and to write my own. This stems from the limited problem-solving experience I gained during my earlier years at university.

I’m now looking for ways to strengthen my coding knowledge and practice while I continue these courses.

Can anyone help with that?

sleek frigate Jun 24, 2025, 11:26 AM

#

can someone help me go through a research paper? Its the 2023 paper on BitLinear

regal rock Jun 25, 2025, 4:10 PM

#

So I am an high schooler junior (11th grade) who has just entered summer. I happen to have some free time and I was wondering to ask how I should start and go about learning ML and start actually being competetive in competitions. Since you all are mostly professionals I knew your advice will be of great help! So what should I do, and how should I start?

wraith sparrow Jun 25, 2025, 5:33 PM

#

regal rock So I am an high schooler junior (11th grade) who has just entered summer. I happ...

https://docs.google.com/forms/d/e/1FAIpQLScPHwHDoV1OJl3e1-XiA_OoVoIBMemV-148PM1X2wpgNhUI-Q/viewform

wraith sparrow Jun 25, 2025, 5:35 PM

#

regal rock So I am an high schooler junior (11th grade) who has just entered summer. I happ...

U missed inaio this year already
-# @glossy crown runner up boy dm him the groups link

crystal wing Jun 25, 2025, 7:04 PM

#

Has anyone used the Microsoft Azure ML stack for building and running models? We will be adopting it and I'm curious about pitfalls and limitations we should be aware of or avoid. https://azure.microsoft.com/en-us/products/machine-learning/

Azure Machine Learning - ML as a Service | Microsoft Azure

Build machine learning models in a simplified way with machine learning platforms from Azure. Machine learning as a service increases accessibility and efficiency.

regal rock Jun 26, 2025, 3:44 AM

#

wraith sparrow U missed inaio this year already -# <@763292785293393920> runner up boy dm him t...

So what are these sessions like?

#

I was thinking if there are any specific resources or books

#

But course are also fine if they help me reach my goals

wraith sparrow Jun 26, 2025, 5:00 AM

#

regal rock So what are these sessions like?

Contact @glossy crown

frigid shadow Jun 26, 2025, 9:34 AM

#

🔬 AI Hardware Research - Need Your Input!
Hey everyone! I'm doing research on AI/ML hardware accessibility in India and would love to get insights from this community.
Quick backstory: GPU prices in India are brutal, cloud costs add up fast, and there might be a gap for more affordable AI-focused solutions.
3-min survey about:

Your current setup and pain points
Budget constraints and preferences
What you'd want in an ideal AI GPU
Cloud vs local experiences

Drop your thoughts in the survey: https://forms.gle/UHGh1kzK1p9uiJSb9
I'll share the results here once I have enough responses. Your input could help identify real solutions for our community! 🙏
Feel free to share with other AI folks who might have insight

Google Docs

AI Hardware Needs Survey - India

We're researching the need for affordable GPU solutions for AI/ML work in India. Your insights will help us understand current challenges and potential solutions. This survey takes 3-5 minutes and your responses are anonymous. We'll share the findings publicly to benefit the AI community.

hallow herald Jun 26, 2025, 7:30 PM

#

Are we allowed to vibe code /chatgpt the code for hackathon projects? Do judges typically care how the code was written?

#

please ping me

haughty helm Jun 27, 2025, 8:05 AM

#

I am a BSCS student, and I’m starting my Final Year Project (FYP), which I need to complete within the next 1.5 years. I’m working alone on this project and would like to build something related to Artificial Intelligence or Machine Learning. Although I’m a beginner in AI/ML, I’m a fast learner and actively working to improve my skills in this area. I would be very grateful if you could suggest some suitable FYP ideas along with a brief description of each.

rugged juniper Jun 27, 2025, 8:18 AM

#

haughty helm I am a BSCS student, and I’m starting my Final Year Project (FYP), which I need ...

let me know anytime if you need my help

haughty helm Jun 27, 2025, 8:55 AM

#

rugged juniper let me know anytime if you need my help

Ideas for my fyp

flint gust Jun 27, 2025, 1:42 PM

#

haughty helm I am a BSCS student, and I’m starting my Final Year Project (FYP), which I need ...

i'd just go around on web, searching for anything that interests me. I'd try to find the info that i'd personally want to know. If i was you, i'd never start doing any project from a suggestion of some random person from discord. It'd a big deal, and you'd better approach it creatively and proactively. It's ok to not know, but try to work out your system of inspiration - a documentary, a book. There are definetely tons of questions that you could answer

haughty helm Jun 27, 2025, 2:14 PM

#

flint gust i'd just go around on web, searching for anything that interests me. I'd try to ...

That actually makes a lot of sense. I agree—starting something just because someone else suggests it can feel empty. I think I need to spend more time exploring what genuinely sparks my curiosity and build from there. Thanks for the insight!

pure raft Jun 27, 2025, 2:54 PM

#

does anybody know if for the new kaggle gemma competition I need to have the project open source? or it should only be available for the judge to review?

glossy crown Jun 28, 2025, 3:22 AM

#

regal rock So what are these sessions like?

hellooo!!, which country are you from 😄

digital bobcat Jun 29, 2025, 4:28 AM

#

Hi all, after completing some theories of ML and doing some toy projects on data science , I applied for internships. Recently I got a callback and they have given me a take home assignment for a predictive modelling task. Now the thing is the dataset they provided is large like too large for me , and I haven't worked in this big dataset like this before and also I use google colab. The shape of dataset is (167020, 217).
What I need is Can anyone help me in how to approach this problem and solve this. All I am asking is that you take a look at the dataset and provide me with suggestions on what all to do step by step , I will do it myself. If you are willing to help , lets connect.
Thank You

dawn ravine Jun 30, 2025, 3:08 PM

#

How much ram & vram does GPU T4 x2 have in total?

#

does GPU p100 have more?

#

Or tpu vm v3-8

analog hornet Jul 1, 2025, 11:32 PM

#

i started learning ML, and I did the Titanic competition already. Is there any recommendation that to do next? I'm stilla beginner, so I want to do an easy one.

naive pike Jul 2, 2025, 4:10 AM

#

analog hornet i started learning ML, and I did the Titanic competition already. Is there any ...

try housing price

#

pridiction

analog hornet Jul 2, 2025, 4:54 AM

#

naive pike pridiction

Thanks

dull tapir Jul 3, 2025, 5:21 AM

#

hello

#

can sb help me

#

what should i do to resolve this

naive pike Jul 3, 2025, 8:08 AM

#

dull tapir what should i do to resolve this

i think its network issue or enviroment issue

#

try it on jupyter notebook

dull tapir Jul 3, 2025, 10:41 AM

#

naive pike i think its network issue or enviroment issue

is there a solution if i want to use kaggle

dull tapir Jul 3, 2025, 11:05 AM

#

ok i found the problem

#

the !pip install torch line of my code

#

didnt run properly

rustic carbon Jul 3, 2025, 4:40 PM

#

Is an 80% accuracy for a random forest clasifier good?

brittle cave Jul 3, 2025, 8:43 PM

#

Hi guys, probably a stupid question, if i import a model from huggingface then can I use it offline ?

naive pike Jul 4, 2025, 3:46 AM

#

brittle cave Hi guys, probably a stupid question, if i import a model from huggingface then ...

Yes you can

naive pike Jul 4, 2025, 3:48 AM

#

rustic carbon Is an 80% accuracy for a random forest clasifier good?

if it beats baseline then yes

wraith sparrow Jul 5, 2025, 6:56 AM

#

glossy crown hellooo!!, which country are you from 😄

The country where u spent past 18 years

timber turtle Jul 5, 2025, 8:21 AM

#

Hey guys I'm currently working in OCR, and i have a question

#

#

I couldn't ectract "Good Morning" or any text from 1st image even though i can extract text from second image with good accuracy

#

I'm currently using tesseract and do you why this happens?

#

Can you suggest some other approaches so as to extract "Good Morning" from the first image?

sonic citrus Jul 5, 2025, 2:33 PM

#

Hey guys. I know it's 3 months late, but I wanted to finish the 5-day Gen AI Intensive course. My question is whether it is possible to gain the certificate even though I am late? because the last email from Google Event said, 'The badges and certificates will be added to your profile by the end of April 2025.'
It means a lot if anyone can answer?

wraith sparrow Jul 6, 2025, 2:48 AM

#

sonic citrus Hey guys. I know it's 3 months late, but I wanted to finish the 5-day Gen AI Int...

Certificates not that important really

short haven Jul 6, 2025, 11:32 AM

#

look at the photo which i attached in post .there are two column of date and time now i wanna extract the difference between two days like 3 days 16 hour .
now how i can do that in excel power query .
i tried to get help from chatgpt but it wasn't works .

mystic solar Jul 6, 2025, 12:46 PM

#

Hey i am student and we have a class in data analytics, i dont realy get how i can improve my accuray in logistiic regresiion, rnadom forest, knn etc..
can someone help me there pls

vast yarrow Jul 9, 2025, 1:46 AM

#

Hello.
I am having some trouble with my local experiment and would like some help.
I want to analyze the transcription of recorded conversation audio data, so could you recommend a model for that?
Considering that this involves recording conversations, I would like to identify the speakers as well.
Can you suggest a model for various experiments?

#

I would like to gather opinions from people who have participated in competitions or analyzed audio data that includes conversations in the past. Also, if there is any recommended external data for training, please let me know.

deep tinsel Jul 10, 2025, 6:56 PM

#

Are we allowed to share github or Streamlit Cloud url for example to share scripts or apps or only kaggle url are allowed ?

paper veldt Jul 11, 2025, 8:24 AM

#

Hello World! I have a problem with uploading link to Youtube in competition gemma3n. I'm trying to upload good URL of my Youtube-video, but got an error "Invalid URL", but URL is truth!
In script videoUtils.js on website there is an error in string:
fetch("https://youtube.com/oembed?url=".concat(url, "&format=json}")).then(function(res) {

"}" after "json" is unnecessary

Fix it please, because its impossible to upload video and take part in competition

wicked ivy Jul 12, 2025, 5:30 PM

#

subscribe this channel and see the latest update and learning video from here :https://www.youtube.com/@AuraaiX/videos'

YouTube

AuraAI

A U R A • A I
Welcome to Aura AI, where we explore the subtle yet powerful presence of Artificial Intelligence shaping our world. We go beyond the headlines to uncover the essence of AI, demystify complex concepts, and illuminate its future impact. Join us to understand the technology that's defining tomorrow. Subscribe for deep di...

west palm Jul 13, 2025, 10:59 AM

#

Hey Guys,I am seeeking recommendations for an impactful AI/ML project that would strongly appeal to product based companies when they are hiring. The goal is to maximize my chances of securing a job.
Please do suggest me asap : )

barren path Jul 14, 2025, 5:06 AM

#

Can Anyone explain what a JSON really is????????????

frosty heron Jul 14, 2025, 6:59 AM

#

Hi, I have recently made a notebook with plotly graphs, it shows perfectly in Editor mode but it is not showing on Kaggle's notebook viewer, even when I used .show(renderer='iframe').
Notebook: https://www.kaggle.com/code/stevensio/kaggle-journeys-cohorts-and-competition-shifts

I tried to look at it on my girlfriend's macbook and it shows, but somehow, the graphs don't show when viewed on my windows laptop. Could this be an OS issue?

Would really appreciate if anyone could find the issue in my code

Kaggle Journeys: Cohorts and Competition Shifts

Explore and run machine learning code with Kaggle Notebooks | Using data from multiple data sources

pseudo musk Jul 14, 2025, 12:57 PM

#

barren path Can Anyone explain what a JSON really is????????????

It is like DataFrame or a dictionary or multidimensional array but lightweight and better for transporting data

dense wolf Jul 14, 2025, 2:31 PM

#

helloo guys actually im trying to
dynamically update my second dropdown (car models) based on the selected company using JavaScript and Jinja2 inside a Flask app. I’ve built the Jinja2 dictionary and the script looks correct, but the dropdown doesn’t update when I select a company. What could be causing this issue—am I missing something in how Jinja2 and JavaScript work together

i have also cleaned my data for it my model is ready to predict but im stuch here
though im following a tutorial its not helping its not actually working

the script is given below
<script>

function load_car_models(company_id, car_models_id)
{
    var company = document.getElementById(company_id);
    var car_model = document.getElementById(car_models_id);
    car_model.value="";
    car_model.innerHTML="";
    

    {% for company in companies %}

    if(company.value == "{{company}}")
{
      {% for model in car_models %}

          {% if company in model %}

       var newOption = document.createElement("option");
       newOption.value="{{model}};
       newOption.innerHTML="{{model}"}
       car_model.options.add(newOption);
       newOption.value ="{{ model }}";
       car_model.options.add(newOption);



    {% endif %}

 {%endfor%}
}

{% endfor %}
}
</script>

timid jungle Jul 14, 2025, 5:07 PM

#

are there any spaces to discuss ml related queries as a student? like a community of students in the same field or smth?

wraith sparrow Jul 14, 2025, 6:29 PM

#

Is bytedance seed 1.5 vl by far the best img understanding model

#

And for edge devices Florence 2?

timid jungle Jul 15, 2025, 12:53 PM

#

when dealing with time series data, how do we know if there is serial dependence in the data or not? is it a question of using domain knowledge or should we use methods like lagging and time step each time to check this thing?

raven canopy Jul 16, 2025, 3:33 AM

#

Hello just wondering if there are any tips for clearing memory from RAM and the GPU? I've been running into out of memory issues using Tensorflow. I tried using a generator in an attempt to reduce RAM for one, while also trying to track and delete every dataset variable related + model variable while also doing gc.collect() and K.clear_session() however I've noticed that for whatever reason, the data I load into the model for model.fit() just sticks in the RAM. Tried deleting that generator via del, assigning it to None too, and I can't seem to get rid of the sticky reference.

Am attempting to do a kfold run but because of my issue it looks like I'd have to restart the kernel every time instead? Also tried running a subprocess and running the fold from a script but it doesn't seem to change anything. Advanced thanks for any replies!

half widget Jul 16, 2025, 8:40 PM

#

Hey everyone! Trying to build a recommendation system for products for a deal website.

We have a few heuristics of how good a deal it is and how popular a product is (deal score and pop score)

Our current deal gallery just optimizes for these heurisitcs. However, we're running into some issues with product diversity. There's certain categories of products that are discounted more frequently and thus appear in the results more frequently.

Our data is stored in postgres, but we load it into polars (in python). We also have milvus set up as a vector db so all our product titles are vectorized as well. Wondering if anyone has any ideas on how to solve this diversity issue

I'm pretty new to ML/data science but one idea was k means clustering based on the titles or vectors? Or maybe this is too naive of an approach

primal trout Jul 18, 2025, 5:15 AM

#

hi guys , its my first time discovering this Machine Learning world ! I have a question tho ; have anyone made any money using this ?

hasty valve Jul 18, 2025, 9:38 AM

#

primal trout hi guys , its my first time discovering this Machine Learning world ! I have a q...

some have made a lot. But this is no get rich quick scheme. Its like any profession; you can make money in football but you have to be top 0.1%

vernal citrus Jul 19, 2025, 2:21 PM

#

i am new to kaggle....can someone tell me ...How to get started????

coral lava Jul 19, 2025, 2:27 PM

#

hi everyone ,https://github.com/campusx-official/ML-Roadmap-for-2022/blob/main/README.md what do you think about that roadmap in 2025?

GitHub

ML-Roadmap-for-2022/README.md at main · campusx-official/ML-Roadma...

A curated list of Machine learning videos, links, projects and datasets to help you conquer the ML landscape in 6 months - campusx-official/ML-Roadmap-for-2022

primal trout Jul 19, 2025, 2:46 PM

#

hasty valve some have made a lot. But this is no get rich quick scheme. Its like any profess...

i dont necesseraly aim getting rich quick , it never was the goal . I do need to work online tho as a starter

wraith sparrow Jul 20, 2025, 10:09 AM

#

hasty valve some have made a lot. But this is no get rich quick scheme. Its like any profess...

Fr either be in top 0.1% in ai or 've those ppl

wraith sparrow Jul 20, 2025, 3:16 PM

#

Anyone interested in video gen models?

keen abyss Jul 20, 2025, 6:21 PM

#

Can anyone help me out I am working on data scientist project thing where I am doing licence plate prediction

I used yolo and ocr to build the models
I have data sets which contains cars and also cropped images which has only number plate

The thing here is the number plate is in Urdu and the ocr model is isn't working properly the number are being detected but not the arabic or urdu text

wraith sparrow Jul 21, 2025, 1:04 AM

#

keen abyss Can anyone help me out I am working on data scientist project thing where I am d...

Try this ocr model https://huggingface.co/microsoft/Florence-2-base-ft

microsoft/Florence-2-base-ft · Hugging Face

wraith sparrow Jul 21, 2025, 6:41 AM

#

i wanna use this model but memory crashing in both collab n kaggle notebook https://huggingface.co/docs/diffusers/main/api/pipelines/hunyuan_video?usage=memory

HunyuanVideo

wraith sparrow Jul 22, 2025, 9:02 AM

#

anyone can help me fix the issue here https://colab.research.google.com/drive/1wEF2RmuNUyRNInr8DrgbqFj4ni7pmrVq?usp=sharing

Google Colab

wraith sparrow Jul 22, 2025, 9:24 AM

#

or this one https://colab.research.google.com/drive/1S3ty97i3NMorupgpFwfoQpB9Mv6FdzGh?usp=sharing

Google Colab

wraith sparrow Jul 22, 2025, 3:01 PM

#

Anyone knows good books on text2video models

glossy crown Jul 22, 2025, 5:37 PM

#

wraith sparrow anyone can help me fix the issue here https://colab.research.google.com/drive/1w...

what issue?

wraith sparrow Jul 22, 2025, 5:39 PM

#

glossy crown what issue?

Torch ka koi version mismatch ho rha shayad

#

Khud dekh lo dono me yaad nhi aa rha

final blaze Jul 24, 2025, 2:52 AM

#

hello everyone.
I am beginner in kaggle.
below problem is exercise of lesson2 from "Intro to Machine Learning".
why this error occured and how can I fix?

problem
import pandas as pd
iowa_file_path = '../input/home-data-for-ml-course/train.csv'
home_data = pd.read_csv(iowa_file_path)
step_1.check()
-error
NameError Traceback (most recent call last)
/tmp/ipykernel_36/423311291.py in <cell line: 0>()
8
9 # Call line below with no argument to check that you've loaded the data correctly
---> 10 step_1.check()

NameError: name 'step_1' is not defined

final blaze Jul 24, 2025, 3:50 AM

#

I am looking for someone to help me with how to do the exercises in the [Intro to Machine Learning] course on Kaggle.
[https://www.kaggle.com/code/wonderfulexcellent/exercise-your-first-machine-learning-model/edit]

pseudo musk Jul 24, 2025, 7:05 PM

#

final blaze hello everyone. I am beginner in kaggle. below problem is exercise of lesson2 f...

did you setup the notebook before doing all this?

final blaze Jul 24, 2025, 7:06 PM

#

pseudo musk did you setup the notebook before doing all this?

thank you for your help.

#

I installed the notebook by running the code that appears first at the start of the exercise.
installing code is below

Set up code checking

from learntools.core import binder
binder.bind(globals())
from learntools.machine_learning.ex2 import *
print("Setup Complete")

#

============
is this right ?

final blaze Jul 24, 2025, 7:18 PM

#

pseudo musk did you setup the notebook before doing all this?

Thank you.

#

I just tried it again your way and moved forward with the practice.

pseudo musk Jul 24, 2025, 7:33 PM

#

nice good luck

rancid terrace Jul 25, 2025, 4:05 AM

#

coral lava hi everyone ,https://github.com/campusx-official/ML-Roadmap-for-2022/blob/main/R...

It's good

thick axle Jul 25, 2025, 8:26 AM

#

Hey guys, as a full stack engineer who have more than 6+ years of experience with frontend and backend, devops, databases (SQL + NOSQL) and AWS.

What's the best way to start learning AI and go for roles like AI Engineer. Or make myself capable to build AI solutions for different industries?

wraith sparrow Jul 25, 2025, 1:28 PM

#

anyone can pls ans this https://x.com/grok/status/1948736131455222227

Grok (@grok)

@noob_contrarian xAI partners with Azure for Grok's enterprise-grade scalability, security, and content safety features, essential for reliable global deployment. While RunPod is cheaper for basic GPU rentals, it lacks Azure's robust infrastructure and integration, making it unsuitable for our

coral lava Jul 25, 2025, 1:29 PM

#

rancid terrace It's good

thanks for your review

rancid terrace Jul 25, 2025, 1:59 PM

#

thick axle Hey guys, as a full stack engineer who have more than 6+ years of experience wit...

Learn linear regression and logistic regression first, it's the basic for any neural network. Then it's upto you to learn other machine learning models or direct jump to perceptron ( basic unit of neural network)

Understand preprocessing data then

Understand neural networks

Later for understanding language processing study NLP

Understand RNN's LSTM AND GRU

After understanding all this learn CNNs for image processing

After this you are ready to understand LLMs

Learn transformer architecture
Learn BERT model architecture

Learn to use langchain , huggingface

Make rag applications
Like pdf summarizer
Caption generators

Learn to use llm inference

After all this learn to finetune models on custom datasets ( require gpu)

Learn to do lora, peft fietuning and Quantizing models

After this learn to use langgraph and connect llm with looks like sql data retrieved, web search tools

You will understand agentic ai

For more advance topics like deep seek r1 you have to learn reinforcement learning

Prompt engineering for prompting the llm

Building llm fromscratch of deploying finetuned models on huggingfave for specific tasks

For learning all this you have 2 framewoks one is tensorflow, another is pytorch.

For llms ai 85 % new published papers are written in pytorch so recommend you to go with pytorch.

For web application you can also use JavaScript for building web application but python environment is more mature and better so going with python gives you more advantages

Last you can also learn about mcp servers to plugin ai with internet

#

I'm not professional but i hope that i might be helpful to you

thick axle Jul 25, 2025, 3:23 PM

#

rancid terrace Learn linear regression and logistic regression first, it's the basic for any ne...

Wow that looks interesting, However, I was thinking, Is there any free resource or video that I can follow to learn these things step by step? Or if you share any tips and tricks to learn this things step by step ?

rancid terrace Jul 25, 2025, 3:28 PM

#

thick axle Wow that looks interesting, However, I was thinking, Is there any free resource ...

Simple steps

Learn regression because neural networks are related to it
Skip other ml. Model in development you don't need them

Learn neural networks, activatiin function, loss functions, optimizers these are the things we need in llms too for finetuning or training our llm on custom. Data

You can just skip rnn or lstm. But they gives you the starting understanding of processing natural language

And for preparing our language flr machine fitting ready we need to understand how NLP WORKS.

So NLP knowledge is important like knowing what is Tokenization etc

Learn transformer model to understand how llm. Works and how they were made then simply go with development

#

Use langchain, huggingface documentation help

#

Resources are : YouTube channels, medium blogs, huggingface, langchain, udemy cources

thick axle Jul 25, 2025, 3:34 PM

#

@rancid terrace Gotcha, Here is the summary I understood:

With the current knowledge I have, I am ready to start learning about regressions. So basically my first step will be to learn what regression is. Nothing else for now. Just learn about regression
Then the second step will be to learn neural networks, activatin function, loss functions, optimizers.
Then learning rnn or lstm is optional (I will definitely learn it)
Then I need to start learning NLP for natural language processing
Finally I need to learn transformer model to understand how llm works
Now I am ready to start development. So this is Part 2 of my roadmap where I will make my hands dirty by working
Start with Langchain and hugging face documentation

Please feel free to correct me If I am wrong. Thank you brother!

rancid terrace Jul 26, 2025, 4:35 AM

#

Everything is fine

#

Best of luck

#

Focus mostly on development

#

Understand the concepts like regression don't start building from scratch

#

Langchain hugginface handles everything

#

Like NLP neural networks etc

wraith sparrow Jul 26, 2025, 6:01 AM

#

>.<

Ash (@noob_contrarian)

Which video genai platform are you using mostly?

bronze lotus Jul 26, 2025, 2:17 PM

#

Does anyone here know how to integrate the results of a feedback survey web app directly into a model's pipelines, so it can reference the survey's data and learn from it without a middleman doing it?

wraith sparrow Jul 27, 2025, 12:53 PM

#

anyone can help pls fix it https://www.kaggle.com/code/ashwinbarnwal/superweights

Superweights

Explore and run machine learning code with Kaggle Notebooks | Using data from Mistral

#

not getting how to login to hf using hf cli in order to confirm the acess granted to use hf models

subtle gulch Jul 27, 2025, 10:08 PM

#

rancid terrace Everything is fine

you are a cool dude

rancid terrace Jul 28, 2025, 4:53 AM

#

subtle gulch you are a cool dude

Just like the book " how to win and influence people " says

If you want to be a great leader first learn to understand what other needs and what you can provide to help them .

remote granite Jul 28, 2025, 3:22 PM

#

that is so good.

rancid terrace Jul 29, 2025, 1:58 PM

#

Pls stop spamming dude we don't care what Mr beast is doing ( he ain't providing us gpu for llms )

rugged anchor Jul 30, 2025, 4:37 AM

#

rancid terrace Learn linear regression and logistic regression first, it's the basic for any ne...

Hey would this same be applicable for a fresh college graduate

rancid terrace Jul 30, 2025, 4:40 AM

#

rugged anchor Hey would this same be applicable for a fresh college graduate

Depends , if you are going for ml or data science or genai engineering

#

He asked me for mostly development

rugged anchor Jul 30, 2025, 4:41 AM

#

I want to learn to be a ai engineer and applied knowledge for generative ai will help

#

I asked one of my senior - Statistics
Linear algebra
Calculus
Panda numpy polaris
Handling and visualization
Scit leave xg boost adaboostlinear regress
Support vector kernal hyper plane
Svm random ensamble method
Bagging boosting
Basic dl
When to use what
Preprocessing of nlp tradeoff nltk spacy
Tokenization
Transformer why diff
Prompt
Rags how does work
Vector db and indexing
Azure, AWS gcp
Agentic ai langchain graph
Pipeline memory agent orchestration
Fine-tuning
Llm which what when why - any 3 architecture
Projects - problem solution

He told me to concentrate on this first

mortal swallow Jul 30, 2025, 5:01 AM

#

guys what to do if you submissions keep getting kaggle errors?

#

your*

rancid terrace Jul 30, 2025, 6:16 AM

#

mortal swallow guys what to do if you submissions keep getting kaggle errors?

Check for the os path of the file , is it correct or not , check for network connectivity

rancid terrace Jul 30, 2025, 6:18 AM

#

rugged anchor I asked one of my senior - Statistics Linear algebra Calculus Panda numpy pol...

Who do you need to learn svm , xgboost etc . You will never use them in ai

#

They are basically used to predict in a given dataset

#

For ai engineering learn regression in detail that's sufficient and learn deep learning with NLP ( tokenizqtion etc ) , later understand encoder decoder architect , attention mechanism , transformer architecture .

Learn generative ai , langchain , langgraph , agentic ai , MCP servers , rag applications ,chatbota , computer vision , reinforcement learning etc

#

For ai engineering go with pytorch because it's currently the most popular framework , all the researches are in pytorch

#

For making production ready and easy structure go with tensor flow

#

🙂

rugged anchor Jul 30, 2025, 6:27 AM

#

rancid terrace For ai engineering learn regression in detail that's sufficient and learn deep l...

Thank you for the advice , can you recommend some of the resources to learn this.

mortal swallow Jul 30, 2025, 6:51 AM

#

rancid terrace Check for the os path of the file , is it correct or not , check for network con...

what if both have no problems?

rancid terrace Jul 30, 2025, 7:17 AM

#

mortal swallow what if both have no problems?

Network issues

#

Try after sometime

mortal swallow Jul 30, 2025, 7:31 AM

#

rancid terrace Network issues

I see thank you.

rancid terrace Jul 30, 2025, 7:45 AM

#

rugged anchor Thank you for the advice , can you recommend some of the resources to learn this...

Ye why not

YouTube : krish nayik , campusx , huggingface , langchain , openai courses

Blog / articles : geeks for geeks , medium.com , etc

#

There are many YouTube channels that also teach something like neuro.. something

#

3blue1brown

rugged anchor Jul 30, 2025, 11:47 AM

#

Thank you Naman

#

It helps me alot

upbeat sluice Jul 31, 2025, 3:14 AM

#

Hi, I am.new to kaggle had a question regarding the submission criteria. What does it mean that my submission has to have a runtime less than 11 hours? Does it mean that the entire inferences should be completed in less than that?

And also the no internet rule. If my code relies on some packages like nibabel or idk torch then how does the no internet rule work.

Thank you for your help

wraith sparrow Jul 31, 2025, 10:21 AM

#

does lmarena dont come with tools

fluid needle Jul 31, 2025, 2:38 PM

#

Hello everyone I am new to kaggle , can anyone explain me how to participate in kaggle hackathon?

zenith matrix Aug 1, 2025, 8:22 AM

#

Hi, I was excited to participate in the code-golf-2025 tournament, but I just saw that my country (Venezuela) is blocked. Will I really not be able to compete because of my nationality?

supple crane Aug 2, 2025, 1:15 AM

#

I am working on on the RSNA Intracranial Aneurysm Detection competition and I have a decent graphics card on my home computer. I have been training models for the past 24 hours on the full dataset getting these values:

[Epoch 1] Avg Loss: 1.3610 | Val Weighted AUC: 0.5800
[Epoch 2] Avg Loss: 1.3071 | Val Weighted AUC: 0.4637
[Epoch 3] Avg Loss: 1.3052 | Val Weighted AUC: 0.5383
[Epoch 4] Avg Loss: 1.3069 | Val Weighted AUC: 0.6006
[Epoch 5] Avg Loss: 1.2967 | Val Weighted AUC: 0.5206
[Epoch 6] Avg Loss: 1.3007 | Val Weighted AUC: 0.4524
[Epoch 7] Avg Loss: 1.3029 | Val Weighted AUC: 0.5068
[Epoch 8] Avg Loss: 1.3083 | Val Weighted AUC: 0.5367

I am curious to know of other ways to make my training faster. My computer isnt trying terribly hard, its using most of its RAM but those 8 models have taken about 24 hours to complete. Can anyone help me out?

quasi smelt Aug 2, 2025, 9:20 AM

#

Hi! Our team is participating in the CMI – Detect Behavior with Sensor Data competition.
We’ve enabled “Always use latest version” in our Kaggle notebook, but some teammates still see older versions when editing.

Just to clarify: Is it against the rules to use GitHub for collaborative EDA and model training only, without exposing any test data?
We’re trying to keep things safe while managing our workflow.

Appreciate any clarification — thanks!

hasty valve Aug 2, 2025, 6:12 PM

#

for hyperparameter tuning, what are the go to methods? Ive been using gridsearchcv, but came across optuna and it seems a lot better

iron ingot Aug 3, 2025, 9:51 AM

#

Hi everyone! I’ve recently started diving deeper into the world of machine learning. Before that, I was mainly into mathematics — particularly convex analysis and tensor optimization.

However, no matter how much I look around, it seems like most jobs/projects only value the ability to use existing libraries and build models like Lego blocks from pre-made components.

Do you know of any areas or projects where deeper mathematical knowledge like this is actually useful or appreciated? I’m starting to feel like it doesn’t really matter, which is a bit disheartening😔

subtle gulch Aug 3, 2025, 7:14 PM

#

iron ingot Hi everyone! I’ve recently started diving deeper into the world of machine learn...

why dont you do researchs then?

iron ingot Aug 3, 2025, 7:26 PM

#

subtle gulch why dont you do researchs then?

Because research can take years, and there’s no guarantee it will lead to success. That’s why I’m trying to focus more on finding jobs or joining projects that can bring a more stable or predictable income.

upbeat sluice Aug 4, 2025, 5:03 AM

#

supple crane I am working on on the RSNA Intracranial Aneurysm Detection competition and I ha...

One way is to limit the data size. Research MIP and how to use it (convert 3d to 2d, so you can make your data lighter). Also maybe resize the images.

rancid terrace Aug 4, 2025, 5:19 AM

#

iron ingot Because research can take years, and there’s no guarantee it will lead to succes...

That's a good plan .

white glacier Aug 4, 2025, 12:36 PM

#

has anyone got any good resources for linear mix models?

vestal trail Aug 4, 2025, 4:53 PM

#

Good ebening. I seem to have some difficulty accessing the data for the DFL - Bundesliga Data Shootout. Could I be so lucky that someone reading this could help me out? thanks in advance.

subtle gulch Aug 4, 2025, 9:50 PM

#

iron ingot Because research can take years, and there’s no guarantee it will lead to succes...

you can get a research job, there are many ML jobs that are for research position you know that right? it doesn't need to be something that will take years

dawn nova Aug 5, 2025, 8:31 AM

#

Hello everyone I am a complete beginner like I did a few ML courses and wanted to do my first project so I chose Kaggles House Prices - Advanced Regression Techniques dataset but there is one thing that has been bothering me in this project ALOT

#

The problem I was having was what is an empty cell. This seems trivial but some categories have NA (not applicable) a string to represent the non existence of something for example BsmtQual it has a category NA for when the house doesnt have a basement so NA not applicable. Others like MasVnrType it has None as a category for when there isnt a MasVnr (whatever that is) and then the empty cells the ones that arent filled in they also have the string NA to represent them

#

So what I wanted to do was to keep the meaningful NA's and None's (I say 's but there was only one None, only in MasVnrType) and impute the empty cells. This all seems easy enough but caused me a whole lot of trouble here is what I did
houses = pd.read_csv("data/train.csv", keep_default_na=False, na_values=[""])
I used read_csv in such a way that pandas doesnt "help me" by converting all the NA's and nones into NaN's bcz then how will I differenciate between the actual empty cell and a NA or none
then
I split the test train and num categorical the usual stuff
after which I did this
meaningful_na_cols = ["Alley","BsmtQual","BsmtCond","BsmtExposure","BsmtFinType1","BsmtFinType2","FireplaceQu","GarageType","GarageFinish","GarageQual","GarageCond","PoolQC","Fence","MiscFeature"]

for col in meaningful_na_cols:
X_train_cat[col] = X_train_cat[col].replace("NA", "NoNoneNo")
X_test_cat[col] = X_test_cat[col].replace("NA", "NoNoneNo")

X_train_cat['MasVnrType'] = X_train_cat['MasVnrType'].replace("None", "NoNoneNo")
X_test_cat['MasVnrType'] = X_test_cat['MasVnrType'].replace("None", "NoNoneNo")
created a unique category so that the meaningful NA's and Nones dont get imputed

imp_cat = SimpleImputer(strategy="most_frequent")
X_train_cat = imp_cat.fit_transform(X_train_cat)
X_test_cat = imp_cat.transform(X_test_cat)
I then imputed and convereted this X_train_cat back to a csv for me to view and this happened

#

my MasVnrType had 4 columns the data description says it has 5 but my training data only had 4 so good nice. Then I did drop first so there should be 3 so out of these 4 MasVnrType categorical columns 3 are meaningful types like Stone or cement or None but one is just NA? How did that get in there?

#

I just need help with csv files and empty cells like can someone please explain in real life datasets how is an empty cell represented how does pandas interprets an empty cell do they put NA string none string NaN like please help me out I am so confused Idk where the problem happens is it the csv file thats formatted this way is it pandas or am I retarded
cheers

limpid garnet Aug 5, 2025, 11:33 AM

#

What does "plot_df = dataset_df.Transported.value_counts()" does ?

iron ingot Aug 5, 2025, 4:14 PM

#

rancid terrace That's a good plan .

Thanks for the support — I hope that wasn’t sarcasm 🙂

iron ingot Aug 5, 2025, 4:20 PM

#

subtle gulch you can get a research job, there are many ML jobs that are for research positio...

That's actually a good idea — I hadn't thought about options like that (probably because I haven't come across them 😅 ), but I think this kind of direction really does feel closer to me.

rancid terrace Aug 5, 2025, 4:29 PM

#

iron ingot Thanks for the support — I hope that wasn’t sarcasm 🙂

Nope it's not sarcasm

#

No jokes while giving advice

hollow flicker Aug 5, 2025, 5:13 PM

#

@eager thicket when we are submiting our code for comepition should we keep our api or the judges use their api

#

can some one help me please

dry lynx Aug 6, 2025, 5:01 AM

#

Hello!

I want to do the kaggle exercises at my work desktop but the company internet is sloowww and honestly jsut takes forever to run.

is there a way i can run the setup codes locally on a python ide?

hidden hollow Aug 6, 2025, 5:46 AM

#

Hi everyone! I have started course on Kaggle about Advanced SQL, but ran into problems when trying to complete the tutorials 🥲 This is my first time working with Kaggle notebooks and taking the course as well.

There was this error when i tried to execute the set up cell
/usr/local/lib/python3.11/dist-packages/google/cloud/bigquery/table.py:1727: UserWarning: BigQuery Storage module not found, fetch data with the REST endpoint instead.
warnings.warn(

The rest of the tutorial cells don't seem to work as well. Could someone please help to solve this issue? 🙏 Thank you!

sly hound Aug 6, 2025, 9:52 AM

#

did refreshing the page solve the issue? what troubleshooting has been done?

deft yarrow Aug 7, 2025, 1:57 AM

#

Hi i have a question about the arc challenge i keep getting error in submission scoring error ? Am i missing something? I changed multiple times and edited but still the same?

hidden hollow Aug 7, 2025, 3:15 AM

#

sly hound did refreshing the page solve the issue? what troubleshooting has been done?

No, unfortunately refreshing didn't help, and I don't know what else can I do to solve the issue
Should there be some set up done / something downloaded for code in kaggle to work?

sly hound Aug 7, 2025, 5:06 AM

#

this is a good time to get your discussions point on kaggle. Start a dissucsion on Kaggle for this issue.

hidden hollow Aug 7, 2025, 5:23 AM

#

Thanks for advice! I have read existing discussions, yet couldn't see anyone with the same problem. For some reason, I cannot start my own discussion (it says that I do not have a permission), do you know what might be the problem?

wraith sparrow Aug 7, 2025, 9:03 AM

#

anyone pls help https://github.com/VideoVerses/VideoTuna getting error while inferencing

GitHub

GitHub - VideoVerses/VideoTuna: Let's finetune video generation mod...

Let's finetune video generation models! Contribute to VideoVerses/VideoTuna development by creating an account on GitHub.

real dew Aug 7, 2025, 10:25 AM

#

hidden hollow Thanks for advice! I have read existing discussions, yet couldn't see anyone wit...

You could try contacting Kaggle support

urban light Aug 7, 2025, 8:01 PM

#

what's everyones favourite dataset review/annotation tool? I would like to have a tool where I can import my entire dataset.jsonl containing questions and answers, be able to mark each item as correct/incorrect, and finally be able to update the answer if needed. Also, I would like to be able to share this dataset so other people can do them.

Once everything is complete, I would like to run evals directly on the dataset. And finally, run fine tuning. Does a tool like this exist? 😄

humble lodge Aug 8, 2025, 8:27 AM

#

I feel that nowadays it is really hard to keep up with all the AI developments. How do you keep yourself informed with all these advancements coming daily? I personally find it very hard to keep track of all the sources (arxiv, new website, big tech blog etc). Interested to know your thoughts!

summer dagger Aug 10, 2025, 11:08 PM

#

Hello everone, can anybody guide me on how to remember the use of brackets, symbols etc? i have completed variables and functions exercises on kaggle and implemented a VaR model on small data set in VS code, following a yt video but i can't even write a 2 line code without mistakes on my own without looking for help? Thanks

marsh lantern Aug 11, 2025, 6:51 AM

#

summer dagger Hello everone, can anybody guide me on how to remember the use of brackets, symb...

Totally normal! A few tips that help:

Use official documentation often — don’t try to memorize everything, just know where to find it.
Type code yourself instead of copy-pasting, even in tutorials.
Build small scripts like a calculator or to-do list to apply what you learn.
Read other people’s code to see how they structure things.

With time, the syntax will stick naturally!

toxic urchin Aug 12, 2025, 12:36 PM

#

Hello everyone, I recently made a basic spectrometer with a DVD's surface as the diffraction grating. I want to go beyond and make a device that captures the spectrum and makes the electromagnetic spectrum's waves RGB + other colours too in another wave plot (see SpectralWorkbench since somebody made it and it's cool) and also an AI model that would predict the light fixture type. How do I do it? Integrate all models in new hardware or use my laptop? Plus, coding problems. Could anyone please guide me through this?

wraith sparrow Aug 12, 2025, 1:17 PM

#

anyone tried goolge's adk using uv

civic seal Aug 13, 2025, 5:10 AM

#

Hi, Im doing a competition for house prediction and saw someones notebook and saw that wahtever they did to the train.csv they did to the test.csv Like fi they dropped a certain feature they would do the same to the test.csv. Is this normal?

steel valley Aug 13, 2025, 9:20 AM

#

civic seal Hi, Im doing a competition for house prediction and saw someones notebook and sa...

Hello @civic seal yes that's perfectly normal. Compare it to learning for an exam yourself, if the teacher says: don't learn chapter 3, it would be unfair to put chapter 3 in the exam.

civic seal Aug 13, 2025, 2:01 PM

#

steel valley Hello <@1285075361595654200> yes that's perfectly normal. Compare it to learning...

Are you also supposed to drop features on the test set like if you dropped 5 features on the train ebcause its 90% missing you would also drop the feature on the test? I also saw somenoes notebook do this

pure garnet Aug 13, 2025, 2:02 PM

#

hey,is there anyone know how to join bigqueryai competition server ? im unable to join

steel valley Aug 13, 2025, 4:49 PM

#

civic seal Are you also supposed to drop features on the test set like if you dropped 5 fea...

yes same

steel valley Aug 13, 2025, 4:49 PM

#

pure garnet hey,is there anyone know how to join bigqueryai competition server ? im unable t...

Do you mean this https://www.kaggle.com/competitions/bigquery-ai-hackathon ?
What's going wrong?

BigQuery AI - Building the Future of Data

Build AI solutions with BigQuery

summer dagger Aug 13, 2025, 5:36 PM

#

marsh lantern Totally normal! A few tips that help: 1. Use official documentation often — don...

Thanks for the suggestions, i will follow.🫡

civic seal Aug 14, 2025, 5:13 AM

#

So why do people do a train, val split with the "train_test_split()" function rather than just go into tarinning on all the trainding data and then predicting onthe test data?

civic seal Aug 14, 2025, 5:57 AM

#

steel valley yes same

So i was doing another kaggle competition today and I trained the model but when it came to predicting on the test i got a error aout how the shape was off (features werent the same). And i realized i had to add the same adjustments to the test dataset as i did to the train. But I had to go back and manually add everything. Like if i did train_data.drop('Name', axis=1, inplace=True) I would have to also add test_data.drop('Name', axis=1, inplace=True) is there a more efficient way of doing this?

wraith sparrow Aug 14, 2025, 6:03 AM

#

civic seal So i was doing another kaggle competition today and I trained the model but when...

if u dont mind give it a try once

civic seal Aug 14, 2025, 6:08 AM

#

What is it exactly? Antoher gemini?

wraith sparrow Aug 14, 2025, 6:15 AM

#

civic seal What is it exactly? Antoher gemini?

no way give it try urself its way different

steel valley Aug 14, 2025, 8:17 AM

#

civic seal So i was doing another kaggle competition today and I trained the model but when...

yeah you could drop the features before splitting into train and test

proper wedge Aug 14, 2025, 11:51 AM

#

Can anybody tell me the 100% free to do online certifications which have high value for ai ml job which are offered by top companies or institutions

mystic rapids Aug 14, 2025, 2:03 PM

#

proper wedge Can anybody tell me the 100% free to do online certifications which have high va...

You should try
Elements of AI – University of Helsinki & MinnaLearn Or IBM SkillsBuild

wraith sparrow Aug 14, 2025, 3:38 PM

#

-# anyways anyone would like to fix the broken docs page https://github.com/Ash-Blanc/paper2sw.git

GitHub

GitHub - Ash-Blanc/paper2sw

Contribute to Ash-Blanc/paper2sw development by creating an account on GitHub.

dull sky Aug 14, 2025, 6:38 PM

#

hi! I'm planning to launch a community competition and I'd ask for some advice regarding it. So far I asked permissions from the data providers, and I have a rough idea about the compeitition itself. Is there a guide I should follow?

#

it'd be a timeseries problem and I'd provide a simple data dictionary and I'd use MASE as metric

shadow canyon Aug 14, 2025, 7:19 PM

#

Hey guys, I'm planning to train a 123million parameter model themed J.A.R.V.I.S (yes, that Jarvis from marvel). I'm carefully selecting the data it gets trained on to get the best results but I have a slight problem. The model will be a conversational one and it will have lots of memories (there's a database for that), from which it will need to query the user responses and put them together,summarize and reply. I have no idea how the data format should be for that kind of thing. I was thinking JSON with alpaca but I'm not so sure. Any advice?

fickle snow Aug 15, 2025, 4:39 PM

#

hi. I am a beginner in AI.
Should I code everything with Python and Math Libraries only to understand everything clearly,
or should I use available AI libraries like PyTorch?
Thanks in advance.

dull sky Aug 15, 2025, 4:53 PM

#

fickle snow hi. I am a beginner in AI. Should I code everything with Python and Math Librar...

It depends on your goals. To be honest people put way too big emphasis on the models. Somewhere I read that professional projects are about 10% planning, 80% data preparation, 10% model training.

#

Anyway, if you're interested in the math and implementation part https://www.youtube.com/watch?v=w_2vCijLiiM&list=PLkDaE6sCZn6FNC6YRfRQc_FbeQrF8BwGI&index=16

fickle snow Aug 15, 2025, 4:54 PM

#

dull sky It depends on your goals. To be honest people put way too big emphasis on the mo...

my goals is to be good in ML enough to create a AGI
i'm not doing this just for fun. ML is my life.

dull sky Aug 15, 2025, 4:54 PM

#

like artificial general intelligence?

fickle snow Aug 15, 2025, 4:54 PM

#

yes

fickle snow Aug 15, 2025, 4:54 PM

#

dull sky Anyway, if you're interested in the math and implementation part https://www.you...

yeah i've taken that course on coursera

#

so do you think i should begin with python and math first or should i use pytorch and tensorflow right away?

#

i'm doing the titanic challenge

#

that's my first time building a model

dull sky Aug 15, 2025, 4:56 PM

#

about AGI I don't want to discourage you, but as you'll learn more and more you'll understand how far we're from that 🙂

fickle snow Aug 15, 2025, 4:57 PM

#

dull sky about AGI I don't want to discourage you, but as you'll learn more and more you'...

yeah, but just set that aside and lets get back to my question

#

i don't know if i'm clear enough, but i'm not talking about learning what first, but doing what first, because i've learn the basic, and i'm trying to build a model, and i don't know should i build it with pure python and math or with AI libraries.

dull sky Aug 15, 2025, 5:07 PM

#

That's just my opinion, but I'd first look for a smaller project.

Find a field that benefits from deeplearning, let's say agriculture. (you can check on kaggle, or get vague answers from chat bots)

Than you should narrow down the branches of machine learning needed (shallow/deep, [un]supervised etc.), you may check what is hot topic of research at https://paperswithcode.com/

You have a domain/field, what is being or can be used there. If it is agriculture after some search at kaggle reveals that there are several computer vision problems.

You've done all these, now set a smaller project, check how people solve that problem, just pick a good looking notebook, repeat it using the documentations of the used libraries as pointers, try to find the framework being followed there.

fickle snow Aug 15, 2025, 5:07 PM

#

i appreciate your advice, but that's not related to my question man

#

actually i've got to go now, see you later, and add me on dm too

dull sky Aug 15, 2025, 5:17 PM

#

fickle snow actually i've got to go now, see you later, and add me on dm too

I'd probably go though a simple multi layer perceptron model in pure python, maybe some common layers (CNN), but I wouldn't go much deeper than that. Once you have that covered go to github and check how prefessionals solved the same problem.

#

I'm just a hobby guy, so I'd focus on practical applications after that.

fickle snow Aug 16, 2025, 12:31 AM

#

dull sky I'd probably go though a simple multi layer perceptron model in pure python, may...

thanks for the advice

wraith sparrow Aug 16, 2025, 2:02 AM

#

fickle snow hi. I am a beginner in AI. Should I code everything with Python and Math Librar...

Try this https://substack.com/@rasbt/note/c-132078631?r=qp031

civic seal Aug 16, 2025, 5:21 AM

#

Does anyone know why my subission file is getting saved like with the C1 and C2 as column name but when i do the head() of the df it will not show that. (this is the reason my submission was wrong). My code for saving it is this:

submission_df = pd.DataFrame({'PassengerId': test_data['PassengerId'], 'Transported': test_predictions})
submission_df.columns = ['PassengerId', 'Transported']

submission_df.to_csv('submission.csv', index=False)```

feral ore Aug 17, 2025, 12:27 PM

#

Im having trouble connecting to the localhost on Dbeaver.

steel valley Aug 19, 2025, 2:51 AM

#

feral ore Im having trouble connecting to the localhost on Dbeaver.

which error?

feral ore Aug 19, 2025, 4:58 AM

#

I cant add a screenshot here for some reason

#

It says:

Connection to 'localhost' cannot be established.

Reason:
Communications link failure

The last packet sent successfully to the server was 0 milliseconds ago. The driver has not received any packets from the server.

#

Tried creating a new connection and then it told me:

Connection refused: getsockopt

flint ocean Aug 19, 2025, 5:15 AM

#

Guys can someone say where to start with in studying about ML LIKE A RAOD MAP AND STUDY PLAN like how you guys started

wraith sparrow Aug 19, 2025, 5:36 AM

#

flint ocean Guys can someone say where to start with in studying about ML LIKE A RAOD MAP AN...

@qafig ask him

flint ocean Aug 19, 2025, 5:45 AM

#

wraith sparrow `@qafig` ask him

What does that mean?...is that a username?

pine pollen Aug 19, 2025, 6:32 AM

#

In neural-network regression, should model performance be independent of the number of output variables? Specifically, given the same dataset, will one multi-output model that predicts three targets perform the same as training three separate single-output models (one per target)? Why or why not?

brazen grotto Aug 19, 2025, 7:15 AM

#

pine pollen In neural-network regression, should model performance be independent of the num...

Not necessarily. If the three targets are totally independent, then training one model with three outputs or three separate models could give similar results. But in most real cases the targets share some structure. A single multi-output model can pick up on those shared patterns, which sometimes helps. On the flip side, if the tasks are very different or unbalanced, training them together can actually hurt because the model has to compromise. So the answer really depends on how related the targets are and how much capacity the model has.

ornate pike Aug 22, 2025, 9:40 AM

#

Does anyone know how long emailing support@kaggle.com takes for a response? They are not responding to my attribution update request even though the kaggle website UI explicitly says to email them.

naive pond Aug 23, 2025, 9:01 AM

#

Hi ,how can I upscale video on kaggle?

wraith sparrow Aug 23, 2025, 12:16 PM

#

do kaggle have any official mcp server

gilded drift Aug 23, 2025, 12:48 PM

#

flint ocean Guys can someone say where to start with in studying about ML LIKE A RAOD MAP AN...

It will be good if you can share your background? Stat? Maths? CS? Unless you express where you stand 'where to start' will fetch less meaning response.

thick widget Aug 24, 2025, 10:16 AM

#

hello everyone i'm hari and I'm new to kaggle how i can upskill me on this platform

gloomy nest Aug 24, 2025, 10:18 AM

#

I see in a lot of job descriptions that one needs to have hands-on experience with LLMs (train, fine-tune etc.)

Does anybody know some relevant links/documentation from which I can learn about LLMs

Also can you suggest a project that would prove the knowledge needed for a junior job?

Thanks

blissful edge Aug 25, 2025, 1:12 AM

#

Sorry if this is a stupid question but how to use packages that require internet to download for offline submission?

uncut patrol Aug 25, 2025, 12:54 PM

#

blissful edge Sorry if this is a stupid question but how to use packages that require internet...

download the wheel files and put them in a dataset and then link that dataset to your notebook, and then you can install the packages from the wheel files using pip without internet! 🙂

limber wasp Aug 26, 2025, 4:13 PM

#

I have been trying to register for a competition for weeks, but PersonaID refuses to recognize/verify my face. I've went back and forth with support just telling me that they reset it and to try again. Now its the day of the competition and I've done the work but I'm still not able to register for the competition and submit my work.

limber fern Aug 27, 2025, 1:42 PM

#

are spontaneous applications effective? i'm starting to look for an internship. Also i'm thinking about a good prompt to generate a cover letter for each application/company

brittle cave Aug 28, 2025, 4:37 PM

#

Who here understand docker (beginner or expert).
Need help with dockerizing an ai project

crimson loom Aug 29, 2025, 7:33 AM

#

I completed the ID verification, including face recognition, but I still cannot receive the SMS verification code.

#

plz, I need help

cursive owl Aug 29, 2025, 9:39 PM

#

brittle cave Who here understand docker (beginner or expert). Need help with dockerizing an a...

I can help, if you can tell me where you need help

brittle cave Aug 29, 2025, 9:42 PM

#

cursive owl I can help, if you can tell me where you need help

i have a project uses ollama models
i need someone to help with making a docker file that

installs ollama
pulls appropriate models
installs python libraries
adds some argument to docker

nimble meadow Aug 30, 2025, 1:54 AM

#

can any one explain how to install SVD and import it , i am getting ModuleNotFoundError: No module named 'surprise'

uncut patrol Aug 30, 2025, 2:07 AM

#

this is on a kaggle notebook right?

#

just add !pip install suprise to the top i think

#

the ! tells juypter to execute a terminal command

nimble meadow Aug 30, 2025, 2:12 AM

#

uncut patrol this is on a kaggle notebook right?

to install surprise - SVD we need to downgrade the version of numpy but still there is same issue

velvet spoke Aug 30, 2025, 9:45 AM

#

Ai blog generation project -

Issues -
At regeneration time, I have issues of overall quality improvent, user instructions follow, Alignment with original context, factual accuracy, true generation ( not rephrasing, new and true content ) , also consider latency, cost, scalable

Data flow ---
At regeneration, I am giving data past conversation ( first time default data- blog generation, heading h1 h2 h2 h3 so on, primary and secondary keyword, deafault prompt ) quotedblogsection , user instructions.
Note - if it is second regeneration, in second input goes like past conversation ( first time default data+ first regeneration data) . It is happening like this for every regeneration.

Note -
U can give me answer by considering all leaving latency, cost, scalable. Also If u give answers consider including all even latency cost scalable

Aprroach I have -
1.Prompt engineering approachwith pass conversation passing as a reference.
2 multiple agent approach
3 Single agent only

Question ----
1 I want to say which method work best to meet overall expectation and why. ?

2.If not work above method, what another approach I have to apply to fulfill my expection

3 Which model should I use - got 5 nano and got 4.1

wraith sparrow Aug 30, 2025, 12:34 PM

#

Anyone have any crazy/good idea with (ai) agents?

wraith sparrow Aug 30, 2025, 12:35 PM

#

velvet spoke Ai blog generation project - Issues - At regeneration time, I have issues of ...

Claude

fresh forge Aug 30, 2025, 9:05 PM

#

hi , guys i have a question , what i should do if i have dataset and there is column in my datasets call
review contains text or description , can i transformation this col to be labels or one-hot or i should
drop this columns if i wanna clustering that's dataset

wraith sparrow Aug 31, 2025, 3:29 AM

#

-# is there any significant diff between https://github.com/Tencent/Youtu-agent and crewai?

high leaf Aug 31, 2025, 8:31 AM

#

Hi @everyone
Can some body help me review my code i trained model for the first time

sudden notch Aug 31, 2025, 3:45 PM

#

fresh forge hi , guys i have a question , what i should do if i have dataset and there is co...

Sorry but your English is a bit confusing. ONLY if it is categorical, use one-hot.

sudden notch Aug 31, 2025, 3:46 PM

#

wraith sparrow Claude

Why should I tell you? What am I going to get?

wraith sparrow Aug 31, 2025, 3:47 PM

#

sudden notch Why should I tell you? What am I going to get?

Wdym

sudden notch Aug 31, 2025, 3:48 PM

#

crimson loom I completed the ID verification, including face recognition, but I still cannot ...

Turn on airplane mode, wait for 10s, turn it off, try again. If it still doesn't work, there's not anything you can do.

sudden notch Aug 31, 2025, 3:48 PM

#

wraith sparrow Wdym

What do you mean?

wraith sparrow Aug 31, 2025, 3:49 PM

#

sudden notch Why should I tell you? What am I going to get?

Yes what do you mean

sudden notch Aug 31, 2025, 3:49 PM

#

Oh I see 😂

#

What am I gonna get if I give you an idea?

#

😎

wraith sparrow Aug 31, 2025, 3:50 PM

#

sudden notch What am I gonna get if I give you an idea?

Ur gonna get one idea in return (for now)

sudden notch Aug 31, 2025, 3:52 PM

#

Mine is ten billion percente better than yours. 👎

#

So why to share for an uncertain idea?

#

I want something better

#

How much experience do you have?

wraith sparrow Aug 31, 2025, 4:39 PM

#

sudden notch How much experience do you have?

In trading ideas?

burnt geyser Aug 31, 2025, 5:19 PM

#

I'm doing the birdclef competition as part of a school project.
I trained my base model and got bad results, even though I took care of class imbalance I think I may have done something wrong, I would appreciate it if someone could help me

wraith sparrow Sep 1, 2025, 8:02 AM

#

can anyone help with finetuning gpt-oss-20b model on ssrl

inland crescent Sep 1, 2025, 11:18 AM

#

When will cohort 5 of Kaggle X start?

mortal pond Sep 1, 2025, 8:37 PM

#

Hi need some urgent help in the ML Model, anyoneup?

round prism Sep 2, 2025, 9:10 PM

#

Does any one know a trading chanel thankss

wraith sparrow Sep 3, 2025, 3:02 AM

#

Do anyone knows any lightweight Opensource alt to firebase studio

stoic ledge Sep 5, 2025, 4:11 AM

#

Recently i have started learning ML and for that i have learned python and now moving onto numpy but the maths that i am learning matrices for the matrix calulation so does anyone know how much maths is required in numpy

supple juniper Sep 7, 2025, 1:42 PM

#

I have joined the discord and linked my account ,why the Agent of Discord is still locked

stoic ledge Sep 8, 2025, 5:10 AM

#

supple juniper I have joined the discord and linked my account ,why the Agent of Discord is sti...

it can be because of you did not have linked your discord account with kaggle

stoic ledge Sep 8, 2025, 5:11 AM

#

supple juniper I have joined the discord and linked my account ,why the Agent of Discord is sti...

And you can link it by going on the kaggle official website

twilit hearth Sep 8, 2025, 6:38 AM

#

stoic ledge Recently i have started learning ML and for that i have learned python and now m...

NumPy will do matrix calculations for you. However if you want to understand how it is working then learn the mathematics. Now how much maths required depends on how much advanced operations you want to perform.

For a basic beginner level, high school level maths for matrices is enough.

jolly torrent Sep 8, 2025, 1:46 PM

#

hello everyone! i recently just started doing sql course in kaggle, at the time i reach WITH ... AS part ive got a problem on first cell which the problem is this

Collecting git+https://github.com/Kaggle/learntools.git
Cloning https://github.com/Kaggle/learntools.git to /tmp/pip-req-build-_uuo8ygc
Running command git clone --filter=blob:none --quiet https://github.com/Kaggle/learntools.git /tmp/pip-req-build-_uuo8ygc
fatal: unable to access 'https://github.com/Kaggle/learntools.git/': Could not resolve host: github.com
error: subprocess-exited-with-error

× git clone --filter=blob:none --quiet https://github.com/Kaggle/learntools.git /tmp/pip-req-build-_uuo8ygc did not run successfully.
│ exit code: 128
╰─> See above for output.

note: This error originates from a subprocess, and is likely not a problem with pip.
error: subprocess-exited-with-error

× git clone --filter=blob:none --quiet https://github.com/Kaggle/learntools.git /tmp/pip-req-build-_uuo8ygc did not run successfully.
│ exit code: 128
╰─> See above for output.

note: This error originates from a subprocess, and is likely not a problem with pip.

anyone could help me solve this?

supple juniper Sep 8, 2025, 2:12 PM

#

stoic ledge And you can link it by going on the kaggle official website

thanks，i have got this Badge

stoic ledge Sep 8, 2025, 2:15 PM

#

twilit hearth NumPy will do matrix calculations for you. However if you want to understand how...

Thanks for answering

But is there any sense to learn how the matrix calculations is done in Numpy as I think that would help me later to refine the data and train ML models

sonic karma Sep 8, 2025, 8:22 PM

#

Hey Everyone, I would be studying Cloud Computing this semester in my undergrad degree , as I am focused towards ML , I wanted to know what should be my approach towards cloud computing as I want it to learn in the ML way ,
Would you suggest a course , any book or any other resource , please do tell. Thanks

#

For reference I would be following this text book from the curriculum:
Cloud Computing, Theory and Practice by Dan C. Marinescu, THIRD EDITION,
Morgan Kaufmann Publishers, 2022

wraith sparrow Sep 9, 2025, 1:46 AM

#

ASK UR SENIORS

twilit hearth Sep 9, 2025, 5:33 AM

#

stoic ledge Thanks for answering But is there any sense to learn how the matrix calculatio...

Yes it is good to know that because that will help in optimizations when required
But that is not mandatory.

stoic ledge Sep 9, 2025, 7:38 AM

#

twilit hearth Yes it is good to know that because that will help in optimizations when require...

Thanks for answering 🙂

ripe blaze Sep 10, 2025, 9:11 AM

#

Anyone can give me any idea how deployment would work for an AI model? I know tools like ollama/unsloth are used to experiment/finetune models, but once its ready for deployment, is there like a certain tool good for that? Assuming you dont want to deploy a model using unsloth due speed differences orsum.

waxen marsh Sep 10, 2025, 6:56 PM

#

what are the top kinds of software engineering problems y'all have seen LLM consistently failing at, even when given step-by-step instructions which can be followed to achieve the solution, one problem that I have seen consistently is , If I given multiple interdependent validation points, the LLMs usually fails to generate optimal results even with multiple tries and feedbacks. what is yours?

molten cave Sep 12, 2025, 5:36 AM

#

Im new Here, should I want to learn python or c++ for CUDA??, new to coding btw

pine mica Sep 12, 2025, 8:17 AM

#

I am facing challenges completing the python exercise of "Working with External Libraries"
Task No. 3, can anyone help me!?

obsidian elbow Sep 12, 2025, 8:43 AM

#

May I know how to be a sponsor ?

mental galleon Sep 12, 2025, 9:59 AM

#

hello guys! I was about to start the 5 day intesive course but the checkboxes don't stay put after refreshing the site.

am I missing something or is there any other way to keep track of what I am doing?

thanks!!

golden lake Sep 12, 2025, 10:29 PM

#

hello guys! I am new to data science and and am looking for tips, advices and resources to start it out. also what is the necessary math, I also want to mention that I am looking for fundamentals only I will be using python (pandas, numpy and matplotlib for visualization) and maybe SQL. also please guys tell me if SQL is necessary or not, and do we use SQL in ML? thank you all in advance.

olive crest Sep 14, 2025, 7:46 AM

#

golden lake hello guys! I am new to data science and and am looking for tips, advices and r...

Hello!
SQL is definitely important for ML because data and databases form the foundation of any project.

All ML models are trained on datasets, and SQL helps in modifying data and performing feature engineering or scaling.

For math, high school–level statistics and probability are enough to get started. It’s important, however, to understand the math behind algorithms like Linear Regression, KNN, etc.

I’d suggest starting with Python. Don’t go too deep at first...focus on building logic, practicing OOP, and optionally some DSA. Then move on to SQL and practice queries regularly.

After that, learn NumPy and Pandas...the key is to practice lots of problems (Kaggle is great for this).

Next, dive into EDA, feature engineering, and data visualization. Begin with Matplotlib and later try Plotly for interactive visuals.

Once comfortable, start applying ML algorithms with NumPy and scikit-learn to learn how to train models.

Finally, work on small end-to-end ML projects. That’s when everything you’ve learned will come together. You can also refer to YouTube for project ideas.

golden lake Sep 14, 2025, 10:25 PM

#

olive crest Hello! SQL is definitely important for ML because data and databases form the fo...

thank you so much!

uneven marsh Sep 15, 2025, 3:40 AM

#

Is there a video, forum, tutorial, or website out there that can make me understand how to code a simple/basic ML model in 25-30 mins? Something like when I'm on my bus ride to school, I can listen to a video or read the website page?

placid root Sep 15, 2025, 3:47 AM

#

Hlo guys anyone know how to connect cloud database with jupyter notebook python using sqlalchemy

old mist Sep 15, 2025, 6:56 PM

#

Hi, @everybody
I have one question, I'm training ml models for the prediction, which is classification problem of 3 classes, where the number of samples are similar but the predition is skewed.
First class and second class is predicted with low precision tough, third class is never predicted. What's the reason? I can' t find the reason.
Before, when I applyed reinforcement learning, where the three classes were assigned to three actions and one action is never selected, too.

#

Actually, that is the preeiction model of forex eur/usd.

civic river Sep 16, 2025, 12:45 PM

#

Can sm1 help me in integrating tpu with the kaggle notebook am facing a bit of issues

high estuary Sep 16, 2025, 3:10 PM

#

golden lake hello guys! I am new to data science and and am looking for tips, advices and r...

if you only use pandas numpy and matplotlib, you're not a data scientist. not even data analyst, need to have much more for over saturated entry level tech jobs

green quest Sep 17, 2025, 10:10 AM

#

Hello, i am new to machine learning, i want to become a self taught ML engineer, so where should i start? what skills should i prefer to master? what tools do i need? help me out please

versed seal Sep 17, 2025, 2:11 PM

#

Hi! Is there more information about the upcoming Google Agents course in November? Links on kaggle website seem to be redirecting back to the course that was offered last March

fading iron Sep 17, 2025, 3:11 PM

#

How do you add a dataset to a Notebook in the current Editor? There is no right side bar, and no "Commit and Run" with settings to add it (from the documentation). I've looked in every menu option and dropdown and do not see an Add Data so I can link my Dataset training

#

Heads up if anyone else has this problem because a lot of online posts and ChatGPT is outdated. It's File->Add Input

wraith sparrow Sep 17, 2025, 3:40 PM

#

versed seal Hi! Is there more information about the upcoming Google Agents course in Novembe...

No we waiting

fading iron Sep 17, 2025, 4:15 PM

#

Does Kaggle not support matplot lib graphs inline like Colab? that's a very useful feature when running to see loss values during training.

wraith sparrow Sep 17, 2025, 5:32 PM

#

Idk man use satyrn if on mac

cedar cosmos Sep 17, 2025, 6:31 PM

#

Hello,
Recently I have started learning ML and for that I have learned python and currently learning Numpy, but I'm a bit confused whether I should learn pandas after Numpy or not.
So can anyone help me with it.

fading iron Sep 17, 2025, 7:23 PM

#

pandas is amazing highly recommend learning it since Dataframes will come in very handy

cedar cosmos Sep 18, 2025, 4:14 AM

#

fading iron pandas is amazing highly recommend learning it since Dataframes will come in ver...

Ok Thanks 👍

wraith sparrow Sep 18, 2025, 4:50 AM

#

cedar cosmos Ok Thanks 👍

Also polars

clever geyser Sep 18, 2025, 11:48 AM

#

Hi everyone..
So I have been working on Diabetic Retinopathy and have created a custom cnn with SE attention block. My model has reached the plateau and now its performance is not improving. How can I improve my model's performance. Currently it is 80% for training, 78% validation and 77% testing. I have to take it to 91% at least to conclude my research.

#

Any suggestions?

cursive owl Sep 18, 2025, 3:35 PM

#

clever geyser Hi everyone.. So I have been working on Diabetic Retinopathy and have created a ...

77% to 91 % seems to long leap but you can try some data augmentation techniques such as rotations, flips etc. Also it would be worth checking for class imbalance.

fading iron Sep 18, 2025, 5:04 PM

#

How can you continue training on a net if you can't write to the input? Only way I see is the download the net and optimizer each time manually then upload to the dataset via the web which is horribly inefficient.

#

Any suggestions are welcome.

stiff girder Sep 19, 2025, 2:53 AM

#

Hello,

I am currently working on a project where I aim to combine Long Short-Term Memory (LSTM) networks with Convolutional Neural Networks (CNNs) to forecast air pollution levels. Since my approach will require satellite observations, I am particularly interested in using data from the TEMPO (Tropospheric Emissions: Monitoring of Pollution) mission.

I would greatly appreciate any advice

hallow warren Sep 19, 2025, 3:18 PM

#

Hi everyone, I am new to Kaggle, data science and machine learning. Can someone help me with the Titanic Problem? Like some basic questions about understanding the task.

vagrant pendant Sep 20, 2025, 8:20 AM

#

guys did any one succeed in getting a job using kaggle, as a fresher?

dim rivet Sep 21, 2025, 9:12 PM

#

clever geyser Hi everyone.. So I have been working on Diabetic Retinopathy and have created a ...

Did my master's project on a similar problem. Sounds like it's completely underfitting if it's only getting 80% on the data it was trained on. Consider making your model more complex, increasing learning rate, using a form of early stopping if you aren't already, etc.

rugged juniper Sep 22, 2025, 2:40 PM

#

hallow warren Hi everyone, I am new to Kaggle, data science and machine learning. Can someone ...

Hi, the Titanic task is to predict Survived (0/1) from passenger features.

rugged juniper Sep 22, 2025, 2:44 PM

#

stiff girder Hello, I am currently working on a project where I aim to combine Long Short-Te...

Hi, with my experience is that use TEMPO L2/L3 products (NO₂, O₃, HCHO; filter by QA/clouds), regrid to a fixed lat–lon, and align them with ground PM2.5/PM10 and meteorology (wind, temp, RH, PBL height)

rugged juniper Sep 22, 2025, 2:46 PM

#

fading iron How can you continue training on a net if you can't write to the input? Only wa...

Hi, you can’t write to /kaggle/input
Save checkpoints to /kaggle/working during training

Keras ModelCheckpoint('/kaggle/working/ckpt.h5')

young vigil Sep 22, 2025, 5:29 PM

#

hi not sure if this is the right place but i'm trying to save my work but it's not letting me

#

it keeps giving me this error: An error occurred while committing kernel: ConcurrencyViolation Sequence number must match Draft record: KernelId=92809017, ExpectedSequence=3, ActualSequence=2, AuthorUserId=24934154

fading iron Sep 22, 2025, 7:23 PM

#

means your code was changed since the last save. It's wonky like that. Best bet copy it to another text editor. Reload the notebook, paste, save version

#

love kaggle so much but I do miss google drive in colab lol was so easy to just read and write to a persistant storage. Have to have a dataset set then use kaggle cli to push your data to it your done if you want to keep anything from /kaggle/working.

fading iron Sep 22, 2025, 7:25 PM

#

young vigil hi not sure if this is the right place but i'm trying to save my work but it's n...

hope this helped

young vigil Sep 22, 2025, 9:41 PM

#

fading iron hope this helped

thanks so much!!!

#

i just downloaded and uploaded it instead

lime trout Sep 23, 2025, 3:36 AM

#

Is there a working professional I can DM? I really need some guidance related to software development and data science.

wraith sparrow Sep 23, 2025, 9:27 AM

#

anyone havng chatgpt pro here

sleek basin Sep 23, 2025, 10:07 AM

#

[URGENT] Writeup submission cancelled after deadline - need help with technical issueHello Kaggle community,

I encountered a technical issue with my writeup submission for the BigQuery AI Hackathon and need your help.

My Situation:

I successfully submitted my writeup at 8:20 AM (40 minutes before the 9:00 AM deadline)
After the deadline (at 9:00:02 AM), I accidentally clicked the "Edit" button to make minor text corrections
The system interpreted this as a submission cancellation, changing my status to "Deadline Missed"

What I Need:

Can my original submission be restored to its pre-deadline state?
This appears to be a technical issue with the edit button functionality after deadline
I have complete evidence that the submission was made before the deadline

My Evidence:

Complete GitHub repository: https://github.com/mkmlab-hq/bigquery-ai-hackathon-submission
All project files were completed and uploaded before the deadline
Browser history shows successful submission at 8:20 AM
Local files with timestamps showing completion before deadline

My Project:

Title: "BigQuery AI Hackathon: Multimodal Health Analysis"
Complete multimodal health analysis system with BigQuery AI integration
Team: MKM Lab

Questions for the Community:

Has anyone else experienced this issue with the edit button after deadline?
Is there a way to contact Kaggle support directly for this type of technical issue?
Any advice on how to resolve this would be greatly appreciated

Technical Details:

The edit button should either be disabled after deadline or not cancel the submission
This seems like a UI/UX issue that could affect other participants

Thank you for any help or advice you can provide.

Best regards,
윤원민 (familyunion)
Kaggle ID: giryun288@gmail.com

verbal crest Sep 23, 2025, 5:10 PM

#

sleek basin [URGENT] Writeup submission cancelled after deadline - need help with technical ...

Please contact kaggle support via email, there are no support staff who can help you on discord.

tired solstice Sep 24, 2025, 6:59 AM

#

500m dataset .tar file. Takes 10secs to upload. And then 1.5hour to process and still going on . Is it normal? On the webpage it’s always saying estimately finish in 10secs. Which is hilarious. Thank you anyone who care about this

fading iron Sep 24, 2025, 7:55 AM

#

usually after uploading it takes 15 minutes to really hit. sounds like the website stalled after the upload. I'd close it out and reload the dataset page and see if it's listed and then try to load it from your notebook.

tired solstice Sep 25, 2025, 1:15 AM

#

fading iron usually after uploading it takes 15 minutes to really hit. sounds like the webs...

Thank you. I think it’s just dead or somewhat. I uploaded it as a model rather than dataset and fixed the problem

shrewd badge Sep 25, 2025, 5:25 AM

#

Hi, I just got a new PC setup with a 5060ti 16GB and started working my first nn training. However, I seem to have a problem with GPU utilization. At first, it nicely is at around 93-95%, but then gradually goes down drastically to around 43%, and training is slowing also drastically. It is not an overheating issue, as the temps start at 50C and then gradually go down to 33c. It is also not an OOM issue, as I have plenty of ram, more than 40gb still free during the entire training. CPU is also not a bottleneck. Any ideas what might be causing it? I am on arch linux, Driver Version: 580.82.09 , CUDA Version: 13.0.

patent viper Sep 25, 2025, 2:12 PM

#

hi, my kaggle notebook keeps hiding my variables. why is that?

ripe blaze Sep 26, 2025, 9:03 AM

#

Where to deploy a ai model + rag? 20gb vram needed. be nice to have on demand payment. so it doesnt burn through my bank. like I only pay when someones actually using the vram.

wraith sparrow Sep 26, 2025, 11:34 AM

#

ripe blaze Where to deploy a ai model + rag? 20gb vram needed. be nice to have on demand pa...

Try lightning ai studio

errant cape Sep 27, 2025, 4:25 AM

#

Hello, if I win a prize competition, how can I withdraw the prizes?

graceful axle Sep 27, 2025, 6:09 AM

#

I checked the research paper, they only classified 3–4 behaviors and got around 0.6 F1. Here in mabe-mouse-challenge we have about 8 labels, so if we use their method, can we get a better score in mabe-mouse-b-dectection challenge?

hushed stag Sep 27, 2025, 9:31 AM

#

Hello, I want to ask when the course starts. Ai

prime ermine Sep 27, 2025, 12:52 PM

#

can my mac m4 run llms ? or i need a nivida gpu

wraith sparrow Sep 27, 2025, 1:48 PM

#

u can buy dgx spark 4tb founders edition

gentle bridge Sep 29, 2025, 1:26 AM

#

Hello Everyone,
I have been facing a login issue since yesterday. So far I had no issues with my Kaggle account. But yesterday I cleared history in chrome and tried to login through signing in with google. However I got the message "an account already exists with an alias of this email please login using that". I'm not sure what to do here because normally I login through my google account for using Kaggle. Has anyone faced this issue before. I would greatly appreciate it if I can get any help on what I can do.

weary oxide Sep 29, 2025, 7:10 AM

#

gentle bridge Hello Everyone, I have been facing a login issue since yesterday. So far I had ...

Try logging in via incognito if that doesn't work search ur inbox and check if multiple accounts are associated with the same mail if that's not issue raise ur ticket in kaggle help that might help also try resetting password

gentle bridge Sep 29, 2025, 7:34 AM

#

weary oxide Try logging in via incognito if that doesn't work search ur inbox and check if m...

Thanks I will try that. I never had any duplicate accounts

weary oxide Sep 29, 2025, 2:49 PM

#

gentle bridge Thanks I will try that. I never had any duplicate accounts

sure

fading iron Sep 30, 2025, 4:50 PM

#

What is the best way to run a notebook? Right now I can onnly do it from the editor. But the idle timer seems broke. I can't even finish a single training session now without it timing out despite updating the output constantly via tdqm updates per epoch. My training takes 1h5m per input file. Not sure what to do, useless if I can't even finish. Plus it says hit cancel or continue editing it wont disconnect but it does. Kills the kernel and restarts automatically.

fading iron Sep 30, 2025, 7:13 PM

#

Found a workaround if it helps anyone else. Load the notebook then press F12 to get to the dev console, paste this in the console. It mimics a keypress once a minute. Voila no more timer killing off the training.

#

setInterval(() => {
document.dispatchEvent(new KeyboardEvent('keydown', {'key':'Shift'}));
}, 60000);

balmy vapor Sep 30, 2025, 7:40 PM

#

Hello! I can't seem to get rid of this problem. What do I need to do? I'm using kaggle's jupyter notebook

ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
bigframes 2.8.0 requires google-cloud-bigquery-storage<3.0.0,>=2.30.0, which is not installed.
gensim 4.3.3 requires numpy<2.0,>=1.18.5, but you have numpy 2.2.6 which is incompatible.
gensim 4.3.3 requires scipy<1.14.0,>=1.7.0, but you have scipy 1.15.3 which is incompatible.
mkl-umath 0.1.1 requires numpy<1.27.0,>=1.26.4, but you have numpy 2.2.6 which is incompatible.
mkl-random 1.2.4 requires numpy<1.27.0,>=1.26.4, but you have numpy 2.2.6 which is incompatible.
mkl-fft 1.3.8 requires numpy<1.27.0,>=1.26.4, but you have numpy 2.2.6 which is incompatible.
numba 0.60.0 requires numpy<2.1,>=1.22, but you have numpy 2.2.6 which is incompatible.
datasets 3.6.0 requires fsspec[http]<=2025.3.0,>=2023.1.0, but you have fsspec 2025.5.1 which is incompatible.
ydata-profiling 4.16.1 requires numpy<2.2,>=1.16.0, but you have numpy 2.2.6 which is incompatible.
onnx 1.18.0 requires protobuf>=4.25.1, but you have protobuf 3.20.3 which is incompatible.
google-colab 1.0.0 requires google-auth==2.38.0, but you have google-auth 2.40.3 which is incompatible.

#

Some require newer versions, some require older versions, how do you even begin to fix this error?

hidden plinth Oct 1, 2025, 3:30 PM

#

hey

#

is there any way

#

i can use sklearn models like gradient boosting with gpu

#

i really need it for hyparameter tuning

#

???

toxic junco Oct 1, 2025, 7:20 PM

#

Guys any good reccomendations for python DSA

deft warren Oct 2, 2025, 1:27 AM

#

balmy vapor Hello! I can't seem to get rid of this problem. What do I need to do? I'm using ...

Hi, try this !pip install --force-reinstall numpy==1.26.4 scipy==1.13.1 "fsspec<=2025.3.0" "protobuf>=4.25.1" "google-auth==2.38.0" "google-cloud-bigquery-storage>=2.30.0,<3.0.0" --break-system-packages

graceful axle Oct 3, 2025, 3:51 AM

#

Hello everyone

#

I am trying to use an API to fetch data

#

I am using a loop to iterate all the pages present in the json.

#

I am getting either a gai error or a ConnectionReset Error

#

Please help

#

`import requests

url = "https://api.themoviedb.org/3/movie/popular?language=en-US&page=1"

headers = {
"accept": "application/json",
"Authorization": "Bearer MY_READ_ACCESS_TOKEN
}

df = pd.DataFrame()

for i in range(1,52774):
url = f"https://api.themoviedb.org/3/movie/popular?language=en-US&page={i}"
response = requests.get(url, headers=headers)
time.sleep(0.25)
df = pd.concat([df,pd.DataFrame(response.json()['results'])[['id','title','overview','release_date','popularity','vote_average','vote_count']]],ignore_index=True)

df.head()`

#

This data is of TMDB

#

Here is the error snippet

`gaierror Traceback (most recent call last)
File ~/Desktop/DS_Practice/.venv/lib/python3.12/site-packages/urllib3/connection.py:198, in HTTPConnection._new_conn(self)
197 try:
--> 198 sock = connection.create_connection(
199 (self._dns_host, self.port),
200 self.timeout,
201 source_address=self.source_address,
202 socket_options=self.socket_options,
203 )
204 except socket.gaierror as e:

File ~/Desktop/DS_Practice/.venv/lib/python3.12/site-packages/urllib3/util/connection.py:60, in create_connection(address, timeout, source_address, socket_options)
58 raise LocationParseError(f"'{host}', label empty or too long") from None
---> 60 for res in socket.getaddrinfo(host, port, family, socket.SOCK_STREAM):
61 af, socktype, proto, canonname, sa = res

File /usr/lib/python3.12/socket.py:963, in getaddrinfo(host, port, family, type, proto, flags)
962 addrlist = []
--> 963 for res in _socket.getaddrinfo(host, port, family, type, proto, flags):
964 af, socktype, proto, canonname, sa = res

gaierror: [Errno -2] Name or service not known

The above exception was the direct cause of the following exception:
...
--> 677 raise ConnectionError(e, request=request)
679 except ClosedPoolError as e:
680 raise ConnectionError(e, request=request)

ConnectionError: HTTPSConnectionPool(host='api.themoviedb.org', port=443): Max retries exceeded with url: /3/movie/popular?language=en-US&page=1 (Caused by NameResolutionError("<urllib3.connection.HTTPSConnection object at 0x7825ea3d9430>: Failed to resolve 'api.themoviedb.org' ([Errno -2] Name or service not known)"))`

leaden crescent Oct 7, 2025, 12:59 AM

#

Level up with AI — 8 eBooks for creators & innovators.
https://payhip.com/b/V6brj

tropic warren Oct 7, 2025, 6:42 AM

#

hidden plinth i can use sklearn models like gradient boosting with gpu

I use Google colab there you can change runtime to a GPU

primal oxide Oct 7, 2025, 9:16 AM

#

Hi everyone! I'm completely new to data science, and this is my first time uploading a dataset. Could you please let me know what you think about it? This dataset will be used for an upcoming analysis

https://www.kaggle.com/datasets/meiliaa/france-business-insolvencies-19902025

old mist Oct 8, 2025, 10:28 PM

#

Hi, @everyone
Is there anyone who joins radical ai founders' masterclass?
I didn't have an opportunity to apply for that.
Please give me the meeting urls for them.

severe lance Oct 10, 2025, 1:04 PM

#

Did the GOOGLE’s Kaggle course started ? @everyone

rose harbor Oct 10, 2025, 1:52 PM

#

severe lance Did the GOOGLE’s Kaggle course started ? @everyone

The 5 days Ai Agents course is actually set to start from Nov 10, if that's the course you are asking about.

severe lance Oct 10, 2025, 1:54 PM

#

Oh thanks, I thought I was late.

stone scroll Oct 11, 2025, 8:35 AM

#

Hello everyone, I started my AI/ML journey this year. Created few projects too and also participated in a "getting started" kaggle competition named "Natural Language Processing with Disaster Tweets" in which i got 189th rank. But i want to explore and learn more so how should i proceed further any suggestions/help/guidance will be appreciated

brave vale Oct 11, 2025, 8:15 PM

#

Hey everybody

#

I have a problem and want your help,
https://www.kaggle.com/code/nooshinpourtaleby/day-4-fine-tuning-a-custom-model/edit
in this 5 day course I don't know why i get this error for this code:
response = client.models.generate_content(
model="gemini-1.5-flash-001", contents=sample_row)
print(response.text)

ClientError Traceback (most recent call last)
Cell In[37], line 1
----> 1 response = client.models.generate_content(
2 model="gemini-1.5-flash-001", contents=sample_row)
3 print(response.text)

also I tried: "gemini-1.5-flash" but it didnt work:(

graceful axle Oct 11, 2025, 9:00 PM

#

learning pytorch is not enough what else should someone learn to get master on ML and win Kaggle?

mortal pond Oct 12, 2025, 4:51 AM

#

Anyone who can help me in the ML Challenge?

upper schooner Oct 12, 2025, 5:24 AM

#

Hey I am trying to educate myself. Can someone explain to me what are

Gradient decent
loss function
learning rate

I am so confused rn. I just know they are used to optimize an algorithm but how

fast lichen Oct 12, 2025, 7:38 AM

#

https://www.kaggle.com/datasets/dasgroup/rba-dataset/data?select=rba-dataset.csv
hello I kna need a help on smth about the dataset
im making association mining rule machine learning then I found this dataset but its too huge so my idea is I want to drop some of the rows and columns anyone can help me no idea what im doin

scarlet galleon Oct 13, 2025, 10:47 AM

#

fast lichen https://www.kaggle.com/datasets/dasgroup/rba-dataset/data?select=rba-dataset.csv...

are you asking how to do this, or whether this is a good idea?

marsh drift Oct 13, 2025, 12:11 PM

#

Hello, on MABe Challenge, I get an errror on my notebook on the submissions pannel, but when I click on it and look at the logs, it shows "successfully ran". And there is no errors in the logs, it even outputs a csv. What could be the cause ? I am a beginner. Thank you !

marsh drift Oct 13, 2025, 9:52 PM

#

marsh drift Hello, on MABe Challenge, I get an errror on my notebook on the submissions pan...

https://www.kaggle.com/code/analyticaobscura/mabe-v1-mouse-action-recognition/comments#3301645

drowsy berry Oct 14, 2025, 4:23 AM

#

Hello

I was using Kaggle's GPU powered notebook for one of the competitions and work

But I was running in to out of memory errors and Kernel Crashes were occurring
This was due to the big embeddings I was generating on large data
And also if I set high hyperparameters for some models
I tried solving those kind of errors using
del
And
gc.collect()

But I wanted to know how I can avoid these errors

What are some of the best practices and optimization techniques to not run into these kind of issues

graceful axle Oct 14, 2025, 10:37 AM

#

marsh drift https://www.kaggle.com/code/analyticaobscura/mabe-v1-mouse-action-recognition/co...

have u tried running the version 2 of that notebook

marsh drift Oct 14, 2025, 11:24 AM

#

graceful axle have u tried running the version 2 of that notebook

yeah id did, but, the version 1 is the one that should be working apparently. But I thought the notebook explaination was suoposed to work too

graceful axle Oct 14, 2025, 1:32 PM

#

marsh drift yeah id did, but, the version 1 is the one that should be working apparently. Bu...

even i tried running the verison 1 its showing similar errors

sudden ledge Oct 14, 2025, 4:44 PM

#

Hi! I submitted my solution on Kaggle, it finished successfully but the scoring is stuck on "running". Has anyone experienced this?

graceful axle Oct 14, 2025, 4:54 PM

#

sudden ledge Hi! I submitted my solution on Kaggle, it finished successfully but the scoring ...

for me it took 1day

sudden ledge Oct 14, 2025, 4:55 PM

#

graceful axle for me it took 1day

ok thx 🙂

terse kiln Oct 14, 2025, 6:47 PM

#

Im beginner trying to convert my model to a pickle file in pycharm however when I do pickle .dump , it said the code ran successfully but the model.pkl was not created in the folder. I did the proper modes but yet the result was same . Any suggestions?

Edit : this problem is now solved with assistance of @teal smelt

glass mulch Oct 15, 2025, 6:37 AM

#

Hey everyone! 👋

I hope you’re all doing well. I’m currently preparing for a career in FinTech, and I’m looking to connect with professionals or learners who are already working or have experience in this domain.

I’d really appreciate some guidance on how to make myself a strong candidate for FinTech roles — including:

The essential skill sets (technical + financial)

Recommended certifications or learning paths

Important tools or technologies used in the industry

And a few project ideas that can help build a solid FinTech-focused portfolio

If anyone here is from a FinTech company or has relevant experience, I’d be very grateful for your advice or even a quick chat. 🙏

You can also connect with me on LinkedIn: www.linkedin.com/in/abhay-singh-1694b221b

Thank you so much in advance for your time and support!

— Abhay

teal smelt Oct 15, 2025, 10:45 AM

#

terse kiln Im beginner trying to convert my model to a pickle file in pycharm however when ...

Hoi, share the code please.
Something might be wrong with the way you're saving your file or specifying the correct file path.

Need the code for the full hunting :)
(By code i mean the specific pickel.dump() and function where you're opening/closing your file and specifying the file path. f.open() f.close().)

teal smelt Oct 15, 2025, 10:48 AM

#

drowsy berry Hello I was using Kaggle's GPU powered notebook for one of the competitions and...

Still need assistance??

terse kiln Oct 15, 2025, 11:10 AM

#

teal smelt Hoi, share the code please. Something might be wrong with the way you're saving...

Hello thanks for replying back , can I give u the GitHub link of that project in dm?

teal smelt Oct 15, 2025, 2:38 PM

#

terse kiln Hello thanks for replying back , can I give u the GitHub link of that project in...

Yup, why not :)
(Problem fixed)

charred sluice Oct 15, 2025, 8:03 PM

#

hi

drowsy berry Oct 16, 2025, 2:29 AM

#

teal smelt Still need assistance??

Yes
I'll share the code if required
But I am talking about a general case
Since this has occurred to me many different times

teal smelt Oct 16, 2025, 6:33 AM

#

drowsy berry Yes I'll share the code if required But I am talking about a general case Sin...

Idk, as far as I know something might be wrong with your code... If it's general I don't know what's causing it maybe dm me the code let's see if we can find the solution...

muted spindle Oct 16, 2025, 9:07 AM

#

Hello, Do you know How much time it takes for a competition to publish private leaderboard after the competition close ?
https://www.kaggle.com/competitions/grand-xray-slam-division-b/leaderboard

teal smelt Oct 16, 2025, 9:09 AM

#

muted spindle Hello, Do you know How much time it takes for a competition to publish private l...

A day minimum

muted spindle Oct 16, 2025, 9:09 AM

#

teal smelt A day minimum

Thx!

sudden notch Oct 16, 2025, 11:23 AM

#

The model I've created has performed well and in a similar manner on test and validation sets. But it performs ~2.5x poorly on competition data. What's the deal?

trail umbra Oct 16, 2025, 11:28 AM

#

..

spring nebula Oct 16, 2025, 11:48 AM

#

Hii

bitter mason Oct 16, 2025, 1:52 PM

#

Hlw

umbral scaffold Oct 16, 2025, 6:33 PM

#

Hello guys! I'm working on creating a forex trading bot that helps me out in understanding signals and placing trades per time depending on the patterns observed. Please suggest great YouTube channels and books that can serve this purpose. @teal smelt

teal smelt Oct 16, 2025, 8:00 PM

#

umbral scaffold Hello guys! I'm working on creating a forex trading bot that helps me out in und...

Not my expertise 😅,
Gimme an hour or two for some research but yeah i might not be able to help you fully, but I'll try 🐣

teal smelt Oct 16, 2025, 8:45 PM

#

umbral scaffold Hello guys! I'm working on creating a forex trading bot that helps me out in und...

Okay so shit dump, here goes nothing.

So you would first need expertise in whatever language you're planning to use (I highly recommend Python, or Rust. Due to their fast and easy to code environment. And plus these languages are highly appreciated for this specific field of data analysis and ai ml so yeah.. )
Then first before starting anything go read "Ernest P. Chan" people really praise that guy over twitter(X) and reddit for his books titled "Quantitative Training", "Algorithmic Training".
(Check these before starting : https://www.reddit.com/r/algotrading/comments/gily37/ernest_p_chan_books_quantitative_trading/
https://www.reddit.com/r/algotrading/comments/15gz5dn/do_ernest_chans_mean_reversion_strategies_for/ )
NOTE THAT THESE BOOKS I SUGGEST TO LAY DOWN YOUR FOUNDATION WITH ALGOS AND TRADING, YOU'LL HAVE TO PRACTICE A LOT BEFORE STARTING FULL
For more algorithms check out "16 proven algorithm forex trading", "forex trading: theory and practice trady"

Now, we have quite the bookish knowledge (not enough, you still gotta explore articles. I suggest joining discord or reddits for algorithms), anyways:

Now you would need APIs, and pre build ais to understand how they work, for these i suggest you to pull up to huggingface.co and kaggle and start searching for keywords relating to "trading ai", "trading algo", "trading blah blah"; research the APIs and decide which one suits you and study the code or just use it directly.

Analyse pre available data from kaggle.

And there's this guy called "Moon Dev" on YouTube, search for him has a playlist, and couldn't find other better channels :(
And search for more on youtube and reddit and google :(

Amd yeah ask chatgpt anytime you want if you need more resources i could only find these, you gotta do a lot of research tho as this is a not-so-famous topic :(

umbral scaffold Oct 16, 2025, 8:49 PM

#

teal smelt Okay so shit dump, here goes nothing. So you would first need expertise in what...

This is comprehensive enough. Thank you so much.

teal smelt Oct 16, 2025, 8:50 PM

#

Hoi, please don't rely on my stuff fully, research and research like a focused horse...

umbral scaffold Oct 16, 2025, 8:53 PM

#

teal smelt Hoi, please don't rely on my stuff fully, research and research like a focused h...

Well said, my chief

teal smelt Oct 16, 2025, 8:59 PM

#

umbral scaffold Well said, my chief

I'm not your chief bro please 😭

#

You can use "friend" or "bro" or "🐣"

viral dagger Oct 16, 2025, 9:02 PM

#

anyone completed the Mathematics for Machine Learning specialization by Deeplearning.AI?

#

because im stuck in a concept i can't seem to understand in the linear algebra course

teal smelt Oct 16, 2025, 9:24 PM

#

viral dagger because im stuck in a concept i can't seem to understand in the linear algebra c...

Bro just share the concept name or whatever problem you're facing in a paragraph, someone will read it and help you right away :) (it's not good to ask for help like this, just elaborate the issue)

#

I'll try if i relate with the topic hehe T_T

viral dagger Oct 17, 2025, 5:52 AM

#

teal smelt Bro just share the concept name or whatever problem you're facing in a paragraph...

O ty I had an issue mainly with the row echelon implementation in python in assignment 2 like the backtracking section mainly but i think I semi get it but not that much lol

Like I solved it but I got a bit confused on how some values were defined etc

But in summary it's basically the row echelon form
I understand it concept wise but I'm having trouble implementing it in python

#

Like converting the math steps directly into python

teal smelt Oct 17, 2025, 9:00 AM

#

viral dagger Like converting the math steps directly into python

NOTE: I have assumed you're not solving "variables" thus no backtracking implement led, if you don't understand what I'm talking about.. just ignore this shitty NOTE.

Okay so like first let's see what the concept means mathematically i guess,
So first let's say ummm imagine a matrix?

a11, a12......, a1n
a21, a22....., a2n
...........................
am1, am2......, amn 
``` so this row echelon form thing basically you would need 4-5 steps and just keep looping them as per your needs.
Pivot: a non zero value, we'll eliminate stuff below it.

### 1: we make the code find a "pivot" in each of the available columns.
### 2: Transpose the matrix or swapping rows if needed so pivot doesn't come out to be zero.
### 3: Normalize the pivot row, to ofcourse make pivot = 1.
### 4: And we'll kill/eliminate all entries below that pivot eventually making them to be zero.
### 5: Run same algorithm but now to down-right element. And so on.
(Basically we're running this for diagonals okay, a11, a22.... So on... And diagonals get to be 1 and adjust other values to keep the over-all result same)



Like imagine a matrix
```py

A = np.array([
    [2, 1, -1, 8],
    [-3, -1, 2, -11],
    [-2, 1, 2, -3]
])

It's echelon is :


[[ 1.   0.5 -0.5  4. ] #here first a11 was made to be 1, i.e. [2÷2=1] and then for all other values also divide with 2.
 [ 0.   1.  -1.  1. ] # note here we first had to transform the matrix with (R2 = R2 - 3R1), for the sake of making value below pivot 0(a21) and adjust others.
 [ 0.   0.   1.  -2. ]] # guess transformation to get this.

errant valve Oct 17, 2025, 9:01 AM

#

hi im a maths masters student at cambridge specialising in stats. My career aspiration is to become a quant researcher at a top quant firm. One of the things that looks super interesting and was recommended to me was to do kaggle since it is from what ive heard full of data to analyse and make predictions from. My stats knowledge is good but i am down to learn more relevant stuff to do with machine learning or whatever experts here think is relevant. I know python syntax well but dont know data libraries like numpy, pandas, matplotlib at all. Can anyone give any advice on how I could go from this position to high level in kaggle? Like can anyone recommend me any courses, books, approaches to improve as quickly as possible? ty

crystal brook Oct 17, 2025, 9:01 AM

#

hello i am new here

teal smelt Oct 17, 2025, 9:02 AM

#

teal smelt **NOTE: I have assumed you're not solving "variables" thus no backtracking imple...

Python Implementation:

Now let's move to the Python Code best paart hehehehehe

import numpy as np

def rowEchelon(X):
    X = X.astype(float) # we won't be able to divide properly without floating in the air 😴 😴
    rows, cols = X.shape
    pivotRow = 0 # the row where our python snake is.

    for col in range(cols):
        if pivotRow >= rows: #last row done right?
            break #stop :)
        
         # Now since we're working on matrix, this is the start of everything.
        pivot = None
        for r in range(pivotRow, rows):
           if X[r, col] != 0: #finds if the element in current row is our pivot, or in the row below it.
               pivot = r
               break
 
           if pivot is None:
               return # basically no pivots were found right?

        # Pivot finding done, lets move to  swapping/transpose. (Note we always swap rows with our pivot row)
          if pivot != pivotRow # make sure you're not swapping the exact same row with itself(the pivotRow wth itself)
             X[[pivotRow, pivot]] = X[[pivot, pivotRow]]

        # Normalize current row with our pivot row
            pivotValue = X[pivotRow, col]
            X[pivotRow] = X[pivotRow] / pivotVal
        
        # Now our concept tells us to eliminate the below stuff right?
            for r in range(pivotRow + 1, rows):
                factor = X[r, col]
                X[r] = X[r] - factor * X[pivotRow] #transformation of rows basically 

            pivotRow += 1
    return X

# we can use our example matrix 
X = np.array([
    [2, 1, -1, 8],
    [-3, -1, 2, -11],
    [-2, 1, 2, -3]
])

martix = rowEchelon(X)
print(matrix)

We'll get the same output as array we had used for example above

[[ 1.   0.5 -0.5  4. ]
 [ 0.   1.  -1.  1. ]
 [ 0.   0.   1.  -2. ]]

teal smelt Oct 17, 2025, 9:04 AM

#

viral dagger O ty I had an issue mainly with the row echelon implementation in python in assi...

See the explanation and code I've sent, I've kept the steps/order of theory and code same so that you don't get lost while understanding, and also please ping me if you face doubts about any variables or the function, I've tried my best to explain throw comments. And also i wrote the code on discord chat so yeah the indentation might be wrong at some places just fix that... That's it

teal smelt Oct 17, 2025, 9:12 AM

#

errant valve hi im a maths masters student at cambridge specialising in stats. My career aspi...

Idk about books, checkout the courses form edureka and freecodecamp om YouTube about "machine learning, ai/ml".. they'll teach basic about how to analyse data like titanic one if i remember correctly and they also teach basic algorithms you would need..

Then you can move further and watch kaggle's own tutorials, just make sure you've completed the above step and understood the "requirements(python, aiml basics, numpy, pandas, matplotlib, kaggle notebook)" first.

Also it'll take quite a while to be comfortable with this technologies if you've just started this so please don't quit just keep learning...

And finally read articles, discuss your doubts, and solve other people's doubts and start kaggle 🐣

(And idk if this should be enough for high level on kaggle, you'll need to spend time and brain on kaggle competitions for it and I'm not much familiar with competitions; so no shitty advice about that 😅)

twin plover Oct 17, 2025, 9:20 AM

#

I have a question, I just have very basic knowledge of AI but working with Python and it's libraries as a data analyst from few months, did it's possible to compete in competition of CAFA 6 Protein Function Prediction it's last date January 26, 2026, i have around 2 months time, did it's possible? kindly guide me

twin plover Oct 17, 2025, 9:23 AM

#

teal smelt Python Implementation: # Now let's move to the Python Code best paart heheheheh...

I have a question, why we use this program of code when we write it easily with just an array

teal smelt Oct 17, 2025, 9:28 AM

#

twin plover I have a question, why we use this program of code when we write it easily with ...

Sorry, i don't get what you're trying to say brother??

twin plover Oct 17, 2025, 9:31 AM

#

teal smelt Sorry, i don't get what you're trying to say brother??

A = np.array([
[2, 1, -1, 8],
[-3, -1, 2, -11],
[-2, 1, 2, -3]
])
i mean when we write it as in upper context then why write the matrix in long as in following code
import numpy as np

def rowEchelon(X):
X = X.astype(float) # we won't be able to divide properly without floating in the air 😴 😴
rows, cols = X.shape
pivotRow = 0 # the row where our python snake is.

for col in range(cols):
    if pivotRow >= rows: #last row done right?
        break #stop 🙂
    
     # Now since we're working on matrix, this is the start of everything.
    pivot = None
    for r in range(pivotRow, rows):
       if X[r, col] != 0: #finds if the element in current row is our pivot, or in the row below it.
           pivot = r
           break

       if pivot is None:
           return # basically no pivots were found right?

    # Pivot finding done, lets move to  swapping/transpose. (Note we always swap rows with our pivot row)
      if pivot != pivotRow # make sure you're not swapping the exact same row with itself(the pivotRow wth itself)
         X[[pivotRow, pivot]] = X[[pivot, pivotRow]]

    # Normalize current row with our pivot row
        pivotValue = X[pivotRow, col]
        X[pivotRow] = X[pivotRow] / pivotVal
    
    # Now our concept tells us to eliminate the below stuff right?
        for r in range(pivotRow + 1, rows):
            factor = X[r, col]
            X[r] = X[r] - factor * X[pivotRow] #transformation of rows basically 

        pivotRow += 1
return X

we can use our example matrix

X = np.array([
[2, 1, -1, 8],
[-3, -1, 2, -11],
[-2, 1, 2, -3]
])

martix = rowEchelon(X)
print(matrix)

#

my english is not so good i hope you understand

teal smelt Oct 17, 2025, 9:32 AM

#

twin plover my english is not so good i hope you understand

It's fine, i understand you

teal smelt Oct 17, 2025, 9:32 AM

#

twin plover A = np.array([ [2, 1, -1, 8], [-3, -1, 2, -11], [-2, 1, 2, -3] ]) i ...

Okay i get it.

#

Look at the matrix we had, its values are quite big and distinct right?

A = np.array([
    [2, 1, -1, 8],
    [-3, -1, 2, -11],
    [-2, 1, 2, -3]
])

We did all that code below this (the whole rowEchelon function, to convert our above A matrix to a simpler rowEchelon form):

[[ 1.   0.5 -0.5  4. ]
 [ 0.   1.  -1.  1. ]
 [ 0.   0.   1.  -2. ]]

Q. Why do we need a row echelon form ? 🤔

Bro, i just gave a very very simplified example; but in the reality row echelon is used to solve big linear equations

Here's an example:

Say we have linear equations:

2x + y - z = 8
-3x - y + 2z = -11
-2x + y + 2z = -3

We can represent this set of equations as a matrix:

[[2, 1 , -1, 8 
-3, -1, 2, -11
-2, 1, 2, -3]]

And we will give this ^^ array to our code as A or X(in python code) and get this array as row echelon:

1, 0.5, -0.5, 4
0, 1, 1, 2 
0, 0, 1, -1

Notice how simplified our matrix is? Imagine how it would look in the form of equations now!!

Here's how it would look:

x + 0.5y - 0.5z = 4
y + z = 2
z = -1

Notice how simplified and easy these equations are to handle now, it is so easy to work with this now instead of what we had in the starting :)

teal smelt Oct 17, 2025, 9:43 AM

#

teal smelt Look at the matrix we had, its values are quite big and distinct right? ``` A = ...

@twin plover i tried my best to explain please see this...

twin plover Oct 17, 2025, 9:44 AM

#

teal smelt <@791555915630903337> i tried my best to explain please see this...

thanks

errant valve Oct 17, 2025, 9:50 AM

#

teal smelt Idk about books, checkout the courses form edureka and freecodecamp om YouTube a...

okay tysm

#

are the kaggle tutorials acc that good or am i looking at the wrong spot? coz from what i remember seeing they are quite short and not that in depth

south gate Oct 17, 2025, 9:51 AM

#

Hey guys

teal smelt Oct 17, 2025, 9:52 AM

#

errant valve are the kaggle tutorials acc that good or am i looking at the wrong spot? coz fr...

They're great but Not beginners friendly ig, for me at least.
I had to go through freecodecamp and edureka on YouTube first.

They just try to lay an outline of the topic and you are the explorer they expect to have some prior knowledge..

sour ingot Oct 17, 2025, 9:55 AM

#

are jobs in ML still in demand?

teal smelt Oct 17, 2025, 9:55 AM

#

sour ingot are jobs in ML still in demand?

Ofcourse, always be

sour ingot Oct 17, 2025, 9:56 AM

#

when i search for internships or jobs in ML there very few results compared to full stack development

teal smelt Oct 17, 2025, 9:57 AM

#

People are taking the field because it's a trend, i don't think so all of them are interested or invested mentally in the field.. at least some other people I know just took it for a trend and are struggling now..
Your actual competition is less than you imagine but yeah you gotta stand out :)

sour ingot Oct 17, 2025, 9:58 AM

#

i started learning ML i did regression,classification i understood them but when i see job listing i get demotivated

#

plus, should i rush towards deep learning?

teal smelt Oct 17, 2025, 9:59 AM

#

sour ingot when i search for internships or jobs in ML there very few results compared to f...

Nope, ML has many fields right? Like building AI Agents, data scientists, data analysis, chatbot, AI and more..

It's just so diverse that you look only at a small part of it, i believe there are more jobs just under different tags

#

And ML pays more than full stack, so it'll be a bit rare as it must be hard hence the big paycheck

sour ingot Oct 17, 2025, 10:00 AM

#

how much did you complete?

teal smelt Oct 17, 2025, 10:01 AM

#

sour ingot i started learning ML i did regression,classification i understood them but when...

Job listings shouldn't be your motivation brother... Your interest in the field is what you need..

teal smelt Oct 17, 2025, 10:01 AM

#

sour ingot how much did you complete?

I'm somewhat a beginner only as of now for AIML :(, I'm mainly into full stack -

sour ingot Oct 17, 2025, 10:02 AM

#

teal smelt I'm somewhat a beginner only as of now for AIML :(, I'm mainly into full stack -

ohk

#

your 16 and already way ahead of other people damn

#

when i was 16 i was playing pubg

teal smelt Oct 17, 2025, 10:03 AM

#

sour ingot when i was 16 i was playing pubg

My phone is too trash to run it

wraith sparrow Oct 17, 2025, 2:39 PM

#

sour ingot when i was 16 i was playing pubg

when i was 16 i was playing sololearn

#❓┊ask-a-question

Steps to Present on GitHub

Why Not a Zip File?

Tips

hello everyone. I am beginner in kaggle. below problem is exercise of lesson2 from "Intro to Machine Learning". why this error occured and how can I fix?

NameError: name 'step_1' is not defined

I installed the notebook by running the code that appears first at the start of the exercise. installing code is below

Set up code checking

do kaggle have any official mcp server

Now let's move to the Python Code best paart hehehehehe

we can use our example matrix

Q. Why do we need a row echelon form ? 🤔

hello everyone.
I am beginner in kaggle.
below problem is exercise of lesson2 from "Intro to Machine Learning".
why this error occured and how can I fix?

I installed the notebook by running the code that appears first at the start of the exercise.
installing code is below