#reinforcement-learning | Kaggle | Page 1

graceful widget Sep 2, 2023, 6:21 PM

#

Hello RL folks.
Seems like the space is not so active.

#

Anybody learning RL with Me

silent acorn Sep 2, 2023, 6:22 PM

#

graceful widget Hello RL folks. Seems like the space is not so active.

Hi I am new, can you learn with me?

graceful widget Sep 2, 2023, 6:24 PM

#

Sure .
I'd be glad to

graceful widget Sep 2, 2023, 6:25 PM

#

silent acorn Hi I am new, can you learn with me?

Are you reading any book for RL or taking courses

silent acorn Sep 2, 2023, 6:26 PM

#

graceful widget Are you reading any book for RL or taking courses

I learn from the course.

#

And you?

graceful widget Sep 2, 2023, 6:28 PM

#

Reading a book for it.
I feel like the courses do not cover enough theoretical ground

silent acorn Sep 3, 2023, 2:55 AM

#

graceful widget Reading a book for it. I feel like the courses do not cover enough theoretical g...

Yes they teach me only Q-learning.
Can you say book name?

icy tree Sep 4, 2023, 4:28 AM

#

silent acorn Yes they teach me only Q-learning. Can you say book name?

Sutton and Barto, Introduction to Reinforcement Learning is a classic text, and it's also very good. Both for mathematical and theoretical treatment of the concepts.

silent acorn Sep 4, 2023, 4:34 AM

#

icy tree Sutton and Barto, Introduction to Reinforcement Learning is a classic text, and ...

Can you say where I get those books?

icy tree Sep 4, 2023, 4:36 AM

#

silent acorn Can you say where I get those books?

https://inst.eecs.berkeley.edu/~cs188/sp20/assets/files/SuttonBartoIPRLBook2ndEd.pdf
from UC Berkeley

silent acorn Sep 4, 2023, 4:38 AM

#

Okay thanks

icy tree Sep 4, 2023, 4:48 AM

#

No problem!

graceful widget Sep 4, 2023, 8:29 AM

#

icy tree Sutton and Barto, Introduction to Reinforcement Learning is a classic text, and ...

Oh wow that's the same book I'm reading. It's a great text

icy tree Sep 4, 2023, 1:26 PM

#

Though if you want a video course, David Silver's famous lectures are pretty good, nice balance of math

silent acorn Sep 4, 2023, 3:11 PM

#

icy tree Though if you want a video course, David Silver's famous lectures are pretty goo...

Can you give me link?

icy tree Sep 4, 2023, 3:47 PM

#

silent acorn Can you give me link?

Sure! This is the playlist by Google Deepmind YT channel:
https://youtube.com/playlist?list=PLqYmG7hTraZDM-OYHWgPebj2MfCFzFObQ&feature=shared

YouTube

DeepMind x UCL | Introduction to Reinforcement Learning 2015

Watch the lectures from DeepMind research lead David Silver's course on reinforcement learning, taught at University College London. Access slides, assignmen...

sweet garden Sep 6, 2023, 10:06 AM

#

Hi,,iam Sharath ,,iam a beginner here in this field of data science, AI and machine learning,,I want to learn RL,,can anybody have time for group studies or guiding me??

wise magnet Sep 6, 2023, 5:23 PM

#

👋

graceful widget Sep 8, 2023, 7:46 PM

#

sweet garden Hi,,iam Sharath ,,iam a beginner here in this field of data science, AI and mach...

Yes. I do have some time for group study

#

@wise magnet nice to have you

sweet garden Sep 9, 2023, 11:48 AM

#

Thanks 👍 looking forward to learn RL

brazen mist Sep 12, 2023, 10:36 AM

#

Yes we can discuss here about deep reinforcement learning

#

As i finished a few topics and feeling confident but have some doubts as well

#

Any working professionals here ?

graceful widget Sep 12, 2023, 7:38 PM

#

brazen mist Any working professionals here ?

I'm not a working professional per se.
But I've got a good experience

brazen mist Sep 13, 2023, 4:17 AM

#

graceful widget I'm not a working professional per se. But I've got a good experience

Nice 🙂

icy tree Sep 14, 2023, 7:20 AM

#

What doubts do you have?

brazen mist Oct 21, 2023, 9:07 AM

#

icy tree What doubts do you have?

There are a bunch of algorithms but don't know where to start to implement it

#

Also i want to implement those algo in games such as gta 5

icy tree Oct 22, 2023, 6:27 PM

#

brazen mist There are a bunch of algorithms but don't know where to start to implement it

https://github.com/udacity/deep-reinforcement-learning
This might be useful
Good luck!

GitHub

GitHub - udacity/deep-reinforcement-learning: Repo for the Deep Rei...

Repo for the Deep Reinforcement Learning Nanodegree program - GitHub - udacity/deep-reinforcement-learning: Repo for the Deep Reinforcement Learning Nanodegree program

feral sequoia Dec 23, 2023, 10:18 PM

#

Hi,
Did anyone work on multi agent algorithm in python ?

fallow vigil Feb 19, 2024, 3:48 PM

#

do i ask rl related doubts here ?

wet girder Jun 2, 2024, 12:02 AM

#

How can I balance the actor and critic networks?

verbal latch Jun 17, 2024, 6:31 AM

#

brazen mist Also i want to implement those algo in games such as gta 5

Bhai pehle basics sikhle, uske bad gta and all ka bol

brazen mist Jun 17, 2024, 6:41 AM

#

verbal latch Bhai pehle basics sikhle, uske bad gta and all ka bol

Bhai kay bolt reinforcement learning cha king hai me

verbal latch Jun 17, 2024, 6:55 PM

#

I assume it's sarcastically said

brazen mist Jun 19, 2024, 4:40 AM

#

verbal latch I assume it's sarcastically said

No, I'm working professional

#

I have a speciality in model based deep reinforcement learning

verbal latch Jun 19, 2024, 9:29 AM

#

brazen mist I have a speciality in model based deep reinforcement learning

Even Sutton and Barto or Lattimore won't claim the statement that you did

brazen mist Jun 19, 2024, 9:57 AM

#

verbal latch Even Sutton and Barto or Lattimore won't claim the statement that you did

Yah i didn't say I'm superior, but I know how to solve problems using them , but yah still learning

verbal latch Jun 19, 2024, 10:11 AM

#

Man it's not just about blindly using the algo, I see that you talk about Deep RL and stuff, and aim to do something crazy which is absolutely great, however the only concern is the basics.

Before even starting with Deep RL, there's a hell lot of stuffs involved in the learning and the planning setting, whether it be standard Bandits in pure exploratory/regret setting or MDP with various versions of it distributed across different types.

marble sigil Jul 12, 2024, 1:08 PM

#

Hey there! I am a complete beginner to RL, I completed a beginner course on datacamp and solved the Frozen-Lake environment using what I learnt from that course, now I am trying to delve into Deep Q Learning and would like to know what resources I could potentially use, thanks!

fiery jolt Jul 22, 2024, 10:56 AM

#

Why RL is so hard to understand and learn? Am I the only one having this issue?

fiery jolt Jul 22, 2024, 12:36 PM

#

If so, can anyone tell me how to start with a simple explanation video or book or any other suggestions

solar geyser Jul 28, 2024, 2:08 AM

#

fiery jolt If so, can anyone tell me how to start with a simple explanation video or book o...

https://www.youtube.com/watch?v=nyjbcRQ-uQ8&list=PLZbbT5o_s2xoWNVdDudn51XM8lOuZ_Njv

YouTube

deeplizard

Reinforcement Learning Series Intro - Syllabus Overview

💡Enroll to gain access to the full course:
https://deeplizard.com/course/rlcpailzrd

Welcome to this series on reinforcement learning! We'll first start out by introducing the absolute basics to build a solid ground for us to run.

We'll then progress onto more advanced and sophisticated topics that integrate artificial neural networks and deep ...

▶ Play video

#

This is one of the best series I have seen on RL, she explains it really nicely

fiery jolt Jul 28, 2024, 8:36 AM

#

solar geyser This is one of the best series I have seen on RL, she explains it really nicely

Thank you

solar geyser Jul 28, 2024, 8:42 AM

#

fiery jolt Thank you

Yeah no problem man, hope this solves your problem

fiery jolt Jul 28, 2024, 8:47 AM

#

solar geyser Yeah no problem man, hope this solves your problem

Thanks for not sending david silver content or sutton book as I see the RL complicated even after seeing david silver lecture 2 and even with the book lol

#

But that is a video that I have never seen before so I know it is going to solve my problems for sure

solar geyser Jul 28, 2024, 8:48 AM

#

fiery jolt Thanks for not sending david silver content or sutton book as I see the RL compl...

Yeah sending an entire book seems like overkill especially if you just want a foundation

solar geyser Jul 28, 2024, 8:49 AM

#

fiery jolt But that is a video that I have never seen before so I know it is going to solve...

😂

fiery jolt Jul 28, 2024, 8:53 AM

#

solar geyser Yeah sending an entire book seems like overkill especially if you just want a fo...

Yeah, the book will overwhelm you with a ton of theories and equations that you don’t need but will just make you suffer and depressed on the RL

solar geyser Jul 28, 2024, 9:03 AM

#

Real

fiery jolt Aug 10, 2024, 11:34 PM

#

solar geyser Real

Thanks man for the amazing playlist you send above I finished 7 of the 15 videos on the playlist I feel like I know more than I could possibly imagine in reinforcement learning

#

And I will definitely watch the rest to make projects in RL and Deep RL

solar geyser Aug 10, 2024, 11:53 PM

#

fiery jolt Thanks man for the amazing playlist you send above I finished 7 of the 15 videos...

That’s great to hear bro, glad you learned a lot

#

😁

fiery jolt Aug 11, 2024, 11:31 PM

#

solar geyser That’s great to hear bro, glad you learned a lot

It’s all thanks to you bro❤️

solar geyser Aug 11, 2024, 11:40 PM

#

fiery jolt It’s all thanks to you bro❤️

Lmk if you need anything else man

fiery jolt Aug 12, 2024, 12:07 AM

#

solar geyser Lmk if you need anything else man

Let’s do some RL projects sometime and learn from each other

verbal latch Aug 27, 2024, 7:50 PM

#

Those interested in bandit literature can check our recent work out

http://arxiv.org/abs/2408.14195

TLDR: We analyse a clustered multi-armed bandit formulation, where the learning objective is to identify representative arms from each cluster, in a fixed confidence setting

@here

arXiv.org

Representative Arm Identification: A fixed confidence approach to i...

We study the representative arm identification (RAI) problem in the multi-armed bandits (MAB) framework, wherein we have a collection of arms, each associated with an unknown reward distribution. An underlying instance is defined by a partitioning of the arms into clusters of predefined sizes, such that for any $j > i$, all arms in cluster $i$ h...

silent siren Sep 1, 2024, 1:00 AM

#

fiery jolt Thanks man for the amazing playlist you send above I finished 7 of the 15 videos...

Huh can i see it too pls ?

fiery jolt Sep 1, 2024, 4:14 AM

#

silent siren Huh can i see it too pls ?

https://www.youtube.com/watch?v=nyjbcRQ-uQ8&list=PLZbbT5o_s2xoWNVdDudn51XM8lOuZ_Njv

YouTube

deeplizard

Reinforcement Learning Series Intro - Syllabus Overview

💡Enroll to gain access to the full course:
https://deeplizard.com/course/rlcpailzrd

Welcome to this series on reinforcement learning! We'll first start out by introducing the absolute basics to build a solid ground for us to run.

We'll then progress onto more advanced and sophisticated topics that integrate artificial neural networks and deep ...

▶ Play video

#

Here is the playlist

silent siren Sep 4, 2024, 5:31 AM

#

https://youtu.be/E0j680JoBko

YouTube

Rauf

Pandas : Renaming and Combining

so this be last video of pandas series , In this video i be showing how to renaming the columns and combining two df with pandas . If you'd like to see the resources or code, check the repository link below:

Repository: https://github.com/Raufjatoi/Pandas/tree/main/vid6

Inspired by the Kaggle Pandas course: https://www.kaggle.com/learn/pandas...

▶ Play video

teal folio Sep 5, 2024, 6:42 PM

#

fiery jolt Why RL is so hard to understand and learn? Am I the only one having this issue...

As compared to conventional ML and DL it is because it requires software development sort of coding techniques to be used. While ML and DL has simple libraries that you can just simply import and work with

silent siren Sep 9, 2024, 5:00 AM

#

https://youtu.be/7NWRnWZghGA

YouTube

Rauf

Data Visualization : Bar Chart and Heat Map

In this video, I will discuss bar charts and heat maps, explaining how they work and the trends they reveal in data, along with other related topics. If you're interested in seeing some code or resources, I have provided links below.

repo : https://github.com/Raufjatoi/Data-Visualization/tree/main/vid2

inspired by : https://www.kaggle.com/lear...

▶ Play video

silent siren Sep 11, 2024, 5:04 AM

#

https://youtu.be/pPeiwUAVl08?si=eidOoW7JFa_NknaE

YouTube

Rauf

Data Visualization : Scatter Plots

In this video, I will tell you about the scatter plots , explaining how it work and the trends it reveal in data, along with other related topics. If you're interested in seeing some code or resources, I have provided links below.

repo : https://github.com/Raufjatoi/Data-Visualization/tree/main/vid3

inspired by : https://www.kaggle.com/learn/d...

▶ Play video

loud harborBOT Sep 17, 2024, 3:56 PM

#

ruhaan10 has been warned

Reason: Bad word usage

dry widget Sep 17, 2024, 3:57 PM

#

Does anyone have best reinforcement learning playlist with maths

tired cove Oct 5, 2024, 12:36 AM

#

@dry widget Check out UCLxDeepmind RL series on youtube. You can also find a specialization from Uni of alberta on coursera. You can audit for free

dry widget Oct 5, 2024, 5:16 AM

#

tired cove <@787582133119483915> Check out UCLxDeepmind RL series on youtube. You can als...

Thanks 🌹

native kernel Nov 7, 2024, 1:51 PM

#

i dont know too much about reinforcement learning, but is it possible to make a program that takes cards from a trading card game (like pokemon or MTG) and figure out what a very good deck is by having it battle other decks?

#

i ask if it's possible because i dont know if it would take too long to run and if it would need like millions of battles to figure it out

normal siren Nov 8, 2024, 4:06 AM

#

native kernel i dont know too much about reinforcement learning, but is it possible to make a ...

Yes, you can have a look at https://rlcard.org/ as an example

normal siren Nov 8, 2024, 4:15 AM

#

native kernel i ask if it's possible because i dont know if it would take too long to run and ...

Depending of the game, here they report it could take ~500k steps to train the agent: https://doi.org/10.48550/arXiv.1910.04376

arXiv.org

RLCard: A Toolkit for Reinforcement Learning in Card Games

RLCard is an open-source toolkit for reinforcement learning research in card games. It supports various card environments with easy-to-use interfaces, including Blackjack, Leduc Hold'em, Texas Hold'em, UNO, Dou Dizhu and Mahjong. The goal of RLCard is to bridge reinforcement learning and imperfect information games, and push forward the research...

native kernel Nov 8, 2024, 1:51 PM

#

@normal siren oh wow thank you!! I'll check that out

tender seal Nov 11, 2024, 10:54 AM

#

HI, I am Abdullah I am an ML engineer want to join any team to particapte in kaggle competions

spice wren Nov 16, 2024, 10:42 AM

#

tender seal HI, I am Abdullah I am an ML engineer want to join any team to particapte in k...

i need help to take this test. can we google meet please to help me out?

loud palm Nov 17, 2024, 6:03 PM

#

Hi, Please help me.
I'm going to make a search engine based on customer behaviors.
Inputs: query embedding, history embedding (Metadata is stored with vector formats)
We use cosine similarity and train embedding models using multi armed bandits.(Is that possible?)
I have two questions about this.
First, how to get the gradient of embedding when use consime similarity?(Can that be estimated in torch?)
Second, for the search, we use two steps, updating the weights about historical embedding and query embedding at the same time, I think that can be noisy.
But I can't make sure. I attahced diagram. And if any questions, feel free to ask.
https://drive.google.com/file/d/1_vWxdasnHjCL6_momviQzcDYHAAgc-1M/view?usp=sharing

Google Docs

Capture.PNG

verbal otter Nov 19, 2024, 11:58 PM

#

I believe this channel is about Discussing Reinforcement learning kindly refrain to put any other content or stuff which is unrelated

merry pollen Dec 4, 2024, 3:06 PM

#

Hello everyone, kindly suggest me best courses on reinforcement learning and share reviews about reinforcement learning specialization on Coursera

normal siren Dec 11, 2024, 4:03 PM

#

merry pollen Hello everyone, kindly suggest me best courses on reinforcement learning and sha...

https://bsky.app/profile/sungkim.bsky.social/post/3lcvvmtmcoc2u

Sung Kim (@sungkim.bsky.social)

Some learning resources on Reinforcement Learning in no particular order.

UCL Course on RL ( www.davidsilver.uk/teaching/ )
UC Berkeley's Deep Reinforcement Learning ( rail.eecs.berkeley.edu/deeprlcourse/ )
Lil's Reinforcement-Learning ( lilianweng.github.io/tags/reinfor... )

merry pollen Dec 11, 2024, 4:06 PM

#

normal siren https://bsky.app/profile/sungkim.bsky.social/post/3lcvvmtmcoc2u

Thanks

normal siren Dec 13, 2024, 1:43 AM

#

merry pollen Hello everyone, kindly suggest me best courses on reinforcement learning and sha...

Some more material:

merry pollen Dec 13, 2024, 2:55 AM

#

normal siren Some more material: - http://incompleteideas.net/book/the-book-2nd.html - https:...

Thank you 💕

normal siren Dec 21, 2024, 4:42 PM

#

merry pollen Hello everyone, kindly suggest me best courses on reinforcement learning and sha...

Forgot this one: https://spinningup.openai.com/

spice cypress Feb 22, 2025, 9:02 PM

#

anyone interested in teaming up for lux AI ?

gloomy token Mar 29, 2025, 1:58 PM

#

How good is DQN method

#

For EHR'S

fallen cargo Apr 3, 2025, 9:52 AM

#

Anyone used NotebookLM how is it?

fallen cargo Apr 8, 2025, 6:47 AM

#

Hi I am new here I think my earlier message was not relevant to this thread please tell me in which thread I can ask such questions
Thank you

jagged oriole Apr 14, 2025, 3:55 AM

#

fallen cargo Hi I am new here I think my earlier message was not relevant to this thread plea...

https://discord.com/channels/1101210829807956100/1129507816697241822

clear mortar May 15, 2025, 7:25 PM

#

Hello, I am learning Reinforcement Learning and interested in Automation, robotics and automotive technologies. Looking for a peer or group to learn together. Is anyone interested?

verbal otter May 19, 2025, 12:47 AM

#

Hi Everyone, I am a 2nd Year PhD student in Computer Science at University of Maryland Baltimore County specializing in Machine Learning, Reinforcement Learning, and Mathematical Reasoning in LLMs. I was thinking to write a Review paper on the current Maths Reasoning in LLMs , so was looking for potential collabrators on it. Thanks

spice cypress May 21, 2025, 8:42 AM

#

clear mortar Hello, I am learning Reinforcement Learning and interested in Automation, roboti...

YES

ember blaze May 21, 2025, 10:49 AM

#

clear mortar Hello, I am learning Reinforcement Learning and interested in Automation, roboti...

Hi, I learn supervised learning and unsupervised learning, I Interesting to talk together

fiery jolt May 24, 2025, 12:33 PM

#

clear mortar Hello, I am learning Reinforcement Learning and interested in Automation, roboti...

Yes I am very interested

wise canopy May 26, 2025, 8:52 PM

#

clear mortar Hello, I am learning Reinforcement Learning and interested in Automation, roboti...

Hi I am also learning automation and robotics. I am interested to join!

past cloud May 29, 2025, 3:51 AM

#

clear mortar Hello, I am learning Reinforcement Learning and interested in Automation, roboti...

yes I am interested to join

white olive Jun 2, 2025, 12:25 PM

#

This Python class offers a multiprocessing-powered Pool for efficiently collecting and managing experience replay data in reinforcement learning.

https://github.com/NoteDance/Pool

GitHub

GitHub - NoteDance/Pool: reinforcement learning, deep reinforcement...

reinforcement learning, deep reinforcement learning - NoteDance/Pool

boreal dagger Aug 18, 2025, 12:48 PM

#

Hi. I'm starting with RL, namely PPO and by extension GRPO. Anyone has prior experience?!

final sphinx Aug 22, 2025, 12:25 PM

#

im working on a project involving RL and drone delivery optimization, could you guys help by sharing some resources for learning RL that actually helped you all in learning RL? Thanks!

sinful turtle Sep 5, 2025, 4:10 PM

#

Job Title: Part-Time Senior AI/ML Engineer (Remote)

We are seeking a skilled and experienced Senior AI/ML Engineer to join our remote team on a part-time basis. The ideal candidate will have a strong technical background, excellent communication skills, and the ability to work independently in a fast-paced environment.

Requirements:
-Minimum of 7–10 years of professional software development experience

-Proven experience working effectively in a remote environment

-Advanced English proficiency (C1 or higher); an American accent is preferred

-Availability to work 10–15 hours per week during EST or CST business hours

If you're a highly motivated engineer with a passion for building high-quality software and can commit to a flexible part-time schedule, we’d love to hear from you.
You can connect with me on WhatsApp: +1 (567) 469-5384

bold arch Sep 15, 2025, 6:57 PM

#

Hi, @everybody
I have one question, I'm training ml models for the prediction, which is classification problem of 3 classes, where the number of samples are similar but the predition is skewed.
First class and second class is predicted with low precision tough, third class is never predicted. What's the reason? I can' t find the reason.
Before, when I applyed reinforcement learning, where the three classes were assigned to three actions and one action is never selected, too.
Actually, that is the preeiction model of forex eur/usd.

tidal notch Nov 9, 2025, 12:42 PM

#

https://media.discordapp.net/attachments/1436719817624256534/1436719913518633010/1.JPG?ex=6910a130&is=690f4fb0&hm=6a48397700e40b701b7defba0bc73ccc590e83e58af09eb7035cae318e9fb319&=&format=webp&width=515&height=687
https://media.discordapp.net/attachments/1436719817624256534/1436719914034659408/2.jpg?ex=6910a130&is=690f4fb0&hm=5d3c01e3db0b2fe7135969c69c22cbf49db07bae5ed8cb9a98ac3e18d3c73ce5&=&format=webp&width=515&height=687
https://media.discordapp.net/attachments/1436719817624256534/1436719914512547951/3.jpg?ex=6910a130&is=690f4fb0&hm=59a326eaa4d74733a406431b5c2eb8ee07f6b78d95094102deb1153d2e261407&=&format=webp&width=515&height=687

weak jewel Nov 9, 2025, 6:58 PM

#

clear mortar Hello, I am learning Reinforcement Learning and interested in Automation, roboti...

me

tawdry yarrow Nov 10, 2025, 1:59 PM

#

bold arch Hi, @everybody I have one question, I'm training ml models for the prediction, w...

The reason might be a Class imbalance in the Training Data, make your Training data class balanced , that all three classes have the same amount of samples in the whole of the training data.

bold arch Nov 10, 2025, 5:51 PM

#

I'm finding a US developer for the collaboration. If anybody interested, please dm me.

gleaming gorge Mar 16, 2026, 6:10 PM

#

https://www.kaggle.com/code/petrumihaicraciun/buck-shot-roulette-reinforcement-learning

#

Quick little game thing

#

if anyone has played Buck Shot roullete this is a little sample enviroment I made that you can make a AI on

#

very similar to the video game

solar knoll May 31, 2026, 1:11 AM

#

@gleaming gorge This is really cool, love that you used Mesa for the multi-agent setup, makes it super easy to swap in different agent classes. The BaseAgent abstraction is clean too.

gleaming gorge May 31, 2026, 1:12 AM

#

solar knoll <@447376149044002827> This is really cool, love that you used Mesa for the multi...

Now that I'm playing around with my PhD more I also recommend looking at @dataclasses in python for large models

#

I've kind of started moving it to seperate files because after 500 lines of code mesa can get hard to track so moving it into a kind of variable holder can help

#

https://docs.python.org/3/library/dataclasses.html