#reinforcement-learning
1 messages · Page 1 of 1 (latest)
Hi I am new, can you learn with me?
Sure .
I'd be glad to
Are you reading any book for RL or taking courses
I learn from the course.
And you?
Reading a book for it.
I feel like the courses do not cover enough theoretical ground
Yes they teach me only Q-learning.
Can you say book name?
Sutton and Barto, Introduction to Reinforcement Learning is a classic text, and it's also very good. Both for mathematical and theoretical treatment of the concepts.
Can you say where I get those books?
Okay thanks
No problem!
Oh wow that's the same book I'm reading. It's a great text
Though if you want a video course, David Silver's famous lectures are pretty good, nice balance of math
Can you give me link?
Sure! This is the playlist by Google Deepmind YT channel:
https://youtube.com/playlist?list=PLqYmG7hTraZDM-OYHWgPebj2MfCFzFObQ&feature=shared
Hi,,iam Sharath ,,iam a beginner here in this field of data science, AI and machine learning,,I want to learn RL,,can anybody have time for group studies or guiding me??
👋
Yes. I do have some time for group study
@wise magnet nice to have you
Thanks 👍 looking forward to learn RL
Yes we can discuss here about deep reinforcement learning
As i finished a few topics and feeling confident but have some doubts as well
Any working professionals here ?
I'm not a working professional per se.
But I've got a good experience
Nice 🙂
What doubts do you have?
There are a bunch of algorithms but don't know where to start to implement it
Also i want to implement those algo in games such as gta 5
https://github.com/udacity/deep-reinforcement-learning
This might be useful
Good luck!
Hi,
Did anyone work on multi agent algorithm in python ?
do i ask rl related doubts here ?
How can I balance the actor and critic networks?
Bhai pehle basics sikhle, uske bad gta and all ka bol
Bhai kay bolt reinforcement learning cha king hai me
I assume it's sarcastically said
No, I'm working professional
I have a speciality in model based deep reinforcement learning
Even Sutton and Barto or Lattimore won't claim the statement that you did
Yah i didn't say I'm superior, but I know how to solve problems using them , but yah still learning
Man it's not just about blindly using the algo, I see that you talk about Deep RL and stuff, and aim to do something crazy which is absolutely great, however the only concern is the basics.
Before even starting with Deep RL, there's a hell lot of stuffs involved in the learning and the planning setting, whether it be standard Bandits in pure exploratory/regret setting or MDP with various versions of it distributed across different types.
Hey there! I am a complete beginner to RL, I completed a beginner course on datacamp and solved the Frozen-Lake environment using what I learnt from that course, now I am trying to delve into Deep Q Learning and would like to know what resources I could potentially use, thanks!
Why RL is so hard to understand and learn? Am I the only one having this issue?
If so, can anyone tell me how to start with a simple explanation video or book or any other suggestions
💡Enroll to gain access to the full course:
https://deeplizard.com/course/rlcpailzrd
Welcome to this series on reinforcement learning! We'll first start out by introducing the absolute basics to build a solid ground for us to run.
We'll then progress onto more advanced and sophisticated topics that integrate artificial neural networks and deep ...
This is one of the best series I have seen on RL, she explains it really nicely
Thank you
Yeah no problem man, hope this solves your problem
Thanks for not sending david silver content or sutton book as I see the RL complicated even after seeing david silver lecture 2 and even with the book lol
But that is a video that I have never seen before so I know it is going to solve my problems for sure
Yeah sending an entire book seems like overkill especially if you just want a foundation
😂
Yeah, the book will overwhelm you with a ton of theories and equations that you don’t need but will just make you suffer and depressed on the RL
Real
Thanks man for the amazing playlist you send above I finished 7 of the 15 videos on the playlist I feel like I know more than I could possibly imagine in reinforcement learning
And I will definitely watch the rest to make projects in RL and Deep RL
That’s great to hear bro, glad you learned a lot
😁
It’s all thanks to you bro❤️
Lmk if you need anything else man
Let’s do some RL projects sometime and learn from each other
Those interested in bandit literature can check our recent work out
http://arxiv.org/abs/2408.14195
TLDR: We analyse a clustered multi-armed bandit formulation, where the learning objective is to identify representative arms from each cluster, in a fixed confidence setting
@here
We study the representative arm identification (RAI) problem in the multi-armed bandits (MAB) framework, wherein we have a collection of arms, each associated with an unknown reward distribution. An underlying instance is defined by a partitioning of the arms into clusters of predefined sizes, such that for any $j > i$, all arms in cluster $i$ h...
Huh can i see it too pls ?
💡Enroll to gain access to the full course:
https://deeplizard.com/course/rlcpailzrd
Welcome to this series on reinforcement learning! We'll first start out by introducing the absolute basics to build a solid ground for us to run.
We'll then progress onto more advanced and sophisticated topics that integrate artificial neural networks and deep ...
Here is the playlist
so this be last video of pandas series , In this video i be showing how to renaming the columns and combining two df with pandas . If you'd like to see the resources or code, check the repository link below:
Repository: https://github.com/Raufjatoi/Pandas/tree/main/vid6
Inspired by the Kaggle Pandas course: https://www.kaggle.com/learn/pandas...
As compared to conventional ML and DL it is because it requires software development sort of coding techniques to be used. While ML and DL has simple libraries that you can just simply import and work with
In this video, I will discuss bar charts and heat maps, explaining how they work and the trends they reveal in data, along with other related topics. If you're interested in seeing some code or resources, I have provided links below.
repo : https://github.com/Raufjatoi/Data-Visualization/tree/main/vid2
inspired by : https://www.kaggle.com/lear...
In this video, I will tell you about the scatter plots , explaining how it work and the trends it reveal in data, along with other related topics. If you're interested in seeing some code or resources, I have provided links below.
repo : https://github.com/Raufjatoi/Data-Visualization/tree/main/vid3
inspired by : https://www.kaggle.com/learn/d...
Reason: Bad word usage
Does anyone have best reinforcement learning playlist with maths
@dry widget Check out UCLxDeepmind RL series on youtube. You can also find a specialization from Uni of alberta on coursera. You can audit for free
Thanks 🌹
i dont know too much about reinforcement learning, but is it possible to make a program that takes cards from a trading card game (like pokemon or MTG) and figure out what a very good deck is by having it battle other decks?
i ask if it's possible because i dont know if it would take too long to run and if it would need like millions of battles to figure it out
Yes, you can have a look at https://rlcard.org/ as an example
Depending of the game, here they report it could take ~500k steps to train the agent: https://doi.org/10.48550/arXiv.1910.04376
RLCard is an open-source toolkit for reinforcement learning research in card games. It supports various card environments with easy-to-use interfaces, including Blackjack, Leduc Hold'em, Texas Hold'em, UNO, Dou Dizhu and Mahjong. The goal of RLCard is to bridge reinforcement learning and imperfect information games, and push forward the research...
@normal siren oh wow thank you!! I'll check that out
HI, I am Abdullah I am an ML engineer want to join any team to particapte in kaggle competions
i need help to take this test. can we google meet please to help me out?
Hi, Please help me.
I'm going to make a search engine based on customer behaviors.
Inputs: query embedding, history embedding (Metadata is stored with vector formats)
We use cosine similarity and train embedding models using multi armed bandits.(Is that possible?)
I have two questions about this.
First, how to get the gradient of embedding when use consime similarity?(Can that be estimated in torch?)
Second, for the search, we use two steps, updating the weights about historical embedding and query embedding at the same time, I think that can be noisy.
But I can't make sure. I attahced diagram. And if any questions, feel free to ask.
https://drive.google.com/file/d/1_vWxdasnHjCL6_momviQzcDYHAAgc-1M/view?usp=sharing
I believe this channel is about Discussing Reinforcement learning kindly refrain to put any other content or stuff which is unrelated
Hello everyone, kindly suggest me best courses on reinforcement learning and share reviews about reinforcement learning specialization on Coursera
Some learning resources on Reinforcement Learning in no particular order.
- UCL Course on RL ( www.davidsilver.uk/teaching/ )
- UC Berkeley's Deep Reinforcement Learning ( rail.eecs.berkeley.edu/deeprlcourse/ )
- Lil's Reinforcement-Learning ( lilianweng.github.io/tags/reinfor... )
Thanks
Thank you 💕
Forgot this one: https://spinningup.openai.com/
anyone interested in teaming up for lux AI ?
Anyone used NotebookLM how is it?
Hi I am new here I think my earlier message was not relevant to this thread please tell me in which thread I can ask such questions
Thank you
Hello, I am learning Reinforcement Learning and interested in Automation, robotics and automotive technologies. Looking for a peer or group to learn together. Is anyone interested?
Hi Everyone, I am a 2nd Year PhD student in Computer Science at University of Maryland Baltimore County specializing in Machine Learning, Reinforcement Learning, and Mathematical Reasoning in LLMs. I was thinking to write a Review paper on the current Maths Reasoning in LLMs , so was looking for potential collabrators on it. Thanks
YES
Hi, I learn supervised learning and unsupervised learning, I Interesting to talk together
Yes I am very interested
Hi I am also learning automation and robotics. I am interested to join!
yes I am interested to join
This Python class offers a multiprocessing-powered Pool for efficiently collecting and managing experience replay data in reinforcement learning.
Hi. I'm starting with RL, namely PPO and by extension GRPO. Anyone has prior experience?!
im working on a project involving RL and drone delivery optimization, could you guys help by sharing some resources for learning RL that actually helped you all in learning RL? Thanks!
Job Title: Part-Time Senior AI/ML Engineer (Remote)
We are seeking a skilled and experienced Senior AI/ML Engineer to join our remote team on a part-time basis. The ideal candidate will have a strong technical background, excellent communication skills, and the ability to work independently in a fast-paced environment.
Requirements:
-Minimum of 7–10 years of professional software development experience
-Proven experience working effectively in a remote environment
-Advanced English proficiency (C1 or higher); an American accent is preferred
-Availability to work 10–15 hours per week during EST or CST business hours
If you're a highly motivated engineer with a passion for building high-quality software and can commit to a flexible part-time schedule, we’d love to hear from you.
You can connect with me on WhatsApp: +1 (567) 469-5384
Hi, @everybody
I have one question, I'm training ml models for the prediction, which is classification problem of 3 classes, where the number of samples are similar but the predition is skewed.
First class and second class is predicted with low precision tough, third class is never predicted. What's the reason? I can' t find the reason.
Before, when I applyed reinforcement learning, where the three classes were assigned to three actions and one action is never selected, too.
Actually, that is the preeiction model of forex eur/usd.
https://media.discordapp.net/attachments/1436719817624256534/1436719913518633010/1.JPG?ex=6910a130&is=690f4fb0&hm=6a48397700e40b701b7defba0bc73ccc590e83e58af09eb7035cae318e9fb319&=&format=webp&width=515&height=687
https://media.discordapp.net/attachments/1436719817624256534/1436719914034659408/2.jpg?ex=6910a130&is=690f4fb0&hm=5d3c01e3db0b2fe7135969c69c22cbf49db07bae5ed8cb9a98ac3e18d3c73ce5&=&format=webp&width=515&height=687
https://media.discordapp.net/attachments/1436719817624256534/1436719914512547951/3.jpg?ex=6910a130&is=690f4fb0&hm=59a326eaa4d74733a406431b5c2eb8ee07f6b78d95094102deb1153d2e261407&=&format=webp&width=515&height=687
me
The reason might be a Class imbalance in the Training Data, make your Training data class balanced , that all three classes have the same amount of samples in the whole of the training data.
I'm finding a US developer for the collaboration. If anybody interested, please dm me.
Quick little game thing
if anyone has played Buck Shot roullete this is a little sample enviroment I made that you can make a AI on
very similar to the video game
@gleaming gorge This is really cool, love that you used Mesa for the multi-agent setup, makes it super easy to swap in different agent classes. The BaseAgent abstraction is clean too.
Now that I'm playing around with my PhD more I also recommend looking at @dataclasses in python for large models
I've kind of started moving it to seperate files because after 500 lines of code mesa can get hard to track so moving it into a kind of variable holder can help