iron basalt Jan 16, 2025, 9:48 PM

#

You want an agent that can adapt immediately to what the player was doing in the previous round?

#

With very little data?

analog gust Jan 16, 2025, 10:07 PM

#

iron basalt You want an agent that can adapt immediately to what the player was doing in the...

thats exactly my problem. my original plan was to handle everything in the c# code, the adaptations, but my professor insisted on machine learning i guess because it is more interesting in the field nowadays, but i had to start from scatch trying to learn the basics of machine learning now and the more i learn the less it even makes sense... theres only so much data i can get from one playthrough

flat token Jan 16, 2025, 10:20 PM

#

analog gust thats exactly my problem. my original plan was to handle everything in the c# co...

C lends itself very well to machine learning code but you have to known the underlying to correctly write it

iron basalt Jan 16, 2025, 10:20 PM

#

analog gust thats exactly my problem. my original plan was to handle everything in the c# co...

For the basic idea: https://www.youtube.com/watch?v=sw7UAZNgGg8

YouTube

Vsauce2

The Game That Learns

By the 1950s, science fiction was beginning to become reality: machines didn’t just calculate; they began to learn. Machine calculating was out. Machine learning was in. But we had to start small.

Donald Michie’s “Machine Educable Noughts And Crosses Engine” -- MENACE -- was composed of 304 separate matchboxes that each depicted a possible stat...

▶ Play video

flat token Jan 16, 2025, 10:20 PM

#

I've implemented a few SVMs already in C++ and they were incredibly fast

iron basalt Jan 16, 2025, 10:21 PM

#

You can use this for any game. You need some set of moves and conditions that makes them valid to choose from, you can then do what is done in that video (directly).

#

The learning part here is that it starts out really bad, basically playing random moves.

analog gust Jan 16, 2025, 10:22 PM

#

so... this is basically without any python based learning algorithms then?

iron basalt Jan 16, 2025, 10:22 PM

#

analog gust so... this is basically without any python based learning algorithms then?

You can do this in any programming language, or as show in the video, mechanically/IRL/by-hand.

#

If you have a computer (machine, not person, "computer" used to be a job title) do it, then it's machine learning.

analog gust Jan 16, 2025, 10:24 PM

#

my prof basically said if i use scikit learn i would avoid writing everything myself, since algorithms like that "already exist" but i'm kinda starting to doubt it... at least until now its been a hell of a lot more work than just writing it myself

iron basalt Jan 16, 2025, 10:25 PM

#

It's very simple machine learning, but it does pretty much exactly what you are asking for. In terms of gameplay experience.

iron basalt Jan 16, 2025, 10:26 PM

#

analog gust my prof basically said if i use scikit learn i would avoid writing everything my...

That does not apply here directly. You can for example take the game state and run some clustering on it, then based on which cluster the current state is part of (roughly what the current "situation" is in game represented by that cluster (if a unique situation happens, a new cluster can be formed with its own set of associated learned moves)), the boss decides from a certain set of appropriate moves (randomly, but then learned over time).

#

In the video I gave you don't need this because you basically just take the board state directly and map it directly (like a hashmap lookup table).

#

The game state is simple.

#

And not too many of them.

#

One way to extend this idea directly such that you can have similar game states map to the same "bucket" (as in a hashmap) of moves is called locality-sensitive hashing, which is an option (https://en.wikipedia.org/wiki/Locality-sensitive_hashing ).

Locality-sensitive hashing

In computer science, locality-sensitive hashing (LSH) is a fuzzy hashing technique that hashes similar input items into the same "buckets" with high probability. (The number of buckets is much smaller than the universe of possible input items.) Since similar items end up in the same buckets, this technique can be used for data clustering and nea...

analog gust Jan 16, 2025, 10:31 PM

#

I'm rly trying to write all of this down but I already know this is probably gonna push my thesis back at least another month until I get all of that running Sigh3

#

I appreciate all the effort tho! Since my previous approach obviously ran me into a wall haha

iron basalt Jan 16, 2025, 10:33 PM

#

analog gust I'm rly trying to write all of this down but I already know this is probably gon...

If you do what that video does, it will work, but you may notice that in that video there are not that many game states. In realtime game you have an absurd number of possible unique game states happening quickly, so you need to group similar ones together to try to wrangle this complexity, this is where other parts of machine learning (clustering) come into play.

#

If you imagine you had a lookup table that maps from game state to set of moves the boss should play in that state (it can pick any of them as needed, learning which ones are best in that case), then this is fine for something like tic-tac-toe, there are not many entries in the table since there are not that many possibilities. But if you now have say, chess, big problem, your table is massive. So we have to try to reduce this, you can do this treating different, but "similar" states as a single entry in the table, so they all map to the same set of moves. This is where machine learning started, it's still the same problem.

#

To have this be fun for the player where they can see the boss getting better, it just picks random moves at first from the set of moves it got from being mapped to, but over time, it can remove moves that made it lose (with some random chance of that happening).

analog gust Jan 16, 2025, 10:40 PM

#

iron basalt If you do what that video does, it will work, but you may notice that in that vi...

well, i kinda already gave up on realtime adaptation , hence why i'm working with rounds now . my boss already has states, since he already decides when to locate, approach, attack or defend . however just adding more states to this and conditions is not enough machine learning according to my prof?

#

if thats what you mean

iron basalt Jan 16, 2025, 10:42 PM

#

analog gust well, i kinda already gave up on realtime adaptation , hence why i'm working wit...

I mean for example, if the player is at position (10, 10) -> {swing, jump back, etc}. Problem is, the player can also be at say (10.0000001, 10) -> {other set of things}.

#

When I say game state, I mean the state of the whole game.

analog gust Jan 16, 2025, 10:44 PM

#

maybe I dont understand floof_cry

iron basalt Jan 16, 2025, 10:44 PM

#

#

This is a chess game state.

#

The boss takes actions based on the current (and past) game states.

#

If you take this state, and run it through a hashing function, you get a single number representing this unique game state.

#

You can then use that number to lookup a set of moves (lookup table).

#

Imagine I gave you a book with every possible chess game state in it, followed by the perfect move to play in that state (on the same page). I now gave you that book and asked you to play the perfect move. You can jump to the page that has the matching game state, and play the associated move.

#

But, try to imagine how big that book would be, how many possible game states chess has.

#

(It does not fit in this universe levels of big)

analog gust Jan 16, 2025, 10:50 PM

#

so a game state is all of the bosses stats and unlocked abilities of the current round the player is playing on? since, it really only changes once the next round starts

#

floof_think

iron basalt Jan 16, 2025, 10:51 PM

#

It's more than that, it's the bosses' position, the player position, the map geometry, their health, etc, literally every variable in the game.

#

In chess I can show all of that with a single screenshot, because chess does not have any hidden state (both players know the full exact game state at all times).

#

This is in contrast to poker, where you have hidden state.

#

Chess also does not have to handle stuff like a position of 1.00001, it's all integers.

iron basalt Jan 16, 2025, 10:55 PM

#

iron basalt But, try to imagine how big that book would be, how many possible game states ch...

So anyhow, instead of doing this, we instead make a much smaller book, with only a few states listed in it and each has a list of "decent/good moves." And now when I tell you to play the perfect (or just good) move, I give you the book and tell you to find the state that is most similar to the one given, then see if any of the moves listed there are valid, and if they are, there is a good chance they are decent moves to play (you still need to manually check this, but you have narrowed down your choices a lot, spending much less time to find a good move).

thick rapids Jan 16, 2025, 10:57 PM

#

Hey guys

#

Is there any sql Postgres expert here

versed axle Jan 16, 2025, 11:03 PM

#

iron basalt

Rxd8

#

(I'm bad at chess)

calm thicket Jan 16, 2025, 11:15 PM

#

thick rapids Is there any sql Postgres expert here

what would you ask them

serene scaffold Jan 16, 2025, 11:15 PM

#

thick rapids Is there any sql Postgres expert here

You're probably looking for #databases . Remember to always ask your actual question and not if someone knows about the topic of a secret question

iron basalt Jan 16, 2025, 11:30 PM

#

versed axle Rxd8

Bf8.

warm copper Jan 16, 2025, 11:55 PM

#

what do you think? Is F1 better in most cases when you care about trade off?

#

Ive seen cases where FP and FN can be both problematic

#

especially for spam and fraud

thick rapids Jan 16, 2025, 11:56 PM

#

serene scaffold You're probably looking for <#342318764227821568> . Remember to always ask your ...

Its not secret I just need some paragraphs to explain

warm copper Jan 16, 2025, 11:56 PM

#

your emails get labeled as spam when they are not supposed to they dont get labeled as spam when they are supposed to

serene scaffold Jan 16, 2025, 11:57 PM

#

thick rapids Its not secret I just need some paragraphs to explain

you're asking if anyone is a postgres expert. but postgres experts can't answer your question if they don't know what it is. so it saves everyone a step if you just ask the postgres question, and then postgres experts can answer it if they see it.

warm copper Jan 16, 2025, 11:57 PM

#

minimizing false positives vs minimizing false negatives xD

#

I did work with Postgres last semester for my DBMS course but Im not an expert

#

whats the question?

#

also isnt that more of a database question 🥲

thick rapids Jan 17, 2025, 12:00 AM

#

Im going to ask at databases

warm copper Jan 17, 2025, 12:00 AM

#

I hate databases

#

like SQL was a nightmare for me last semester

warm copper Jan 17, 2025, 12:00 AM

#

serene scaffold you're asking if anyone is a postgres expert. but postgres experts can't answer ...

do you know RL?

#

Reinforcement Learning

serene scaffold Jan 17, 2025, 12:01 AM

#

warm copper do you know RL?

are you asking to ask?

warm copper Jan 17, 2025, 12:01 AM

#

Just wondering how hard it is

#

Im taking RL next semester

#

The syllabus looked horrifying

serene scaffold Jan 17, 2025, 12:01 AM

#

I don't know. there isn't really an application for it in NLP.

warm copper Jan 17, 2025, 12:01 AM

#

really?

#

I thought LLMs use RLHF to help with bias

thick rapids Jan 17, 2025, 12:02 AM

#

warm copper like SQL was a nightmare for me last semester

For me too

warm copper Jan 17, 2025, 12:03 AM

#

To mitigate hallucinations in LLMs we can use RLHF

#

we get human feedback during training and use a reward model

#

which is basically RL

#

also they ask examples of supervised and unsupervised learning?

thick rapids Jan 17, 2025, 12:05 AM

#

Databases is dead

warm copper Jan 17, 2025, 12:05 AM

#

I would say Logistic Regression is a supervised learning

#

and unsupervised learning would be like clustering techniques

#

like K-Means

#

I used Isolation Forest for my spam detection which is unsupervised too xD

spring field Jan 17, 2025, 12:08 AM

#

I have recently found out that the proper term for "unsupervised" is "self-supervised"

thick rapids Jan 17, 2025, 12:08 AM

#

Loooool

warm copper Jan 17, 2025, 12:09 AM

#

yeah that is correct

#

self supervised is the right term

thick rapids Jan 17, 2025, 12:09 AM

#

Yeah but not that usual

warm copper Jan 17, 2025, 12:10 AM

#

Basically you can use CNN and RNN which are usually supervised learning and apply anomly methods on them to make them self supervised

thick rapids Jan 17, 2025, 12:10 AM

#

Why

serene scaffold Jan 17, 2025, 12:10 AM

#

spring field I have recently found out that the proper term for "unsupervised" is "self-super...

terms are only "proper" insasfaras there's a consensus about what they mean. and there often isn't.
I don't recognize those as synonyms.

spring field Jan 17, 2025, 12:10 AM

#

though a "proper term" in this field is a bit ironic

warm copper Jan 17, 2025, 12:10 AM

#

you can use CNNs in GANs

#

if someone asks me if CNN is supervised or unsupervised I would say it depends

#

how CNN is used

spring field Jan 17, 2025, 12:13 AM

#

serene scaffold terms are only "proper" insasfaras there's a consensus about what they mean. and...

true true, the "proper" was a bit rushed I suppose
though, well, that doesn't make life any easier that new terms are introduced, meaning different things for different people and institutions bread_pensive
as a linguist, that must drive you crazy (at least you won't have to worry about a job in that sense, lol)

thick rapids Jan 17, 2025, 12:14 AM

#

Why they aren’t synonyms

warm copper Jan 17, 2025, 12:14 AM

#

I started to forget. alot of things from my linguistics degree tbh

#

I havent practiced it for a long time

#

I mean self supervised is like you supervise yourself

#

unsupervised means no supervision at all

thick rapids Jan 17, 2025, 12:15 AM

#

Ok got it

spring field Jan 17, 2025, 12:16 AM

#

interesting, I guess there is a difference after all, though they both have in common that you don't have labeled/known targets
https://ai.stackexchange.com/questions/40341/what-is-the-difference-between-self-supervised-and-unsupervised-learning

thick rapids Jan 17, 2025, 12:16 AM

#

Guys do you think all ML after all is statistics written in code

warm copper Jan 17, 2025, 12:17 AM

#

would you say a kid aged 7 at a pool is self supervised or unsupervised?

#

can they supervise themselves?

spring field Jan 17, 2025, 12:17 AM

#

that is certainly one of the analogies of all time

warm copper Jan 17, 2025, 12:18 AM

#

thats why they cant be synonyms

thick rapids Jan 17, 2025, 12:18 AM

#

warm copper would you say a kid aged 7 at a pool is self supervised or unsupervised?

Pretty on point

warm copper Jan 17, 2025, 12:20 AM

#

this is what you need to know:
Supervised learning is learning from labeled data
Unsupervised learning is learning from unlabeled data
Self-supervised learning is learning from unlabeled data with learned labels

#

in Self-supervised learning those learned labels are synthetic

#

Theres also Zero-Shot, One-Shot and Few-Shot Learning @spring field

#

which are used in NLP

#

you will see that one-shot and few-shot learning are type of supervised learning

#

and zero-shot is transfer learnng

#

I was going crazy with all those last semester

#

ugh

spring field Jan 17, 2025, 12:27 AM

#

I am yet to, lol

warm copper Jan 17, 2025, 12:42 AM

#

spring field I am yet to, lol

are you doing DS?

spring field Jan 17, 2025, 12:50 AM

#

In a sense I guess, I'm employed currently and the position does involve ML and DS

warm copper Jan 17, 2025, 12:51 AM

#

so lucky

#

I need a job like that

rich river Jan 17, 2025, 3:48 AM

#

tensor[..., 0, True, 1::2, torch.tensor([1, 2])]

tensor.index({"...", 0, true, Slice(1, None, 2), torch::tensor({1, 2})})

can anyone explain what this is?
https://pytorch.org/cppdocs/notes/tensor_indexing.html

desert oar Jan 17, 2025, 4:53 AM

#

rich river ``` tensor[..., 0, True, 1::2, torch.tensor([1, 2])] tensor.index({"...", 0, tr...

what do you want to know about it? this is a particularly contrived example of advanced indexing functionality that is possible with pytorch tensors

#

admittedly i can't find any official docs on the "Python API" that they refer to -- but as far as i know (and as far as i've used) it's similar to the Numpy API, which is described here https://numpy.org/doc/stable/user/basics.indexing.html

#

aha:

When accessing the contents of a tensor via indexing, PyTorch follows Numpy behaviors that basic indexing returns views, while advanced indexing returns a copy. Assignment via either basic or advanced indexing is in-place. See more examples in Numpy indexing documentation.
https://pytorch.org/docs/stable/tensor_view.html

#

that's at least a clue

#

so this is the (contrived) Pytorch Python code: tensor[..., 0, True, 1::2, torch.tensor([1, 2])]
and this is the C++ equivalent: tensor.index({"...", 0, true, Slice(1, None, 2), torch::tensor({1, 2})})

rich river Jan 17, 2025, 5:29 AM

#

desert oar what do you want to know about it? this is a particularly contrived example of a...

yeah I can understand previous codes

inland crown Jan 17, 2025, 5:48 AM

#

I have a datastream of 100 crypto coins. The top 100 for the hour. It charts their battle for the top. I get these vertical lines in the graph and I feel they are the result of some synch problem. Thoughts?

#

The system is all supposed to run on a 60 second cycle.

#

so 1 small gap is 60 seconds.

#

maybe a synch issue with the incoming datastream's update cycle?

final cobalt Jan 17, 2025, 5:51 AM

#

https://paste.pythondiscord.com/TNJQ

#

Q.Q

#

I'm just getting black images from the generate() function. The thing seems to be learning pretty good, but I seem to be getting nans/infs in my output. Should I be clamping something?

inland crown Jan 17, 2025, 7:06 AM

#

SUCCESS!

final cobalt Jan 17, 2025, 8:05 AM

#

You won crypto!

fickle shale Jan 17, 2025, 11:22 AM

#

inland crown SUCCESS!

Boom!

rare lynx Jan 17, 2025, 11:47 AM

#

hey guys can you help me build a model for my eeg analysis you can find the notebook here - https://www.kaggle.com/code/pramitroy/data-processing dm me if you guys have some suggestions or better model

Data Processing

Explore and run machine learning code with Kaggle Notebooks | Using data from Rest eyes open - Parkinsons Disease 64-Channel EEG

granite mica Jan 17, 2025, 2:33 PM

#

Plz send me a code that creates its own answers for any questions asked

cursive oriole Jan 17, 2025, 4:13 PM

#

granite mica Plz send me a code that creates its own answers for any questions asked

just API call a LLM or use Ollama

subtle glade Jan 17, 2025, 7:16 PM

#

This GPU cost thing is a problem, programming with anxiety about spending $ sucks

serene scaffold Jan 17, 2025, 7:26 PM

#

subtle glade This GPU cost thing is a problem, programming with anxiety about spending $ suck...

The unfortunate reality is that cutting edge ML depends on the very best hardware, and the title for best hardware keeps getting won by bigger and more expensive devices

#

If you're a student, you can see if your university has a compute environment that you can use

subtle glade Jan 17, 2025, 7:42 PM

#

Nope, no university. Think I'm going to try a huggingface subscription

iron basalt Jan 17, 2025, 7:57 PM

#

subtle glade This GPU cost thing is a problem, programming with anxiety about spending $ suck...

What are you trying to make?

subtle glade Jan 17, 2025, 7:58 PM

#

RAG chatbots, just for study

final cobalt Jan 17, 2025, 9:25 PM

#

Success?

#

Something is still wrong of course, but this is progress

#

Here's the issue: what you'd normally expect from a half- or poorly-trained diffusion model is blobs of noise somewhat resembling structure. This looks more like a perfectly clean image with noise laid overtop of it. This image above was generated from pure noise

#

Anyone have any idea what might cause this?

ionic valley Jan 17, 2025, 11:21 PM

#

is there a point in pre-normalizing your data if your model already contains batch normalization?

final cobalt Jan 17, 2025, 11:30 PM

#

ionic valley is there a point in pre-normalizing your data if your model already contains bat...

It depends on the what you're doing. That begs the question, what are you doing?

ionic valley Jan 18, 2025, 12:55 AM

#

final cobalt It depends on the what you're doing. That begs the question, what are you doing?

this isn't a question on a project, I'm just trying to learn more about batch norm

final cobalt Jan 18, 2025, 2:54 AM

#

So, I've got a question for y'all

#

Something I think one can't learn from a book

#

How does one debug and tune a neural network? I mean, when you've got a network that is theoretically sound but isn't working (or could work better), what's the process for figuring it out?

#

Aside from virgin sacrifice, that is

serene scaffold Jan 18, 2025, 3:16 AM

#

final cobalt How does one debug and tune a neural network? I mean, when you've got a network ...

You need to figure out in which specific situations it's not working and what those situations have in common

neat sparrow Jan 18, 2025, 3:28 AM

#

I'm not fully sure what I'm looking for, but I'm attempting to train and fine-tune a model. I have a high-end gaming pc that can process the datasets, however, this would take me very long. I'm going to be processing multiple terabytes of data. Is there a cheap cloud server or remote server I can run this all from and process data faster?

serene scaffold Jan 18, 2025, 3:29 AM

#

neat sparrow I'm not fully sure what I'm looking for, but I'm attempting to train and fine-tu...

It will be very expensive to do this no matter what.

#

If you're trying to do this as a private person (and not on behalf of a company or institution that can pay for it), I would scale this down by orders of magnitude

neat sparrow Jan 18, 2025, 3:31 AM

#

serene scaffold It will be very expensive to do this no matter what.

Is there an alternative? So I just have to wait it out? Also, if my computer restarts or goes into sleep for some reason, how can I save the data?

serene scaffold Jan 18, 2025, 3:31 AM

#

neat sparrow Is there an alternative? So I just have to wait it out? Also, if my computer res...

What kind of data are the terabytes of it that you have?

neat sparrow Jan 18, 2025, 3:31 AM

#

serene scaffold If you're trying to do this as a private person (and not on behalf of a company ...

Hm. Well, I really don't want to do that.

neat sparrow Jan 18, 2025, 3:33 AM

#

serene scaffold What kind of data are the terabytes of it that you have?

HuggingFace datasets such as FineWeb or Common Crawl. I already trained it on smaller datasets, however.

serene scaffold Jan 18, 2025, 3:33 AM

#

neat sparrow HuggingFace datasets such as FineWeb or Common Crawl. I already trained it on sm...

and what do you want to train the model to do?

neat sparrow Jan 18, 2025, 3:38 AM

#

serene scaffold and what do you want to train the model to do?

Text-2-text generation/multi-turn dialogue.

serene scaffold Jan 18, 2025, 3:39 AM

#

neat sparrow Text-2-text generation/multi-turn dialogue.

I do not think you should try to do this on your own computer with terabytes of data, and I do not think there is a cloud compute platform where you can do this cheaply.

final cobalt Jan 18, 2025, 3:44 AM

#

serene scaffold I do not think you should try to do this on your own computer with terabytes of ...

Salad comes to mind

#

But you'd have to be buying in bulk

neat sparrow Jan 18, 2025, 4:03 AM

#

serene scaffold I do not think you should try to do this on your own computer with terabytes of ...

Okay, if I can't do it cheaply, then what would it be?

pine escarp Jan 18, 2025, 9:29 AM

#

Hello.
What are some advanced projects i can add in my portfolio?

tawdry sundial Jan 18, 2025, 12:01 PM

#

i am making an agent with lots of functions to use in function call, i assume adding hundreds of functions to a llm request would be quite expensive.

#

how could i make it cheaper? i was thinking of implementing rag but not so sure about how that will work

#

currently the functions are split into files where each file has functions that relate to each other, all these files are stored in scripts folder

#

i am sure this is a common challenge when making agents

#

would appreciate any suggestions on how to deal with large amounts of function to add to llm request

late lichen Jan 18, 2025, 2:55 PM

#

uhm.... training llm using my discord data is legal???

lapis sequoia Jan 18, 2025, 3:05 PM

#

What does game have to do with RL?

agile cobalt Jan 18, 2025, 3:30 PM

#

late lichen uhm.... training llm using my discord data is legal???

Training it using your personal data export, including only your own messages, is probably fine - but may not deliver good results as it'll be very out of context
Training it on data scrapped from discord including other people's messages is not cool

fervent canopy Jan 18, 2025, 5:50 PM

#

pine escarp Hello. What are some advanced projects i can add in my portfolio?

https://github.com/SanshruthR/WGAN-GP

GitHub

GitHub - SanshruthR/WGAN-GP: Advanced GAN architecture designed for...

Advanced GAN architecture designed for generating high-quality images. - SanshruthR/WGAN-GP

flat token Jan 18, 2025, 6:06 PM

#

fervent canopy https://github.com/SanshruthR/WGAN-GP

I heard about this guy when I did my undergrad at NYU

neat sparrow Jan 18, 2025, 6:12 PM

#

neat sparrow Okay, if I can't do it cheaply, then what would it be?

Nobody answered that ^

proven pier Jan 18, 2025, 6:42 PM

#

Are exponential based reward mechanisms good for reinforcement learning? Should provide globally differentiable training feedback?

fervent canopy Jan 18, 2025, 6:54 PM

#

neat sparrow Nobody answered that ^

I would suggest runpod. Try applying for aws credits if it's research based. But, runpod is as cheap as it gets.

fervent canopy Jan 18, 2025, 6:55 PM

#

flat token I heard about this guy when I did my undergrad at NYU

lol yep it took me quite a bit to understand the entire working of that and write the code

fervent canopy Jan 18, 2025, 6:56 PM

#

tawdry sundial i am making an agent with lots of functions to use in function call, i assume ad...

is it domain based?

#

So, you can easily run a highly quantized model on cpus without even using a gpu and they perform quite well

#

you just need to know where to look tbh lol

#

https://github.com/SanshruthR/CPU_BlazeChat This might be useful for you

GitHub

GitHub - SanshruthR/CPU_BlazeChat: Generate text and images using t...

Generate text and images using the CPU. Contribute to SanshruthR/CPU_BlazeChat development by creating an account on GitHub.

fervent canopy Jan 18, 2025, 7:06 PM

#

proven pier Are exponential based reward mechanisms good for reinforcement learning? Should ...

RL is genuinely Hit and trial there's no definite approach or guide to what would give the best output. You'd have to monitor the model quite closely as it can lead to gradient explosion but you can always implement gradient clipping etc. TLDR, Their effectiveness depends on the problem being solved.

spring field Jan 18, 2025, 8:04 PM

#

TL;DR RL is hard

inland crown Jan 18, 2025, 8:20 PM

#

Are there any message board or social media site scripts? I don't know if it would be easier to start from scratch these days or to port my 20 year old PERL scripts. My searches keep pulling up spambots for various platforms instad of software for platforms.

rich moth Jan 18, 2025, 8:38 PM

#

inland crown Are there any message board or social media site scripts? I don't know if it wou...

Lets build a modern one using a MERN stack.

├── backend/
│   ├── models/
│   │   ├── User.js
│   │   └── Post.js
│   ├── routes/
│   │   ├── auth.js
│   │   └── posts.js
│   ├── middleware/
│   │   └── authMiddleware.js
│   ├── server.js
│   └── config/
│       └── db.js
├── frontend/
│   ├── public/
│   └── src/
│       ├── components/
│       │   ├── Auth/
│       │   │   ├── Login.js
│       │   │   └── Register.js
│       │   ├── Posts/
│       │   │   ├── CreatePost.js
│       │   │   └── PostList.js
│       │   └── Layout/
│       │       └── Navbar.js
│       ├── context/
│       │   └── AuthContext.js
│       ├── App.js
│       ├── index.js
│       └── api.js
├── .env
└── package.json

inland crown Jan 18, 2025, 8:50 PM

#

is it really just that easy these days? LOL!

#

So, how do I implement that at Blahblah.com?

#

(that's really the name, it's not a placeholder)

#

f u d g e... (only he didn't say fudge)

rich moth Jan 18, 2025, 9:11 PM

#

I rewrote the entire thing with security, UI, WebSocket's and everything cool. It works a bit like twitter but with some unique differences. It uses Go for the backend and Angular for the front. Ill paste it in the other channel

inland crown Jan 18, 2025, 9:39 PM

#

How do I implement it for testing? I haven't had a good system at blahblah in a long time and with everyone bailing on FB and X it really would be the PERFECT time!

#

I had the popular message boards before FB took over.

#

lol

#

I can set up a cloud account with a subdomain like blah.blahblah.com ( I think ,I've never actually done that yet lol)

rich moth Jan 18, 2025, 11:06 PM

#

i bought a domain pyposh.org awhile back we can test it on that., i bought it via the google cloud platform

inland crown Jan 18, 2025, 11:28 PM

#

ok, we could also use blahblah.net or blahblah.org, I don't have anything there yet.

#

zencoder just totally screwed my code.. Been trying to get it back this whole time... UGH...

rich moth Jan 19, 2025, 12:08 AM

#

https://github.com/plunder707/social-experiment

GitHub

GitHub - plunder707/social-experiment: A social media platform buil...

A social media platform built using a Go backend with an Angular frontend to provide seamless user authentication, efficient post management, and real-time updates through WebSockets. - plunder707/...

spring field Jan 19, 2025, 12:21 AM

#

what is this?
SEaaS: Social Experiment as a Service? ducky_skull

rich moth Jan 19, 2025, 2:11 AM

#

Its main goal is to facilitate user engagement and interactions through seamless content sharing. It lets you register,, login, create and share post in real time for now. Its pretty simple now im gonna add more features for content sharing.

inland crown Jan 19, 2025, 3:36 AM

#

Slowly making progress

#

opaque condor Jan 19, 2025, 5:26 AM

#

could i have some help with pytorch using Visual code studio be cause i don't understand the documentation that I've read through.

odd meteor Jan 19, 2025, 7:26 AM

#

opaque condor could i have some help with ``pytorch`` using *Visual code studio* be cause i d...

Can you provide more context?

lilac crest Jan 19, 2025, 10:24 AM

#

i dont understand why newTen is not getting updated after i call append on sum/4 over newTen

weary timber Jan 19, 2025, 11:03 AM

#

when i try to print out the weights of my net's after training, nothing seems to be changed even when the net is trained well, like it has a accuracy of %94 but the weights and all are printed out the same , can someone help me with this?

weary timber Jan 19, 2025, 11:29 AM

#

forgot to tell , pytorch

tawdry sundial Jan 19, 2025, 12:22 PM

#

fervent canopy is it domain based?

i am working on the project to learn RAG and llm techniques, not trying to cut down costs by using cheaper models

fervent canopy Jan 19, 2025, 12:23 PM

#

weary timber when i try to print out the weights of my net's after training, nothing seems to...

check if the requires grad is set to True, if you are using pretrained weights they are already good so it wouldn't update that much. You can also try adjusting the lr to something greater

#

like use 1e-2 or something idk

fervent canopy Jan 19, 2025, 12:25 PM

#

tawdry sundial i am working on the project to learn RAG and llm techniques, not trying to cut d...

which vector database are you using to learn RAG, are you following a yt tutorial or a course?

#

and do you wanna run a llm on your own server/ machine or do you prefer an api response?

weary timber Jan 19, 2025, 1:59 PM

#

fervent canopy check if the requires grad is set to True, if you are using pretrained weights ...

it is a siamese net i coded myself

#

and i looked up a tutorial online, in the video the code works for the guy, i copied the exact code from the video to check if soemthings wrong with me and yeah, the code from the video doesnt work on my pc

#

when i run it

rancid sorrel Jan 19, 2025, 2:27 PM

#

inland crown How do I implement it for testing? I haven't had a good system at blahblah in a ...

just create a docker file and spin up the mern stack

#

https://hub.docker.com/r/03192859189254/node-mern-stack/
just drop your code into it using
COPY /filepathonhost/ /containerdesitnation

#

assuming you dont want persitance, then you woudl create/use a volume

opaque condor Jan 19, 2025, 4:02 PM

#

odd meteor Can you provide more context?

i don't under stand the documentation and I lose my place when I read it and it's just confuse I understand what a tensor is its just an array of numbers that could be an image broken into numerical sequences i understand tokening text ect.

rancid sorrel Jan 19, 2025, 4:08 PM

#

tensors are simlar, but you break the image down into vector graphics

#

but its a tensor vs a vector because it contains its start cordinates usally

fervent canopy Jan 19, 2025, 5:39 PM

#

weary timber it is a siamese net i coded myself

I think the main problem could be the sharing of weights, make sure you are sharing the weights and not instantiating two separate models.

#

and try to run that in a cloud environments

#

like kaggle or colab

#

also check if optimizer.step() is being called after loss.backward()

weary timber Jan 19, 2025, 7:31 PM

#

fervent canopy I think the main problem could be the sharing of weights, make sure you are shar...

ohhhhhhhhhhhhh

#

i did it but didnt work

#

you want me to send the codE?

fervent canopy Jan 19, 2025, 9:53 PM

#

weary timber you want me to send the codE?

please do that

opaque condor Jan 19, 2025, 9:59 PM

#

rancid sorrel but its a tensor vs a vector because it contains its start cordinates usally

would you like the link to my live share

rancid sorrel Jan 19, 2025, 10:00 PM

#

its a bit late where i am do do much of stuff like that 😉

opaque condor Jan 19, 2025, 10:01 PM

#

rancid sorrel its a bit late where i am do do much of stuff like that 😉

I'm sorry i had a few places to go and time got away from me

rancid sorrel Jan 19, 2025, 10:02 PM

#

its cool, sadly i am on hol so not around much till thursday really, i just pop in here for a quick read this week

opaque condor Jan 19, 2025, 10:05 PM

#

should I put the link in any way?

rancid sorrel Jan 19, 2025, 10:09 PM

#

sure others will love to review

devout cloak Jan 19, 2025, 10:24 PM

#

I made a cool thing from a paper yesterday, It is a CNN that learns the group of transformations on an image by encoding within an embedding for a CNN

opaque condor Jan 19, 2025, 10:46 PM

#

rancid sorrel sure others will love to review

i havent made any thing yet i need help learning it from the begining

opaque condor Jan 19, 2025, 11:33 PM

#

https://prod.liveshare.vsengsaas.visualstudio.com/join?F0FFF0F0DE287C908B86DD92DCC6C9BB23EE

Visual Studio Code for the Web

Build with Visual Studio Code, anywhere, anytime, entirely in your browser.

worldly wagon Jan 20, 2025, 12:50 AM

#

does anyone have issues with pos_tag and the lemmatizer of nltk

serene scaffold Jan 20, 2025, 12:55 AM

#

worldly wagon does anyone have issues with pos_tag and the lemmatizer of nltk

use spacy for this and do not use nltk.

worldly wagon Jan 20, 2025, 12:57 AM

#

serene scaffold use spacy for this and do not use nltk.

😭 will do

serene scaffold Jan 20, 2025, 12:59 AM

#

In [1]: import spacy

In [2]: nlp = spacy.load('en_core_web_sm')

In [3]: doc = nlp("the boy walked to the store")

In [4]: doc
Out[4]: the boy walked to the store

In [5]: list(doc)
Out[5]: [the, boy, walked, to, the, store]

In [6]: doc[2]
Out[6]: walked

In [7]: doc[2].has_morph()
Out[7]: True

In [8]: doc[2].suffix
Out[8]: 13,622,047,838,477,328,034

In [9]: doc[2].suffix_
Out[9]: 'ked'

worldly wagon Jan 20, 2025, 12:59 AM

#

serene scaffold ```ipython In [1]: import spacy In [2]: nlp = spacy.load('en_core_web_sm') In ...

🤔 why was ked returned and not ed?

serene scaffold Jan 20, 2025, 12:59 AM

#

worldly wagon 🤔 why was ked returned and not ed?

yeah, idk

#

kinda sus

#

In [10]: doc[2].morph
Out[10]: Tense=Past|VerbForm=Fin

worldly wagon Jan 20, 2025, 1:01 AM

#

ahh i'll go check it out myself

ornate iris Jan 20, 2025, 3:14 AM

#

I started developing this method called horizon mapping thats kind of a higher level partner to MCTS, its supposed to analyze upper decision boundaries, compute entropy of decision trees, help identify horizon points or points of uncertianty to aid in triggering surprise minimization, generate adversarial interactions. Just overall find and visualize areas where the model can train and adapt. and the damn thing just wont work.

Even though everything looks right, imports then logging for global mapping, it just seems like one of those weird things with programming where a file just wont initialize properly. So I'm taking a break.

@left tartan but to answer your question, I'm not training it yet with the new method I'm just trying to get through the errors it's causing with the system.

left tartan Jan 20, 2025, 3:14 AM

#

ornate iris I started developing this method called horizon mapping thats kind of a higher l...

What kind of errors?

ornate iris Jan 20, 2025, 3:21 AM

#

left tartan What kind of errors?

The logging is saying it's not defined but the errors are popping up on 90 and 113. Meanwhile the refrence for global mapping is on 80. So maybe the problem is in the file itself for the resilient error guard.

left tartan Jan 20, 2025, 3:21 AM

#

ornate iris The logging is saying it's not defined but the errors are popping up on 90 and 1...

Could you open a help thread and paste code and exact error?

#

#❓｜how-to-get-help

ornate iris Jan 20, 2025, 3:23 AM

#

Actually thanks for being a rubber duck! Just figured it out!

tawdry sundial Jan 20, 2025, 11:06 AM

#

fervent canopy which vector database are you using to learn RAG, are you following a yt tutoria...

VectorStore and i use VectorStoreIndex for retrieving.

#

currently i am scraping information from yt, docs to learn how to efficiently design the workflow

tawdry sundial Jan 20, 2025, 11:09 AM

#

fervent canopy and do you wanna run a llm on your own server/ machine or do you prefer an api r...

the main llm (and the only one currently) is gpt 4 mini

#

so my current plan is to use llamaindex dataloader docstringwalker to get all the python functions (the functions are split into files based on relevancy and dependency), store it in VectorStore (vector db), then retrieve relevant file(s) with VectorStoreIndex

#

then query the llm

#

i am currently not sure which RAG and retrieval process i will us, there are a lot of options. I am using top_k 2 at the moment

ashen latch Jan 20, 2025, 11:49 AM

#

There is a question in my head

Currently, with the development in Deep Learning, do traditional ML algorithms such as SVM, Decision Trees, K-Means, etc. need to be known, or is there no need for one to know them and focus only on Deep Learning, For someone who wants to specialize in ML Research ?

weary timber Jan 20, 2025, 12:10 PM

#

fervent canopy please do that

https://paste.pythondiscord.com/2Y2Q

#

the dataset im using is at&t

#

the loss seems to go down but the accuracy and weights dont change

rancid sorrel Jan 20, 2025, 12:42 PM

#

Just don't do it in danish or some non English languages

#

As the postfix is incredibly important there

kindred fable Jan 20, 2025, 2:14 PM

#

ashen latch There is a question in my head Currently, with the development in Deep Learning...

I would personally say yes, as some problems might be better off with a more classic ML model then a deep learning model.

As extra this also build up a base of understanding of how you would tackle problems instead of alway using deep learning, a more broad arsenal is never bad.

ashen latch Jan 20, 2025, 2:20 PM

#

kindred fable I would personally say yes, as some problems might be better off with a more cla...

I am interested in ML Optimization research.

regal sun Jan 20, 2025, 2:22 PM

#

What are some recommended ways to set up a version control system for a small data science project utilizing Jupyter Notebook? I am considering using GitHub as I will be working with a group of friends, but it seems like the notebook metadata differs between our devices.

Sorry if this is the wrong channel to ask this question. Redirect me to the correct one if necessary, thanks!

kindred fable Jan 20, 2025, 2:33 PM

#

ashen latch I am interested in ML Optimization research.

I would still say yes as it builds an understanding of the fundamental concepts as these are often the foundations for more advanced ML concepts.

I would suggest to start with the basics like linear regression, logistic,.. and focus on understanding the optimization methods like gradient descent, quadratic programing.

Once you get this, you can start by implementing easy deep learning models with optimizations.

If you want you can quickly go over the "classic" ML but i wouldn't skip out on it entirely.

kindred fable Jan 20, 2025, 2:44 PM

#

regal sun What are some recommended ways to set up a version control system for a small da...

You can try to use something like nbstripout. This removes cell outputs and metadata when you commit.

I've never used it myself but you can surely try it out.

You can also use google Colab.

ashen latch Jan 20, 2025, 2:45 PM

#

kindred fable I would still say yes as it builds an understanding of the fundamental concepts ...

Okay thank you ❤️

weary timber Jan 20, 2025, 4:18 PM

#

weary timber https://paste.pythondiscord.com/2Y2Q

can someone help me with this

#

the loss goes down but the accuracy doesnt go up

#

im stuck atp

limber belfry Jan 20, 2025, 4:59 PM

#

How to make real time object detection with a yolo11 model that i trained? The code part

kindred fable Jan 20, 2025, 4:59 PM

#

weary timber can someone help me with this

Do you have an example of the output?

rancid sorrel Jan 20, 2025, 5:49 PM

#

regal sun What are some recommended ways to set up a version control system for a small da...

Git

blazing wedge Jan 20, 2025, 6:04 PM

#

Does anyone know optimization well?

rancid sorrel Jan 20, 2025, 6:05 PM

#

Hyper parameter tuning and pipelines

#

https://www.tensorflow.org/tutorials/keras/keras_tuner

TensorFlow

Introduction to the Keras Tuner | TensorFlow Core

#

Your also gonna need this
https://www.tensorflow.org/api_docs/python/tf/keras/callbacks/Callback
And tensorboard

TensorFlow

tf.keras.callbacks.Callback | TensorFlow v2.16.1

Base class used to build new callbacks.

fervent canopy Jan 20, 2025, 6:08 PM

#

weary timber https://paste.pythondiscord.com/2Y2Q

Maybe try this `# Before training
initial_weights = {name: param.clone() for name, param in siamese_net.named_parameters()}

After training

for name, param in siamese_net.named_parameters():
diff = torch.sum(torch.abs(initial_weights[name] - param.data))
print(f"Parameter {name} changed by: {diff.item()}") `

serene scaffold Jan 20, 2025, 6:08 PM

#

blazing wedge Does anyone know optimization well?

remember to always always ask your actual question. don't ask if someone knows about the topic of a secret question.

fervent canopy Jan 20, 2025, 6:09 PM

#

tawdry sundial currently i am scraping information from yt, docs to learn how to efficiently de...

they have a really good mongodb course so use that and maybe try using faiss library

rancid sorrel Jan 20, 2025, 6:10 PM

#

Are you trying to optimise by speed or accuracy is a really good thing to know too

fervent canopy Jan 20, 2025, 6:13 PM

#

tawdry sundial the main llm (and the only one currently) is gpt 4 mini

try using groq and reduce the tokenisation length and use faiss

kindred fable Jan 20, 2025, 6:46 PM

#

How would you guys handle extracting specific data from insurances?
Right now i can extract all the text using pdfplumber and OCR, but i still need to extract the specific data like names, conditions, dates,....

The data should be put in to a csv

Note: I cant share the data here because its sensitive data that falls under an NDA

rancid sorrel Jan 20, 2025, 6:54 PM

#

Use ms vision

#

Very good very cheap

#

On azure

#

https://azure.microsoft.com/en-us/products/ai-services/ai-vision

Azure AI Vision with OCR and AI | Microsoft Azure

Accelerate computer vision development with Microsoft Azure. Get insights from image and video content using OCR, object detection, and image analysis.

rancid sorrel Jan 20, 2025, 7:22 PM

#

fyi there is a specific document OCR and its even cheaper than the main AI

young cairn Jan 20, 2025, 7:24 PM

#

I imported dotenv but it still says 'module not found'
this is how i did it:
from dotenv import load_dotenv

any idea why it's throwing this err?

#

pip install python-dotenv
this is what i installed

rancid sorrel Jan 20, 2025, 7:28 PM

#

Is that not venv?

#

Or am I thinking something else

weary timber Jan 20, 2025, 7:39 PM

#

kindred fable Do you have an example of the output?

it simply goes down like to 0.003 from 1.1

#

and when i test it with the test data

#

its very bad

weary timber Jan 20, 2025, 7:39 PM

#

fervent canopy Maybe try this `# Before training initial_weights = {name: param.clone() for nam...

ok wait

#

#

strotmic this is for you

#

example output

kindred fable Jan 20, 2025, 8:10 PM

#

and the accuracy your talking about is this accuracy on test or train set?

weary timber Jan 20, 2025, 8:32 PM

#

kindred fable and the accuracy your talking about is this accuracy on test or train set?

oh the accuracy wait lemme send that too

#

wait wtf

#

i changed a tiny thing and now the accuracy comes pretty high

#

#

the reason i have test epochs is the test count is only 30 and it selects random photos for test elements so to get a clearer result

#

i added test epoch

rancid sorrel Jan 20, 2025, 8:45 PM

#

is this time serise data?

#

is
1 your data time shifted
2 is shuffle off
are you not splitting 80:20 using sklaern randomly but instead spliting the data
df[:80]

#

  split_index = int(len(df) * 0.8)
  train_df = df[:split_index]
  test_df = df[split_index:]```

serene scaffold Jan 20, 2025, 9:05 PM

#

(make sure that you shuffle the data before doing that, and use iloc)

#

@rancid sorrel ^

rancid sorrel Jan 20, 2025, 9:06 PM

#

if your doing time serise you explisty dont shuffle

#

or it screws you

serene scaffold Jan 20, 2025, 9:07 PM

#

I see

rancid sorrel Jan 20, 2025, 9:07 PM

#

cause it gives you 100% accuracy

weary timber Jan 20, 2025, 9:15 PM

#

rancid sorrel is this time serise data?

are you talking to me?

#

it is at&t

#

face

rancid sorrel Jan 20, 2025, 9:16 PM

#

oh

spring field Jan 20, 2025, 9:47 PM

#

rancid sorrel cause it gives you 100% accuracy

How does shuffling time series give you 100% accuracy? Like what's the intuition behind it?

rancid sorrel Jan 20, 2025, 9:48 PM

#

spring field How does shuffling time series give you 100% accuracy? Like what's the intuition...

cause for time serise data it predicts the missing data,
if your trying to predict x+1 you you include the gradent of x+1 in your training data by shuffling then split

#

essentually your near enough including testing data in your training data to bork your model

fervent canopy Jan 20, 2025, 10:52 PM

#

This project demonstrates real-time object detection using the YOLOv5n6 model with low-resolution inference for high-speed processing, while drawing the results on high-resolution frames.
https://github.com/SanshruthR/CCTV_YOLO

GitHub

GitHub - SanshruthR/CCTV_YOLO: Fast Real-time Object Detection with...

Fast Real-time Object Detection with High-Res Output https://x.com/_akhaliq/status/1840213012818329826 - SanshruthR/CCTV_YOLO

molten elk Jan 20, 2025, 11:34 PM

#

I would like to do the ff: How much math do I need to learn?

Finding the winning strategy in a card game
Assessing online ad clicks for significance
Tracking disease outbreaks using news headlines
Using online job postings to improve your data science resume
Predicting future friendships from social network data

weary timber Jan 20, 2025, 11:34 PM

#

fervent canopy This project demonstrates real-time object detection using the YOLOv5n6 model wi...

what do i do and for how long to get good at ml like you?

obtuse perch Jan 21, 2025, 3:02 AM

#

I can't do this least square fit in excel I don't know why. Could python work well and how to do that? or Mathematica

fervent canopy Jan 21, 2025, 6:08 AM

#

weary timber what do i do and for how long to get good at ml like you?

dm'ed you

#

cuz i don't wanna send a block of text here lol

fervent canopy Jan 21, 2025, 6:13 AM

#

obtuse perch I can't do this least square fit in excel I don't know why. Could python work we...

use =PY in excel

#

and write that in python

fervent canopy Jan 21, 2025, 6:24 AM

#

molten elk I would like to do the ff: How much math do I need to learn? - Finding the winni...

lol that's a really vague question there is no such thing as bare minimum in maths every problem can be solved with different methods and scenarios. Please pardon my crude analogy here, but it's like saying hey, I want to kill a person what should I use? You can use a bat, a gun, a rocket launcher or just your fists. So, it all depends on what you are trying to use, for example if you need to find the value of tan 37 you can use algebraic methods, geometric methods or idk taylor series. You can use high school maths and Advanced calc but it's going to give you the same thing. One would require idk 3 pages to solve and other would solve it in 2 lines. One requires little knowledge and other requires knowledge of calculus and permutations. So, like just dive into it man, and start solving it and you'd just learn that stuff as you'd progress 🙂

hallow badger Jan 21, 2025, 9:33 AM

#

Deepseek R1 better V3?

rancid sorrel Jan 21, 2025, 9:43 AM

#

obtuse perch I can't do this least square fit in excel I don't know why. Could python work we...

Excel supports python

rancid sorrel Jan 21, 2025, 10:53 AM

#

hello

rancid sorrel Jan 21, 2025, 11:32 AM

#

sigh .py removed his post now i look like amuppet

tender hearth Jan 21, 2025, 12:22 PM

#

hallow badger Deepseek R1 better V3?

it's v3 with CoT-reasoning

#

if you read the paper, you'll see they used v3 and then some fancy RL for the CoT training

umbral tide Jan 21, 2025, 2:42 PM

#

Cara membuat personal chatbot sederhana👇🏻
https://www.wahyuikbal.web.id/blog/AI-Engineer-How-to-Integrate-a-Ai-into-Your-Personal-Website

Wahyu ikbal website

AI Engineer: How to Integrate a Ai into Your Personal Website

Apakah kamu merasa FOMO dengan perkembangan AI saat ini? Mau belajar ai tapi bingung mulai dari mana? Sebagai mahasiswa IT, langkah pertama yang bisa kamu lakukan adalah dengan membuat aplikasi AI

somber fractal Jan 21, 2025, 6:53 PM

#

i experience problem with text classification task with hf transformers bert library

#

anyone has experience with that?

serene scaffold Jan 21, 2025, 6:59 PM

#

somber fractal anyone has experience with that?

be sure to never ask if someone has enough experience to answer your question. just ask your whole question. give enough information for someone to start answering right away.

lapis sequoia Jan 21, 2025, 7:50 PM

#

fervent canopy This project demonstrates real-time object detection using the YOLOv5n6 model wi...

this is short code can you explain in vc

fervent canopy Jan 21, 2025, 9:27 PM

#

lapis sequoia this is short code can you explain in vc

sure

oblique comet Jan 21, 2025, 11:32 PM

#

what

#

why are these even different

serene scaffold Jan 21, 2025, 11:36 PM

#

oblique comet what

remember to always give text as text and not as a screenshot. if this is part of an error message, please give the whole error message, including the parts that you don't think are important.

oblique comet Jan 21, 2025, 11:37 PM

#

i am not asking for help here actually, just rambling and being annoyed of pytorch

#

why do two incompatible types for float exist, torch.cuda.HalfTensor and torch.HalfTensor

serene scaffold Jan 21, 2025, 11:38 PM

#

oblique comet why do two incompatible types for float exist, torch.cuda.HalfTensor and torch.H...

because things on the GPU are different than those on the CPU

oblique comet Jan 21, 2025, 11:38 PM

#

hm

serene scaffold Jan 21, 2025, 11:39 PM

#

if you decide that you want help, show the code and the error message, and I or someone else might take a look.

oblique comet Jan 21, 2025, 11:39 PM

#

alright, one second.

fallow coyote Jan 21, 2025, 11:41 PM

#

when learning the mathematics for ML, what topics should I focus on more? Bare in mind, I will be learning univariate and multivariate calculus, as well as some introductory lessons into matrices later in the semester at my uni

oblique comet Jan 21, 2025, 11:42 PM

#

https://paste.pythondiscord.com/OKMQ basically the stuff around line 71 in load_model seems to be loaded to cpu in that case

serene scaffold Jan 21, 2025, 11:42 PM

#

fallow coyote when learning the mathematics for ML, what topics should I focus on more? Bare i...

missing from what you said is probability theory

oblique comet Jan 21, 2025, 11:43 PM

#

oblique comet https://paste.pythondiscord.com/OKMQ basically the stuff around line 71 in load_...

already struggled an hour here to get bitsandbytes to load it on the device I want using .to() or device_map but both are not supported yet; so I came up with torch.cuda.set_device(0) instead

serene scaffold Jan 21, 2025, 11:43 PM

#

@oblique comet and also the whole error message. (remember to always post both at the same time.)

oblique comet Jan 21, 2025, 11:44 PM

#

serene scaffold <@155420846695907328> and also the whole error message. (remember to always post...

https://paste.pythondiscord.com/JY2Q

fallow coyote Jan 21, 2025, 11:45 PM

#

serene scaffold missing from what you said is probability theory

Im struggling to find good resources in how to learn the statistical/probability aspect of ML. Maths has always been my strongest subject so, I dont struggle with learning the maths (even with limited knowledge), but Im struggling in trying to find good resources. Im going through ISLP and I understand the maths, but I want to understand it further so I fully know what the values are saying

serene scaffold Jan 21, 2025, 11:46 PM

#

@oblique comet I've never seen all these extra cuda settings (like torch.backends.cuda.enable_cudnn_sdp), but hopefully someone who's experienced in that area will come along.

oblique comet Jan 21, 2025, 11:48 PM

#

disabling cudnn sdp was required for para attention; i later replaced that one with teacache instead so that part is obsolete
the error remains sadly even if removing it

#

just in case; https://paste.pythondiscord.com/VVSQ here is a simplified version that still produces the same error without all the extras

oblique comet Jan 21, 2025, 11:51 PM

#

serene scaffold <@155420846695907328> I've never seen all these extra cuda settings (like `torch...

thanks for looking into it at least

#

adding device_map="balanced" to LTXImageToVideoPipeline fixed it for some reason

thorny geode Jan 22, 2025, 1:32 AM

#

hello, this is a pretty much out of topic, but i am a highschooler trying to choose whether i should really focus on my data and statistics research instead of improving on my grades (its around 92 average), since I am still not sure whether universities care about which ones for scholarship

serene scaffold Jan 22, 2025, 2:05 AM

#

thorny geode hello, this is a pretty much out of topic, but i am a highschooler trying to cho...

Your grades are probably more important

#

When you say "research" can you be very extra specific about the context and objectives? @thorny geode

#

@thorny geode I need to know if this "research" is a personal side project, or something you're doing in an official capacity.

warm copper Jan 22, 2025, 2:45 AM

#

0.0

warm copper Jan 22, 2025, 2:46 AM

#

serene scaffold <@537775568507240471> I need to know if this "research" is a personal side proje...

I freaked out the server by talking about recall and precision in off topic channel 😦

serene scaffold Jan 22, 2025, 2:57 AM

#

warm copper I freaked out the server by talking about recall and precision in off topic chan...

Why would that freak people out

thorny geode Jan 22, 2025, 3:21 AM

#

serene scaffold When you say "research" can you be very extra specific about the context and obj...

My bad

thorny geode Jan 22, 2025, 3:24 AM

#

serene scaffold <@537775568507240471> I need to know if this "research" is a personal side proje...

I am planning to win my national research competition around the end of this year, or at least on a city-level. The competition in my country for mathematics/statistics field is very scarce. For example, the city regional winner only yse ANOVA as its main methodology.

#

For the context, I’ve been steadily learning the book Introduction to Statistical Programming with only basic statistical knowledge, such as expected value and distribution in my high school, and a bronze national olympiad winner in mathematics for general skills (on junior high school though)

warm copper Jan 22, 2025, 3:27 AM

#

ANOVAAAA

#

bring the t-test

#

and f-statistics

#

statistical programming usually focuses on R

#

have you ever used R?

thorny geode Jan 22, 2025, 3:28 AM

#

But I don’t know how this can even contribute to my probability of getting a scholarship (or maybe some intership opportunities), since my teachers suggests on improving my score, while edu fairs and university seminars just give a vague idea of “good academic record, extracurricular activies, leadership” stuff

thorny geode Jan 22, 2025, 3:29 AM

#

warm copper bring the t-test

I do use t-test to win my first “mathematical modelling” competition, even though its mostly just statistics

warm copper Jan 22, 2025, 3:30 AM

#

I mean f test is preferred for ANOVA

thorny geode Jan 22, 2025, 3:30 AM

#

warm copper have you ever used R?

I have used R before, but I prefer Python as most of the machine learning models are based on Python, so now I have a good grasp on using pandas and matplotlib

warm copper Jan 22, 2025, 3:30 AM

#

you would get much more information with R if your aim is just statistical programming

#

ML and AI use Python

#

for example for ANOVA you need to focus on F-statistics

thorny geode Jan 22, 2025, 3:32 AM

#

yes, of course, since ANOVA compares more than 2 variables, and F-statistics is made for that

warm copper Jan 22, 2025, 3:32 AM

#

if you are gonna use ANOVA for the championship

#

focus on F-statistics

#

😄

thorny geode Jan 22, 2025, 3:33 AM

#

thank you for the info

warm copper Jan 22, 2025, 3:33 AM

#

you can use F statistics for 2 variables too

#

For two variables the F-statistic in ANOVA is the square of the t-statistic

thorny geode Jan 22, 2025, 3:34 AM

#

but im planning to use more advanced models for winning my championship, and it looks like lasso regression seems very nice… I mentions ANOVA as even simple hypothesis testing already wins city championship, so improving on my statistical knowledge and skills will bring me up to the national competition with no hard difficulties (hopefully)

thorny geode Jan 22, 2025, 3:35 AM

#

warm copper For two variables the F-statistic in ANOVA is the square of the t-statistic

Yes, and I believe that is to check the partial effect of adding that specific variables into the multiple regression model

warm copper Jan 22, 2025, 3:35 AM

#

lasso regression is used as a feature selection method

#

if your aim is to find the most important variables that can work

thorny geode Jan 22, 2025, 3:36 AM

#

yes, hopefully I can finish that chapter before my semester ends, but Chapter 3 of regression would be really sufficient in my research

thorny geode Jan 22, 2025, 3:37 AM

#

warm copper if your aim is to find the most important variables that can work

we can use forward selection, backward selection, or mixed selection

warm copper Jan 22, 2025, 3:37 AM

#

yeah stepwise regression too

thorny geode Jan 22, 2025, 3:37 AM

#

and for low amount of variables, we test all the combination of variables to check all the posibilities

thorny geode Jan 22, 2025, 3:37 AM

#

warm copper yeah stepwise regression too

ah i see, very nie

#

nice

warm copper Jan 22, 2025, 3:37 AM

#

https://github.com/KadirOrcunAltunel/HeartFailureAnalysis/tree/main

GitHub

GitHub - KadirOrcunAltunel/HeartFailureAnalysis

Contribute to KadirOrcunAltunel/HeartFailureAnalysis development by creating an account on GitHub.

#

check this 😄

thorny geode Jan 22, 2025, 3:38 AM

#

@serene scaffold I’m sorry, i moved into another conversation

warm copper Jan 22, 2025, 3:38 AM

#

https://github.com/KadirOrcunAltunel/HeartFailureAnalysis/blob/main/Methods.md

GitHub

HeartFailureAnalysis/Methods.md at main · KadirOrcunAltunel/HeartFa...

Contribute to KadirOrcunAltunel/HeartFailureAnalysis development by creating an account on GitHub.

#

this part can be helpful for you

#

for feature selection tasks

thorny geode Jan 22, 2025, 3:40 AM

#

ooh yeah that will be a nice cheatsheet if im confused what to do in my research later

warm copper Jan 22, 2025, 3:41 AM

#

what are you planning to do for your research?

#

whats the project

thorny geode Jan 22, 2025, 3:44 AM

#

I’m thinking about using BMKG meteorogical data in predicting crop yield, as my country really focused on agriculture

warm copper Jan 22, 2025, 3:46 AM

#

so its not logistic regression

#

y is not categorical I assume

#

you can use linear regression

#

Im not sure if you learned tree based models but they are good as well

#

XGBoost can be good with nonlinear relations

obtuse perch Jan 22, 2025, 4:14 AM

#

rancid sorrel Excel supports python

I gave up that. the function is too complex I can't fit

somber fractal Jan 22, 2025, 5:13 AM

#

    {"job_title": "Asia Finance Controller", "tags": ["Manager", "Director"]},
    {"job_title": "Assistant Audit Manager AVP", "tags": ["Manager", "Director"]},
    {"job_title": "Business Controller", "tags": ["Manager", "Director"]}
]

# Preprocess data
df = pd.DataFrame(data)
mlb = MultiLabelBinarizer()
df['labels'] = list(mlb.fit_transform(df['tags']))

# Convert to Hugging Face dataset
dataset = Dataset.from_pandas(df)

# Load tokenizer and model
tokenizer = BertTokenizer.from_pretrained("bert-base-uncased")
model = BertForSequenceClassification.from_pretrained("bert-base-uncased", num_labels=len(mlb.classes_), problem_type="multi_label_classification")

# Tokenize data
def preprocess_function(examples):
    return tokenizer(examples['job_title'], truncation=True, padding=True)

tokenized_dataset = dataset.map(preprocess_function, batched=True)

# Ensure labels are of type torch.float (this is required for multi-label classification)
def cast_to_float(example):
    example['labels'] = torch.tensor(example['labels'], dtype=torch.float)  # Convert labels to torch.float
    return example

# tokenized_dataset = tokenized_dataset.map(cast_to_float)

# Training arguments
training_args = TrainingArguments(
    output_dir="./results",
    evaluation_strategy="epoch",
    save_strategy="epoch",
    num_train_epochs=3,
    per_device_train_batch_size=8,
    logging_dir="./logs",
)

# Trainer
trainer = Trainer(
    model=model,
    args=training_args,
    train_dataset=tokenized_dataset,
    eval_dataset=tokenized_dataset,  # Ideally, you should split this into train/test datasets.
)

# Train model
trainer.train()```

#

I get this error : RuntimeError: result type Float can't be cast to the desired output type Long

serene scaffold Jan 22, 2025, 5:19 AM

#

somber fractal I get this error : RuntimeError: result type Float can't be cast to the desired ...

!traceback

arctic wedgeBOT Jan 22, 2025, 5:19 AM

#

Traceback

Please provide the full traceback for your exception in order to help us identify your issue.
While the last line of the error message tells us what kind of error you got,
the full traceback will tell us which line, and other critical information to solve your problem.
Please avoid screenshots so we can copy and paste parts of the message.

A full traceback could look like:

Traceback (most recent call last):
  File "my_file.py", line 5, in <module>
    add_three("6")
  File "my_file.py", line 2, in add_three
    a = num + 3
        ~~~~^~~
TypeError: can only concatenate str (not "int") to str

If the traceback is long, use our pastebin.

somber fractal Jan 22, 2025, 5:26 AM

#

serene scaffold !traceback

what do you mean

serene scaffold Jan 22, 2025, 5:26 AM

#

somber fractal I get this error : RuntimeError: result type Float can't be cast to the desired ...

this is the last line of the error message. please show the whole entire thing.

somber fractal Jan 22, 2025, 5:26 AM

#

ok

#

https://pastebin.com/xXQPXK2X

Pastebin

RuntimeError Traceback (most recent ca...

Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.

shut bloom Jan 22, 2025, 5:42 AM

#

!traceback

arctic wedgeBOT Jan 22, 2025, 5:42 AM

#

Traceback

Please provide the full traceback for your exception in order to help us identify your issue.
While the last line of the error message tells us what kind of error you got,
the full traceback will tell us which line, and other critical information to solve your problem.
Please avoid screenshots so we can copy and paste parts of the message.

A full traceback could look like:

Traceback (most recent call last):
  File "my_file.py", line 5, in <module>
    add_three("6")
  File "my_file.py", line 2, in add_three
    a = num + 3
        ~~~~^~~
TypeError: can only concatenate str (not "int") to str

If the traceback is long, use our pastebin.

serene scaffold Jan 22, 2025, 5:44 AM

#

somber fractal ```data = [ {"job_title": "Asia Finance Controller", "tags": ["Manager", "Di...

how did you produce this code?

somber fractal Jan 22, 2025, 5:45 AM

#

with ai

serene scaffold Jan 22, 2025, 5:46 AM

#

somber fractal with ai

how much experience do you have writing python code?

somber fractal Jan 22, 2025, 5:46 AM

#

many

#

but first time with transformers.

serene scaffold Jan 22, 2025, 5:47 AM

#

@somber fractal I'm concerned that you don't know enough about what you're trying to do to benefit from any help I might give you.

somber fractal Jan 22, 2025, 5:48 AM

#

i see

serene scaffold Jan 22, 2025, 5:50 AM

#

somber fractal ```data = [ {"job_title": "Asia Finance Controller", "tags": ["Manager", "Di...

if you look at this code, you'll see that there's only three data instances, and that you don't divide them into training and testing. which means that the model will be useless.

this AI output is just intended to be used as an example.

somber fractal Jan 22, 2025, 5:51 AM

#

i am not such dumb ) i know

#

i have to solve just the bug

#

open to any solution not to critics

serene scaffold Jan 22, 2025, 5:52 AM

#

@somber fractal the problem is probably that tokenized_dataset contains the wrong data type.

#

it looks like you commented out # tokenized_dataset = tokenized_dataset.map(cast_to_float). I wonder what error you were getting before, if any

somber fractal Jan 22, 2025, 5:54 AM

#

to avoid this float data type error i commented out that function but didnt work

#

i wonder which parameters should i include to the tokenizer function to make the dtype long,

#

if it will have any positive effect ofcourse.

#

i have deadline thats why dont have enough time to research the documentation, thats why i am here.

young cairn Jan 22, 2025, 7:06 AM

#

How would you guys deal with prompt condensation? i want to reduce process time for parsing and for that i need to reduce input-output tokens

rancid sorrel Jan 22, 2025, 9:28 AM

#

obtuse perch I gave up that. the function is too complex I can't fit

Use =linst() then go into the graph right click the line and you can change to polynomial in the graph window when you click propertys

hallow badger Jan 22, 2025, 11:10 AM

#

openai with softbankbuilding stargate new company invest 500 billion for four years

thorny geode Jan 22, 2025, 11:20 AM

#

warm copper y is not categorical I assume

possibly 🙂

thorny geode Jan 22, 2025, 11:20 AM

#

warm copper Im not sure if you learned tree based models but they are good as well

thank you for your tips

regal light Jan 22, 2025, 11:36 AM

#

Hey guys, have anyone worked with ocr fine tuning. I have omani number plate datasets. But couldn't find a proper ocr model to fine tune it. Can anyone help me with it?

agile cobalt Jan 22, 2025, 3:15 PM

#

regal light Hey guys, have anyone worked with ocr fine tuning. I have omani number plate da...

Take a look around "YOLO (you only look once)" models or just roboflow, e.g. https://universe.roboflow.com/roboflow-universe-projects/license-plate-recognition-rxg4e

Roboflow

License Plate Recognition Object Detection Dataset and Pre-Trained ...

10125 open source license-plates images plus a pre-trained License Plate Recognition model and API. Created by Roboflow Universe Projects

fervent canopy Jan 22, 2025, 5:07 PM

#

regal light Hey guys, have anyone worked with ocr fine tuning. I have omani number plate da...

Just use openCV lol

#

It works good enough

regal light Jan 22, 2025, 5:08 PM

#

Okay cool

#

But I wanted to know if anyone has experience with fine tuning an ocr model

final cobalt Jan 23, 2025, 12:14 AM

#

https://paste.pythondiscord.com/4NJA Could I get a triple check on my math

#

It's a bit tough telling whether the problem is the model or the diffusion logic

fervent canopy Jan 23, 2025, 6:51 AM

#

regal light But I wanted to know if anyone has experience with fine tuning an ocr model

dude ocr is just a ml model

lavish wraith Jan 23, 2025, 8:18 AM

#

Is AI is difficult field or easy for learning

small wedge Jan 23, 2025, 8:26 AM

#

AI/ML is a very difficult field, but you can easily learn to leverage existing models and even make your own with libraries that abstract away all the complex math and understanding required to build them from scratch

serene scaffold Jan 23, 2025, 4:36 PM

#

I just tried to make a matplotlib figure so large that I got a warning saying I might be getting DOS'ed

rancid sorrel Jan 23, 2025, 7:04 PM

#

lol

inland crown Jan 23, 2025, 7:42 PM

#

Would this be the right place to discuss zencoder and copilot?

left tartan Jan 23, 2025, 7:57 PM

#

inland crown Would this be the right place to discuss zencoder and copilot?

What about them? How to build one? Or how to use them? If build, yes... if just how to use them, that's more a general (#python-discussion) or OT discussion, depending

inland crown Jan 23, 2025, 8:39 PM

#

They aren't working. I'll ask in gen thank you!

left tartan Jan 23, 2025, 8:42 PM

#

inland crown They aren't working. I'll ask in gen thank you!

If zencoder isn't working, you're better off asking their slack. I'm not sure about copilot's community.

inland crown Jan 23, 2025, 8:44 PM

#

Thank you for digging that link for me! I appreciate it!

floral elm Jan 23, 2025, 11:12 PM

#

Can someone tell me where to get help with using my gpu with tensorflow on windows? I've tried mutliple combinations of cuda/cudnn/drivers/python/tensorflow. I made sure they're all compatible each time. I've tried miniconda, anaconda3, and WSL2 with docker, and although I seem to have set them up correctly, tensorflow can't see my gpu in each case. I've also tried about 25 hours of consulting chatgpt. nvidia-smi does correctly show my gpu.

serene scaffold Jan 23, 2025, 11:32 PM

#

floral elm Can someone tell me where to get help with using my gpu with tensorflow on windo...

tensorflow development is winding down--I recommend switching to pytorch

#

I can help you install pytorch on windows.

floral elm Jan 23, 2025, 11:39 PM

#

serene scaffold tensorflow development is winding down--I recommend switching to pytorch

😂 Yeah I just realized pytorch would be a much better solution. I'm installing an earlier version of cuda now, just for pytorch, and I'll let you know how it goes. Thanks for answering!

#

in my defense, I thought pytorch uses tensorflow. I just found out that it doesnt after I posted for help

floral elm Jan 23, 2025, 11:50 PM

#

serene scaffold tensorflow development is winding down--I recommend switching to pytorch

Got it working already - Ty again!

serene scaffold Jan 24, 2025, 12:10 AM

#

floral elm Got it working already - Ty again!

nice!

final cobalt Jan 24, 2025, 4:31 AM

#

I think

#

Though I'm having a blood hard time telling if so

#

https://github.com/lucaswalkeryoung/Diffusion that it's working

GitHub

GitHub - lucaswalkeryoung/Diffusion: a small diffusion model

a small diffusion model. Contribute to lucaswalkeryoung/Diffusion development by creating an account on GitHub.

#

Could I get someone to look over my DDPM class to check my denoising logic? The loss is dropping really nicely, but it's kinda hard to tell if it's working or not

final cobalt Jan 24, 2025, 5:07 AM

#

YESSSSSSS

#

#

That sure as hell looks like learning to me!!!!!

weary timber Jan 24, 2025, 10:40 AM

#

final cobalt

nightmare fuel

final cobalt Jan 24, 2025, 11:15 AM

#

weary timber nightmare fuel

XD Yeah

#

That's about the size of it

spring field Jan 24, 2025, 12:00 PM

#

serene scaffold I just tried to make a matplotlib figure so large that I got a warning saying I ...

lol
also, what was that lib you were shilling now? was it seaborn or plotly?

#

ah, right
I had seaborn on my mind for some reason, so was a bit confused when I found out it's a matplotlib wrapper when I thought plotly was that 😅

#

plotly do be looking nice indeed, yeah

magic grove Jan 24, 2025, 3:35 PM

#

Hi, I made a post on #1035199133436354600 but it got closed for inactivity. I'm using Anaconda on Windows and have installed Jupyter Themes using conda install -c conda-forge jupyterthemes and tried to change the theme to Onedork by running jt -t onedork and restarting the Jupyter Notebook and refreshing my browser cache but the theme does not change, nor do I have the option in the themes menu to switch it to Onedork. Here is my log on a fresh reinstall https://paste.pythondiscord.com/MBDA

light lichen Jan 24, 2025, 5:36 PM

#

hi

#

https://www.kaggle.com/code/harunshimanto/machine-learning-algorithms-for-epileptic-seizures#What-is-Data-Pre-pocessing?

Machine Learning Algorithms for Epileptic Seizures

Explore and run machine learning code with Kaggle Notebooks | Using data from Epileptic Seizure Recognition

#

can anyone go through this notebook and explain me whats there in the dataset

#

i cant understand anything

#

please ping me if you answer

lapis sequoia Jan 24, 2025, 5:54 PM

#

RL is incredibly hard, where did you guys start?

marble bough Jan 24, 2025, 7:11 PM

#

I am really interested in shifting towards this focus. SWE is my love but AI is rapidly replacing in this field.

spring field Jan 24, 2025, 7:18 PM

#

AI is definitely not replacing SWEs

glacial yoke Jan 24, 2025, 7:48 PM

#

How do you build a discord bot AI that gives answers based on specific sources like a google document? How to start?

spring field Jan 24, 2025, 7:54 PM

#

light lichen i cant understand anything

the text is actually barely comprehensible, it is really badly written, took me several rereads to understand what is even going on and I still don't understand a couple things
the gist of it though is that each row in that table represents a recording made over several hours, but with a total recording time of 23.6 seconds, that was then split into 4097 "buckets" where each bucket represents a 23.6/4097 seconds from the recording, then those 4097 buckets were split into 23 chunks where each represents 1 second of that recording in which you have 178 of those buckets and so each row is those 178 buckets and there is the label at the end
basically, as I understand, you can think of the X1..178 as something like

X1 recorded at 00:00:00
X2 recorded at 00:00:30
X3 recorded at 00:01:00
X4 recorded at 00:01:30
...

where each record is some value from the EEG data
so, you have 178 features (X values) that summed together by how long each record is would make up a whole second, but the actual observation time might span several minutes/hours and then at the end you have the label, 1 for a seizure and the others for no seizure
so they are essentially trying to predict a seizure from say 30 minutes of observation
again, idk what is the actual interval of the recordings or what is the total observation period or if I even understood those 23.6 seconds correctly, but that is what I understand from the poorly written text

untold bloom Jan 24, 2025, 8:40 PM

#

this is the data collected from a single subject's (human's) brain (orange are the data, blue is the interpolating line, i.e., what you get with, e.g., plt.plot)
they recorded each human 23 seconds (23.6 or something but unimportant)
the recording device takes samples with some frequency; it turns out, it takes 178 samples per second (cool)
- then for each subject, we have 23 * 178 = 4094 datapoint (orange dots)
we need to make a dataset out of this; how?
they do it like this: crop each 23 second measurements into 1 second parts. Then your X values (features) will be those orange points in each 1 second interval (178 of them)
what is y? y is what type of seizure happened in that interval (one of 1, 2, 3, 4, 5)
ok so we have 178 features and 5 classes
so X.shape[1] is 178; what is X.shape[0]? In other words, how many instances we have that have 178 features?
well we have 500 subjects, and for each of them, we have 23 1-second chunks; so 23 * 500 = 11500

fading wigeon Jan 24, 2025, 11:54 PM

#

Huh

#

This is actually well within my experience

#

I worked as an R&D engineer at a neuroscience company in my last role

#

Although, uhhh... is there a question somewhere? 😅

#

On a separate note, I'm trying to wrap my head around treen ensembles/random forests. Am I correct in my understanding here?

Basically, we can make a decision tree off of a dataset. A random forest involves changing the dataset up a bit and creating decision trees off of that dataset, with the hope of having a bunch of decision trees that we can hope will agree with eachother on the important bits?

As for changing the dataset, I believe with random forests it's random sampling with replacement to create each tree?

final cobalt Jan 25, 2025, 12:01 AM

#

Does anyone know of a good server specifically for diffusion model training/mechanics?

#

I've done the reading, but I need some good old human to human learning

fading wigeon Jan 25, 2025, 12:03 AM

#

Haha I might try to work on that, it's my weakest area of ML and I do keep seeing those jobs

#

Also, from what I just learned, I think random forst also biases feature node decision to be more random, to differentiate from other tree ensembles

fading wigeon Jan 25, 2025, 12:24 AM

#

Where does a machine learning engineer go camping? ||In a random forest||

serene scaffold Jan 25, 2025, 1:27 AM

#

fading wigeon Where does a machine learning engineer go camping? ||In a random forest||

If a consumer product/service uses ML, but the ml that it uses is a random forest, that's how you know it's shit

fading wigeon Jan 25, 2025, 1:28 AM

#

Out of curiosity, are you speaking to tree ensembles in general or just that specific algorithm?

serene scaffold Jan 25, 2025, 1:29 AM

#

fading wigeon Out of curiosity, are you speaking to tree ensembles in general or just that spe...

specifically random forests.

past meteor Jan 25, 2025, 1:33 AM

#

fading wigeon On a separate note, I'm trying to wrap my head around treen ensembles/random for...

Yes, the two parts that introduce randomness are:

Sampling from the dataset with replacement to create a new one (bootstrapping)
Only considering a random amount of splits and not all of them

#

The idea is that decision trees overfit to much and generally have high variance

#

introducing the randomness places you in a way better place in the bias-variance trade-off

fading wigeon Jan 25, 2025, 1:38 AM

#

Oh yeah I agree that specifically random forests are not the optimal tree ensemble. Probably an improvement on bagged decision trees, but I like the boosted trees whose further iterations focus on what was misclassified in earlier trees if I’m understanding the algorithm correctly.

past meteor Jan 25, 2025, 1:39 AM

#

Sure, but the drawback there is that you need to train them in sequence ig

#

On paper training RF should be faster (but it isn't in any of the implementations I've tried)

#

Boosting is inherently sequential

#

But yeah, either way nothing stops you from trying both

#

No free lunch after all

#

There will be problems where RF > gbms

fading wigeon Jan 25, 2025, 1:48 AM

#

That's one thing I've kind of been struggling to learn/figure out. I know the ins and outs of neural networks, and could implement one with pen and paper if need be (preferably would at least want numpy please....). I've been learning the ins and outs of quite a few different machine learning algorithms.

I just struggling with the insight of when to use which for what kind of issue/problem.

#

Like I'd have no idea if you asked me to give an example where a RF > gbms

#

And the only thing pushing me towards a neural network over other stuff is only feature amount

#

But even then that's more of just a gut feeling

#

than a thing I could defend as truth

#

I suppose in practice you just modify an existing implementation that works on something similar?

spring field Jan 25, 2025, 2:59 AM

#

untold bloom - this is the data collected from a single subject's (human's) brain (orange are...

as far as i could tell, 1 was a seizure and 2, 3, 4, 5 were not seizures at all

a binary classification between classes of label 1 and the rest (2,3,4,5)

also, what was confusing me was the mention of:

EEG signals are to ensure the accuracy of diagnosing disease that usually is taken 8-10 hours in the form of records.

The EEG data used in our study were downloaded from 24-h EEG recorded (..)

Which leads me to believe that it was not an actual continuous 23.6 seconds, but rather, that was the total recording time, but it was different than the observation time, which may have been several hours and so the measurements were taken only every couple seconds/minutes, but again, I don't know, it's really hard to read what they have written (as in it's not written very clearly).

light lichen Jan 25, 2025, 3:21 AM

#

spring field the text is actually barely comprehensible, it is really badly written, took me ...

oh i see, thank you. Actually im a beginner in data science and I've been given a project in my college so i need some help

#

thanks to everyone who explained it

#

im watching videos step by step and working on this project

#

is it fine with any of you guys that if i add you and ask you my doubts

light lichen Jan 25, 2025, 3:29 AM

#

fading wigeon This is actually well within my experience

oh i see, im a student and ive been given this project

#

the only problem is we were just told to study on our own and complete it

#

they just provided us with a problem statement

#

no resources, no dataset

#

and as a beginner im really confused what to do

#

in the first week, we are just supposed to do analysis

#

preprocessing, cleaning, eda, visualization

#

but i just couldn't understand the data

fading wigeon Jan 25, 2025, 3:33 AM

#

Hmm. I'm not sure like... how much depth to go into

#

But EEG is typically time series data that is generally artifact heavy, but artifact cleaning can sometimes clean seizure activity so you have to be careful

#

If you have any questions about EEG specifically I'd be happy to help, though. Not sure if it's too indepth/specialized for your problem though

light lichen Jan 25, 2025, 3:35 AM

#

fading wigeon If you have any questions about EEG specifically I'd be happy to help, though. ...

i found a notebook on kaggle

#

im referring it

#

https://www.kaggle.com/code/harunshimanto/machine-learning-algorithms-for-epileptic-seizures#What-is-Data-Pre-pocessing?

Machine Learning Algorithms for Epileptic Seizures

Explore and run machine learning code with Kaggle Notebooks | Using data from Epileptic Seizure Recognition

#

if i dont understand anything, ill ask it

#

#

this is what im working on

#

is there any other dataset, i tried finding but the one which i sent was the most common one

final cobalt Jan 25, 2025, 5:10 AM

#

So

#

Brass tax it for me guys

#

Can I or can I not use mixed precision on an M3 Apple Silicon macbook?

past meteor Jan 25, 2025, 6:54 AM

#

fading wigeon I suppose in practice you just modify an existing implementation that works on s...

don't overthink it imo

#

Because a couple of things matter: one algo isn't intrinsically better than another one

#

In practice being able to robustly evaluate several ML algos matters wayyyyy more than knowing how any specific one works

#

Because you'd just try them all

fading wigeon Jan 25, 2025, 6:57 AM

#

True, fair…

#

Try different models see what happens

limber spear Jan 25, 2025, 8:08 AM

#

Agree. Fast isn’t always the best. Something may break

trim cedar Jan 25, 2025, 1:29 PM

#

Hi all, is Scrapy the best python web scraper?

odd meteor Jan 25, 2025, 1:35 PM

#

trim cedar Hi all, is Scrapy the best python web scraper?

It's subjective. So it depends on the nature of the task and the website involved.

Some prefer Playwright, some prefer Selinium, BeautifulSoup, etc.

odd meteor Jan 25, 2025, 2:02 PM

#

final cobalt Can I or can I not use mixed precision on an M3 Apple Silicon macbook?

I believe you can use mixed precision on M3.

untold bloom Jan 25, 2025, 4:20 PM

#

spring field as far as i could tell, `1` was a seizure and `2, 3, 4, 5` were not seizures at ...

as far as i could tell, 1 was a seizure and 2, 3, 4, 5 were not seizures at all

a binary classification between classes of label 1 and the rest (2,3,4,5)
opposite; 1 is non-seizure, others are some types of seizures (e.g., tonic clonic, complex partial). they are binaryfying the problem

EEG signals are to ensure the accuracy of diagnosing disease that usually is taken 8-10 hours in the form of records.
The EEG data used in our study were downloaded from 24-h EEG recorded (..)

Which leads me to believe that [...] the measurements were taken only every couple seconds/minutes
first one is a generic fact, second one implies the dataset used in the notebook is a (rather small) subset of an original, big data. usually these are in the order of 10s or even 100s of GBs (what they have in the notebook is < 10MB). Also you'd lose a lot of information in between if your sampling period was in the order of seconds; temporal resolution of EEG recordings are rather high and typically in the order of milliseconds (in this dataset, it's 1/178 * 1000 = 5.6 milliseconds)

calm thicket Jan 25, 2025, 4:30 PM

#

any high performance alternatives for networkx? i see snap.py but i'm struggling to compile it 🥴. i have found igraph

pine wolf Jan 25, 2025, 4:34 PM

#

igraph and graph-tool are it

calm thicket Jan 25, 2025, 4:36 PM

#

igraph seems good but the docs have massive ads covering everything 😩

pine wolf Jan 25, 2025, 4:38 PM

#

graph-tool is really good, it's just a bit harder to setup

#

has good numpy support too

modest lotus Jan 25, 2025, 6:06 PM

#

Hi all, I have a model trained based on LayoutLM. The training is done, when I run inference on an image, I get the expected result. But I want the result in JSON, so that I can process it further. But there seems to be no way. One thing that I tried is to crop the image with the help of bounding boxes and give it to an OCR tool to recognise the text. But this doesn't work consistently, I'm not sure if it is due to cropping the image. So in short, LayoutLM gives an output with bounding boxes and labels, I use the bounding boxes to crop the image and provide the image to an OCR software to recognise the image. If someone could help me or point me to some resource, it would be really helpful. Thank you in advance.

PS: Mention me here or you can DM if you have experience working with LayoutLM or similar kinds of models.

unique ridge Jan 25, 2025, 6:28 PM

#

Is this my understanding of dataset prepping correct?

Annotate Data:

For single-object classification: Label each image with a category (e.g., "dog", or "cat").
For multi-object detection: Annotate images with bounding boxes. Label Studio is a solution to do this.

#

There are scenarios that more categories appear in an image. Should you thereby always label images with bounding boxes?

serene scaffold Jan 25, 2025, 6:32 PM

#

unique ridge Is this my understanding of dataset prepping correct? Annotate Data: For singl...

It sounds like you're mixing up whole-image classification and detection

unique ridge Jan 25, 2025, 6:48 PM

#

Yeah you're right

#

You need to classify some images first before you can detect if an image contains a category I would assume?

fallow coyote Jan 25, 2025, 9:31 PM

#

has anyone read essential math for data science? is it considered a good book for getting a good basic understanding for the maths need for ML?

fading wigeon Jan 25, 2025, 11:53 PM

#

For multi object detection unless you have something more specific in mind you can just label it with each type of object that appears. There are several strategies for that sort of algorithm, the simplest just being running each individual algorithm on it lol

#

Otherwise you can use a soft max activation

worldly wagon Jan 26, 2025, 1:20 AM

#

are there any good algorithms/models that break ovo words into morphemes?

serene scaffold Jan 26, 2025, 1:51 AM

#

worldly wagon are there any good algorithms/models that break ovo words into morphemes?

the only library I can find for this is abandoned https://polyglot.readthedocs.io/en/latest/Installation.html

worldly wagon Jan 26, 2025, 1:52 AM

#

serene scaffold the only library I can find for this is abandoned https://polyglot.readthedocs.i...

yea i also found that, been reading into it

serene scaffold Jan 26, 2025, 1:52 AM

#

I tried to use it just now and the website that hosts the models appears to be gone.

worldly wagon Jan 26, 2025, 1:52 AM

#

serene scaffold I tried to use it just now and the website that hosts the models appears to be g...

ahhh

serene scaffold Jan 26, 2025, 1:52 AM

#

In [5]: downloader.supported_languages_table("morph2")
HTTPError: HTTP Error 404: Not Found

worldly wagon Jan 26, 2025, 1:53 AM

#

damm that's kinda bad lol

serene scaffold Jan 26, 2025, 1:56 AM

#

worldly wagon damm that's kinda bad lol

I wonder if you could make your own using this and this

Wiktionary

Category:English prefixes

Affixes attached to the beginning of English words.
For more information, see Appendix:English prefixes.

Category:English prefix forms: English prefixes that are inflected to display grammatical relations other than the main form.
Category:English terms by prefix: English terms categorized by their prefixes.

Wiktionary

Category:English suffixes

Affixes attached to the end of English words.
For more information, see Appendix:English suffixes.

Category:English suffix forms: English suffixes that are inflected to display grammatical relations other than the main form.
Category:English derivational suffixes: English suffixes that are used to create new words.
Category:English diminutive s...

worldly wagon Jan 26, 2025, 1:58 AM

#

serene scaffold I wonder if you could make your own using [this](https://en.wiktionary.org/wiki/...

dam u need to drop research tips it took me like 4-5days to find that

#

but yea i've been using it

serene scaffold Jan 26, 2025, 1:58 AM

#

worldly wagon dam u need to drop research tips it took me like 4-5days to find that

I am a computational linguist.

worldly wagon Jan 26, 2025, 1:59 AM

#

serene scaffold I am a computational linguist.

true, i remember, i'm new to NLP

#

lemmatization has been a decent fall back

serene scaffold Jan 26, 2025, 2:01 AM

#

worldly wagon true, i remember, i'm new to NLP

why do you need to do this

worldly wagon Jan 26, 2025, 2:02 AM

#

serene scaffold why do you need to do this

incase a word isn't present in my pre-defined dataset i'm creating but is a valid word

#

feel like i'm butchering the explanation

serene scaffold Jan 26, 2025, 2:03 AM

#

worldly wagon incase a word isn't present in my pre-defined dataset i'm creating but is a vali...

you want to replace out-of-vocabulary (OOV) words with the in-vocabulary word that most closely approximates its meaning?

worldly wagon Jan 26, 2025, 2:04 AM

#

serene scaffold you want to replace out-of-vocabulary (OOV) words with the in-vocabulary word th...

yea kind of i'd also like to be able to segment them into their morphemes

serene scaffold Jan 26, 2025, 2:05 AM

#

worldly wagon yea kind of i'd also like to be able to segment them into their morphemes

why?

worldly wagon Jan 26, 2025, 2:05 AM

#

serene scaffold why?

no real reason really just think it could be good meta-data

serene scaffold Jan 26, 2025, 2:06 AM

#

worldly wagon no real reason really just think it could be good meta-data

I suspect there's no good solution for this because people don't really need to do it these days

worldly wagon Jan 26, 2025, 2:08 AM

#

serene scaffold I suspect there's no good solution for this because people don't really need to ...

ahh fair fair

bitter harbor Jan 26, 2025, 2:13 AM

#

serene scaffold you want to replace out-of-vocabulary (OOV) words with the in-vocabulary word th...

Just out of curiosity how would you approach that

serene scaffold Jan 26, 2025, 2:15 AM

#

bitter harbor Just out of curiosity how would you approach that

I would give up.
part of life is recognizing what you can't do and cutting your losses.

just kidding. I mostly deal with interactive LLMs these days, where that isn't an issue. but I suppose you could take the word in the vocabulary with the shortest cosine distance to the OOV word.

modest lotus Jan 26, 2025, 2:18 AM

#

unique ridge There are scenarios that more categories appear in an image. Should you thereby ...

Yes, I have annotated images with labels using Label studio. I do not have an issue with training or running inference on the model. Those work perfectly fine. I have run an inference on an image. Now to run the inference I give an image, and output gives an image with its identified labels. See here the output is an image, but I want an output in a different format, let's say a JSON so that I could do some post processing on the identified data.

modest lotus Jan 26, 2025, 2:20 AM

#

serene scaffold It sounds like you're mixing up whole-image classification and detection

I really don't want to classify the image, I want to know what is in an image, group it with labels, so I can do some process with that data.

#

Here's a gist of what I'm trying to do, maybe this could help. Let's say I have some 100 invoices (in images). What I would need is, I would like to get the details from an invoice, such as invoice number, amount etc. So, instead of plain OCR to recognise text, LayoutLM also has been used to identify what type of text it is.

#

Everything is good now, I give an image, LayoutLM tells me what the invoice number is. But the problem is the output is an image, with a bounding box and labels. So I'm not really able to do anything with the data. I can visually see it, but I would need it in a JSON format or something so I can write some code on top. Hope this helps.

tulip epoch Jan 26, 2025, 5:02 AM

#

Hi, Guys

#

"Can I get a 'Hi' from individuals who have successfully established their careers in data science?"

final cobalt Jan 26, 2025, 5:07 AM

#

How do I undo this: ```py
self.transforms = transforms.Compose([
transforms.Resize(64, interpolation=transforms.InterpolationMode.BILINEAR),
transforms.RandomCrop(64),
transforms.RandomHorizontalFlip(),
transforms.RandomVerticalFlip(),
transforms.ToTensor(),
transforms.Normalize((0.5, 0.5, 0.5), (0.5, 0.5, 0.5)),
])

The normalization part I mean.

jaunty helm Jan 26, 2025, 5:14 AM

#

final cobalt How do I undo this: ```py self.transforms = transforms.Compose([ tr...

normalize is x' = (x - mean) / std, so to undo it should be just x = x' * std + mean
you can rephrase that into (x - (-mean / std)) / (1 / std) so you can throw it into a Normalize if you want to

grand breach Jan 26, 2025, 6:31 AM

#

no matter what I do I can't update scikit-learn on kaggle, I also changed this option and restarted my book but still older version of sklearn get's imported

jaunty helm Jan 26, 2025, 7:47 AM

#

grand breach no matter what I do I can't update scikit-learn on kaggle, I also changed this o...

iirc you can run !pip install in cells
latest environment prob just means latest kaggle environment, which may not necessarily have the latest sklearn

rich condor Jan 26, 2025, 11:40 AM

#

It's been all over headlines recently that China has been bypassing a lot of the legwork for training actual models by shortcutting with knowledge distillation.

Surely knowledge distillation has to run into hiccups at some point? I'm not a technical expert but it makes intuitive sense that shortcuts are not sustainable

What are the disadvantages of knowledge distillation?

lapis sequoia Jan 26, 2025, 2:22 PM

#

Anyone successful integrated chatgpt api into python? What you use it for?

serene scaffold Jan 26, 2025, 2:33 PM

#

lapis sequoia Anyone successful integrated chatgpt api into python? What you use it for?

to test if ChatGPT is actually good at certain tasks.

past meteor Jan 26, 2025, 2:44 PM

#

lapis sequoia Anyone successful integrated chatgpt api into python? What you use it for?

What do you mean precisely? Calling the ChatGPT api with Python?

#

If so, yes.. I use it for a ton of things but mostly good ol' retrieval augmented generation

daring fiber Jan 26, 2025, 4:01 PM

#

File is not loading permission error

serene scaffold Jan 26, 2025, 4:06 PM

#

daring fiber File is not loading permission error

that file isn't a path to a CSV. it looks like it's a whole folder.

spare badger Jan 26, 2025, 5:30 PM

#

Hi guys, i need little help with sklearn library and decision tree classifier. I need to find out why sorted data have higher impact on classification than not sorted but don't know how to start

shadow vortex Jan 26, 2025, 5:46 PM

#

anyone here had issues with Letta framework?

#

My Letta Free model keeps showing Failed to send message

serene scaffold Jan 26, 2025, 6:02 PM

#

shadow vortex My Letta Free model keeps showing Failed to send message

Any time you need help with an error message, always show the whole entire error message and the code that caused it, even if you don't think it would help

shadow vortex Jan 26, 2025, 6:03 PM

#

Got it, sorry about thatl

past bramble Jan 26, 2025, 6:21 PM

#

any recommendations on next AI/ML project?

#

(I'm gonna sleep I'll respond tomorrow)

final cobalt Jan 26, 2025, 7:03 PM

#

Pardon me smart people

#

Is anyone here familiar with diffusion model internals? Or, can anyone point me to someone who is?

#

My forehead is sore from beating my head against the wall. I need to talk with a human who knows this stuff.

serene scaffold Jan 26, 2025, 7:23 PM

#

final cobalt My forehead is sore from beating my head against the wall. I need to talk with a...

Always give the information that people would need to start helping you.

final cobalt Jan 26, 2025, 7:24 PM

#

XD I know the rule about asking to ask. Throwing a butt load of context out though can result in a big wall of text that people often don't want to contend with

#

The TL;DR is that I've build a diffusion model, all the part are where they should be, and I've debugged and fine tuned as best I can. But it still won't learn - dead on arrival kind of thing, not just poor learning

#

Somewhere between the datapreprocessing phase, forward noise injection, the model's architecture, hyperparameter choices, the training regimen, the denoising process, and the potential for programmer error, something is going wrong

serene scaffold Jan 26, 2025, 7:42 PM

#

final cobalt Somewhere between the datapreprocessing phase, forward noise injection, the mode...

How do you know that it's wrong? What is the delta between the current behavior and the desired behavior?

#

If it's just outputting random noise, you should probably share the training code in a way that shows all your hyperparameters

final cobalt Jan 26, 2025, 7:52 PM

#

As with most things about ML and diffusion models in specific, that's a question with a multidimensional answer. The short version is that after 25K steps I'm still getting pure noise despite what seems to me to be substantial improvements in loss.

MSE loss, scaled by a factor of 10000 (no exploding gradients) drops from the 10K range down to between 10 - 100. That's a three to four order of magnitude drop in loss. While I know loss isn't the best metric for assessing a model like this, it's still worth noting. At the start of training I see the faint imprint of what might eventually become structure - but I wouldn't exactly call it structure on its own. Loss drops precipitously before hitting a hard wall. It falls from 10Kish to 1Kish in a few batches, 1Kish to 100ish in a few dozen, and then to a lower bound usually around 20 or so in another few dozen. Then loss stops decreasing almost entirely, and after hitting that point and continuing for about 8 hours (while I slept) it dropped from 20 to, like, 18.

In short, fast learning and then hitting a wall.

This speaks to me of a few things: the model is learning the easy stuff well enough and then getting to a point where is can't (not struggles, but fails) to learn anything beyond this. I've tried a few different datasets and configurations of augmentation, and so I'm pretty sure the issue isn't lack of variety.

Hitting a hard wall sounds a lot like settling in on a trivial solution to me. It's found a minimum and it won't budge.

In terms of actual output, the model quickly starts producing almost-structure as it learns then hits the wall. It keeps on with this for a while after learning stops but eventually even this disappears and all I get out is noise again. This, too, sounds like falling towards some kind of trivial solution.

#

Now, I'm almost definitely over normalizing. I'm using instancenorm to normalize features maps individually, and weight norm to keep complement this. I read an article which said this approach helped their model converge in a fraction of the time and it outperformed a number of modern benchmarks.

Even if over normalization were the problem, though, I've been told that if any reasonably structured and capacitied model can't overfit to a single image then the issue is probably structure and not an issue of normalization/fine tuning. This of course comes with the caveat that lack of variety means lack of interesting gradient.

#

https://github.com/lucaswalkeryoung/Diffusion-2

bitter harbor Jan 26, 2025, 8:26 PM

#

serene scaffold I would give up. part of life is recognizing what you can't do and cutting your ...

We get it you use gpt :)

Whatre you referring to by distance between words?

How do you deal with gibberish OOV words, are you taking the full context into account (/other lexicons) or do they just get their own lil vector space?

serene scaffold Jan 26, 2025, 8:27 PM

#

bitter harbor We get it you use gpt :) Whatre you referring to by distance between words? H...

ChatGPT is the LLM I deal with the least, since it's proprietary

#

Cosine distance between two embedded representations of the words. Which requires them to both be in the vocabulary of that embedder

#

I haven't had to deal with gibberish on a scale worth accounting for.

bitter harbor Jan 26, 2025, 8:43 PM

#

serene scaffold Cosine distance between two embedded representations of the words. Which require...

What is the embedder embedding though? Does it just depend on what kind of preprocessing is being done or is it just a handwavey throwing things against the wall

final cobalt Jan 26, 2025, 8:55 PM

#

bitter harbor What is the embedder embedding though? Does it just depend on what kind of prepr...

Just jumping in here, jumping back a few messages it sounds to me (with little context) like you're asking what embedding a word actually means

#

If so, this is super cool. I remember a small sense of awe when I learned this

#

Ping and I'll expand

bitter harbor Jan 26, 2025, 9:00 PM

#

I think ive got a decent understand of what the embedder does im moreso curious about what's being fed into it in the first place

jade jay Jan 27, 2025, 3:16 AM

#

Hey, I am relatively new to python (finance major). would someone mind setting me up with some resources for python basics and data science essentials?

#

landed a DS internship for the summer but want to make sure i know a good amount before i get there, still quite a bit behind

serene scaffold Jan 27, 2025, 3:18 AM

#

jade jay landed a DS internship for the summer but want to make sure i know a good amount...

I recommend doing the kaggle pandas tutorial so that you'll know how to manipulate tabular data.

jade jay Jan 27, 2025, 3:18 AM

#

serene scaffold I recommend doing the kaggle pandas tutorial so that you'll know how to manipula...

ill take a look at this right now

serene scaffold Jan 27, 2025, 3:18 AM

#

Have fun!

jade jay Jan 27, 2025, 3:19 AM

#

thank you! would you recommend any subscriptions?

serene scaffold Jan 27, 2025, 3:19 AM

#

No, don't do any.

#

!zen now

arctic wedgeBOT Jan 27, 2025, 3:19 AM

#

The Zen of Python (line 14):

Now is better than never.

serene scaffold Jan 27, 2025, 3:19 AM

#

!zen right now

arctic wedgeBOT Jan 27, 2025, 3:19 AM

#

The Zen of Python (line 15):

Although never is often better than right now.

jade jay Jan 27, 2025, 3:19 AM

#

sounds good

#

@serene scaffold https://www.kaggle.com/learn/pandas

is this the right one

Learn Pandas Tutorials

Solve short hands-on challenges to perfect your data manipulation skills.

serene scaffold Jan 27, 2025, 3:22 AM

#

Yes

jade jay Jan 27, 2025, 3:22 AM

#

perfect, thanks!

#

also how much time would you recommend i dedicate per day leading up to my internship start date (may 18th)? i know everyone has a different learning curve but just want to get an idea of how much i should do

serene scaffold Jan 27, 2025, 3:28 AM

#

jade jay also how much time would you recommend i dedicate per day leading up to my inter...

As much as you feel like doing. Don't burn yourself out.

jade jay Jan 27, 2025, 3:28 AM

#

thats a good point

#

thanks again for your help, ill be back in here pretty frequently

serene scaffold Jan 27, 2025, 3:32 AM

#

Sounds good. I'm here every day because I have issues

jade jay Jan 27, 2025, 4:06 AM

#

i think its cool so

#

not a bad issue

final cobalt Jan 27, 2025, 5:21 AM

#

I need an arithmatic check

#

https://paste.pythondiscord.com/AHUA

#

I'm trying to simulate the reverse diffusion process without the model so I can be sure it's working properly. The forward process seems to be working, but the reverse isn't

remote stream Jan 27, 2025, 5:44 AM

#

Guys i have a problem where when i build an app using pyinstaller , the app i have currently selected or windows explorer automatically closes

final cobalt Jan 27, 2025, 5:52 AM

#

remote stream Guys i have a problem where when i build an app using pyinstaller , the app i ha...

Wrong channel XD

remote stream Jan 27, 2025, 5:59 AM

#

final cobalt Wrong channel XD

it doesnt change the fact that i am in need of desperate help 🥲

#

@sudden canyon can i get help

final cobalt Jan 27, 2025, 6:20 AM

#

    def forward(self, xₜ: torch.Tensor) -> tuple[torch.Tensor, ...]:

        ϵ = []

        for t in range(1, self.timesteps + 1):

            ϵ.append(ϵₜ := torch.randn_like(xₜ))

            ãₜ = self.ã[t]
            b̃ₜ = self.b̃[t]

            xₜ = (xₜ * ãₜ) + (ϵₜ * b̃ₜ)

            if not t % 10:
                self.transforms_reverse(xₜ).save(f"Outputs/forward_{t}.png")

        return xₜ, ϵ


    def reverse(self, xₜ: torch.Tensor, ϵ: list[torch.Tensor]) -> torch.Tensor:

        for t in reversed(range(1, self.timesteps + 1)):

            zₜ = torch.randn_like(xₜ) if t > 1 else 0
            ϵₜ = ϵ.pop()

            bₜ = self.b[t]
            b̃ₜ = self.b̃[t]
            b̃̄ₜ = self.b̃̄[t]
            ãₜ = self.ã[t]

            xₜ = ((xₜ - (ϵₜ * (bₜ / b̃̄ₜ))) / ãₜ) + (zₜ * b̃ₜ)

            if not t % 10:
                self.transforms_reverse(xₜ).save(f"Outputs/reverse_{t}.png")

        return xₜ

#

I've recreated the formal algorithms for forward and reverse perfectly. It should be the case that I take the original and apply noise one step and a time and save the noise - since I don't have a model to predict it for me. Then I pop the noise off in reverse order and apply the denoising algorithm. I should do the thing. But all I get is noise coming back out

grand breach Jan 27, 2025, 2:59 PM

#

is 5:30 hours for inferencing with distilbert over 300k samples too slow ?

odd meteor Jan 27, 2025, 4:08 PM

#

grand breach is 5:30 hours for inferencing with distilbert over 300k samples too slow ?

5 hours for inference is crazy. If it takes 5hrs just for inference, I can only imagine how computationaly expensive the actual model training was 😮

Are you running this on a CPU? If yes, then that explains why.

fervent canopy Jan 27, 2025, 4:10 PM

#

Real-time monitoring, object tracking, and line-crossing detection for CCTV camera streams. https://github.com/SanshruthR/CCTV_SENTRY_YOLO11 https://github.com/user-attachments/assets/e29ad9df-b810-4308-b6a8-4ff81019edea

GitHub

GitHub - SanshruthR/CCTV_SENTRY_YOLO11: Real-time monitoring, objec...

Real-time monitoring, object tracking, and line-crossing detection for CCTV camera streams. - SanshruthR/CCTV_SENTRY_YOLO11

▶ Play video

grand breach Jan 27, 2025, 4:22 PM

#

odd meteor 5 hours for inference is crazy. If it takes 5hrs just for inference, I can only ...

yes even I was thinking this was insane

#

this is with distilbert

#

https://paste.pythondiscord.com/GAEQ

grand breach Jan 27, 2025, 4:46 PM

#

i'm using p100 gpu

#

are there serious bottleneck issues with my code

odd meteor Jan 27, 2025, 5:06 PM

#

grand breach are there serious bottleneck issues with my code

Yes. There's a much better way to do what you're trying to do. If I have time later tonight, I'll respond to this again with an example.

Meanwhile, are you're just using the pretrained model for evaluation without fine-tuning on your target data?

grand breach Jan 27, 2025, 5:08 PM

#

odd meteor Yes. There's a much better way to do what you're trying to do. If I have time la...

yes, it's ok thank you, i'll try on my own i'm looking for a good tutorial, one question is my code more cpu intensive

errant bison Jan 27, 2025, 5:10 PM

#

Where could i find some real world ai problem statements which can be then useful for future

past bramble Jan 27, 2025, 5:17 PM

#

how are embeddings calculated for any model? i feel there must be human involvement cuz what other way does it have to know how to tokenize words

unkempt apex Jan 27, 2025, 5:38 PM

#

fervent canopy Real-time monitoring, object tracking, and line-crossing detection for CCTV came...

make it 30fps , so that it will work lag free real time

odd meteor Jan 27, 2025, 6:30 PM

#

grand breach yes, it's ok thank you, i'll try on my own i'm looking for a good tutorial, one ...

You mentioned you're using Tesla P100, so I presume the machine you're using has an active accelerator (base on line 12 of your code)

To verify, print(device) and confirm it's not showing 'CPU'

final cobalt Jan 27, 2025, 8:58 PM

#

https://paste.pythondiscord.com/2R6A

I present to you the cleanest DDPM denoising logic ever written

empty mantle Jan 27, 2025, 9:51 PM

#

The space after the dot at line 13 triggers me for some reason

final cobalt Jan 27, 2025, 9:51 PM

#

empty mantle The space after the dot at line 13 triggers me for some reason

XD

#

My code can be triggering

#

It's been described as "different" more than once. Honestly, I find most people's code illegible

spice ravine Jan 27, 2025, 10:14 PM

#

is deepseek a chatgpt wrapper

serene scaffold Jan 27, 2025, 10:19 PM

#

spice ravine is deepseek a chatgpt wrapper

No, it's an entirely separate LLM

quartz karma Jan 27, 2025, 11:09 PM

#

serene scaffold No, it's an entirely separate LLM

but trained from results given by chatgpt and others?

spring field Jan 27, 2025, 11:11 PM

#

quartz karma but trained from results given by chatgpt and others?

that's unlikely, they probably had their own dataset

quartz karma Jan 27, 2025, 11:13 PM

#

spring field that's unlikely, they probably had their own dataset

Thanks. I kept hearing from news and videos that they kinda used something from LLM predecessors but am totally not sure what that was.

spring field Jan 27, 2025, 11:14 PM

#

tbf, I haven't looked into it too much, but using an LLM to train another seems like a pretty terrible idea ducky_skull

jaunty helm Jan 28, 2025, 3:04 AM

#

quartz karma Thanks. I kept hearing from news and videos that they kinda used something from ...

it's likely that it does
i.e., think about how much of the internet's texts are now LLM-generated; if any of that goes into the training dataset, then technically yes it uses something from predecessors

#

it's also why you'll see a lot of the same LLM-isms across multiple models
cause nearly everyone uses synthetic datasets generated from larger llms

final cobalt Jan 28, 2025, 3:08 AM

#

spring field tbf, I haven't looked into it too much, but using an LLM to train another seems ...

For now

jaunty helm Jan 28, 2025, 3:08 AM

#

example: look up ShareGPT datasets
as the name suggests, all of these originate from conversations between a human and llm; they might've underwent further processing, but still

final cobalt Jan 28, 2025, 3:08 AM

#

There's going to be a tipping point where generative models can produce works good enough to feed other models

jaunty helm Jan 28, 2025, 3:12 AM

#

final cobalt There's going to be a tipping point where generative models can produce works go...

we're already doing that tbh, very commonly in fact
at the very least, stuff spit out by very large llms are often good enough for training smaller ones

final cobalt Jan 28, 2025, 3:13 AM

#

jaunty helm we're already doing that tbh, very commonly in fact at the very least, stuff spi...

Totally. I'm also going to use ChatGTP or similar to build the initial embeddings for magic cards to train a deck builder

#

I'm thinking of a GNN based diffusion model which can diffuse either decks from cards or cards from decks

#

In other news

#

My diffusion model is learning!!!!!

jaunty helm Jan 28, 2025, 3:15 AM

#

sick

final cobalt Jan 28, 2025, 3:15 AM

#

This is definitely structure

#

Early days, but its further than I've gotten before and it seems to still be learning

past bramble Jan 28, 2025, 4:25 AM

#

final cobalt This is definitely structure

cool what's it tho

final cobalt Jan 28, 2025, 4:25 AM

#

past bramble cool what's it tho

Well it's just a blob right now XD

#

It isn't just pure noise. So... progress

past bramble Jan 28, 2025, 4:31 AM

#

final cobalt Well it's just a blob right now XD

nice what is it supposed to be?

#

i just came here

final cobalt Jan 28, 2025, 4:40 AM

#

Pokemon 🙂

#

Its a little dataset of pokemon images

wispy junco Jan 28, 2025, 4:44 AM

#

guys, so I have this task where I pull files using R, I need to atutomate this,
I usually use R studio to pull csv files for that particular dates,
can someone share some links that will help me automate it on databricks?
(I'm not even sure if this is the right channel to ask this, if not, do let me know, I'll post it in the correct one lemon_pleading )

serene scaffold Jan 28, 2025, 4:45 AM

#

wispy junco guys, so I have this task where I pull files using R, I need to atutomate this, ...

is databricks a python thing?

wispy junco Jan 28, 2025, 5:24 AM

#

serene scaffold is databricks a python thing?

databricks is an environment where we can run all kinds of languages and also we can use it to automate stuff and revert to prev versions of the code

#

it's like jupyter but better

peak thorn Jan 28, 2025, 7:42 AM

#

is anyone here have done freelance in AI, ML or DS can you please share your experience and journey ? bcs i m a beginner in this freelance field

frank niche Jan 28, 2025, 9:49 AM

#

Is anyone still using tflite-model-maker? I cannot get it installed, even the colab notebook referenced in google's tutorial is broken. The devs seem to be aware of this and recvommend mediapipe_model_maker, but that does not support audio.

lilac sonnet Jan 28, 2025, 10:30 AM

#

Hey guys I am new here hope everything is fine 🙂

steady hawk Jan 28, 2025, 10:32 AM

#

Welcome to the server :)

grand breach Jan 28, 2025, 12:08 PM

#

what is the right way to choose a model for generating contextual word embeddings ?

buoyant vine Jan 28, 2025, 1:17 PM

#

Normally I just test models with the model or system it is intended to be used with (let's say a classifier) and comparing the evaluation results of that.

#

Normally BERT based models are the gold standard, although pre-computed systems like GloVe can be useful in situations where you have a lot of data and not a lot of compute since GloVe just becomes a lookup in a table rather than sets of matrix operations.

#

Personally, I've found intfloat/multilingual-e5-large and intfloat/e5-{small/medium/large} to be excellent models for their size and compute cost. Worth making sure whether or not you need a model that can understand multiple languages and the association between words in different languages or if just a single language model works for you.

#

From what I have seen, if you want multi lingual models, you likely will have to go with larger models with bigger embedding sizes in order to maintain good accuracy, although again, depends on usecase

serene grail Jan 28, 2025, 1:40 PM

#

Cool, I haven't heard of GloVe before, I'll have to read about it

buoyant vine Jan 28, 2025, 1:52 PM

#

GloVe is like one of the OG ways of doing text embeddings before BERT and Transformer LLMs became all the rage

grand breach Jan 28, 2025, 3:37 PM

#

buoyant vine Personally, I've found `intfloat/multilingual-e5-large` and `intfloat/e5-{small/...

i'm performing an information retreival (semantic search) i've 300k samples of english text, i thought to go with adv nlp techniques as this is an adv nlp project

#

i'm not really sure about GloVe and how it scales well with large data

frail flower Jan 28, 2025, 4:06 PM

#

I am working on building an app that manages my pantry, my recipes, and my shopping list. I was wondering how effectively I could integrate such paid features as price tracking (such as for bread, eggs, milk, rice, and beans), and ai-generated recipes "using what you have in stock". how "reliable" is ai for recipe creation, and what tricks could I use with my prompts?

serene scaffold Jan 28, 2025, 4:10 PM

#

frail flower I am working on building an app that manages my pantry, my recipes, and my shopp...

I've used ChatGPT to produce recipes. but LLMs can't do math. I once got a ChatGPT recipe and said "scale the recipe by half and convert all the units to grams", and it generated Python code to do that conversion.

frail flower Jan 28, 2025, 4:10 PM

#

serene scaffold I've used ChatGPT to produce recipes. but LLMs can't do math. I once got a ChatG...

yeah, that's what I was worried about. but I guess it could implement the teachings from The Flavor Bible rather well, since not everyone has the time to read that book

serene scaffold Jan 28, 2025, 4:11 PM

#

frail flower yeah, that's what I was worried about. but I guess it could implement the teachi...

so you'd be doing RAG?

frail flower Jan 28, 2025, 4:12 PM

#

serene scaffold so you'd be doing RAG?

Well, that's a possibility, especially if it is selfhosted, but I do want to eventually turn it into a native app on the appstores and/or fdroid, so then we'd get size constraints and copyright issues.

olive crag Jan 28, 2025, 4:23 PM

#

Hello, I'm looking for the best dependency for handwritten OCR.

remote stream Jan 28, 2025, 5:52 PM

#

https://stackoverflow.com/questions/79390642/exe-file-written-in-python-closes-the-currently-selected-app-when-opening

Help!!!!!!!!!!!!!!!!!!!!!!!1
Help!!!!!!!!!!!!!!

serene scaffold Jan 28, 2025, 6:15 PM

#

@remote stream please move your question to #packaging-and-distribution

final cobalt Jan 28, 2025, 7:14 PM

#

I'm having fun

#

I'm working on building an embedding system for magic cards

#

And I'ma build a diffusion based deck/card builder 😄 😄

lapis sequoia Jan 28, 2025, 7:44 PM

#

are the token type ids needed for multilabel classification? Using Bert

serene scaffold Jan 28, 2025, 7:49 PM

#

lapis sequoia are the token type ids needed for multilabel classification? Using Bert

if you're trying to classify each token, then the y value for each token needs to be an n dimensional vector for n classes, where each element i is 1 if that label belongs to the ith class, else 0.

lapis sequoia Jan 28, 2025, 7:49 PM

#

is is Bcewithlosslogits?

serene scaffold Jan 28, 2025, 7:49 PM

#

and if you have a sequence of m tokens (such as a sentence), then the y value for the whole sequence is an array of shape (n, m)

lapis sequoia Jan 28, 2025, 7:49 PM

#

for the loss function

serene scaffold Jan 28, 2025, 7:50 PM

#

lapis sequoia is is Bcewithlosslogits?

you need a target for calculating the loss, yes.

lapis sequoia Jan 28, 2025, 7:50 PM

#

serene scaffold and if you have a sequence of `m` tokens (such as a sentence), then the y value ...

even if there are five targets ?

serene scaffold Jan 28, 2025, 7:52 PM

#

lapis sequoia even if there are five targets ?

as in, each token can have between zero and five labels?

lapis sequoia Jan 28, 2025, 7:58 PM

#

serene scaffold as in, each token can have between zero and five labels?

ok, with multilabel classification, with Bert, is it bcelosswithlogits because each token is being ran through Bert and the probability that the token type is accurate to one of the tokens has to be x (- [0,1] with the accuracy being higher for each feature being assigned to one of the five(just, you know, some categorical target) target values? is that why it is Bceloss?

serene scaffold Jan 28, 2025, 7:59 PM

#

lapis sequoia ok, with multilabel classification, with Bert, is it bcelosswithlogits because e...

does -[0, 1] mean x: -1 <= x <= 1?

lapis sequoia Jan 28, 2025, 7:59 PM

#

no

#

x is a value between 0 and 1

serene scaffold Jan 28, 2025, 8:02 PM

#

the - is confusing.
suppose you have a sequence with m tokens, and each token can belong to n classes (potentially none of them). then the output from BERT will be an array of shape (n, m), where each element (i, j) is a number between 0 and 1, representing the probability that the jth token belongs to the ith class.

lapis sequoia Jan 28, 2025, 8:03 PM

#

ok, and is sigmoid the optimizer?

serene scaffold Jan 28, 2025, 8:03 PM

#

sigmoid is an activation function

lapis sequoia Jan 28, 2025, 8:03 PM

#

ok

#

for binary features

serene scaffold Jan 28, 2025, 8:04 PM

#

binary features?

lapis sequoia Jan 28, 2025, 8:05 PM

#

the activation function when trying to predict a value for a target between 0 and 1, not softmax

#

I get it now

serene scaffold Jan 28, 2025, 8:07 PM

#

you can use sigmoid in any situation where you want to squeeze an individual number to be between 0 and 1.

softmax is nice because it squeezes each element in a vector to be between 0 and 1, but proportionally to each other, such that they sum to 1.

lapis sequoia Jan 28, 2025, 8:07 PM

#

ok, bcewithlogits and sigmoid, no, I was thrown off because mutli-label classification values are not treated as categorical

sick eagle Jan 28, 2025, 8:33 PM

#

This is the craziest thing I've seen today.

serene scaffold Jan 28, 2025, 8:34 PM

#

sick eagle This is the craziest thing I've seen today.

does that person define "math" as "doing calculations by hand"?

sick eagle Jan 28, 2025, 8:38 PM

#

serene scaffold does that person define "math" as "doing calculations by hand"?

absolutely not

#

i think he is talking about linear algebra, statistics...

iron basalt Jan 28, 2025, 11:44 PM

#

sick eagle This is the craziest thing I've seen today.

Classic mistake of believing something to be universally true because it was true in your personal experience. And boldly stating it to be so without first looking into it further. A very large portion of the most engaging posts involving knowledge fall in this category in my experience online.

obsidian talon Jan 29, 2025, 3:24 AM

#

Any recognized or valuable certificates for analytics/ML/data science?

serene scaffold Jan 29, 2025, 3:25 AM

#

obsidian talon Any recognized or valuable certificates for analytics/ML/data science?

None. I work for a resarch company and participate in hiring decisions for the AI division.

obsidian talon Jan 29, 2025, 4:00 AM

#

serene scaffold None. I work for a resarch company and participate in hiring decisions for the A...

none? IBM? Oracle? Google Analytics?

fervent canopy Jan 29, 2025, 7:57 AM

#

unkempt apex make it 30fps , so that it will work lag free real time

I implemented that

grand breach Jan 29, 2025, 2:34 PM

#

obsidian talon none? IBM? Oracle? Google Analytics?

what i keep hearing is that certificates don't really matter much and they're like cherry on top of cake

serene scaffold Jan 29, 2025, 2:57 PM

#

grand breach what i keep hearing is that certificates don't really matter much and they're li...

that's correct.

spice ravine Jan 29, 2025, 7:06 PM

#

Well seems like Ima be cancelling my gpt subscription then

serene scaffold Jan 29, 2025, 7:07 PM

#

because of deepseek or what

spice ravine Jan 29, 2025, 7:07 PM

#

yea deepseek

serene scaffold Jan 29, 2025, 7:09 PM

#

why does deepseek make you want to cancel your chatgpt subscription?

spice ravine Jan 29, 2025, 9:02 PM

#

Becuz is free and is better than gpt

serene scaffold Jan 29, 2025, 9:07 PM

#

spice ravine Becuz is free and is better than gpt

sure, but even if you download the model weights, setting it up so that you can start asking it stuff is non-trivial and requires beefy hardware.

past bramble Jan 30, 2025, 3:46 AM

#

it's easier with ollama but hardware is the main thing

delicate cargo Jan 30, 2025, 4:51 AM

#

Is anyone around that wants to voice chat? I am working on a system to allow a genetic algorithm to define self organizing automata, and I need to step away from it for a bit, but I want to talk about it with someone

plucky pagoda Jan 30, 2025, 6:07 AM

#

Is there a open-source automated content moderation system that is pre-built and robust?

The approach is to filtering content on a CDN as its coming in to the database in transmission to database, at rest within the database/cdn network.

Machine learning is what I heard I need need for this. Can I use ray?

#Project scope.
This is a federation of decentralized cdn.

stable hollow Jan 30, 2025, 8:08 AM

#

me when I do data visualization

#

fervent canopy Jan 30, 2025, 10:59 AM

#

Real-time monitoring, object tracking, and line-crossing detection for CCTV camera streams. https://github.com/SanshruthR/CCTV_SENTRY_YOLO11 https://github.com/user-attachments/assets/e29ad9df-b810-4308-b6a8-4ff81019edea

GitHub

GitHub - SanshruthR/CCTV_SENTRY_YOLO11: Real-time monitoring, objec...

Real-time monitoring, object tracking, and line-crossing detection for CCTV camera streams. - SanshruthR/CCTV_SENTRY_YOLO11

▶ Play video

vivid skiff Jan 30, 2025, 1:37 PM

#

hey guys, do you know any resource to learn about TorchInductor's IR?

nimble mist Jan 30, 2025, 2:05 PM

#

vivid skiff hey guys, do you know any resource to learn about TorchInductor's IR?

I'd also like to know because i'm curious about what's behind PyTorch 2.0.

#

This article might not be very helpful to you, but it's still an interesting read https://dev.to/aaronlangford31/lessons-learned-from-using-torch-inductor-for-inference-1ma7

DEV Community

Lessons Learned from Using Torch Inductor For Inference

The purpose of this blog post is to give an intro to compiling models using Torch Inductor along with...

limber grotto Jan 30, 2025, 2:37 PM

#

Hello everyone
I'm looking to automate report production from datascience and ML reprocessing.
I produce my stats, reprocessings, graphs with pandas, mathplotlib, sns ...
I don't have any particular problems with content creation, but I'm more concerned with layout and content use.
What would you recommend for clean formatting/layout to produce printable reports?
I'd like to stick to scriptable python, and avoid PowerBI or similar.
Thanks

serene scaffold Jan 30, 2025, 2:53 PM

#

limber grotto Hello everyone I'm looking to automate report production from datascience and ML...

what do you want the format of the output to be? pdf? html?

serene scaffold Jan 30, 2025, 2:53 PM

#

plucky pagoda Is there a open-source automated content moderation system that is pre-built and...

!source filter

arctic wedgeBOT Jan 30, 2025, 2:53 PM

#

Command: filter

Group for managing filters.

Source Code

Go to GitHub

limber grotto Jan 30, 2025, 2:56 PM

#

serene scaffold what do you want the format of the output to be? pdf? html?

no preference , just easy printable
I believe HTML is not the best format to print, perhaps pdf or docx could be better

#

the main question is how to achieve a clean layout

vivid skiff Jan 30, 2025, 3:30 PM

#

nimble mist This article might not be very helpful to you, but it's still an interesting rea...

Thank you

vivid skiff Jan 30, 2025, 3:30 PM

#

nimble mist I'd also like to know because i'm curious about what's behind PyTorch 2.0.

I'll let you know if I find something

fluid basalt Jan 30, 2025, 4:05 PM

#

What opened the passion to you all for Data science? Been plucking through the code academy career path, but about 45% through I have been losing steam on doing daily four to six hour steady sessions.

#

I really enjoy each part of the actual practical data analysis but man there is a wide world of things to learn. Just wondering what projects yall have undertaken that are exciting to give me a glimpse of the finish line ya know?

stuck tapir Jan 30, 2025, 4:09 PM

#

fluid basalt What opened the passion to you all for Data science? Been plucking through the c...

yea theres alot to take in but imo instead of putting it all on one day or smth just have fun and take time to digest it

fluid basalt Jan 30, 2025, 4:10 PM

#

I have been chewing through it for the last two weeks, trying not to burn out burn out, but dont wanna lose steam on learning ya know?

#

preparing for my UoT Data science program come september

stuck tapir Jan 30, 2025, 4:12 PM

#

ooh
what works for me is just try pacing yourself like do some projects which excite u or smth bcuz for me that always helps with burnout n stuff

#

like in between learning do some fun projs and then reinforce too

fluid basalt Jan 30, 2025, 4:13 PM

#

I agree, I run into the logical falacy of. If i keep learning, I will increase my mental toolbelt to solve x or y problem ya know?

#

I api called a lot of defunct insurance data and have been making a jupyter notebook as a portfolio project but I feel I am approaching the unknown unknowns of what I can do with it

peak thorn Jan 30, 2025, 7:36 PM

#

i have cuda version 11.5 so which version of cuDNN should i install for running tensorflow?

sullen herald Jan 30, 2025, 8:33 PM

#

peak thorn i have cuda version 11.5 so which version of cuDNN should i install for running ...

you can refer to this list for tensorflow gpu compatibility w/ cuda versions
https://www.tensorflow.org/install/source#gpu

TensorFlow

Build from source | TensorFlow

unkempt wigeon Jan 30, 2025, 10:41 PM

#

How fast is pytorch compared to tensorflow and keres?

serene scaffold Jan 30, 2025, 10:45 PM

#

unkempt wigeon How fast is pytorch compared to tensorflow and keres?

there isn't a straightforward answer for this question. just use pytorch.

plucky pagoda Jan 31, 2025, 12:34 AM

#

unkempt wigeon How fast is pytorch compared to tensorflow and keres?

Are you using cpu pytorch or gpu pytorch ?

unkempt wigeon Jan 31, 2025, 12:36 AM

#

plucky pagoda Are you using cpu pytorch or gpu pytorch ?

Well I've been trying to figure out if my computer can handle CPU because I can't tell if I have a GPU I've been trying to look into my computer's model but I've been getting a headache lately

plucky pagoda Jan 31, 2025, 12:37 AM

#

There is something called ray.
https://www.ray.io/
@unkempt wigeon

Scale Machine Learning & AI Computing | Ray by Anyscale

Ray is an open source framework for managing, executing, and optimizing compute needs. Unify AI workloads with Ray by Anyscale. Try it for free today.

unkempt wigeon Jan 31, 2025, 1:19 AM

#

plucky pagoda Are you using cpu pytorch or gpu pytorch ?

Is there a way I'm figuring out if Mike terminal has a GPU?

stuck tapir Jan 31, 2025, 1:55 AM

#

yall whats a good platform to start freelancing on w/ python

peak thorn Jan 31, 2025, 3:03 AM

#

Why there are so many problems with tensorflow to install it your gpu config.. ducky_skull

Where pytorch is so simple compare to tensorflow installation.

serene scaffold Jan 31, 2025, 3:14 AM

#

peak thorn Why there are so many problems with tensorflow to install it your gpu config..<:...

Tensorflow development is winding down, so there might be new compatibility issues coming up that aren't getting or won't be fixed.

sinful surge Jan 31, 2025, 5:09 AM

#

i want to scrap the data from the nansen website for the realitime update of the values like this is the code :-

# Extract data
trending_data = []
for row in soup.select("div.MuiBox-root.mui-style-70qvj9"):  # Update selector based on actual HTML
    try:

how shall i do that can anyone help

Screenshot_2025-01-31_at_10.39.37_AM.png

#

No data found. Check your selectors or the website structure.

am getting this again and again.

coarse rivet Jan 31, 2025, 5:59 AM

#

Hi i am new to ML. I know basic ml training how to train ml to recommend music to the user based on there age gender etc. Now i want to train a model make insight on how a website path is doing from its daily metrics like session time, total session, bounce rate etc. Any advice on anything will help i am still researching how to start and what to do. Talking to chatgpt

untold fable Jan 31, 2025, 9:05 AM

#

what type of project should i make for seseach intership

#

to get research intership

storm kelp Jan 31, 2025, 3:44 PM

#

spice ravine Becuz is free and is better than gpt

Is it actually better though?

red ledge Jan 31, 2025, 3:51 PM

#

Hello

untold bloom Jan 31, 2025, 4:17 PM

#

i used keras v3 with torch backend and compared to pure pytorch it was much faster, both on GPU

#

but neither i have a reproducible example of that nor i claim i managed to do GPU adaptation properly in PyTorch

#

but the intriguing thing was that i didn't have to do anything (nothing) in keras for GPU adaptation

#

but the drawback was writing a custom loss function, passing the epoch index, using an adaptive learning rate was way harder to bake into the Keras code

#

so tradeoffs yet again

untold bloom Jan 31, 2025, 4:22 PM

#

untold bloom i used keras v3 with torch backend and compared to pure pytorch it was much fast...

quantiatively: 10-15 times faster; also I had found a post on discourse of PyTorch where another person was suffering a similar loss of performance, so I'm not alone on that front, I thought/think

sullen herald Jan 31, 2025, 5:10 PM

#

untold bloom quantiatively: 10-15 times faster; also I had found a post on discourse of PyTor...

10-15x faster seems like a huge number. Are you sure pytorch pipeline is correctly implemented? In my experience, I have also found keras v3 torch backend little faster than pytorch (tabular/image data), but not a significant boost.

keras v3 was a significant improvement over prior versions, supporting jax/torch backend. But I still find it complex when adding custom callbacks as you mentioned. I only prefer using it for quick prototyping or sometimes tabular datasets.

stuck tapir Jan 31, 2025, 5:11 PM

#

Yeah 10-15x sounds like a significant change

untold bloom Jan 31, 2025, 5:20 PM

#

yeah as said, I don'T claim i managed to do GPU adaptation properly in PyTorch

hollow pagoda Jan 31, 2025, 5:22 PM

#

coarse rivet Hi i am new to ML. I know basic ml training how to train ml to recommend music t...

Using those metrics or even features made to be used within the path as features trained on their retention/interaction data is a good starting stone

smoky basalt Jan 31, 2025, 5:26 PM

#

wat tutorial should i use as begineer to data visualisation

#

bc im pretty sure data visualisation is required as a start to learning ml

sullen herald Jan 31, 2025, 5:27 PM

#

untold bloom yeah as said, I don'T claim i managed to do GPU adaptation properly in PyTorch

If you are doing any benchmarking keras v3 v/s torch, I will be very keen to help/contribute.

sullen herald Jan 31, 2025, 5:31 PM

#

smoky basalt wat tutorial should i use as begineer to data visualisation

I personally feel EDA is something which you learn more with practise. I would recommend doing couple of basic courses from coursera (I did this one"Applied Plotting, Charting & Data Representation in Python"), https://www.kaggle.com/learn select data visualization mini course on Kaggle, it's fun.

Learn Python, Data Viz, Pandas & More | Tutorials | Kaggle

Practical data skills you can apply immediately: that's what you'll learn in these no-cost courses. They're the fastest (and most fun) way to become a data scientist or improve your current skills.

untold bloom Jan 31, 2025, 5:33 PM

#

sullen herald If you are doing any benchmarking keras v3 v/s torch, I will be very keen to hel...

hmm thanks, maybe I can come up with an MRE, data was public (a Kaggle competition actually), then share what I have here

sullen herald Jan 31, 2025, 5:34 PM

#

untold bloom hmm thanks, maybe I can come up with an MRE, data was public (a Kaggle competiti...

Sounds good, thanks. Which competition btw?

untold bloom Jan 31, 2025, 5:35 PM

#

an old one actually, https://www.kaggle.com/competitions/seizure-prediction/

sullen herald Jan 31, 2025, 5:37 PM

#

EEG signals, interesting. Did you use Recurrent networks or transformers?

untold bloom Jan 31, 2025, 5:44 PM

#

CNN actually :p

#

on the spectrogram of the signals

#

it was not my original idea, of course

#

and my aim was to compare some models under this dataset with some specific stuff, so neither the dataset itself nor the base model was too critical

#

only that it ought to have been a time-series based dataset

#

and that the base model wasn't too "incapable" (sorry logistic regression, you are loved too)

spiral whale Jan 31, 2025, 5:56 PM

#

can i somehow download an open source llm model and run it locally with tensorflow?

#

or keras?

serene scaffold Jan 31, 2025, 5:57 PM

#

spiral whale can i somehow download an open source llm model and run it locally with tensorfl...

use pytorch
the huggingface website has code for each model for how to run it locally with pytorch. but keep in mind that a lot of models require enterprise hardware.

#

which one do you want to use?

spiral whale Jan 31, 2025, 5:57 PM

#

why pytorch tho? and any, but small one, not rlly planning to do super accurate things

#

i used to use keras. Is it outdated or something?

serene scaffold Jan 31, 2025, 5:58 PM

#

yes, it is

spiral whale Jan 31, 2025, 5:58 PM

#

oh, sadge. okey then

#

so hugging face has models weigths?

serene scaffold Jan 31, 2025, 5:58 PM

#

exactly

spiral whale Jan 31, 2025, 5:58 PM

#

and layers i believe

#

okey, ty

serene scaffold Jan 31, 2025, 5:59 PM

#

spiral whale Jan 31, 2025, 5:59 PM

#

one last thing. I remember doing my custom data augmentation class with keras for my own project. Tho i needed to fork keras and make it not to convert images into RGB (RGBA images will be RGB). Can i do the same with pytorch?

serene scaffold Jan 31, 2025, 6:00 PM

#

so you're trying to do a task with images, but the color channels are RGBA and not RGB?

smoky basalt Jan 31, 2025, 6:00 PM

#

would i need to learn data visualisation for ml?

spiral whale Jan 31, 2025, 6:00 PM

#

serene scaffold so you're trying to do a task with images, but the color channels are RGBA and n...

yeah like, my augmentation was giving a random background to RGBA images. Couldnt with RGB

serene scaffold Jan 31, 2025, 6:01 PM

#

smoky basalt would i need to learn data visualisation for ml?

matplotlib is the standard data visualization tool, though it's not very pythonic, unfortunately.

spiral whale Jan 31, 2025, 6:01 PM

#

this has nothing to do with the LLM, is a different project, but wondering if i could, since imma move to pytorch

smoky basalt Jan 31, 2025, 6:01 PM

#

serene scaffold matplotlib is the standard data visualization tool, though it's not very pythoni...

so i should learn it first then jump to tensorflow

serene scaffold Jan 31, 2025, 6:01 PM

#

smoky basalt so i should learn it first then jump to tensorflow

learn pytorch instead of tensorflow. by the time you're ready to get a job, tensorflow will probably be completely dead.

smoky basalt Jan 31, 2025, 6:02 PM

#

serene scaffold learn pytorch instead of tensorflow. by the time you're ready to get a job, tens...

ok

#

so where should i start?

#

bc theres pandas, numpy, and all this stuff

#

sci kit

serene scaffold Jan 31, 2025, 6:03 PM

#

spiral whale _this has nothing to do with the LLM, is a different project, but wondering if i...

I don't know enough about image processing for this. I imagine you can have an adapter to handle any RGBA <-> RGB conversions.

spiral whale Jan 31, 2025, 6:03 PM

#

okey, will look for it. Thanks 🙂

latent girder Jan 31, 2025, 6:03 PM

#

Hi. Do you have any recommended pandas tutorial for beginners? Most stuffs from youtube are too advanced, not detailed enough and too fast paced

serene scaffold Jan 31, 2025, 6:03 PM

#

smoky basalt bc theres pandas, numpy, and all this stuff

I recommend that you first learn how to use pandas to manipulate and explore data, so that you get a sense for what "data" is in the context of data science.

smoky basalt Jan 31, 2025, 6:03 PM

#

serene scaffold I recommend that you first learn how to use pandas to manipulate and explore dat...

ok

smoky basalt Jan 31, 2025, 6:04 PM

#

latent girder Hi. Do you have any recommended pandas tutorial for beginners? Most stuffs from ...

well this was pretty convenient

serene scaffold Jan 31, 2025, 6:04 PM

#

latent girder Hi. Do you have any recommended pandas tutorial for beginners? Most stuffs from ...

the kaggle pandas tutorial. it's interactive. (which is important, because you won't passively learn from watching youtube)

latent girder Jan 31, 2025, 6:04 PM

#

serene scaffold the kaggle pandas tutorial. it's interactive. (which is important, because you w...

can you send me the link please?

smoky basalt Jan 31, 2025, 6:04 PM

#

serene scaffold the kaggle pandas tutorial. it's interactive. (which is important, because you w...

any yt tutorials that can help if stuck?

smoky basalt Jan 31, 2025, 6:04 PM

#

latent girder can you send me the link please?

https://www.kaggle.com/learn

Learn Python, Data Viz, Pandas & More | Tutorials | Kaggle

Practical data skills you can apply immediately: that's what you'll learn in these no-cost courses. They're the fastest (and most fun) way to become a data scientist or improve your current skills.

serene scaffold Jan 31, 2025, 6:04 PM

#

latent girder can you send me the link please?

sure, but it's the first search result for "kaggle pandas tutorial". https://www.kaggle.com/learn/pandas

Learn Pandas Tutorials

Solve short hands-on challenges to perfect your data manipulation skills.

latent girder Jan 31, 2025, 6:05 PM

#

Thank oyu

spiral whale Jan 31, 2025, 6:08 PM

#

@serene scaffold why do u recommend learning pytorch rather than TF itself?

serene scaffold Jan 31, 2025, 6:08 PM

#

spiral whale <@253696366952316929> why do u recommend learning pytorch rather than TF itself?

why do you say "TF itself"? is there a relationship that you think exists between the two?

spiral whale Jan 31, 2025, 6:09 PM

#

yeah, but same with keras / TF. One is a higher level framework. I know pytorch is built in top of tf, but tf gives more flexibility, doesnt it?

#

like, is pytorch just more user friendly?

serene scaffold Jan 31, 2025, 6:09 PM

#

I know pytorch is built in top of tf
this is false.

spiral whale Jan 31, 2025, 6:09 PM

#

😮

#

oh well, still, why pytorch over tf?

smoky basalt Jan 31, 2025, 6:10 PM

#

spiral whale like, is pytorch just more user friendly?

i think its bc tensorflow is outdated and maybe not as fast and lacking features

spiral whale Jan 31, 2025, 6:10 PM

#

oh, tf is outdated too?

#

i thought TF2 was a thing

smoky basalt Jan 31, 2025, 6:10 PM

#

apparently

#

from wat im infering

latent girder Jan 31, 2025, 6:10 PM

#

serene scaffold sure, but it's the first search result for "kaggle pandas tutorial". https://www...

can i do my inputs on vscode or any other ide? Ill just be referring to the guide right?

#

the first section is i would be creating my own dataframe anwyayy which i could do with other ide?

serene scaffold Jan 31, 2025, 6:11 PM

#

I've never seen anyone in industry use tensorflow, and it seems that development of tensorflow is winding down. as far as I can tell, the only reason anyone still uses tensorflow is because of tutorials that have been written for it.

smoky basalt Jan 31, 2025, 6:11 PM

#

yk for this y is there 0, 1

#

#

or is that js to help

#

auto generated?

spiral whale Jan 31, 2025, 6:12 PM

#

serene scaffold I've never seen anyone in industry use tensorflow, and it seems that development...

i see... I thought TF was maintained by google, same as keras. Didnt know they abandoned both

smoky basalt Jan 31, 2025, 6:12 PM

#

to help navigate row numbers?

spiral whale Jan 31, 2025, 6:12 PM

#

Pytorch is from community i guess?

serene scaffold Jan 31, 2025, 6:12 PM

#

latent girder can i do my inputs on vscode or any other ide? Ill just be referring to the guid...

you run the code in the kaggle pandas tutorial
but in general, you can use whatever code editor you want. VSC, pycharm, etc. have no baring on how the code is actually executed.

serene scaffold Jan 31, 2025, 6:12 PM

#

spiral whale i see... I thought TF was maintained by google, same as keras. Didnt know they a...

google loves to abandon stuff.

smoky basalt Jan 31, 2025, 6:13 PM

#

oh wait u can set custom index

spiral whale Jan 31, 2025, 6:13 PM

#

okey, ty

sullen herald Jan 31, 2025, 6:14 PM

#