past meteor Jul 29, 2024, 12:05 PM

#

I haven't started the new role yet but I'm curious how big their maturity is on keeping costs down

lapis sequoia Jul 29, 2024, 12:05 PM

#

pretty nice summary:

SIMD is the 'concept', SSE/AVX are implementations of the concept. All SIMD instruction sets are just that, a set of instructions that the CPU can execute on multiple data points. As long as the CPU supports executing the instructions, then it is feasible for multiple SIMD instruction sets to coexist, regardless of data size.

past meteor Jul 29, 2024, 12:05 PM

#

In my current role people are just happy if it's running

#

My boss keeps paying absolutely insane cloud bills without knowing why they're so high

buoyant vine Jul 29, 2024, 12:06 PM

#

I guess it depends what market you're in

#

I am in adtech so cost per user/site/operation has an big impact

past meteor Jul 29, 2024, 12:06 PM

#

Right now I'm in R&D with many people that aren't really concerned about going to prod

#

The worst thing is that one guy decided to put all our services in 1 azure resource group

#

Drives me crazy

buoyant vine Jul 29, 2024, 12:08 PM

#

How is the whole Azure AI experience? I could never understand their naming schemes or pricing of services

past meteor Jul 29, 2024, 12:08 PM

#

To figure out the cost of a project you need to find/filter each individual resource and tally up their costs instead of just taking the entire RG

#

Hmm, in all honesty. We do all ML/AI stuff on prem

#

We have 2 Quadro cards, it's very comfy

buoyant vine Jul 29, 2024, 12:09 PM

#

Fair enough

past meteor Jul 29, 2024, 12:10 PM

#

The place I'm moving to does everything in databricks so that's that

#

I'm not sure how I feel about being bound to spakr for everything

#

I guess streamlining your stack saves engineering time

buoyant vine Jul 29, 2024, 12:11 PM

#

Tbh imo, ML/AI wise, as long as what ever you are doing can go into ONNX, you're entire deploy setup and infra side is just a breeze

past meteor Jul 29, 2024, 12:12 PM

#

I don't want to get philosophical but

#

The bigger issue is that the people working on the models are typically not engineers and have never heard of ONNX

#

But my sample size are just people I know irl of course

buoyant vine Jul 29, 2024, 12:13 PM

#

So I agree, but I argue that it is not hard to introduce them to onnx. Especially if you're using something like PyTorch, really all it becomes is a "Hey just add this 10 lines of python" or you can do it via CI or some CLI tool for example

#

Sometimes a magic script is the best kind of script

past meteor Jul 29, 2024, 12:14 PM

#

I see what you mean

#

I'm projecting a bit, most of my colleagues deliberately scoped things to never have to worry about prod

#

which is entirely possible in R&D, just call it something like "proof-of-value" and then it's done

buoyant vine Jul 29, 2024, 12:16 PM

#

Yeah, I mean I think that is probably a good thing, but maybe the missing step there is "we create this model, send it to the blackbox in CI that does the training and then spits back results"

past meteor Jul 29, 2024, 12:16 PM

#

Which is unironically what I was trying to make

buoyant vine Jul 29, 2024, 12:17 PM

#

Effectively what I ended up doing for our web classifier model which I actually don't understand why it isn't a mainstream library, is a framework that lets multiple people work on seperate models within the repo, and then run them via just run_model("my.path.to.pthon_file:MyClass") and then everything else is handled for them, which also lets me keep the system generic and abstract enough to automatically convert the models, load datasets, etc...

#

Probably the best bit of time I invested in and 2k LOC was setting that up, because it just made life so much better

past meteor Jul 29, 2024, 12:19 PM

#

What was your reasoning of not just having them give you the trained model

#

Is it something they couldn't do from their machines?

#

Or machines they have access to

buoyant vine Jul 29, 2024, 12:20 PM

#

They depend on CI /remote servers for training since which multiple GPUs and 24GB VRAM

#

and they are all working on the same repo

#

the biggest issue is when you have multiple people working on different models to do the same thing in order to compare, is you end up with them writing lots of bits of random code all over the place and different entrypoints to train and test the models

past meteor Jul 29, 2024, 12:21 PM

#

Specifically CI for model training is interesting because I can think of many times I'd want a new mode in a way that is decoupled to my commits

#

Like, new data arriving

#

But I suspect your use case is vastly different from the ones I have in mind

buoyant vine Jul 29, 2024, 12:22 PM

#

this approach meant the entry points for training, testing and exporting all are the same across models, and the testing system didn't have to be specialized for each model

buoyant vine Jul 29, 2024, 12:23 PM

#

past meteor But I suspect your use case is vastly different from the ones I have in mind

I think so yeah, in our case we are targetting one end goal with the models, and it is more just a way of comparing different models with rapid development

#

the only painpoint is the system never really ended up setting up checkpoints

#

so if it failed right at the end you'd be 🫡 Waiting another 24 hours

lapis sequoia Jul 29, 2024, 12:24 PM

#

pheeeww

covert cave Jul 29, 2024, 1:54 PM

#

hi, from sklearn.preprocessing import MinMaxScaler,StandardScaler sc=StandardScaler()
X_train= sc.fit_transform(X_train)
X_test= sc.fit_transform(X_test)
for these codes I got this error: TypeError: Property names are only supported if all input properties have string names, but your input contains ['str', 'tuple'] as property name/column name types. If you want the property names to be stored and validated, you should convert them all to strings, for example using X.columns = X.columns.astype(str). Otherwise, you can remove the property/column names from your input data or convert them all to a non-string data type. do u know?how can I fix this error?

#

How do you suggest I do this?

past meteor Jul 29, 2024, 2:01 PM

#

Put it in 3 backticks like so ``` your code ```

past meteor Jul 29, 2024, 2:01 PM

#

covert cave How do you suggest I do this?

Also, you can put python between the 3 backticks, like so:

```python
X_train= sc.fit_transform(X_train)
X_test= sc.fit_transform(X_test)
```

that gives you

X_train= sc.fit_transform(X_train)
X_test= sc.fit_transform(X_test)

covert cave Jul 29, 2024, 2:09 PM

#

past meteor Also, you can put python between the 3 backticks, like so: \```python X_train= ...

No unfourtanetly, I got this error in this time :SyntaxError: invalid syntax

serene scaffold Jul 29, 2024, 2:10 PM

#

covert cave No unfourtanetly, I got this error in this time :SyntaxError: invalid syntax

can you show the whole exact code and the whole error message?

#

!paste

arctic wedgeBOT Jul 29, 2024, 2:10 PM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in Discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the Paste! button in the bottom left, or by pressing CTRL + S. After doing that, you will be navigated to the new paste's page. Copy the URL and post it here so others can see it.

past meteor Jul 29, 2024, 2:11 PM

#

covert cave No unfourtanetly, I got this error in this time :SyntaxError: invalid syntax

I was just talking about the formatting

covert cave Jul 29, 2024, 2:20 PM

#

I solved it now , It worked after disabling name verification like X_train= sc.fit_transform(X_train.values) . thanks a lott

errant bison Jul 29, 2024, 3:04 PM

#

I'm trying to create an ai which helps in traffic congestion. How can i do that

serene scaffold Jul 29, 2024, 3:17 PM

#

errant bison I'm trying to create an ai which helps in traffic congestion. How can i do that

I don't recommend this as a first project as it will be exceptionally challenging and you probably won't make any progress before losing motivation.

You'd need a traffic simulation where the model can control things like the color of each traffic light. and you can probably use a reinforcement learning approach where the model is trying to maximize the speed of each vehicle.

orchid forge Jul 29, 2024, 3:18 PM

#

what does this "notions of dimensional modelling, ETL and the basics of data engineering go a long way" part means?

errant bison Jul 29, 2024, 4:10 PM

#

serene scaffold I don't recommend this as a first project as it will be exceptionally challengin...

Not my first project, but i had created to detect the vehicles, but i want to add any unique feature. This seems just like google maps. If u have any idea then pls share!

spare forum Jul 29, 2024, 5:41 PM

#

orchid forge what does this "notions of dimensional modelling, ETL and the basics of data eng...

Etl is extract transform load, the first step to data project to prepare data i recommend seeing thing in detail, in some team it will be the entire job of a data engineer, but you still need to know about it sometimes you have to do it on smaller project

#

also there is different data architectures note that etl isn't the only way to go

past meteor Jul 29, 2024, 7:00 PM

#

orchid forge what does this "notions of dimensional modelling, ETL and the basics of data eng...

Yeah, the answer that @spare forum gave is spot on, exactly what I meant

umbral delta Jul 29, 2024, 7:58 PM

#

so i have a bunch of images that look like this, along with the same image with a red outline around the number, and a mask black/white image. i have an iot device thats takes centered pictures of the water meter centered / always in a similar place, how could i center the data in the dataset?

#

pallid badge Jul 29, 2024, 8:54 PM

#

Hi, are there also Discord channels for scientific computing around or Slack communities? How does one find such things?

pallid badge Jul 29, 2024, 8:56 PM

#

spare forum Etl is extract transform load, the first step to data project to prepare data i ...

Can I ask a question about ETL? Is this not similiar to data processing from HDF5 to eiher another HDF5 or something else?

#

I import data, clean it, process it, and save it somewhere?

serene scaffold Jul 29, 2024, 11:38 PM

#

@dreamy topaz "object oriented programming" means different things to different people. If you want to do machine learning in python, you should know how to make a class in python, and know how to use classes that other people have created.

nova matrix Jul 30, 2024, 2:07 AM

#

I have a dataframe column with one trend where the null values lay between 2 known rows
and then sudden blocks of NaN values (50 rows)
i want to fil only the first trend of Nans
and not the second
how would I achieve this

serene scaffold Jul 30, 2024, 3:11 AM

#

@nova matrix what method do you want to use to fill nans in the first trend

nova matrix Jul 30, 2024, 3:18 AM

#

serene scaffold <@879805921302290472> what method do you want to use to fill nans in the first t...

mean ideally

#

mean of the value before and value after

#

I tried different methods but my dataset has 50 million rows so I was wondering if I could do this with a package

serene scaffold Jul 30, 2024, 3:26 AM

#

@nova matrix you can use isna and shift
I'm standing on a train or I'd show you

#

https://pandas.pydata.org/docs/reference/api/pandas.Series.shift.html

#

https://pandas.pydata.org/docs/reference/api/pandas.Series.isna.html

#

https://pandas.pydata.org/docs/reference/api/pandas.Series.notna.html

#

@nova matrix these are the key

scenic parcel Jul 30, 2024, 3:58 AM

#

What's a nice solvable problem for LSTMs

nova matrix Jul 30, 2024, 4:04 AM

#

serene scaffold <@879805921302290472> you can use isna and shift I'm standing on a train or I'd ...

that saves sooo much time thankss man

serene scaffold Jul 30, 2024, 4:04 AM

#

nova matrix that saves sooo much time thankss man

Did you solve it

nova matrix Jul 30, 2024, 4:05 AM

#

serene scaffold Did you solve it

About to implement But think it will work
will lyk

orchid forge Jul 30, 2024, 4:28 AM

#

spare forum also there is different data architectures note that etl isn't the only way to g...

Oh okay

eager plume Jul 30, 2024, 4:51 AM

#

I'm new to python world.
I wanna ask about data science's prerequisites, roadmap, curriculum.

Is the ML,DL and DA,Stats combines together to become Data Science?????

unkempt apex Jul 30, 2024, 5:07 AM

#

eager plume I'm new to python world. I wanna ask about data science's prerequisites, roadmap...

There is a ocean named as "AI" and all rivers ended in that ocean!

orchid forge Jul 30, 2024, 5:14 AM

#

I'm so fucking happy, finally I'm understanding how to do exploratory data properly, I'm literally getting it now.

#

The world ways of understanding is so fake and stupid, I came up with my way to understand which is crazy

#

My solution to solve a freaking problem is "don't ask people for help"

serene scaffold Jul 30, 2024, 5:16 AM

#

orchid forge I'm so fucking happy, finally I'm understanding how to do exploratory data prope...

What did you do

wooden sail Jul 30, 2024, 5:17 AM

#

orchid forge The world ways of understanding is so fake and stupid, I came up with my way to ...

i would generally be wary of this, especially when doing something math-heavy

orchid forge Jul 30, 2024, 5:18 AM

#

THAT YOU DON'T QUESTION THE FREAKING DATA AT THE BEGINNING ITSELF

serene scaffold Jul 30, 2024, 5:18 AM

#

What? Why are you shouting?

orchid forge Jul 30, 2024, 5:18 AM

#

People keep saying this thing that first you understand your problem statement bla bla bla bullshit

orchid forge Jul 30, 2024, 5:18 AM

#

serene scaffold What? Why are you shouting?

Because I was so stupid back then

serene scaffold Jul 30, 2024, 5:20 AM

#

orchid forge People keep saying this thing that first you understand your problem statement b...

The first step is actually to accept that you haven't done any data science.

orchid forge Jul 30, 2024, 5:21 AM

#

The first step is actually to accept the fact that even a data scientist/analyst can't help you because he/she would only tell you what everyone would tell you

wooden sail Jul 30, 2024, 5:22 AM

#

you might consider that if everyone tells you something, including experts in the field, maybe you're wrong

orchid forge Jul 30, 2024, 5:23 AM

#

I'm not questioning any different solution coming from a expert, it's just that now I have found "MY OWN" way to solve shit

#

That's all which is 100000% better and it's working for me

wooden sail Jul 30, 2024, 5:24 AM

#

and are you well equipped to show and explain why the method works?

past meteor Jul 30, 2024, 7:55 AM

#

orchid forge People keep saying this thing that first you understand your problem statement b...

because it gives you jumping off points for (interesting) places to look at in your data

#

Otherwise you're looking for a needle in a haystack. Look at it this way, how much harder would it be to do data analysis if I took a dataset with a meaning and renamed all the columns into A, B, C, D etc.

#

I've seen domain experts make assumptions and hypotheses that were incorrect but that in and of itself is very important. It lets you find out other interesting things like maybe your data quality is bad, the measurements are incorrect, you made a coding error and if you can exclude all of this their assumptions may have been wrong, which lead to additional questions

orchid forge Jul 30, 2024, 8:09 AM

#

past meteor I've seen domain experts make assumptions and hypotheses that were incorrect but...

I read it, I get it, but I'm sorry I'm trying to find my way and idc to find a solution from an expert. I wanna make mistakes and learn from it than simply getting help.
Thank you

past meteor Jul 30, 2024, 8:10 AM

#

Wait

past meteor Jul 30, 2024, 8:10 AM

#

orchid forge I read it, I get it, but I'm sorry I'm trying to find my way and idc to find a s...

It's not about getting help, do you know what a domain expert is? I'm not asking this to sound demeaning, I'm just trying to make sure we're on the same wavelength

orchid forge Jul 30, 2024, 8:11 AM

#

Oh okay

#

I have a bad habit to not trust myself when it comes to having a solution in my head and then I would just simply ask people if I'm correct or not which has kinda made me feel like I don't understand data analysis, but idk today morning I woke up feeling something else like I was just studying and suddenly things start to make sense because I was legit studying. I tried to have lil confidence in my way of understanding a data and it kinda worked for the first time and I just wanna be this person everyday now.

#

Somebody who trusts herself

past meteor Jul 30, 2024, 8:15 AM

#

I have a few tips about this later but before we go there, can you still try and explain with your own words what you think I meant with domain expert?

orchid forge Jul 30, 2024, 8:16 AM

#

Someone who is dope in his/her work field (in their industry work)

#

I guess

#

Why?

past meteor Jul 30, 2024, 8:16 AM

#

Okay we're on the same wavelength then

orchid forge Jul 30, 2024, 8:16 AM

#

Oh okay idk the professional words but I do understand the professional words okay

#

Zester

past meteor Jul 30, 2024, 8:17 AM

#

So, I've done data projects (only talking about work ones) in additive manufacturing, health, sociology, finance, ... I'm absolutely not an expert in any of those. The only way I could get those done is by asking experts where to look when I get my dataset and to validate my findings

#

It's not about not trusting yourself

#

Often times data professionals are an expert in working with data and they throw you in the deep end in places you have no clue about

orchid forge Jul 30, 2024, 8:19 AM

#

Ik but I wanna have my own journey with data.

past meteor Jul 30, 2024, 8:19 AM

#

But you want to do this for a living?

orchid forge Jul 30, 2024, 8:19 AM

#

Yeah for a while not forever

past meteor Jul 30, 2024, 8:20 AM

#

Honestly, the more involved datasets I've worked with were not ones where you could "have your own journey with data". Simply because it would be like having all of your columns be named A, B, C, D, ...

orchid forge Jul 30, 2024, 8:20 AM

#

😂

past meteor Jul 30, 2024, 8:21 AM

#

Then you need someone at your side. Because the subject matter is just the way it is

#

9 times out of 10 I'd not know where to look and that's absolutely normal. All the things I'd find were obvious and that makes sense.

orchid forge Jul 30, 2024, 8:22 AM

#

Ofc but I love to play a role of some "smart" person who is finding things which already exists but i just love that "wow i just discover something new" feeling

past meteor Jul 30, 2024, 8:22 AM

#

For my current project I do find stuff that matters but when we meet the doctor we work with he always says "but could you look at X, Y and Z in conjunction with ..."

orchid forge Jul 30, 2024, 8:22 AM

#

past meteor + 9 times out of 10 I'd not know where to look and that's absolutely normal. All...

Well for those times I have this server, people are beautiful here

past meteor Jul 30, 2024, 8:23 AM

#

And that's where the gold is

#

Well, we're not domain experts in this server

#

I'm talking specifically about getting assistance from someone working in the field

#

And how that shouldn't make you feel like you're not doing a good job

orchid forge Jul 30, 2024, 8:24 AM

#

I'm just a girl who wants to be happy thinking she's smart even if that's like basic for you all expert people

orchid forge Jul 30, 2024, 8:24 AM

#

past meteor I'm talking specifically about getting assistance from someone working in the fi...

I wish I had someone but I don't

past meteor Jul 30, 2024, 8:24 AM

#

You're misunderstanding what I mean

orchid forge Jul 30, 2024, 8:24 AM

#

I just have this server and Google

past meteor Jul 30, 2024, 8:24 AM

#

My point is simple

#

No matter how smart you are, if you get an average dataset you'd find in real life you'd struggle because you're missing a lot of context. This applies to me, you and anyone else

orchid forge Jul 30, 2024, 8:25 AM

#

past meteor And how that shouldn't make you feel like you're not doing a good job

Yeah I know, i don't have any professional around actually IRL

orchid forge Jul 30, 2024, 8:25 AM

#

past meteor No matter how smart you are, if you get an average dataset you'd find in real li...

Oh

past meteor Jul 30, 2024, 8:25 AM

#

Then you go for different sources, news articles, books, papers, blogs, ...

orchid forge Jul 30, 2024, 8:26 AM

#

past meteor Then you go for different sources, news articles, books, papers, blogs, ...

Yeah I do that, I do that a lot

past meteor Jul 30, 2024, 8:26 AM

#

I always try and immerse myself (even if there's people I can ask questions to)

orchid forge Jul 30, 2024, 8:26 AM

#

Oh that's nice

iron basalt Jul 30, 2024, 8:27 AM

#

past meteor No matter how smart you are, if you get an average dataset you'd find in real li...

Just be an expert in every field.

past meteor Jul 30, 2024, 8:27 AM

#

So yeah, to circle back. I'd say for an EDA the critical thing is to have background knowledge of the problem you're trying to solve. When you do this for a living there will always be someone you can ask

orchid forge Jul 30, 2024, 8:27 AM

#

I love this server

past meteor Jul 30, 2024, 8:27 AM

#

For now all you can do is probably just read info about the topic online etc

orchid forge Jul 30, 2024, 8:28 AM

#

past meteor So yeah, to circle back. I'd say for an EDA the critical thing is to have backgr...

Yeah ik that

orchid forge Jul 30, 2024, 8:28 AM

#

past meteor For now all you can do is probably just read info about the topic online etc

And try to make sense out of that simple data

past meteor Jul 30, 2024, 8:29 AM

#

If you want to practice what worked for me is doing Kaggle competitions. Specifically those called "tabular playground". They're ML competitions but people always do an EDA, you can stop there if you want. The gist is, you make your own notebook, your own solution and then you read others

#

I'd always make it 100 % before looking at others

#

It's a mix of learning technical stuff "wow, is that how to make that kind of plot with Matplotlib?" and data analysis/ML intuitions "Oh that's how they looks at stuff, interesting conclusions they drew from ..."

iron basalt Jul 30, 2024, 8:30 AM

#

past meteor Honestly, the more involved datasets I've worked with were not ones where you co...

Reading this Wikipedia article simulates the experience of seeing all those new strange column names for a specific field of work: https://en.wikipedia.org/wiki/Glossary_of_baseball_terms

Glossary of baseball terms

This is an alphabetical list of selected unofficial and specialized terms, phrases, and other jargon used in baseball, along with their definitions, including illustrative examples for many entries.

serene grail Jul 30, 2024, 8:31 AM

#

past meteor If you want to practice what worked for me is doing Kaggle competitions. Specifi...

chocojNoted

iron basalt Jul 30, 2024, 8:31 AM

#

What you don't know what a "cement mixer" is?

orchid forge Jul 30, 2024, 8:31 AM

#

past meteor If you want to practice what worked for me is doing Kaggle competitions. Specifi...

Omg thank you so much, you guys are all helping Angels
Thank you I'll look through that

past meteor Jul 30, 2024, 8:32 AM

#

iron basalt Reading this Wikipedia article simulates the experience of seeing all those new ...

I'm stealing this, this is such a nice way to describe it

iron basalt Jul 30, 2024, 8:32 AM

#

past meteor I'm stealing this, this is such a nice way to describe it

I like to call all jargon in every field its "baseball terminology," or the "baseball barrier of entry."

#

Most extreme in things like business and law.

#

(And math, what you don't know the "hairy ball theorem?")

past meteor Jul 30, 2024, 8:34 AM

#

I like this one because even the explanation has references to things you wouldn't know. Torii Hunter, grand slam, walk-off homer, ...

#

#

(especially for us Europeans lol)

iron basalt Jul 30, 2024, 8:34 AM

#

past meteor

Yeah, like trying to understand a legal document in another language.

past meteor Jul 30, 2024, 8:35 AM

#

When I did the job in (additive) manufacturing I spent ym first week on the shop floor with the technicians and reading ISO standards in my spare time

iron basalt Jul 30, 2024, 8:36 AM

#

past meteor When I did the job in (additive) manufacturing I spent ym first week on the shop...

Yeah, reading standards is a great way to get somewhere in stuff like that.

past meteor Jul 30, 2024, 8:36 AM

#

Also like just seeing the production steps

#

if I saw "wire EDM, 2 seconds"

#

You have to see the production process to gauge the plausibility of that

#

I suppose you could also make a histogram/boxplot of all lengths and come to the same conclusion

#

This was some part time gig I did as a student. By far the worst data job I've ever done.

iron basalt Jul 30, 2024, 8:38 AM

#

past meteor This was some part time gig I did as a student. By far the worst data job I've e...

I have noticed the pattern of part time student gigs for data science often making no sense.

past meteor Jul 30, 2024, 8:39 AM

#

I had others that were nice though

#

This one just had a terrible company culture. To do anything I needed authorization from Denver (-7h) and India (+7h) needed to execute iti

#

The "database" was Excel files on a network drive with a SQL view on top, queries could kill the entire plant

iron basalt Jul 30, 2024, 8:40 AM

#

past meteor This one just had a terrible company culture. To do anything I needed authorizat...

Did you have to fax them?

past meteor Jul 30, 2024, 8:41 AM

#

iron basalt Did you have to fax them?

Luckily not, but I worked 2 days a week which meant if I needed to do the denver India cycle the week was over

#

Denver tells me I have a go end of day on the first day I ask or if they're busy on the next. By the time I can ask India it's their midnight

iron basalt Jul 30, 2024, 8:44 AM

#

past meteor The "database" was Excel files on a network drive with a SQL view on top, querie...

This is way too common, it's a strange property of accessibility, and being able to linearly add to something very easily, but it does not scale.

past meteor Jul 30, 2024, 8:45 AM

#

Ah and the other thing that I didn't like was people couldn't agree on basic terminology. For instance, what "how much % of parts were without defects this month" meant

#

For one group it meant parts manufactured this month / parts returned this month while for the other group (... me) it's parts manufactured in month A / parts returned of month A (irrespective of when they were returned)

#

So instead of cool stuff you can imagine I spent a lot of time in meetings getting people to agree (and write down) definitions

iron basalt Jul 30, 2024, 8:51 AM

#

past meteor Ah and the other thing that I didn't like was people couldn't agree on basic ter...

(It's funny because this happens to be part of the 1944 CIA organization sabotage guidelines, arguing on basic terminology (it's a really good way to get nothing done))

orchid forge Jul 30, 2024, 8:52 AM

#

Hmm

iron basalt Jul 30, 2024, 8:53 AM

#

orchid forge Hmm

(Not implying that the CIA is involved, just that it's a pretty bad thing for an organization to do if even the CIA recommends it as a method to disrupt)

orchid forge Jul 30, 2024, 8:54 AM

#

I don't understand what are you trying to say

serene grail Jul 30, 2024, 8:54 AM

#

I actually often argue about basic terminology...
Self-reflection time...

orchid forge Jul 30, 2024, 8:54 AM

#

But I wanna be a CIA man

#

Like it's so cool

#

Haha @small wedge

#

Funny right

small wedge Jul 30, 2024, 8:57 AM

#

bro wants to be a fed whyme

orchid forge Jul 30, 2024, 8:58 AM

#

small wedge bro *wants* to be a fed <:whyme:857641310860476447>

I wish I had permission to say "shut up" in this server

small wedge Jul 30, 2024, 8:58 AM

#

you do my friend and so much more

#

but i digress I don't wanna fill ds/ai channel with nonsense

orchid forge Jul 30, 2024, 8:59 AM

#

small wedge but i digress I don't wanna fill ds/ai channel with nonsense

Yeah

lapis sequoia Jul 30, 2024, 11:05 AM

#

PyTorch running on amd gpu (more https://discuss.pytorch.org/t/how-to-run-torch-with-amd-gpu):

pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/rocm6.1

JAX does not support it.
TF has some instructions here (https://rocm.docs.amd.com/projects/install-on-linux/en/latest/how-to/3rd-party/tensorflow-install.html).

#

also has anyone used MLX ? https://github.com/ml-explore/mlx
ig it's not ready for prod, but being added to Keras (well, queued.)

GitHub

GitHub - ml-explore/mlx: MLX: An array framework for Apple silicon

MLX: An array framework for Apple silicon. Contribute to ml-explore/mlx development by creating an account on GitHub.

#

noticed that lscpu in linux will tell you whether your cpu has avx,avx2 etc support (if i understand correctly)

buoyant vine Jul 30, 2024, 11:22 AM

#

lapis sequoia noticed that `lscpu` in linux will tell you whether your cpu has avx,avx2 etc su...

easiest is to do cat /proc/cpuinfo

#

And under flags it will tell you, the supported CPU flags

#

On most x86 hardware, you will have AVX2 and the generations that came before it, i.e. SSE4, AVX, SSE3, etc...

lapis sequoia Jul 30, 2024, 11:23 AM

#

thanks, yes

buoyant vine Jul 30, 2024, 11:24 AM

#

Only very new consumer hardware like AMD's Zen4 CPUs have AVX512 features.
Some Xeons Golds had AVX512 several years ago, but some problems they can have is it causes the chip to overhead and thermal throttle

lapis sequoia Jul 30, 2024, 11:24 AM

#

interesting, i've got those on intel, not a new laptop

buoyant vine Jul 30, 2024, 11:24 AM

#

AVX512?

lapis sequoia Jul 30, 2024, 11:25 AM

#

i was noticing that colab has xeon processors

buoyant vine Jul 30, 2024, 11:26 AM

#

Most ARM server chips will ship with NEON, notably Ampere, atlthough they don't mark it in their CPU info on VMs.
Some newer gen ARM chips like AWS Gravaton have SVE1 & 2 as well which provide more powerful SIMD operations on the cores

lapis sequoia Jul 30, 2024, 11:27 AM

#

avx51f avx512dq avx512ifma avx512cd sha_ni avx512bw avx512vl avx512vbmi umip avx512_vbmi2 avx512_vnni avx512_bitalg avx512_vpopcntdq rdpid movdiri movdir64b fsrm avx512_vp2in...

buoyant vine Jul 30, 2024, 11:28 AM

#

Fair enough, what chip is it?

lapis sequoia Jul 30, 2024, 11:28 AM

#

i7

#

bought it second hand, 2 years ago

buoyant vine Jul 30, 2024, 11:28 AM

#

I meant what generation

#

I'm guessing either Icelake or Skylake

lapis sequoia Jul 30, 2024, 11:29 AM

#

11th

#

idk the codenames

buoyant vine Jul 30, 2024, 11:29 AM

#

Intel just with the worst possible naming schemes on their chips 😔 And then AMD is now doing the same

lapis sequoia Jul 30, 2024, 11:30 AM

#

just use the number

#

:-)

buoyant vine Jul 30, 2024, 11:30 AM

#

Right it's Rocket lake, so post-icelake

lapis sequoia Jul 30, 2024, 11:31 AM

#

post ice age

#

have u tried torch/rocm in amd gpus?

buoyant vine Jul 30, 2024, 11:31 AM

#

not via torch

lapis sequoia Jul 30, 2024, 11:31 AM

#

via tf?

buoyant vine Jul 30, 2024, 11:32 AM

#

no just onnxruntime

#

we dont use TF, which is think is a general pattern in the industry

lapis sequoia Jul 30, 2024, 11:32 AM

#

i see, for inference or training as well?

buoyant vine Jul 30, 2024, 11:32 AM

#

TF is just not a very fun framework to try use in comerical aspects and then deploying

#

Mostly inference

lapis sequoia Jul 30, 2024, 11:32 AM

#

interesting, i've used tf for training, onnxruntime for web deployments

buoyant vine Jul 30, 2024, 11:33 AM

#

Can do training, but normally it is easier to use our PT framework so we just use nvidia

lapis sequoia Jul 30, 2024, 11:33 AM

#

i looked at it today and ratio of stars to issue was high

#

sorry, low

buoyant vine Jul 30, 2024, 11:34 AM

#

What on earth is ratio of starts to issues supposed to suggest lol

lapis sequoia Jul 30, 2024, 11:35 AM

#

well, i'd expect that there aren't that many users, but there are many issues

#

if it works 4u, that's fine though...

buoyant vine Jul 30, 2024, 11:36 AM

#

🤨

lapis sequoia Jul 30, 2024, 11:36 AM

#

could also mean programmers aren't very concerned about fixing issues. it's nor far from tf's n of issues

buoyant vine Jul 30, 2024, 11:37 AM

#

that is... 😅 Definitely not how you want to judge that lol

lapis sequoia Jul 30, 2024, 11:37 AM

#

to me the response to user queries is quite a good proxy for success of a library

wooden sail Jul 30, 2024, 11:39 AM

#

tf has both way more stars and fewer issues than numpy, and pytorch has way more issues and fewer stars than tf

lapis sequoia Jul 30, 2024, 11:41 AM

#

imho it's a good library though

buoyant vine Jul 30, 2024, 11:41 AM

#

They are both massive libraries

#

Commercially I would generally lean towards pytorch over TF though

#

Theres quite a few beginner tutorials for TF

#

but generally in our experience PyTorch's frameworks are just more intuitive and normally faster on things like train time and memory

lapis sequoia Jul 30, 2024, 11:42 AM

#

on Keras metrics Pytorch is the slowest

#

but could be a limitation of keras, idk

buoyant vine Jul 30, 2024, 11:43 AM

#

Keras metrics... With pytorch ?

lapis sequoia Jul 30, 2024, 11:44 AM

#

i meant as a quick glance of a repository, low n of stars but high of issues does not seem great, was a minor observation..

#

Yes, why?

buoyant vine Jul 30, 2024, 11:44 AM

#

why on earth would you use that xD

wooden sail Jul 30, 2024, 11:44 AM

#

keras did split back out of tf

lapis sequoia Jul 30, 2024, 11:44 AM

#

I'll let you guess..

buoyant vine Jul 30, 2024, 11:44 AM

#

Keras is effectively built for TF

lapis sequoia Jul 30, 2024, 11:44 AM

#

no

buoyant vine Jul 30, 2024, 11:44 AM

#

It is

wooden sail Jul 30, 2024, 11:44 AM

#

keras started out and is now again separate from tf

lapis sequoia Jul 30, 2024, 11:44 AM

#

no, just read about it

buoyant vine Jul 30, 2024, 11:44 AM

#

In reality, it is

lapis sequoia Jul 30, 2024, 11:44 AM

#

Keras 3 is multibackend, and runs well with Jax and Torch as well

buoyant vine Jul 30, 2024, 11:45 AM

#

Like, I hate to break it to you, but using Keras and PyTorch is just a weird choice

lapis sequoia Jul 30, 2024, 11:45 AM

#

In fact, is in the road to support MLX

#

it's not because you have the flexibility of testing the speed of many models

#

in != frameworks

wooden sail Jul 30, 2024, 11:45 AM

#

(also for completeness, jax and tf use the same backend)

buoyant vine Jul 30, 2024, 11:46 AM

#

If you're using torch, it is infinitely easier to use something like PT lightning & friends than Keras

lapis sequoia Jul 30, 2024, 11:46 AM

#

wooden sail (also for completeness, jax and tf use the same backend)

for gpu calculations? their support matrix is very different

lapis sequoia Jul 30, 2024, 11:46 AM

#

buoyant vine If you're using torch, it is infinitely easier to use something like PT lightnin...

meh

wooden sail Jul 30, 2024, 11:47 AM

#

lapis sequoia for gpu calculations? their support matrix is very different

they both use XLA as default backend

buoyant vine Jul 30, 2024, 11:47 AM

#

yeah

#

I think it is a bit odd keras chose to abstract themselves out of just TF

#

ultimiately it is probably going to still be solely used with TF 😅 Because that is what all the tutorials will use

wooden sail Jul 30, 2024, 11:50 AM

#

probably for a while, yeah. it did start out separately from tf though so maybe they can pull it off ok

lapis sequoia Jul 30, 2024, 11:50 AM

#

it's actually nice conceptually

wooden sail Jul 30, 2024, 11:50 AM

#

i don't remember when keras was acquired

lapis sequoia Jul 30, 2024, 11:50 AM

#

it's an abstract symbolic layer

#

that specifies computations

wooden sail Jul 30, 2024, 11:50 AM

#

looks like google acquired keras in 2017

lapis sequoia Jul 30, 2024, 11:51 AM

#

keras has tutorials, they run on any backend

buoyant vine Jul 30, 2024, 11:51 AM

#

wooden sail probably for a while, yeah. it did start out separately from tf though so maybe ...

Maybe, but they are generally, if looking at PT, competing with PyTorch + PyTorch Lightning which is already generally very well supported

#

and does some things like Multi-GPU and multi-machine training without a lot of the footguns

wooden sail Jul 30, 2024, 11:52 AM

#

i have to admit i've never heard of pytorch lightning before

buoyant vine Jul 30, 2024, 11:52 AM

#

I'd describe it as the thing you end up finding when you start training large scale models or multi-gpu models

#

because it manages all the checkpoints and device mounting for you

#

They also did https://lightning.ai/torchmetrics which is seperate to lightning but a great metric lib

Lightning AI | Turn ideas into AI, Lightning fast

The all-in-one platform for AI development. Code together. Prototype. Train. Scale. Serve. From your browser - with zero setup. From the creators of PyTorch Lightning.

wooden sail Jul 30, 2024, 11:53 AM

#

same here, i've yet to work with more than one a100

#

usually in the team "if it doesn't fit there, go back to your notebook and do better math"

lapis sequoia Jul 30, 2024, 11:54 AM

#

you can test that in tf

#

create multiple logical gpus from 1 gpu

buoyant vine Jul 30, 2024, 11:56 AM

#

I generally like it for that reason

#

but more so in the context of when working with teams

#

it just helps prevent people writing some cursed loops or unreadable blocks

lapis sequoia Jul 30, 2024, 11:57 AM

#

wooden sail they both use XLA as default backend

i don't know about xla apart from the name tbh

buoyant vine Jul 30, 2024, 11:57 AM

#

If you're in a team where everyone is good at Python and best practices, then it matters less imo, other than the multi-machine stuff

wooden sail Jul 30, 2024, 11:58 AM

#

lapis sequoia i don't know about xla apart from the name tbh

https://en.wikipedia.org/wiki/Accelerated_Linear_Algebra

lapis sequoia Jul 30, 2024, 11:58 AM

#

i admit i think pytorch will win to some extent

#

i just like fchollet general thought process i think

buoyant vine Jul 30, 2024, 11:59 AM

#

From the work I have done, I think it has already largely won the comerical space

wooden sail Jul 30, 2024, 11:59 AM

#

they work very differently, it actually makes sense to try out different backends for some problems

lapis sequoia Jul 30, 2024, 11:59 AM

#

he is so smart lol

wooden sail Jul 30, 2024, 11:59 AM

#

the way jax/xla and pytorch build the computational graph is very different, and this affects the speed and memory usage of some operations

buoyant vine Jul 30, 2024, 11:59 AM

#

I think a lot of people start with TF and Keras doing tutorials and learning

lapis sequoia Jul 30, 2024, 11:59 AM

#

maybe, i did the opposite, started with pytorch

buoyant vine Jul 30, 2024, 12:00 PM

#

But seems to be the defactor for comercial projects where we go "okay we need a classifier or something" it is pytorch time

#

I am guessing because PyTorch seems to be the choice for academic related things

wooden sail Jul 30, 2024, 12:01 PM

#

that is the case, though i do see (and push for) more jax lately

buoyant vine Jul 30, 2024, 12:01 PM

#

Flash attention comes to mind with the recent LLM stuff

lapis sequoia Jul 30, 2024, 12:01 PM

#

are you guys trying to help the world with your models

buoyant vine Jul 30, 2024, 12:01 PM

#

no

wooden sail Jul 30, 2024, 12:02 PM

#

no

buoyant vine Jul 30, 2024, 12:02 PM

#

we are trying to make money

#

and in my case, classify half the internet

wooden sail Jul 30, 2024, 12:02 PM

#

what i work on is either to publish papers or to make money, or both

jaunty helm Jul 30, 2024, 12:02 PM

#

(guess I'll ask here as well)
sry to interrupt, but
If you were getting people, who are completely new to programming, into python (mainly for statistical analysis), would you recommend installing from python.org, or use something like anaconda, or even just colab?
personally I've not used anaconda much at all so idrk what's good/bad about it

lapis sequoia Jul 30, 2024, 12:02 PM

#

do you recycle

buoyant vine Jul 30, 2024, 12:03 PM

#

wooden sail that is the case, though i do see (and push for) more jax lately

I can to some extent understand this, because it lets you do a more high level implementation

#

Personally I am not a huge jax fan, but I can understand why it exists

wooden sail Jul 30, 2024, 12:04 PM

#

the biggest seller for me is really just that it looks like numpy, and most people already know that as background

buoyant vine Jul 30, 2024, 12:04 PM

#

jaunty helm (guess I'll ask here as well) sry to interrupt, but If you were getting people, ...

Depends, I think colab is a good initial place to start, but then setting up conda (not anaconda) can make the next steps easier

wooden sail Jul 30, 2024, 12:04 PM

#

well, i say that, but the next thing right up there is the ease of writing the code to look just like the math on paper does

lapis sequoia Jul 30, 2024, 12:05 PM

#

oh i think i've seen you in the math forums now

wooden sail Jul 30, 2024, 12:05 PM

#

i would say conda is still my preferred way of using python, it brings most of the stuff i need bundled together. also requires no sudo perms so it's easier to use it on compute clusters with restricted perms

lapis sequoia Jul 30, 2024, 12:05 PM

#

i think you explained a bit about the condition number to me

#

could be it wasnt you

wooden sail Jul 30, 2024, 12:06 PM

#

that sounds like something i would mention, but i can't say i remember

lapis sequoia Jul 30, 2024, 12:06 PM

#

i had no idea about it, it was quite useful

#

(it was years ago XD)

wooden sail Jul 30, 2024, 12:08 PM

#

then there's no way i remember

buoyant vine Jul 30, 2024, 12:08 PM

#

Not python related, but one of the ML frameworks I am watching is Burn https://github.com/tracel-ai/burn
The Wgpu backend is just wearyaf So fucking nice for when you dont want to deal with nvidia drivers

GitHub

GitHub - tracel-ai/burn: Burn is a new comprehensive dynamic Deep L...

Burn is a new comprehensive dynamic Deep Learning Framework built using Rust with extreme flexibility, compute efficiency and portability as its primary goals. - tracel-ai/burn

lapis sequoia Jul 30, 2024, 12:09 PM

#

ig you use more Julia than Python for ML? Or maybe it's not the right question

wooden sail Jul 30, 2024, 12:11 PM

#

i've been meaning to learn julia, but don't know it. i mainly use python. used to use matlab before

serene grail Jul 30, 2024, 12:11 PM

#

I've only heard bad things about matlab

lapis sequoia Jul 30, 2024, 12:11 PM

#

i see. it seems quite a beautiful language, at least from examples

wooden sail Jul 30, 2024, 12:12 PM

#

serene grail I've only heard bad things about matlab

in a very real sense, it's better than numpy for many numerical problems

lapis sequoia Jul 30, 2024, 12:12 PM

#

buoyant vine Not python related, but one of the ML frameworks I am watching is Burn https://g...

good ratio

buoyant vine Jul 30, 2024, 12:12 PM

#

I think Julia is cool, but it suffers a bit from the issue of being very niche.

At work it is always a weigh up of:

Does it need good performance?
- No -> Use Python
- Yes -> Use Rust

It is hard to justify Julia just for the data and ML side of things and getting everyone in the team to learn another language

wooden sail Jul 30, 2024, 12:13 PM

#

it jits everything by default and has faster run times. it's also industry standard in many applications. sure burns a hole through your pocket though

serene grail Jul 30, 2024, 12:13 PM

#

wooden sail in a very real sense, it's better than numpy for many numerical problems

Oh really? Interesting

buoyant vine Jul 30, 2024, 12:13 PM

#

Julia as a language is great, and makes awesome use of LLVMjit

#

but I think it is very niche

#

that being said, wish we used it instead of Python at one of my old jobs

lapis sequoia Jul 30, 2024, 12:14 PM

#

i've no knowledge about that, but i think it's precise numerically (for linear algebra operations)

buoyant vine Jul 30, 2024, 12:14 PM

#

would have been amazing there

#

The hard thing is always getting devs to learn it though

#

unless all you do is numeric computing or ~adjacent stuff

lapis sequoia Jul 30, 2024, 12:15 PM

#

yeah, i guess we are on a tangent on a tangent

wooden sail Jul 30, 2024, 12:15 PM

#

buoyant vine unless all you do is numeric computing or ~adjacent stuff

which ML is

#

but wrapping it as part of a bigger product, not so much

lapis sequoia Jul 30, 2024, 12:16 PM

#

do you have favourite ml researcher?

buoyant vine Jul 30, 2024, 12:16 PM

#

True, but with ML specifically I think it can also be hard to weight up VS PyTorch/TF/Onnx

lapis sequoia Jul 30, 2024, 12:17 PM

#

(can be a list)

buoyant vine Jul 30, 2024, 12:17 PM

#

there are a lot of devs that know python, and a lot that know Python and an ML framework

lapis sequoia Jul 30, 2024, 12:17 PM

#

would be interesting to see node js hop in

buoyant vine Jul 30, 2024, 12:18 PM

#

Also RIP my data engine, 32 cores and 64GB of memory and it still OOMs when aggregating 😔

lapis sequoia Jul 30, 2024, 12:18 PM

#

do you guys open problems to freelancers anywhere?

lapis sequoia Jul 30, 2024, 12:20 PM

#

buoyant vine Also RIP my data engine, 32 cores and 64GB of memory and it still OOMs when aggr...

a gpu w 32 cores you mean?

buoyant vine Jul 30, 2024, 12:20 PM

#

No I mean CPU

lapis sequoia Jul 30, 2024, 12:21 PM

#

you do onnx inference in cpu?

buoyant vine Jul 30, 2024, 12:21 PM

#

This isn't the inference server

#

this is the system that just logs the data being spat out by the inference cluster

#

the inference system itself is on a GPU machine(s)

lapis sequoia Jul 30, 2024, 12:21 PM

#

i see

buoyant vine Jul 30, 2024, 12:22 PM

#

Although ngl, I am kinda excited for the AI CPU chips

#

they have enough FLOPS processing power to make it realistically pretty cost affective to run on them instead of GPUs

agile cobalt Jul 30, 2024, 12:22 PM

#

buoyant vine Also RIP my data engine, 32 cores and 64GB of memory and it still OOMs when aggr...

which library are you using to aggregate?

buoyant vine Jul 30, 2024, 12:23 PM

#

I am biased because I used to work there, but https://quickwit.io/

Search more with less | Quickwit

Sub-second search & analytics engine on cloud storage

serene grail Jul 30, 2024, 12:23 PM

#

Oooh, working on a search engine must have been interesting

buoyant vine Jul 30, 2024, 12:25 PM

#

Was good, I got a bit burnt out though

#

Learnt a relatively important life lesson that on those sorts of projects and systems, it really makes a difference to actively use the tool so you understand why things are done in certain ways and can see it from the end user's POV

#

otherwise it can seem a bit like a lot of stuff not quite lining up

lapis sequoia Jul 30, 2024, 12:29 PM

#

wooden sail <https://en.wikipedia.org/wiki/Accelerated_Linear_Algebra>

interesting page, thx

#

basically compiles computation graph to machine code and has fusion operations which i hear are useful...

#

but idk what it means

wooden sail Jul 30, 2024, 12:32 PM

#

the best part about jax, aside from looking just like numpy (and therefore looking very similar to matlab and julia as well), is that it exposes XLA's features directly

#

you can choose to grad, jit, and map however you like

lapis sequoia Jul 30, 2024, 12:36 PM

#

As a part of the OpenXLA project, XLA is built collaboratively by industry-leading ML hardware and software companies, including Alibaba, Amazon Web Services, AMD, Apple, Arm, Google, Intel, Meta, and NVIDIA.

#

's got some backers

past meteor Jul 30, 2024, 12:40 PM

#

I started out with TF, it was what we used in uni for neural nets (that and matlab)

lapis sequoia Jul 30, 2024, 12:42 PM

#

i'm unsure that tensorflow uses XLA compilation by default, where is that information?

buoyant vine Jul 30, 2024, 12:44 PM

#

lapis sequoia i'm unsure that tensorflow uses XLA compilation by default, where is that inform...

I believe it is the default when your enable the JIT going of the XLA docs

#

Also 👏 AMD please give me a desktop chip of this

#

Otherwise I am going to be building a really cursed laptop server

lapis sequoia Jul 30, 2024, 12:45 PM

#

yeah, so it's not the default since that's false by default

wooden sail Jul 30, 2024, 12:45 PM

#

you probably have more tops on a usual server processor

#

ah, tf doesn't have jit enabled by default?

#

you can always try to cause a weird error and check the traceback

buoyant vine Jul 30, 2024, 12:46 PM

#

wooden sail you probably have more tops on a usual server processor

I don't think so

wooden sail Jul 30, 2024, 12:46 PM

#

jax spits out hundres of lines of xla calls whenever you look at it wrong

buoyant vine Jul 30, 2024, 12:46 PM

#

If their measurements are in anyway close to what they say they are

#

that chip is several times faster than what you could do with a 64 core Epyc

wooden sail Jul 30, 2024, 12:47 PM

#

i somehow strongly doubt that from a 28w processor

lapis sequoia Jul 30, 2024, 12:47 PM

#

im checking

buoyant vine Jul 30, 2024, 12:48 PM

#

wooden sail i somehow strongly doubt that from a 28w processor

Honestly same, but then again, maybe not since having dedicated hardware makes a significant difference

#

In reality is a 54W chip, which I suspect is what they are actually doing to get those numbers

#

they are doing Max boost frequency to get those tops

wooden sail Jul 30, 2024, 12:49 PM

#

the cache sizes would also have to be compared because that neuters the real life performance

#

time to fish up some benchmarks

buoyant vine Jul 30, 2024, 12:50 PM

#

For reference a 16 AMD ryzen on zen 4 will do about 1 TOPS from my testing

#

although I think Openblas tends to struggle past 800 GFLOPs because of memory bandwidth, also unsure if OMP is pinning the cores

lapis sequoia Jul 30, 2024, 12:50 PM

#

tf.config.optimizer.get_jit()
returns ''

wooden sail Jul 30, 2024, 12:54 PM

#

some old numbers from intel xeons from 2022 say 419 tflops with float32, and upwards of 1500 with int8

buoyant vine Jul 30, 2024, 12:55 PM

#

That sounds far too high

wooden sail Jul 30, 2024, 12:55 PM

#

the h100 lists 26 to 3000 tflops depending on the data type

buoyant vine Jul 30, 2024, 12:55 PM

#

a A10g GPU is rated at 250 TOPS

wooden sail Jul 30, 2024, 12:56 PM

#

for which datatype?

buoyant vine Jul 30, 2024, 12:56 PM

#

int8

wooden sail Jul 30, 2024, 12:57 PM

#

i need to look up the a10g cuz i'Ve never worked with it

buoyant vine Jul 30, 2024, 12:57 PM

#

It is a nice GPU tbh, it is what the AWS G5 instances use

#

I am using it here just because it was the first one I know that has the specs clearly listed on nvidia's page

#

Sometimes getting spec sheets with numbers is a pain 😔

#

I suspect AMD's numbers there are int8 operations

#

complete guess, but any sort of floating point op seems far too high

wooden sail Jul 30, 2024, 12:58 PM

#

yeah i would also think so

#

let's see here

#

if techpowerup is to be trusted, an a10g

#

and an a100

buoyant vine Jul 30, 2024, 1:00 PM

#

wooden sail if techpowerup is to be trusted, an a10g

I think those numbers seem relatively OK, but the FP16 seems off, at work there is a noticable perf diff between FP16 ops and FP32 ops on the A10g, so them being the same seems a bit off

wooden sail Jul 30, 2024, 1:00 PM

#

yeah

#

this is the a100 from nvidia's site

buoyant vine Jul 30, 2024, 1:01 PM

#

that probably about lines up with what i'd expect

wooden sail Jul 30, 2024, 1:02 PM

#

and the h100 with 3000 tflops on int8 seems reasonable then

buoyant vine Jul 30, 2024, 1:02 PM

#

Maybe that 50 TOPS number isn't the most unrealistic thing in the world then

#

1/5th the compute of the A10g, but 1/3rd the TDP

wooden sail Jul 30, 2024, 1:03 PM

#

indeed, if they disclose the data type 😛 otherwise this will become like another "nanometer" thing

buoyant vine Jul 30, 2024, 1:03 PM

#

Yeah, well once they release I'm sure we'll see some more numbers

#

otherwise 😅 I am doing some shopping and we'll do a comparison

wooden sail Jul 30, 2024, 1:04 PM

#

TOPS with 1bit ints

buoyant vine Jul 30, 2024, 1:05 PM

#

Man that would really suck to find out xD

#

Although I am sure it could be utalised for some interesting bits of compute

#

like data filtering

#

We can actually probably work out

#

since I suspect, AMD's TOPS will be MS' TOPS

#

because of the whole AI PC shit

wooden sail Jul 30, 2024, 1:07 PM

#

mhm

buoyant vine Jul 30, 2024, 1:08 PM

#

Least scuffed MS webpage

lapis sequoia Jul 30, 2024, 1:09 PM

#

https://github.com/google/flax

GitHub

GitHub - google/flax: Flax is a neural network library for JAX that...

Flax is a neural network library for JAX that is designed for flexibility. - google/flax

buoyant vine Jul 30, 2024, 1:09 PM

#

PepeHands Litterally everyone is just saying TOPS but not the datatype

#

god damn it MS!

wooden sail Jul 30, 2024, 1:10 PM

#

what i did find in an arstechnica link is that current amd 7000 and 8000 chips offer "12 to 16 TOPS"

buoyant vine Jul 30, 2024, 1:10 PM

#

that is what the AMD website lists

#

I think that is using the internal GPU

#

because they dont have a NPU

#

but do all have the Radeon graphics

#

so might be that what they are doing via their Ryzen AI™️ drivers, is effectively proxying the operations as graphics ops

wooden sail Jul 30, 2024, 1:12 PM

#

mhm

#

this whole thing is a shitshow

buoyant vine Jul 30, 2024, 1:12 PM

#

Idk why they are so focused on them being in laptops tho

#

Like... Guys do you not see if you're getting 50TOPS of int8 performance how big that potential is for the server and desktop space?

#

just for inference and numeric computing

#

Ignore the whole MS AI PC stuff that no one actually gives a shit about

wooden sail Jul 30, 2024, 1:13 PM

#

they're already selling it that way on the server lines for 2 or 3 years though

buoyant vine Jul 30, 2024, 1:13 PM

#

Qualcom are making ARM based laptops with excellent battery lives, but don't put that as their biggest feature

#

Everyone trying to push for AI but 99% of applications dont support it

buoyant vine Jul 30, 2024, 1:14 PM

#

wooden sail they're already selling it that way on the server lines for 2 or 3 years though

I don't think so, at least not in the same way

wooden sail Jul 30, 2024, 1:14 PM

#

pretty sure yes, the xeon i mentioned from 2022 explicitly says neural something or another

buoyant vine Jul 30, 2024, 1:14 PM

#

Like the best we have gotten really is better SIMD support and larger caches which mean more per-core performance

wooden sail Jul 30, 2024, 1:14 PM

#

#

efficiency this, performance that, AI everywhere

buoyant vine Jul 30, 2024, 1:15 PM

#

They dont actually have any dedicated hardware IIRC

lapis sequoia Jul 30, 2024, 1:15 PM

#

but they do have a page detailing why and how at least.

buoyant vine Jul 30, 2024, 1:15 PM

#

what Intel have done is taken their perf cores and efficiency cores from the desktop market, and put it on Xeons

#

with a bigger cache

#

Going off their 'fact sheet' it seems it is just the E-cores and P-cores

wooden sail Jul 30, 2024, 1:17 PM

#

#

i still have to find out what exactly that accelerator is

#

ah, intel AMX they call it

#

https://www.intel.com/content/www/us/en/products/docs/accelerator-engines/what-is-intel-amx.html

Intel

What Is Intel® Advanced Matrix Extensions (Intel® AMX)? – Intel

Take advantage of Intel® Advanced Matrix Extensions (Intel® AMX) accelerator capabilities to improve the performance of deep learning workloads.

#

dedicated matmul hardware on the P cores

#

for 3 gens already

lapis sequoia Jul 30, 2024, 1:19 PM

#

they've got some new stuff for gpus also

wooden sail Jul 30, 2024, 1:20 PM

#

the trend i've seen in the last 10 years is that the server scene is like "upstream" of desktop and laptop hardware, just like this

#

and the neat features trickle down... with arguable salesmanship like shoving AI down your throat

buoyant vine Jul 30, 2024, 1:21 PM

#

thonk_line Reading their performance sheet tho

#

it just looks weird

#

wait nvm

#

In their graph they did (old)FP32 vs (new) BF16

#

and then in the next did (old)int8 vs (new)int8

lapis sequoia Jul 30, 2024, 1:21 PM

#

for gpus they got some charts https://intel.github.io/intel-extension-for-tensorflow/latest/docs/guide/performance.html

#

and here i think https://www.intel.com/content/www/us/en/developer/topic-technology/artificial-intelligence/performance.html

Intel

Performance Data for 4th Gen Intel® AI Data Center Products

Find performance data and hardware and software configurations for 5th and 4th gen Intel® Xeon® Scalable processors, 3rd gen Intel® Xeon® processors and Intel® Data Center GPU Flex Series processors.

worldly wagon Jul 30, 2024, 1:22 PM

#

Is there any documentation on using arrow keys to move sliders in plotly? pithink I just wanna confirm this can only be done in dash not standard plotly GO

serene grail Jul 30, 2024, 1:23 PM

#

I think a few years in the future "AI on chip" on desktop will actually be quite useful and the chip manufacturers don't want to fall behind the competition on that front

wooden sail Jul 30, 2024, 1:24 PM

#

i think that's still a handful of generations away. the gap between AI tasks you can run on a desktop vs realistic tasks to be run on compute clusters is still comparable to the distance between heaven and hell

#

unless you put server grade hardware in your desktop, which requires you to shell out several tens of thousands of moneyz

buoyant vine Jul 30, 2024, 1:25 PM

#

wooden sail

TBF this looks like it is very recent

wooden sail Jul 30, 2024, 1:25 PM

#

like 3 years i think

#

no, my bad, 2023?

#

i thought it was older

buoyant vine Jul 30, 2024, 1:26 PM

#

All the AMX stuff and Xeon 6th gen stuff is 2024 from what i can see

wooden sail Jul 30, 2024, 1:27 PM

#

amx was introduced in 2020, but only implemented in 2023 with xeon 4 (According to google)

#

but that means they were selling the AI on cpu idea for a while already

buoyant vine Jul 30, 2024, 1:28 PM

#

I wonder if that is because of all the chip and arch issues intel were having previously

#

I remember they were having some serious issues with their new arch in the chip making processes

wooden sail Jul 30, 2024, 1:29 PM

#

like self-destructing chips :x

buoyant vine Jul 30, 2024, 1:29 PM

#

Mostly a desktop thing

#

But one thing is going to be interesting is looking at the price of the 6th gens

#

Going off their press release sheet, they make it seem like the 6th gens are the ones with actually good performance with AMX

#

the rest are just AVX512 VNNI

#

so what is the price of that chip

#

and how much does it weight up against the Epycs?

#

Looks like AMD are rapidly catching up as well https://www.amd.com/en/products/processors/server/epyc/4th-generation-9004-and-8004-series/amd-epyc-9754s.html

#

128 physical cores

#

fucking monster of a CPU

#

So the top line 6th gen Xeon is 128 P- cores and 288 E cores

#

so it is no slouch either

#

Will be interesting to see what the TDP and cost of that chip is

#

Competition is going to be 2x 9754S @ ~$11k a piece

left tartan Jul 30, 2024, 2:15 PM

#

buoyant vine Will be interesting to see what the TDP and cost of that chip is

I bet I could heat my house water on one of those.

coral field Jul 30, 2024, 5:09 PM

#

what are good statistics to measure the strength and direction of association between a continuous variable and multi-class categorical output?

proven inlet Jul 30, 2024, 5:47 PM

#

Can someone help me? I tried to make my own neural network without ai libraries but it does not learn properly. https://pastebin.com/K8HLiZB7

Pastebin

Artificial Intelligence - Pastebin.com

Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.

#

#

Console:

Epoch 1/10 - Accuracy: 19.78%
Epoch 2/10 - Accuracy: 15.88%

#

it goes down 🙄

proven inlet Jul 30, 2024, 5:49 PM

#

proven inlet

it updates sometimes but 50% of it has no change

still parrot Jul 30, 2024, 6:34 PM

#

huh

proven inlet Jul 30, 2024, 6:37 PM

#

still parrot huh

🥺

still parrot Jul 30, 2024, 6:38 PM

#

idk I just started learning deep learning and all I had is errors Be glad you made it this far.

proven inlet Jul 30, 2024, 6:38 PM

#

still parrot idk I just started learning deep learning and all I had is errors Be glad you ma...

It's not that hard actually

#

the code im using

still parrot Jul 30, 2024, 6:39 PM

#

I know

proven inlet Jul 30, 2024, 6:39 PM

#

but idk where error

proven inlet Jul 30, 2024, 6:39 PM

#

still parrot idk I just started learning deep learning and all I had is errors Be glad you ma...

Keep learning !

still parrot Jul 30, 2024, 6:39 PM

#

proven inlet Keep learning !

yep

still parrot Jul 30, 2024, 6:41 PM

#

proven inlet Keep learning !

where did you learn

proven inlet Jul 30, 2024, 6:41 PM

#

still parrot where did you learn

i already know programmin for years i just had to learn the algorithm behind ai so a few tutorials were enough

proven inlet Jul 30, 2024, 6:42 PM

#

proven inlet i already know programmin for years i just had to learn the algorithm behind ai ...

tutorial about math behind the ai

still parrot Jul 30, 2024, 6:42 PM

#

proven inlet

by the way can't you do something like back feeding the info like telling what the answer should be and what it could be also?

proven inlet Jul 30, 2024, 6:43 PM

#

still parrot by the way can't you do something like back feeding the info like telling what t...

i do train ai using back proganation

still parrot Jul 30, 2024, 6:43 PM

#

ohh

proven inlet Jul 30, 2024, 6:43 PM

#

Which checks the result and compares with true one

#

if its right good 👍 if not change the weights and bias

#

By delta

#

derivative of sigmoid or ReLU

#

depends on what you are using

odd meteor Jul 30, 2024, 6:43 PM

#

coral field what are good statistics to measure the strength and direction of association be...

You can compute the mutual information score. Unlike computing the correlation coefficient, I think one of the drawbacks of mutual information score is in its result's explanability.

For a more detailed analysis, you'll need a statistical test. The kind of statistical test you need depends on the kind of categorical feature you're dealing with.

Do you have have an ordinal or nominal categorical feature?

Since you've already mentioned that the categorical feature is not dichotomous, we can rule out Point Biseral Test.

still parrot Jul 30, 2024, 6:44 PM

#

I think you should just test it more with more data.

proven inlet Jul 30, 2024, 6:44 PM

#

still parrot I think you should just test it more with more data.

i already have over 1k data

still parrot Jul 30, 2024, 6:44 PM

#

ohh my

proven inlet Jul 30, 2024, 6:44 PM

#

and it's just basically number recognition

#

still parrot Jul 30, 2024, 6:44 PM

#

ohh

proven inlet Jul 30, 2024, 6:44 PM

#

all of them has 100 hand written numbers

coral field Jul 30, 2024, 6:44 PM

#

odd meteor You can compute the mutual information score. Unlike computing the correlation c...

ordinal, the categories represent various "levels"

still parrot Jul 30, 2024, 6:45 PM

#

there's a 25 hour long video teaching it

proven inlet Jul 30, 2024, 6:45 PM

#

still parrot there's a 25 hour long video teaching it

Does it use libraries like pytorch & tensorflow or by scratch?

proven inlet Jul 30, 2024, 6:46 PM

#

odd meteor You can compute the mutual information score. Unlike computing the correlation c...

could you help me please? 🙂

still parrot Jul 30, 2024, 6:46 PM

#

pytorch but there's a video by another creator about doing it be scratch.

proven inlet Jul 30, 2024, 6:46 PM

#

still parrot pytorch but there's a video by another creator about doing it be scratch.

can you give link

still parrot Jul 30, 2024, 6:47 PM

#

ok

#

https://www.youtube.com/watch?v=Wo5dMEP_BbI&list=PLQVvvaa0QuDcjD5BAw2DxE6OF2tius3V3 https://www.youtube.com/watch?v=w8yWXqWQYmU https://www.youtube.com/watch?v=cAkMcPfY_Ns

YouTube

sentdex

Neural Networks from Scratch - P.1 Intro and Neuron Code

Building neural networks from scratch in Python introduction.

Neural Networks from Scratch book: https://nnfs.io

Playlist for this series: https://www.youtube.com/playlist?list=PLQVvvaa0QuDcjD5BAw2DxE6OF2tius3V3

Python 3 basics: https://pythonprogramming.net/introduction-learn-python-3-tutorials/
Intermediate Python (w/ OOP): https://pythonpr...

▶ Play video

YouTube

Samson Zhang

Building a neural network FROM SCRATCH (no Tensorflow/Pytorch, just...

Kaggle notebook with all the code: https://www.kaggle.com/wwsalmon/simple-mnist-nn-from-scratch-numpy-no-tf-keras

Blog article with more/clearer math explanation: https://www.samsonzhang.com/2020/11/24/understanding-the-math-behind-neural-networks-by-building-one-from-scratch-no-tf-keras-just-numpy.html

▶ Play video

YouTube

Green Code

I Built a Neural Network from Scratch

Don't click this: https://tinyurl.com/bde5k7d5

💚 Link to Code: https://www.patreon.com/greencode

How I Learned This: https://nnfs.io/ (by the awesome @sentdex )

I'm not an AI expert by any means, I probably have made some mistakes. So I apologise in advance :)

Also, I only used PyTorch to test the forward pass. Apart from that, everything el...

▶ Play video

#

https://www.youtube.com/watch?v=OGxgnH8y2NM&list=PLQVvvaa0QuDfKTOs3Keq_kaG2P55YRn5v

YouTube

sentdex

Practical Machine Learning Tutorial with Python Intro p.1

The objective of this course is to give you a holistic understanding of machine learning, covering theory, application, and inner workings of supervised, unsupervised, and deep learning algorithms.

In this series, we'll be covering linear regression, K Nearest Neighbors, Support Vector Machines (SVM), flat clustering, hierarchical clustering, a...

▶ Play video

proven inlet Jul 30, 2024, 6:52 PM

#

Oh i watched the last 2 videos already but thanks ill watch 2 other 🙂

proven inlet Jul 30, 2024, 6:53 PM

#

still parrot https://www.youtube.com/watch?v=Wo5dMEP_BbI&list=PLQVvvaa0QuDcjD5BAw2DxE6OF2tius...

Samson's looks good tutorial ngl

#

Thanks again 🙂

still parrot Jul 30, 2024, 6:54 PM

#

you can also get a copy of his code

odd meteor Jul 30, 2024, 6:55 PM

#

proven inlet could you help me please? 🙂

In that case you need a non-parametric test.

You can use Kendal Tau or Spearman-Rank correlation instead of Pearson correlation.

Another alternative is, to carry out a Chi-Square test of independence to determine the association between the two variables.

You can even use ANOVA as well to test for differences in means of the continuous variable across the different categories.

still parrot Jul 30, 2024, 6:55 PM

#

proven inlet Samson's looks good tutorial ngl

The code is here https://www.kaggle.com/code/wwsalmon/simple-mnist-nn-from-scratch-numpy-no-tf-keras

Simple MNIST NN from scratch (numpy, no TF/Keras)

Explore and run machine learning code with Kaggle Notebooks | Using data from Digit Recognizer

proven inlet Jul 30, 2024, 6:56 PM

#

odd meteor In that case you need a non-parametric test. You can use Kendal Tau or Spearman...

Too many definitations and terms that i don't know but thanks ill research them

proven inlet Jul 30, 2024, 6:56 PM

#

still parrot The code is here https://www.kaggle.com/code/wwsalmon/simple-mnist-nn-from-scr...

i prefer watching tutorials

odd meteor Jul 30, 2024, 6:56 PM

#

proven inlet Too many definitations and terms that i don't know but thanks ill research them

Oops sorry I quoted wrongly. My bad

proven inlet Jul 30, 2024, 6:57 PM

#

odd meteor Oops sorry I quoted wrongly. My bad

Oh

odd meteor Jul 30, 2024, 6:57 PM

#

coral field ordinal, the categories represent various "levels"

In that case you need a non-parametric test.

You can use Kendal Tau or Spearman-Rank correlation instead of Pearson correlation.

Another alternative is, to carry out a Chi-Square test of independence to determine the association between the two variables.

You can even use ANOVA as well to test for differences in means of the continuous variable across the different categories.

summer bolt Jul 30, 2024, 6:58 PM

#

can anyone help me create a countdown bot that stays in a vc 24/7 and countdowns like this: 3 minutes till countdown
2 minutes til countdown
1 minute til countdown
30 seconds till countsown
10
9
8
7
6
5
4
3
2
1
Go
Please help me with this

odd meteor Jul 30, 2024, 6:58 PM

#

proven inlet could you help me please? 🙂

What do you need help with?

proven inlet Jul 30, 2024, 6:58 PM

#

proven inlet Can someone help me? I tried to make my own neural network without ai libraries ...

This

summer bolt Jul 30, 2024, 7:00 PM

#

odd meteor What do you need help with?

can y help me

still parrot Jul 30, 2024, 7:00 PM

#

summer bolt can anyone help me create a countdown bot that stays in a vc 24/7 and countdowns...

import time

def countdown(t):
while t:
mins, secs = divmod(t, 60)
timer = '{:02d}:{:02d}'.format(mins, secs)
print(timer, "minutes till countdown")
time.sleep(60)
t -= 1

print("1 minute till countdown")
time.sleep(30)
print("30 seconds till countdown")
time.sleep(20)
print("10 seconds till countdown")

for i in range(10, 0, -1):
    print(i)
    time.sleep(1)

print("Go!")

Start countdown for 3 minutes

countdown(3)

summer bolt Jul 30, 2024, 7:01 PM

#

still parrot import time def countdown(t): while t: mins, secs = divmod(t, 60) ...

dms and ty

proven inlet Jul 30, 2024, 7:02 PM

#

summer bolt can anyone help me create a countdown bot that stays in a vc 24/7 and countdowns...

bro how is this releated to artificial intelligence

summer bolt Jul 30, 2024, 7:03 PM

#

wrong channel xd

proven inlet Jul 30, 2024, 7:03 PM

#

İts ok lol

odd meteor Jul 30, 2024, 7:21 PM

#

proven inlet This

@proven inlet I'm currently not free at the moment but I will take a look at it once I'm a bit free.

proven inlet Jul 30, 2024, 7:21 PM

#

odd meteor <@1018096765225938985> I'm currently not free at the moment but I will take a lo...

It's okay thanks 👍

lapis sequoia Jul 30, 2024, 8:48 PM

#

i ran it with 1 hidden layer

#

Loading Dataset...
Loaded.
Training Epoch: 0
Training Epoch 0 finished with %18.366666666666667 accuracy
Training Epoch: 1
Training Epoch 1 finished with %31.266666666666666 accuracy
Training Epoch: 2
Training Epoch 2 finished with %34.766666666666666 accuracy
Training Epoch: 3
Training Epoch 3 finished with %37.4 accuracy
Training Epoch: 4

#

seems fine, what's the issue @proven inlet

proven inlet Jul 30, 2024, 8:50 PM

#

lapis sequoia seems fine, what's the issue <@1018096765225938985>

accuracy goes down with every epoch

lapis sequoia Jul 30, 2024, 8:50 PM

#

i don't remember if the remaining steps of backprop are correct, but if you are using MSE the derivative is correct

#

i'm showing you it does not with 1 hidden layer at least

#

https://colab.research.google.com/drive/16GCl7IZ3ZBwc3Vp3pCSpF8tfhqcDdki_?usp=sharing

Google Colab

proven inlet Jul 30, 2024, 8:50 PM

#

is the problem using more than 1 hidden layer

proven inlet Jul 30, 2024, 8:51 PM

#

lapis sequoia Loading Dataset... Loaded. Training Epoch: 0 Training Epoch 0 finished with %18....

if i print new - old weights

#

most of them are 0

lapis sequoia Jul 30, 2024, 8:51 PM

#

i commented some stuff

#

and changed the delta, and learning rate, i can't test much more now, just run it and check

proven inlet Jul 30, 2024, 8:52 PM

#

proven inlet if i print new - old weights

its not learning 😦

proven inlet Jul 30, 2024, 8:52 PM

#

lapis sequoia https://colab.research.google.com/drive/16GCl7IZ3ZBwc3Vp3pCSpF8tfhqcDdki_?usp=sh...

thats not my code

lapis sequoia Jul 30, 2024, 8:53 PM

#

https://colab.research.google.com/drive/16GCl7IZ3ZBwc3Vp3pCSpF8tfhqcDdki_?usp=sharing
is that one? @proven inlet

Google Colab

#

does seem to increase with extra hidden layers as well.

proven inlet Jul 30, 2024, 8:57 PM

#

How??

#

it does not for me

#

lemme try on colab

lapis sequoia Jul 30, 2024, 8:57 PM

#

i said i modified a few things

#

in any case, im unsure if it'd get very far, since the surface can have many pockets

proven inlet Jul 30, 2024, 8:58 PM

#

Uh you are using ai libraries

lapis sequoia Jul 30, 2024, 8:58 PM

#

i ran a similar model with keras to compare

proven inlet Jul 30, 2024, 8:59 PM

#

i need to get it working without ai libraries

#

thats my goal

lapis sequoia Jul 30, 2024, 8:59 PM

#

the ai library is completely irrelevant, you can remove the cell.

proven inlet Jul 30, 2024, 9:00 PM

#

no its not irrelevant you load x & y data with ai libraries

#

lapis sequoia Jul 30, 2024, 9:00 PM

#

how do you want me to test the code otherwise?

#

that data is the mnist dataset, a dataset of handwritten digits.

proven inlet Jul 30, 2024, 9:01 PM

#

i have it as local file in my project dir

#

but it doesn't work

lapis sequoia Jul 30, 2024, 9:01 PM

#

well, maybe you didn't normalise the data

#

i can't guess the part in your laptop

proven inlet Jul 30, 2024, 9:01 PM

#

it's just like that

lapis sequoia Jul 30, 2024, 9:02 PM

#

(x_train.astype("float32") / 255)

#

i.e normalising

proven inlet Jul 30, 2024, 9:02 PM

#

img_array = np.array(image).flatten() / 255

#

i do this

lapis sequoia Jul 30, 2024, 9:03 PM

#

idk what is the issue with your data, i can't access it, but the network seems not totally wrong

proven inlet Jul 30, 2024, 9:03 PM

#

The data has no issue

#

it's all correct

lapis sequoia Jul 30, 2024, 9:04 PM

#

ok, maybe someone else can help

proven inlet Jul 30, 2024, 9:04 PM

#

okay thanks

#

:)

#

@lapis sequoia This is what happens 🙏

lapis sequoia Jul 30, 2024, 9:11 PM

#

try it, please, this settings:

#

input_size = 28*28
hidden_layers = [10,10]
output_size = 10
print("Loading Dataset...")
images, labels = load_dataset()
labels = onehot(labels, output_size)

print("Loaded.")
nn = Brain(input_size, hidden_layers, output_size)
nn.train(x_test, y_test, 10, .2)

proven inlet Jul 30, 2024, 9:20 PM

#

lapis sequoia ```py input_size = 28*28 hidden_layers = [10,10] output_size = 10 print("Loading...

lapis sequoia Jul 30, 2024, 9:22 PM

#

what happens with hidden_layers=[] ?

proven inlet Jul 30, 2024, 9:22 PM

#

they're being added to self.layers which is list of Layer objects

lapis sequoia Jul 30, 2024, 9:22 PM

#

no, if you use it empty

proven inlet Jul 30, 2024, 9:23 PM

#

it would not work

lapis sequoia Jul 30, 2024, 9:23 PM

#

why not?

proven inlet Jul 30, 2024, 9:23 PM

#

inputs will be connected to output without anything

#

it didnt work

lapis sequoia Jul 30, 2024, 9:24 PM

#

then it must be the input data-labels pairs the problem

proven inlet Jul 30, 2024, 9:25 PM

#

https://pastebin.com/K8HLiZB7 is something wrong??

Pastebin

Artificial Intelligence - Pastebin.com

Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.

lapis sequoia Jul 30, 2024, 9:25 PM

#

you can load it into a grid of images and paste the labels

#

with the image

#

just for a subset

proven inlet Jul 30, 2024, 9:26 PM

#

i do that like this:

def load_dataset():
    images = []
    labels = []

    for label in range(10):
        path = os.path.join("Dataset", str(label))
        for filename in os.listdir(path):
            if filename.endswith(".png"):
                image_path = os.path.join(path, filename)

                image = Image.open(image_path).convert("L")
                img_array = np.array(image).flatten() / 255
                images.append(img_array)
                labels.append(label)

    return np.array(images), np.array(labels)

lapis sequoia Jul 30, 2024, 9:26 PM

#

imho there isn't, just errors.reverse() is not used

#

to me it seems correct, but we won't know until you plot image-label pair

proven inlet Jul 30, 2024, 9:27 PM

#

proven inlet i do that like this: ```py def load_dataset(): images = [] labels = [] ...

The returned values from here passed to train function (labels are converted to onehot) and they are being used in for loop with zip method

#

lapis sequoia Jul 30, 2024, 9:28 PM

#

plot it and check

proven inlet Jul 30, 2024, 9:28 PM

#

input is image, expected_output is label. atleast it should be,

proven inlet Jul 30, 2024, 9:28 PM

#

lapis sequoia plot it and check

Alr

#

seems fine

#

onehot format

lapis sequoia Jul 30, 2024, 9:29 PM

#

no, it's not what i am saying

#

plot the images with the label, like the image of 0 with the label, then you know it's matching

proven inlet Jul 30, 2024, 9:30 PM

#

lapis sequoia plot the images with the label, like the image of 0 with the label, then you kno...

Oh

lapis sequoia Jul 30, 2024, 9:30 PM

#

and the image isn't wrong etc

proven inlet Jul 30, 2024, 9:31 PM

#

its just in array format tho

#

between 1 and 0

#

wait lemme append their path instead of their pixels so we can check

lapis sequoia Jul 30, 2024, 9:32 PM

#

like this

#

proven inlet Jul 30, 2024, 9:32 PM

#

i changed load_data function to put img path instead of pixel array and it gave me these results:

#

it seems to work

#

because its 0

#

0*

lapis sequoia Jul 30, 2024, 9:33 PM

#

also, if you have [] you still have 10 neurons with activations, and corresponding weights

#

so with empty list, it should do something

proven inlet Jul 30, 2024, 9:34 PM

#

empty list to which param

lapis sequoia Jul 30, 2024, 9:34 PM

#

hidden layers

proven inlet Jul 30, 2024, 9:34 PM

#

hiddenlayers didn't work for empty layers

proven inlet Jul 30, 2024, 9:35 PM

#

proven inlet it didnt work

this one is with empty hiddenlayer array

lapis sequoia Jul 30, 2024, 9:35 PM

#

i've tried it without trouble, i'm just saying that it should have been a simple test to try first

proven inlet Jul 30, 2024, 9:36 PM

#

i already tried it without hiddenlayers

#

it does not work

lapis sequoia Jul 30, 2024, 9:36 PM

#

could it be possible that you are only loading one number?

#

i can't guess if you don't plot random data points.

proven inlet Jul 30, 2024, 9:37 PM

#

lapis sequoia could it be possible that you are only loading one number?

no i printed onehot array and it showed me all numbers

#

i've tested that before running ai

lapis sequoia Jul 30, 2024, 9:39 PM

#

can you show a screenshot of the images in 0\1

#

and 1\1

proven inlet Jul 30, 2024, 9:39 PM

#

Alr

#

0/1.png

lapis sequoia Jul 30, 2024, 9:40 PM

#

and how many are u using? in total

proven inlet Jul 30, 2024, 9:40 PM

#

lapis sequoia and how many are u using? in total

10 numbers, all of them has 100 photos

#

1000 images total

#

Alr

arctic wedgeBOT Jul 30, 2024, 9:41 PM

#

:incoming_envelope: :ok_hand: applied timeout to @proven inlet until <t:1722376265:f> (10 minutes) (reason: attachments spam - sent 10 attachments).

The <@&831776746206265384> have been alerted for review.

lapis sequoia Jul 30, 2024, 9:41 PM

#

just one screenshot of the images in miniature

#

should fit 20 in one image

atomic tide Jul 30, 2024, 9:42 PM

#

!unmute 1018096765225938985

arctic wedgeBOT Jul 30, 2024, 9:42 PM

#

:incoming_envelope: :ok_hand: pardoned infraction timeout for @proven inlet.

proven inlet Jul 30, 2024, 9:42 PM

#

Thx

proven inlet Jul 30, 2024, 9:42 PM

#

lapis sequoia just one screenshot of the images in miniature

Dataset/1

#

Dataset/0

lapis sequoia Jul 30, 2024, 9:54 PM

#

it's not shuffled @proven inlet , maybe try that

#

you are passing first all 0s, then all 1s, ...

proven inlet Jul 30, 2024, 9:55 PM

#

lapis sequoia you are passing first all 0s, then all 1s, ...

Yes

#

is that critical problem for ai ?

lapis sequoia Jul 30, 2024, 9:55 PM

#

yeah, you are forcing the net to a local pocket probably

#

and it can't get out of it

proven inlet Jul 30, 2024, 9:55 PM

#

HMMM

#

That makes sense

#

Lemme try that

lapis sequoia Jul 30, 2024, 9:56 PM

#

mine are shuffled ;-)

proven inlet Jul 30, 2024, 9:56 PM

#

good point

#

but the ai tries all of the numbers

#

per epoch

lapis sequoia Jul 30, 2024, 9:57 PM

#

but you update the weights as you go don't you

proven inlet Jul 30, 2024, 9:57 PM

#

Yep

lapis sequoia Jul 30, 2024, 9:58 PM

#

so try, you can create a random array of 1000 numbers and pick from there

#

and then labels[i], image[i]

proven inlet Jul 30, 2024, 9:58 PM

#

lapis sequoia so try, you can create a random array of 1000 numbers and pick from there

or i can just shuffle the order of images and labels

#

thats better i think

lapis sequoia Jul 30, 2024, 9:58 PM

#

as long as you shuffle them in the same order yes

proven inlet Jul 30, 2024, 9:58 PM

#

Yep

lapis sequoia Jul 30, 2024, 9:58 PM

#

otherwise they don't match

proven inlet Jul 30, 2024, 9:59 PM

#

Yeah ima try it rn

#

it's literally unlearning 😭

#

permutation = np.random.permutation(len(images))
images = images[permutation]
labels = labels[permutation]

lapis sequoia Jul 30, 2024, 10:02 PM

#

chollet likes amd apparently

lapis sequoia Jul 30, 2024, 10:10 PM

#

proven inlet ```py permutation = np.random.permutation(len(images)) images = images[permutati...

can you try with no hidden layers, and lr=.2

proven inlet Jul 30, 2024, 10:11 PM

#

lr was 0.002

#

changing it to 0.2

lapis sequoia Jul 30, 2024, 10:11 PM

#

and hidden_layers=[]

proven inlet Jul 30, 2024, 10:11 PM

#

that was absolutely random tho but %20 crazy

#

😳 😳 😳

lapis sequoia Jul 30, 2024, 10:12 PM

#

yup

proven inlet Jul 30, 2024, 10:12 PM

#

i think

#

it starts to work

lapis sequoia Jul 30, 2024, 10:12 PM

#

been sayin

proven inlet Jul 30, 2024, 10:12 PM

#

hiddenlayers were already empty

#

for a long time

lapis sequoia Jul 30, 2024, 10:13 PM

#

alright, gotta sleep

proven inlet Jul 30, 2024, 10:13 PM

#

learning rate changed the game here i guess

proven inlet Jul 30, 2024, 10:13 PM

#

lapis sequoia alright, gotta sleep

Goodnight 🙂 thanks for helping

lapis sequoia Jul 30, 2024, 10:13 PM

#

yes, shuffling as well

#

ur welcome

proven inlet Jul 30, 2024, 10:13 PM

#

lapis sequoia yes, shuffling as well

Yeah i forgot about that part

violet gull Jul 31, 2024, 4:30 AM

#

rl batch size how much?

small wedge Jul 31, 2024, 4:36 AM

#

violet gull rl batch size how much?

depends on the task but 16-32 is usually a good starting range

violet gull Jul 31, 2024, 4:36 AM

#

how know if need more?

small wedge Jul 31, 2024, 4:36 AM

#

if your model struggles to converge

#

it's a hard variable to isolate especially in rl where there might not always be a clear metric for convergence

#

might be safer to start high like 64 and work your way down

violet gull Jul 31, 2024, 4:37 AM

#

im at 512

small wedge Jul 31, 2024, 4:38 AM

#

ideally it should be the minimum number of samples required to get a consistent estimation of the true gradient

#

if your samples are balanced well then that's probably super overkill, otherwise it's probably fine you just lose a bit of training speed from going so high

violet gull Jul 31, 2024, 4:40 AM

#

thanks you

eager plume Jul 31, 2024, 1:01 PM

#

Data science is vast field.

8 am confused in Ai/ML/Ds/Da.
That what to choose first.

serene scaffold Jul 31, 2024, 1:05 PM

#

eager plume Data science is vast field. 8 am confused in Ai/ML/Ds/Da. That what to choose f...

there's a lot of terms that aren't mutually exclusive.
machine learning is pretty much a subset of AI.
"data science" is mostly a buzzword.

spare forum Jul 31, 2024, 1:17 PM

#

DS means "using data technique to solve problem" kinda, it's large, it can be ml ai etc... But it is not necessary, DA is analytics the name speak for itself you do analytics to track business things (kpi), DL is a subset of ML which is a subset of AI

lapis sequoia Jul 31, 2024, 1:29 PM

#

this seems a nice article https://medium.com/decisionforce/understanding-mathematics-behind-floating-point-precisions-24c7aac535e3

Medium

Understanding Mathematics behind floating-point precisions

Introduction

#

"Understanding Mathematics behind floating-point precisions"

#

wait...

#

#

results are right though...but..

serene grail Jul 31, 2024, 1:39 PM

#

32 bits is 8x4 right

#

maybe a typo

lapis sequoia Jul 31, 2024, 1:39 PM

#

yes..!article reads nicely though, ig it's just a typo

lapis sequoia Jul 31, 2024, 3:04 PM

#

Recently Microsoft released a 1-bit LLM variant namely BitNet b1.58 which uses ternary {-1, 0, 1} for every single parameter. Surprisingly it matches the FP16 or BF16 precision transformer model.

serene grail Jul 31, 2024, 3:06 PM

#

lapis sequoia > Recently Microsoft released a 1-bit LLM variant namely BitNet b1.58 which uses...

I don't understand what this means

lapis sequoia Jul 31, 2024, 3:06 PM

#

it seems just to be about the values that the weights can take

#

thr paper is here: https://arxiv.org/pdf/2402.17764

#

you can see that on the first matrix

#

(it may be a lot more complex)

#

paper ends with:

Recent work like Groq5 has demonstrated promising results and great potential for building specific hardware (e.g., LPUs) for LLMs. Going one step further, we envision and call for actions to design new hardware and system specifically optimized for 1-bit LLMs, given the new computation paradigm enabled in BitNet [...

#

so i guess that's the underlying idea / goal

serene grail Jul 31, 2024, 3:10 PM

#

Interesting, so it's simplifying the values in the matrices for simpler calculations and that (in theory) gives the same results for less compute?
From my skimming that's what I got

lapis sequoia Jul 31, 2024, 3:10 PM

#

exactly

#

this happens in quantisation (converting weights from FP to Int) as well, but up to a byte (int8 numbers.)

serene grail Jul 31, 2024, 3:11 PM

#

very interesting

lapis sequoia Jul 31, 2024, 3:12 PM

#

idk why it's called bit though, probably just being ignorant

serene grail Jul 31, 2024, 3:12 PM

#

8 bits is one byte right?

lapis sequoia Jul 31, 2024, 3:12 PM

#

yes

#

i've just seen this device called 'friend' has anyone seen it?

serene grail Jul 31, 2024, 3:13 PM

#

I haven't

lapis sequoia Jul 31, 2024, 3:14 PM

#

this is a random link but...https://techcrunch.com/2024/07/30/friend-is-an-ai-companion-backed-by-founders-of-solana-perplexity-and-zfellows/

TechCrunch

Ivan Mehta

Friend's $99 necklace uses AI to help combat loneliness | TechCrunch

AI hardware is all the rage in startup land -- though receptions have thus far been mixed. Two notable examples, Rabbit and Humane, released devices to

#

if they would be able to run a 1 bit llm in small devices, it'd be a big money thing

#

(that device must use wifi)

serene grail Jul 31, 2024, 3:15 PM

#

hmm I don't know if AI is good enough yet for a device like that

lapis sequoia Jul 31, 2024, 3:16 PM

#

maybe, i do enjoy talking to llms a lot. but never buy trendy tech

#

just mentioned cuz of the 1 bit llm

serene grail Jul 31, 2024, 3:16 PM

#

oh yeah, if LLMs get much smaller that would be very good for this market

lapis sequoia Jul 31, 2024, 3:17 PM

#

do you find dull talking to llms?

serene grail Jul 31, 2024, 3:18 PM

#

not really, I'm just kind of too lazy to open up a tab in a browser and use an LLM lol
and I don't have a real use case for it besides having fun

lapis sequoia Jul 31, 2024, 3:18 PM

#

i think you can 'hey google' with Gemini

#

but yeah, not that useful

serene grail Jul 31, 2024, 3:19 PM

#

I like the idea of something like AI dungeon/novel AI more than a chatbot, they basically let you write a story together with an LLM. that can be more fun for me

lapis sequoia Jul 31, 2024, 3:20 PM

#

nice, haven't heard ab it

serene grail Jul 31, 2024, 3:22 PM

#

that's what got me interested in AI in the first place, although I didn't take it seriously until recently
AI dungeon used to use GPT2 in the beginning because that was the most advanced model

lapis sequoia Jul 31, 2024, 3:27 PM

#

just realising log(3) is 1.58, the number in the llm

#

not sure what it means

#

basically having -1,1,0 reduces matrix muls to additions

serene grail Jul 31, 2024, 3:29 PM

#

there are 3 options for the numbers, -1, 0, and 1, maybe it has something to do with that

lapis sequoia Jul 31, 2024, 3:30 PM

#

yeah, the measure of the entropy/information of each bit i think, but still idk

#

so for the alphabet is log(26), more possibilities, more entropy (i mean, not for the alphabet, cause it's not random in english text.)

tranquil ledge Jul 31, 2024, 3:32 PM

#

hello has anyone worked with TABLEAU DESKTOP before for data analysis ?

past meteor Jul 31, 2024, 3:34 PM

#

tranquil ledge hello has anyone worked with TABLEAU DESKTOP before for data analysis ?

Hi there, as a rule of thumb it's better to ask your question directly instead of looking for someone that may answer your question https://dontasktoask.com/

Many of us quickly look at the questions in the chat and prefer quickly answering something concrete

Don't ask to ask, just ask

cinder elk Jul 31, 2024, 4:06 PM

#

hey, I need some help, does anyone know how can I count the number of screws if they're overlapping or touching each other in this image, I tried dilation but didn't work

lapis sequoia Jul 31, 2024, 4:12 PM

#

i'd guess SAM could be useful there

agile cobalt Jul 31, 2024, 4:31 PM

#

maybe something like https://sites.google.com/view/f-vlm/home ?
might be overkill, not sure

F-VLM

Abstract
We present F-VLM, a simple open-vocabulary object detection method built uponFrozenVision andLanguageModels. F-VLM simplifies the current multi-stage training pipeline by eliminating the need for knowledge distillation or detection-tailored pretraining. Surprisingly, we observe that a

cinder elk Jul 31, 2024, 4:33 PM

#

I've been asked to do this by classical image segmentation methods and not use any AI💀💀
that's why I'm scratching my head over this problem

serene grail Jul 31, 2024, 4:34 PM

#

Do you need the exact number of screws or is there an acceptable error margin?

cinder elk Jul 31, 2024, 4:34 PM

#

95% accuracy it says

whole stone Jul 31, 2024, 4:38 PM

#

are the pics always at the same distance and using same sized screws?

unkempt apex Jul 31, 2024, 4:44 PM

#

cinder elk I've been asked to do this by classical image segmentation methods and not use a...

without AI? really?

cinder elk Jul 31, 2024, 4:45 PM

#

whole stone are the pics always at the same distance and using same sized screws?

no the dataset contains 3 types of screws/nuts in total. these are the other 2

cinder elk Jul 31, 2024, 4:46 PM

#

unkempt apex without AI? really?

ikr, sucks

unkempt apex Jul 31, 2024, 4:46 PM

#

then opencv will be best ( wait,.. is there any other options?)

spare forum Jul 31, 2024, 4:46 PM

#

tranquil ledge hello has anyone worked with TABLEAU DESKTOP before for data analysis ?

A bit

whole stone Jul 31, 2024, 4:47 PM

#

i would try opening instead of dilation

spare forum Jul 31, 2024, 4:47 PM

#

unkempt apex without AI? really?

Every task doesn't require AI

past meteor Jul 31, 2024, 4:47 PM

#

If you're going to only use classical computer vision (no deep nets) you'll need an entire pipeline of steps

whole stone Jul 31, 2024, 4:47 PM

#

also your class looks super cool, idk what you mean with it sucks

unkempt apex Jul 31, 2024, 4:47 PM

#

if you change the original image and maybe apply some filter , with identifying contour for each pixel , it is possible

past meteor Jul 31, 2024, 4:48 PM

#

The ones that are facing up could probably be found with a hough transform (circle detection)

#

The ones that are flat ... maybe with template matching

#

I'd try it iteratively, first use basic template matching to see how many "hits" you have

#

the nuts are super simple, that's something you can 100 % do with template matching and post processing

lapis sequoia Jul 31, 2024, 4:52 PM

#

i know of people doing it with ROI algorithms

past meteor Jul 31, 2024, 4:52 PM

#

@cinder elk https://docs.opencv.org/4.x/d4/dc6/tutorial_py_template_matching.html

lapis sequoia Jul 31, 2024, 4:52 PM

#

it can not be that hard..(not saying it's easy either.)

agile cobalt Jul 31, 2024, 4:53 PM

#

past meteor <@757655435988959354> https://docs.opencv.org/4.x/d4/dc6/tutorial_py_template_m...

Does that handles rotation?

past meteor Jul 31, 2024, 4:53 PM

#

yes

#

Or rather, there's extensions of the basic template matching that can handle rotation and scale

#

Like SIFT

cinder elk Jul 31, 2024, 4:55 PM

#

past meteor Or rather, there's extensions of the basic template matching that can handle rot...

gotta check this out

past meteor Jul 31, 2024, 4:56 PM

#

https://docs.opencv.org/4.x/da/df5/tutorial_py_sift_intro.html

#

If you know the right words you'll find an example pretty quickly

#

cinder elk Jul 31, 2024, 4:58 PM

#

wow, thanks @past meteor

serene grail Jul 31, 2024, 4:59 PM

#

woah this is cool, I should look into cv

past meteor Jul 31, 2024, 4:59 PM

#

https://stackoverflow.com/questions/76090694/how-to-find-all-matching-objects-in-an-image-with-sift

Stack Overflow

How to find all matching objects in an image with SIFT

I have a picture of a diamond card, and a small picture of one diamond, I'm trying to find all the diamonds in the big picture
Below are the pictures:
Below is the experimental code:
using System.

cinder elk Jul 31, 2024, 4:59 PM

#

I had only been trying to do this just by OTSU and contouring. Definitely learning new things here

past meteor Jul 31, 2024, 4:59 PM

#

And we're at the right sentence

#

"How to find all objects in the scene with SIFT"

past meteor Jul 31, 2024, 5:00 PM

#

cinder elk wow, thanks <@260493929047130113>

Anyway, you'll have to keep googling etc. that's what I'd do. A lot of googling and trial and error, but you should be on the right path now 🙂

#

okay, I really got nerdsniped by this one

#

Maybe just template matching but rotating your template and changing the scale can help

#

So basically, doing half of SIFTs algo

cinder elk Jul 31, 2024, 5:41 PM

#

at that point I might use sift as well, I'll sit to code once again after my dinner let's see how this pans out

cinder elk Jul 31, 2024, 5:42 PM

#

whole stone also your class looks super cool, idk what you mean with it sucks

ignore my rants, I do find it interesting actually

left tartan Jul 31, 2024, 5:51 PM

#

past meteor Maybe just template matching but rotating your template and changing the scale c...

Altho some of those nails look like they're facing up. 😢

past meteor Jul 31, 2024, 5:52 PM

#

left tartan Altho some of those nails look like they're facing up. 😢

Yup, you'd need 2 templates

#

Or template + hough transform like I mentioned

#

Hard to say what'll work without trying it 😄

spare forum Jul 31, 2024, 6:48 PM

#

tranquil ledge hello has anyone worked with TABLEAU DESKTOP before for data analysis ?

Don't dm people, thx

violet gull Jul 31, 2024, 7:31 PM

#

hi friends, I ran this for several hours https://github.com/pytorch/examples/blob/main/reinforcement_learning/reinforce.py and i am wondering why it does not look as expected. its made by pytorch so i would expect it to converge on such a simple model but it clearly hasnot converged. The red is the running average and the blue are individual episode scores

GitHub

examples/reinforcement_learning/reinforce.py at main · pytorch/exam...

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc. - pytorch/examples

agile cobalt Jul 31, 2024, 7:37 PM

#

it sounds like it's been broken for a while?
that example hasn't been updated since 2022 too, and there is an issue claiming it does not works at all https://github.com/pytorch/examples/issues/1213

edit; could be that this guy is trying to run an older version of gym or maybe gym vs Gymnasium

just guessing though, I haven't tried running it myself though

violet gull Jul 31, 2024, 7:38 PM

#

agile cobalt it sounds like it's been broken for a while? that example hasn't been updated si...

do you know of any working RL examples? i have searched for hours cant havent found one

violet gull Jul 31, 2024, 7:40 PM

#

agile cobalt it sounds like it's been broken for a while? that example hasn't been updated si...

it runs for me just the statistics it uses dont make sense

agile cobalt Jul 31, 2024, 7:43 PM

#

I mean, https://pytorch.org/tutorials/intermediate/reinforcement_q_learning.html should work, iirc they actually run the Notebooks and the time/output displayed should be the real time/output from their CI/CD

violet gull Jul 31, 2024, 7:43 PM

#

agile cobalt I mean, https://pytorch.org/tutorials/intermediate/reinforcement_q_learning.html...

look at the graph on the bottom of that

#

its not convergent

agile cobalt Jul 31, 2024, 7:44 PM

#

...........oh

lapis sequoia Jul 31, 2024, 8:10 PM

#

do you guys know how to get to formula (1) ? from the previous one

#

it's from this paper https://arxiv.org/abs/1412.0233

arXiv.org

The Loss Surfaces of Multilayer Networks

We study the connection between the highly non-convex loss function of a simple model of the fully-connected feed-forward neural network and the Hamiltonian of the spherical spin-glass model under the assumptions of: i) variable independence, ii) redundancy in network parametrization, and iii) uniformity. These assumptions enable us to explain t...

wooden sail Jul 31, 2024, 8:16 PM

#

it's explained in the text just underneath

#

you'll have to take out a pen and paper and try out their construction with 2 layers to see it

lapis sequoia Jul 31, 2024, 8:19 PM

#

it has to be derivable algebraically i assume

#

i did test it, but that's not my doubt

wooden sail Jul 31, 2024, 8:20 PM

#

what's the question?

lapis sequoia Jul 31, 2024, 8:20 PM

#

how to get from formula on top, to the one in the bottom, algebraically

#

or if it's got a name or smth

#

for example, sigma isn't there

wooden sail Jul 31, 2024, 8:21 PM

#

matrix multiplication can be defined as a double sum over the individual elements of the vectors and matrices involved

#

that's about it

lapis sequoia Jul 31, 2024, 8:21 PM

#

that's not just mmul, you've got an activation

wooden sail Jul 31, 2024, 8:21 PM

#

they replace the relus with a binary matrix A

lapis sequoia Jul 31, 2024, 8:22 PM

#

that seems an approximation

wooden sail Jul 31, 2024, 8:22 PM

#

why?

lapis sequoia Jul 31, 2024, 8:22 PM

#

sigma doesn't just output 1 or 0

wooden sail Jul 31, 2024, 8:22 PM

#

it does if you use a relu

lapis sequoia Jul 31, 2024, 8:22 PM

#

oh, it

wooden sail Jul 31, 2024, 8:22 PM

#

if you take an input x, a relu outputs either x or 0

lapis sequoia Jul 31, 2024, 8:23 PM

#

it's not s sigmoid

wooden sail Jul 31, 2024, 8:23 PM

#

the paper says everything it's using

lapis sequoia Jul 31, 2024, 8:23 PM

#

i still don't think it's obvious the multiplicatory of weights

wooden sail Jul 31, 2024, 8:23 PM

#

lapis sequoia Jul 31, 2024, 8:23 PM

#

which is why i asked if there was some derivation

wooden sail Jul 31, 2024, 8:23 PM

#

they explain it explicitly in words right under the equation

lapis sequoia Jul 31, 2024, 8:24 PM

#

we are talking ab different things

wooden sail Jul 31, 2024, 8:24 PM

#

#

everything you've asked is written there

lapis sequoia Jul 31, 2024, 8:24 PM

#

you don't just build the formula reading the paragraph, do you?

wooden sail Jul 31, 2024, 8:24 PM

#

yes you do

lapis sequoia Jul 31, 2024, 8:25 PM

#

ok, sorry im not like you, so i asked to try to understand

wooden sail Jul 31, 2024, 8:25 PM

#

using the paragraph and the definition of matrix multiplication, plus the info they gave above (using only dense layers and relu)

#

that's why i suggested you do it explicitly for 2 layers on paper

lapis sequoia Jul 31, 2024, 8:25 PM

#

i already did that

wooden sail Jul 31, 2024, 8:25 PM

#

did you really?

lapis sequoia Jul 31, 2024, 8:26 PM

#

yes, why?

wooden sail Jul 31, 2024, 8:26 PM

#

if you shows pics of your work on paper, others will be able to help more easily

#

since you'll have more concise questions about the parts that seem to be off

lapis sequoia Jul 31, 2024, 8:28 PM

#

ill ask elsewhere, thanks, you must feel good now.

brave yew Jul 31, 2024, 9:16 PM

#

guys... has anyone here worked with sentiment analysis pipeline of transformers library? for some reason my longer inputs are taking up to 10x less time than my one line inputs

#

i am using my gpu to process... atleast that is what i have specified in my pipeline method

unkempt wigeon Jul 31, 2024, 9:23 PM

#

I I can use some help I'm trying to learn no networks unfortunately I need to hear somebody saying it for me to truly learn because I always lose my place with reading and sorry

violet gull Jul 31, 2024, 9:34 PM

#

unkempt wigeon I I can use some help I'm trying to learn no networks unfortunately I need to he...

good thing youtube videos exist

unkempt wigeon Jul 31, 2024, 9:35 PM

#

I tried and I still don't understand it I'm sorry

thorn flame Jul 31, 2024, 10:10 PM

#

Hi! I'm trying to build a recommendation system as an ML noob, is there some kind of good resource that could help?

#

I believe the process is the same for all types, just training data could be different

#

I'm given 1hr30mins to solve this. For an ML noob, is this feasible?

buoyant vine Jul 31, 2024, 11:44 PM

#

Eh kind of

#

maybe not for a complete noob tho

#

A basic recommendation system, you want some vectors which represent user interests

#

and the easiest method is to mean pool those vectors of the users lists of interests

#

to get an average vector that hopefully encapsulates the context of all rolled up vectors

#

and then KNN search over some dataset of all the vectors to find similar things

thorn flame Jul 31, 2024, 11:58 PM

#

buoyant vine A basic recommendation system, you want some vectors which represent user intere...

How do I convert to vectors.

#

I understand I need to convert the input datasets to pandas dataframes.

buoyant vine Jul 31, 2024, 11:59 PM

#

Ehhhhhhh it kind of depends

thorn flame Jul 31, 2024, 11:59 PM

#

I also understand I could use the surprise package for the algorithm and model evaluation (I could be wrong)

buoyant vine Jul 31, 2024, 11:59 PM

#

easiest honestly is spit out a bunch of keywords for what ever content

#

and then feed it into some LLM encoder to generate the emebeddings that can be used for KNN

#

Have a look at the sentence transformers library for that

mental rampart Aug 1, 2024, 12:06 AM

#

hey , i was having some doubt regarding tensor conversion from text in pytorch and nlp

#

if my tokens are words, how should i convert them to tensor?

thorn flame Aug 1, 2024, 12:07 AM

#

buoyant vine and then feed it into some LLM encoder to generate the emebeddings that can be u...

Pretty lost. How do I use the embeddings with KNNBasic class from surprise package for example?

thorn flame Aug 1, 2024, 12:23 AM

#

Also by transformers lib, do you mean hugging face's?

thorn flame Aug 1, 2024, 12:26 AM

#

thorn flame Pretty lost. How do I use the embeddings with KNNBasic class from surprise pack...

I think what I need to do is content-based filtering not collaborative filtering so the KNN class won't be needed I guess

thorn flame Aug 1, 2024, 12:29 AM

#

thorn flame Also by transformers lib, do you mean hugging face's?

Gotcha: https://huggingface.co/sentence-transformers

sentence-transformers (Sentence Transformers)

thorn flame Aug 1, 2024, 12:36 AM

#

thorn flame I think what I need to do is content-based filtering not collaborative filtering...

@buoyant vine let me know what you think pls

#

I believe I could get the cosine similarity after getting the embeddings with sentence transformers

buoyant vine Aug 1, 2024, 12:53 AM

#

Yes

#

I am about to go to sleep, but what you want

#

Is say a user wants some recommendations relating to star wars, you would encode that text/keywords into an embedding with sentence transformers

#

And then do knn search for close/similar results in your database

#

Where the keywords / records in your database have also been encoded with the same sentence transformer model

#

If you want recommendation by something like, "given a user's watch history recommend some other shows the user might like" then you take the average of all the embeddings the user has watched and use that averaged embedding to do the knn search

#

Or if you want "user watched X video, recommend them some other similar videos" you'd take the embedding of video X (be that generated from keywords or what not) and use that for the knn

#

Basically your whole goal is to get a index of various embeddings for the dataset of content you want to be able to select and return

#

Then it is just a case of generating a query embeddings to suite what sort of application you want

#

Make some sense?

thorn flame Aug 1, 2024, 12:58 AM

#

It's basically an ecommerce platform

#

To provide products for users based on purchase history and whatnot

#

And browser activity

#

Doesn't knn search compare with preferences of other users??

buoyant vine Aug 1, 2024, 1:01 AM

#

No?

#

It is completely arbitrary

#

All it does is calculate the distance between two points in a graph effectively

#

A vector is like a set of coordinates or a postcode

thorn flame Aug 1, 2024, 1:02 AM

#

Which lib are you suggesting I use for that?

#

scikit-learn ??

buoyant vine Aug 1, 2024, 1:03 AM

#

When you do KNN search, you effectively are asking "what are the closest other data points available to me from this point?"

thorn flame Aug 1, 2024, 1:03 AM

#

Cos the KNN class from surprise lib works with some kind of Reader format

buoyant vine Aug 1, 2024, 1:03 AM

#

I would suggest Sentence transformers to generate the embeddings, and PyNNDescent as the index

thorn flame Aug 1, 2024, 1:03 AM

#

Index how?

buoyant vine Aug 1, 2024, 1:04 AM

#

You give it a bunch of embeddings, generated by what ever content you have

#

And it will make a index that can be searched, i.e. you give it a query vector/embedding, it gives you the top K back

thorn flame Aug 1, 2024, 1:04 AM

#

I dig

#

It basically implements its own knn algo I guess

buoyant vine Aug 1, 2024, 1:05 AM

#

Quickly* rather than brute forcing checking every point

#

I wouldn't worry about the algo ATM.

#

It doesn't really matter to your use case

#

Just that it is faster than brute force

#

And provides a convenient way of going "hey here are all my embeddings, give me the closest points to X"

#

Start with something basic, i.e. don't worry about taking in the user history or what ever

thorn flame Aug 1, 2024, 1:06 AM

#

so what I'm imagining is having to pass a user with necessary context to my function, and it returns a list of products based on my products dataset

buoyant vine Aug 1, 2024, 1:07 AM

#

Just take some input text to begin with, encode it then do the knn search

#

Once you have that it should start to make more sense

#

And then you can start looking at doing it off of the user history

thorn flame Aug 1, 2024, 1:07 AM

#

Do I need a db for this?

buoyant vine Aug 1, 2024, 1:07 AM

#

No

#

You can do it in memory, doesn't really matter

thorn flame Aug 1, 2024, 1:08 AM

#

Everything is stored in memory?

#

I see

buoyant vine Aug 1, 2024, 1:08 AM

#

Unless your dataset is huge, but I suspect that is not really a issue here

thorn flame Aug 1, 2024, 1:08 AM

#

buoyant vine Unless your dataset is huge, but I suspect that is not really a issue here

Yeah

#

I'm wondering if a possible result can return multiple similarities

#

Instead of just one

buoyant vine Aug 1, 2024, 1:09 AM

#

That is what the K is

thorn flame Aug 1, 2024, 1:09 AM

#

I see

#

Makes sense now :)

buoyant vine Aug 1, 2024, 1:09 AM

#

so say with nndescent you can tell it "take the top 10" etc...

thorn flame Aug 1, 2024, 1:09 AM

#

That's dope

#

Thanks. I'll run with this

#

Would be my first actual intro to ML if I'm successful with the engine heh

cloud cosmos Aug 1, 2024, 4:00 AM

#

Hello

#

Im trying to make a text-speech ai and I don't know where to start exactly any tips?

brave yew Aug 1, 2024, 4:26 AM

#

brave yew guys... has anyone here worked with sentiment analysis pipeline of transformers ...

anyone? please?

brave yew Aug 1, 2024, 4:27 AM

#

cloud cosmos Im trying to make a text-speech ai and I don't know where to start exactly any t...

well pick up python if you don't know it yet, and start exploring the hugging face transformers libarary and look into its pipelines, if the pipelines are not suitable for your application you can fine tune it into your own nlp model using pytorch libarary

cloud cosmos Aug 1, 2024, 4:28 AM

#

brave yew well pick up python if you don't know it yet, and start exploring the hugging fa...

Okay thank you so much

quasi crag Aug 1, 2024, 4:51 AM

#

Would be my furst actual im trying to make twxt spech to im succesfull with the engine 🏋️

cinder elk Aug 1, 2024, 9:21 AM

#

past meteor okay, I really got nerdsniped by this one

tried to use sift with flann but it still isn't that accurate

raw tree Aug 1, 2024, 10:14 AM

#

Hey - quick question - can you/what is the sanest way to load safetensors into keras ?

raw tree Aug 1, 2024, 10:15 AM

#

brave yew guys... has anyone here worked with sentiment analysis pipeline of transformers ...

less ?
like the longer string is faster ?

#

wut O.O

brave yew Aug 1, 2024, 10:16 AM

#

raw tree less ? like the longer string is faster ?

Yeah

raw tree Aug 1, 2024, 10:18 AM

#

brave yew Yeah

it does have more tokens too right ?

#

If so, absolutlely no clue

brave yew Aug 1, 2024, 10:19 AM

#

raw tree it does have more tokens too right ?

Yes one was a 5 letter sentence the other was a whole paragraph, I will send a screenshot once I get back from classes

raw tree Aug 1, 2024, 10:20 AM

#

raw tree Hey - quick question - can you/what is the sanest way to load safetensors into k...

should I open up a help channel ?

stoic topaz Aug 1, 2024, 10:30 AM

#

hi, im in high school and have a free choice data analysis project (using powerbi)

any interesting dataset/analysis ideas?

woeful sorrel Aug 1, 2024, 10:38 AM

#

Hellllo friends

#

Man how to read a CSV file in pandas

cinder elk Aug 1, 2024, 10:45 AM

#

can I use feature matching algorithms like sift/surf to find matches that are close enough but not perfect matches?

#

reference

toxic mortar Aug 1, 2024, 10:48 AM

#

Code:

!pip install -q transformers==4.31.0
from transformers import DistilBertTokenizerFast
from transformers import TFDistilBertForSequenceClassification
  sentiment_model = TFDistilBertForSequenceClassification.from_pretrained('distilbert-base-uncased',num_labels=2)

Error:

tokenizer_config.json:   0%|          | 0.00/48.0 [00:00<?, ?B/s]
vocab.txt:   0%|          | 0.00/232k [00:00<?, ?B/s]
tokenizer.json:   0%|          | 0.00/466k [00:00<?, ?B/s]
config.json:   0%|          | 0.00/483 [00:00<?, ?B/s]
model_training/kinesiologie_tape_new/sentiment/tf_model.h5 exists on GCP = False
Sentiment analysis stage (1/2) - est. time is 2 minutes
model.safetensors:   0%|          | 0.00/268M [00:00<?, ?B/s]
---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
<ipython-input-31-7a684e7281c2> in <cell line: 2>()
     30   ))
     31 
---> 32   sentiment_model = TFDistilBertForSequenceClassification.from_pretrained('distilbert-base-uncased',num_labels=2)
     33   optimizer = tf.keras.optimizers.Adam(learning_rate=5e-5, epsilon=1e-08)
     34   losss = tf.keras.losses.SparseCategoricalCrossentropy(from_logits=True)

2 frames
/usr/local/lib/python3.10/dist-packages/transformers/modeling_tf_utils.py in build(self, input_shape)
   1129     def build(self, input_shape=None):
   1130         call_context = get_call_context_function()
-> 1131         if self.built or call_context().in_call:
   1132             self.built = True
   1133         else:

TypeError: 'NoneType' object is not callable

#

Any help?

#

I followed their official docs:
https://huggingface.co/docs/transformers/model_doc/distilbert

DistilBERT

cerulean violet Aug 1, 2024, 10:51 AM

#

Hello I am trying to make a small AI model which is for my bot used for moderation,any idea where to start?

cinder elk Aug 1, 2024, 10:56 AM

#

what are your intended goals?
does it need to process images/videos or just for text?
if text only look into nltk for sentiment analysis and keywords.
Or you can use langchain

tranquil ledge Aug 1, 2024, 11:17 AM

#

Hello does anyone worked with Tableau desktop before

#

#

i want to create this table

#data-science-and-ml

Start countdown for 3 minutes