muted crypt May 4, 2023, 2:01 PM

#

Makes sense yeah

past meteor May 4, 2023, 2:01 PM

#

Do you rescale your output?

muted crypt May 4, 2023, 2:02 PM

#

I've done that with just one of the coordinates, latitude vs time and just a column

past meteor May 4, 2023, 2:02 PM

#

I assume you're using MSE or something similar but lat, lon and altitude are on different scales

muted crypt May 4, 2023, 2:02 PM

#

I guess that is wrong

#

This is the X data

past meteor May 4, 2023, 2:03 PM

#

Like, on paper what you're doing is that you have a 3D input that is fed into your model and together with the latent vector Z it produces a 3D output. This 3D output is what produces the loss, this is pretty much like multi task learning.

muted crypt May 4, 2023, 2:05 PM

#

past meteor Like, on paper what you're doing is that you have a 3D input that is fed into yo...

what I don't get is how it can compute the loss if you don't provide it also the real trajectory

past meteor May 4, 2023, 2:05 PM

#

What you should be predicting is the real trajectory

muted crypt May 4, 2023, 2:06 PM

#

indeed, but in the train dataframe you do need to have it, right?

cold osprey May 4, 2023, 2:06 PM

#

huh

past meteor May 4, 2023, 2:06 PM

#

Are you using Pytorch or Tensorflow? I'm going to look for an example

muted crypt May 4, 2023, 2:06 PM

#

Keras

muted crypt May 4, 2023, 2:06 PM

#

past meteor Are you using Pytorch or Tensorflow? I'm going to look for an example

that would help because I'm starting to get lost

past meteor May 4, 2023, 2:13 PM

#

muted crypt that would help because I'm starting to get lost

https://www.tensorflow.org/guide/keras/rnn is a good place to start

TensorFlow

Recurrent Neural Networks (RNN) with Keras | TensorFlow Core

muted crypt May 4, 2023, 2:15 PM

#

past meteor https://www.tensorflow.org/guide/keras/rnn is a good place to start

okay I'll check that out! thank you!
before leaving though, can you provide a small guidance on what your approach would be? is it, testing first for one flight and then when loss=0, add other flights?

past meteor May 4, 2023, 2:16 PM

#

You just need a SimpleRNN and Dense layer to start with

muted crypt May 4, 2023, 2:16 PM

#

as different flights are unrelated, can they all be dumped in the same array?

young granite May 4, 2023, 2:17 PM

#

array is just a form of input

past meteor May 4, 2023, 2:18 PM

#

I don't know about Keras specifically but you typically end up with a dataset that looks like this: num_steps x features x num_outputs so you can have unrelated ones

#

num_outputs is your y_train

#

You're conditioning over windows meaning if your window is of size 3 it'll look like [1,2,3] -> 3(pred) ; [2, 3, 4] -> 4 (pred) ; [3, 4, 5] -> 5 (pred) ....

muted crypt May 4, 2023, 2:21 PM

#

say that I have 50 flights, each with 10 features. They are variable, so like the first flight has a latitude that varies over time, speed, altitud... all of these have to be an array inside of every cell in the matrix or is it better to have an id column and simply have as many rows as points in the entire flights

#

This is where I struggle the most, I don't really find the best approach on the dataset you mentioned

past meteor May 4, 2023, 2:22 PM

#

muted crypt This is where I struggle the most, I don't really find the best approach on the ...

https://tsfresh.readthedocs.io/en/latest/text/forecasting.html read this

muted crypt May 4, 2023, 2:22 PM

#

I had tested this for 12 flights

#

Being FPLlat the latitude of the intended trajectory and TELlat the latutide of the real trajectory

muted crypt May 4, 2023, 2:26 PM

#

past meteor https://tsfresh.readthedocs.io/en/latest/text/forecasting.html read this

yet once again the same question appears for me. These train on a single sequence and looking at the previous values it gives a prediction. Should the Y be the real and the window size contain the intended points?

#

Like [1_intended, 2_intended, 3_intended] -> 3_real (pred)?

#

The thing here is that i don't want to extend a timeseries but rather generate a new one from a given one, this messes up my head

past meteor May 4, 2023, 2:28 PM

#

You need to try putting it into words what your task is. Are you trying to map a coordinate from intended to flown (fully markovian) or are you trying to map a coordinate from intended to flown, given the past few coordinates (markovian over a window)

#

Depending on how these drones work, your initial results and what you want you may even need a bi-directional RNN (but don't start doing this yet)

#

Maybe a reasonable feature-set and approach is using a feedforward net (no RNN) with [X,Y,Z, starting X, starting Y, starting Z, time since start, time to end] as features

muted crypt May 4, 2023, 2:32 PM

#

past meteor May 4, 2023, 2:33 PM

#

Why? I think that potentially points in the middle are the ones that are off. The points close to the start and close to the end are typically similar (according to the pics)

#

You can also make the problem easier by predicting the diff between intended and flown etc etc

#

You just need to focus on understanding your task better on the ML side I think

muted crypt May 4, 2023, 2:36 PM

#

past meteor You can also make the problem easier by predicting the diff between intended an...

So what I've done is a study of this already, like the evolution of the error along the trajectory, finding the weak points, comparing both datasets... but now I have to implement this in ML

#

I've been working on that for months and I feel like I have all the necessary data to feed it into a NN but can't figure it out because I haven't been able to find any similar example

muted crypt May 4, 2023, 2:39 PM

#

past meteor You need to try putting it into words what your task is. Are you trying to map a...

and this is fully markovian, no previous info from the flown flight. Just the intended points

muted crypt May 4, 2023, 3:12 PM

#

past meteor You just need to focus on understanding your task better on the ML side I think

is it that what you refer for loss=0, I think I've manged something

cold osprey May 4, 2023, 3:26 PM

#

seems good

#

loss should be very close to 0

past meteor May 4, 2023, 3:28 PM

#

muted crypt is it that what you refer for loss=0, I think I've manged something

Indeed this is what I meant

muted crypt May 4, 2023, 3:33 PM

#

but i've done that by training the model with just a feature (latitude, and the time stamps) of a single flight

cold osprey May 4, 2023, 3:34 PM

#

try with lat long and alt

muted crypt May 4, 2023, 3:35 PM

#

cold osprey try with lat long and alt

how?

past meteor May 4, 2023, 3:35 PM

#

Are you only doing one of the 3 dimensions now @muted crypt ?

muted crypt May 4, 2023, 3:35 PM

#

past meteor Are you only doing one of the 3 dimensions now <@484100119185063947> ?

in this test, yes

past meteor May 4, 2023, 3:36 PM

#

Okay I'll try explaining what I meant again with a lot less jargon:

muted crypt May 4, 2023, 3:36 PM

#

My dataframe is originally just a column, should I do 3 columns now and then what the Y is the next 3 values of lat, long, alt too?

past meteor May 4, 2023, 3:36 PM

#

Your task: X, Y, Z (intended) -> X, Y, Z (actual) for all time steps T for all flights.

muted crypt May 4, 2023, 3:36 PM

#

past meteor Okay I'll try explaining what I meant again with a lot less jargon:

please do. By the way I am not a computer scientist or similar so this is very appreciated. I have to deliver this a thesis but my knowledge is quite low, hence my dumb questions

past meteor May 4, 2023, 3:37 PM

#

Looking at your images, intended != actual typically in the middle of your flight (look at your plots)

#

So a baseline model (could be linear regression even 😄 ) is the following: time_since_start, time_to_end and you predict 3 things: difference_intended_flown_X, difference_intended_flown_Y, difference_intended_flown_Z

#

My intuition is that this model is already going to be quite good! 😄 This is a drastic simplification of your problem

cold osprey May 4, 2023, 3:40 PM

#

diff_X, diff_Y, diff_Z as a function of time since start and time to end?

#

hmmm

past meteor May 4, 2023, 3:40 PM

#

No. EDIT: actually yes, I missread.

muted crypt May 4, 2023, 3:41 PM

#

how can I know the diff_X before the flight? does diff_X mean the distance to the real? or the distance to the start?

past meteor May 4, 2023, 3:41 PM

#

muted crypt how can I know the diff_X before the flight? does diff_X mean the distance to th...

Do you need to know all of these quantities before the flight?

muted crypt May 4, 2023, 3:42 PM

#

Just the positions where the drone will fly by and at which time

#

and velocity for instance of each segment, from which you can get the time

past meteor May 4, 2023, 3:43 PM

#

So, I suspect that the model will be used to "adjust" the intended path to correspond to the actual path ahead of flying right? (and not during)

muted crypt May 4, 2023, 3:43 PM

#

past meteor So, I suspect that the model will be used to "adjust" the intended path to corre...

that is correct

past meteor May 4, 2023, 3:43 PM

#

Then you can definitely make a model like I described above

muted crypt May 4, 2023, 3:43 PM

#

so you can't "rely" on previous information from that intended flight

past meteor May 4, 2023, 3:44 PM

#

Even if it's a bad model, you need to make it imo because it's a good baseline to compare other models to

cold osprey May 4, 2023, 3:44 PM

#

is the predicting real-time? as in as the drone flies, it will show the predicted path it will take

past meteor May 4, 2023, 3:45 PM

#

Based on looking at this I suspect that there is indeed a relationship between the time to start, time to end and the difference between intended and flown

muted crypt May 4, 2023, 3:46 PM

#

past meteor Even if it's a bad model, you need to make it imo because it's a good baseline t...

I would like to do a simple model yes, but it really varies a lot, the flown trajectories are quite unexpected sometimes so I'm not sure it will perform too nice

cold osprey May 4, 2023, 3:46 PM

#

with the simple model zestar proposed:

have intended flight path
use model to get diffX diffY diffZ
use diff(s) and intended flight path to get predicted flight path
compare to actual flight path

muted crypt May 4, 2023, 3:47 PM

#

so combining diff with intended I see

#

Yet the model has to take both intended and real right?

cold osprey May 4, 2023, 3:48 PM

#

for zestar's model, the model only takes time since start and time to end as inputs

past meteor May 4, 2023, 3:48 PM

#

Exactly that

cold osprey May 4, 2023, 3:48 PM

#

it doesnt 'care' about the X Y Z per say

past meteor May 4, 2023, 3:49 PM

#

You can use this: https://scikit-learn.org/stable/modules/generated/sklearn.multioutput.MultiOutputRegressor.html to wrap any regression model to predict diff_X, diff_Y and diff_Z. Do note you're fitting three models at the same time.

scikit-learn

sklearn.multioutput.MultiOutputRegressor

Examples using sklearn.multioutput.MultiOutputRegressor: Comparing random forests and the multi-output meta estimator Comparing random forests and the multi-output meta estimator

muted crypt May 4, 2023, 3:49 PM

#

and what would be the Y in the model then?

cold osprey May 4, 2023, 3:49 PM

#

muted crypt and what would be the Y in the model then?

lat or long ig

#

X and Y - lat and long

past meteor May 4, 2023, 3:49 PM

#

muted crypt and what would be the Y in the model then?

y = [diff_X, diff_Y and diff_Z]

cold osprey May 4, 2023, 3:49 PM

#

Z - altitude

#

oh soz

#

u mean that y

past meteor May 4, 2023, 3:50 PM

#

Indeed

muted crypt May 4, 2023, 3:50 PM

#

past meteor y = [diff_X, diff_Y and diff_Z]

and we get this one beforehand right?

past meteor May 4, 2023, 3:50 PM

#

Yeah, it's easy to compute this no?

#

For all flights you subtract the intended from the actual

muted crypt May 4, 2023, 3:50 PM

#

here a big question arises though, do you take into account the time shift?

cold osprey May 4, 2023, 3:51 PM

#

what is the time shift?

past meteor May 4, 2023, 3:51 PM

#

What do you mean with time shift indeed?

muted crypt May 4, 2023, 3:51 PM

#

cold osprey May 4, 2023, 3:51 PM

#

i think we r using euclidean here

past meteor May 4, 2023, 3:51 PM

#

Your time series are not aligned?

cold osprey May 4, 2023, 3:51 PM

#

oh

#

Think

muted crypt May 4, 2023, 3:52 PM

#

check this out:
Intended: https://paste.pythondiscord.com/budagaqufi
Real: https://paste.pythondiscord.com/oduxafuziv

cold osprey May 4, 2023, 3:52 PM

#

real link doesnt work

#

whats FLPturn and FLPwpt?

muted crypt May 4, 2023, 3:53 PM

#

cold osprey real link doesnt work

updated!

past meteor May 4, 2023, 3:54 PM

#

So for the intended one you have a lot less samples than for flown?

muted crypt May 4, 2023, 3:54 PM

#

So the Real is the data recorded by the drone in 0.1 second increments ('secs' column) the intnded is jus the trajectory to be followed

past meteor May 4, 2023, 3:54 PM

#

Because it generates a bunch of waypoints

cold osprey May 4, 2023, 3:54 PM

#

hmmm

muted crypt May 4, 2023, 3:54 PM

#

past meteor So for the intended one you have a lot less samples than for flown?

it's just the waypoint that you specify -> the drone flies towards them

cold osprey May 4, 2023, 3:55 PM

#

and we want to get from intended to real without actually flying the drone

past meteor May 4, 2023, 3:55 PM

#

Does the drone pass every waypoint?

muted crypt May 4, 2023, 3:55 PM

#

cold osprey and we want to get from intended to real without actually flying the drone

correct

muted crypt May 4, 2023, 3:55 PM

#

past meteor Does the drone pass every waypoint?

yes:

#

well not exactly on top, but very close at least

cold osprey May 4, 2023, 3:56 PM

#

1st data row of real corresponds to 1st waypoint?

past meteor May 4, 2023, 3:56 PM

#

Okay, then I would only predict 12 points per flight, the ones closest to the waypoint

muted crypt May 4, 2023, 3:57 PM

#

cold osprey 1st data row of real corresponds to 1st waypoint?

not necessarily, it's when the drone sensor is turned on

past meteor May 4, 2023, 3:57 PM

#

Why? Otherwise you're making big assumptions about the flight. You can't upsample the data / linearly interpolate between waypoints to have the same sample rate as the drone unless you're 100 % sure the drone is programmed to move between waypoints in a straight line

cold osprey May 4, 2023, 3:58 PM

#

need a way to align them

past meteor May 4, 2023, 3:58 PM

#

if you are sure that the path between the waypoints is linear then you can upsample to get the same sample rate as the flight

muted crypt May 4, 2023, 3:59 PM

#

past meteor Why? Otherwise you're making big assumptions about the flight. You can't upsampl...

theoretically it has to. The goal is basically to learn the patterns from all the data that I have (I have much more intended + real dataframes) and then apply it to a new intended to predict the real

muted crypt May 4, 2023, 3:59 PM

#

cold osprey need a way to align them

I've done that

#

Like this:
The evolution of the error along the flight

cold osprey May 4, 2023, 3:59 PM

#

ah

past meteor May 4, 2023, 3:59 PM

#

Look, if you're sure that the drone flies in a straight line you should upsample between the waypoints to have an obervation every 0,01s

cold osprey May 4, 2023, 3:59 PM

#

^

muted crypt May 4, 2023, 4:00 PM

#

I've interpolated the intended trajectory so that the number of rows matches the number of rows in the real one, is that what you mean?

past meteor May 4, 2023, 4:00 PM

#

yes

muted crypt May 4, 2023, 4:01 PM

#

I've done this yeah

cold osprey May 4, 2023, 4:01 PM

#

with the straight line assumption of the intended data, then can try zestar simple model first then

past meteor May 4, 2023, 4:01 PM

#

Then you don't need to align them or what am I missing?

cold osprey May 4, 2023, 4:01 PM

#

he's already aligned it right

muted crypt May 4, 2023, 4:02 PM

#

Depends on what you mean by align

past meteor May 4, 2023, 4:02 PM

#

I'd truncate your flight dataset to be between waypoint 1 and the last way point as well

muted crypt May 4, 2023, 4:03 PM

#

It's impossible to perfectly alineate them, you can find the best time shift (what I like to call) (amount of time lag between real and intended)

muted crypt May 4, 2023, 4:03 PM

#

past meteor I'd truncate your flight dataset to be between waypoint 1 and the last way point...

yes that can be done too

#

I have this now for instance

#

For 16 flights, already interpolated

past meteor May 4, 2023, 4:04 PM

#

After truncating it to be between way point 1 and N I'm not sure you need to align them? Especially if you've already interpolated

muted crypt May 4, 2023, 4:04 PM

#

past meteor After truncating it to be between way point 1 and N I'm not sure you need to ali...

yes, some corrections should be done as well

muted crypt May 4, 2023, 4:05 PM

#

muted crypt I have this now for instance

let's say that this is already aligned though, which almost is, is it even the right format for the model?

cold osprey May 4, 2023, 4:06 PM

#

not for the time since start and time to end model

muted crypt May 4, 2023, 4:08 PM

#

cold osprey not for the time since start and time to end model

and apart from these 2 extra columns? this shouldn't be too hard

past meteor May 4, 2023, 4:08 PM

#

You'd need to create variables such as time since waypoint and time to waypoint

cold osprey May 4, 2023, 4:08 PM

#

^

past meteor May 4, 2023, 4:08 PM

#

If the drone passes each waypoint then the idea is that it deviates from the path between waypoints

muted crypt May 4, 2023, 4:09 PM

#

yet this is just 2 arrays which are flipped, aren't they?
time since start: [0, 1, 3, 4, 5]
time to end: [5 ,4 ,3 ,2 ,1 ]

past meteor May 4, 2023, 4:10 PM

#

If the time between each waypoint is equal then yes

cold osprey May 4, 2023, 4:10 PM

#

for 5 data points

muted crypt May 4, 2023, 4:10 PM

#

past meteor If the drone passes each waypoint then the idea is that it deviates from the pat...

it will never cross it exaclty as the resultion is very different. See how many decimals it shows in the csv

cold osprey May 4, 2023, 4:10 PM

#

wait, is the model predicting for each waypoint or on all points(from interpolation)

past meteor May 4, 2023, 4:10 PM

#

but I can imagine you could be 2 time points from a given waypoint but 7 time points to the next

muted crypt May 4, 2023, 4:11 PM

#

so the time to end refers to the time to the next waypoint?

past meteor May 4, 2023, 4:11 PM

#

yes

muted crypt May 4, 2023, 4:11 PM

#

How does that differ from the actual end of the flight?

past meteor May 4, 2023, 4:12 PM

#

My assumption is that the drone comes pretty close, if not exactly, to each waypoint and it deviates in the middle

muted crypt May 4, 2023, 4:12 PM

#

muted crypt How does that differ from the actual end of the flight?

if we have this we make sure to know that we are in the middle of the flight

muted crypt May 4, 2023, 4:12 PM

#

past meteor My assumption is that the drone comes pretty close, if not exactly, to each way...

just in turns though

past meteor May 4, 2023, 4:12 PM

#

Like, you can calculate this easily before you build a model to see if it's true

muted crypt May 4, 2023, 4:13 PM

#

it won't deviate a thing on these horizontal long segments

past meteor May 4, 2023, 4:13 PM

#

All of this is "feature engineering" and is the cornerstone of ML. You have to be a bit creative haha, if you're creative enough you can really simplify the problem for LSTMs to linear regression

past meteor May 4, 2023, 4:13 PM

#

muted crypt it won't deviate a thing on these horizontal long segments

You can add the segment type as a feature as well then?

muted crypt May 4, 2023, 4:13 PM

#

past meteor Like, you can calculate this easily before you build a model to see if it's true

this is it:

#

where the peaks correspond to the turns

#

yet for the rest it is not really true

past meteor May 4, 2023, 4:14 PM

#

What is this? The difference?

muted crypt May 4, 2023, 4:14 PM

#

past meteor What is this? The difference?

yes

past meteor May 4, 2023, 4:14 PM

#

Just add the segment type as a variable for your model

muted crypt May 4, 2023, 4:14 PM

#

Difference from real to intended

past meteor May 4, 2023, 4:14 PM

#

And add time between waypoints

muted crypt May 4, 2023, 4:15 PM

#

past meteor Just add the segment type as a variable for your model

this is much harder than it seems somehow

#

do you mean to categorize each segment?

past meteor May 4, 2023, 4:15 PM

#

https://paste.pythondiscord.com/budagaqufi.py don't you have this as a variable?

#

CurvaturePassed etc.

muted crypt May 4, 2023, 4:16 PM

#

I mean a turn is not a segment but it falls into a certain length of 2 segments

past meteor May 4, 2023, 4:16 PM

#

Can't you know if you're turning between 2 waypoints by looking at the X, Y and Z coordinates?

muted crypt May 4, 2023, 4:16 PM

#

past meteor https://paste.pythondiscord.com/budagaqufi.py don't you have this as a variable?

this is not realiable sadly

past meteor May 4, 2023, 4:17 PM

#

In a straight line you only have X that varies, no?

muted crypt May 4, 2023, 4:18 PM

#

past meteor https://paste.pythondiscord.com/budagaqufi.py don't you have this as a variable?

but again you need to add more rows or something in here

#

Because you cant just have 3 points like a triangle and tell where are the turns

past meteor May 4, 2023, 4:19 PM

#

Can there even be a turn between 2 way points?

#

Especially since you said you linearly interpolate

muted crypt May 4, 2023, 4:19 PM

#

past meteor Can there even be a turn between 2 way points?

no, that's why it's hard to categorize that

past meteor May 4, 2023, 4:20 PM

#

Tbh, I can't chat all evening but you just need to "distill" your knowledge of the problem into variables and simplify the problem as much as possible

muted crypt May 4, 2023, 4:20 PM

#

past meteor Especially since you said you linearly interpolate

when I interpolate i'm just adding points to the segments, on top

past meteor May 4, 2023, 4:20 PM

#

And afterwards you need to build a baseline model

#

Then you start relaxing your assumptions 1 by 1 and creating more powerful models. At the end of this process you get to RNN, LSTMs, maybe even bi-directional RNNs

muted crypt May 4, 2023, 4:21 PM

#

yeah the thing is that there not an example of something similar

past meteor May 4, 2023, 4:22 PM

#

using time from wayepoint, time to waypoint to predict the diffs is already making many assumptions that you can then start relaxing later on (or you add variables to make more reliable assumptions)

muted crypt May 4, 2023, 4:22 PM

#

and really predicting something like temperature is pretty easy but this is much more differnt and I don't really know why

past meteor May 4, 2023, 4:23 PM

#

No, the thing with predicting temperature is that they've already done all what I've said and that it's just documented and makes sense because all of the tricks/thinking are written down already 😛

muted crypt May 4, 2023, 4:23 PM

#

past meteor using time from wayepoint, time to waypoint to predict the diffs is already maki...

and what model would you use here?

muted crypt May 4, 2023, 4:24 PM

#

past meteor No, the thing with predicting temperature is that they've already done all what ...

i mean, one thing is predicting the continuation of a sequence and the other predicting the whole sequence based on 12 points

past meteor May 4, 2023, 4:24 PM

#

I always use a mix of Ridge, Lasso, Random Forest, xgboost, SVMs (depending on my dataset's size) and neural networks

muted crypt May 4, 2023, 4:25 PM

#

and can you predict 3 columns at a time for instance?

past meteor May 4, 2023, 4:25 PM

#

muted crypt and can you predict 3 columns at a time for instance?

yes...

#

https://scikit-learn.org/stable/modules/generated/sklearn.multioutput.MultiOutputRegressor.html

scikit-learn

sklearn.multioutput.MultiOutputRegressor

Examples using sklearn.multioutput.MultiOutputRegressor: Comparing random forests and the multi-output meta estimator Comparing random forests and the multi-output meta estimator

#

This fits 3 models, neural networks on the other hand fit all of it at once with 3 output neurons

muted crypt May 4, 2023, 4:27 PM

#

wait but the y is diffX, diffY, diffZ. so now I have to compute the error in 3 dimensions. I just had the absolute distance :(

past meteor May 4, 2023, 4:27 PM

#

Yes...

untold cliff May 4, 2023, 4:27 PM

#

If i have a categorical feature with lots of categories and some portion of these categories dont have a lot of instaces in the dataset, like maybe less than 10 for each of them, does it make sense to group them all in just one new category since they wouldnt provide much information i believe because they have very few instaces?

muted crypt May 4, 2023, 4:27 PM

#

damn this is mad but I guess I'll try it

past meteor May 4, 2023, 4:27 PM

#

You need to ensure diffX, diffY and diffZ are on the same scale and then take the mean of the error

muted crypt May 4, 2023, 4:28 PM

#

oh yes do you recommend scaling?

past meteor May 4, 2023, 4:28 PM

#

It's not that mad tbh 🤷‍♂️

past meteor May 4, 2023, 4:28 PM

#

muted crypt oh yes do you recommend scaling?

If you're unsure I'd always recommend scaling

muted crypt May 4, 2023, 4:30 PM

#

past meteor If you're unsure I'd always recommend scaling

fair

muted crypt May 4, 2023, 4:30 PM

#

muted crypt I have this now for instance

so it is essentially adding the diff and time to start/end here?

#

then doing the model

muted crypt May 4, 2023, 4:34 PM

#

past meteor It's not that mad tbh 🤷‍♂️

also one last question, wouldn't the time to waypoint be controversial because as they are relatively short segments, so many row will have the same times yet the diff can be quite different?

tall tulip May 4, 2023, 4:44 PM

#

We use ADFtest and KPSStest to check the stationary is there any method availabe to check the seasonality of the data?

past meteor May 4, 2023, 4:52 PM

#

muted crypt also one last question, wouldn't the time to waypoint be controversial because a...

Is there a big difference in 0,1 seconds? pithink

plucky raft May 4, 2023, 4:55 PM

#

hey guys, im trying to save a file to a variable like so
dataset = 'filename'

#

but its not working

#

do i have to include the path to the file?

#

or the extensions?

faint mist May 4, 2023, 6:20 PM

#

I just figured out interesting observation when dealing with time series problems

#

I usually train my model after scaling down the data in values between 0 and 1

#

The model will always learn to predict a value between 0-1

#

However, in case of regression this may be limiting the model capability to generalise

#

For example if the maximum value in the training dataset is 1000

#

Scaled down to 0-1 the 1000 will become 1

#

On the other hand, if the maximum value in the testing split is 2000

#

Scaled down based on the scaler of the training data set

#

The value will be 2

#

The model will never predict 2

past meteor May 4, 2023, 6:25 PM

#

That's precisely why fitting your normalization stuff on your entire dataset is cheating

faint mist May 4, 2023, 6:26 PM

#

Yes, this what i did at first

#

Then normalised only the training split and used the same scaler for testing

past meteor May 4, 2023, 6:26 PM

#

I did an experiment a while ago with synthetic time series and I noticed that if you have preprocessing such as normalization if you do not update them across time (esp. if you have trend) only the drift on the normalization alone is enough to kill your model

faint mist May 4, 2023, 6:27 PM

#

Exactly!

past meteor May 4, 2023, 6:27 PM

#

I refit the normalization online at each timestep y_actual became available

faint mist May 4, 2023, 6:37 PM

#

Makes sense, but again I still think this limits the model capability

wheat snow May 4, 2023, 6:43 PM

#

i want to analyze my youtube data. And i received a huge html package for that... I dont know much about html's so is it possible to pull that data out and restore it in a csv format?

faint mist May 4, 2023, 6:45 PM

#

wheat snow i want to analyze my youtube data. And i received a huge html package for that.....

Check out github copilot, will speed up this kind of labor work

wheat snow May 4, 2023, 6:45 PM

#

faint mist Check out github copilot, will speed up this kind of labor work

got a link? pithink

faint mist May 4, 2023, 6:46 PM

#

its a service provided by github

#

chatgpt optimized for coding

wheat snow May 4, 2023, 6:46 PM

#

uhhh

#

i dunno what u mean sorry

faint mist May 4, 2023, 6:47 PM

#

Sorry if I confused you

wheat snow May 4, 2023, 6:47 PM

#

ah

#

i see

faint mist May 4, 2023, 6:47 PM

#

https://github.com/features/copilot

GitHub

GitHub Copilot · Your AI pair programmer

GitHub Copilot works alongside you directly in your editor, suggesting whole lines or entire functions for you.

wheat snow May 4, 2023, 6:47 PM

#

basiccly AI pylance

#

but it knows what project ur working on

#

so it knows what command you might need next?

faint mist May 4, 2023, 6:48 PM

#

yes it will suggest multiple

#

you can ask chatgpt too

#

these tools are really amazing if you want a head start

wheat snow May 4, 2023, 6:48 PM

#

true bruh last time i used chatgpt was in january

#

it sucked back then lmao

faint mist May 4, 2023, 6:49 PM

#

then you take it from there and modify as needed

wheat snow May 4, 2023, 6:49 PM

#

faint mist these tools are really amazing if you want a head start

i think i just use it to preparte my data for analysis, i want to analyze it by my own tho+

faint mist May 4, 2023, 6:49 PM

#

Yes

#

Will speed up the process

#

outsource the labor work

wheat snow May 4, 2023, 6:53 PM

#

from bs4 import BeautifulSoup
import csv

# Open the HTML file and read its contents
with open('Wiedergabeverlauf.html', encoding='utf8') as file:
    contents = file.read()

# Parse the HTML data using BeautifulSoup
soup = BeautifulSoup(contents, 'html.parser')

# Find the table containing the watch history data
table = soup.find('table', {'class': 'table-section'})

# Create a list to hold the extracted data
data = []

# Loop through each row in the table and extract the data
for row in table.find_all('tr'):
    # Extract the title and watch time for each video
    title = row.find('a', {'class': 'content-link'}).text.strip()
    time = row.find('span', {'class': 'accessible-description'}).text.strip()
    
    # Add the data to the list
    data.append([title, time])

# Save the data to a CSV file
with open('watch_history.csv', 'w', newline='', encoding='utf8') as file:
    writer = csv.writer(file)
    writer.writerow(['Title', 'Watch Time'])
    writer.writerows(data)

"This code uses BeautifulSoup to parse the HTML data and find the table containing the watch history data. It then loops through each row in the table and extracts the title and watch time for each video, and saves the data to a list. Finally, it saves the data to a CSV file called 'watch_history.csv'.

Note that the above code assumes that the watch history data is contained within a table with the class 'table-section'. If your HTML file has a different structure, you may need to modify the code accordingly."

#

im not sure bout that

#

idk if the watch history data is stored in tables...

#

it looks like that

#

#

here a better pic

#

@faint mist normal that the script runs so long? i mean its a 50MB html

faint mist May 4, 2023, 7:02 PM

#

Hmm, Ideally no

#

I will leave it for someone else to pitch in and help you with the matter

#

I am no expert in parsing html files and not sure how to help

#

I apologize

past meteor May 4, 2023, 7:16 PM

#

faint mist Makes sense, but again I still think this limits the model capability

You can do an inverse transformation after predicting btw

wheat snow May 4, 2023, 7:18 PM

#

faint mist I apologize

no worries mate

faint mist May 4, 2023, 7:19 PM

#

past meteor You can do an inverse transformation after predicting btw

Yes, but it in theory, in real world it will never be able to predict a value higher than 1. In other words, the model will never have the capability of predicting the next "All time high"

#

if you get what I mean

#

It will be close

#

ofc

#

but it could be closer

hard thicket May 4, 2023, 7:44 PM

#

Hi, might not be the right channel so apologies if that’s so (let know and I’ll delete / move it)
Looking for input on how people like to develop data pipelines for aws from development to production. Ie how do you start locally when do you move to aws what accounts separation from production do you through, any and everything would be interesting.
We have some new projects that I’m trying aws glue / emr (for pyspark) and not sure what resources to make for the team around a idp and or testable workable starting point

wheat snow May 4, 2023, 7:47 PM

#

smth is wrong... that scrip has been running for like teh last hour

#

and nothing happend

#

no errors... its just processing

waxen tusk May 4, 2023, 11:59 PM

#

Thoughts on Data Factory?

dusty bay May 5, 2023, 12:46 AM

#

I want to make a plot from a csv file using matplotlib. I have made the code but there is an error 'csv2df' object has no attribute 'plt'. Can anyone help me. Here is the code.

import pandas as pd
import matplotlib.pyplot as plt


class csv2df():
    
    def __init__(self):
        self.df = pd.read_csv("RMS level.csv")
        self.sheet = self.df[3:]
        
    def plot(self):
        self.x = self.sheet["RMS Level"]
        self.plt.plot(self.x)
        
        
show = csv2df()
show.plt.show()

astral path May 5, 2023, 12:47 AM

#

i'm working on a project where i'm trying to predict a player's success after four seasons in a basketball video game based on their high school ratings. basically, there's 20 features for a player in high school, and i'm trying to predict a specific statistic (PER) in the game during their senior year. the catch is that players who have a particularly high rating for their high school features won't play until their senior season, so there should be a soft limit for how good a player is, and if a player is too good, their predicted PER should also be lower.

my current model fails to take this into account and will predict the best high school players to have high PER as a senior, even though most of them won't return for multiple seasons. How do I fix this?

velvet abyss May 5, 2023, 1:32 AM

#

i applied for a data engineering job

#

I mean, an internship to be more exact

#

How should I prepare myself I somehow reach the interview phase?

restive path May 5, 2023, 2:11 AM

#

Hello

#

For those who started in data science without experience in a job, what is the most common thing they are asked to do?

stark zenith May 5, 2023, 4:16 AM

#

dusty bay I want to make a plot from a csv file using matplotlib. I have made the code but...

I think you'd just need to do show.plot() ?

patent pivot May 5, 2023, 4:19 AM

#

stark zenith I think you'd just need to do show.plot() ?

they may also need to change self.plt.plot(self.x) to self.plt = plt.plot(self.x) otherwise they are still referencing self.plt which does not exist

cinder urchin May 5, 2023, 5:43 AM

#

Hello. Anyone know or have a chatbot? If not, can you tell me a name of model that I can use with the "transformers" library that doesn't need a lot of memory to work. I tried a few models are only 2 managed to crach my computer?

past meteor May 5, 2023, 5:56 AM

#

waxen tusk Thoughts on Data Factory?

It's the bread and butter of data in the azure stack, it's pretty intuitive imo

tall tulip May 5, 2023, 10:19 AM

#

I've dataset with 5 min time stamp which I changed to hourly data, and this data have and daily and 12 days seasonality and also not a stationary data, I've make the data stationary, after that I've used SARIMAX model which gives negative AIC value but when I tried to predict the value It gives me straight line, I also tried auto arima, but still It didn't work for me. How can I improve it's accuracy?

here is the model summary:

boreal gale May 5, 2023, 1:28 PM

#

tall tulip I've dataset with 5 min time stamp which I changed to hourly data, and this data...

you seems to be just using a AR(1) model here..? which is not the full capability of SARIMAX

i assume you are using statsmodels, have a look here, https://www.statsmodels.org/dev/generated/statsmodels.tsa.statespace.sarimax.SARIMAX.html#statsmodels.tsa.statespace.sarimax.SARIMAX-parameters
particularly the order parameter.

restive path May 5, 2023, 4:41 PM

#

Hey guys, any data science on here?

A question regarding the learning of mathematics, according to what I have investigated, the D must learn a lot, but in reality the most important is algebra, calculus and statistics, now if you could say the most important contents of algebra and calculus, what would? they are?

wooden sail May 5, 2023, 4:59 PM

#

what do you mean when you say algebra here? mathematicians say algebra to mean abstract algebra, which is way different from your high school algebra. what one uses very often in data science is linear algebra, which is one of the elementary parts of abstract algebra

mint palm May 5, 2023, 4:59 PM

#

how does CLS token work with transformer?

wooden sail May 5, 2023, 4:59 PM

#

regarding calculus, really all of it. you'll be looking at gradients, jacobians and hessians (so multivar calc) and integration used for optimization

sleek harbor May 5, 2023, 5:08 PM

#

could someone explain this behavior of optuna? When I set the sampler as optuna.samplers.TPESampler(n_startup_trials=300) with 300-400 random initial samplings everything is fine.. at first. You can indeed see it taking 300-400 random hyperparameter combinations, after which the graph becomes more stable as the "smart algorithms" kick in.. but that only lasts for around 200 samplings.. after which it seems that optuna reverts to random sampling again..! How can this be explained? Is it supposed to be like this? I can't make sense of it..

agile cobalt May 5, 2023, 5:18 PM

#

sleek harbor could someone explain this behavior of optuna? When I set the sampler as optuna....

if I had to guess, it just assumes that it reached a local minimum/maximum then starts going further away from it to try to to find a different (hopefully the global) local minimum/maximum?
that is, going further into X direction wouldn't make it any better, so it tries to find another Z direction that might make it better

#

the alternative would be pretty much overfitting then staying there

past meteor May 5, 2023, 5:23 PM

#

I should look at Optuna sometime 🤔 I always just use sci-kit's hyperparameter tuner or Keras tuner (even with Torch etc.) if I need more flexibility.

agile cobalt May 5, 2023, 5:24 PM

#

remember to be careful when tuning hyperparameters, otherwise you might end up overfiting your model's hyperparameters to your ~~test~~ validation data

past meteor May 5, 2023, 5:25 PM

#

wdym? You should never test on your test set before you've fixed 1 set of hyper parameters

restive path May 5, 2023, 5:26 PM

#

wooden sail what do you mean when you say algebra here? mathematicians say algebra to mean a...

If I asked you what are the most important contents of linear algebra, which would you tell me?

agile cobalt May 5, 2023, 5:26 PM

#

do you call it like
train / validation / test
or
train / test / validation
or only
train / test

#

in my mind, validation is after freezing everything, test is how you would measure if it gets better or worse, not sure what is the standard

past meteor May 5, 2023, 5:27 PM

#

train /validation (find best model + hyperparameters) => test once

wooden sail May 5, 2023, 5:27 PM

#

restive path If I asked you what are the most important contents of linear algebra, which wou...

almost all of it, since it's the bread and butter

#

(sub)spaces, linear transformations, projections, diagonalization/EVD, SVD, low rank approximation. in fact, the other stuff (calculus and statistics) will always be applied on TOP of linear algebra

past meteor May 5, 2023, 5:28 PM

#

agile cobalt do you call it like train / validation / test or train / test / validation or on...

I think in most literature / texts validation is what you select hyperparameters on, hence why k-fold crossvalidation etc.

sleek harbor May 5, 2023, 5:28 PM

#

agile cobalt remember to be careful when tuning hyperparameters, otherwise you might end up o...

this is really infuriating.. there's not much you could do here, but the way optuna just takes that one lucky hyperparam combo and claims it is the "best".. and it does that all the time, even when you really could chose an optimal combination.. it just throws out this random combo that happened to get lucky, even while you can see the actual algorithm at work moving in another direction.. why don't they fix this? it's obvious this is random luck, not actually a good combo..

past meteor May 5, 2023, 5:29 PM

#

sleek harbor this is really infuriating.. there's not much you could do here, but the way opt...

You should read about the optimization algo that you're using

agile cobalt May 5, 2023, 5:30 PM

#

sleek harbor this is really infuriating.. there's not much you could do here, but the way opt...

whatever is happening, the issue is probably you using the tool incorrectly, not the tool itself being objectively bad
like zestar said, make sure to read the documentation and perhaps even relevant papers if you haven't yet

sleek harbor May 5, 2023, 5:31 PM

#

past meteor You should read about the optimization algo that you're using

I did, but that didn't give me much.. the algorithm itself seems to work, somewhat at least. But then it says that some other combo is the "best" just because it scores once

wooden sail May 5, 2023, 5:31 PM

#

sleek harbor this is really infuriating.. there's not much you could do here, but the way opt...

are you maybe under the impression that non convex optimization is easy? this approach is very similar to simulated annealing, which is a good heuristic. but heuristics and local optimality are about as good as it gets

past meteor May 5, 2023, 5:31 PM

#

I personally only vaguely know about TPS hence why I would not touch it over the ones I know and trust like Bayes opt (sequential problems) or random search

agile cobalt May 5, 2023, 5:31 PM

#

it is also possible that the method you are using is just not appropriated for your model

past meteor May 5, 2023, 5:32 PM

#

If you can run your trials in parallel I think random search and iteratively making your grids smaller is a good option

#

Assuming you have many combinations otherwise you could just run grid search ofc

sleek harbor May 5, 2023, 5:32 PM

#

past meteor I personally only vaguely know about TPS hence why I would not touch it over the...

random search, same as even grid search, results in the same problem tho. The best isn't actually the best, and u have to look at the graphs to see that

past meteor May 5, 2023, 5:33 PM

#

sleek harbor random search, same as even grid search, results in the same problem tho. The be...

Why do you care about the exact best?

wooden sail May 5, 2023, 5:33 PM

#

sleek harbor random search, same as even grid search, results in the same problem tho. The be...

there is no good way of finding a global optimum for nonconvex problems. if you find one, you'd win a nice prize

sleek harbor May 5, 2023, 5:33 PM

#

past meteor Why do you care about the exact best?

that's the thing.. I don't. I care about the average best.. but it gives me the single one best

past meteor May 5, 2023, 5:34 PM

#

Why do you care about the average best? What is your exact problem?

sleek harbor May 5, 2023, 5:34 PM

#

wooden sail there is no good way of finding a global optimum for nonconvex problems. if you ...

but you could at least get something close to good..

wooden sail May 5, 2023, 5:34 PM

#

sleek harbor but you could at least get something close to good..

no way of knowing what "good" is without knowing what the best is

#

all you can do is compare to the results you get

past meteor May 5, 2023, 5:34 PM

#

I did a bunch of graduate courses on global optimization for non-convex problems. This is one of the god particles.

wooden sail May 5, 2023, 5:35 PM

#

there's probably a parameter you pass to optuna to choose the cost function with which it picks the hyper params

past meteor May 5, 2023, 5:35 PM

#

There's ideas you can do if you want good results on average but I'm curious to know what your exact problem is? Is it really just hyperparameter tuning?

sleek harbor May 5, 2023, 5:35 PM

#

past meteor Why do you care about the average best? What is your exact problem?

lets just say i have a curve, and I'd like to get a value close to the bottom of that curve. But the curve isn't a perfect line.. I'd still like the averaged bottom, not one dot somewhere to the side that randomly happens to be lower than the average bottom

past meteor May 5, 2023, 5:36 PM

#

sleek harbor lets just say i have a curve, and I'd like to get a value close to the bottom of...

Why do you want this though? Is this really just hyperparameter tuning or not?

sleek harbor May 5, 2023, 5:37 PM

#

past meteor Why do you want this though? Is this really just hyperparameter tuning or not?

yeah, it's just hyperparameter tuning.. and curiosity

cold osprey May 5, 2023, 5:37 PM

#

wooden sail there's probably a parameter you pass to optuna to choose the cost function with...

Q on cost function and loss function. When we pass a loss function to say a pytorch neural network, thats a loss function right?

#

coz its evaluated on the batch size

#

if its evaluated on the entire dataset, then its cost function?

#

https://stats.stackexchange.com/questions/179026/objective-function-cost-function-loss-function-are-they-the-same-thing

wooden sail May 5, 2023, 5:38 PM

#

sleek harbor lets just say i have a curve, and I'd like to get a value close to the bottom of...

this isn't really what's happening though. you get a different curve for each set of hyperparams, they parametrize a family of curves. then you pick among the curves with some criterion

wooden sail May 5, 2023, 5:38 PM

#

cold osprey if its evaluated on the entire dataset, then its cost function?

"cost function" just refers to what you're minimizing

royal void May 5, 2023, 5:38 PM

#

Hi, I need to find a way to get the size of the center clusteron these maps, do you know a way to compute that ? like in the first one i would like a size of 5 and in the second one of 1 or 2

Capture_decran_2023-05-05_a_19.34.17.png

Capture_decran_2023-05-05_a_19.33.54.png

cold osprey May 5, 2023, 5:38 PM

#

is this just a semantic thing?

royal void May 5, 2023, 5:39 PM

#

whoops i just mixed the first and the second *

past meteor May 5, 2023, 5:39 PM

#

sleek harbor lets just say i have a curve, and I'd like to get a value close to the bottom of...

there's no guarantees that this line is smooth and continuous

wooden sail May 5, 2023, 5:39 PM

#

royal void Hi, I need to find a way to get the size of the center clusteron these maps, do ...

you could compute some statistics on the background noise looking at the corners of the image, then use that to define a threshold

sleek harbor May 5, 2023, 5:39 PM

#

wooden sail this isn't really what's happening though. you get a different curve for each se...

what I was talking about would prove a problem even if we had just one hyperparameter tho.. is that really that difficult to fix?

wooden sail May 5, 2023, 5:40 PM

#

sleek harbor what I was talking about would prove a problem even if we had just one hyperpara...

yes, this is a very difficult problem in general with no good solution

#

you can pick an "average best" if you like, but there's no special reason why that would be any better

royal void May 5, 2023, 5:40 PM

#

wooden sail you could compute some statistics on the background noise looking at the corners...

I tried but it's not really efficient...

wooden sail May 5, 2023, 5:40 PM

#

what did you try?

past meteor May 5, 2023, 5:41 PM

#

Tbh the plot you showed doesn't even tell the full story as we can't see what parameters were tried

#

If I were you I would most likely do a small search around the n lowest points and call it day @sleek harbor

wooden sail May 5, 2023, 5:42 PM

#

cold osprey is this just a semantic thing?

the cost is in general defined w.r.t. all the data. the batches part comes later. but yeah the distinction is just semantic

past meteor May 5, 2023, 5:42 PM

#

But most likely I would just select whatever came up lowest, hyperparameter tuning is imo something that is rarely worth it time vs. reward wise

restive path May 5, 2023, 5:42 PM

#

wooden sail almost all of it, since it's the bread and butter

Basic properties of matrices and vectors: scalar multiplication, linear transformation, transpose, conjugate, range, determinant
Internal and external products, matrix multiplication rule and various algorithms, inverse matrix
3.Special matrices: square matrix, identity matrix, triangular matrix, idea on sparse and dense matrix, unit vectors, symmetric matrix, Hermitian, biased-Hermitian and unitary matrices
Matrix factorization/LU decomposition concept, Gauss/Gauss-Jordan elimination, solving the linear equation system Ax=b
Vector space, basis, interval, orthogonality, orthonormality, linear least squares
Eigenvalues, eigenvectors, diagonalization, singular value decomposition

royal void May 5, 2023, 5:42 PM

#

wooden sail what did you try?

I tried to define a treshold by taking the mean value as I have a lot of points and define the radius like the first value below the mean +0.01 to be a little higher but in the first image for example the cluster expands a little even when we are below the thresold

#

sorry for my english I'm french

restive path May 5, 2023, 5:43 PM

#

restive path 1. Basic properties of matrices and vectors: scalar multiplication, linear trans...

this?

wooden sail May 5, 2023, 5:43 PM

#

royal void I tried to define a treshold by taking the mean value as I have a lot of points ...

wdym by "expands a little"? it's larger than you'd like it to be? (your english is fine)

wooden sail May 5, 2023, 5:44 PM

#

restive path 1. Basic properties of matrices and vectors: scalar multiplication, linear trans...

these are the bare essentials, yeah

cold osprey May 5, 2023, 5:44 PM

#

restive path 1. Basic properties of matrices and vectors: scalar multiplication, linear trans...

oh my, reminds of me of uni KEKW

past meteor May 5, 2023, 5:44 PM

#

I disagreeish on these being the essentials because there's so much abstraction in ML nowadays that you can get away with knowing less

#

If you want to make novel stuff then yes, it is the bare minimum

restive path May 5, 2023, 5:45 PM

#

wooden sail these are the bare essentials, yeah

are you a data scientist? is it what is most used?

cold osprey May 5, 2023, 5:45 PM

#

past meteor I disagreeish on these being the essentials because there's so much abstraction ...

this tbh, i dont rmb most of the maths ive learnt

wooden sail May 5, 2023, 5:45 PM

#

royal void I tried to define a treshold by taking the mean value as I have a lot of points ...

ah, you mean the shape keeps going but falls under the noise floor. ok. yeah so, as soon as you have noise, it's not always possible to recover the shape perfectly. if you have a model for the shape we're looking for, we might be able to do better. for example, we can fit a 2d gaussian to the image

wooden sail May 5, 2023, 5:45 PM

#

restive path are you a data scientist? is it what is most used?

i'm doing a phd in signal processing rn. the things you listed are the things you should be able to do with your hands tied behind your back if someone suddenly wakes you up at 3 am

royal void May 5, 2023, 5:46 PM

#

Here I can get below the treshold in the orange square but I would like to get the red square as size of the cluster

Capture_decran_2023-05-05_a_19.45.15.png

#

Oh yes make a fit should work

past meteor May 5, 2023, 5:46 PM

#

wooden sail i'm doing a phd in signal processing rn. the things you listed are the things yo...

In industry less so

#

Even in my context (applied research) I don't think anyone remembers what SVD is or how to do PCA from their time in uni

royal void May 5, 2023, 5:47 PM

#

wooden sail ah, you mean the shape keeps going but falls under the noise floor. ok. yeah so,...

but I have no idea on how to do this in C lmao but thank you I'm going to try !!

wooden sail May 5, 2023, 5:48 PM

#

oh oof, in C. well my suggestion would be to set up the math on paper and then code that :p but there surely exists a library that can help you with it

wooden sail May 5, 2023, 5:48 PM

#

past meteor Even in my context (applied research) I don't think anyone remembers what SVD is...

i mean, you should never do an SVD by hand unless your problem is AT MOST 3D

#

but you should understand it inside out

past meteor May 5, 2023, 5:48 PM

#

I'm not even talking about by hand I meant the general procedure 🤣

wooden sail May 5, 2023, 5:49 PM

#

the conceptual understanding is the most important

past meteor May 5, 2023, 5:49 PM

#

People I work with know what it does and why you'd need it but not the internals

royal void May 5, 2023, 5:49 PM

#

wooden sail oh oof, in C. well my suggestion would be to set up the math on paper and then c...

I already did a linear regression i guess that I can make a gaussian fit

past meteor May 5, 2023, 5:49 PM

#

For most stuff in my context that is more than enough. In pure industry you can get away with even less

wooden sail May 5, 2023, 5:49 PM

#

i think that's the most important, yeah. if you understand that, you can read an algorithm and understand why it'd work

sleek harbor May 5, 2023, 5:50 PM

#

@wooden sail @past meteor I'm kinda dumb, so pls bear with me a bit. Am I wrong in assuming, that in such a graph, where we want the lowest value, that 1 (or a value very close to 1) would be the best obvious choice? Cus that's what I'd want to get as the "best" hyperparam value. However, usually, just because of how the dataset is split, and random factors one can't control, with enough repetitions and tries, some combination of parameters (and even if we are just tuning this one hyperparam) will have a lower target value (y) at a value with a lower than 1 param value (x).. those (or that one) combo will Not be good when you try it on another dataset.. no? I suck at talking, so I'm not even sure I'm getting my point across..

wooden sail May 5, 2023, 5:51 PM

#

what even is x here

#

what are we looking at

past meteor May 5, 2023, 5:51 PM

#

The number of trials I suppose?

#

1 is most lijely the last trial

sleek harbor May 5, 2023, 5:51 PM

#

a hyperparameter, eta, it's values

wooden sail May 5, 2023, 5:52 PM

#

and what does that control?

past meteor May 5, 2023, 5:52 PM

#

lemon_thinking

sleek harbor May 5, 2023, 5:52 PM

#

the learning rate basically

#

of XGBoost

#

just chose a random hyperparam.. a similar picture could be painted for many hyperparams

wooden sail May 5, 2023, 5:52 PM

#

well, you have 2 hyperparams, yeah?

past meteor May 5, 2023, 5:53 PM

#

The value you get in hyper parameter tuning is the average over all of your folds you tried the parameters on

sleek harbor May 5, 2023, 5:53 PM

#

wooden sail well, you have 2 hyperparams, yeah?

could be 2, could be 1, could be 100, the question would be the same

wooden sail May 5, 2023, 5:53 PM

#

sleek harbor could be 2, could be 1, could be 100, the question would be the same

right, so then comes my point. why does it matter whether eta is close to 1?

#

or any other hyperparam for that matter

past meteor May 5, 2023, 5:54 PM

#

Hence why you can take the best one. It should be relatively robust and not something that wildly overfits on your data

sleek harbor May 5, 2023, 5:54 PM

#

wooden sail right, so then comes my point. why does it matter whether eta is close to 1?

because being close to 1 has "proven" to "consistently" provide good results?

wooden sail May 5, 2023, 5:54 PM

#

what is "close to 1"

#

this will depend entirely on the problem at hand

sleek harbor May 5, 2023, 5:54 PM

#

past meteor Hence why you can take the best one. It should be relatively robust and not some...

yeah.. but it could just be overfit to your cv fold combination...

past meteor May 5, 2023, 5:55 PM

#

It's fit on ALL folds

#

it's not 1 hyperparam instance on 1 fold

sleek harbor May 5, 2023, 5:55 PM

#

wooden sail what is "close to 1"

would you rather have 1 or 0.1 in that picture?

wooden sail May 5, 2023, 5:55 PM

#

there's no reason why eta has to be close to 1 always, and as zestar says, i would expect any hyperparameter tuning tool to already average over all the folds and trials

wooden sail May 5, 2023, 5:55 PM

#

sleek harbor would you rather have 1 or 0.1 in that picture?

whatever gives the lower loss. the value of the hyperparam itself doesn't matter

past meteor May 5, 2023, 5:56 PM

#

The default eta of xgboost is apparently 0.3 so I wouldn't know why it should be close to 1

sleek harbor May 5, 2023, 5:56 PM

#

past meteor it's not 1 hyperparam instance on 1 fold

it can still be overfit to ALL the folds together, as in a different set of folds would result in drastically different results

past meteor May 5, 2023, 5:56 PM

#

sleek harbor it can still be overfit to ALL the folds together, as in a different set of fold...

??

wooden sail May 5, 2023, 5:56 PM

#

what?

#

if there's a problem with all the folds, there's a problem with your dataset

sleek harbor May 5, 2023, 5:57 PM

#

wooden sail there's no reason why eta has to be close to 1 always, and as zestar says, i wou...

but that's my whole problem.. they don't average over trials.. only over folds, but that's not enough

past meteor May 5, 2023, 5:57 PM

#

I'm not sure you fully understand k-fold and/or hyperparameter tuning?

sleek harbor May 5, 2023, 5:58 PM

#

past meteor The default `eta` of xgboost is apparently 0.3 so I wouldn't know why it should ...

different values will work differently on different datasets, obviously.. the point is that I'm tuning parameters for this dataset.. if one set of hyperparams were objectively the best for all datasets nobody would tune them in the first place...

past meteor May 5, 2023, 6:00 PM

#

9 times out of 10 for something like xgboost I don't tune it 🤷‍♂️

sleek harbor May 5, 2023, 6:01 PM

#

wooden sail if there's a problem with all the folds, there's a problem with your dataset

no.. it just means that.. 🤦 I can't explain this. For the same reason cv exists in the first place, repeated kfold has been invented to compensate for the problems of kfolds, which is great, but it doesn't fix the problem entirely, only helps.. You can overfit to a combination of KFolds same as u can overfit to a random split

wooden sail May 5, 2023, 6:02 PM

#

certainly, that can happen

#

but are you aware that this problem is at least as difficult as the original one you were solving?

sleek harbor May 5, 2023, 6:02 PM

#

past meteor 9 times out of 10 for something like xgboost I don't tune it 🤷‍♂️

that's cool. I'm obviously a noob and have no idea what I'm talking about.. maybe I shouldn't tune at all.. I'm just trying to understand here

wooden sail May 5, 2023, 6:02 PM

#

optimizing the hyperparams is a completely separate optimization problem of its own

#

not only that, you won't even be able to check you got the "best" or even "good" hyperparams

past meteor May 5, 2023, 6:03 PM

#

It can indeed happen that your specific instance of hyperparameters do a strangely good job on one fold which biases the result on average but you have to draw the line somewhere imo

wooden sail May 5, 2023, 6:04 PM

#

you validate, and if it performs well, you call it a day

#

you can only check by using arbitrarily large amounts of data

past meteor May 5, 2023, 6:04 PM

#

It's also fine to be "lazy" with hyperparameters imo. For boosting type models I would only tune the rounds of boosting I'm doing

#

Intuition tells me that this is likely the most important hyperparam (unlike for bagged models)

#

Overfitting is mostly related to fitting too many models and not the complexity of each individual one in sequential set-ups

sleek harbor May 5, 2023, 6:21 PM

#

idk.. I just feel like a value of eta of 0.13 would be objectively a bad choice, especially when you can see a graph of points that look to be steadily improve the closer to 1 u get.. to me it seems like that value of 0.13 is pretty much an outlier that should be ignored, since other values around it seem to be on average worse than those closer to 1. Which imo means one should chose a value closer to the average "good". The thing that bugs me is that the optimization algorithm, as far as I understand, agrees with me on that, cus it keeps "suggesting" values closer to 1. But since those values, tho improve on average, don't manage to "abnormally score" the way 0.13 did, 0.13 remains the "winner". I would chose a winner that, say, scores the best among the best group of 10 consecutive averages..

sleek harbor May 5, 2023, 6:28 PM

#

past meteor Intuition tells me that this is likely the most important hyperparam (unlike for...

that's great, when you have enough experience to have intuition.. which I do not. Btw, the optuna algorithm strongly disagrees with that statement.. 😅

#

personally, to me, the eta graph looked a lot more informative, with a visible trend.. this looks.. pretty much random (already narrowed down a bit tho, when the range was 30-500 u could see that too low and too high results aren't good)

past meteor May 5, 2023, 6:30 PM

#

If you're tuning multiple hyperparameters then the imortance of n_estimators might be subsumed

#

Kind of similar to colinearity

sleek harbor May 5, 2023, 6:32 PM

#

anyone have a guide to how to tune them properly? cus.. I see tons of various methods, and some of them seem fundamentally wrong to me. For example, the popular "tune one at a time" seems to be a strange choice to me, specifically because of collinearity..

past meteor May 5, 2023, 6:33 PM

#

Tune one at a time is bad as well because the surface is non-convex and some parameters are just unimportant

#

I would only tune n_estimators and call it day. Maybe tuning 5 others would be better than that one but this is such an easy one to tune because it's discrete, you can grid search it even if you want

frigid lion May 5, 2023, 7:57 PM

#

hey so i've been learning data science for a while, displaying, analazying data and mostly machine learning models using sci kit learn and the math behind them but I hear a lot about numpy and ye i've learned about it but still i don't feel like there are so many options that I use in it for it to be talked about so much.
I just want to know how much are you actually using numpy while doing any data science projects

#

and ye i know that many other libraries are based on numpy as well but I just dk if i'm missing sth about it that i don't use it that often by just calling something straight from numpy

#

not sure if you know what i mean but whenever some1 mentions data science 2 things that are mentioned are numpy and pandas and I don't know what it means to have knowledge of numpy

wooden sail May 5, 2023, 8:03 PM

#

frigid lion not sure if you know what i mean but whenever some1 mentions data science 2 thin...

scikit and pandas are built on top of numpy. that is to say, numpy is comparatively "low level" and requires you to code the math yourself

#

it gives you the most control, but you need to know how to do all the math

frigid lion May 5, 2023, 8:04 PM

#

wooden sail scikit and pandas are built on top of numpy. that is to say, numpy is comparativ...

ye this i know but for example if job offer says knowledge of numpy

#

what does it mean

wooden sail May 5, 2023, 8:05 PM

#

it means, if someone gives you some math, e.g. from a recent paper, you can implement it yourself on numpy (because no library will have an implementation of something recent)

frigid lion May 5, 2023, 8:06 PM

#

is there any way to train sth like that because i dont see myself need to ever do things like this so far

wooden sail May 5, 2023, 8:07 PM

#

by doing/reading math and implementing it yourself from scratch

#

for example many people try to set up basic neural networks from scratch using numpy

#

it helps you review both your math and numpy at the same time

frigid lion May 5, 2023, 8:07 PM

#

i havent got to neural networks yet so far so cant speak about it

wooden sail May 5, 2023, 8:08 PM

#

things like linear regression, then

#

anything you've ever done with pandas can also be done with numpy

frigid lion May 5, 2023, 8:08 PM

#

oh k maybe ill try it then this seems a bit hard to code from scratch but i may give it a try

#

or it just seems like that and may turn out not that hard

#

ive implemented knn, naive bayes and decision tree from scratch

#

how do you think linear regression compares to it when it comes to coding from scratch

wooden sail May 5, 2023, 8:10 PM

#

linear regression should be a lot easier

#

it's a good problem to practice many things though. pseudo inverses, gradient descent, newton methods, etc

frigid lion May 5, 2023, 8:12 PM

#

ok thanks a lot

wooden sail May 5, 2023, 8:13 PM

#

from the things you mentioned though, sounds like you're already pretty familiar with numpy

wheat snow May 5, 2023, 8:38 PM

#

is this the right place to ask for help on transforming an html to an csv file?

#

since csv is kinda data science related

hasty mountain May 5, 2023, 10:14 PM

#

Does anyone has experience with Pytorch Geometric? I'd really like to know how its Dataloader does its batching process. It feels like it simply considers batch size = 1 for every sample, and then modifies the tensor dimensions so the model can analyze the graph node, its edges and bonds...

(Yes, I've tried reading the docs, but still didn't figure it out)

#

I'm trying to implement a Unsupervised Pre-training process on a Graph Neural Network, so the way the API is batching the samples is causing me some trouble...

hasty mountain May 6, 2023, 3:47 AM

#

hasty mountain Does anyone has experience with Pytorch Geometric? I'd really like to know how i...

Nevermind, that was an error in my code. The batching is working fine, now. Or at least seem to be...

#

It gets annoyingly slow when it's too big, something that I find strange, but ok...

granite bronze May 6, 2023, 5:31 AM

#

question, im pretty new to python and i wanna learn ai and machine learning. what kinds of things would you suggest me know how to do as a prerequisite, and also do you have any tutorials you would suggest me watch/read when it does come time to learning?

#

sorry if this question is out of place btw

stark zenith May 6, 2023, 6:07 AM

#

granite bronze question, im pretty new to python and i wanna learn ai and machine learning. wha...

Honestly the fast.ai course is pretty good. It more puts you in a spot where you can do something with it, then works backwards from there.

granite bronze May 6, 2023, 6:07 AM

#

thx man i will check that out

stark zenith May 6, 2023, 6:08 AM

#

No problem, enjoy! Try to really commit to it, follow along with the notebooks, and make your own projects.

inland heath May 6, 2023, 7:41 AM

#

hey im trying to use regularisation to improve a linear regression i did. i have an excel spreadsheet with x and y values and i'm not sure how to split the data so that i have a dataset of x and y train and another dataset of x and y test which have to be a numpy array (the extracted data from the excel spreadsheet is in the form of a list within a list (inner list is row values)

#

if yall can provide any suggestions feel free to ping me :)

lapis sequoia May 6, 2023, 7:58 AM

#

.

zinc nova May 6, 2023, 8:03 AM

#

hey hi everyone , anyone interested in nlp and classification of texts ?

wooden sail May 6, 2023, 8:06 AM

#

inland heath hey im trying to use regularisation to improve a linear regression i did. i have...

you can split them at random, and try different realizations of the split

#

as for the regularization, what are you trying to do? which property are you trying to enforce?

young granite May 6, 2023, 8:22 AM

#

does one has an idea how i could 3d plot complex numbers in a "unit sphere"?

wooden sail May 6, 2023, 8:25 AM

#

what are you trying to plot?

young granite May 6, 2023, 8:27 AM

#

wooden sail what are you trying to plot?

i was thinking of a way to plot and do a kind of clustering of FFT frequencies maybe with their magnitude

#

this just came up to my mind and would be a cool way to show distribution

wooden sail May 6, 2023, 8:28 AM

#

i don't see where the 3d part comes in though

young granite May 6, 2023, 8:28 AM

#

i could do it 2D on the unit circle

#

but for many datapoints it gets unstructured

wooden sail May 6, 2023, 8:29 AM

#

why the unit circle or sphere though

young granite May 6, 2023, 8:29 AM

#

to get a better understanding visualization of the distribution

wooden sail May 6, 2023, 8:29 AM

#

the distribution of what

young granite May 6, 2023, 8:29 AM

#

the complex values

wooden sail May 6, 2023, 8:30 AM

#

i'm not sure i follow what you're trying to do

#

let's forget for a second that they're complex numbers, because they're isomorphic to R2. so we have a set of points in R2. why would you want to project them onto the unit circle for this? this gets rid of the magnitude information and keeps only the phase

gloomy saddle May 6, 2023, 8:31 AM

#

isn't it more I and Q for stuff like this, magnitude and phase?

#

e.g. frequency for X, magnitude for Z, Phase for Y?

young granite May 6, 2023, 8:33 AM

#

thats why i try to figure out how i can use the magnitude as z

wooden sail May 6, 2023, 8:34 AM

#

you only have 1 input axis though. if you want to see the magnitude, you'd just get frequency vs magnitude

#

what's the actual problem? you have some data in spectral domain, and you want to figure out which frequencies have some property?

young granite May 6, 2023, 8:36 AM

#

wooden sail what's the actual problem? you have some data in spectral domain, and you want t...

kinda i got some spectrum and want to compare the resulting frequencies

wooden sail May 6, 2023, 8:37 AM

#

compare them to what?

young granite May 6, 2023, 8:37 AM

#

each other

wooden sail May 6, 2023, 8:37 AM

#

ok

#

that's very different

#

cuz then you have vectors in C^n, where n is the length of the spectrum. you'd have to do some sort of projection first

young granite May 6, 2023, 8:38 AM

#

why tho lets assume we got a spectrum resulting in 5 freqs, when i plot them all into the unit sphere and lets say another one i can directly compare?

wooden sail May 6, 2023, 8:38 AM

#

hold up

#

each spectrum you want to compare has 5 frequency bins? 5 samples, each one a complex number?

young granite May 6, 2023, 8:39 AM

#

wooden sail each spectrum you want to compare has 5 frequency bins? 5 samples, each one a co...

yes 5 complex value per spectrum

wooden sail May 6, 2023, 8:40 AM

#

ok. the sphere here is the 4-sphere, a 4-dimensional object in 5d space

#

if you want something you can visualize, you have to do a projection onto a lower dimensional space first

#

and again, projecting onto the sphere gets rid of the magnitude information and leaves only the angle of the vector

young granite May 6, 2023, 8:41 AM

#

mhhh

wooden sail May 6, 2023, 8:42 AM

#

it keeps info regarding relative magnitudes of the complex values relative to each other in each spectrum

past meteor May 6, 2023, 8:42 AM

#

@wooden sail I'm curious how you would solve an issue we had at work recently:

wooden sail May 6, 2023, 8:42 AM

#

is that enough info? you tell us

young granite May 6, 2023, 8:43 AM

#

wooden sail ok. the sphere here is the 4-sphere, a 4-dimensional object in 5d space

i would struggle to build something like this tbh

wooden sail May 6, 2023, 8:43 AM

#

i'm not sure why you wanted to project onto the sphere yet. there are cases where it makes sense, but visualization is also a completely separate matter

young granite May 6, 2023, 8:44 AM

#

wooden sail i'm not sure why you wanted to project onto the sphere yet. there are cases wher...

how would u compare complex values of spectrums then?

wooden sail May 6, 2023, 8:44 AM

#

depends on what i'm looking for

young granite May 6, 2023, 8:44 AM

#

would u at all do something like this

gloomy saddle May 6, 2023, 8:44 AM

#

start by better explaining what your trying to visualise, what you have described is very fuzzy?

young granite May 6, 2023, 8:44 AM

#

similarities distributions etc.

past meteor May 6, 2023, 8:44 AM

#

We had a 3d point cloud with each point being an EMG sensor. It's a person moving along a line from back to front (but the direction differs from person to person) the task was to find the right heel

wooden sail May 6, 2023, 8:44 AM

#

young granite similarities distributions etc.

this is also completely different

#

are we doing a statistical comparison or a deterministic one regarding shape?

young granite May 6, 2023, 8:45 AM

#

i draw something, give me a sec 😄

past meteor May 6, 2023, 8:45 AM

#

So we had measurements every few ms. of the position of each sensor. obviously people are moving (raising, lowering their body parts and thus the sensors)

lone plaza May 6, 2023, 8:46 AM

#

Is this efficient enough for the cost function?

gloomy saddle May 6, 2023, 8:46 AM

#

past meteor We had a 3d point cloud with each point being an EMG sensor. It's a person movi...

text not image :/

past meteor May 6, 2023, 8:47 AM

#

gloomy saddle text not image :/

wdym?

wooden sail May 6, 2023, 8:47 AM

#

lone plaza Is this efficient enough for the cost function?

looks like some type of entropy, what exactly is your question?

lone plaza May 6, 2023, 8:48 AM

#

Is there a build in np function that takes care of 0 and sets them to something slightly bigger than 0 as to avoid taking a log of 0

wooden sail May 6, 2023, 8:49 AM

#

past meteor So we had measurements every few ms. of the position of each sensor. obviously p...

i'm not sure what kind of data an EMG sensor gives, but what comes to mind are those stick figure models. maybe the parameters of one of those could be fit given the sensor info

young granite May 6, 2023, 8:49 AM

#

spectrum, 2) FFT, 3) complex values, 4) sphere plot

past meteor May 6, 2023, 8:49 AM

#

We have some activation values but it's mostly X, Y, Z we're working with. after that we use the EMG activation of a reference point to make our models

#

The stick figure models are a good one! I know it from the context of facial recognition but I hadn't thought of applying it here

wooden sail May 6, 2023, 8:50 AM

#

young granite 1) spectrum, 2) FFT, 3) complex values, 4) sphere plot

what's the difference between spectrum and fft here

past meteor May 6, 2023, 8:51 AM

#

We have a heuristic in place right now, I'll try and see if I can make what you're suggesting work indeed

young granite May 6, 2023, 8:51 AM

#

wooden sail what's the difference between spectrum and fft here

condensation of data

wooden sail May 6, 2023, 8:51 AM

#

what?

young granite May 6, 2023, 8:52 AM

#

i keep 5 freq of the resulting FFT

#

or in that example 3

wooden sail May 6, 2023, 8:53 AM

#

ok so the original thing isn't a spectrum

#

cuz the fft yields the spectrum

gloomy saddle May 6, 2023, 8:53 AM

#

1 is normally your raw input data

wooden sail May 6, 2023, 8:53 AM

#

and here i mean spectrum as in spectral domain, its physical meaning notwithstanding

young granite May 6, 2023, 8:53 AM

#

wooden sail and here i mean spectrum as in spectral domain, its physical meaning notwithstan...

ok theres where we where mismatching 😄

#

so yes 1 is input data

wooden sail May 6, 2023, 8:54 AM

#

anyway. you have some data, you fft it to get the spectral domain, you keep some fourier bins

young granite May 6, 2023, 8:54 AM

#

wooden sail May 6, 2023, 8:54 AM

#

do you keep the same bins for all the data?

young granite May 6, 2023, 8:55 AM

#

i can choose whether i keep the 5 with highest power spectrum or [:5]

wooden sail May 6, 2023, 8:55 AM

#

ok. and after doing this, we wanna check how similar the bins are

young granite May 6, 2023, 8:56 AM

#

so not necessarily the 5 highest and therefore could differ

gloomy saddle May 6, 2023, 8:56 AM

#

and after getting frequency and magnitude, a 2 dimensional value, now what? e.g. are you say slicing the input into small time periods, and plotting how the FFT changes over time?

young granite May 6, 2023, 8:56 AM

#

wooden sail ok. and after doing this, we wanna check how similar the bins are

yes

wooden sail May 6, 2023, 8:56 AM

#

in this case the meaning of the fourier axis doesn't really matter

#

these are basically just vectors in C^n

#

is the magnitude of the bins important? or only their ratios?

#

e.g. is the vector [10, 5] the same as the vector [2, 1]? or is the "energy content" important?

young granite May 6, 2023, 8:58 AM

#

wooden sail is the magnitude of the bins important? or only their ratios?

i would argue only the ratios

wooden sail May 6, 2023, 8:58 AM

#

ok, then the magnitude doesn't matter and you can indeed project on the unit sphere

#

that can make the distance... tricky to measure, but we can ignore that for now

young granite May 6, 2023, 8:59 AM

#

yeh i think the idea is pretty cool but i struggle with embedding the code 😄

wooden sail May 6, 2023, 9:00 AM

#

now we have unit vectors in C^n. and you want to project this to R^3 you say

young granite May 6, 2023, 9:00 AM

#

to get all values inside the sphere

young granite May 6, 2023, 9:00 AM

#

wooden sail now we have unit vectors in C^n. and you want to project this to R^3 you say

wooden sail May 6, 2023, 9:00 AM

#

that's fairly difficult. hmm

#

i don't think there's a very meaningful way of doing that tbh

young granite May 6, 2023, 9:01 AM

#

mhhh

wooden sail May 6, 2023, 9:01 AM

#

the only way to guarantee you get real values out of a function with complex inputs is to make it a constant function 😛

#

you can make 2 spheres, one for the real parts and another for the complex parts

young granite May 6, 2023, 9:02 AM

#

thats fairly simple 😄

#

i didnt know u are that pragmatic edd 😄

#

😛

wooden sail May 6, 2023, 9:03 AM

#

i'm usually a "why visualize" kind of person tbh

#

all right, and then this still leaves the problem that we need a matrix that maps from C^5 to C^3 while approximately preserving distances

young granite May 6, 2023, 9:03 AM

#

wooden sail i'm usually a "why visualize" kind of person tbh

cause looks nice and makes it easy to understand for topic foreign persons

wooden sail May 6, 2023, 9:04 AM

#

the thing is that low dimensional representations never tell the full story 😛 projections lose information

young granite May 6, 2023, 9:04 AM

#

+1

#

just get best of both worlds id say 😛

wooden sail May 6, 2023, 9:04 AM

#

in this case, for example, if your C^5 vectors do not have a sparse representation, it'll be very difficult to embed them while preserving distance

#

the easiest approach is to make a random matrix size 3 x 5 where the entries are random, and just use that

young granite May 6, 2023, 9:07 AM

#

wont it be possible to use the PS for Z and norm them?

wooden sail May 6, 2023, 9:07 AM

#

what's PS?

young granite May 6, 2023, 9:07 AM

#

power spectrum, but nah then i loose information

#

mhhh

wooden sail May 6, 2023, 9:08 AM

#

right, you'd lose info

#

you can try, why not. compare it to the approach with 2 spheres

young granite May 6, 2023, 9:08 AM

#

always a pleasure to hear (read lel) ur thoughts ❤️

#

but then i would only represent data in 1/2 the sphere

#

so maybe not the PS

wooden sail May 6, 2023, 9:11 AM

#

also note that a matrix with 5 columns has a spark that is at most 4, i.e. in the BEST case, we take 4 columns and they're now linearly dependent. that means you can only really COMPLETELY discern vectors that are 2-sparse

#

which is pretty strict

young granite May 6, 2023, 9:11 AM

#

🗿

#

2 spheres it is then 😄

#

but ill see what i can come up with after ur input

#

maybe i ask a college aswell what he thinks bout this

wooden sail May 6, 2023, 9:12 AM

#

this will be a problem regardless of what you do, i'm just saying you will very likely not get anything useful out of this approach

#

regardless of using power spectrum or not

young granite May 6, 2023, 9:12 AM

#

mhh

wooden sail May 6, 2023, 9:12 AM

#

the problem is projecting down to C^3

young granite May 6, 2023, 9:12 AM

#

so better sticking with 2D?

wooden sail May 6, 2023, 9:13 AM

#

better not project and do it in C^5, then make plots of the distances

#

the more you project, the worse the problem gets

#

but go ahead and try. maybe we'll be pleasantly surprised. but if it doesn't give anything interesting, you shouldn't be surprised

young granite May 6, 2023, 9:14 AM

#

pushing boundries lel

wooden sail May 6, 2023, 9:14 AM

#

try making one sphere in R^3 using the power spectrum, and to spheres (real and imag) using the complex fourier bins and see if anything looks nice

inland heath May 6, 2023, 9:33 AM

#

wooden sail you can split them at random, and try different realizations of the split

all good i got it

obsidian peak May 6, 2023, 11:10 AM

#

https://github.com/YashIndane/platefetcher

GitHub

GitHub - YashIndane/platefetcher: Scan the number plate and get all...

Scan the number plate and get all the details of the vehicle! 🚘 - GitHub - YashIndane/platefetcher: Scan the number plate and get all the details of the vehicle! 🚘

lone plaza May 6, 2023, 11:42 AM

#

Is there any experienced python developer who's willing to look through my self written ai? Nothing impressive tho, it is just a prove of concept for me

young granite May 6, 2023, 12:37 PM

#

@wooden sail i created worms 🗿

wooden sail May 6, 2023, 12:39 PM

#

lol

young granite May 6, 2023, 12:42 PM

#

wooden sail lol

somewhat clustering 🗿

#

#

generated sine functions with noise and some freqs

#

but thats it for now i guess first discussing this with my college next week so i dont waste more time xD

steady bronze May 6, 2023, 1:35 PM

#

hey guys do i need to pay for the open ai gpt api
because when i create a api key and try to use it its not working

from langchain. llms import OpenAI
llm = OpenAI()
llm("explain large language models in one sentence")

this is my code but the response i get is
RateLimitError: You exceeded your current quota, please check your plan and billing details.
i have never even used my api key before
i just created i

spiral smelt May 6, 2023, 1:50 PM

#

Hello, I was just wondering whether anyone had any experience in neural network image classification? I've written a Python script that image classifies two categories, however I would like to extend it to 10 categories. Any help would be really appreciated, because I'm a bit lost on how to do this 🙂

cold osprey May 6, 2023, 1:52 PM

#

increase outputs to 10 at your fully connected layer

spiral smelt May 6, 2023, 1:53 PM

#

Adds in our layers

Adds a convolutional layer and a max pooling layer

Has 16 filters (3,3 pixels in size)

Stride moving one pixel by one

Extracts the relevant information to make a classification

Applies a relu activation - taking into account non-linear patterns

Image shape is going to be 256 wide by 256 heigh, 3 channels deeps

model.add(Conv2D(16, (3,3), 1, activation='relu', input_shape=(256,256,3)))
model.add(MaxPooling2D())

Adds a convolutional layer and a max pooling layer

Has 32 filters (3,3 pixels in size)

Stride moving one pixel by one

model.add(Conv2D(32, (3,3), 1, activation='relu'))
model.add(MaxPooling2D())

Adds a convolutional layer and a max pooling layer

Has 16 filters (3,3 pixels in size)

Stride moving one pixel by one

model.add(Conv2D(16, (3,3), 1, activation='relu'))
model.add(MaxPooling2D())

Flattens to remove the channels value

model.add(Flatten())

256 values will now be the output

model.add(Dense(256, activation='relu'))

Creates a single output, 0 or 1

model.add(Dense(1, activation='sigmoid'))

Compiles the model using the 'adam' optimiser. Specifying what the loss is. The metric tracked is accuracy, shows how well the model is classifying either 0 or 1.

model.compile('adam', loss=tf.losses.BinaryCrossentropy(), metrics=['accuracy'])

Displays how the model transforms the data

model.summary()

spiral smelt May 6, 2023, 1:54 PM

#

cold osprey increase outputs to 10 at your fully connected layer

Sorry where about would I put this, I'm very new to this

cold osprey May 6, 2023, 1:55 PM

#

model.add(Dense(9, activation='sigmoid'))

9 or 10

#

should be 10, one for each class

spiral smelt May 6, 2023, 1:56 PM

#

Okay one second I'll have a go 🙂

spiral smelt May 6, 2023, 2:00 PM

#

cold osprey ```py model.add(Dense(9, activation='sigmoid')) ``` 9 or 10

So I did that but it still doesn't work. I think the problem at the moment I need to assign 0 to 9 to 10 categories before hand but at the moment I haven't figured it out

cold osprey May 6, 2023, 2:01 PM

#

how does ur data look

spiral smelt May 6, 2023, 2:02 PM

#

I have a folder called 'data' within the folder I have three sub-folders 'train' , 'test' and 'validation', within those folder is 10 categories that contain different items of clothing

cold osprey May 6, 2023, 2:02 PM

#

u using data loaders or?

spiral smelt May 6, 2023, 2:03 PM

#

cold osprey how does ur data look

would I be worth will sharing my entire code, thank you so much for this been working on this for about 30 hours :/ im using os to load the data from the directories?

cold osprey May 6, 2023, 2:03 PM

#

sure

#

im in a dota game rn tho hahha

spiral smelt May 6, 2023, 2:04 PM

#

Oh don't worry if you're busy 🙂 I can keep working on it @cold osprey

cold osprey May 6, 2023, 2:39 PM

#

spiral smelt Oh don't worry if you're busy 🙂 I can keep working on it <@342346882800025600>

donez

sleek harbor May 6, 2023, 2:39 PM

#

I've never seen this done before (summing up results of predictions of the test set made with models trained on train-validation sets across kfolds, and then divided by the total folds). Is this a common practice? Cus so far I've only come across the popular "refit all training data with best cross val results and then predict test data with that model".. never seen something like this before in courses or tutorials, but it does kinda make sense
source: https://aetperf.github.io/2021/02/16/Optuna-+-XGBoost-on-a-tabular-dataset.html

Architecture & Performance

Optuna + XGBoost on a tabular dataset

Databases, Dataviz, Machine Learning.

quartz ivy May 6, 2023, 3:12 PM

#

spiral smelt # Adds in our layers # Adds a convolutional layer and a max pooling layer # Has...

you can't use binary cross entropy for multi class, need to change that, i think

cold osprey May 6, 2023, 3:23 PM

#

yes

#

can just use CrossEntropyLoss

#

hmm thats pytorch

#

not sure what is the tf equivalent iis

spiral smelt May 6, 2023, 3:37 PM

#

quartz ivy you can't use binary cross entropy for multi class, need to change that, i think

Okay thank you, I'm still looking into how to fix it. AI is really new to me

#

https://github.com/KatieCook12/Neural-Networks/blob/f775208e1302c14905ff7b2a4e2a643afe028807/Python - here's the code I've already written

GitHub

Neural-Networks/Python at f775208e1302c14905ff7b2a4e2a643afe028807 ...

Neural Networks Image Classification. Contribute to KatieCook12/Neural-Networks development by creating an account on GitHub.

cold osprey May 6, 2023, 3:41 PM

#

model.add(Dense(1, activation='sigmoid'))

#

if u change this to 10 and the loss to CategoricalCrossentrypy, what happens?

#

https://www.tensorflow.org/api_docs/python/tf/keras/losses/CategoricalCrossentropy

TensorFlow

tf.keras.losses.CategoricalCrossentropy | TensorFlow v2.12.0

Computes the crossentropy loss between the labels and predictions.

#

the way uve set up ur code is abit weird too

#

if yhat < 0.5: 
    print(f'Predicted class is dress.')
else:
    print(f'Predicted class is hat.')
``` like this bit

#

are u following a course for this or?

spiral smelt May 6, 2023, 3:54 PM

#

cold osprey are u following a course for this or?

Thank you, just coding it now. I'm following a YouTube video

cold osprey May 6, 2023, 3:55 PM

#

ah ic

spiral smelt May 6, 2023, 3:57 PM

#

cold osprey ah ic

Compiles the model using the 'adam' optimiser. Specifying what the loss is. The metric tracked is accuracy, shows how well the model is classifying either 0 or 1.

model.compile('adam', loss=tf.losses.CategoricalCrossentrypy(), metrics=['accuracy']) - so when I ran this it came up with this error

#

AttributeError Traceback (most recent call last)
Cell In[52], line 2
1 # Compiles the model using the 'adam' optimiser. Specifying what the loss is. The metric tracked is accuracy, shows how well the model is classifying either 0 or 1.
----> 2 model.compile('adam', loss=tf.losses.CategoricalCrossentrypy(), metrics=['accuracy'])

File ~\lib\site-packages\tensorflow\python\util\lazy_loader.py:59, in LazyLoader.getattr(self, item)
57 def getattr(self, item):
58 module = self._load()
---> 59 return getattr(module, item)

AttributeError: module 'keras.api._v2.keras.losses' has no attribute 'CategoricalCrossentrypy'

cold osprey May 6, 2023, 3:57 PM

#

lel theres a typo

#

CategoricalCrossentrypy - > CategoricalCrossentropy

spiral smelt May 6, 2023, 3:58 PM

#

yeah just realised sorry

spiral smelt May 6, 2023, 3:59 PM

#

cold osprey lel theres a typo

so know when I run - # Model.fit takes in the training data

Epoche is how long we're going to train for

Passes through the validation data, to see how well the model is performing in real time

Stores in a variable called history

hist = model.fit(train, epochs=20, validation_data=val, callbacks=[tensorboard_callback]) - it comes out as:

cold osprey May 6, 2023, 4:00 PM

#

yes epochs is how many times we pass through the whole dataset

spiral smelt May 6, 2023, 4:00 PM

#

I'm getting an error when I run it saying: ValueError: Shapes (None, 1) and (None, 10) are incompatible

cold osprey May 6, 2023, 4:02 PM

#

where is the error from?

#

like which line

spiral smelt May 6, 2023, 4:02 PM

#

hist = model.fit(train, epochs=20, validation_data=val, callbacks=[tensorboard_callback]) - its coming from this

#

oh wait one sec

spiral smelt May 6, 2023, 4:07 PM

#

cold osprey where is the error from?

Unfortunately I'm still getting the error

cold osprey May 6, 2023, 4:09 PM

#

does ur data only have 2 classes?

#

how does y look for ur data?

spiral smelt May 6, 2023, 4:09 PM

#

10 categories, but maybe I didn't set it up right, should I print y?

#

so this is how I set up the classes:

#

Builds an image dataset, using keras

test = tf.keras.utils.image_dataset_from_directory('data/test')
train = tf.keras.utils.image_dataset_from_directory('data/train')
val= tf.keras.utils.image_dataset_from_directory('data/validation')

#

this is the output: Found 249 files belonging to 10 classes.
Found 3054 files belonging to 10 classes.
Found 194 files belonging to 10 classes.

#

Allow us to convert to a numpy iterator, allows access to the image dataset

data_iterator_test = test.as_numpy_iterator()
data_iterator_train = train.as_numpy_iterator()
data_iterator_val = val.as_numpy_iterator()

cold osprey May 6, 2023, 4:12 PM

#

ye

#

looking at ur code

spiral smelt May 6, 2023, 4:12 PM

#

Thank you, honestly I appreciate this so much

cold osprey May 6, 2023, 4:13 PM

#

basically the last layer should output 10 numbers

#

logits or probabilities

#

which the highest will be what it classifies the image as

#

 'Trouser': 1,
 'Pullover': 2,
 'Dress': 3,
 'Coat': 4,
 'Sandal': 5,
 'Shirt': 6,
 'Sneaker': 7,
 'Bag': 8,
 'Ankle boot': 9}``` then u would have something like this

#

so say the first '0th' was the highest, then its a tshirt/top

spiral smelt May 6, 2023, 4:14 PM

#

okay that makes sense, so how do I assign the categories to there number

#

So I guess at the moment it's only assigning to either 0 or 1 and not the entire range

cold osprey May 6, 2023, 4:15 PM

#

ya when u set ur last layer to output 1 only, its outputting one number which u then see if its < 0.5 or < 0.5 (yhat)

#

which is a ok way to do it but harder when u want to modify it for multiclass classification

#

what i wouldve done for binary classification is just output 2 classes with the same idea as 10 classes

#

i think the error is coming from how the data is set up hmmmm

#

am comparing to my pytorch code rn

#

been a while since i used tensorflow

spiral smelt May 6, 2023, 4:18 PM

#

Okay, thank you, I'm googling too, to see what solution there is

cold osprey May 6, 2023, 4:22 PM

#

could u print one of ur data and see how it looks like?

spiral smelt May 6, 2023, 4:24 PM

#

okay I think I figured out the label problem I included this:

#

Copy code
num_classes = 10 # Replace 10 with the actual number of classes in your dataset

test = test.map(lambda x, y: (x / 255, tf.one_hot(y, num_classes)))
train = train.map(lambda x, y: (x / 255, tf.one_hot(y, num_classes)))
val = val.map(lambda x, y: (x / 255, tf.one_hot(y, num_classes)))

#

I now running the testing which is working (yay!) I'll let you know the results

cold osprey May 6, 2023, 4:25 PM

#

👍

spiral smelt May 6, 2023, 4:26 PM

#

cold osprey could u print one of ur data and see how it looks like?

It might take a while cause my laptops slow, and I've set it to 20 epoche

cold osprey May 6, 2023, 4:26 PM

#

if loss is going down and accuracry/other metrics is going up, should be fine

spiral smelt May 6, 2023, 4:28 PM

#

cold osprey if loss is going down and accuracry/other metrics is going up, should be fine

Hopefully, sorry one other thing. So I want to see what number is assigned to each image. When I run this:

#

Checks which class is assigned to which image

Checks that they've been scaled correctly

fig, ax = plt.subplots(ncols=1, figsize=(20,20))
for idx, img in enumerate(batch_train[0][:10]):
ax[idx].imshow(img)
ax[idx].title.set_text(batch_train[1][idx])

#

it doesn't display a grid of images, with there number assigned to them

#

on my 4th Epoch, it's being incredibly slow

cold osprey May 6, 2023, 4:29 PM

#

training on a gpu?

#

if no, u can try google colab for free gpu

spiral smelt May 6, 2023, 4:30 PM

#

cold osprey training on a gpu?

That awesome, I'm check it out

spiral smelt May 6, 2023, 4:33 PM

#

cold osprey if no, u can try google colab for free gpu

On 7 epoche now, the tension is getting to me 😆

cold osprey May 6, 2023, 4:37 PM

#

if ure using tensorboard, i think u can view the loss and accuracy in real time?

spiral smelt May 6, 2023, 4:37 PM

#

cold osprey if ure using tensorboard, i think u can view the loss and accuracy in real time?

Yeah it doesn't look good though, I'm hoping it'll improve

#

I'm on epoche 9 and it says the loss is 2.1518 and the accuracy is 0 :/

#

Sorry was looking at the wrong metric the accuracy is 0.2603 but isn't improving

cold osprey May 6, 2023, 4:47 PM

#

model.add(Dense(1, activation='sigmoid'))
``` may need to change this to relu

#

id suggest looking for a tutorial on multi class classification and working from that instead

#

also pytorch > tensorflow hahah

#

high chance the problem is from the data

#

else the model just isnt good enough

spiral smelt May 6, 2023, 4:52 PM

#

cold osprey else the model just isnt good enough

My uni supplied the data so I have to use it, but I guess I'll write about it in the report. I tried relu but it didn't work so I've changed it to softmax

cold osprey May 6, 2023, 4:53 PM

#

u can use more layers too

#

or bigger layers

spiral smelt May 6, 2023, 4:56 PM

#

cold osprey or bigger layers

Thank you, I'm just re-running it again 🙂 hopefully the outcome will be better

spiral smelt May 6, 2023, 4:57 PM

#

cold osprey or bigger layers

Accuracy is looking better this time

frigid lion May 6, 2023, 6:20 PM

#

hey so atm im doing jose portilla machine learning course on udemy and i would also like to do the andrew ng course on coursera but i see a lot of the content is behind the paywall do you think the free part of the course is good enough or wont make much sense without the paid lessons as well

#

i will soon end the jose portilla course i have just a few lessons left

past meteor May 6, 2023, 6:24 PM

#

My tip: go for a book after that course

frigid lion May 6, 2023, 6:26 PM

#

which book?

#

and why do you think so

past meteor May 6, 2023, 6:31 PM

#

My personal favourite is statlearning.com

frigid lion May 6, 2023, 6:32 PM

#

ive been reading a bit from this book while taking this course cuz jose recommended it as well

#

do you have any idea doe if the andrew ng course makes sense if i were not to pay for it

past meteor May 6, 2023, 6:33 PM

#

Normally you can always audit courses, which is follow them for free but some content is "hidden"

frigid lion May 6, 2023, 6:34 PM

#

ye i know i can audit for free but the amount of the things that are locked behind paywall seems like a lot and i feel like these are also important topics that are there

past meteor May 6, 2023, 6:35 PM

#

You can also read the sci-kit learn user guide. Some things might not make sense but you can google the terms to understand them better

#

You should read chapter 6, 10 and then 1, 2, 3 and 4

frigid lion May 6, 2023, 6:39 PM

#

past meteor You should read chapter 6, 10 and then 1, 2, 3 and 4

ok thanks

#

if some1 else has some knowledge about the andrew course i'd appreciate as well

cold osprey May 6, 2023, 6:42 PM

#

past meteor My personal favourite is statlearning.com

how maths heavy is this?

#

this was my first ml book https://www.oreilly.com/library/view/hands-on-machine-learning/9781492032632/

O’Reilly Online Learning

Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow,...

#

but i already knew some sk learn before this

#

mainly regression

#

now doing a pytorch course then have some personal projects planned

past meteor May 6, 2023, 6:45 PM

#

cold osprey how maths heavy is this?

Imo it's not super math heavy, there's no proofs or so in the book

cold osprey May 6, 2023, 6:46 PM

#

ah okay

past meteor May 6, 2023, 6:46 PM

#

Some formulas aren't derived fully either so sometimes it feels like they're making a "jump" but that also means it's pretty hands-on

cold osprey May 6, 2023, 6:47 PM

#

hand wavy maths is my fav kind

#

xd

past meteor May 6, 2023, 6:48 PM

#

For me it depends. I did an entire course on just support vector machines in uni. Most of it was math, most of it was fun. Doesn't really make you significantly better at using SVMs though 🤷‍♂️

sinful kelp May 6, 2023, 6:52 PM

#

I did a course in machine learning which was very maths and stats focused. It felt like it gave me a good foundation for a lot of the concepts, but when it comes to actual machine learning, there seemed to be a bit of a disconnect between the ideas and the actual methods in practice.

past meteor May 6, 2023, 7:06 PM

#

My first ML course actually only made sense to me after I did other courses... It was very theoretical and also covered stuff that is not really relevant like theta subsumption, inductive logic programming, ...

cold osprey May 6, 2023, 7:08 PM

#

hmm i dont have formal education for ML

#

but got the maths from my degree

sinful kelp May 6, 2023, 7:09 PM

#

I would say that the most useful concepts mainly came from statistics. I have found Bayesian statistics a very useful way to think about ML and data in general.

past meteor May 6, 2023, 7:10 PM

#

Bayesian stuff is cool until you run out of memory and that's the part they don't talk about in stats classes. In ML classes they will, they'll also tell you variational inference exists but they won't tell you that the probabilities you get out of it aren't great.

sinful kelp May 6, 2023, 7:11 PM

#

I would agree with that.

cerulean kayak May 6, 2023, 8:06 PM

#

does anyone know of an alternative to feature importance? at me if u respond

next valley May 6, 2023, 8:22 PM

#

Foundation in the mathamatical theory and concepts are important if you want to make novel models, if you're just copy and pasting pre made models all you really need to get going are some hands to manipulate the data to fit the inputs of the pre made model

cerulean kayak May 6, 2023, 8:46 PM

#

next valley Foundation in the mathamatical theory and concepts are important if you want to ...

okay so I have a model that I made (you dont have to read it all but I wanted to be as specific as possible):

from sklearn.tree import DecisionTreeClassifier
from sklearn.model_selection import train_test_split
from sklearn.metrics import accuracy_score
x = data.drop('Rings', axis=1)
y = data['Rings']
x_train, x_test, y_train,y_test=train_test_split(x,y,test_size=0.3)
clf_gini = DecisionTreeClassifier(criterion='gini', max_depth=3, random_state=0)
clf_gini.fit(x_train, y_train)
y_pred_gini = clf_gini.predict(x_test)
print("Accuracy with gini index: {0:0.4f}".format(accuracy_score(y_test,y_pred_gini)))

and then I got 0.2706 which is of corse abysmal (note data is a pandas array created by a read_csv function.) and I want to be like "because this is real bad we need to find out what is throwing us off." and I know the anwser is we need to drop the sex column, but I don't know how to come to that conclusion. My friend did this using a random forest tree instead of a decision tree, so he used feature importance. I read online that feature importance is more for random forest than decision tree, so what should I use?

mint palm May 6, 2023, 8:48 PM

#

I have an interview for Data Science role. I realise i am more into ML stuff, and been a while since I did my project and statistics stuff on R.
Can someone give me list of topic that you would revise before interview? Also i realise i forgot about various distributions, quantiles, QQ plot etc. so please try to include related important stuff.
Thanks in advance for your time.

sinful kelp May 6, 2023, 8:49 PM

#

mint palm I have an interview for Data Science role. I realise i am more into ML stuff, an...

can you be a bit more specific? what type of position is it? at what level?

cerulean kayak May 6, 2023, 8:49 PM

#

mint palm I have an interview for Data Science role. I realise i am more into ML stuff, an...

would you mind if I dmed you?

mint palm May 6, 2023, 8:49 PM

#

cerulean kayak would you mind if I dmed you?

please.

mint palm May 6, 2023, 8:50 PM

#

sinful kelp can you be a bit more specific? what type of position is it? at what level?

entry level but i expect some detailed question too as its a startup

agile cobalt May 6, 2023, 8:50 PM

#

cerulean kayak okay so I have a model that I made (you dont have to read it all but I wanted to...

you know that a random forest is literally a bunch of decision trees thrown together right?

a single decision tree is an extremely limited model

#

max depth 3 also sounds a bit shallow, though that depends on your data

cerulean kayak May 6, 2023, 8:53 PM

#

agile cobalt max depth 3 also sounds a bit shallow, though that depends on your data

including the target varible(y): 9. Do you have any idea how I should find out a good depth?

agile cobalt May 6, 2023, 8:54 PM

#

cerulean kayak including the target varible(y): 9. Do you have any idea how I should find out a...

you mean number of columns or?..

next valley May 6, 2023, 8:54 PM

#

cerulean kayak okay so I have a model that I made (you dont have to read it all but I wanted to...

There are many ways, also a random forst trees are technically just a bunch of decision trees that average their results to give a output
since you are using a decision tree classifier, it's best to first see your datas total columns as decision trees branch based columns averages

One reason may be that your data has way more dimensions than your tree can handle, therefore perhapse it'll pay off to increase the depth of the tree

If the depth of the tree exceeds the total colums of the data, there may be issues with the data itself try seeing if the data is complete, i.e. there are no missing values

It may also be the case that the data itself is non linear in nature, therefore it'll be hard for a decision tree to model the data

agile cobalt May 6, 2023, 8:54 PM

#

but about the original question, as far as I am aware, using random forests (even if your final model isn't a random forest) is the best way to automatically determine which features are or aren't important
usually you may want to filter by hand as well

cerulean kayak May 6, 2023, 8:55 PM

#

agile cobalt you mean number of columns or?..

yes.

next valley May 6, 2023, 9:01 PM

#

you can also speed up inference or sending data for inference via dimension reducing techniques such as PCA, which is basically finding which feature (columns) influence the labels (prediction) the most

Course this also comes at the cost of you potentially discarding important information that may pop up in the future that would be important to your model

#

@cerulean kayak that help?

cerulean kayak May 6, 2023, 9:04 PM

#

give me a second, I vaugly know what you are talking about: if you can't tell im in a college class and alot of my problems are stemming from the fact that I know stuff but I don't know it's specific name.

next valley May 6, 2023, 9:05 PM

#

Pca is principal component analysis

sinful kelp May 6, 2023, 9:10 PM

#

https://scikit-learn.org/stable/modules/generated/sklearn.decomposition.PCA.html

scikit-learn

sklearn.decomposition.PCA

Examples using sklearn.decomposition.PCA: A demo of K-Means clustering on the handwritten digits data A demo of K-Means clustering on the handwritten digits data Principal Component Regression vs P...

cerulean kayak May 6, 2023, 9:11 PM

#

next valley Pca is principal component analysis

is this a form of preprocessing?

sinful kelp May 6, 2023, 9:11 PM

#

you could also consider using SHAP to determine feature importance https://towardsdatascience.com/using-shap-values-to-explain-how-your-machine-learning-model-works-732b3f40e137

Medium

Using SHAP Values to Explain How Your Machine Learning Model Works

Learn to use a tool that shows how each feature affects every prediction of the model

next valley May 6, 2023, 9:12 PM

#

Kind of, it's more of a analysis of what columns you dont need, hence analysis in principle component analysis

#

@cerulean kayak

cerulean kayak May 6, 2023, 9:14 PM

#

okay and pca is not limited to clustering algorithems? because i know it deals with knn, which is a clustering algorithem

sinful kelp May 6, 2023, 9:15 PM

#

no PCA is a method for transforming your data into a new space where each axis explains how the data varies.

#

It's typically used for visualizing high-dimensional data (probably where you saw it being used for KNNs), but it can also be used to generate new, more relevant features from your data

cerulean kayak May 6, 2023, 9:17 PM

#

agile cobalt but about the original question, as far as I am aware, using random forests (_ev...

also what the heck do you mean by hand as well? That can't be machine learning.

cold osprey May 6, 2023, 9:18 PM

#

as in drop them from X

next valley May 6, 2023, 9:18 PM

#

cerulean kayak also what the heck do you mean by hand as well? That can't be machine learning.

Actually, most of machine learning is shifting through a sample of the dataset by hand to make sense of it at first worrykek

cold osprey May 6, 2023, 9:18 PM

#

like if u know feature_45 is not useful, u may not even query the data from the database say

sinful kelp May 6, 2023, 9:20 PM

#

The predictions are learnt automatically from the model (e.g. the decision tree). The feature engineering (pre-processing of the data) can often be done by hand.

agile cobalt May 6, 2023, 9:23 PM

#

cerulean kayak also what the heck do you mean by hand as well? That can't be machine learning.

look into your data and make sure that the things you are feeding into the model makes sense before putting your data into a model?
data collection and preprocessing are extremely important steps, despite not being part of the model itself

cerulean kayak May 6, 2023, 9:24 PM

#

agile cobalt look into your data and make sure that the things you are feeding into the model...

okay so ya. a line I omited was

sex_map={'M':1,'F':2,'I':3} #
data['Sex']=data['Sex'].map(sex_map)

which is kinda an example of that.

cold osprey May 6, 2023, 9:24 PM

#

oh hmmm

#

gender isnt ordinal tho

#

i guess it doesnt matter for a tree based model?

agile cobalt May 6, 2023, 9:25 PM

#

"i"?

cold osprey May 6, 2023, 9:25 PM

#

no it does i think

agile cobalt May 6, 2023, 9:26 PM

#

a tree based model would do something like 1 goes left, 2 and 3 goes right or 1, 2 goes left and 3 goes right

cold osprey May 6, 2023, 9:26 PM

#

I for 'i dont know'

cerulean kayak May 6, 2023, 9:26 PM

#

agile cobalt "i"?

infant apparently

cold osprey May 6, 2023, 9:26 PM

#

agile cobalt a tree based model would do something like `1 goes left, 2 and 3 goes right` or ...

still treats it as categorical? thought it would do <=2 go left, <1 go right

#

which implies some order in the feature

agile cobalt May 6, 2023, 9:27 PM

#

no, it does treats it as numbers - I'm just using the discrete labels because those are all the possible values

cold osprey May 6, 2023, 9:27 PM

#

ah okay

sinful kelp May 6, 2023, 9:27 PM

#

next valley Actually, most of machine learning is shifting through a sample of the dataset b...

what type of explanation are you hoping to find for this project? If you determined that this feature hurts your model's performance, that's evidence enough to remove it from training.

agile cobalt May 6, 2023, 9:27 PM

#

but yeah it cannot do 1, 3 left, 2 right in one split I think

cold osprey May 6, 2023, 9:27 PM

#

yeah idt it can if its treating it as numbers

cerulean kayak May 6, 2023, 9:30 PM

#

so is mapping it like this wise? because im basing this off a lab my ta did for a DT and they did the same thing but with doors on a car: {2 doors:2, 3 doors:3 4+ doors : 3}

cold osprey May 6, 2023, 9:30 PM

#

read up ordinal vs nominal data

#

stuff like gender, brand & colour is nominal

#

generation (boomer, millenial, genZ) is an example of ordinal

#

hmm thinking about it, generation may not necessarily be ordinal too, depending on the context

cloud marsh May 6, 2023, 9:33 PM

#

cupy basically reimplements numpy methods to use CUDA where possible right? how are dependencies resolved for higher level projects that depend on numpy?

cerulean kayak May 6, 2023, 9:35 PM

#

okay and real quick @agile cobalt what do you think I should do for the depth of my tree?

next valley May 6, 2023, 9:35 PM

#

cerulean kayak okay and real quick <@256442550683041793> what do you think I should do for the ...

bruh, did you not read the large ass blurb i sent your first?

cold osprey May 6, 2023, 9:54 PM

#

Just experimented with a transfer learning model

#

does the constant up and down fluctuations of loss and accuracy mean anything?

next valley May 6, 2023, 9:57 PM

#

academic term for it is called high variance iirc, there are a lot of things that can affect this and it depends highly on what exactly your model is and how you are feeding the data

cold osprey May 6, 2023, 9:58 PM

#

for background, its a EfficientNet B0 model that im tuning the fully connected layer to classify 3 food classes

#

proly an overkill model but ye

next valley May 6, 2023, 9:58 PM

#

may be that the layers you didn't freeze have too high of a learning rate set to them

#

or it may be beneficial to increase the batch size

cold osprey May 6, 2023, 9:59 PM

#

0.001 seems pretty small hahah

#

batch size is 32 en. lemme double it

#

another q, do we need train val test datasets for NNs?

#

or is 2 sets enough

next valley May 6, 2023, 10:00 PM

#

what?

#

oh

cold osprey May 6, 2023, 10:00 PM

#

currently im only splitting my data into train and test

next valley May 6, 2023, 10:01 PM

#

train val/dev test is extremely important to fine-tune a model, without a test set you risk over fitting to your model to the val/dev set as well when trying to address variance issues between your train val/dev set

cold osprey May 6, 2023, 10:01 PM

#

cool

#

thats what i thought

#

larger batch size = more gpu memory usage, coz more data has to be in memory when updating the model params?

next valley May 6, 2023, 10:04 PM

#

depends on where you are loading the batches to yes

#

also i would suggest you reduce the epoch count, it doesn't seem like the dataset you are fine tuning it to is large enough to justify 100 iterations on it

cold osprey May 6, 2023, 10:05 PM

#

yeah haha

#

50 seems more than enough

#

oh hmm something went weird

#

epoch 81 onwards, both loss became nan

#

and accuracy tanked

next valley May 6, 2023, 10:08 PM

#

something may be wrong with your data

cold osprey May 6, 2023, 10:08 PM

#

next valley May 6, 2023, 10:08 PM

#

oh

cold osprey May 6, 2023, 10:08 PM

#

some params went to zero in the model im guessing

#

or overflow?

next valley May 6, 2023, 10:09 PM

#

no, try clearing the variable holding the accuracy train_accuracy test_accuracy

gloomy saddle May 6, 2023, 10:09 PM

#

probably means something went wrong in one of your scoring or loss functions 🙂

next valley May 6, 2023, 10:10 PM

#

note how train_accuracy and test_accuracy go beyond 80 epoches

cold osprey May 6, 2023, 10:10 PM

#

ya coz loss at epoch 80 onwards is nan

next valley May 6, 2023, 10:11 PM

#

i have no clue what your code structure is but it may have been that you forgot to reinitialize the variables you used to graph the loss and accuracy

cold osprey May 6, 2023, 10:12 PM

#

ye i have hella code

#

sec

#

!code

arctic wedgeBOT May 6, 2023, 10:12 PM

#

Formatting code on discord

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

For long code samples, you can use our pastebin.

cold osprey May 6, 2023, 10:12 PM

#

https://paste.pythondiscord.com/adigefukaz

#

there

#

at the bottom, i have a train function which i call with all the parameters i need

#

seems like the only way loss can be nan is the len(dataloader) being 0

#

line 69 and 122

#

https://discuss.pytorch.org/t/nan-loss-coming-after-some-time/11568/6

PyTorch Forums

Nan Loss coming after some time

You could use a normalization layer. Alternatively, you can try dividing by some constant first (perhaps equal to the max value of your data?) The idea is to get the values low enough that they don’t cause really large gradients.

#

gradients...

next valley May 6, 2023, 10:18 PM

#

unfortunatly I use tensorflow but based on my limited knowledge of pytorch my assumption is that you didn't update the test_dataloader variable to be the new epoch

#

try adding a print statement with print(len(test_dataloader)) at line 122

#

that's the best i can come up with to verify that you did things right

#

PepegaPls

cold osprey May 6, 2023, 10:20 PM

#

haha i cba tbh since i wont be running 100 epochs

#

will leave it for future me to figure out when i run a model that does require that many epochs

next valley May 6, 2023, 10:22 PM

#

other than that I'm unsure what parts of the model you are trainning but I think adding some form of regularization may help, or maybe reshuffle the images in the dataset to see if maybe you may have some batches that are easier than others

cold osprey May 6, 2023, 10:23 PM

#

the base model EfficientNet B0

#

trained on 1 mil images, 1k classes

next valley May 6, 2023, 10:24 PM

#

like, the whole model

#

MonkaS

cold osprey May 6, 2023, 10:24 PM

#

yeye

#

pretty strong base model

#

just fine tuned on the fully connected layer

#

https://pytorch.org/vision/main/models/generated/torchvision.models.efficientnet_b0.html

next valley May 6, 2023, 10:25 PM

#

oh, i though you where training all layers, when i meant by the whole model i mean you un froze all the layers

cold osprey May 6, 2023, 10:25 PM

#

nah nah hahaha

#

froze all the CNN bits of it

next valley May 6, 2023, 10:27 PM

#

besides those things, i guess another thing to help with regularization to smooth out training loss would be to add dropout to the MLP if it isn't already there, I'm unsure about the architecture of efficientnet

cold osprey May 6, 2023, 10:27 PM

#

yep there a dropout layer

#

p=0.2

#

can maybe increase it

next valley May 6, 2023, 10:35 PM

#

Yup

cerulean kayak May 7, 2023, 12:03 AM

#

next valley bruh, did you not read the large ass blurb i sent your first?

understand and read are 2 very diffrent verbs
2). I am not as smart as you. And you've been saying alot of stuff, so that could refer to alot of diffrent posts.

next valley May 7, 2023, 12:11 AM

#

cerulean kayak 1) understand and read are 2 very diffrent verbs 2). I am not as smart as you. ...

You could have ask me questions on what parts sounded confusing PepeKEKWHands

cerulean kayak May 7, 2023, 12:26 AM

#

next valley You could have ask me questions on what parts sounded confusing <:PepeKEKWHands:...

okay.

it's best to first see your datas total columns as decision trees branch based columns averages
what do you mean by "datas total columns as decision tree branch based columns averages"?

#

also if you are not supposed to do mapping to values for nominal data, should I use dummies instead?

next valley May 7, 2023, 12:35 AM

#

cerulean kayak okay. > it's best to first see your datas total columns as decision trees branc...

Poorly worded on my part

Decision trees work at a high level by chosing a column and then deciding how to branch into another column
Let's say that your data is a matrix/tensor of dimensions/shape [m, n]
Therefore in theory the most optimal branch depth would be where depth = n before the tree starts looping over all columns

On the subject of mapping to values for nominal data, traditionally it doesn't matter as decision trees can also split on nominal data

however sklearn decision tree classifier cannot handle nominal data and therefore you must transform it to be ordinal

#

@cerulean kayak

#

Also, when i mean optimal i mean by accuracy, if you want optimal in terms of accuracy and inference speed then you'll need to figure out how to reduce how many features (columns) are in your data set

#

Hence you can perform pca to prune features that are deemed irrelevant

cerulean kayak May 7, 2023, 12:45 AM

#

next valley Also, when i mean optimal i mean by accuracy, if you want optimal in terms of ac...

no I don't care about speed. Python's motto should be "what speed, lol"
but seriously just accuracy.

also, so will mapping sex to 1s and 2s make it inaccurate?

cold osprey May 7, 2023, 12:46 AM

#

u can one hot encode them

cerulean kayak May 7, 2023, 12:46 AM

#

%$#$@#&%
okay...

next valley May 7, 2023, 12:47 AM

#

cerulean kayak no I don't care about speed. Python's motto should be "what speed, lol" but seri...

Yes if there's more than 2 items you are trying to map but as someone else said, just one hot encode them, which will not decrease accuracy

#

It all depends on how you manipulate your data, welcome to machine learning where 80% of the time is asking how tf do i make my data work

cerulean kayak May 7, 2023, 12:50 AM

#

o trust me, yesterday my model had 0.76 accuracy and today it has 0.23 and i changed the print from a .format to print(f"")

cold osprey May 7, 2023, 12:50 AM

#

sounds like a bigger problem than the print

cerulean kayak May 7, 2023, 12:52 AM

#

well i have 2 other witnesses who say the same thing

cold osprey May 7, 2023, 1:46 AM

#

Anyone got any sklearn subclassing code i can refer to?

#

or is that not a thing?

serene scaffold May 7, 2023, 1:51 AM

#

cold osprey Anyone got any sklearn subclassing code i can refer to?

what sklearn thing did you want to subclass?

cold osprey May 7, 2023, 1:52 AM

#

serene scaffold what sklearn thing did you want to subclass?

an estimator maybe?

#

like how its done in pytorch/tensorflow

serene scaffold May 7, 2023, 1:53 AM

#

an estimator? not sure what you're referring to.

cold osprey May 7, 2023, 1:55 AM

#

any classifier

#

https://towardsdatascience.com/how-to-build-a-custom-estimator-for-scikit-learn-fddc0cb9e16e found this

Medium

How to Build a Custom Estimator for scikit-learn

Implementing a custom ensemble model with under-sampling for imbalanced data

#

nothing this complicated but what im saying is, whats the diff of declaring a classifier like

lin_reg = LinearRegression() ```

#

and ```py
class SpecialLinearRegression(LinearRegression):

def init(self):
pass

special_lin_reg = SpecialLinearRegression()```

cloud marsh May 7, 2023, 2:16 AM

#

what are some good options for managing data you plan on sending to tensorboard?

#

or just tensorboard tooling and data science log/benchmark data in general

dense oar May 7, 2023, 5:43 AM

#

Any good resource recommendations (websites, books, etc) for learning AI with Python? For complete beginners

cloud marsh May 7, 2023, 6:19 AM

#

dense oar Any good resource recommendations (websites, books, etc) for learning AI with Py...

what do you mean by complete beginner?

magic dune May 7, 2023, 6:21 AM

#

import numpy as np
import matplotlib.pyplot as plt


class NeuralNetwork:
    def __init__(self, layers, lr, epoch, X, t):
        self.lr = lr
        self.epoch = epoch
        self.layers = layers
        self.X = X
        self.t = t
        self.weights = {layer_idx: np.random.randn(layers[layer_idx + 1], layers[layer_idx]) / 5 for layer_idx in
                        range(len(layers) - 1)}
        self.bias = np.random.randn((len(layers) - 1), 1) / 5
        self.z_dict = {i: np.zeros((layers[i])) for i in range(len(layers))}
        self.z_dict[0] = X[0].flatten()
        delta_3 = (self.z_dict[2][0] - t[0]) * (self.z_dict[2][0] * (1 - self.z_dict[2][0]))
        delta_4 = (self.z_dict[2][1] - t[1]) * (self.z_dict[2][1] * (1 - self.z_dict[2][1]))
        self.delta = np.array([delta_3, delta_4])
        self.plot_data = []

    def forward(self):
        for z in X:
            z = z.reshape(-1, 1)
            for layer_idx in range(1, (len(layers))):
                a = np.matmul(self.weights[(layer_idx - 1)], z) + self.bias[(layer_idx - 1)]
                z = 1 / (1 + np.exp(-a))
                self.z_dict[layer_idx] = z.flatten()
            error = 0.5 * (z.flatten() - t) ** 2
        total_error = np.sum(error)
        return total_error

    def sigmoid(self, z):
        return z * (1 - z)

    def backward(self):
        for l in reversed(range(len(self.weights))):
            diag = np.diag(self.delta)
            arr = np.array([self.z_dict[l], self.z_dict[l]])
            new_derivatives = np.matmul(diag, arr)
            self.weights[l] = self.weights[l] - (self.lr * new_derivatives)
            self.bias[l] = sum(self.delta)
            sigmoid_arr = np.diag(self.sigmoid(self.z_dict[l]))
            self.delta = np.matmul(sigmoid_arr, np.matmul(self.weights[l].T, self.delta))
        return self.weights, self.bias

    def train(self):
        for e in range(self.epoch):
            total_error = self.forward()
            self.plot_data.append([e, total_error])
            self.backward()
            print(f"{e}: {total_error}")
        return self.weights, self.bias,

    def predict(self):
        return self.z_dict[2]

    def plot(self):
        data = np.array(self.plot_data)
        plt.scatter(data[:, 0], data[:, 1])
        print(data[:, 0])
        print(data[:, 1])
        plt.xlabel("Epoch")
        plt.ylabel("Total Error")
        plt.show()



if __name__ == '__main__':
    X = np.array([[0.05, .10]])
    t = np.array([1.00, 3.00])
    lr = 0.5
    n = 2
    H = 2
    output = 2
    epoch = 40000
    layers = [n, H, output]
    nn = NeuralNetwork(layers, lr, epoch, X, t)
    nn.train()
    print(nn.predict())
    nn.plot()

rate my neural network code?

magic dune May 7, 2023, 6:21 AM

#

cloud marsh what do you mean by complete beginner?

this

cloud marsh May 7, 2023, 6:22 AM

#

magic dune <:this:470903994118832130>

ok well thank god it's not javascript

magic dune May 7, 2023, 6:23 AM

#

cloud marsh ok well thank god it's not javascript

lol

cloud marsh May 7, 2023, 6:27 AM

#

magic dune lol

i have no control-f here. wtf lol. does that sigmoid function work?

magic dune May 7, 2023, 6:28 AM

#

cloud marsh i have no control-f here. wtf lol. does that sigmoid function work?

have u ever heard of squash function in math

cloud marsh May 7, 2023, 6:31 AM

#

dense oar Any good resource recommendations (websites, books, etc) for learning AI with Py...

quantecon has a few good books on finance, but also a good intro to python & data science:

I would start here: basics on the python ecosystem for data science programming for econ/finance: https://python-programming.quantecon.org/intro.html

quantecon with python: this is probably a bit too much if you're in HS, but it gives you information about where to find data sets for econ/finance. https://python.quantecon.org/intro.html
quantecon with julia (mostly the same problems as above, but in Julia) https://julia.quantecon.org/intro.html
network economics: https://networks.quantecon.org/

when i looked at finance/econ in the past, it seemed that getting data sets and access to data streams were about as complicated as any programming.

cloud marsh May 7, 2023, 6:32 AM

#

magic dune have u ever heard of squash function in math

i don't think i've heard it called that.

#

so z is a probability then. does np.diag actually diagonalization (like with Jordan Normal Form, i might be mincing terminology here) or does it just take the diagonal?

https://en.wikipedia.org/wiki/Jordan_normal_form

wooden sail May 7, 2023, 6:39 AM

#

cloud marsh so z is a probability then. does np.diag actually diagonalization (like with Jor...

not at all

#

also jordan normal forms are not diagonalization in general

#

what np diag does is one of two things:

if you give it a vector, it spits out a diagonal matrix that is 0 everywhere except on its diagonal where it has your vector
if you give it a matrix, it takes the diagonal of that matrix and spits it out as a vector

cloud marsh May 7, 2023, 6:42 AM

#

so, it's maybe useful when you want the variances and not the covariances

wooden sail May 7, 2023, 6:42 AM

#

!e

import numpy as np
M = np.random.normal(size=(3,3))
m = np.diag(M)
print(M)
print(m)

M_hat = np.diag(m)
print(M_hat)

arctic wedgeBOT May 7, 2023, 6:42 AM

#

@wooden sail :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | [[-0.36296454 -1.46717512  2.73763072]
002 |  [ 1.42493807  0.14579904  0.99692921]
003 |  [-1.60806289  0.37930145  1.77627134]]
004 | [-0.36296454  0.14579904  1.77627134]
005 | [[-0.36296454  0.          0.        ]
006 |  [ 0.          0.14579904  0.        ]
007 |  [ 0.          0.          1.77627134]]

wooden sail May 7, 2023, 6:42 AM

#

that could be an example, sure

past meteor May 7, 2023, 6:55 AM

#

cold osprey Anyone got any sklearn subclassing code i can refer to?

Do you still need this?

cloud marsh May 7, 2023, 7:02 AM

#

why would errors like this "Unsatisfied version of shared singleton module @late shell-widgets/base" occur?

i'm getting a similar warning when trying to render textures with k3d. i'm trying to evaluate whether I can write data to the texture and update it, but it's not rendering.

https://github.com/jupyterlab/jupyterlab-desktop/issues/576

#

i was looking into diagonizability about 6 months ago. i can't remember why, but i came across jordan normal form. i guess it's what you can do if it's not diagonalizable @wooden sail if i could ever get past the boilerplate of setting up environments/languages, then i could actually apply things and then probably retain them.

cloud marsh May 7, 2023, 9:12 AM

#

ok i guess i could just use PyVista/Trame then

flint gazelle May 7, 2023, 10:10 AM

#

Why cant i use an activation function in the last layer here : ```python3
model = tf.keras.Sequential([
tf.keras.layers.Input(shape=(70,), dtype=tf.int8),
tf.keras.layers.Dense(400, activation='relu'),
tf.keras.layers.Dense(2000, activation='relu'),
tf.keras.layers.Dense(1500, activation='relu'),
tf.keras.layers.Dense(1000, activation='relu'),
tf.keras.layers.Dense(400,activation='relu'),
tf.keras.layers.Dense(200,activation='relu'),
tf.keras.layers.Dense(1,'sigmoid'),
])

Epoch 1/2000
518/518 [==============================] - 13s 22ms/step - loss: 0.4892 - mae: 0.4892 - val_loss: 0.4873 - val_mae: 0.4873
Epoch 2/2000
518/518 [==============================] - 11s 22ms/step - loss: 0.4891 - mae: 0.4891 - val_loss: 0.4873 - val_mae: 0.4873
Epoch 3/2000
518/518 [==============================] - 11s 21ms/step - loss: 0.4891 - mae: 0.4891 - val_loss: 0.4873 - val_mae: 0.4873
Epoch 4/2000
518/518 [==============================] - 11s 22ms/step - loss: 0.4891 - mae: 0.4891 - val_loss: 0.4873 - val_mae: 0.4873
Epoch 5/2000
518/518 [==============================] - 11s 22ms/step - loss: 0.4891 - mae: 0.4891 - val_loss: 0.4873 - val_mae: 0.4873
Epoch 6/2000
518/518 [==============================] - 11s 21ms/step - loss: 0.4891 - mae: 0.4891 - val_loss: 0.4873 - val_mae: 0.4873

#

but if i use no activation function

Epoch 1/2000
518/518 [==============================] - 13s 22ms/step - loss: 0.7076 - mae: 0.7076 - val_loss: 0.2777 - val_mae: 0.2777
Epoch 2/2000
518/518 [==============================] - 11s 22ms/step - loss: 0.2634 - mae: 0.2634 - val_loss: 0.2562 - val_mae: 0.2562
Epoch 3/2000
518/518 [==============================] - 11s 22ms/step - loss: 0.2405 - mae: 0.2405 - val_loss: 0.2353 - val_mae: 0.2353
Epoch 4/2000
518/518 [==============================] - 13s 24ms/step - loss: 0.2238 - mae: 0.2238 - val_loss: 0.2233 - val_mae: 0.2233
Epoch 5/2000
518/518 [==============================] - 11s 21ms/step - loss: 0.2098 - mae: 0.2098 - val_loss: 0.2134 - val_mae: 0.2134
Epoch 6/2000
518/518 [==============================] - 11s 21ms/step - loss: 0.1987 - mae: 0.1987 - val_loss: 0.2025 - val_mae: 0.2025

using any other activation function in the output layer results in no progress in training. I also tried to just use

def custom(x):
    return tf.clip_by_value(x, clip_value_min=0, clip_value_max=1)

The output has to be between 0 and 1. Can anyone help me here ? I appreciate any help.

past meteor May 7, 2023, 10:14 AM

#

flint gazelle Why cant i use an activation function in the last layer here : ```python3 model ...

What is your x and y?

flint gazelle May 7, 2023, 10:15 AM

#

x_shape(70,) y_shape(1,)

#

x is a chessboard 8*8 + extra info 6 bytes and y the evaluation score for the board

past meteor May 7, 2023, 10:16 AM

#

y is real valued between 0 and 1?

flint gazelle May 7, 2023, 10:17 AM

#

yes

past meteor May 7, 2023, 10:17 AM

#

Then you need a linear activation and not sigmoid in your last layer

flint gazelle May 7, 2023, 10:18 AM

#

but

def custom(x):
    return tf.clip_by_value(x, clip_value_min=0, clip_value_max=1)

is linear, right ?

past meteor May 7, 2023, 10:19 AM

#

Linear is the default, just remove sigmoid.

flint gazelle May 7, 2023, 10:19 AM

#

yeah i know but the output values have to be between 0 and 1. If i just dont use a activation function. There will be values higher than 1 and lower than 0

past meteor May 7, 2023, 10:20 AM

#

Have you tested this so far?

flint gazelle May 7, 2023, 10:20 AM

#

Yes

past meteor May 7, 2023, 10:20 AM

#

And it was larger than 0 and 1?

flint gazelle May 7, 2023, 10:20 AM

#

Yes

past meteor May 7, 2023, 10:20 AM

#

You can also just use a linear activation and clip inside of your forward method EDIT: it's called call in tensorflow

#

What loss are you using?

flint gazelle May 7, 2023, 10:25 AM

#

mae

#

I also tried mse but mae worked better

#

But to me its still weird that i cant use any activation function even the clip one, and i think i said something wrong its not the score of the board but the expectancy that whit might win. 1 is white completley winning 0.75 white has advantage and 0.5 is even and 0.25 is disadvanage for white and 0 is completley losing, but continues values. so any value inbetween is possible

#

The y values for training are also be between 0 and 1 continous

past meteor May 7, 2023, 10:27 AM

#

For example for predicting pixels I've done linear => sigmoid before with binary cross entropy loss

#

MSE could work as well in this case

flint gazelle May 7, 2023, 10:32 AM

#

I am currently trying but it doesnt look that promising

past meteor May 7, 2023, 10:34 AM

#

I'd also just make your network a lot smaller

flint gazelle May 7, 2023, 10:34 AM

#

i have to finsih training to see that

lavish kraken May 7, 2023, 10:34 AM

#

if you want learn about XAI with python --https://github.com/PacktPublishing/Hands-On-Explainable-AI-XAI-with-Python

GitHub

GitHub - PacktPublishing/Hands-On-Explainable-AI-XAI-with-Python: E...

Explainable AI with Python, published by Packt. Contribute to PacktPublishing/Hands-On-Explainable-AI-XAI-with-Python development by creating an account on GitHub.

past meteor May 7, 2023, 10:34 AM

#

Helping people debug their networks is hard if I'm not sitting next to time 🤣

flint gazelle May 7, 2023, 10:35 AM

#

Yeah, but i appreciate your help. Sitting training the network the whole weekend here.

#

So some values are still a little bit above like 1.07 when just using mse and linear activation function. I will just clip the values after evaluating as you said.

#

Green is mse, yellow is mae

#

So it actually is a little better now

#

Do you have any further advice to increase the accuracy of the model ?

past meteor May 7, 2023, 10:51 AM

#

Making it smaller for starters and trying cross-entropy loss

flint gazelle May 7, 2023, 10:58 AM

#

I dont quite understand how i should use a cross-entropy loss. I thought these were used for classification and integers representing the class labels. Whereas i use continous values. Should i divide into value ranges so 0 -> 0.1, 0.1 -> 0.2 and so on ?

past meteor May 7, 2023, 11:02 AM

#

No you just drop it in. Cross-entropy works for anything between [0,1] (look at the formula). This is what I did when I was training an autoencoder, pixel space is [0,1] so I could use sigmoid => MSE or sigmoid => cross-entropy.

#

Loss functions are strongly related to a different likelihood so you're optimising for something else as you would in MSE (\eta ~ gaussian vs. Y ~ bernoulli). You can reason about what makes more sense in your case, I think there's arguments for both! 🙂 OR you just try it out and see which one works best empirically

warm iron May 7, 2023, 11:11 AM

#

Hey guys I was trying to install tenserflow library but my cmd don't work as it says "pip is not recognized as a internal and external command "

#

what should I do to resolve this issue?

flint gazelle May 7, 2023, 11:12 AM

#

You have to set the path to the directory where python is installed in you envroirment variables

#

but its recomended to use a virtual envroirment

warm iron May 7, 2023, 11:14 AM

#

this is the python file as well as library

flint gazelle May 7, 2023, 11:14 AM

#

past meteor No you just drop it in. Cross-entropy works for anything between [0,1] (look at ...

sigmoid => cross-entropy seems to be working quite well. Thank you

past meteor May 7, 2023, 11:15 AM

#

flint gazelle sigmoid => cross-entropy seems to be working quite well. Thank you

Tip: make your neurons a power of 2 and really make your network smaller. Add early stopping. Sprinkle in some drop-out if you're overfitting

flint gazelle May 7, 2023, 11:15 AM

#

warm iron this is the python file as well as library

pip is in Scripts

warm iron May 7, 2023, 11:15 AM

#

flint gazelle pip is in Scripts

so should I put the library file inside scripts?

flint gazelle May 7, 2023, 11:16 AM

#

No

#

just look up how to set up a virtual envroirment for example with conda and than activate the envroirment and you can get started

warm iron May 7, 2023, 11:20 AM

#

flint gazelle just look up how to set up a virtual envroirment for example with conda and than...

I can't do it on a normal python envroirment?

flint gazelle May 7, 2023, 11:23 AM

#

You can, but this can become an issue later when you have other projects using the same interpreter, because the dependencies might have missmathching versions and so on. If you want are more userfriendly way, you can download PyCharm they have a VirtualEnvroirment inuild in their ide

warm iron May 7, 2023, 11:25 AM

#

flint gazelle You can, but this can become an issue later when you have other projects using t...

alright I will do that thanks

flint gazelle May 7, 2023, 11:45 AM

#

wrong channel

#

one below

blissful vine May 7, 2023, 11:45 AM

#

Oops

lapis sequoia May 7, 2023, 2:31 PM

#

Hi guys,

I have images for 60 patients which gives cell types and wether the cell is cancerous or not.

And then I have images for 40 other patients which only tell wether cell is cancerous or not.

How can I make use of the extra 40 patients data to train celltype classification in CNN.

queen cradle May 7, 2023, 2:41 PM

#

lapis sequoia Hi guys, I have images for 60 patients which gives cell types and wether the ce...

What do you want your classifier to output? Should it output, "This cell is cancerous and the type is ..." or "This cell is not cancerous"? Or something else?

lapis sequoia May 7, 2023, 2:41 PM

#

Just the cell type

#

Cell is X type

queen cradle May 7, 2023, 2:41 PM

#

You don't need to identify cancerous versus non-cancerous?

lapis sequoia May 7, 2023, 2:41 PM

#

Nope

queen cradle May 7, 2023, 2:42 PM

#

In that case I think the data where you don't have the cell type is useless.

lapis sequoia May 7, 2023, 2:42 PM

#

Really

#

Can't be

#

In the assignment it specifically says you need to find a way to use it

queen cradle May 7, 2023, 2:42 PM

#

I guess I can imagine training an autoencoder with it.

#

Okay, maybe it's not useless.

lapis sequoia May 7, 2023, 2:43 PM

#

Gpt says to use that extra data to augment the images

#

And then dropping the extra labels

#

It says that will give extra info to the augemntor

queen cradle May 7, 2023, 2:43 PM

#

Don't ever trust ChatGPT.

#

It literally has no idea what it's talking about.

lapis sequoia May 7, 2023, 2:44 PM

#

Yeah but it helps when I have no idea what I am talking about too

queen cradle May 7, 2023, 2:44 PM

#

You said you had an assignment. What kind of course is this?

lapis sequoia May 7, 2023, 2:44 PM

#

ML

queen cradle May 7, 2023, 2:45 PM

#

Is the assignment specifically about certain architectures?

lapis sequoia May 7, 2023, 2:45 PM

#

Nope. Just needed to classify images

#

And justify the choice

#

Gpt said cnn #1 CHoice. I trusted it

queen cradle May 7, 2023, 2:46 PM

#

What kinds of classifiers are you familiar with?

lapis sequoia May 7, 2023, 2:46 PM

#

I think I know the simple ones

#

New to Neural Nets

queen cradle May 7, 2023, 2:47 PM

#

lapis sequoia Gpt said cnn #1 CHoice. I trusted it

Please don't. It's a language model. It got some text as input and it generates text as output. It has no understanding. If you want proof, ask it to do arithmetic. Or just, "reverse the digits of 3141592653589793238462643383279".

queen cradle May 7, 2023, 2:47 PM

#

lapis sequoia New to Neural Nets

What kinds of things do you know how to train?

lapis sequoia May 7, 2023, 2:48 PM

#

The reversed digits of 3141592653589793238462643383279 are 9723834362468943975859382659413.

#data-science-and-ml

Adds in our layers

Adds a convolutional layer and a max pooling layer

Has 16 filters (3,3 pixels in size)

Stride moving one pixel by one

Extracts the relevant information to make a classification

Applies a relu activation - taking into account non-linear patterns

Image shape is going to be 256 wide by 256 heigh, 3 channels deeps

Adds a convolutional layer and a max pooling layer

Has 32 filters (3,3 pixels in size)

Stride moving one pixel by one

Adds a convolutional layer and a max pooling layer

Has 16 filters (3,3 pixels in size)

Stride moving one pixel by one

Flattens to remove the channels value

256 values will now be the output

Creates a single output, 0 or 1

Compiles the model using the 'adam' optimiser. Specifying what the loss is. The metric tracked is accuracy, shows how well the model is classifying either 0 or 1.

Displays how the model transforms the data

Compiles the model using the 'adam' optimiser. Specifying what the loss is. The metric tracked is accuracy, shows how well the model is classifying either 0 or 1.

Epoche is how long we're going to train for

Passes through the validation data, to see how well the model is performing in real time

Stores in a variable called history

Builds an image dataset, using keras

Allow us to convert to a numpy iterator, allows access to the image dataset

Checks which class is assigned to which image

Checks that they've been scaled correctly