#data-science-and-ml | Python | Page 100

dry geyser Feb 10, 2024, 3:39 PM

#

"BUT THEY BEAT KASPAROV, SON"

#

"there is no chess board"

#

on a serious note

#

write_json is a blessing to isolate test cases with polars

final kiln Feb 10, 2024, 3:40 PM

#

I wouldn't use polars, spark is the way

dry geyser Feb 10, 2024, 3:41 PM

#

sssshhh

final kiln Feb 10, 2024, 3:41 PM

#

Polars may promise multi processing, but spark actually delivers

dry geyser Feb 10, 2024, 3:42 PM

#

for my use case polars is absolutely great

#

i rewrote all my validation and coalescing into expr

#

and it didnt suck

final kiln Feb 10, 2024, 3:43 PM

#

Yeah I remember some parts being nice, but when it came time to scale it didn't deliver

#

There were also issues with some of the interfaces

#

And the IO didn't work as promised

#

That one killed me >.<

dry geyser Feb 10, 2024, 3:44 PM

#

i didnt see any great features in spark for data transformation in the context of what im doing/need to do

final kiln Feb 10, 2024, 3:44 PM

#

It will use all your cores while maintaining ram below 4gb

#

Spark is not without fault tho, there's some serious issues with memory leaking

#

I just re create the session when I use it, but it's not ideal cuz of the random split

#

It's probly something I'm doing wrong tho, somehow

#

Clearly states java heap

dry geyser Feb 10, 2024, 3:47 PM

#

java will never beat rust in anything

#

re memory and performance

final kiln Feb 10, 2024, 3:49 PM

#

Execution is everything

#

You can benchmark C to be slower than python

#

If you don't got the skills for C

dry geyser Feb 10, 2024, 3:50 PM

#

not that java is impossible to run anywhere constrained... you could argue that smartcard applets are java bytecode and indeed they are.... but thats besides the point.

#

writing shit C that performs worse than python sounds like a challenge

#

whereas writing shit py is arguably trivial

#

just like js

#

the entry bar is low

final kiln Feb 10, 2024, 3:51 PM

#

All I'm saying is that the language is not everything if the person writing doesn't do a good job

dry geyser Feb 10, 2024, 3:51 PM

#

there are intrinsic/inherent requirements for certain languages that make the mistakes also comparatively more complex. ex. in C memory safety.

final kiln Feb 10, 2024, 3:52 PM

#

I don't even know if java allows intereopability with low level anyway

dry geyser Feb 10, 2024, 3:52 PM

#

rust is opinionated to an unbearable level if you write shit code

#

it can...

final kiln Feb 10, 2024, 3:52 PM

#

I don't think that will protect you enough

dry geyser Feb 10, 2024, 3:52 PM

#

android is java and you have tons of JNI/native code

#

most android software protections are actually using native code to interop with java for obfuscation etc

final kiln Feb 10, 2024, 3:53 PM

#

The type system can't predict if you're reading your CSV right

#

Or if you're making good use of your cores

dry geyser Feb 10, 2024, 3:53 PM

#

sure, but thats a different issue

final kiln Feb 10, 2024, 3:53 PM

#

It's not, that's what drove me away from polars

dry geyser Feb 10, 2024, 3:53 PM

#

rust forces constraints for a reasonable baseline of safety

final kiln Feb 10, 2024, 3:56 PM

#

Spark is written in Scala apparently

dry geyser Feb 10, 2024, 3:56 PM

#

btw ontopic: how can I "pretty print" a df in polars?

#

the console output scales horizontally

buoyant vine Feb 10, 2024, 3:56 PM

#

just printing the df should work

#

although it will cap the width

dry geyser Feb 10, 2024, 3:57 PM

#

i run terminal like it's 1973

#

80x75

#

yeah

#

gimme deathscrolls plz

buoyant vine Feb 10, 2024, 3:57 PM

#

I assume print(df) makes an ugly output with that size

dry geyser Feb 10, 2024, 3:57 PM

#

@buoyant vine also where do i send you a case of beer/drinks/whatever?

buoyant vine Feb 10, 2024, 3:57 PM

#

😅 I wouldn't worry about it

dry geyser Feb 10, 2024, 3:57 PM

#

lol

buoyant vine Feb 10, 2024, 3:58 PM

#

Unless you have a miracle cure for whooping cough lol

dry geyser Feb 10, 2024, 3:58 PM

#

i believe in the honorable practice of displaying gratefulness to those who help

#

i mean, i wouldnt rob a hospital of cough suppressants, but if that pays the moral debt....

final kiln Feb 10, 2024, 4:00 PM

#

There's a context manager that lets you alter the width of the rows

#

Scala seems appropriate for this stuff

#

Rust may be the wrong choice as it's more of a systems language

buoyant vine Feb 10, 2024, 4:01 PM

#

002_think

dry geyser Feb 10, 2024, 4:01 PM

#

whats the keyword?

#

re ctx manager

final kiln Feb 10, 2024, 4:01 PM

#

Ask gpt

dry geyser Feb 10, 2024, 4:01 PM

#

lol

final kiln Feb 10, 2024, 4:02 PM

#

But it was something Config

buoyant vine Feb 10, 2024, 4:02 PM

#

well that is an unhelpful awnser lol

final kiln Feb 10, 2024, 4:02 PM

#

What am I gonna do, lie ?

dry geyser Feb 10, 2024, 4:02 PM

#

humiliate yourself saying 'i dunno'

final kiln Feb 10, 2024, 4:02 PM

#

That's exactly what I said tho

#

While pointing you to where I got it in the first place

dry geyser Feb 10, 2024, 4:03 PM

#

gpt just threw a pandas answer to me

#

derp

#

#

ever polite.

#

lets hope those fearsome hostile AIs never become a thing. i bet she will remember.

buoyant vine Feb 10, 2024, 4:05 PM

#

https://docs.pola.rs/py-polars/html/reference/config.html#use-as-a-context-manager

dry geyser Feb 10, 2024, 4:05 PM

#

gpt-4 is pretty awful for polars

#

out of date and also gets quite some things awfully wrong

final kiln Feb 10, 2024, 4:06 PM

#

That's why you cross check with the docs

buoyant vine Feb 10, 2024, 4:06 PM

#

Or alternatively you just go straight to the docs 😅 Makes stuff much quicker and simpler

final kiln Feb 10, 2024, 4:07 PM

#

This model has like 100M params, gonna take forever to train

buoyant vine Feb 10, 2024, 4:07 PM

#

shouldn't take too long

final kiln Feb 10, 2024, 4:07 PM

#

buoyant vine Or alternatively you just go straight to the docs 😅 Makes stuff much quicker an...

Has been working out for me, bad stuff happens when I don't cross check tho

dry geyser Feb 10, 2024, 4:09 PM

#

@buoyant vine im missing more code samples

final kiln Feb 10, 2024, 4:10 PM

#

one is 500dim, the other is 1000dim (they are transformers )

dry geyser Feb 10, 2024, 4:11 PM

#

@buoyant vine quick q: im using a public csv for testing, and have been teasing the idea of doing the coalescing and grouping entirely via expr:

mapping = {
    "CUSTOMER_FORENAME": "first_name",
    "CUSTOMER_SURNAME": "last_name",
    "CUSTOMER_GENDER": "gender"
}

structured_column = pl.struct({
    new_key: pl.col(old_key) for new_key, old_key in mapping.items()
}).alias("person")

How can I retrieve this to create a dict properly containing the keys-values? ex. person : { first_name: ...., }

#

df.select(structured_column)?

buoyant vine Feb 10, 2024, 4:12 PM

#

hmm

#

I think that should work

#

Or at least I can't see anything wrong with the idea]

#

or it might be a with_columns otherwise

dry geyser Feb 10, 2024, 4:14 PM

#

pprint.pprint(df_with_struct.rows(named=True)[0]) still shows the column names though

#

{'person': {'CUSTOMER_FORENAME': 'JOHN',
'CUSTOMER_GENDER': 'male',
'CUSTOMER_SURNAME': 'DOE'}}

buoyant vine Feb 10, 2024, 4:15 PM

#

Isn't that what you have defined as your new_key?

dry geyser Feb 10, 2024, 4:16 PM

#

if i swap them they obviously dont exist: polars.exceptions.ColumnNotFoundError: first_name

fossil forum Feb 10, 2024, 4:16 PM

#

Hello everyone, I am an AI/ML engineer working in the US. We have recently started a discord channel aimed at

Sharing AI ideas
Finding project mates for AI projects
Study groups to learn AI
Resource sharing
Networking

This is in it's early stages, but if you are one of those who are interested in leading conversations and building a beautiful AI community join this channel.
Reach out to me if you are interested

final kiln Feb 10, 2024, 4:17 PM

#

#

this one is 50, 200, 500, 1000 if im not mistaken

buoyant vine Feb 10, 2024, 4:18 PM

#

dry geyser if i swap them they obviously dont exist: polars.exceptions.ColumnNotFoundError:...

What if you rename the columns before hand?

dry geyser Feb 10, 2024, 4:19 PM

#

i could but that seems against my ocd tendency towards not touching the original columns

buoyant vine Feb 10, 2024, 4:19 PM

#

could do a .rename(mapping) before so the columns are correct before they go into the struct

#

https://docs.pola.rs/py-polars/html/reference/series/api/polars.Series.struct.rename_fields.html can do this aswell

dry geyser Feb 10, 2024, 4:20 PM

#

lemme try the rename

#

@buoyant vine where do i place the rename_fields? can i concat/chain it directly to the pl.struct?

final kiln Feb 10, 2024, 4:25 PM

#

I know for sure its gonna do the thing, but it sure is taking a while

buoyant vine Feb 10, 2024, 4:25 PM

#

I think you can just do .alias("person").rename_fields(["field_1", "field_2"])

#

it expects the new fields to be in the order you defined the struct in

dry geyser Feb 10, 2024, 4:26 PM

#

.struct.rename... seems to work

#

yep

#

structured_column = pl.struct({
new_key: pl.col(original_column) for new_key, original_column in mapping.items()
}).struct.rename_fields(list(mapping.values())).alias("person")

final kiln Feb 10, 2024, 4:27 PM

#

you know you can also do stuff using sql, probly easy for those kinds of operations

#

easier *

dry geyser Feb 10, 2024, 4:27 PM

#

@final kiln i havent got started with the sql side yet

#

does it go thru the same engine?

#

exprs seem blazing fast

buoyant vine Feb 10, 2024, 4:28 PM

#

it uses the datafusion engine yeah

#

but it goes through a few more hoops

final kiln Feb 10, 2024, 4:29 PM

#

i mean if all you're doing is renaming stuff

#

i think its about to do the thing, or am i losing my mind already

dry geyser Feb 10, 2024, 4:32 PM

#

@buoyant vine now im rewriting the dynamic expression stuff. the basic things like "if this bool column is set to True, then the field value is foo" does not seem too complicated

#

i wrote my own sandboxed asteval-like expression engine, but it was horribly slow

buoyant vine Feb 10, 2024, 4:34 PM

#

👍

final kiln Feb 10, 2024, 4:41 PM

#

final kiln i think its about to do the thing, or am i losing my mind already

I'm just gonna leave it and go enjoy my Saturday ._.

dry geyser Feb 10, 2024, 4:43 PM

#

you should

final kiln Feb 10, 2024, 4:43 PM

#

Ah I see the issue tho, it's just super slow

#

It's still on the third slice after an hour

dry geyser Feb 10, 2024, 4:43 PM

#

it was hailing here so ill be cranking out shitcode

#

is there a way to limit/condition an expression to the presence of a non null value in a specific column?

buoyant vine Feb 10, 2024, 5:01 PM

#

can combine the expression with a col.not_null() expr

#

i.e. (pl.col(col_name).not_null() & other_expr)

dry geyser Feb 10, 2024, 5:04 PM

#

ex. if column X is not null and set to boolean true, set a new column FOO to value XYZ

buoyant vine Feb 10, 2024, 5:05 PM

#

hmm

#

should be able to

dry geyser Feb 10, 2024, 5:12 PM

#

on it

#

@buoyant vine is it possible to add a new field to a struct without recomposing it?

buoyant vine Feb 10, 2024, 5:16 PM

#

I dont think so

dry geyser Feb 10, 2024, 5:17 PM

#

i suppose then the way to do it is to create an intermediate column

#

and add it

#

can map_* be used to do something like what i asked earlier re conditional field values?

buoyant vine Feb 10, 2024, 5:21 PM

#

it basically is a pandas.apply method

#

i.e. it gives you the column value, and expects a value returned

#

what you do inbetween those points it doesn't really care about

#

the problem is it limits your performance signficantly

dry geyser Feb 10, 2024, 5:25 PM

#

expr?

#

@buoyant vine http://pastie.org/p/4yTznKtTuJeKQQLQFm2m2z not getting the phones part to work

buoyant vine Feb 10, 2024, 5:41 PM

#

dont you want pl.concat_list(list(fields)).unique().drop_nulls().alias(name) rather than doing it after the explode?

dry geyser Feb 10, 2024, 5:42 PM

#

lemme check

#

polars.exceptions.InvalidOperationError: unique operation not supported for dtype list[str]

buoyant vine Feb 10, 2024, 6:04 PM

#

what if you do .arr.unique?

final kiln Feb 10, 2024, 6:09 PM

#

I changed the LR schedule, increased the warmup period.

I really need to look up the rationale behind the 2017 LR scheduler

#

It had the opposite effect on the scheduler, but it looks like it improved the situation

Which would be awesome, except that it totally means idk what I'm doing >.>

#

Ok so.

With max LR of 500e-6 it stayed up there for a long time but I could notice a slight slope downwards + the mini batch loss was becoming more stable.

A smaller max LR (which implies smaller LR throughout), has not changed it very much except that the slope downwards has increased, but still nothing major

late ruin Feb 10, 2024, 6:19 PM

#

hi, need a quick help, i've got an xlsx file that contains these numeric columns, they are float64, now as you can see, they are not very pretty id say, i mean the way they are represented are too long, i tried to change using with as type by doing

df['FF','Rs','Rsh','VOC(mV)','jsc(µA/cm2)']= df['FF','Rs','Rsh','VOC(mV)','jsc(µA/cm2)'].astype(double)

which haven't worked cause of the name of each column, would love to hear some tips and tricks

final kiln Feb 10, 2024, 6:20 PM

#

My intention was to increase max LR to speed up the process. But since the opposite occurred, it means that the model is overshooting the minima.

left tartan Feb 10, 2024, 6:21 PM

#

late ruin hi, need a quick help, i've got an xlsx file that contains these numeric columns...

You want to change the output format, the way the dataframe displays the floats?

late ruin Feb 10, 2024, 6:21 PM

#

id like to change the way each column has its information displayed

left tartan Feb 10, 2024, 6:22 PM

#

Start here, see precision/etc: https://pandas.pydata.org/pandas-docs/stable/user_guide/options.html

#

There’s more advanced things you can do to render it differently, but that might be enough

dry geyser Feb 10, 2024, 6:23 PM

#

@buoyant vine polars.exceptions.ComputeError: expected array dtype

late ruin Feb 10, 2024, 6:24 PM

#

left tartan There’s more advanced things you can do to render it differently, but that might...

thats good it helps but thing is some of the data is not decimal but more in the millions, and thats why it shows this way

final kiln Feb 10, 2024, 6:24 PM

#

final kiln My intention was to increase max LR to speed up the process. But since the oppos...

Since I cannot increase LR to speed it up, my only option is to decrease the size of the batch so that the number of weight updates is higher

dry geyser Feb 10, 2024, 6:24 PM

#

@buoyant vine the fields array is Expr(col(thecolumn))

#

phone1, phone2, etc

buoyant vine Feb 10, 2024, 6:25 PM

#

what is your code rn?

left tartan Feb 10, 2024, 6:25 PM

#

late ruin thats good it helps but thing is some of the data is not decimal but more in the...

See float_format

languid moss Feb 10, 2024, 6:27 PM

#

Hello, I need to develop an AI to play the 2048 Game, I have a large experience with Python, I know i have to use Tensorflow but I just need a roadmap for my learning, thanks in advance !

late ruin Feb 10, 2024, 6:27 PM

#

left tartan See float_format

if i saw you irl id kiss you , thank you very much

final kiln Feb 10, 2024, 6:33 PM

#

final kiln Since I cannot increase LR to speed it up, my only option is to decrease the siz...

In gonna do the opposite, increase batch size and decrease LR/keep it as it is. Higher batch size = more accurate gradient calculation

dry geyser Feb 10, 2024, 6:35 PM

#

taking break, stuck on this one

final kiln Feb 10, 2024, 6:37 PM

#

final kiln In gonna do the opposite, increase batch size and decrease LR/keep it as it is. ...

Batch size of 128 now

#

It's also possible that it's just not a good idea to do d=1000 and N=12, since that's equivalent to what they used in 2017. And they used a lot more GPU than what I'm using.

#

I'm gonna let this one roll and start doing from 800

long canopy Feb 10, 2024, 6:59 PM

#

anything out there to scrape or query github?

tidal bough Feb 10, 2024, 7:03 PM

#

long canopy anything out there to scrape or query github?

well, it has an api

#

and googling pypi github gets me at least one popular library implementing it, even.

long canopy Feb 10, 2024, 7:06 PM

#

hm right, i hope the rate limit doesn't affect me

#

maybe there's a github dataset of repositories somewhere?

tidal bough Feb 10, 2024, 7:09 PM

#

a dataset of what info from the repos?

#

googling dataset of github repos gets me https://www.kaggle.com/datasets/github/github-repos

GitHub Repos

Code and comments from 2.8 million repos

final kiln Feb 10, 2024, 7:10 PM

#

#

one is batch size of 32. the other 128

#

32 has higher learning rate, 128 lower

#

one is betting that the gradient calculation is accurate enough, so I just need to double down on them (double the updates and larger steps), the other is betting that the model was overshooting the minima so it needs to calculate more accurate gradients and take smaller steps

long canopy Feb 10, 2024, 7:13 PM

#

tidal bough googling `dataset of github repos` gets me https://www.kaggle.com/datasets/githu...

wow i suck i should google

#

thanks a lot!

final kiln Feb 10, 2024, 7:18 PM

#

final kiln

New approach, I'm starting from 500, which I already saw that converges, then increase it til 1000 in chunks of 100

late ruin Feb 10, 2024, 7:53 PM

#

hey im trying to normalize my data, in the next way (pic) , but the 'B.C' column is string and id like to keep it without applying anything to it, any suggestions?

tidal bough Feb 10, 2024, 7:57 PM

#

well, simplest way would be normalized_df["B.C."] = df["B.C."]. :p

#

the nice way would be to exclude that column from df before doing this stuff

late ruin Feb 10, 2024, 8:00 PM

#

im omega pepega lol ty

final kiln Feb 10, 2024, 8:21 PM

#

final kiln New approach, I'm starting from 500, which I already saw that converges, then in...

This helped a lot.

There's something funky going on with my gradient accumulation code. It doesn't look wrong at all tho. I suspect that it is something related to the order of magnitude of the values when I use small mini batches

long canopy Feb 10, 2024, 8:22 PM

#

will two prompts with the same tokens in different positions have different vector embeddings?

final kiln Feb 10, 2024, 8:24 PM

#

long canopy will two prompts with the same tokens in different positions have different vect...

I believe so yes, on top of the vector embedding, it is then added a positional encoding.

long canopy Feb 10, 2024, 8:24 PM

#

ah right that makes sense

teal lance Feb 10, 2024, 11:29 PM

#

Can anybody help with this error ☝🏽

neon crystal Feb 11, 2024, 12:43 AM

#

Hey guys I am trying to run GMMHMM model for regime detection on a time series. But I am not sure about the amount of clusters distribution I need. I remember from normal ML for K means I can use metrics like WSS and other methods based on the centroids. But now I am working with Gaussian distributions here what metrics can I use? I am thinking maybe KL or JS divergence but are these really a good metrics to use?

signal holly Feb 11, 2024, 2:30 AM

#

can anyone help mentor me for my ml journey
I keep giving up easily relying only on myself

limber mesa Feb 11, 2024, 3:25 AM

#

teal lance Can anybody help with this error ☝🏽

pip install pandas ?

teal lance Feb 11, 2024, 3:26 AM

#

limber mesa pip install pandas ?

Weird fix had two instances of Python but thank you 🔥

limber mesa Feb 11, 2024, 3:26 AM

#

Jep that sounds about right

dense pawn Feb 11, 2024, 5:54 AM

#

signal holly can anyone help mentor me for my ml journey I keep giving up easily relying only...

I completely understand the struggle with giving up easily. While I'm not at the point of being a mentor someone, I'm also on this ML journey and I've found this roadmap helpful: [https://i.am.ai/roadmap/#note] It includes specific steps and resources that helped me stay motivated when I felt stuck. Perhaps some of them could be useful for you too!

AI Roadmap

Follow these roadmaps to become an Artificial Intelligence expert.

limber mesa Feb 11, 2024, 6:57 AM

#

dense pawn I completely understand the struggle with giving up easily. While I'm not at the...

Thanks!
100% bookmarked that 🙂

final kiln Feb 11, 2024, 7:49 AM

#

I've checked the commit hashes, compared them using github, there's no code differences

#

800 and 900 are missing because celery failed, I need to implement a circuit breaker and a timeout thing

#

This is turning out to be random. I need to step back and reflect on why this is random and how to make it, not be random. Otherwise I won't get anywhere with this.

#

I'm reshuffling the batches to prevent the network from capturing any patterns coming from the order in which it sees the sequences

#

My hypothesis is that the small batch size is at fault.

The way it is, it might be throwing the dices until it finds a sequence of batches that happen to accurately represent the gradient landscape, when it finds them in sequence and in sufficient number it then finds the direction towards the local minima and from there I suppose the slope is large and all directions are funneling towards the minima so the loss starts decreasing sharply, until it no longer is the case and the situation regresses to where it was but at a lower level, which would explain why they all converge to the same value more or less.

#

Guesswork is no good here. Since this setup is highly efficient memory wise, I can fit the transformer from 2017 and possibly their batch configuration. There's not gonna be a one to one correspondence but I can calculate the information content on each batch from 2017 and try to match it in my batches. The model hyper parameters are more or less the same too, especially if the MetaFormer stuff translates to NLP.

hushed scarab Feb 11, 2024, 11:10 AM

#

Hello everyone; let’s say you want to train a dataset, where can I find the data instead of creating it from scratch?

versed pilot Feb 11, 2024, 11:26 AM

#

Hey, I came across this today, does it help? https://insar.dev/

pygmtsar

PyGMTSAR (Python InSAR)

versed pilot Feb 11, 2024, 11:52 AM

#

It's focused on interferometry, you didn't say what exactly you are doing with SAR

tired otter Feb 11, 2024, 1:01 PM

#

Hi. Question about variational autoencoders. Is the main idea that during encoding, latent space is sampled from multidimensional distribution which is shaped by parameters obtained from input data?

mental radish Feb 11, 2024, 1:38 PM

#

Hey! I tried to fit the flux of cosmic rays versus energy for AllParticles&H. I used the CRDB package to extract the data. Then, i used the power law to fit. However, the shape of x is (1236,) and the error in y is (1236,2). how do i fix the shape issue?

Here is my code:

x = t_combined.e
y = t_combined.value
err = t_combined.err_sta
lsq = LeastSquares(x, y, err, power_law)
m = Minuit(lsq, a=1, gamma=-2.0)
plt.errorbar(x, y, err, fmt="o", label="data")
plt.plot(x, (x, *m.values), label="fit") # what does this line do?
ax.scatter(x, y, label="Combined original data", marker="x")
a_fit = minuit.values.a gamma_fit = minuit.values.gamma
x_fit = np.logspace(np.log10(t_combined['e'].min()), np.log10(t_combined['e'].max()), 100) y_fit = power_law(x_fit, a_fit, gamma_fit)
ax.plot(x_fit, y_fit, label="Fitted power law", linestyle='--', color='red')
plt.xlabel(r"𝐸𝑘 [GeV]")
plt.ylabel(r"𝐸𝑘 d𝐽/d𝐸𝑘 [1/(m2 s sr)]")
plt.title('Power Law Fit')
plt.legend()
plt.xscale('log')
plt.yscale('log')
plt.show()
print("Fitted parameters (a, gamma):", m)
plt.show()

dry geyser Feb 11, 2024, 1:48 PM

#

dang

#

still stuck with the dedup + struct expt

#

expr

merry oak Feb 11, 2024, 1:50 PM

#

mental radish Hey! I tried to fit the flux of cosmic rays versus energy for AllParticles&H. I ...

As mentioned in the help post, please format your code with markdown to make it easier to read

final kiln Feb 11, 2024, 2:16 PM

#

behold, mah pipline

#

it now uses pull requests to centralize note taking

#

so I open a PR, it automatically detects if it's an experiment, creates it in mlflow, when I merge it starts running it

odd meteor Feb 11, 2024, 3:13 PM

#

hushed scarab Hello everyone; let’s say you want to train a dataset, where can I find the data...

Check Kaggle.com

sharp zenith Feb 11, 2024, 3:32 PM

#

the AI bubble blew up ? looks like there's nothing new in huggingface.co for a long time

agile owl Feb 11, 2024, 3:43 PM

#

maybe there's just too many piglets for the teats

#

so is polars just generally faster than pandas

#

or is it situational

final kiln Feb 11, 2024, 4:26 PM

#

may I try to convert you to the religion of Spark

zinc moat Feb 11, 2024, 4:30 PM

#

What's the best low pc computing cost object detector? I want create my own security cam (only a detector of people)

final kiln Feb 11, 2024, 4:32 PM

#

zinc moat What's the best low pc computing cost object detector? I want create my own secu...

I think mediapipe has something of the sort

zinc moat Feb 11, 2024, 4:33 PM

#

ty

#

I'll see

final kiln Feb 11, 2024, 4:41 PM

#

Now I can use the PRs as logbooks for each experiment. And they can refer each other and all that stuff so everything is gonna be neatly organized

sterile talon Feb 11, 2024, 4:50 PM

#

versed pilot Hey, I came across this today, does it help? https://insar.dev/

Hah neat, It has come up during the week! I'm not sure yet if it suits my application, but it does InSAR. Thanks!

dry geyser Feb 11, 2024, 5:24 PM

#

any polars guy around?

agile owl Feb 11, 2024, 7:52 PM

#

final kiln may I try to convert you to the religion of Spark

my data isn't bigger than memory so what would spark do for me

final kiln Feb 11, 2024, 7:54 PM

#

agile owl my data isn't bigger than memory so what would spark do for me

Lets you use all your cores, afaik polars doesnt do that

agile owl Feb 11, 2024, 7:54 PM

#

Polars is written from the ground up with performance in mind. Its multi-threaded query engine is written in Rust and designed for effective paralellism. I

#

pandas is also that way afaik

final kiln Feb 11, 2024, 7:55 PM

#

agile owl >Polars is written from the ground up with performance in mind. Its multi-thread...

The docs can say what they wanna say, but it don't do it

#

I mean ig the lib wasn't even able to read my data so idk

#

All I know is spark took my kaggle and got CPU up to 300% with no effort from my part

#

While polars wasn't able to read a basic CSV in lazy mode thing

iron basalt Feb 11, 2024, 7:59 PM

#

agile owl so is polars just generally faster than pandas

Generally, yes.

agile owl Feb 11, 2024, 7:59 PM

#

under what conditions might pandas be faster

#

or is it universally the case

final kiln Feb 11, 2024, 8:06 PM

#

I will risk and say always cuz I've never seen pandas be fast in my life

agile owl Feb 11, 2024, 8:06 PM

#

pandas is faster than pyspark lol

#

if it's a small dataset

#

the cost of spinning up the workers is not worth it oftentimes

final kiln Feb 11, 2024, 8:07 PM

#

Ig if the dataset is small I'll be using python constructs

agile owl Feb 11, 2024, 8:07 PM

#

that is not performant at all

#

you need to use numpy or pyarrow

final kiln Feb 11, 2024, 8:07 PM

#

It's way more performant than pandas

iron basalt Feb 11, 2024, 8:07 PM

#

agile owl under what conditions might pandas be faster

Maybe in some specific single-threaded cases the Numpy operations used are faster in their C/Fortran implementations than those written in Rust (unlikely, even a simple loop with optimizations (auto-vectorization) enabled in LLVM will be fast).

final kiln Feb 11, 2024, 8:07 PM

#

If you're using numpy

agile owl Feb 11, 2024, 8:07 PM

#

that's just factually incorrect

#

that's not a python construct though

#

that's numpy

final kiln Feb 11, 2024, 8:07 PM

#

True

#

Still

#

Small dataset, a dict + numpy does the trick for me

agile owl Feb 11, 2024, 8:10 PM

#

by small dataset I mean smaller than RAM

#

or your RAM budget

#

I still wouldn't use a dict for nested indexes or anything like that

final kiln Feb 11, 2024, 8:11 PM

#

Why do you find the spark init so expensive tho, the memory management alone makes it worth it

agile owl Feb 11, 2024, 8:11 PM

#

because you don't need memory management if the data is smaller than your RAM budget?

final kiln Feb 11, 2024, 8:11 PM

#

It's nice if you can cap it a 4gb at will

final kiln Feb 11, 2024, 8:12 PM

#

agile owl because you don't need memory management if the data is smaller than your RAM bu...

But yes, you still need management

agile owl Feb 11, 2024, 8:12 PM

#

for what purpose

final kiln Feb 11, 2024, 8:12 PM

#

For the purpose of having memory for your other stuff

boreal gale Feb 11, 2024, 8:12 PM

#

my 2 cents is that polars is generally faster than pandas (pandas<2 for sure, less so for pandas>=2)
but i don't think time series support in polars is really fully there yet (if you care about those stuff, iirc you deal with finance so i thought this would be relevant)

final kiln Feb 11, 2024, 8:12 PM

#

Memory leaking and etcs too ofc

#

If you want performance

#

You think memory management

agile owl Feb 11, 2024, 8:13 PM

#

that is not what I think at all

final kiln Feb 11, 2024, 8:13 PM

#

Then you're thinking wrong

agile owl Feb 11, 2024, 8:13 PM

#

when I think performance I think using all the memory I have

#

because that's the axiom of computer science

final kiln Feb 11, 2024, 8:14 PM

#

Idk none of that, all I know is that performance is about where you put your memory and how you lay it out

#

And pandas and polars are not the tool for that

agile owl Feb 11, 2024, 8:14 PM

#

the tradeoff between using memory and not using memory is that you use the memory to get things done faster

#

im not worried about constraining memory usage

final kiln Feb 11, 2024, 8:15 PM

#

I think you're arguing against a straw

agile owl Feb 11, 2024, 8:15 PM

#

you said i need memory management but in fact that stuff has an overhead

#

and i was asking for performance

final kiln Feb 11, 2024, 8:16 PM

#

Memory constraint is not the only thing in memory management

final kiln Feb 11, 2024, 8:16 PM

#

agile owl and i was asking for performance

If you want true performance I suggest using Cython to do intereopability with C

agile owl Feb 11, 2024, 8:17 PM

#

not willing to go that far

final kiln Feb 11, 2024, 8:17 PM

#

As the code will be specifically made for your use case

#

Then numpy and polars are the next best thing afaik

#

Ah and spark ofc

agile owl Feb 11, 2024, 8:18 PM

#

boreal gale my 2 cents is that polars is generally faster than pandas (pandas<2 for sure, le...

whats the problem with time series in polars?

final kiln Feb 11, 2024, 8:18 PM

#

Spark being better since it makes better use of resources

agile owl Feb 11, 2024, 8:18 PM

#

I'm not "against" spark but adding dependencies especially ones that need separate runtimes has a cost

final kiln Feb 11, 2024, 8:19 PM

#

Execution is everything

#

If it's a well done thing, you don't care

#

That's my take at least, I was very impressed with it

iron basalt Feb 11, 2024, 8:20 PM

#

Why is there any issue here? Just download and try all of them on your data. Measure it.

final kiln Feb 11, 2024, 8:20 PM

#

iron basalt Why is there any issue here? Just download and try all of them on your data. Mea...

I second this, best thing is always to measure

agile owl Feb 11, 2024, 8:20 PM

#

I would have to rewrite a lot of code and I'm not sure if it's the best use of my time rn vs other things I need to write for this project

final kiln Feb 11, 2024, 8:21 PM

#

Otherwise we just arguing about the size of the angels wings

agile owl Feb 11, 2024, 8:21 PM

#

so I was trying to figure out as much as I could from ppl who have used both

iron basalt Feb 11, 2024, 8:21 PM

#

agile owl I would have to rewrite a lot of code and I'm not sure if it's the best use of m...

If it may end up making the difference between something like 1 hour or 10 hours of training (or whatever you are doing), yeah, probably worth.

agile owl Feb 11, 2024, 8:22 PM

#

that's what I'm trying to figure out.. lol

#

if it's like 5% faster

#

then there's no point right now

warm copper Feb 11, 2024, 8:22 PM

#

hey guys

iron basalt Feb 11, 2024, 8:22 PM

#

agile owl that's what I'm trying to figure out.. lol

To know that you need to know if you are compute bound, memory bound, IO bound.

warm copper Feb 11, 2024, 8:22 PM

#

I have a question

slim wigeon Feb 11, 2024, 8:22 PM

#

can someone help me to debug my implementation of DNN and backprop from scratch? i cannot provide provide any more information about the problem im having in the code because idk where the problem is
my goal is to write DNN using only numpy, but after i finished the implementation, my implementation of DNN just doesnt learn
i have been debugging for a few days and i couldnt find the problem
please dm me and ill send you the code

agile owl Feb 11, 2024, 8:23 PM

#

it's bound by different resources at different stages. the ultimate bound is the CPU-GPU mem interface

#

at some stages it's bound by my shitty python code

#

at others by pandas implementations

#

but CPU bound in general

iron basalt Feb 11, 2024, 8:24 PM

#

agile owl but CPU bound in general

CPU bound is not a thing, compute bound or memory bound is.

warm copper Feb 11, 2024, 8:24 PM

#

# here we initialize a random data matrix X and random numerical labels y
import numpy as np

X = np.random.randn(10,3)
y = np.random.randn(10,1)

# we also initialize a hypothetical hyperplane defined by w and b
w = np.random.randn(1,3)
b = -1

# (i) find the numerical labels predicted by the model (w,b) for the points in X
#     your code should be a single numpy line
#     hint: we wrote this equation for a single point x in class
#           try to generalize it by expressing everying in terms of matrices

# your code goes here

y_predicted = np.dot(X, w.T) + b

print(y_predicted)

# (ii) find the updated weights after one application of gradient descent with lr = 0.1
#      your code should be a single numpy line

y_ = np.random.randn(10,1)

w_updated = w - 0.1 * np.dot((y_predicted - y_).T, X)

print(w_updated)

final kiln Feb 11, 2024, 8:24 PM

#

warm copper I have a question

What is your question

warm copper Feb 11, 2024, 8:24 PM

#

for the last part of the code

#

should I use y_ or y

boreal gale Feb 11, 2024, 8:25 PM

#

agile owl whats the problem with time series in polars?

i think my old gripe was just there is no groupby rolling and/or the interface was fairly clunky for my specific usecase - this is probably fixed, i can't recall my exact issue

my latest gripe is that ewm_mean in poalrs doesn't take a times like in pandas

warm copper Feb 11, 2024, 8:25 PM

#

y_ was given in the last part of question

#

kinda felt weird to me

agile owl Feb 11, 2024, 8:25 PM

#

iron basalt CPU bound is not a thing, compute bound or memory bound is.

https://stackoverflow.com/questions/868568/what-do-the-terms-cpu-bound-and-i-o-bound-mean was using it like this

Stack Overflow

What do the terms "CPU bound" and "I/O bound" mean?

#

so it's a reinforcement learning environment

iron basalt Feb 11, 2024, 8:25 PM

#

agile owl https://stackoverflow.com/questions/868568/what-do-the-terms-cpu-bound-and-i-o-b...

I know, it's not super useful in optimization.

agile owl Feb 11, 2024, 8:26 PM

#

the environment itself is computationally complex and run on the CPU

warm copper Feb 11, 2024, 8:26 PM

#

I used y_ nevertheless

agile owl Feb 11, 2024, 8:26 PM

#

the GPU is doing the actual network

warm copper Feb 11, 2024, 8:26 PM

#

it must be there for a reason right?

#

@final kiln

agile owl Feb 11, 2024, 8:26 PM

#

and it's bound by the CPU-GPU memory interface

final kiln Feb 11, 2024, 8:26 PM

#

warm copper ```python # here we initialize a random data matrix X and random numerical label...

What is y_ ?

agile owl Feb 11, 2024, 8:26 PM

#

but I'm also reading/writing with DB

warm copper Feb 11, 2024, 8:26 PM

#

I dunno?

#

it was prewritten in the code

final kiln Feb 11, 2024, 8:27 PM

#

I think you gotta use y right ? Since that's your data

warm copper Feb 11, 2024, 8:27 PM

#

whats y_ doing there then?

#

🥲

final kiln Feb 11, 2024, 8:27 PM

#

Compare y_predict with y and apply grad desc

final kiln Feb 11, 2024, 8:28 PM

#

warm copper whats y_ doing there then?

It may be to simulate the validation step idk

iron basalt Feb 11, 2024, 8:28 PM

#

agile owl and it's bound by the CPU-GPU memory interface

So you load your data in your RAM, since you can fit it all. What are you doing with it? Are you even running any Pandas operations on it?

agile owl Feb 11, 2024, 8:29 PM

#

i'm doing numpy operations on it but I use pandas to read from sql

#

also to index it

final kiln Feb 11, 2024, 8:30 PM

#

My love for numpy is undying

iron basalt Feb 11, 2024, 8:30 PM

#

Ok, what kind of operations are you doing on the CPU on that data?

agile owl Feb 11, 2024, 8:30 PM

#

I could have the data in an in-memory database instead of postgres

iron basalt Feb 11, 2024, 8:31 PM

#

agile owl I could have the data in an in-memory database instead of postgres

I assume you just fetch all of it, so it's all in main memory (RAM).

agile owl Feb 11, 2024, 8:31 PM

#

yeah

#

!pastebin

arctic wedgeBOT Feb 11, 2024, 8:32 PM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in Discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the Paste! button in the bottom left, or by pressing CTRL + S. After doing that, you will be navigated to the new paste's page. Copy the URL and post it here so others can see it.

agile owl Feb 11, 2024, 8:32 PM

#

here's an example: https://paste.pythondiscord.com/5JNA

#

don't mind the asserts I'm still cleaning it up

#

I just wanted to quiet the typechecker without comments all over the place

wooden sail Feb 11, 2024, 8:33 PM

#

warm copper ```python # here we initialize a random data matrix X and random numerical label...

this doesn't really make sense, y is not used anywhere here at all. which parts did you write and which were already given?

warm copper Feb 11, 2024, 8:34 PM

#

#

here @wooden sail

wooden sail Feb 11, 2024, 8:34 PM

#

here what? what am i looking at

warm copper Feb 11, 2024, 8:34 PM

#

the question

final kiln Feb 11, 2024, 8:34 PM

#

Ig they want to apply gradient descent with y_

iron basalt Feb 11, 2024, 8:35 PM

#

agile owl here's an example: https://paste.pythondiscord.com/5JNA

Stuff like concatenate is a performance red flag.

agile owl Feb 11, 2024, 8:35 PM

#

why

iron basalt Feb 11, 2024, 8:35 PM

#

Memory allocations.

final kiln Feb 11, 2024, 8:35 PM

#

Memory management

agile owl Feb 11, 2024, 8:35 PM

#

so it's copying everything?

wooden sail Feb 11, 2024, 8:35 PM

#

warm copper the question

doesn't look like y is used anywhere, then

iron basalt Feb 11, 2024, 8:35 PM

#

agile owl so it's copying everything?

Yes, and allocation new memory / a chunk of it from heap.

warm copper Feb 11, 2024, 8:35 PM

#

yeah

agile owl Feb 11, 2024, 8:36 PM

#

isn't that what Spark does too?

final kiln Feb 11, 2024, 8:36 PM

#

Can't you ask the professor

iron basalt Feb 11, 2024, 8:36 PM

#

If you allocate in a loop on the heap, all performance goes out the window.

warm copper Feb 11, 2024, 8:36 PM

#

for y_predicted I used this line of code

#

y_predicted = np.dot(X, w.T) + b

#

@wooden sail

final kiln Feb 11, 2024, 8:36 PM

#

Like it's becoming a matter of interpretation

agile owl Feb 11, 2024, 8:36 PM

#

so are you saying I should make a big numpy array first

#

and then assign the individual elements

iron basalt Feb 11, 2024, 8:37 PM

#

agile owl isn't that what Spark does too?

Not sure what it does, but normally you would either just avoid allocation entirely (probably not needed), or if you really need to, a fast memory arena (arenas are often used by big fast data projects (probably Spark does)).

#

A memory arena basically just being a pre-allocated chunk that is O(1) to allocate on.

warm copper Feb 11, 2024, 8:37 PM

#

y_ = np.random.randn(10,1)

w_updated = w - 0.1 * np.dot((y_predicted - y_).T, X)

print(w_updated)
``` for the second part of the question

agile owl Feb 11, 2024, 8:37 PM

#

so should I just make a big numpy array with the shape of the output

#

and then assign the elements

#

rather than concatenating

iron basalt Feb 11, 2024, 8:38 PM

#

agile owl so should I just make a big numpy array with the shape of the output

Yes, that helps.

warm copper Feb 11, 2024, 8:38 PM

#

what do you think? @wooden sail sounds good to me but Im not sure why we dont use y at all

wooden sail Feb 11, 2024, 8:38 PM

#

warm copper what do you think? <@467435887236612106> sounds good to me but Im not sure why w...

because the code you were given is poorly written

warm copper Feb 11, 2024, 8:38 PM

#

lol

iron basalt Feb 11, 2024, 8:38 PM

#

Any way you can avoid memory allocation. Without that, you have no chance at fast speeds in a tight loop.

final kiln Feb 11, 2024, 8:39 PM

#

That's why asking the person who wrote it is the best option

warm copper Feb 11, 2024, 8:40 PM

#

hmmm

#

I asked openai just now

#

it did what I did hahahaha

final kiln Feb 11, 2024, 8:40 PM

#

Be careful with those things when using for learning.

iron basalt Feb 11, 2024, 8:41 PM

#

iron basalt Any way you can avoid memory allocation. Without that, you have no chance at fas...

Note that with something like polars, since it chains operations together and does them all together, it can avoid many allocations that you would have to do if you had to do it step by step in something like Pandas.

wooden sail Feb 11, 2024, 8:43 PM

#

i can't comment on the gradient since the cost function isn't written there

warm copper Feb 11, 2024, 8:43 PM

#

yup

#

weird function

iron basalt Feb 11, 2024, 8:43 PM

#

Consider something like np.sum(a + b) in Numpy. Numpy has to run its elementwise addition, and then sum on that. That is looping over all the elements twice. But something that chains operations together like polars can just do that in a single loop. ```c
int sum = 0;
for (int i = 0; i < N; ++i) {
sum += a[i] + b[i];
}

#

So even though it may be nicely vectorized and whatever in numpy, it's still just doing more work.

slim wigeon Feb 11, 2024, 8:44 PM

#

is there a place where i can upload python notebook to share it?

wooden sail Feb 11, 2024, 8:44 PM

#

warm copper weird function

not using least squares cost?

final kiln Feb 11, 2024, 8:44 PM

#

I think it's least squares, I recall him showing a slides from a lecture

sterile talon Feb 11, 2024, 8:45 PM

#

versed pilot It's focused on interferometry, you didn't say what exactly you are doing with S...

You are right and I may very well be doing a type of InSAR as well. It seems I'll be doing Pol-SAR as well

slim wigeon Feb 11, 2024, 8:45 PM

#

slim wigeon can someone help me to debug my implementation of DNN and backprop from scratch?...

here's my python code
https://paste.pythondiscord.com/RCBA

iron basalt Feb 11, 2024, 8:45 PM

#

iron basalt Consider something like `np.sum(a + b)` in Numpy. Numpy has to run its elementwi...

(This is why libraries that build compute graphs can optimize better, they don't do uneeded work)

final kiln Feb 11, 2024, 8:46 PM

#

Tensorflow ftw

iron basalt Feb 11, 2024, 8:46 PM

#

(This is especially important for reducing allocations)

warm copper Feb 11, 2024, 8:46 PM

#


1.  y_predicted

If you meant y_predicted, it represents the predicted output of the model for the given data matrix X. After calculating y_predicted = np.dot(X, w.T) + b, this variable holds the model's predicted values for each corresponding sample in X.

2. y_

If you actually did mean y_, it seems to be  an unused and undefined variable  in the provided context. Perhaps it was meant to serve a purpose but got left in the code without assignment.

Explanation

Here's a breakdown of what typically happens in a simple linear regression model, as seems to be intended in your code:

y: This variable holds the true labels or target values for each data point in your dataset.
y_predicted: This would store the model's predicted labels after being calculated using the weights w, bias b, and the data X.
Model Update: y_predicted and y are then used to compute the errors in the model's predictions and to update the weights w through gradient descent.
Let me know if you have a specific portion of the code where you're seeing y_ and I can provide a more tailored explanation!






profile picture
# here we initialize a random data matrix X and random numerical labels y
import numpy as np

X = np.random.randn(10,3)
y = np.random.randn(10,1)

# we also initialize a hypothetical hyperplane defined by w and b
w = np.random.randn(1,3)
b = -1

# (i) find the numerical labels predicted by the model (w,b) for the points in X
#  your code should be a single numpy line
#  hint: we wrote this equation for a single point x in class
#     try to generalize it by expressing everying in terms of matrices

# your code goes here

y_predicted = np.dot(X, w.T) + b

print(y_predicted)

# (ii) find the updated weights after one application of gradient descent with lr = 0.1
#   your code should be a single numpy line

y_ = np.random.randn(10,1)

w_updated = w - 0.1 * np.dot((y_predicted - y).T, X)

print(w_updated)

what does y_ do here



Show drafts

In this code, y_ serves as a source of randomness or noise, and it does not directly contribute to the linear regression process. Here's why:

 Gradient Descent Calculation:

 Let's focus on the line where you update the weights using gradient descent:

Python
w_updated = w - 0.1 * np.dot((y_predicted - y).T, X)
Use code with caution. Learn more

 Notice that  in this expression, you are using the following:

y_predicted: The values predicted by the current model.
y: The true labels for your data.
X: The data matrix itself.
 The calculation doesn't involve y_. The goal of gradient descent is to update the weights w to improve the model's accuracy. This improvement is based on the difference between the true labels (y) and the model's predictions (y_predicted).

 Possible Scenarios for y_:

 Typo:   It's very possible that y_ is the result of a typo, and the intention was to reuse y (the true labels) instead.

 Experimentation: The code might be  intended for experimentation. Perhaps the developer had the idea of introducing random noise during the gradient descent step, although this is not a standard practice in a typical linear regression context.

 Key Point:  When performing linear regression with gradient descent, the key elements for weight updates are the true labels (y), the predicted labels (y_predicted), and the data matrix (X).

 Let me know if you'd like me to analyze a different section of code or explore  more advanced variations on  gradient descent!

#

I asked Gemini

#

Im gonna use y instead of y_

wooden sail Feb 11, 2024, 8:47 PM

#

you need to ask your lecturer because the way it's written, y is not used anywhere

final kiln Feb 11, 2024, 8:47 PM

#

R u in the US, idk if Gemini ultra is already the Gemini latest in the EU

wooden sail Feb 11, 2024, 8:47 PM

#

what cost function are you using?

warm copper Feb 11, 2024, 8:47 PM

#

non

#

none

wooden sail Feb 11, 2024, 8:47 PM

#

yes you are

warm copper Feb 11, 2024, 8:47 PM

#

he just wants one line of code there

wooden sail Feb 11, 2024, 8:47 PM

#

you are using one, otherwise there is no gradient to speak of

final kiln Feb 11, 2024, 8:47 PM

#

It's the square function from one of your lectures

wooden sail Feb 11, 2024, 8:47 PM

#

what is the gradient being computed of?

#

that's the cost function

warm copper Feb 11, 2024, 8:48 PM

#

lr 0.1?

final kiln Feb 11, 2024, 8:49 PM

#

0.1*dy/dw, 0.1 is the dw

#

In the numerator, not the other one

#

calculus of variations anyone

wooden sail Feb 11, 2024, 8:50 PM

#

.latex the way you have written it, in column vector form, would be
[
\bm{y} = \bm{Xw} + \bm{b}
]
for which the gradient, assuming a least squares cost of the form
[
\Vert \bm {y} - \bm{Wx} - \bm{b} \Vert_2^2
]
is
[
g(\bm{w}) = 2(\bm{X}^T\bm{Xw} - \bm{X}^T(\bm{y} - \bm{b}))
]

strange elbowBOT Feb 11, 2024, 8:50 PM

#

$latex.png$

wooden sail Feb 11, 2024, 8:50 PM

#

which you'd then scale by 0.1

#

gradient means you took the derivative of something. what did you take the derivative of?

warm copper Feb 11, 2024, 8:51 PM

#

this is an exercise

wooden sail Feb 11, 2024, 8:51 PM

#

because what you got does not match a least squares function. what was it instead, then?

warm copper Feb 11, 2024, 8:51 PM

#

it doesnt have any previous stuff

#

in the code

wooden sail Feb 11, 2024, 8:51 PM

#

then it doesn't make sense

#

i can't say "find the derivative" and not tell you what to take the derivative of

#

it says there you wrote the equation in class, so the answer is in your notes from class

tidal bough Feb 11, 2024, 8:52 PM

#

wooden sail this doesn't really make sense, y is not used anywhere here at all. which parts ...

i'd wager the mistake is defining a random y_ instead of using y.

wooden sail Feb 11, 2024, 8:53 PM

#

tidal bough i'd wager the mistake is defining a random y_ instead of using y.

there's more issues than that. since y is also random, there is additionally no ground truth for w and b

agile owl Feb 11, 2024, 8:53 PM

#

Invalid index type "tuple[slice, ndarray[Any, dtype[bool_]]]" for "_LocIndexerFrame"; expected type "slice | ndarray[Any, dtype[integer[Any]]] | Index[Any] | list[int] | Series[int] | Series[bool] | ndarray[Any, dtype[bool_]] | list[bool] | Callable[[DataFrame], slice | ndarray[Any, dtype[integer[Any]]] | Index[Any] | list[int] | Series[int] | Series[bool] | ndarray[Any, dtype[bool_]] | list[bool] | list[<nothing>]] | list[<nothing>] | tuple[slice | ndarray[Any, dtype[integer[Any]]] | Index[Any] | list[int] | Series[int] | Series[bool] | ndarray[Any, dtype[bool_]] | list[bool] | list[<nothing>] | tuple[Index[Any] | Series[bool] | ndarray[Any, dtype[bool_]] | list[bool] | str | bytes | date | datetime | timedelta | datetime64 | timedelta64 | bool | int | float | Timestamp | Timedelta | complex | list[Any] | slice | tuple[str | bytes | date | datetime | timedelta | datetime64 | timedelta64 | bool | int | float | Timestamp | Timedelta | complex, ...], ...] | Callable[..., Any], list[<nothing>] | slice | Series[bool] | Callable[..., Any]]"Mypyindex

thanks mypy!

wooden sail Feb 11, 2024, 8:53 PM

#

this is pretty contrived

warm copper Feb 11, 2024, 8:53 PM

#

the question is called Linear Regression with numpy 1liners

wooden sail Feb 11, 2024, 8:53 PM

#

standard linear regression is based on least squares

#

go to your notes and find the cost function that was used

final kiln Feb 11, 2024, 8:53 PM

#

strange elbow

isnt this a simpler expression, 2(y - ...) X or something

tidal bough Feb 11, 2024, 8:54 PM

#

wooden sail there's more issues than that. since y is also random, there is additionally no ...

that's just generating some example data, I believe. as in, that part of the code comes with the problem statement.

wooden sail Feb 11, 2024, 8:54 PM

#

tidal bough that's just generating some example data, I believe. as in, that part of the cod...

which is then not used, and instead a new y_ is drawn

tidal bough Feb 11, 2024, 8:54 PM

#

yup, y_ shouldn't exist

wooden sail Feb 11, 2024, 8:54 PM

#

which is fine, X has full rank with high probability

warm copper Feb 11, 2024, 8:54 PM

#

I think its probably a typo

#

Im using y

#

y_predicted - y

#

y_ doesnt make any sense there

wooden sail Feb 11, 2024, 8:55 PM

#

final kiln isnt this a simpler expression, 2(y - ...) X or something

the same expression with X^T factored out

tidal bough Feb 11, 2024, 8:55 PM

#

agile owl ```py Invalid index type "tuple[slice, ndarray[Any, dtype[bool_]]]" for "_LocInd...

^typechecking errors in any program I write be like

#

~~and i like it this way~~

final kiln Feb 11, 2024, 8:55 PM

#

wooden sail the same expression with X^T factored out

ah didnt see that

wooden sail Feb 11, 2024, 8:55 PM

#

warm copper y_ doesnt make any sense there

neither makes sense, you can use wichever and it will work because of how the problem is written

#

i'd use y_ because it's what they put in the code block, but you have to ask them

#

and go verify in your notes whether it's least squares, cuz your gradient looks wrong

agile owl Feb 11, 2024, 8:56 PM

#

I'm starting to think that typecheckers just can't handle get item

#

at least not with pandas

iron basalt Feb 11, 2024, 8:57 PM

#

agile owl ```py Invalid index type "tuple[slice, ndarray[Any, dtype[bool_]]]" for "_LocInd...

Reminds me of earlier C++ errors with templated types.

warm copper Feb 11, 2024, 9:01 PM

#

#

@wooden sail this is from the notes

wooden sail Feb 11, 2024, 9:03 PM

#

ok, so it's least squares and you're absorbing the scaling factors into the 0.1

warm copper Feb 11, 2024, 9:03 PM

#

yup

#

what I dont understand is

#

this is not a linear regression

#

I dont see any fitting or nothing

#

feels stupid

wooden sail Feb 11, 2024, 9:04 PM

#

yes there is

warm copper Feb 11, 2024, 9:04 PM

#

where is the fit function?

final kiln Feb 11, 2024, 9:05 PM

#

L

warm copper Feb 11, 2024, 9:05 PM

#

oh the learning rate?

final kiln Feb 11, 2024, 9:05 PM

#

no, L is the error being minimized I think

#

in your notes

wooden sail Feb 11, 2024, 9:06 PM

#

it failed

final kiln Feb 11, 2024, 9:06 PM

#

r u writing latex manually

wooden sail Feb 11, 2024, 9:06 PM

#

wdym manually

warm copper Feb 11, 2024, 9:06 PM

#

so should I use y_ or y?

wooden sail Feb 11, 2024, 9:06 PM

#

warm copper so should I use y_ or y?

this has nothing to do with your previous question

final kiln Feb 11, 2024, 9:06 PM

#

without at least real time feedback of the result

#

there's a nice web app thing for it

warm copper Feb 11, 2024, 9:06 PM

#

what? lol

wooden sail Feb 11, 2024, 9:07 PM

#

.latex "regression" is another word for "fitting" or "finding parameters". you're doing gradient descent on the function
[
L = \frac{1}{n} \Vert \bm{y} - (\bm{Wx} + \bm{b}) \Vert _2 ^2
]

strange elbowBOT Feb 11, 2024, 9:07 PM

#

$latex.png$

wooden sail Feb 11, 2024, 9:07 PM

#

there we go

tidal bough Feb 11, 2024, 9:07 PM

#

edd, crunching his bones: back in my day we had to write latex on a piece of paper, and get it right the first time

wooden sail Feb 11, 2024, 9:07 PM

#

this function L is what you're minimizing. and you're doing so by tuning w via gradient descent

final kiln Feb 11, 2024, 9:07 PM

#

https://latex.codecogs.com/eqneditor/editor.php

Online LaTeX Equation Editor - create, integrate and download

HTML LaTeX equation editor that creates graphical equations (gif, png, swf, pdf, emf). Produces code for directly embedding equations into HTML websites, forums or blogs. Images may also be dragged into other applications like Word. Open source and XHTML compliant.

wooden sail Feb 11, 2024, 9:07 PM

#

no paper writer worth their salt uses that

warm copper Feb 11, 2024, 9:07 PM

#

so what do you guys think I should do?

#

for my question

wooden sail Feb 11, 2024, 9:07 PM

#

use y_, and more importantly, go review your course material

warm copper Feb 11, 2024, 9:08 PM

#

thats all the course material

wooden sail Feb 11, 2024, 9:08 PM

#

cuz it sounds like you aren't grasping the key ideas

final kiln Feb 11, 2024, 9:08 PM

#

what's wrong with it, i'd say it's quite handy

warm copper Feb 11, 2024, 9:08 PM

#

I dont see any mention of y_ in my notes

wooden sail Feb 11, 2024, 9:08 PM

#

it's just slower. i'll just write a large chunk of raw tex and compile it later

wooden sail Feb 11, 2024, 9:08 PM

#

warm copper I dont see any mention of y_ in my notes

because it has nothing to do with the rest of the problem

#

i keep telling you, the code you were given is not consistent

#

you could use either y or y_ and it will work

#

this is just a parameter you evaluate into the loss function L

#

i assure you the person grading won't care either, but since they went through the trouble of making y_ in the new cell, just use that

warm copper Feb 11, 2024, 9:09 PM

#

#

so this is my loss function

wooden sail Feb 11, 2024, 9:10 PM

#

yes

#

that's the same as i wrote above

#

the sum of squared errors, which you then minimize to achieve the "least" value

#

hence "least squares"

warm copper Feb 11, 2024, 9:11 PM

#

#

this is a and b

wooden sail Feb 11, 2024, 9:12 PM

#

a is the same as w in your task

warm copper Feb 11, 2024, 9:12 PM

#

yup

final kiln Feb 11, 2024, 9:12 PM

#

what is the question

wooden sail Feb 11, 2024, 9:12 PM

#

the real question is, why do they insist on using row vectors. yuck

warm copper Feb 11, 2024, 9:13 PM

#

#data-science-and-ml message here @final kiln

final kiln Feb 11, 2024, 9:13 PM

#

I have been converted actually

#

row is better cuz pytorch

wooden sail Feb 11, 2024, 9:13 PM

#

no, pytorch bad cuz row

warm copper Feb 11, 2024, 9:13 PM

#

lol

#

I did a great job with the rest of the questions tho

final kiln Feb 11, 2024, 9:13 PM

#

warm copper https://discord.com/channels/267624335836053506/366673247892275221/1206337423731...

your prof is asking you to apply gradient descent on y_

warm copper Feb 11, 2024, 9:13 PM

#

😄

agile owl Feb 11, 2024, 9:14 PM

#

I'd guess it's a lot more intuitive for most people to think of tables as a set of rows stitched together than to think of it as columns where each index represents a different individual

warm copper Feb 11, 2024, 9:14 PM

#

okay then y_predicted - y_

final kiln Feb 11, 2024, 9:14 PM

#

yeah once it gets to several dimensions row is easier to think about

#

like (x, y, z, d, v, c) shapes

wooden sail Feb 11, 2024, 9:14 PM

#

agile owl I'd guess it's a lot more intuitive for most people to think of tables as a set ...

that completely throws away all of the power of linalg, hopefully they at least think of the rows as spanning a vector space still

final kiln Feb 11, 2024, 9:15 PM

#

linalg is agnostic to this tho

wooden sail Feb 11, 2024, 9:15 PM

#

it honestly makes no difference as long as you're consistent and keep in mind your fundamental vector spaces

#

but math books canonically use column vectors, so

final kiln Feb 11, 2024, 9:15 PM

#

warm copper okay then `y_predicted - y_`

either one will work, you can leave a note saying you were unsure which one to use, but both look equivalent

warm copper Feb 11, 2024, 9:15 PM

#

If our given data set is linearly separable, does the same hold true for the transformed set? In the following cells you can plot a transformed version of the Iris dataset, so that you see how it behaves (for your choice of 𝑎 , 𝑏 , 𝑐 .) But you should also try and justify your answer in a theoretical way: if there exists a 'good' perceptron for the original data set, what would be the weights for the perceptron that works on the transformed set? Are there any issues that might arise?

#

I answered this question using linear algebra

wooden sail Feb 11, 2024, 9:16 PM

#

linalg would be the way

warm copper Feb 11, 2024, 9:16 PM

#

wooden sail Feb 11, 2024, 9:16 PM

#

rank-reducing transformations will give you a nontrivial kernel

wooden sail Feb 11, 2024, 9:17 PM

#

warm copper

only for full rank transformations 😛

tidal bough Feb 11, 2024, 9:17 PM

#

wooden sail but math books canonically use column vectors, so

well you see, an (n,m) matrix consists of n column vectors, the ith one obtained by matrix[i, :].reshape(-1,1),

warm copper Feb 11, 2024, 9:17 PM

#

he wanted us to use chatGPT for this question as well

slim wigeon Feb 11, 2024, 9:18 PM

#

guys, i need help with finding out why my implementation of dnn is not learning 😭 i have been trying to debug for 3 days and im slowly going insane

#

Here's my code https://paste.pythondiscord.com/RCBA
notebook version: https://colab.research.google.com/drive/1R4tpsRi4gHXrAcUU9zGRj76FQBt9LJtX?usp=sharing

Google Colaboratory

warm copper Feb 11, 2024, 9:18 PM

#

But I assume linear transformations dont cause loss of linear separability? @wooden sail

final kiln Feb 11, 2024, 9:19 PM

#

warm copper he wanted us to use chatGPT for this question as well

eh, i thought academia was afraid of new tech

wooden sail Feb 11, 2024, 9:19 PM

#

warm copper But I assume linear transformations dont cause loss of linear separability? <@46...

they do, if they're rank deficient. that's what i'm telling you

#

go ahead and try to separate your data if T is the zero matrix

warm copper Feb 11, 2024, 9:19 PM

#

oh hsit yeah

wooden sail Feb 11, 2024, 9:19 PM

#

the same will be true for any T that is not full rank

#

well, there's a discussion to be had about domains, kernels, and pre-images

#

this is exactly what i meant about your fundamental subspaces

#

you can play with the rank-nullity theorem or the fundamental theorem of linear algebra here

#

things get a little bit more tricky for nonzero b because it becomes an affine transformation, but the spirit of the discussion is the same

warm copper Feb 11, 2024, 9:22 PM

#

😄

#

Arent you a math wiz

#

you wanna see how stupid my DSA assignment is?

#

@wooden sail

agile owl Feb 11, 2024, 9:26 PM

#

linear algebra has always hurt my head tbh

wooden sail Feb 11, 2024, 9:26 PM

#

i want to sleep 😔

agile owl Feb 11, 2024, 9:26 PM

#

I prefer calculus

#

it's a lot more intuitive to me

warm copper Feb 11, 2024, 9:26 PM

#


Option 1:  all arriving passengers are placed in a single queue, and service stations take passengers from the front of that queue.

Option 2:  each service station has its own queue, and arriving passengers are dispatched to a queue according to one of many policies:

2.A:  round robin (1, 2, 3, 4, 5, 1, 2, 3, 4, 5, 1, 2, ...).

2.B:  arriving passenger is placed in a shortest queue.

2.C:  arriving passenger is placed in a random queue.

 Inputs to the simulation:
The duration of the simulation measured in minutes (D:  make it arbitrarily long, do not worry about it being or not being realistic).
The average arrival rate measured in minutes (A:  arrivals are random, but on average there is one new passenger every A minutes), 
The average service rate measured in minutes (S:  service rates are random, but on average they need about S minutes of service).
For the sake of this study, make sure to crowd the system, by choosing S >> 5*A, without causing an overflow of your queues.  Also choose D to be long enough to get rid of any transitory effects.

 Outputs of the Simulation for each queuing policy:
The duration of the simulation (which may be longer than the input parameter, as when check-in closes, there may be passengers in the waiting queues and service stations).
The maximum length of the queue for each queue.
The average and maximum waiting time for each queue.
The rate of occupancy of each service station (percentage of time each station was busy).
If you want: show the real-time evolution of the queues during the run-time simulation.

final kiln Feb 11, 2024, 9:27 PM

#

try linear algebra with complex numbers

warm copper Feb 11, 2024, 9:27 PM

#

I asked teacher if I could use Markov Queues from Queue Theory

#

M/M/c

#

He didnt even know what that was

final kiln Feb 11, 2024, 9:27 PM

#

what is it

#

just searched on google, ofc queue theory is a thing in math

warm copper Feb 11, 2024, 9:28 PM

#

#

this was my response

#

and he was like what?

agile owl Feb 11, 2024, 9:30 PM

#

where does that say markov queue though

warm copper Feb 11, 2024, 9:30 PM

#

Its a markovian model

final kiln Feb 11, 2024, 9:30 PM

#

everything circles back to markov

agile owl Feb 11, 2024, 9:31 PM

#

glad I don't need to know that

warm copper Feb 11, 2024, 9:32 PM

#

https://web.mit.edu/modiano/www/6.263/lec5-6.pdf

#

similar to what I learnt

#

in my stats class

final kiln Feb 11, 2024, 9:33 PM

#

check knot theory

#

it's also a thing

#

surprsingly useful tho

warm copper Feb 11, 2024, 9:33 PM

#

oh god

#

you will learn markov chains as a part of your AI journey @agile owl

agile owl Feb 11, 2024, 9:35 PM

#

I know what they are

#

I meant the whole queueiing thing

final kiln Feb 11, 2024, 9:35 PM

#

where are markov chains used in ai ?

warm copper Feb 11, 2024, 9:35 PM

#

its an implementation of markov chains on queues

agile owl Feb 11, 2024, 9:35 PM

#

theyre used in reinforcement learning

final kiln Feb 11, 2024, 9:35 PM

#

oh, havent gotten to that stuff

warm copper Feb 11, 2024, 9:35 PM

#

yup

agile owl Feb 11, 2024, 9:36 PM

#

state-action-reward-state-action-reward...

final kiln Feb 11, 2024, 9:36 PM

#

makes sense

warm copper Feb 11, 2024, 9:36 PM

#

whoever this markov guy was

#

he was genius

agile owl Feb 11, 2024, 9:36 PM

#

the learning curves in reinforcement learning go up because we are trying to maximize rewards instead of minimizing losses

#

I like to joke that it's the optimistic branch of ML

final kiln Feb 11, 2024, 9:36 PM

#

agile owl state-action-reward-state-action-reward...

yeah it's a sort of random walker right

agile owl Feb 11, 2024, 9:36 PM

#

yeah

warm copper Feb 11, 2024, 9:36 PM

#

https://en.wikipedia.org/wiki/Andrey_Markov

Andrey Markov

Andrey Andreyevich Markov (14 June 1856 – 20 July 1922) was a Russian mathematician best known for his work on stochastic processes. A primary subject of his research later became known as the Markov chain. He was also a strong, close to master-level chess player.
Markov and his younger brother Vladimir Andreevich Markov (1871–1897) proved the M...

agile owl Feb 11, 2024, 9:37 PM

#

the way it fits is random exploration of states

#

but it has what is called a policy gradient

#

so it's partially random in that the state it explores are randomly generated but it optimizes the policy gradient which is the function that links actions to rewards

#

a | s -> R

final kiln Feb 11, 2024, 9:38 PM

#

interesting

#

so you have a state, and you throw a dsiplacement at random and see if that improves the reward

warm copper Feb 11, 2024, 9:39 PM

#

agile owl Feb 11, 2024, 9:39 PM

#

yeah

#

your actions lead to states

final kiln Feb 11, 2024, 9:39 PM

#

so it's proposal based right

#

like, if a certain displacement doesnt work you throw it away

agile owl Feb 11, 2024, 9:40 PM

#

yeah it learns to make the right action given the state to optimize the rewards

#

not just for one step but for an entire episode of steps

warm copper Feb 11, 2024, 9:40 PM

#

are you studying Data Science? @agile owl

agile owl Feb 11, 2024, 9:40 PM

#

yea sure

final kiln Feb 11, 2024, 9:40 PM

#

wait so it's like a genetic thing

#

i need to read up on this stuff

warm copper Feb 11, 2024, 9:40 PM

#

agile owl yea sure

nice that was my undergrad are you enjoying it

agile owl Feb 11, 2024, 9:41 PM

#

sure

#

I already got a degree in it they just didn't cover reinforcement learning very much so I'm self-studying it now

warm copper Feb 11, 2024, 9:41 PM

#

yeah we didnt either

#

we will do in my grad program

final kiln Feb 11, 2024, 9:42 PM

#

i might go for phd, really liking ml and stuff

#

down the line tho

warm copper Feb 11, 2024, 9:42 PM

#

queue theoryy

#

learn some of that

final kiln Feb 11, 2024, 9:43 PM

#

i wonder what other mundane things mathematicians have turned into fields

agile owl Feb 11, 2024, 9:43 PM

#

I think reinforcement learning is really a lot more exciting than static learning problems

#

that's how they get robots to walk etc.

tidal bough Feb 11, 2024, 9:43 PM

#

final kiln i wonder what other mundane things mathematicians have turned into fields

the reals👍

final kiln Feb 11, 2024, 9:43 PM

#

the thing that drew me to ML was llms, really fascinated by them

warm copper Feb 11, 2024, 9:43 PM

#

https://www.amazon.com/Fundamentals-Queueing-Theory-Probability-Statistics-dp-111894352X/dp/111894352X/ref=dp_ob_title_bk

Fundamentals of Queueing Theory (Wiley Series in Probability and St...

Fundamentals of Queueing Theory (Wiley Series in Probability and Statistics)

agile owl Feb 11, 2024, 9:43 PM

#

I would be surprised if chatgpt didn't use some kind of reinforcement learning

final kiln Feb 11, 2024, 9:44 PM

#

tidal bough the reals👍

no like, is there a sock theory

warm copper Feb 11, 2024, 9:44 PM

#

it does

agile owl Feb 11, 2024, 9:44 PM

#

I think every time you tell it it did something wrong it learns from that

tidal bough Feb 11, 2024, 9:44 PM

#

google "RLHF".

agile owl Feb 11, 2024, 9:45 PM

#

plus side is you get to use your CPU and GPU at the same time 🙂

final kiln Feb 11, 2024, 9:45 PM

#

tidal bough google "RLHF".

yeah I remember this

warm copper Feb 11, 2024, 9:45 PM

#

#

Gemini is good

agile owl Feb 11, 2024, 9:46 PM

#

reinforcement learning to the moon

#

I need to learn more about the implementation of these models instead of using sb3 though

#

I want to try to reimplement SAC in rust from scratch using their version of torch

final kiln Feb 11, 2024, 9:48 PM

#

rust has torch ?

agile owl Feb 11, 2024, 9:48 PM

#

it has a torch alternative

#

that has a torch-like API

final kiln Feb 11, 2024, 9:49 PM

#

python is awesome but when the project starts growing I feel the need for type safety

#

mypy dont cut it cuz a lot of libs dont have stubs

warm copper Feb 11, 2024, 9:49 PM

#

I also like NLP @agile owl

agile owl Feb 11, 2024, 9:50 PM

#

I like NLP too but more for quantizing things like sentiment than generation

warm copper Feb 11, 2024, 9:50 PM

#

cool!

final kiln Feb 11, 2024, 9:50 PM

#

ive been burning a lot of gpu for sentiment analysis

#

,_.

warm copper Feb 11, 2024, 9:51 PM

#

yeah

#

I dont do GPU intensive stuff on my computer

agile owl Feb 11, 2024, 9:51 PM

#

I have an RTX 2080 Ti

#

it does alright

final kiln Feb 11, 2024, 9:51 PM

#

I wrote a data and training pipelin in github actions

warm copper Feb 11, 2024, 9:51 PM

#

I did a CNN implementation once during my undergrad. My computer shut down @agile owl RTX 2080

final kiln Feb 11, 2024, 9:51 PM

#

models are trained in spot

warm copper Feb 11, 2024, 9:51 PM

#

not Ti tho

final kiln Feb 11, 2024, 9:52 PM

#

they all train during the night

agile owl Feb 11, 2024, 9:52 PM

#

I think the Ti has 3 extra gb of VRAM

warm copper Feb 11, 2024, 9:52 PM

#

I had to use Google Collab

#

50 bucks a month

#

REEEEE

agile owl Feb 11, 2024, 9:52 PM

#

VRAM is super expensive did you guys know that

warm copper Feb 11, 2024, 9:53 PM

#

no o.O

agile owl Feb 11, 2024, 9:53 PM

#

the good stuff is

warm copper Feb 11, 2024, 9:53 PM

#

is that why Graphics Cards are expensive?

final kiln Feb 11, 2024, 9:53 PM

#

agile owl Feb 11, 2024, 9:53 PM

#

part of the reason yes

final kiln Feb 11, 2024, 9:53 PM

#

everything is automated

warm copper Feb 11, 2024, 9:53 PM

#

I have 2080 and it cant run Cities Skylines 2 properly

#

saddddd

agile owl Feb 11, 2024, 9:54 PM

#

I'm honestly somewhat surprised it took people so long to realize the potential of GPUs for machine learning

#

they had GPUs for a long time

warm copper Feb 11, 2024, 9:54 PM

#

but they were not as capable

agile owl Feb 11, 2024, 9:54 PM

#

I don't think they have changed the basic premise of it that much

#

but in relation to the CPU they always had more but worse cores didn't they?

warm copper Feb 11, 2024, 9:54 PM

#

also we didnt have capable computers back then

#

I mean Im 90s kids

agile owl Feb 11, 2024, 9:55 PM

#

right but think about it

#

using GPUs for compute

#

that only really became a thing when NVIDIA wrote Cuda

#

even though in principle it could have been done earlier

#

is it because the GPU instruction sets are proprietary?

#

I don't understand why no one did something like CUDA independently

final kiln Feb 11, 2024, 9:56 PM

#

warm copper Feb 11, 2024, 9:57 PM

#

yeah thats a good question @agile owl

#

o_O I dont know either

agile owl Feb 11, 2024, 9:57 PM

#

I asked chatGPT and it gave me a reasonable sounding answer

warm copper Feb 11, 2024, 9:57 PM

#

what did it say?

agile owl Feb 11, 2024, 9:58 PM

#

lack of standardization

warm copper Feb 11, 2024, 9:58 PM

#

ohhh

agile owl Feb 11, 2024, 9:58 PM

#

limited programmability of earlier generations of GPUs

#

lack of awareness

final kiln Feb 11, 2024, 9:58 PM

#

given how hard it is to use the AMD environment, may make sense

#

ive spent a whole weekend trying to get an amd gpu in the cloud to do stuff

#

took me an hour to do the same in nvidia

agile owl Feb 11, 2024, 9:59 PM

#

I got AMD GPUs to work with ROCm years ago

#

it was alright

final kiln Feb 11, 2024, 9:59 PM

#

ig my issue is the lack of docker support

agile owl Feb 11, 2024, 9:59 PM

#

the experience of getting it set up sucked tho

final kiln Feb 11, 2024, 10:00 PM

#

it's not zero support, but everything is so badly done

agile owl Feb 11, 2024, 10:00 PM

#

it's like one dude

final kiln Feb 11, 2024, 10:00 PM

#

10gb docker images for ex

agile owl Feb 11, 2024, 10:00 PM

#

or it used to be

#

it was like literally one dude

final kiln Feb 11, 2024, 10:00 PM

#

fr ?

agile owl Feb 11, 2024, 10:00 PM

#

lol

final kiln Feb 11, 2024, 10:00 PM

#

like one person supporting it ?

agile owl Feb 11, 2024, 10:00 PM

#

I remember looking for resources about it and it was ONE GUY answering everyone's questions on github

final kiln Feb 11, 2024, 10:01 PM

#

yeah that explains it

agile owl Feb 11, 2024, 10:01 PM

#

i'd be surprised if they haven't grown the team

#

I mean he probably had a few coworkers but still

final kiln Feb 11, 2024, 10:01 PM

#

behind every bad code is one overworked developer

agile owl Feb 11, 2024, 10:02 PM

#

I still think AMD is underinvesting in AI

#

I don't understand it

#

if they could undercut Nvidia in AI it would be a massive coup

final kiln Feb 11, 2024, 10:02 PM

#

massive corps are hard to change

#

people don't get compensated for innovating

#

so they kinda dont

#

I think AI was nvidias end game from the start

agile owl Feb 11, 2024, 10:03 PM

#

I mean they don't need to be the first mover

#

being the second mover is also good if there's only one other company

#

they just need to invest in having something as good or better than cuda

final kiln Feb 11, 2024, 10:04 PM

#

there seems to be a market vacum of sorts

agile owl Feb 11, 2024, 10:04 PM

#

or even slightly worse

final kiln Feb 11, 2024, 10:04 PM

#

no competitor to nvidia

agile owl Feb 11, 2024, 10:04 PM

#

exactly

#

even if you're worse than nvidia if you're a viable option and can compete on price

#

their problem is rocm is barely viable

#

it just has to be good enough to work with common libraries

#

and they would get a massive sales boost from people going for the value alternative

final kiln Feb 11, 2024, 10:05 PM

#

idk if it's easy to do that, you're saying like make cheaper gpu rite

agile owl Feb 11, 2024, 10:06 PM

#

they already make cheaper GPU

#

I'm saying they need the SOFTWARE

#

so people buy them for compute

final kiln Feb 11, 2024, 10:06 PM

#

ah, yeah that's for sure

agile owl Feb 11, 2024, 10:06 PM

#

their gpus are usually slightly worse than nvidias

#

from a hardware perspective

#

but the software can be worse too

#

it just has to like, work

final kiln Feb 11, 2024, 10:07 PM

#

I personally dont care or would notice the hardware

agile owl Feb 11, 2024, 10:07 PM

#

which rocm barely does

#

you'd notice it would take like 10-20% longer to train

final kiln Feb 11, 2024, 10:07 PM

#

I do notice the complete lack of support on the software side

agile owl Feb 11, 2024, 10:07 PM

#

but the GPUs are like half the cost

#

so it's value from that perspective

final kiln Feb 11, 2024, 10:07 PM

#

I mean rn I just schedule the thing and let it do it during the night

#

so I wouldn't notice it

agile owl Feb 11, 2024, 10:09 PM

#

I'm making a webserver to provide a UI for model creation

final kiln Feb 11, 2024, 10:09 PM

#

yeah I get the feeling a lot of modelling can be done with UI or some DSL

agile owl Feb 11, 2024, 10:10 PM

#

the alternative is to write a DSL

#

but then it wouldn't have charts

final kiln Feb 11, 2024, 10:10 PM

#

something Like labview could be cool

#

both a UI and a DSL

agile owl Feb 11, 2024, 10:12 PM

#

this server is currently just for training an already curated dataset and plotting results

#

the next step I want to do is add the ability to do ETL from different APIs into a joined table before standardizing and slicing for CV etc.

final kiln Feb 11, 2024, 10:13 PM

#

have you tried mlflow

agile owl Feb 11, 2024, 10:13 PM

#

is that some paid service

final kiln Feb 11, 2024, 10:13 PM

#

a webserver that does graphs sounds an awful lot like it

final kiln Feb 11, 2024, 10:13 PM

#

agile owl is that some paid service

no its open source

agile owl Feb 11, 2024, 10:13 PM

#

does it work for reinforcement learning

final kiln Feb 11, 2024, 10:14 PM

#

its what Ive been using to log my stuff

#

you kinda just do .log_metric("metric_name", spme_val)

#

and it saves it and you can see it real time on the UI

#

theres also an auto log feature, but ive never used it

#

like it does some magic that you dont even have to explicitly log stuff

agile owl Feb 11, 2024, 10:17 PM

#

like in a callback?

final kiln Feb 11, 2024, 10:17 PM

#

#

there's a lot of automation done by me, each experiment is a PR, when I merge it automatically runs the training loops and they appear as runs

final kiln Feb 11, 2024, 10:18 PM

#

agile owl like in a callback?

idk how it does it, but doesnt seem to use callbacks, there seems to be a ton of py magic to it

agile owl Feb 11, 2024, 10:19 PM

#

I'd rather roll my own

#

I don't like magic

final kiln Feb 11, 2024, 10:19 PM

#

I meant the auto log feature

#

what im doing you just do .log_param, .log_metric and .log_artifact

#

so it backs up your models and everythng

#

#

surely you'd prefer not to reimplement all this

#

there are others similar to this

agile owl Feb 11, 2024, 10:23 PM

#

I'm honestly not sure if that API can work with sb3

#

without doing deep surgery

#

they already have their own logging functions

#

I just need to visualize them

#

I already have the points

final kiln Feb 11, 2024, 10:23 PM

#

technically it's an http api

#

been using it with pytorch, works quite well

agile owl Feb 11, 2024, 10:24 PM

#

I mean, getting access to that state at the right level of granularity to plot it

warm copper Feb 11, 2024, 10:24 PM

#

someone asked me what ML agorithms don't require regularization and which ones do

#

I said this

#


There are some ML algorithms where overfitting is not a problem at all. For example Naive Bayes is known for its conditional independence which makes it resistant to overfitting. KNN is another algorithm that is resistant to overfitting as it works by memorizing the training data. Random forest is also resilient when it comes to overfitting thanks to the way it combines many independently trained decision trees.

#

am I right?

final kiln Feb 11, 2024, 10:25 PM

#

knn doesnt overfit ?

agile owl Feb 11, 2024, 10:25 PM

#

knn has a very good calibration

final kiln Feb 11, 2024, 10:25 PM

#

i thought every model can overfit

agile owl Feb 11, 2024, 10:26 PM

#

it depends on what your assumptions are

#

some people say RL can't overfit but I'm pretty sure it can as soon as you introduce different data to the same environment

final kiln Feb 11, 2024, 10:27 PM

#

it's way too late for me to use my brain to learn new stuff

warm copper Feb 11, 2024, 10:28 PM

#

I found this online:

#


Non-parametric: KNN doesn't learn a fixed set of parameters. It essentially relies on memorizing the training data.
Focus on local regions: KNN makes predictions based on localized neighborhoods in the data space, reducing its susceptibility to extreme patterns that might mislead parametric models.

final kiln Feb 11, 2024, 10:30 PM

#

i mean yeah, does it even make sense to say "knn overfits"

#

it's just a database query almost

agile owl Feb 11, 2024, 10:30 PM

#

it's an interesting property though

#

I think you could make a more sophisticated model based on the same principle

final kiln Feb 11, 2024, 10:31 PM

#

its called a transformer

warm copper Feb 11, 2024, 10:31 PM

#

is there any other algorithm that uses memorization?

final kiln Feb 11, 2024, 10:32 PM

#

likely yeah

#

not neural networks tho, they do compression

#

im gonna sleep

warm copper Feb 11, 2024, 10:32 PM

#

Locally Weighted Regression: A non-parametric regression method that fits simple models to localized subsets of the training data. The focus is on predictions made close to a query point, relying more heavily on training examples in that local region.

#

found this

#

Locally Weighted Regression

#

https://www.cs.cmu.edu/afs/cs/project/jair/pub/volume4/cohn96a-html/node7.html#:~:text=Locally weighted regression (LWR) is,local'' to that point.

Locally Weighted Regression

agile owl Feb 11, 2024, 10:33 PM

#

if you're localizing to time then that's just a rolling window

warm copper Feb 11, 2024, 10:33 PM

#

interesting read

agile owl Feb 11, 2024, 10:33 PM

#

you could like weight by time too

#

exponentially weight the cost with some halflife with respect to time

warm copper Feb 11, 2024, 10:34 PM

#

When to use Locally Weighted Linear Regression?
When n (number of features) is small.
If you don’t want to think about what features to use.

quartz karma Feb 11, 2024, 10:45 PM

#

Hi, does anyone know how to get simba work on top of spark? is it enough to just install the driver?

agile owl Feb 12, 2024, 1:42 AM

#

so I got my code working with polars and it's actually slower

#

by quite a bit

#

gonna profile it and see what the problem seems to be

#

same stuff as where it was taking a lot of time with pandas except the flamechart is shallower

#

and it's taking longer

#

I feel like I got memed

#

I'm not sure my results were actually correct because I got a different end result but I'll save learning polars for a new project

#

I think it kind of stands to reason that a dataframe with an index built in is going to be faster than doing a filter on a column...

teal lance Feb 12, 2024, 4:57 AM

#

rapid isle Feb 12, 2024, 5:48 AM

#

guys is there a difference between the two

#

rn_image_picker_lib_temp_0b087b9c-8119-4634-b5f4-c293554487ef.jpg

worldly dawn Feb 12, 2024, 6:38 AM

#

rapid isle guys is there a difference between the two

can you think about a case where it might behave differently?

Also that's not really related to #data-science-and-ml

rapid isle Feb 12, 2024, 8:10 AM

#

sorry could u refer me to the respective channel

#

and no thats why im asking

worldly dawn Feb 12, 2024, 8:12 AM

#

rapid isle sorry could u refer me to the respective channel

#python-discussion or #❓｜how-to-get-help are a good start

worldly dawn Feb 12, 2024, 8:12 AM

#

rapid isle and no thats why im asking

think about the different boundaries and examples of values that fit in each one

#

that's also why I am asking you

#

so you can think about it

dry geyser Feb 12, 2024, 8:29 AM

#

morning!

#

An interesting side effect in polars:

        filtered_df = df.with_columns(
            pl.when(
                pl.col(column).is_not_null() & pl.col(column).str.contains(phone_regexp)
            ).then(
                pl.col(column).str.extract_groups(phone_regexp)
            ).otherwise(
                pl.lit(None)
            ).alias(column)
        )

This will create a structured column with as many None values as capture groups

#

I tried to fix that behavior to no avail

#

[.when([(col("PHONE3").is_not_null()) & (col("PHONE3").str.contains([String(^(?:(?P<country_code>+\d+)[\s-]+)?(?P<number>(?:\d[\s-]*)+)$)]))]).then(col("PHONE3").str.extract_groups()).otherwise(null.cast(Struct([Field { name: "country_code", dtype: String }, Field { name: "number", dtype: String }]))).alias("PHONE3")]

#

the dataframe does not contain any nulls in the columns parsed

#

 'phones': [{'country_code': None, 'number': None},
                 {'country_code': None, 'number': '5551234'},
                 {'country_code': None, 'number': '5551234'},
                 {'country_code': None, 'number': '5551234'}]}

sonic yoke Feb 12, 2024, 9:27 AM

#

Guys can someone recommend a good beginner course for python for datascience

dry geyser Feb 12, 2024, 10:49 AM

#

[2024-02-12 11:45:17,350] [MainProcess:MainThread] INFO: CSV: Processed 499999 lines in 1.66 seconds, 300776.14 lines/second

#

rewrote all the coalescing/transforms into expr engine query plans

final kiln Feb 12, 2024, 11:05 AM

#

sonic yoke Guys can someone recommend a good beginner course for python for datascience

Check the pinned messages

feral blade Feb 12, 2024, 11:24 AM

#

Does pytorch have yolo model? My lab asked to train object detecting using it and specifically asked to switch to pytorch for this assignment.

molten acorn Feb 12, 2024, 12:41 PM

#

Hi, I have a pandas dataframe which is grouped by a column named 'run'. Each group should have more or less the same amount of rows. In this dataframe, there is another column called 'total_data' and I would like to merge these groups into a single group, effectively eliminating the need for a 'run' column. While merging it would be nice if it took the mean of the row value of 'total_data' horizontally across each 'run' group, rather than the mean on the column itself. The end result should be a Series with the same amount of rows as a 'run' group. Could someone please assist me with this? I've been trying to solve this with AI but I can't seem to figure out the right combination of functions to call. Any help will be appreciated

#

Hopefully I explained that correctly. I'm new to pandas so I'm not sure if I'm describing the problem correctly

left tartan Feb 12, 2024, 1:29 PM

#

molten acorn Hi, I have a pandas dataframe which is grouped by a column named 'run'. Each gro...

Maybe let's tackle the first question first: you have two dataframes, and you want to "combine" then.

#

Do they have a common index or something to "join" them on?

molten acorn Feb 12, 2024, 1:30 PM

#

They have all the same columns, might differ in the number of rows by less than 5%

left tartan Feb 12, 2024, 1:31 PM

#

Yes, so sounds like you want to a left outer join then.

#

The place to start is: https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.merge.html:

#

df1.merge(df2, how="outer", ....)

#

There's a few different ways to merge. You can use "on=[list of columns to join on]", or left_index/right_index if the indexes is what you want to use.

molten acorn Feb 12, 2024, 1:34 PM

#

Ok, great, this gives me a path to follow. This will then be possible to merge values by taking the mean?

#

Across the dataframes

#

I'll experiment, thank you for the help

final kiln Feb 12, 2024, 1:38 PM

#

Not using seeds was a mistake

#

But also, I may be coming to the realization that 16gb of GPU is not gonna be enough to train this thing. What else could make this thing always converge to the same value other than it's just too small. Bert uncased is like 100M parameters

jagged latch Feb 12, 2024, 1:50 PM

#

I have a question to those experienced in Dash. Can you call another function that you defined earlier in the Python script to generate the dataframe inside a Callback function to replace the old dataframe with a new one?

#

Like I have a program that generates a Pandas dataframe based on a given date and I want to implement a date picker that will replace the data with different data from the other dates entered by the user in the text box.

#

Inside the Callback function, I called the function that originally generated the dataframe so that it can generate a new dataframe with the new date.

agile owl Feb 12, 2024, 2:52 PM

#

       spot = self.data.loc[curr_date, "spot"]
        spot_window = self.data.loc[prev_dates, "spot"].to_frame()
        log_spot_window = spot_window.apply(np.log)
        if self.current_step > 0:
            spot_returns = log_spot_window.unstack().diff().dropna().iloc[-1]
            spot_returns = spot_returns.values
        else:
            spot_returns = np.zeros(self.no_symbols)
        spot_window_vals = spot_window.values
        spot_values = spot.values
        spot_rank = get_percentile(spot_values, spot_window_vals, axis=0)

@left tartan pd.loc and pd.unstack are faster than pl.filter and pl.pivot in this code. particularly in the definition of spot_window and spot_returns

left tartan Feb 12, 2024, 2:53 PM

#

(I'm leaving this for the resident polars experts 🙂

left tartan Feb 12, 2024, 2:54 PM

#

agile owl ```py spot = self.data.loc[curr_date, "spot"] spot_window = self....

But could you describe the problem you're trying to solve?

agile owl Feb 12, 2024, 2:54 PM

#

my difficulty is always figuring out the right place to start and stop

#

for the appropriate amount of context

#

so basically

left tartan Feb 12, 2024, 2:56 PM

#

agile owl my difficulty is always figuring out the right place to start and stop

Basically, it's: You're starting with Dataframe X, and you want Dataframe Y... so maybe describe the starting state and ending state?

agile owl Feb 12, 2024, 2:56 PM

#

the starting state is I have the dataframe that represents the entire chunk deserialized as self.data

#

self.dates is the list of unique dates in data

#

I have an index counting which date I'm on

#

I need to update the current state given the date and self.data

#

!pastebin

arctic wedgeBOT Feb 12, 2024, 2:57 PM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in Discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the Paste! button in the bottom left, or by pressing CTRL + S. After doing that, you will be navigated to the new paste's page. Copy the URL and post it here so others can see it.

agile owl Feb 12, 2024, 2:57 PM

#

https://paste.pythondiscord.com/Y2RQ here's the whole method

#

the class is 550 lines so don't wanna post it and be rude unless you're really interested

left tartan Feb 12, 2024, 2:59 PM

#

Looks like you want the percentile of spot price over a window from current date -10 to +1?

agile owl Feb 12, 2024, 2:59 PM

#

that's part of it

#

I should probably split this up into multiple methods

#

at the end you see everything that goes into the return

#

lines 31-38

#

the docstring is also wrong

#

I forgot to update it

#

I was originally clustering for a single stock and haven't updated it

#

there's obviously some repetition and dumbness going on here but the critical part is I think the part I posted originally

#

polars profile:

#

pandas profile

#

the two most expensive high-level polars operations are filter and pivot

#

this corresponds to the pandas indexing and pandas unstack

#

so that kind of makes sense

#

but the polars version is just 2x slower

#

unfortunately I lost the polars code I had because I never committed it out of disgust at the results

#

surely not having an index comes at some cost though

#

if pandas can't beat polars at a loc index vs a generic filter then I'd be surprised because the whole reason pandas has the badness it does have is so it can optimize around indexing

final kiln Feb 12, 2024, 3:17 PM

#

It always comes down to the data

agile owl Feb 12, 2024, 3:22 PM

#

I mean, it also comes down to how the data is structured for access

#

I don't understand how polars gets rid of the index and doesn't pay a price

#

you either have an index that's set as a discrete action, set it every time you do an operation which seems extremely expensive, or don't have it

final kiln Feb 12, 2024, 3:26 PM

#

Ah I was talking about my thing

agile owl Feb 12, 2024, 3:26 PM

#

oo mb

final kiln Feb 12, 2024, 3:27 PM

#

Need to step back and re process the data using all the lessons I learned til now

agile owl Feb 12, 2024, 3:31 PM

#

the polars code was something like this before I tossed it

spot = self.data.filter(pl.col("date") == curr_date))
spot_window = self.data.filter(pl.col("date").is_in(prev_dates))

...
spot_returns = log_spot_window.pivot(index = "date", columns = "ticker", values="spot").drop("date").diff()

final kiln Feb 12, 2024, 3:31 PM

#

A 40M parameter model having the same loss graph as a 1M parameter one, like I can't even

final kiln Feb 12, 2024, 3:39 PM

#

agile owl the polars code was something like this before I tossed it ```py spot = self.da...

if you encode your text I bet you could use GPU for that

#

Probly not advisable

#

I think

final kiln Feb 12, 2024, 3:55 PM

#

#

It's also possible that these things converge slowly, that graph looks an awful lot like mine

#

well more or less, x axis is number of steps w max range being the end of the dataset

jagged latch Feb 12, 2024, 4:11 PM

#

I have a question. I'm trying to load into my Dash App with Debug Mode enabled. Does it usually take long?

past meteor Feb 12, 2024, 4:47 PM

#

agile owl ```py spot = self.data.loc[curr_date, "spot"] spot_window = self....

Can you show me the polars

agile owl Feb 12, 2024, 4:50 PM

#

I didn't commit it unfortunately but it was something like this:

spot = self.data.filter(pl.col("date") == curr_date))
spot_window = self.data.filter(pl.col("date").is_in(prev_dates))

...
spot_returns = log_spot_window.pivot(index = "date", columns = "ticker", values="spot").drop("date").diff()

#

I don't think polars has diff actually

#

but if my polars code was correct to begin with then I probably wouldn't have had this issue in the first place hah

past meteor Feb 12, 2024, 4:51 PM

#

It has diff

agile owl Feb 12, 2024, 4:53 PM

#

I also don't understand why polars should be faster than pandas if it never indexes the data?

past meteor Feb 12, 2024, 4:53 PM

#

They answer this very question in their docs

#

https://docs.pola.rs/user-guide/migration/pandas/

final kiln Feb 12, 2024, 4:54 PM

#

Is there a docs for training transformers, I'd be really happy if there was one

#

I'm just trying all this stuff until something sticks, not very efficient

past meteor Feb 12, 2024, 4:55 PM

#

agile owl I also don't understand why polars should be faster than pandas if it never inde...

It does index the data

#

There's just no Pandas index weirdness

#

that's the nuance

agile owl Feb 12, 2024, 4:55 PM

#

I see

#

so is filter wrong then

past meteor Feb 12, 2024, 4:56 PM

#

Giving an index based on inter position in Pandas is also just a bit pointless

#

It's the default, of course there's smarter ways to do it

#

But I don't see people doing that, llike picking an index in Pandas that aligns with their data access patterns

#

You also need to check what type of index Pandas uses, as you know in DBs there's many different kinds

#

Hash based indexes don't give you a lot if you're filtering like <

agile owl Feb 12, 2024, 4:57 PM

#

it's a multiindex on date as datetime64 and ticker as a string

#

I'm not using lt or gt just equals and isin

past meteor Feb 12, 2024, 4:58 PM

#

hmm then a hash index is good

#

I refer to: https://www.postgresql.org/docs/current/indexes-types.html for a concise overview

agile owl Feb 12, 2024, 4:59 PM

#

the issue isn't really with the DB

past meteor Feb 12, 2024, 4:59 PM

#

Yeah, I didn't mean that

#

I just linked this as a general resource about indexes

agile owl Feb 12, 2024, 4:59 PM

#

ah gotcha

past meteor Feb 12, 2024, 4:59 PM

#

Because they aren't magic ✨

past meteor Feb 12, 2024, 5:00 PM

#

agile owl I didn't commit it unfortunately but it was something like this: ```py spot = s...

Do you use the lazy api

#

or eager only?

agile owl Feb 12, 2024, 5:00 PM

#

I believe the read_database returned a lazy frame

past meteor Feb 12, 2024, 5:00 PM

#

No

agile owl Feb 12, 2024, 5:01 PM

#

it returns an eager frame?

past meteor Feb 12, 2024, 5:01 PM

#

yes

agile owl Feb 12, 2024, 5:01 PM

#

then i was using eager

past meteor Feb 12, 2024, 5:01 PM

#

Never use eager unless you have to imo

#

that's a speedup left on the table

agile owl Feb 12, 2024, 5:01 PM

#

can you read from db using lazy

past meteor Feb 12, 2024, 5:01 PM

#

No, query the db and call .lazy() immediately

agile owl Feb 12, 2024, 5:02 PM

#

thanks you're a lot more helpful than the polars discord

#

they have a beginner questions channel where no one answers beginner questions lol

past meteor Feb 12, 2024, 5:02 PM

#

🤣

agile owl Feb 12, 2024, 5:02 PM

#

typical of the rust community memes I have to admit

past meteor Feb 12, 2024, 5:02 PM

#

👈 polars simp # 1

agile owl Feb 12, 2024, 5:02 PM

#

they have a reputation for being elitists and thinking people haven't done enough work if they need help

#

I used to think like that then I realized being eager to ask questions has almost no downside on the internet

#

as long as they are somewhat reasonable..

past meteor Feb 12, 2024, 5:04 PM

#

Answering questions makes you think

#

they're without obligation

#

Nobody loses, unless someone is spamming or so

#

Oh, good addition about the lazy API is that it removes all footguns

#

You can't iterrows, maprows or whatever I see people doing

agile owl Feb 12, 2024, 5:05 PM

#

well time to rewrite everything I wrote yesterday but with lazy this time

#

I thought I must have really screwed something up bad so I tossed it all

#

lesson learned

past meteor Feb 12, 2024, 5:06 PM

#

last recommendation

agile owl Feb 12, 2024, 5:06 PM

#

I also didn't feel like branching because I was lazy

past meteor Feb 12, 2024, 5:06 PM

#

It will sound crazy

#

but 1) read all the docs. do it while you're eating cornflakes or whatever 2) browse through the method names in the API ref

#

Polars has the GOD tier method for time series

#

The danger is, if you focus on translating Pandas to Polars you'd never find it

#

https://docs.pola.rs/py-polars/html/reference/dataframe/api/polars.DataFrame.group_by_dynamic.html

#

https://docs.pola.rs/py-polars/html/reference/dataframe/api/polars.DataFrame.rolling.html

agile owl Feb 12, 2024, 5:08 PM

#

reading the whole API sounds a bit ambitious

past meteor Feb 12, 2024, 5:08 PM

#

Not reading it, just look at the names

#

ofc I didn't read the entire API

agile owl Feb 12, 2024, 5:09 PM

#

here's a funny thing

#

I asked copilot to retrieve what I wrote yesterday but it said it's not allowed to do it

#

but when I start retyping it it suggests what I had written

past meteor Feb 12, 2024, 5:09 PM

#

That there

#

Is sentience /s

agile owl Feb 12, 2024, 5:10 PM

#

that would be an interesting use for AI

#

"help I accidentally deleted this code I forgot to commit, replay your telemetry buffer"

#

XD

past meteor Feb 12, 2024, 5:11 PM

#

While you're at it replay Jeff Bezos' credit card details thanks

boreal gale Feb 12, 2024, 5:12 PM

#

past meteor Polars has the GOD tier method for time series

i had the exact opposite take O.o - i only just started using polars again after a long hiatus so a lot probably changed.
#data-science-and-ml message

past meteor Feb 12, 2024, 5:12 PM

#

boreal gale i had the exact opposite take O.o - i only just started using polars again after...

When was this?

boreal gale Feb 12, 2024, 5:13 PM

#

probably 1 year ago or even more

past meteor Feb 12, 2024, 5:13 PM

#

This time last year is when I turned the >1h Pandas data pipeline to ~15s polars

#

And I definitely used group by dynamic

#

Maybe it was longer ago then yeah

agile owl Feb 12, 2024, 5:14 PM

#

this is in the environment step function in a reinforcement learning setup