#data-science-and-ml | Python | Page 97

tidal bough Jan 26, 2024, 7:57 PM

#

seems right

#

in 3.12 you can simplify it a bit by using math.sumprod

scarlet siren Jan 26, 2024, 8:00 PM

#

tidal bough seems right

For
[[-1.0, 0.0, 1.0], [1.0, 0.0, -2.0], [-1.0, -1.0, 2.0]]
dot [[26], [20], [970]]
I'm getting
[[944.0], [-1914.0], [1894.0]]
but it should be
944
1966
1894

#

It's like a 3x3 matrix dot 1x3 matrix

tidal bough Jan 26, 2024, 8:01 PM

#

scarlet siren For [[-1.0, 0.0, 1.0], [1.0, 0.0, -2.0], [-1.0, -1.0, 2.0]] dot [[26], [20], [9...

!e no, your program is right

import numpy as np
A = np.array([[-1.0, 0.0, 1.0], [1.0, 0.0, -2.0], [-1.0, -1.0, 2.0]])
b = np.array([[26], [20], [970]])
print(A@b)

arctic wedgeBOT Jan 26, 2024, 8:01 PM

#

@tidal bough :white_check_mark: Your 3.12 eval job has completed with return code 0.

001 | [[  944.]
002 |  [-1914.]
003 |  [ 1894.]]

scenic token Jan 26, 2024, 8:38 PM

#

I am generating a plot of a graph with the networkx library

How can I make the spaces between my nodes larger in a circular draw

scarlet siren Jan 26, 2024, 8:43 PM

#

r1 + r2 = d1
r2 + d3 = d2
r3 + r4 = d3
r4 + r5 = d4

knowing that r1 = 2r5 how is the linear equation matrix constructed? (assuming r1 = 2R and r5 = R)

#

2R + r2 = d1
r2 + r3 = d2
r3 + r4 = d3
r4 + R = d4

would it be

2 1 0 0 R d1
0 1 1 0 x r2 = d2
0 0 1 1 r3 d3
1 0 0 1 r4 d4

or should I ignore r1 = 2r5

tidal bough Jan 26, 2024, 9:12 PM

#

either you solve for r1 manually like that and get a 4x4 matrix, yeah, or you rewrite r1 = 2r5 as r1 - 2r5 = 0 and then you have a 5x5 matrix.

scarlet siren Jan 26, 2024, 9:23 PM

#

And for the equation to have an answer, A has to have an inverse, right?

tidal bough Jan 26, 2024, 9:32 PM

#

If A has an inverse, then a solution exists, but I don't think the opposite has to be true - it's generally https://en.wikipedia.org/wiki/Rouché–Capelli_theorem

Rouché–Capelli theorem

In linear algebra, the Rouché–Capelli theorem determines the number of solutions for a system of linear equations, given the rank of its augmented matrix and coefficient matrix. The theorem is variously known as the:

Rouché–Capelli theorem in English speaking countries, Italy and Brazil;
Kronecker–Capelli theorem in Austria, Poland, Croatia, Ro...

scarlet siren Jan 26, 2024, 9:38 PM

#

A not having an inverse doesn't mean AX = B doesn't have an answer?

tidal bough Jan 26, 2024, 9:41 PM

#

No. Consider A = [[1,1],[0,0]], b = [[1],[0]]. A has determinant 0 and hence is noninvertible, yet A x = b has infinite solutions.

desert oar Jan 26, 2024, 10:24 PM

#

i think it's pretty common to do things like sentiment analysis etc. using simple models on top of pre-trained word vectors

#

i've certainly done it for text classification. word vectors basically just acting as dimension reduction at that point.

final kiln Jan 26, 2024, 10:25 PM

#

desert oar i think it's pretty common to do things like sentiment analysis etc. using simpl...

Yeah that's pretty much what happens

#

Tomorrow I'm gonna see if there's a threshold where the feed forward doesn't work

#

Presumably it won't work in cases where the text is more complex

#

And the context window is larger

final kiln Jan 26, 2024, 10:28 PM

#

final kiln Yeah that's pretty much what happens

Like, the transformer blocks were just getting in the way. Once I made the model super small it found a way to skip forward the attention heads and just use the embedder module + feed forward

#

What I did next was to just delete the transformer blocks altogether. The embedder + feed forward converged crazy quick

#

Haven't checked these things but

#

I think the embeddings themselves will come grouped into regions, negative words to one side and positive words to the other

#

And the only thing the feed forward does is count them in the input

scarlet siren Jan 26, 2024, 10:44 PM

#

Inverse on
[2, 1, 0, 0],
[0, 1, 1, 0],
[0, 0, 1, 1]
[1, 0, 0, 1]

gives back
[[-0.0, -0.0, -0.0, -0.0], [1.0, -0.0, -0.0, -0.0], [-1.0, 1.0, -0.0, -0.0], [1.0, -1.0, 1.0, -0.0]]

when tested with numpy I got
[[ 1 -1 1 -1]
[-1 2 -2 2]
[ 1 -1 2 -2]
[-1 1 -1 2]]

#

numpy version:

import numpy as np
import numpy.linalg as alg


def main():
    matrix = np.array([
        [2, 1, 0, 0],
        [0, 1, 1, 0],
        [0, 0, 1, 1],
        [1, 0, 0, 1]
    ])
    print(f'A = \n{matrix}')
    inv = alg.inv(matrix).astype(int)
    print(f'det(A) = {alg.det(matrix)}')
    print(f'A-1 = \n{inv}')
    print(f'det(A-1) = {alg.det(inv)}')
if __name__ == '__main__':
    main()

desert oar Jan 26, 2024, 11:13 PM

#

final kiln What I did next was to just delete the transformer blocks altogether. The embedd...

what that tells me is that the actual order and structure of the words is much less important than simply knowing which combinations of words are present in the sentence

desert oar Jan 26, 2024, 11:15 PM

#

final kiln I think the embeddings themselves will come grouped into regions, negative words...

that makes sense as well if you think about what the model is doing, it's basically just regression on top of embeddings at that point. it's also an illustration of why learning the embeddings along with the model is better than using pre-trained if possible

final kiln Jan 26, 2024, 11:25 PM

#

desert oar what that tells me is that the actual order and structure of the words is much l...

I reckon that it's true for small phrases. But when you get to essay type texts the model is gonna need to do almost like summarization internally

desert oar Jan 26, 2024, 11:26 PM

#

final kiln I reckon that it's true for small phrases. But when you get to essay type texts ...

I instinctively assumed you were looking at smaller phrases because that's what everybody does with sentiment analysis self study projects 😆 but I shouldn't have made that assumption

#

however I don't think it's necessarily invalid even on longer documents as long as they are well separated by their vocabulary. Consider book reviews for example

final kiln Jan 26, 2024, 11:26 PM

#

desert oar I instinctively assumed you were looking at smaller phrases because that's what ...

It's small phrases at the moment, but I'm gonna need to find a dataset with long texts

#

This is one of the tasks that I'll use to make the ablation study on the transformer

desert oar Jan 26, 2024, 11:27 PM

#

I bet I could build a book review sentiment classifier with > 50% accuracy by just looking for words in some fixed-size neighborhood of "bad" and "good" in a pretrained fasttext embedding space

final kiln Jan 26, 2024, 11:29 PM

#

Perhaps, depends on the complexity of the text. If it's thesis type of text for example, with an opinion on some geopolitical matter, a transformer is likely needed since it requires actual conceptual understanding

desert oar Jan 26, 2024, 11:30 PM

#

remember what transformers do: they construct a new sequence of vectors such that each individual vector in the new sequence represents its own context in the original sequence

#

so transformers only improve your model if context is important

desert oar Jan 26, 2024, 11:30 PM

#

final kiln Perhaps, depends on the complexity of the text. If it's thesis type of text for ...

right, it will definitely depend on the task

#

but I wouldn't say it's just a matter of length, more about the subtlety of ideas involved in the text

final kiln Jan 26, 2024, 11:31 PM

#

Yeah I'd say so too

#

Gonna need to scrape the web for datasets

#

This is actually a good exercise to get to speed with all the NLP tasks

#

Sentiment analysis, machine translation, topic classification, and a couple others I don't recall

#

I'm gonna use them to compare my variant, and then replicate the MetaFormer study but for NLP

#

I haven't found anyone doing it yet, don't know why

cinder jay Jan 26, 2024, 11:50 PM

#

Hi, i have the following code to segment the blood vessels of the eye:

import cv2
import numpy as np
import skimage

def vessel_segmentation(image):
    im_rgb = cv2.imread(image)
    
    # Extract green channel
    im_green = im_rgb[:, :, 1]
    
    # CLAHE enhancement
    clahe = cv2.createCLAHE(clipLimit=2.0, tileGridSize=(8,8))
    im_enh = clahe.apply(im_green)

    # Negative
    im_gray = cv2.bitwise_not(im_enh)
    
    # Use Top-Hat transform
    se = cv2.getStructuringElement(cv2.MORPH_ELLIPSE, (21, 21))
    im_top = cv2.morphologyEx(im_gray, cv2.MORPH_TOPHAT, se)

    # OTSU Thresholding
    _, im_thre = cv2.threshold(im_top, 50, 255, cv2.THRESH_BINARY + cv2.THRESH_OTSU)


    return im_thre

i have the following result:

#

but should be something like this:

manic linden Jan 27, 2024, 4:10 AM

#

Hey if I traspose a dataframe, shouldn't the size of the df always be the same? In my program I lose 3 columns when I do it?

serene scaffold Jan 27, 2024, 4:10 AM

#

manic linden Hey if I traspose a dataframe, shouldn't the size of the df always be the same? ...

if you transpose a dataframe, the shape will go from (a, b) to (b, a)

#

if you appear to be losing data, there must be more going on.

ashen galleon Jan 27, 2024, 4:41 AM

#

Q:
I have a discord bot, and, for every guild,
I have an image banning method using embeddings of the image, comparing to stored banned embeddings.
Currently,
the structure is just every embedding in one tensor
and it works, but I can't have the same embeddings for every guild.

I could simply store an element of the tensor that's the guild_id,
but that feels like an antipattern,
and that I should use the Pony ORM database I have, and then have the guild_id-tensor pairs.

Is there a preexisting standard for this sort of application?

viral field Jan 27, 2024, 6:23 AM

#

I have seen lots of videos about creating object detectors using a camera or webcam
But can't we do the same thing on our PC or mobile screen?

limber mesa Jan 27, 2024, 6:44 AM

#

ashen galleon Q: I have a discord bot, and, for every guild, I have an image banning method u...

maybe check #discord-bots ?

final kiln Jan 27, 2024, 9:07 AM

#

anyone knows a good literature review on transformers ? I'm looking for something done in 2023

fallen dagger Jan 27, 2024, 9:46 AM

#

Demo of a CLI tool I built over the weekend that connects with Google's Gemini LLM and use it with your files.

It lets you add your own custom commands as well, so you can further enhance the CLI or use it to interact with the LLM and your files however you want.

final kiln Jan 27, 2024, 10:07 AM

#

#

don't know if this is a limitation of the dataset or of the feedforward

orchid cargo Jan 27, 2024, 1:59 PM

#

Does anyone know how to make a weather forecast? Maybe it's impossible.

desert oar Jan 27, 2024, 2:06 PM

#

final kiln anyone knows a good literature review on transformers ? I'm looking for somethin...

let me know if you find one

desert oar Jan 27, 2024, 2:07 PM

#

orchid cargo Does anyone know how to make a weather forecast? Maybe it's impossible.

it's possible, but whether it's something you can do at home versus a serious multi-year research project depends a lot on the scale and scope that you intend

#

if you're trying to forecast the temperature at a single location, you can do OK with traditional timeseries methods, but your predictions will at best only be interpretable as an average

#

beyond that, you're getting into meteorology, not just statistics or machine learning

feral sand Jan 27, 2024, 2:13 PM

#

orchid cargo Does anyone know how to make a weather forecast? Maybe it's impossible.

for what i know you can try LSTM for some decent results

river cape Jan 27, 2024, 2:23 PM

#

Hey guys I have these 3 problem statements , how do I forward with these problem?
PS1 - Anonymise user identities in large databases to ethically employ machine learning in understanding customer trends and behaviour without violating their ri`ghts.
PS2 - Create a blockchain-powered platform allowing users to lease their information to social media services, with the assurance that the data will not be retained upon exiting the service.
PS3 - Develop an AI based solution to offer timely insights into current global hacking trends, prioritising potential threats based on their likelihood of targeting specific enterprises.

final kiln Jan 27, 2024, 2:34 PM

#

For sentiment analysis, I can't have the embedder module be learnable, it's gonna overfit every single time. I looked around and people seem to start with a pre trained one and stick a feed forward on top.

So I decided to just train one by getting a transformer to first do next token prediction.

But at that point might as well just let the next token prediction be the sentiment. I have a bit of padding on the sequences and the last token is the classification

#

The expectation is that the transformer will be obliged to learn syntax as it does on the normal next token pred. It's training rn will see if it works out. At least it's not overfiti g

left tartan Jan 27, 2024, 3:19 PM

#

river cape Hey guys I have these 3 problem statements , how do I forward with these problem...

What’s the context here? Are you expected to do all three? Or to explain how you’d approach them? Is this just an essay question?

final kiln Jan 27, 2024, 3:39 PM

#

final kiln

50 epochs in with the next token prediction way, no overfitting and passed a similar test as this one

molten elk Jan 27, 2024, 3:50 PM

#

Does anyone know if in this code the binomial function only counts whether something is a 0 or 1? These values refer to heads and tails respectively

desert oar Jan 27, 2024, 3:55 PM

#

molten elk Does anyone know if in this code the `binomial` function only counts whether som...

the "binomial distribution" describes the number of "successes" after a sequence of a "experiments" or "trials". the classic example is the number of heads in a sequence of coin flips. this code generates random numbers according to that distribution. each draw from that distribution is simulating 1000 coin flips with a 70% chance of heads.

desert oar Jan 27, 2024, 3:59 PM

#

final kiln For sentiment analysis, I can't have the embedder module be learnable, it's gonn...

i haven't seen that technique before, seems worth exploring. you might need to "clip" the final token to only have nonzero score for the classes and 0 for actual words

final kiln Jan 27, 2024, 4:01 PM

#

desert oar i haven't seen that technique before, seems worth exploring. you might need to "...

I've allocated 5 special tokens, one for each sentiment, which I'm assigning to the last position in the sequence

#

I think it is working, the limitation will be the dataset, which won't include stuff I can come up with, like sarcasm or the entire phrase being positive and then end with "jk, it was the opposite"

molten elk Jan 27, 2024, 4:04 PM

#

desert oar the "binomial distribution" describes the number of "successes" after a sequence...

Can you explain how the values of the numbers generated mean success or failure?

molten elk Jan 27, 2024, 4:08 PM

#

desert oar the "binomial distribution" describes the number of "successes" after a sequence...

Btw if instead of a coin, can we still use the binomial distribution to check success in a die for instance

final kiln Jan 27, 2024, 4:21 PM

#

Well, it seems like it worked, gonna run over some stats after a well deserved break

But this will do, I can train both the transformer and the metric tensor net, it gives me clear performance metrics, etc

#

Next task will be summarization

desert oar Jan 27, 2024, 4:29 PM

#

molten elk Can you explain how the values of the numbers generated mean success or failure?

each number generated is the number of successes. if you have binomial(1000,0.7) and get 670, that's 670/1000 heads

wooden sail Jan 27, 2024, 4:31 PM

#

as for the dice, you have to do some prep work yourself in defining what counts as a "success"

#

for a standard 6-sided fair die, you'd think of which sides represent a success. then for a single roll of the die, this determines the value of p, the probability of success

desert oar Jan 27, 2024, 5:43 PM

#

molten elk Btw if instead of a coin, can we still use the binomial distribution to check su...

in general you need 6 "categories" for a die. success/fail as in binomial is only 2. the more general case of >2 categories is called "multinomial".

#

but yes if you can interpret some outcome of the die roll as "success" and other outcomes as "failure" (eg a saving throw in D&D) then yes you can use the binomial distribution

#

and by the way, binomial with exactly 1 trial has a special name: the Bernoulli distribution. a binomial distribution is the sum of independent draws from a Bernoulli distribution

#

and likewise for multinomial. a multinomial distribution is the sum of independent draws from a categorical distribution

#

Wikipedia articles for probability distributions are usually interesting reference points, even though most other stats articles on Wikipedia are not great

final kiln Jan 27, 2024, 6:05 PM

#

What does it mean when the validation loss is lower than the training loss

#

I don't think I have data leak, it only happens on certain hyper parameters

desert oar Jan 27, 2024, 6:08 PM

#

final kiln What does it mean when the validation loss is lower than the training loss

is it robust to resampling? could be just weird luck

final kiln Jan 27, 2024, 6:09 PM

#

desert oar is it robust to resampling? could be just weird luck

Resampling ?

#

I'm shuffling the training data on every epoch

#

Have a running average for both sets

#

Ah they just flipped

desert oar Jan 27, 2024, 6:10 PM

#

final kiln Resampling ?

like, re-splitting train and test

final kiln Jan 27, 2024, 6:11 PM

#

desert oar like, re-splitting train and test

Oh, I haven't tried. The dataset readme instructs to use the indicated split for comparison with literature

#

Now it's overfiting ah

desert oar Jan 27, 2024, 6:12 PM

#

final kiln Oh, I haven't tried. The dataset readme instructs to use the indicated split for...

is it just a train/test split, or train/validation/test?

final kiln Jan 27, 2024, 6:13 PM

#

desert oar is it just a train/test split, or train/validation/test?

The labels they gave are train/test/dev

desert oar Jan 27, 2024, 6:14 PM

#

final kiln The labels they gave are train/test/dev

okay, so you're hopefully not using test for the loss curve during training

final kiln Jan 27, 2024, 6:14 PM

#

desert oar okay, so you're hopefully not using test for the loss curve during training

I'm training on train and using test for validation

desert oar Jan 27, 2024, 6:14 PM

#

is that what they say to do? normally "test" is reserved for checking at the very end

final kiln Jan 27, 2024, 6:14 PM

#

I decreased the size of the model and now the val is larger

desert oar Jan 27, 2024, 6:15 PM

#

the names are confusing and disagree with common english usage

final kiln Jan 27, 2024, 6:15 PM

#

desert oar is that what they say to do? normally "test" is reserved for checking at the ver...

Uhm, I just need a split to follow along to check when the loop is overfiting

cinder jay Jan 27, 2024, 6:24 PM

#

final kiln anyone knows a good literature review on transformers ? I'm looking for somethin...

search in pages like Springer, Mendeley
there's a lot of works in that topics bro

final kiln Jan 27, 2024, 6:31 PM

#

Thanks, I'll check it out

#

But I reckon it might be picking up on some pattern that is more pronounced in the validation set

desert oar Jan 27, 2024, 6:34 PM

#

final kiln Uhm, I just need a split to follow along to check when the loop is overfiting

right, use dev for that

final kiln Jan 27, 2024, 6:35 PM

#

desert oar right, use dev for that

Bit confusing terminology

#

But shouldn't matter right

desert oar Jan 27, 2024, 6:35 PM

#

a bit? very confusing

final kiln Jan 27, 2024, 6:35 PM

#

As long as there's no leakage

desert oar Jan 27, 2024, 6:35 PM

#

yes, as long as the set you use to "follow along" during training is not the one you use for final score

final kiln Jan 27, 2024, 6:36 PM

#

I'm now just reducing model size till it stops overfiting

#

I know I got this before

river cape Jan 27, 2024, 6:39 PM

#

left tartan What’s the context here? Are you expected to do all three? Or to explain how you...

I have selected the PS1 as my statement , so i need suggestions or advice on how do I proceed?

left tartan Jan 27, 2024, 6:39 PM

#

river cape I have selected the PS1 as my statement , so i need suggestions or advice on how...

You didn’t answer my questions.

river cape Jan 27, 2024, 6:40 PM

#

left tartan You didn’t answer my questions.

These are 3 problem statements given to me I have to choose one

left tartan Jan 27, 2024, 6:40 PM

#

Ok, now that you’ve chosen one, what are you expected to do?

river cape Jan 27, 2024, 6:40 PM

#

Anonymise user identities in large databases to ethically employ machine learning in understanding customer trends and behaviour without violating their rights.

left tartan Jan 27, 2024, 6:41 PM

#

Is this just an essay question? A coding assignment? Etc

river cape Jan 27, 2024, 6:41 PM

#

I need to implement a machine learning model for a given database which understands the customer trends and behaviour of the customer without enclosing thier details

#

Its a hackathon

ashen galleon Jan 27, 2024, 6:43 PM

#

limber mesa maybe check <#343944376055103488> ?

I don't think so-
The issue is entirely about the database, so possibly databases, but most database people probably know less about embeddings than most ML people.

merry briar Jan 27, 2024, 6:47 PM

#

@lapis sequoia the problem was the learn function I was doing self.weights[j][i] instead of self.weights[layer_i][i][j] so the changing one weight would change the other instead so it couldn't caluate the cost for the weight it was thinking about

final kiln Jan 27, 2024, 6:53 PM

#

I'm on one block with 2 heads

#

Kind of insane that this miniscule model is threatening to memorize the data ._.

cinder jay Jan 27, 2024, 7:02 PM

#

hey, how opencv subtract works?
i don't get it

final kiln Jan 27, 2024, 7:11 PM

#

Aaaah back to regularization

left tartan Jan 27, 2024, 7:18 PM

#

river cape I need to implement a machine learning model for a given database which understa...

With any data science type question, first step (imo) is to understand the data. Basic EDA stuff. Determine how what cleansing, transformation, normalization, etc is needed to prepare the data. Come up with a train/test split strategy. Determine your X and y variables, etc.

final kiln Jan 27, 2024, 7:26 PM

#

Increased model size, included L1 and L2, model size affects LR schedule which might've actually been messing up the other loops

#

But at some point I'm gonna have to do data augmentation or find myself a larger set

frigid owl Jan 27, 2024, 8:25 PM

#

hey guys i need help saving architecture from autokeras

#

i trained a text classifier and i just want to save architecture not the already trained model

#

is there a way to do this?

left tartan Jan 27, 2024, 10:12 PM

#

@agile owl it’s not that it gets used the most times… usually you’re sliding the model forward and doing N tests, not a single test

#

Ie; train on 2019-2022, and test against 2023

#

Or start at 2010-2015, then walk forward

agile owl Jan 27, 2024, 10:13 PM

#

how do you get any sort of signal against a recent trend then

#

if you want behavior from a high rate environment

#

etc.

left tartan Jan 27, 2024, 10:13 PM

#

agile owl how do you get any sort of signal against a recent trend then

You want to train on 2023 and see how it would’ve performed against 2020?

agile owl Jan 27, 2024, 10:14 PM

#

let's say you're trading equity indices right

#

now equity indices have different correlations to rates/inflation in different historical periods

left tartan Jan 27, 2024, 10:14 PM

#

(We haven’t gotten there yet, but Monte Carlo is also a topic to discuss)

agile owl Jan 27, 2024, 10:14 PM

#

let's say you went from a low inflation environment where inflation was associated with higher returns on equities

#

but now you're in a high inflation environment and the opposite is true

#

and the last time you had something like this was 10 years ago

#

what is your sliding window gonna do

left tartan Jan 27, 2024, 10:15 PM

#

agile owl now equity indices have different correlations to rates/inflation in different h...

In your case, you’re asking how well a strategy would’ve worked in different market regimes?

agile owl Jan 27, 2024, 10:15 PM

#

right

#

if you use too short a window, for instance

#

and any window is probably too short given how far back our datasets go in finance

#

the bias of your dataset depends on the timeframe right

#

if you made a trading bot in 2008-2009 to trade treasuries and it just always bought, that would be the right thing

#

if you made a trading bot including 2012-2021 and tried to use it in 2022 and it just always bought bonds

#

you'd just get blown up by the Fed

left tartan Jan 27, 2024, 10:18 PM

#

You could certainly test a model against some historical era, it’s just somewhat inevitable that you’re overfitting tho. This is where, perhaps, I’d Monte Carlo it rather that test against market actual

agile owl Jan 27, 2024, 10:19 PM

#

sure but implicit in the montecarlo design is that you understand the market dynamics well enough to produce a better sample than historical conditioned on the current environment

#

which is a big claim

left tartan Jan 27, 2024, 10:19 PM

#

Yah, absent a multiverse, what’s the alternative?

agile owl Jan 27, 2024, 10:19 PM

#

pretending that history repeats itself

#

that's the necessary axiom to any of this anyway right

left tartan Jan 27, 2024, 10:21 PM

#

Not precisely, the necessary axiom is that there’s patterns, but not that the patterns exhibit the same order/etc

left tartan Jan 27, 2024, 10:23 PM

#

agile owl that's the necessary axiom to any of this anyway right

But I feel you, enjoying talking about this (few folks here engage on this topic)!

agile owl Jan 27, 2024, 10:23 PM

#

sure me too

#

so it's hard to think about the assumptions we are implicitly making about what historical behavior means about future behavior sometimes

left tartan Jan 27, 2024, 10:23 PM

#

My favorite pres on this topic: https://www.davidhbailey.com/dhbtalks/battle-quants.pdf

#

YouTube version: https://youtu.be/e3h9xf3p1DE?feature=shared

agile owl Jan 27, 2024, 10:24 PM

#

I think it often goes unsaid what people are actually assuming with respect to that

#

my favorite thing that has no real theoretical basis but gets used by everyone is implied to realized volatility ratios

left tartan Jan 27, 2024, 10:26 PM

#

agile owl so it's hard to think about the assumptions we are implicitly making about what ...

Yah, the entire market is so circular

agile owl Jan 27, 2024, 10:26 PM

#

one is future looking and the other is past looking

#

but everyone uses them in every asset class

#

what people should look at is the implied volatility from X days ago vs the realized volatility

#

but that doesn't even answer the same question

left tartan Jan 27, 2024, 10:27 PM

#

lol, that’s interesting to people like us. But to the market players, they have zero hindsight

agile owl Jan 27, 2024, 10:28 PM

#

also the meaning of realized volatility is completely different if you're delta hedging vs not

#

but yeah who cares

#

low ratio good high ratio bad

left tartan Jan 27, 2024, 10:31 PM

#

Whelp, off to dinner, nice chat!

agile owl Jan 27, 2024, 10:31 PM

#

yup u 2

stiff axle Jan 27, 2024, 11:46 PM

#

I installed miniconda but when I type python I get the non conda version. My terminal also doesn't detect the conda command. Do I need to add this manually to my environment variables? I'm asking because during the installation process it said doing so was not recommended.

serene scaffold Jan 27, 2024, 11:49 PM

#

stiff axle I installed miniconda but when I type python I get the non conda version. My ter...

is there a reason you want to use *conda? because my suggestion is to not.

stiff axle Jan 27, 2024, 11:50 PM

#

serene scaffold is there a reason you want to use *conda? because my suggestion is to not.

I thought it was the de facto virtual environment manager for python

left tartan Jan 28, 2024, 12:15 AM

#

stiff axle I thought it was the de facto virtual environment manager for python

-some- people use conda, but there are multiple package managers. And conda is certainly not the most popular.

#

That said; In some data science circles, Conda is well entrenched.

#

For a discussion of this: https://www.reddit.com/r/Python/comments/10bxkjp/what_are_people_using_to_organize_virtual/

sterile flare Jan 28, 2024, 1:09 AM

#

HELP idk whats wrong

sterile flare Jan 28, 2024, 1:09 AM

#

left tartan -some- people use conda, but there are multiple package managers. And conda is c...

is conda the same as Anaconda ?

serene scaffold Jan 28, 2024, 1:11 AM

#

stiff axle I thought it was the de facto virtual environment manager for python

conda was designed to solve certain problems for data scientists, not the whole python community. but in my opinion, those problems have since largely been solved for all of python, so conda only serves to create a barrier between its users and the rest of the python community.

serene scaffold Jan 28, 2024, 1:11 AM

#

sterile flare HELP idk whats wrong

Do print(df.columns.tolist()) and put the text in the chat. As a general rule, please do not post screenshots of text.

sterile flare Jan 28, 2024, 1:11 AM

#

serene scaffold Do `print(df.columns.tolist())` and put the text in the chat. As a general rule,...

why?

serene scaffold Jan 28, 2024, 1:12 AM

#

sterile flare why?

Because they are harder to read and can't be copied from.

sterile flare Jan 28, 2024, 1:12 AM

#

serene scaffold Because they are harder to read and can't be copied from.

oh, sorry then

serene scaffold Jan 28, 2024, 1:12 AM

#

helping people often involves googling error messages or running segmants of their code. and it's rude to expect people to retype stuff by hand.

sterile flare Jan 28, 2024, 1:13 AM

#

serene scaffold helping people often involves googling error messages or running segmants of the...

no i'm not expecting people to repeat my code, i only want an explanation or a guideness

#

i apologize again

serene scaffold Jan 28, 2024, 1:14 AM

#

Don't worry about it for now. Just run print(df.columns.tolist()) and put the resultant text in the chat.

sterile flare Jan 28, 2024, 1:17 AM

#

serene scaffold Don't worry about it for now. Just run `print(df.columns.tolist())` and put the ...

our professor said in order to display a column with all its rows in the data frame we should do df.loc[: , 'wanted column name'].head() , the print(df.columns.tolist()) is an equivalent to this ? and why didn't this code work , why it said keyerror and what does it mean

serene scaffold Jan 28, 2024, 1:17 AM

#

sterile flare our professor said in order to display a column with all its rows in the data fr...

No, I'm asking you to run the code and show the result; it's information that I need to be able to help you

sterile flare Jan 28, 2024, 1:17 AM

#

serene scaffold No, I'm asking you to run the code and show the result; it's information that I ...

ohhh

#

okey then wait

serene scaffold Jan 28, 2024, 1:17 AM

#

But it appears that 'England' is not the name of a column in your dataframe

sterile flare Jan 28, 2024, 1:18 AM

#

exactly it is thats the problem

#

thats why i was confused

#

wanna show u my dataframe ?

#

can i do a ss ?

serene scaffold Jan 28, 2024, 1:18 AM

#

I wanted you to run print(df.columns.tolist()) and put the text in the chat.

sterile flare Jan 28, 2024, 1:19 AM

#

['A', 'B', 'c', 'D'] this is what appeared, i guess it is the D column named england

#

this is weird tho

serene scaffold Jan 28, 2024, 1:20 AM

#

Okay, so there's no column named England. But you expected there to be one with that name. Should the England column have been there when the dataframe was initially created?

sterile flare Jan 28, 2024, 1:20 AM

#

serene scaffold I wanted you to run `print(df.columns.tolist())` and put the text in the chat.

so this gives me the names of the my dataframe columns? and u wanted to check if england was one of them ?

serene scaffold Jan 28, 2024, 1:20 AM

#

It might help if you show the code that creates the dataframe, and everything that comes after it

#

!paste

arctic wedgeBOT Jan 28, 2024, 1:20 AM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in Discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the Paste! button in the bottom left, or by pressing CTRL + S. After doing that, you will be navigated to the new paste's page. Copy the URL and post it here so others can see it.

serene scaffold Jan 28, 2024, 1:21 AM

#

sterile flare so this gives me the names of the my dataframe columns? and u wanted to check if...

Yes

sterile flare Jan 28, 2024, 1:21 AM

#

serene scaffold Okay, so there's no column named England. But you expected there to be one with ...

yes , wanna show u my dataframe ?

serene scaffold Jan 28, 2024, 1:22 AM

#

sterile flare yes , wanna show u my dataframe ?

Copy and paste all the code and put it in the paste bin, per the instructions a few messages up.

#

I want to see the code, in this instance. Not the dataframe.

sterile flare Jan 28, 2024, 1:22 AM

#

whats a paste bin

serene scaffold Jan 28, 2024, 1:22 AM

#

arctic wedge

Please read this message

sterile flare Jan 28, 2024, 1:25 AM

#

https://paste.pythondiscord.com/PHWQ

#

i guess that's what i was asked to do ?

serene scaffold Jan 28, 2024, 1:25 AM

#

Yes

#

Thank you

sterile flare Jan 28, 2024, 1:25 AM

#

i think i have 2 df

serene scaffold Jan 28, 2024, 1:25 AM

#

It looks like you have two dataframes: data_frame and df.

#

and data_frame is based on whatever defra_consumption.csv is.

sterile flare Jan 28, 2024, 1:25 AM

#

yes, sorry i'm not thinking straight

serene scaffold Jan 28, 2024, 1:26 AM

#

That's okay

sterile flare Jan 28, 2024, 1:26 AM

#

serene scaffold and `data_frame` is based on whatever `defra_consumption.csv` is.

i didn't understand this

serene scaffold Jan 28, 2024, 1:26 AM

#

sterile flare i didn't understand this

Look at lines 3 and 4. You define file_path as the location for a certain CSV file. And then data_frame is a dataframe that represents whatever is in file_path

#

so whatever columns are in defra_consumption.csv will be the columns for data_frame (but not df). It might be that 'England' is one of them.

#

make sense, @sterile flare?

sterile flare Jan 28, 2024, 1:31 AM

#

serene scaffold Look at lines 3 and 4. You define `file_path` as the location for a certain CSV ...

yes true

sterile flare Jan 28, 2024, 1:31 AM

#

serene scaffold so whatever columns are in `defra_consumption.csv` will be the columns for `data...

exactly

serene scaffold Jan 28, 2024, 1:31 AM

#

great 🙏

sterile flare Jan 28, 2024, 1:32 AM

#

this is what i realized after remembering i have 2 dfs

sterile flare Jan 28, 2024, 1:32 AM

#

serene scaffold great 🙏

thank you so much, you're the sweetest

#

and sorry if i was rude

serene scaffold Jan 28, 2024, 1:32 AM

#

No worries, now you know for the future 🙏

stiff axle Jan 28, 2024, 2:18 AM

#

I appreciate y’alls input on the conda advice. Ima just run with it and see how it goes.

cinder jay Jan 28, 2024, 2:19 AM

#

Hi guys, im programming a python script to segment retinal blood vessel, the current result is this:

#

I've applied CLAHE and another pre processing

#

how do i remove the circle and the small particles¿¿¿

agile owl Jan 28, 2024, 2:35 AM

#

you could use object detection and mask over the pixels where they are detected

past meteor Jan 28, 2024, 6:21 AM

#

agile owl so it's hard to think about the assumptions we are implicitly making about what ...

What is often said about time series is that ML models aren't made to predict the future, we just use them to do so

final kiln Jan 28, 2024, 8:55 AM

#

this model is gonna get trained, whether it likes it or not

ashen axle Jan 28, 2024, 8:57 AM

#

I am attempting to optimize a baseline fitting algorithm for signal preprocessing by instantiating it as a SKLearn Custom Estimator and using GridSearchCV. How can I define a custom score to optimize the baseline fit, i.e. maximise fit function smoothness and intersections with the signal function? Also, is that a rational way of defining an optimal fit?

The end goal is a SKLearn Pipeline with several preprocessing steps prior to deconvolution through model fit optimization.

past meteor Jan 28, 2024, 9:13 AM

#

final kiln this model is gonna get trained, whether it likes it or not

is this vanilla SGD?

past meteor Jan 28, 2024, 9:14 AM

#

ashen axle I am attempting to optimize a baseline fitting algorithm for signal preprocessin...

@wooden sail this is one for you

final kiln Jan 28, 2024, 9:14 AM

#

past meteor is this vanilla SGD?

I'm batching the dataset and reshuffling it, using Adam as the optimizer

#

loss/train is large because I'm adding l1

past meteor Jan 28, 2024, 9:15 AM

#

I'm too used to using drop out to ever look at train loss

final kiln Jan 28, 2024, 9:16 AM

#

when I started this thing I didn't know about dropout, it's not coded into the transformer yet

past meteor Jan 28, 2024, 9:17 AM

#

I was mostly intrigued by how large the learning rate is and how the val loss isn't dropping that much at the end

#

But honestly, it's a meaningless observation on my part 😄 they could be reasonable numbers for your domain

final kiln Jan 28, 2024, 9:18 AM

#

past meteor But honestly, it's a meaningless observation on my part 😄 they could be reasona...

it's a valid observation, it has been overfitting every time

bronze robin Jan 28, 2024, 9:18 AM

#

Any method to calculate standard error of c, for a best fit line given by the equation log(y) = mlog(x) + log(c) ?

final kiln Jan 28, 2024, 9:18 AM

#

at some point val will plateu and start to grow

past meteor Jan 28, 2024, 9:18 AM

#

I always start with a learning rate of exactly 3e-4

#

And train with early stopping

final kiln Jan 28, 2024, 9:19 AM

#

I'm using the LR schedule originally proposed in the 2017 paper, where the transformer was introduced

#

tho the formula doesn't seem to work as intended for small dimensions like these

#

I adapted it so that it defaults to 1e-3 if it tries to output larger values

#

but the intend behaviour is that it starts super small, grows up to 1e-3 or something, and then exponentially decays

past meteor Jan 28, 2024, 9:21 AM

#

final kiln I'm using the LR schedule originally proposed in the 2017 paper, where the trans...

oh that's solid then. The most important thing is having a decent lr to start with. If the paper suggests one, roll with that 😄

final kiln Jan 28, 2024, 9:25 AM

#

I might need to modify it tho, I don't think they tested it for these values

wooden sail Jan 28, 2024, 9:35 AM

#

ashen axle I am attempting to optimize a baseline fitting algorithm for signal preprocessin...

this is a pretty broad topic and the score depnds a lot on your application. kinda rough without further context. regarding smoothness and intersections with the data, you can achieve this by having an L2 error term, which measures how good the fit is at points you specify, and then an additional term measuring how large the derivative is, trying to minimize both. this is also roughly the idea behind classical methods like splitting the signal with low and high pass filters first

ashen axle Jan 28, 2024, 9:42 AM

#

wooden sail this is a pretty broad topic and the score depnds a lot on your application. kin...

its a chromatographic signal with gradient elution producing a curved baseline which i need to remove. ideally after correction, a maximal number of peak bases intercept with zero.

ashen axle Jan 28, 2024, 9:43 AM

#

wooden sail this is a pretty broad topic and the score depnds a lot on your application. kin...

the baseline fitting algorithm essentially smoothes windows of the signal until the baseline is estimated, the hyperparameters to optimize are window size and number of smoothing iterations

ashen axle Jan 28, 2024, 9:46 AM

#

wooden sail this is a pretty broad topic and the score depnds a lot on your application. kin...

so as I understand it, i would mask the signal to find the “zero” points, i.e. local minima, and see how well the estimation fits them? Is the size of the derivative the smoothness metric? I thought minimizing the second derivative would be the way to go.

I was thinking of scoring by measuring how many zero points existed in the signal minus the baseline, avoiding the need to find local minima

wooden sail Jan 28, 2024, 9:49 AM

#

the size of the derivative is a measure of a type of smoothness, yes. there are several kinds of smoothness, not just the basic "is the function even differentiable". many of the definitions involve finding a bound for how much the function changes if you change the input

#

and indeed, for spectral data like yours, one tends to look for the local minima and try to make those points zero, meaning the baseline function doesn't need to pass through all data points exactly

#

there are many papers discussing this topic of you just look up "baseline fitting algorithm" on google scholar, most of them on spectrographic data

#

sadly you'll probably find that in your data, even after optimization, probably 0 points will be exactly 0 😛 so oyu'll have 0 or only very few zero crossings

#

1d peak-finding is not very complicated though, you could almost consider this an input to your procedure and just calculate them ahead of time. this would tell you which points in the domain to include into your cost function

ashen axle Jan 28, 2024, 9:59 AM

#

wooden sail 1d peak-finding is not very complicated though, you could almost consider this a...

absolutely. the baseline correction is actually an attempt to increase the accuracy of scipy.peak_widths by removing convolution caused by presence of baseline.

ashen axle Jan 28, 2024, 10:03 AM

#

wooden sail there are many papers discussing this topic of you just look up "baseline fittin...

This is true. My question is however, how to implement optimisation through SKLearn grid search, as I want to construct the processing pipeline in SKLearn with XGBoost classification modeling on the processed signals. I get the feeling im on the right track though - define the metrics, say L2 norm and derivative minimization, and implement as a custom scorer for the grid search

wooden sail Jan 28, 2024, 10:04 AM

#

sadly the specifics of sklearn are beyond me, i don't use it. but that sounds about right

#

sklearn's grid search says something about defining the estimator with a score function, which i presume entails defining a function for the baseline, a score, and having sklearn fiddle with the baseline function's parameters through grid search

ashen axle Jan 28, 2024, 10:06 AM

#

wooden sail sklearn's grid search says something about defining the estimator with a score f...

fair dues. btw this is the algorithm I applying:

https://journals.sagepub.com/doi/10.1366/000370208783412762

final kiln Jan 28, 2024, 10:09 AM

#

made a small change in the output data, removed all regularization, it's looking promising

#

too soon tho

#

but gonna let it cook

ashen axle Jan 28, 2024, 10:12 AM

#

is there a particular framework you recommend working in for problems such as this? Thus far everything had been written as python classes from scratch, based around pandas, with scipy.optimize for curve fitting

final kiln Jan 28, 2024, 10:25 AM

#

reduced the model size by half and adapted the LR scheduler to have its intended behavior, I'm getting there for sure

#

like, it's not baaad

#

it's very sensitive to punctuation changes, so there's quite a bit of data augmentation that can be done here

river cape Jan 28, 2024, 11:34 AM

#

can anyone provide some references to federated learning and differential privacy

final kiln Jan 28, 2024, 11:50 AM

#

final kiln like, it's not baaad

I think the lesson I'm gonna take from this one is that I should define my end goal more clearly. I'm trying to make the model not overfit, but I don't even know if the value b4 overfit is good or not.

cinder jay Jan 28, 2024, 11:50 AM

#

Hi, im implementing a python script using opencv that segment the retinal blood vessel, the current result are this one:

#

how do i remove the small points????

final kiln Jan 28, 2024, 11:51 AM

#

cinder jay how do i remove the small points????

Apply an erosion kernel

#

But also, try to not produce them in the first place

cinder jay Jan 28, 2024, 11:52 AM

#

which kernel size???

final kiln Jan 28, 2024, 11:52 AM

#

No idea, you have to experiment with it

outer jasper Jan 28, 2024, 11:53 AM

#

can anyone help me with pytorch? torch.cuda.is_available() returns false even though i installed cuda

final kiln Jan 28, 2024, 11:53 AM

#

outer jasper can anyone help me with pytorch? torch.cuda.is_available() returns false even th...

Uhm, run nvidia-smi in the terminal

cinder jay Jan 28, 2024, 11:54 AM

#

final kiln No idea, you have to experiment with it

kay

#

ty

outer jasper Jan 28, 2024, 11:55 AM

#

final kiln Uhm, run nvidia-smi in the terminal

it opened then closed immediately

final kiln Jan 28, 2024, 11:56 AM

#

outer jasper it opened then closed immediately

Shouldn't open anything at all, should just print out a table reporting the status of the GPU

#

Which OS r u using

outer jasper Jan 28, 2024, 11:56 AM

#

windows

final kiln Jan 28, 2024, 11:56 AM

#

Uhm

outer jasper Jan 28, 2024, 11:56 AM

#

it opend a console then closed

final kiln Jan 28, 2024, 11:57 AM

#

Literally never debugged this on windows, but I expect funky non sensical behavior like this

#

Maybe try to go via the Linux subsystem thing

tidal bough Jan 28, 2024, 11:57 AM

#

outer jasper can anyone help me with pytorch? torch.cuda.is_available() returns false even th...

Torch doesn't use the system's cuda, it uses a bundled one. How did you install torch?

outer jasper Jan 28, 2024, 11:57 AM

#

i really should switch to linux

outer jasper Jan 28, 2024, 11:58 AM

#

tidal bough Torch doesn't use the system's cuda, it uses a bundled one. How did you install ...

using pip

#

pip install

tidal bough Jan 28, 2024, 11:58 AM

#

If you just did pip install torch, that'd get you the CPU-only version.

final kiln Jan 28, 2024, 11:58 AM

#

Yeah Linux is a good choice since almost all production servers are Ubuntu

#

I think that's actually how I've been installing it tho

#

Gonna check

outer jasper Jan 28, 2024, 11:59 AM

#

i used this command .\Scripts\pip install torch==1.8.0+cu101 torchvision==0.9.0+cu101 torchaudio===0.8.0 -f https://download.pytorch.org/whl/torch_stable.html then it gave me error saying it does not exist then i used the one in the website pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118

final kiln Jan 28, 2024, 12:00 PM

#

I just do pip install torch

outer jasper Jan 28, 2024, 12:00 PM

#

i am following a guide to install a Text to spech thingy

odd meteor Jan 28, 2024, 12:00 PM

#

river cape can anyone provide some references to federated learning and differential privac...

https://github.com/FedML-AI/FedML/blob/master/research/Awesome-Federated-Learning.md

GitHub

FedML/research/Awesome-Federated-Learning.md at master · FedML-AI/F...

FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs o...

outer jasper Jan 28, 2024, 12:01 PM

#

and every thing was going well unil i reached the pytorch steb

#

step

tidal bough Jan 28, 2024, 12:01 PM

#

final kiln I just do pip install torch

huh, and that lets you use the GPU? that's weird; the builds on pypi aren't even the right size (the ones with bundled CUDA are way bigger).

outer jasper Jan 28, 2024, 12:02 PM

#

for some reason they say (the guide) you need an older version of pytorch

past meteor Jan 28, 2024, 12:02 PM

#

final kiln reduced the model size by half and adapted the LR scheduler to have its intended...

You should really set up some of robust hyper parameter tuning regime

tidal bough Jan 28, 2024, 12:03 PM

#

outer jasper i used this command .\Scripts\pip install torch==1.8.0+cu101 torchvision==0.9.0+...

--index-url https://download.pytorch.org/whl/cu118 should be the right way, yeah. Maybe you didn't uninstall torch before that, and so pip didn't replace the CPU one with the GPU one?..

past meteor Jan 28, 2024, 12:03 PM

#

I did this as a student as well, fiddle with tons of hyperparameters for ages. It's not time efficient, best to set this all up, run it, sleep and check the results in the morning

tidal bough Jan 28, 2024, 12:03 PM

#

i used this command .\Scripts\pip install torch==1.8.0+cu101 torchvision==0.9.0+cu101 torchaudio===0.8.0 -f https://download.pytorch.org/whl/torch_stable.html then it gave me error saying it does not exist
torch 1.8.0 only supports up to python 3.9, probably that's why these versions didn't work

outer jasper Jan 28, 2024, 12:03 PM

#

oh

#

okay

#

why is every thing conflicting

#

very annoying for a beginner

final kiln Jan 28, 2024, 12:06 PM

#

tidal bough huh, and that lets you use the GPU? that's weird; the builds on pypi aren't even...

Uhm, might be because colab comes with a lot of pre installed stuff

final kiln Jan 28, 2024, 12:06 PM

#

past meteor I did this as a student as well, fiddle with tons of hyperparameters for ages. I...

Yeah so true, honestly it even gets to not being healthy

#

I got a bit of infra setup, I can build on top of it

#

I'm thinking of using the GitHub actions API to programmatically start several training loops

#

Instead of manually triggering them

past meteor Jan 28, 2024, 12:08 PM

#

Yeah, that's the way to go

#

Well, at least some variant of it

#

Before I do experiments at work nowadays I think about what I want to evaluate etc. and then build something ad hoc to automate training / hyperparams etc.

#

but it's probably better to use mlflow, tensorboard, optuna, ... for this

final kiln Jan 28, 2024, 12:10 PM

#

The infra I have saves me on GPU compute. I could setup on kaggle/colab, but the free tiers will just run out over night

#

I think kaggle connects to Google cloud but all our credit is on AWS

#

But yeah, lesson about knowing the goal is well learned here

#

Gonna run over the equations for the cross entropy in the context of my output data (which is kinda funky) and make a Fermi estimate of what I would consider a good result

#

Or just an actual estimate

final kiln Jan 28, 2024, 12:14 PM

#

past meteor I did this as a student as well, fiddle with tons of hyperparameters for ages. I...

But so, this basically means that I need to get used to having an overnight delay between setting up experiment and getting the results

past meteor Jan 28, 2024, 12:14 PM

#

final kiln But so, this basically means that I need to get used to having an overnight dela...

yes and no

#

Hmmm

#

I can only speak of my personal experiences but I usually think "okay this is my task" and then I draw out a schematic about how I'll try and evaluate what I want to do, what types of models, what types of metrics and I code this all up.

#

Then I might run experiment A manually a few times to see if it runs and try and make sense of the initial results, afterwards I run the experiment pipeline.

As it's running I prepare experiment B and repeat.

final kiln Jan 28, 2024, 12:17 PM

#

Yeah that sounds reasonable

#

There's quite a lot of stuff tho. So there's dropout, L1 and L2 regularization, the various LR schedules that themselves may have several parameters, and then there's potentially 3 models to compare across various sizes with quite a bit of parameters themselves

#

And this is just the first task, I'd wanna do this for at least 3 or 4, which I think is what they did in the MetaFormer study

#

And I haven't even gotten to the data

past meteor Jan 28, 2024, 12:23 PM

#

Yeah, hence why you should parametize it and use some optimizer.

It's a nasty problem. One of the first things they teach you in intermediate ML courses is you train models to solve a problem that is usually convex. On top of this you have a argmin_Loss wrt hypermaramters: Loss = F(hyperparameters) but this isn't a convex problem whatsoever.

crisp raptor Jan 28, 2024, 12:43 PM

#

I feel like this image seems so unprofessional in a paper on NLG

final kiln Jan 28, 2024, 1:36 PM

#

past meteor Yeah, hence why you should parametize it and use some optimizer. It's a nasty p...

I think this whole exercise is gonna be good for me, even if it does not come out as the perfect workflow at first.

#

I do have some ideas on how to make this hyper parameter stuff searchable with gradient descent. But I'm sure a lot of people have tried it b4 me

past meteor Jan 28, 2024, 1:37 PM

#

final kiln I think this whole exercise is gonna be good for me, even if it does not come ou...

No you're right, do it like this reflect on it and improve 👊

agile owl Jan 28, 2024, 2:00 PM

#

past meteor What is often said about time series is that ML models aren't made to predict th...

that's not really true with reinforcement learning. in a sense, a robot learning to stand up and walk is learning to predict the future

#

and you aren't predicting the future per se you are predicting the best action given the current state

molten elk Jan 28, 2024, 2:05 PM

#

Can anyone tell me how this is different np.random.binomial(1000, 0.7, 500) from np.random.binomial(1000, 0.7)? Is the size parameter different from the number of times?

agile owl Jan 28, 2024, 2:12 PM

#

#

I agree that could be clearer

#

the question with reinforcement learning is if the environment admits information about the reward

molten elk Jan 28, 2024, 2:34 PM

#

agile owl

I get it now it runs the binomial check in the number specified by size

agile owl Jan 28, 2024, 2:35 PM

#

the number of trials is a distribution parameter but the number of samples is not

mint palm Jan 28, 2024, 2:50 PM

#

#

I know what conditional probability is, but man it will be great if anyone of you could please help me interpret the equations

final kiln Jan 28, 2024, 4:00 PM

#

looking at cross entropy loss is not a good way to do it, I care more about the percentage of correct guesses

#

random chance is on the order of 1e-5

desert oar Jan 28, 2024, 4:46 PM

#

final kiln looking at cross entropy loss is not a good way to do it, I care more about the ...

cross entropy loss is basically telling you whether the probability distribution of guesses fits the data. i wouldn't write it off so quickly

desert oar Jan 28, 2024, 4:46 PM

#

mint palm I know what conditional probability is, but man it will be great if anyone of yo...

what about it specifically?

final kiln Jan 28, 2024, 4:52 PM

#

desert oar cross entropy loss is basically telling you whether the probability distribution...

In this case it's not the full picture. I should even be separating the loss into two components. One for replicating the input the other to predict the next token, which is the classification

#

All this time I thought I needed to reduce model size, now I see that I get better results by increasing it

#

#

see how val actually increased, but the percentage of correct guesses got better

river cape Jan 28, 2024, 5:02 PM

#

odd meteor https://github.com/FedML-AI/FedML/blob/master/research/Awesome-Federated-Learnin...

Anonymise user identities in large databases to ethically employ machine learning in understanding customer trends and behaviour without violating their rights.

Any idea on how to go forward for this problem statement?

odd meteor Jan 28, 2024, 5:26 PM

#

river cape Anonymise user identities in large databases to ethically employ machine learnin...

I suggested using Federated Learning + Differential Privacy earlier. Have you looked in them?

I also shared a github repo on research paper implementation of Federated Learning.

Try checking that as well.

If you however don't fancy the idea of reading a research paper and using the code implementation of that paper to learn a new topic, then I'll suggest taking your time to go through this nice detailed blog and tutorial from Flower.

https://flower.dev/docs/framework/tutorial-series-what-is-federated-learning.html

odd meteor Jan 28, 2024, 5:34 PM

#

river cape Anonymise user identities in large databases to ethically employ machine learnin...

It's not compulsory you combine Federated Learning & Differential Privacy though.

Both concept addresses privacy concerns in the context of data-driven tasks using different approach.

You can just implement either Differential Privacy or Federated Learning and you'll still be on point as well.

dark lichen Jan 28, 2024, 5:36 PM

#

anyone here good in advanced math?

final kiln Jan 28, 2024, 5:39 PM

#

Honestly at this point I'll be happy with an overfit 💀

#

Ok training set is getting to 50% correct guesses

#

So I think I'm getting somewhere

odd meteor Jan 28, 2024, 5:58 PM

#

river cape Anonymise user identities in large databases to ethically employ machine learnin...

Is this a school project? If yes, then I think, maybe using the approach I suggested earlier might be an overkill.

A more straightforward concept is really just annonymizing information about the identity of people captured in your dataset.

Annonymize their name, address, county, and any other sensity information about the person in the data.

final kiln Jan 28, 2024, 5:58 PM

#

Omg validation loss just started converging out of nowhere around 75% correct guesses on the train set

#

It was at like 12 randomly jumping around, now it's under 1 and dropping

#

Honestly I gotta just let it cook

river cape Jan 28, 2024, 6:03 PM

#

odd meteor I suggested using Federated Learning + Differential Privacy earlier. Have you lo...

I did look into them especially the recommendation systems , and this is for a hackathon so we need to implement a technique which is suitable for the above problem statement.

So what I initially thought is we have a large dataset and I would just search on Google for the entities which are Personally Identifiable . Check the similarity of those against the columns of the dataset. And i am stuck here as to what to do

umbral charm Jan 28, 2024, 6:03 PM

#

when would one use matplotlib compared to plotly

#

i dont know if i should stick to matplotlib or learn plotly

odd meteor Jan 28, 2024, 6:04 PM

#

umbral charm when would one use matplotlib compared to plotly

If you prefer better aesthetics and interactivity of plot

final kiln Jan 28, 2024, 6:05 PM

#

my notebook tab just crashed, : D........

odd meteor Jan 28, 2024, 6:07 PM

#

umbral charm i dont know if i should stick to matplotlib or learn plotly

I think almost everyone started with Matplotlib and Seaborn before learning either Plotly, Bokeh, Cufflinks for creating interactive plot.

odd meteor Jan 28, 2024, 6:08 PM

#

final kiln my notebook tab just crashed, : D........

Are you running the experiment in your local machine or cloud?

umbral charm Jan 28, 2024, 6:08 PM

#

odd meteor I think almost everyone started with Matplotlib and Seaborn before learning eith...

Yes this was the case for me too

#

I was just wondering since i feel fluent enough with matplotlib to learn somethign more complicated

#

But there would be no point learning it if its worse / inefficient

final kiln Jan 28, 2024, 6:10 PM

#

odd meteor Are you running the experiment in your local machine or cloud?

It was kaggle

#

I'm gonna give it a rest, tomorrow I'll implement the cloud infra stuff so I don't have to babysit notebooks

odd meteor Jan 28, 2024, 6:12 PM

#

river cape I did look into them especially the recommendation systems , and this is for a h...

Hackathon hmmm...

If I were in your team, I'd be using differential privacy or Federated learning lol at least to have a shot at winning the money.

Are winners gonna be announced based on the concept used or solely based on model performance on a specific evaluation metric?

river cape Jan 28, 2024, 6:12 PM

#

odd meteor Hackathon hmmm... If I were in your team, I'd be using differential privacy or ...

Its mostly the concept and accuracy plus I just need a roadmap of how I can do it inorder to submit an abstract ? Because this is my first time diving into differential privacy

odd meteor Jan 28, 2024, 6:15 PM

#

final kiln It was kaggle

Oh I see. Having to randomly move your mouse every 30 mins in order to keep the notebook active 😂😂😂

Well, I still prefer Kaggle to free tier of Colab

final kiln Jan 28, 2024, 6:17 PM

#

odd meteor Oh I see. Having to randomly move your mouse every 30 mins in order to keep the ...

Colab ran out yesterday for me. I think I got the right idea of what I want to do so I'll just have spot instances training with diff hyper parameters

#

But damn, this really shouldn't be so hard, it's just a classification problem

odd meteor Jan 28, 2024, 6:22 PM

#

river cape Its mostly the concept and accuracy plus I just need a roadmap of how I can do i...

I'm afraid I might not be of much help at this time; since I've not worked on any project where I had to implement differential privacy yet.

However, I'm sure there are more knowledgeable people here with much experience in Differential Privacy who can be of help.

If I were you, I'd go down the rabbit hole of checking research paper with code implementation on this same topic or even learning from YouTube or something. ( Devot 1 or 2 days and you'll have a lot to write in the abstract you're expected to submit)

Hopefully, your team wins this Hackathon. All the best 👍

odd meteor Jan 28, 2024, 6:31 PM

#

umbral charm I was just wondering since i feel fluent enough with matplotlib to learn somethi...

I don't think Plotly is complicated though. Since you're confident enough with Matplotlib, you shouldn't struggle with Plotly.

The syntax of Plotly is somewhat similar to Seaborn

fleet hemlock Jan 28, 2024, 6:46 PM

#

Hi can you tell me what can i do to improve in python and what are the projects can I do

still coyote Jan 28, 2024, 6:55 PM

#

fleet hemlock Hi can you tell me what can i do to improve in python and what are the projects ...

!kindling has a list of projects you can do

arctic wedgeBOT Jan 28, 2024, 6:55 PM

#

Kindling Projects

The Kindling projects page on Ned Batchelder's website contains a list of projects and ideas programmers can tackle to build their skills and knowledge.

fleet hemlock Jan 28, 2024, 6:56 PM

#

Thank you

mint palm Jan 28, 2024, 8:31 PM

#

desert oar what about it specifically?

cat, N,
I know conditioal prob, but what the heck are these chained "="

final kiln Jan 28, 2024, 8:42 PM

#

mint palm cat, N, I know conditioal prob, but what the heck are these chained "="

You mean ~ ?

#

That's the notation for saying that a given random var follows a given dist

mint palm Jan 28, 2024, 8:42 PM

#

no the conditional prob equation

final kiln Jan 28, 2024, 8:43 PM

#

What about it ? The || ?

mint palm Jan 28, 2024, 8:43 PM

#

#

i know one |

#

not ||

final kiln Jan 28, 2024, 8:44 PM

#

Yeah that's a good question

mint palm Jan 28, 2024, 8:44 PM

#

also the double arrow?

final kiln Jan 28, 2024, 8:45 PM

#

Seems to only happen within KL

#

So it's probably defined somewhere back

mint palm Jan 28, 2024, 8:45 PM

#

This paper is nutz.
5 %contribution 95 % flassy equation

final kiln Jan 28, 2024, 8:46 PM

#

Double arrow is not any standard notation I am aware of

mint palm Jan 28, 2024, 8:46 PM

#

this is start of methodology

#

let me share paper

final kiln Jan 28, 2024, 8:46 PM

#

Anything that's not standard they have to define it

mint palm Jan 28, 2024, 8:46 PM

#

https://proceedings.neurips.cc/paper/2020/file/6740526b78c0b230e41ae61d8ca07cf5-Paper.pdf

final kiln Jan 28, 2024, 8:46 PM

#

And its preferable not to use anything that's not standard

mint palm Jan 28, 2024, 8:47 PM

#

no supplementry nothing

final kiln Jan 28, 2024, 8:48 PM

#

Yeah that's kinda wild, I wonder if it becomes standard notation around some specialized niche

#

Maybe follow the closest related citation

#

See if they define it there

mint palm Jan 28, 2024, 8:49 PM

#

Its like those fancy restaurants

final kiln Jan 28, 2024, 8:50 PM

#

Check 37

#

Bet it's gonna be there

mint palm Jan 28, 2024, 8:51 PM

#

yeah i was checking that one only

#

found it

#

but its a bigger night mare

#

thanks for the suggestion

final kiln Jan 28, 2024, 8:52 PM

#

It's defined back there, I'm reading it rn

#

mint palm Jan 28, 2024, 8:53 PM

#

yup integration of kl loss from one of the term

final kiln Jan 28, 2024, 8:53 PM

#

The double arrow thing, maybe it relates to the concept of clustering somehow idk

mint palm Jan 28, 2024, 8:54 PM

#

i will try to find.

final kiln Jan 28, 2024, 8:56 PM

#

Is the pytorch documentation an open source thing ? I really wanna contribute to it if it is

#

I see, they are generated from the docstrings

#

I wonder if they're open to mods on this stuff, I can make them easier to understand

desert oar Jan 28, 2024, 10:10 PM

#

mint palm https://proceedings.neurips.cc/paper/2020/file/6740526b78c0b230e41ae61d8ca07cf5-...

the || i think is just part of their KL notation, it might signify something else but i would just read it as part of the KL divergence operator/function

#

"Cat" is the categorical distribution, Bernoulli with >2 categories or multinomial with n=1 trial

#

that double arrow notation is new to me, it might be defined in reference 37

#

i agree that it appears to denote some kind of clustering structure, but i can only guess as to what it means

abstract wasp Jan 28, 2024, 11:10 PM

#

Hi, I’m trying to make a model that converts bullet points into full sentences. I’m not sure how to structure my dataset. I currently just have a .txt file with something like:
Input:

finished data collection
started cleaning data
Output:
I finished the data collection and I started cleaning the data.
Is this good enough? I’ve seen some people use json format for this, what’s best?

final kiln Jan 28, 2024, 11:14 PM

#

Uhm, I'd just pickle a python object, or put it into a parquet file or an SQLite file

serene scaffold Jan 28, 2024, 11:20 PM

#

abstract wasp Hi, I’m trying to make a model that converts bullet points into full sentences. ...

What matters most from an ML perspective is that the outputted sentences are useful. There is no "best" output format. It's just a matter of how you plan to use them downstream.

abstract wasp Jan 28, 2024, 11:23 PM

#

serene scaffold What matters most from an ML perspective is that the outputted sentences are use...

Ok, thank you!

serene scaffold Jan 28, 2024, 11:24 PM

#

abstract wasp Hi, I’m trying to make a model that converts bullet points into full sentences. ...

What kind of model is it btw?

#

Presumably there are examples that are more intricate than "I ... And I ... And I ..."

abstract wasp Jan 28, 2024, 11:30 PM

#

serene scaffold What kind of model is it btw?

I’m thinking of a sequence to sequence model.
So I’m in this research hub and we have to send weekly emails about our progress through that week. I forget to send the emails most of the time 🤡😭 I usually type down my hours and what I completed—I’m making this so that it can take the notes I have, convert it into full sentences and have the email be sent out automatically 🤡😂

abstract wasp Jan 28, 2024, 11:30 PM

#

serene scaffold Presumably there are examples that are more intricate than "I ... And I ... And ...

True, but I’m making this model just for this use lmao

desert oar Jan 28, 2024, 11:43 PM

#

final kiln Uhm, I'd just pickle a python object, or put it into a parquet file or an SQLite...

I definitely would not use pickle if I could avoid it

agile owl Jan 29, 2024, 12:34 AM

#

~ means distributed as

#

like x ~ N means x is normally distributed

#

you know the big fancy N

dense yarrow Jan 29, 2024, 1:55 AM

#

does anyone have a pdf of this book?Pandas for Everyone: Python Data Analysis, 2nd Edition by daniel chen

#

i'd reeally appreciate it

#

i made the wrong orielly account and cannot access it but i have hw due

#

in a few hours

#

nvm i got it now

magic dune Jan 29, 2024, 2:59 AM

#

@serene scaffold can I ask you a question about markov chains? (I saw you were a Computational Linguist). I know it can create sudo realstic senetences but sometimes they don't flow very well. Are there any solutions for this type of markov chain problem or is that just a known limitation. (Sorry to bother.)

serene scaffold Jan 29, 2024, 3:00 AM

#

magic dune <@253696366952316929> can I ask you a question about markov chains? (I saw you w...

that's a known limitation. generating text with markov chains is a primitive form of technologies like ChatGPT.

#

which obviously doesn't have that particular limitation.

magic dune Jan 29, 2024, 3:03 AM

#

serene scaffold that's a known limitation. generating text with markov chains is a primitive for...

I see but using Transfomers are more expensive because of training time right?

#

or am I wrong?

serene scaffold Jan 29, 2024, 3:06 AM

#

magic dune I see but using Transfomers are more expensive because of training time right?

if you can generate text with markov chains, all the data you have about which tokens can be chained together is, collectively, a language model.
I'm not aware of a non-arbitrary point at which a language model becomes a "large" language model. but creating transformer-based LLMs from scratch is quite computationally expensive, yes.

#

you might enjoy this reddit post I wrote when I created a markov chain language model for my homework

#

https://www.reddit.com/r/exmormon/comments/f0vvp0/i_used_the_book_of_mormon_to_train_my_random/

From the exmormon community on Reddit

Explore this post and more from the exmormon community

#

I therefore submit to all of you one of those most statistically probable passages to appear in the Book of Mormon.
Ha, I'm even funnier than I remember.

magic dune Jan 29, 2024, 3:08 AM

#

lol

#

I have been having a lot of fun with markov chain

#

And wondered if there is any fix. thanks for answering all my questions super informatively!

serene scaffold Jan 29, 2024, 3:10 AM

#

magic dune And wondered if there is any fix. thanks for answering all my questions super in...

of course pepefedora

serene scaffold Jan 29, 2024, 3:13 AM

#

magic dune And wondered if there is any fix. thanks for answering all my questions super in...

if the "problem" is that your chains are producing sentences that seem to switch sentence structure partway (you can see a lot of examples of that in the reddit post), the only solution really is to increase the n for the ngrams

#

or decrease the temperature. I guess.

#

but both of these will just make the text more similar to passages from the training data.

magic dune Jan 29, 2024, 3:13 AM

#

ya

serene scaffold Jan 29, 2024, 3:13 AM

#

markov chains with ngrams aren't sophisticated enough to produce things that are "new"

magic dune Jan 29, 2024, 3:14 AM

#

serene scaffold markov chains with ngrams aren't sophisticated enough to produce things that are...

ya what are the best sources for this?

serene scaffold Jan 29, 2024, 3:15 AM

#

I dunno

magic dune Jan 29, 2024, 3:17 AM

#

serene scaffold I dunno

fair enough!

ionic umbra Jan 29, 2024, 5:09 AM

#

I'm trying to run the example code for d3graph from here: https://erdogant.github.io/d3graph/pages/html/Edge properties.html

... but whenever I execute, I just get a "File not found" error from Firefox, saying the temporary file it's trying to create doesn't exist.

"Firefox can’t find the file at /tmp/tmpog5luy4x/d3graph.html"

...any ideas what to do here?

final kiln Jan 29, 2024, 8:12 AM

#

This network

#

Will get trained whether it likes it or not 😈

final kiln Jan 29, 2024, 8:15 AM

#

desert oar I definitely would not use pickle if I could avoid it

Yeah just listing it out, I sometimes use pickle temporarily, but even that I should avoid tbh

#

I'm gonna try to design a generalized pipeline

#

Only one input, which will be in json form

#

I gotta brainstorm

final kiln Jan 29, 2024, 10:06 AM

#

here's the initial version of the game plan

#

this is similar to what I already have, the only difference is that I'm extending it so that train.py can be selected

#

there's possibly a much better way though

#

if I get an AWS AMI with self hosted runners pre-installed

#

I need to read up on it

#

https://github.com/machulav/ec2-github-runner#save-costs

#

https://github.com/machulav/ec2-github-runner#example

this is awesome if it actually works

final kiln Jan 29, 2024, 12:11 PM

#

looks almost done, but it' not stopping the instance

final kiln Jan 29, 2024, 12:33 PM

#

this gonna be epic

final kiln Jan 29, 2024, 1:11 PM

#

this would be amazing if it works out

#

and it worked out, this is great, im very happy rn

spiral whale Jan 29, 2024, 2:22 PM

#

hello, ive discovered LM studio, where u can download free open source models and run them locally. Is there a way to download them and import them on my own python script? with keras or tensorflow?

livid goblet Jan 29, 2024, 4:21 PM

#

Hi
Guys, which topic would you think is more interesting to work on for a Master Thesis ? I'm kinda on the fence about that
DeepStereoBrush – Depth Map Interpolation Using Deep Learning
Neural Networks Optimization for Edge/Mobile Computing (such as CLIP network, etc..)

barren fable Jan 29, 2024, 4:23 PM

#

Hi, I have two questions about dummy variables and feature selection in machine learning.

First, so I know that to avoid the dummy variable trap, I should drop a column from the dummy variables. So now I see some people who say that you don't need to drop a column from your dummy variable because Sklearn will do it automatically.
I read some articles that said, "You don't have to do this because the sci-kit learn library automatically removes one of the variables for you in the following code."

# X is the training dataset and we are using Sci-Kit Learn
from sklearn.compose import ColumnTransformer
from sklearn.preprocessing import OneHotEncoder
ct = ColumnTransformer(transformers = [('encoder', OneHotEncoder(), [3])], remainder = 'passthrough')
X = np.array(ct.fit_transform(X))
print(X)

Other people said that you need to do it manually, and when I searched and looked at the OneHotEncoder parameters on the Sklearn website, I found that
drop{‘first’, ‘if_binary’} or an array-like of shape (n_features,), default=None
so the default of the drop is None, not first.
So, does the SK Learn actually take care of the dummy variable trap and remove a column, or should I do it manually? im confused?

Second, I read an article that says, "Backward Elimination is irrelevant in Python, because the Scikit-Learn library automatically takes care of selecting the statistically significant features when training the model to make accurate predictions."
So again, should I do feature selection manually? or Sklearn takes care of it?

Thanks.

odd meteor Jan 29, 2024, 4:45 PM

#

livid goblet Hi Guys, which topic would you think is more interesting to work on for a Master...

Tbh I think it's only you that can decide this, because what's interesting to me might be boring to you, and vice versa.

Pick what you really have interest in and perhaps, narrow it down to availability of nice / approachable professors who are specialised in that same field in your school.

fading compass Jan 29, 2024, 5:03 PM

#

Hello, I have to generate a 'random on a grid' grid, every mesh has to be occupied by a point, can someone help ?
Here is my code in the case of a regular grid :

def generate_gridded(dim,nb_pts):
    dx,dy=dim
    x=np.linspace(0,dx,int(np.sqrt(nb_pts)))
    y=np.linspace(0,dy,int(np.sqrt(nb_pts)))
    X,Y=np.meshgrid(x,y)
    X1,Y1=np.meshgrid(x,y)
    return X.flatten(),Y.flatten()

Thank you

mint palm Jan 29, 2024, 5:30 PM

#

why some ROC curve life left, and some like right?

#

is right one having high discretisation of thresholds?

#

and due to rectangular interpolation, it looks choppy?

final kiln Jan 29, 2024, 6:07 PM

#

one job per hyperparameter set

final kiln Jan 29, 2024, 6:12 PM

#

final kiln one job per hyperparameter set

@past meteor what do you think ? I literally just setup my set of hyper parameters as a matrix in the GitHub workflow and it runs them sequentially (or concurrently if possible)

#

It doesn't really handle fault tolerance though, if AWS takes away the spot instance it just kinda fails

#

Still working on how to handle it

#

I think I can possibly check for unfinished jobs and try to schedule them, but idk yet

past meteor Jan 29, 2024, 6:31 PM

#

final kiln <@260493929047130113> what do you think ? I literally just setup my set of hyper...

definite improvement! Good job. I admire your dedication 👀

final kiln Jan 29, 2024, 6:33 PM

#

thanks!

#

here's how the workflow is looking iff you're curious

#

ideally I think I'm gonna do 1 workflow per experiment instead of having a bunch of input parameters

past meteor Jan 29, 2024, 6:34 PM

#

Do all of them fail as soon as the first fails or what?

final kiln Jan 29, 2024, 6:35 PM

#

no, I'm disabling that on fail-fast: false

#

but I worry about the case where the spot instance is removed by aws

#

I can make this sort of "idempotent"

#

all jobs run when I run the workflow, but they will first check if that set of hyper parameters were already trained

#

yeah that's how it's gonna go

#

and I can set it to dispatch a totally seperate workflow

#

that checks for failed jobs and restarts this one in case of it

river cape Jan 29, 2024, 7:06 PM

#

After making a model , what do we do?

serene scaffold Jan 29, 2024, 7:14 PM

#

river cape After making a model , what do we do?

kick back and relax
pour another coffee

river cape Jan 29, 2024, 7:21 PM

#

serene scaffold kick back and relax pour another coffee

No what I meant was how to use that model

serene scaffold Jan 29, 2024, 7:27 PM

#

river cape No what I meant was how to use that model

depends on the model
what model did you make?

river cape Jan 29, 2024, 7:28 PM

#

serene scaffold depends on the model what model did you make?

Okay how do you deploy and integrate the model

serene scaffold Jan 29, 2024, 7:28 PM

#

river cape Okay how do you deploy and integrate the model

same answer

river cape Jan 29, 2024, 7:29 PM

#

serene scaffold same answer

I didnt get you

serene scaffold Jan 29, 2024, 7:29 PM

#

river cape Okay how do you deploy and integrate the model

the way you deploy a model depends on the model

river cape Jan 29, 2024, 7:30 PM

#

Doesnt every model need to deployed on the cloud

serene scaffold Jan 29, 2024, 7:30 PM

#

no. but even if they did, the way that you deploy it depends on the model

river cape Jan 29, 2024, 7:31 PM

#

serene scaffold no. but even if they did, the way that you deploy it depends on the model

Could you give me an example?

serene scaffold Jan 29, 2024, 7:31 PM

#

river cape Could you give me an example?

one technique is to have the model in a docker container that has a REST API.

river cape Jan 29, 2024, 7:32 PM

#

serene scaffold one technique is to have the model in a docker container that has a REST API.

What are the other ways

serene scaffold Jan 29, 2024, 7:33 PM

#

river cape What are the other ways

there's probably lots more ways than even I know about. it really depends on lots of different factors.

#

this is like asking "how do people deploy software" without being any more specific.

river cape Jan 29, 2024, 7:35 PM

#

Hmmm My bad

river cape Jan 29, 2024, 7:35 PM

#

serene scaffold one technique is to have the model in a docker container that has a REST API.

Is REST a framework?

serene scaffold Jan 29, 2024, 7:35 PM

#

if you want to make the question more specific, I or someone else might be able to help

serene scaffold Jan 29, 2024, 7:35 PM

#

river cape Is REST a framework?

No, it's a style of API. you can make them with FastAPI, among other frameworks.

river cape Jan 29, 2024, 7:36 PM

#

serene scaffold No, it's a style of API. you can make them with FastAPI, among other frameworks.

WHat about flask and django?

serene scaffold Jan 29, 2024, 7:37 PM

#

river cape WHat about flask and django?

people who want to make REST APIs are switching from flask to FastAPI, from what I understand.

I wouldn't try to make a REST API with django.

river cape Jan 29, 2024, 7:38 PM

#

Ahhhhh sorry mate if I didnt comprehend my question properly , such a newbie trying to figure out things

serene scaffold Jan 29, 2024, 7:42 PM

#

river cape Ahhhhh sorry mate if I didnt comprehend my question properly , such a newbie try...

No problem. You are welcome to look through my early message history.

desert oar Jan 29, 2024, 8:06 PM

#

serene scaffold people who want to make REST APIs are switching from flask to FastAPI, from what...

django is still a great choice if you know you need a database and don't mind doing a little bit of DIY work when it comes to processing inputs and formatting outputs

#

however it think there is a library that makes doing JSON CRUD stuff easier with django

#

"django rest framework" i think it's called

past meteor Jan 29, 2024, 8:07 PM

#

or django ninja

desert oar Jan 29, 2024, 8:07 PM

#

that one is new to me. i haven't used django since 2017

past meteor Jan 29, 2024, 8:12 PM

#

I used django rest framework (DRF) for the backend of an app I never finished and it's soul crushing 🤣

primal agate Jan 29, 2024, 9:21 PM

#

Guys I have a question I dont know how to start with numbpy and pandas

#

Any ideas

left tartan Jan 29, 2024, 9:22 PM

#

primal agate Any ideas

Perhaps you should ask the question?

primal agate Jan 29, 2024, 9:23 PM

#

What I should learn at first

left tartan Jan 29, 2024, 9:24 PM

#

Kaggle.com/learn is a good start

primal agate Jan 29, 2024, 9:24 PM

#

Thank you so much

#

Thats what I needed

final kiln Jan 29, 2024, 9:46 PM

#

left tartan Kaggle.com/learn is a good start

Oooh that's super cool

primal agate Jan 29, 2024, 9:47 PM

#

You have been learning by this?

left tartan Jan 29, 2024, 9:51 PM

#

primal agate You have been learning by this?

Me? No, but I’ve been using this stuff for a while

#

It’s a nice intro because it doesn’t try to teach too much at once, I recommend it a lot

valid tree Jan 29, 2024, 11:37 PM

#

Hi folks, in pandas, why is this the case:

import pandas as pd

df = pd.read_csv("...")
df.groupby('country').size() # calling the size() function on the group

def size_filter(grp):
    return grp.size() > 2

df.groupby('country').filter(size_filter) # error on size is not callable

Also notably, when I do type(grp) from within that filter function, it's a Series. I'm just not sure what I'm operating on fundamentally in my filter function - why is it a Series? Which Series is it?

serene scaffold Jan 30, 2024, 12:03 AM

#

valid tree Hi folks, in pandas, why is this the case: ```python import pandas as pd df = ...

when you do df.groupby('country').size(), then df.groupby('country') is a DataFrameGroupBy object, and size is a method of DataFrameGroupBy.

When you call DataFrameGroupBy.filter, and you pass a function to that method, the function is applied to a DataFrame for each group.

#

not a DataFrameGroupBy

valid tree Jan 30, 2024, 12:06 AM

#

serene scaffold when you do `df.groupby('country').size()`, then `df.groupby('country')` is a Da...

First paragraph makes sense. But the function is being applied to a Series, not a DF, no? If that’s the type being printed

#

Thanks for getting back to me

serene scaffold Jan 30, 2024, 12:06 AM

#

valid tree First paragraph makes sense. But the function is being applied to a Series, not ...

can you create an entirely self-contained example, where df is defined in terms of (for example) a dict literal?

#

one way to do this is to do print(df.head().to_dict('list')) with the actual dataframe that you have.

#

but it needs to have enough unique values for the country column to be interesting.

valid tree Jan 30, 2024, 12:10 AM

#

yeah just double checked, I was calling type on the wrong thing - pitfall of notebooks. I see it's a DataFrame now

valid tree Jan 30, 2024, 12:11 AM

#

serene scaffold when you do `df.groupby('country').size()`, then `df.groupby('country')` is a Da...

Well, that certainly clears up a lot - thank you

serene scaffold Jan 30, 2024, 12:11 AM

#

valid tree yeah just double checked, I was calling type on the wrong thing - pitfall of not...

I appreciate that you recognize the pitfall of notebooks joe_salute

#

(I use notebooks for some things, for as much as I shit on them.)

valid tree Jan 30, 2024, 12:16 AM

#

serene scaffold (I use notebooks for some things, for as much as I shit on them.)

Useful for visual representations of things during exploration, but yeah, makes keeping track of things very difficult

lapis sequoia Jan 30, 2024, 1:16 AM

#

is there any better models than Black Scholes in terms of call option price?

serene scaffold Jan 30, 2024, 1:48 AM

#

lapis sequoia is there any better models than Black Scholes in terms of call option price?

For what? The people who typically answer questions in this channel are scientists. So we don't think of models in terms of their monetization schemes.

lapis sequoia Jan 30, 2024, 1:49 AM

#

Ah ok mb

serene scaffold Jan 30, 2024, 1:50 AM

#

That model you mentioned. What does it do?

lapis sequoia Jan 30, 2024, 1:56 AM

#

So the model, which I've recently heard throughout my researches throughout youtube. The model: Black-scholes models is a pricing model used for the valuation of stock options. I would say it looks good to me as it refers to the volatility of price, risk free rate, etc.

#

However, I would like to know if there's any better model I could use or should I stick with learning this new model

left tartan Jan 30, 2024, 2:55 AM

#

lapis sequoia So the model, which I've recently heard throughout my researches throughout yout...

Black scholes is a very fundamental model and worth learning. A lot of things are based or derived from it.

#

It’s one of those; everyone uses it to some extent, even if there’s some flaws and weaknesses, it’s still pretty good.

#

The interesting thing about it is that implied volatility is derived from it (since you can observe the current price, you can solve for ivol)

lapis sequoia Jan 30, 2024, 3:00 AM

#

intresting, anyways I'll learn this new model and hopefully know how I can implement or apply it to my future projects

#

It sounded intresting to me, but I wanted to know if it's worth it. Now I know it's worth it, so thank you so much 🙏

barren iris Jan 30, 2024, 3:06 AM

#

does anyone know a good implementation of Kendall's W concordance coefficient?

eternal bridge Jan 30, 2024, 3:23 AM

#

anyone here used Quarto on VS? I need some assistance with running Quarto

teal lance Jan 30, 2024, 4:41 AM

#

So I was able to compare dxy , spy , vix but I want to use python to build a model using this because well the dxy acted as the open interest while vix declined sending spy up

#

Anybody want to help me also finish a good project it’s turning the cot report into the dmi indicator ? Basically taking the values of the dmi to spit a bias and trend confirmation by acting as a live cot report

teal lance Jan 30, 2024, 4:46 AM

#

teal lance Anybody want to help me also finish a good project it’s turning the cot report i...

Because both have 3 variables

tacit basin Jan 30, 2024, 5:06 AM

#

valid tree yeah just double checked, I was calling type on the wrong thing - pitfall of not...

How is pitfall of a notebook if you can explain that would be great

tacit basin Jan 30, 2024, 5:07 AM

#

valid tree Useful for visual representations of things during exploration, but yeah, makes ...

Restart and re-run all helps a lot

teal lance Jan 30, 2024, 6:31 AM

#

valid tree Jan 30, 2024, 7:44 AM

#

tacit basin How is pitfall of a notebook if you can explain that would be great

It’s user error still, but easier to make mistakes when you might have modified a variable later down in the notebook and you run cells further up

primal agate Jan 30, 2024, 8:33 AM

#

teal lance

Nice bro

teal lance Jan 30, 2024, 8:35 AM

#

primal agate Nice bro

Thank you brother 🤜🏾🤛🏾✅

#

Do you guys think a dmi is able to be used in python to spit a bias out based on the values since they are numerical ? Using the values makes it easier to configure true or false faster and more reliable with out too many outliers tbh

#

hard to take a thinkscript and turn it into python

#

Starting to enjoy it I’m taking my 3 1/2 market experience and just trying to replicate my manual trading

teal lance Jan 30, 2024, 8:52 AM

#

teal lance Starting to enjoy it I’m taking my 3 1/2 market experience and just trying to re...

tacit basin Jan 30, 2024, 9:13 AM

#

valid tree It’s user error still, but easier to make mistakes when you might have modified ...

yes true, restarting ane running all helps if possible, sometimes some cells run a long time, so that may not be best option, but for most cases a good option i think

valid tree Jan 30, 2024, 9:35 AM

#

tacit basin yes true, restarting ane running all helps if possible, sometimes some cells run...

Of course yeah, at the end of the day it’s my mistake

tacit basin Jan 30, 2024, 10:01 AM

#

valid tree Of course yeah, at the end of the day it’s my mistake

i mean the way notebooks work it's easier to do it than say using scripts as in scripts restart and run all is default mode, where in notebooks is only an option. i use notebooks daily and end up in mess often. i need to remind myself to keep notebooks code clean and tidy, but... lol

teal lance Jan 30, 2024, 10:04 AM

#

#

Loving this fetching 🥹🐍🐍🐍

teal lance Jan 30, 2024, 10:29 AM

#

#

final kiln Jan 30, 2024, 10:48 AM

#

almost there

midnight harbor Jan 30, 2024, 10:58 AM

#

can someone give me good advice for getting colab alternative that can run at background (colab pro seem to me kinda expensive what people say on reddit as its computing unit get finished)

so if any one here have good alternative of this cheap and fast please recommend by tagging me

tacit basin Jan 30, 2024, 10:59 AM

#

midnight harbor can someone give me good advice for getting colab alternative that can run at ba...

how long do you need a session to last?

final kiln Jan 30, 2024, 11:02 AM

#

midnight harbor can someone give me good advice for getting colab alternative that can run at ba...

That's exactly what I'm coding rn

#

I'm running ec2 spot instances using GitHub actions

#

I'm getting 10-30% of the original price

midnight harbor Jan 30, 2024, 11:03 AM

#

tacit basin how long do you need a session to last?

in googl colab t4 gpu free plan taking 11 hours to complete but as its free plan it's session get expired

well also i m experimenting but still 11 hours i think would be the one (1x t4 gpu of google)
can run multi gpu if available

tacit basin Jan 30, 2024, 11:05 AM

#

midnight harbor in googl colab t4 gpu free plan taking 11 hours to complete but as its free plan...

i was thinking free paperspace but it's 6hrs session only, for paid ones will be longer, maybe cheaper than colab, you can check their pricing https://www.paperspace.com/pricing

Pricing | Paperspace

Paperspace offers a wide selection of low-cost GPU and CPU instances as well as affordable storage options. Browse pricing.

midnight harbor Jan 30, 2024, 11:05 AM

#

also i m very confuse in what this computing units of colab, like hwo they work

midnight harbor Jan 30, 2024, 11:06 AM

#

tacit basin i was thinking free paperspace but it's 6hrs session only, for paid ones will be...

i rememeber this

#

i once created accoutn on this XD

tacit basin Jan 30, 2024, 11:06 AM

#

each gpu on colab will have different compute units cost, like A100 will be most expensive, t4 cheapest i think

midnight harbor Jan 30, 2024, 11:07 AM

#

many people are saying on reddit like their computing unit end in a day like if units get finished will they get back next day or just the end

midnight harbor Jan 30, 2024, 11:11 AM

#

final kiln I'm running ec2 spot instances using GitHub actions

ec2 from amazon?
and does this has gpu or multiple cpus?

final kiln Jan 30, 2024, 11:11 AM

#

midnight harbor ec2 from amazon? and does this has gpu or multiple cpus?

Has everything you need, it just takes a fair bit of work to setup everything

#

Inputted the wrong parameter for the 1000th time

#

I'm so distracted rn

#

I'm even gonna train this on CPU tbh, what's the problem in it lasting all night if it will only cost me like one dollar

#

Better than having it run in 3h and costing triple

buoyant vine Jan 30, 2024, 11:21 AM

#

think The relative scale of that should not line up

#

typically, the cost of getting enough cpu cores to match the GPU (assuming the actual math that would be done on the GPU is the limiting factor) would result in the cpu cost version being much higher

jovial heath Jan 30, 2024, 11:34 AM

#

heello, i'm working on a CNN project using keras

final kiln Jan 30, 2024, 11:35 AM

#

buoyant vine <:think:882208337196892180> The relative scale of that should not line up

It does when there's a shortage of the cheaper stuff

jovial heath Jan 30, 2024, 11:35 AM

#

My result when I trained the model was val_accuracy 98.06
and my accuracy was 95.11

final kiln Jan 30, 2024, 11:35 AM

#

I keep getting my quota requests wrong too

#

So I'm always operating on a subset of what's available

jovial heath Jan 30, 2024, 11:36 AM

#

but my graph looks like this

#

#

i'm using a small dataset

buoyant vine Jan 30, 2024, 11:37 AM

#

what do your other metrics say? I.e. F1, Recall, Precision?

jovial heath Jan 30, 2024, 11:39 AM

#

i think i dont use

#

my model is like this

#

#

i'm using reduce on plateu too

#

reduce lr on plateau

#

I saw someone saying it's good to use

final kiln Jan 30, 2024, 11:41 AM

#

Try the reverse, starting with small LR, build up for a bit, then exponential decay

versed gulch Jan 30, 2024, 11:57 AM

#

If I have a 2D array of values [[1, 2, 3, 4, 5, 6], [2, 4, 5, 1 , 1, 2]], is there way I can get a minimum of each column such that the values are above 2?

primal agate Jan 30, 2024, 12:02 PM

#

teal lance

It looks so good

#

How much time did you spended

#

Spend*

#

Building this

amber cairn Jan 30, 2024, 12:30 PM

#

Hello all, slightly off topic but I wonder if there's anybody aware of any app out of there, which in a similar manner like Duolingo, can support training in small chunks on data science challenges.
The idea is to get help developing those skillsets without digging into projects of any sort of without managing little scripts creation without any support in confirming the correct implementation

final kiln Jan 30, 2024, 12:43 PM

#

Yeah Duolingo is pretty cool

#

Sometimes wish leetcode was like Duolingo

final kiln Jan 30, 2024, 1:07 PM

#

the experiment is literally gonna take 24h to run

#

waiting on GPU quota again

#

I think aws makes it hard on purpose so that people don't overuse the spot instances

tacit basin Jan 30, 2024, 1:51 PM

#

midnight harbor many people are saying on reddit like their computing unit end in a day like if ...

you get certain number of compute units depends on the subscription, there is colab pro and pro plus i think. the compute is for full months so if you use all in one day, which you can easily do with A100, then you don't have any till next month. you can pay more compute units additionaly each month, either in 100s or in 500s compute units, like 10 and 50 dollar.

midnight harbor Jan 30, 2024, 1:52 PM

#

Thanks @tacit basin

tacit basin Jan 30, 2024, 1:52 PM

#

yeah for A100 gpu you get like 20 something hours per month on most expensive plan pro plus

#

good thing about it it's that they are usually available, on some other clouds that's not the case

teal lance Jan 30, 2024, 3:13 PM

#

primal agate It looks so good

lol just woke up 🤣🤣🤣

primal agate Jan 30, 2024, 3:38 PM

#

It was nice for you

#

How much time did you spend

agile owl Jan 30, 2024, 3:46 PM

#

what is the best library for GANs right now

teal lance Jan 30, 2024, 4:28 PM

#

A good amount of time the next thing I need is to correlate the volume and compare assets to the vix for low volatility or high volatility

odd meteor Jan 30, 2024, 4:48 PM

#

agile owl what is the best library for GANs right now

Do you mean generative AI instead of GANs ?
Can you add more clarity or further details to your original question.

GANs is just like any other type of NN. It's like asking which library is best for RNN or CNN.

agile owl Jan 30, 2024, 4:51 PM

#

I don't see how that's not a valid question

#

there are libraries for RNNs and CNNs

#

but yeah I guess you could say generative AI

odd meteor Jan 30, 2024, 4:55 PM

#

final kiln Yeah Duolingo is pretty cool

I was using this app to learn Deutsch around 2020. It was all fun and nice till they updated the app and introduced gamified style of learning. I don't know if things has changed now

agile owl Jan 30, 2024, 4:56 PM

#

they have admitted that they are not an education company but an entertainment company

serene scaffold Jan 30, 2024, 5:02 PM

#

as a linguist, my professional opinion is duolingo bad

odd meteor Jan 30, 2024, 5:07 PM

#

agile owl I don't see how that's not a valid question

Of course, no question asked here is deemed invalid 😊
What I inferred from your original question was:

/Which library (framework) is best for GANs./

And by "library", if you were referring to Keras, PyTorch, TensorFlow etc... then, I don't think there's a specific framework that's better than the other in that regards.

agile owl Jan 30, 2024, 5:08 PM

#

my favorite foreign language is korean and they had the absolute worst korean lessons that's how I realized it was a scam

#

there's also libraries that have a bunch of networks already implemented like stable baselines 3

#

for use in particular contexts like reinforcement learning

final kiln Jan 30, 2024, 5:11 PM

#

serene scaffold as a linguist, my professional opinion is duolingo bad

Well I love it, specifically the gamification thing, it beats doing 0, which is my most likely situation if I don't use it

odd meteor Jan 30, 2024, 5:11 PM

#

agile owl they have admitted that they are not an education company but an entertainment c...

At least I like the weekly rank competition they have. I hope they've not removed that feature

final kiln Jan 30, 2024, 5:11 PM

#

After some time I'm gonna branch out, voice chat, see movies in German, etc

#

model keeps overfittign

#

what is it about sentiment analysis that makes it so easy to overfit

#

this stuff has been a huge success tho

#

I really just need for aws to for the lvoe of god give me access to that juicy spot gpu already

#

been playing with quotas for almost a month

#

okay, I think I'm going to look for a larger dataset and data augmentation techniques, I refuse to believe that the transformer can't perform this task

final kiln Jan 30, 2024, 5:54 PM

#

https://www.kaggle.com/datasets/kazanova/sentiment140

Sentiment140 dataset with 1.6 million tweets

Sentiment analysis with tweets

#

1.6M samples, I was working with 50k

buoyant vine Jan 30, 2024, 6:02 PM

#

final kiln I really just need for aws to for the lvoe of god give me access to that juicy s...

What instances are you trying to use?

#

We have almost never any issue getting on demand instances, I dont think you'll ever get them on spot though

final kiln Jan 30, 2024, 6:03 PM

#

buoyant vine What instances are you trying to use?

I'd be happy with a p2.xlarge

#

I'm trying spot

final kiln Jan 30, 2024, 6:04 PM

#

buoyant vine We have almost never any issue getting on demand instances, I dont think you'll ...

They do list it tho

buoyant vine Jan 30, 2024, 6:04 PM

#

yeah but most of the time they are never available enough

final kiln Jan 30, 2024, 6:05 PM

#

Imma cry

#

Just spent so much effort to get spot infra thing

buoyant vine Jan 30, 2024, 6:05 PM

#

Normally when we do training runs we have retries on our scripts to spawn instances because we need to check other availability zones for available instances on demand

final kiln Jan 30, 2024, 6:06 PM

#

Okay, I'm gonna search Google for a bit on this GPU shortage thing, see what I can come up with

buoyant vine Jan 30, 2024, 6:07 PM

#

Have you tried some of the TRN / non-cuda instances?

#

does your tooling support it?

final kiln Jan 30, 2024, 6:07 PM

#

It can boot up any ec2 on demand or spot instance

buoyant vine Jan 30, 2024, 6:07 PM

#

nah I mean like your ML lib

#

i.e. PyTorch, since they are what are interacting with the hardware doing the math

final kiln Jan 30, 2024, 6:08 PM

#

Oh, I don't know what you mean by TRN, I assumed it was some instance type

#

Is it like TPU type of thing

buoyant vine Jan 30, 2024, 6:08 PM

#

there is TRN1 which is AWS' tpu thing

final kiln Jan 30, 2024, 6:09 PM

#

I'm using pytorch, don't know if it supports TPU, but I assume it does

buoyant vine Jan 30, 2024, 6:09 PM

#

you might have a better time getting some spot instances on those perhaps

final kiln Jan 30, 2024, 6:09 PM

#

Oh that is clutch

buoyant vine Jan 30, 2024, 6:09 PM

#

and still have a decent speedup

final kiln Jan 30, 2024, 6:10 PM

#

buoyant vine there is TRN1 which is AWS' tpu thing

I'm gonna test my code on TPU using colab, thanks for the tip !

blissful hatch Jan 30, 2024, 7:10 PM

#

hey bro!

#

guess you're quite familiar with tensorflow

#

right?

final kiln Jan 30, 2024, 8:52 PM

#

I got a GPU spot instance

mint palm Jan 30, 2024, 9:21 PM

#

where can i learn low level working of LSTM

#

I know the states/gates and suff

#

but i wanna know how embedding and iterations run though

amber cairn Jan 30, 2024, 9:25 PM

#

serene scaffold as a linguist, my professional opinion is duolingo bad

What would you suggest otherwise, and linguistically speaking to learn a new language?

blissful hatch Jan 30, 2024, 9:26 PM

#

LSTM?

#

what does that mean?

serene scaffold Jan 30, 2024, 9:27 PM

#

amber cairn What would you suggest otherwise, and linguistically speaking to learn a new lan...

you need to understand structurally how the language differs from your native language. and you can build your vocabulary by writing sentences and speaking them aloud, and talking to native speakers.

serene scaffold Jan 30, 2024, 9:27 PM

#

blissful hatch LSTM?

long-short term memory

amber cairn Jan 30, 2024, 9:28 PM

#

serene scaffold you need to understand structurally how the language differs from your native la...

Clear, I was referring mostly to the process as using a supportive app. Do you have any preference, or anything better than Duolingo that you would recommend?

serene scaffold Jan 30, 2024, 9:30 PM

#

amber cairn Clear, I was referring mostly to the process as using a supportive app. Do you h...

I don't have anything in mind--sorry

amber cairn Jan 30, 2024, 9:32 PM

#

serene scaffold I don't have anything in mind--sorry

Thanks anyway

teal lance Jan 30, 2024, 9:45 PM

#

Idea to add on to my script

#

lavish swift Jan 30, 2024, 10:28 PM

#

Does anyone have suggestions for a course on AI and more specifically LLMs? I don't mean creating a model, but topics should include:

Running a model LOCALLY and not just sending data to OpenAI
RAG
How to differentiate and pick a model to implement in the chain
LangChain (or other relevant libraries)
Fine-tuning (maybe?)
Ideally the course would also have a community to ask further questions

I'm doing some of this now, but it mostly feels like guessing. So I'd like to fill in some of my many knowledge gaps.

final kiln Jan 30, 2024, 10:55 PM

#

This is so much work, y so much setup for this I don't get it

#

Y r people publishing 10GB sized images

#

What is life

#

I might've dropped the ball when defining the storage for the last AMI tho

#

Don't matter, might as well now over then now under

#

Storage is supposed to be cheap anyway

#

The instance I managed to catch is AMD based, which is way I'm still on this. Amazon Linux AMIs come with stuff for Nvidia

#

And it failed

#

AMI got corrupted for sure

#

Need to repeat from the original

#

Gonna give it a rest, is getting late

#

But it's a matter of time, tomorrow I'll finally have GPU

undone dust Jan 31, 2024, 1:09 AM

#

hey 2 questions, should I learn how to use pytorch or tensorflow and what's a good video to get me into it?

serene scaffold Jan 31, 2024, 1:27 AM

#

undone dust hey 2 questions, should I learn how to use pytorch or tensorflow and what's a g...

Most people prefer pytorch. But don't think of it as "learning pytorch". You're learning about neural network theory, and applying it with pytorch

undone dust Jan 31, 2024, 1:28 AM

#

serene scaffold Most people prefer pytorch. But don't think of it as "learning pytorch". You're ...

oh ok thanks and is there like a video a lot of people recommend or just start watch everything?

serene scaffold Jan 31, 2024, 1:29 AM

#

undone dust oh ok thanks and is there like a video a lot of people recommend or just start w...

Whatever you watch, keep in mind that you'll learn nothing from just passively watching. Take notes and apply everything.

merry ridge Jan 31, 2024, 2:19 AM

#

Is there a LaTeX or MathJax bot available to render math for this channel?

delicate apex Jan 31, 2024, 2:23 AM

#

.help latex

strange elbowBOT Jan 31, 2024, 2:23 AM

#

Command Help

**```
.latex <query>

*Renders the text in latex and sends the image.*

delicate apex Jan 31, 2024, 2:23 AM

#

helpful embed, but yes - there it is

#

you can experiment with it #sir-lancebot-playground if you like, as well, especially as the resulting images do not have delete or revision features if you have incorrect latex input

serene scaffold Jan 31, 2024, 2:33 AM

#

.latex \latex

#

What

delicate apex Jan 31, 2024, 2:34 AM

#

.latex \LaTeX

strange elbowBOT Jan 31, 2024, 2:34 AM

#

$latex.png$

serene scaffold Jan 31, 2024, 2:34 AM

#

Yay
Now I can be happy

merry ridge Jan 31, 2024, 2:56 AM

#

Thanks

#

So I decided to enroll in a machine learning course focusing on neural networks. I don't know if it's just me, but I thought I was very comfortable with multivariable calculus, and this notation is really killing me. For example they wrote that given a model for a neural network X with depth N, the model

#

.latex $Y^i = F^i(\mathbf{X}) = f( \sum_{i_N} w_{N j_N}^i f ( w_{{N-1}, j_{N-1} }^{j_N} \ldots f(w_{1,j_1}^{j_0} X_{j_0})))$

#

I'm just going to compile it on my side and paste it as an image I guess

#

#

With some loss function:

#

Clearly has that the derivative with respect to the weights depend only on the outer most nested function so that

#

So my main confusion is that I have no idea how people are able to chew through this much notational complexity and just conclude something about the form of the partial derivative so such a blasé manner. Do people just not really care about the fine details? It took me nearly an hour to carefully keep every subscript and subscribe in my head, understood what the equation was trying to do and then apply the chain rule.

#

This is aimed at an upper senior undergraduate level, so it's not exactly ML for babies I guess. But I was kind of expecting a little bit more hand holding with respect to the computation.

odd meteor Jan 31, 2024, 3:18 AM

#

lavish swift Does anyone have suggestions for a course on AI and more specifically LLMs? I d...

This is a good place to start

https://github.com/mlabonne/llm-course

GitHub

GitHub - mlabonne/llm-course: Course to get into Large Language Mod...

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks. - GitHub - mlabonne/llm-course: Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

odd meteor Jan 31, 2024, 3:21 AM

#

undone dust oh ok thanks and is there like a video a lot of people recommend or just start w...

https://youtu.be/Z_ikDlimN6A?si=K_F7SMP_rYdBGvw_

YouTube

Daniel Bourke

Learn PyTorch for deep learning in a day. Literally.

Welcome to the most beginner-friendly place on the internet to learn PyTorch for deep learning.

All code on GitHub - https://dbourke.link/pt-github
Ask a question - https://dbourke.link/pt-github-discussions
Read the course materials online - https://learnpytorch.io
Sign up for the full course on Zero to Mastery (20+ hours more video) - https:/...

▶ Play video

iron basalt Jan 31, 2024, 3:51 AM

#

merry ridge

My eyes.

#

Switch courses.

undone dust Jan 31, 2024, 3:58 AM

#

odd meteor https://youtu.be/Z_ikDlimN6A?si=K_F7SMP_rYdBGvw_

shout out

bold timber Jan 31, 2024, 4:03 AM

#

I have a question about Bidirectional RNN. How does Bidirectional RNN work when there's a sentence like "I am ___ hungry, and I can eat half a pig."? Can Bidirectional RNN be used to fill in the blank?

left tartan Jan 31, 2024, 4:03 AM

#

merry ridge This is aimed at an upper senior undergraduate level, so it's not exactly ML for...

As a CS grad student, I made the mistake of taking a stats class that was for both stats majors and CS majors. Terrible mistake: the CS students were left in the dust in the first week, never keeping up with the high speed notational complexity that the stats folks were comfortable with.

lavish swift Jan 31, 2024, 4:23 AM

#

odd meteor This is a good place to start https://github.com/mlabonne/llm-course

Thanks! I'll take a look! 👍

dense yarrow Jan 31, 2024, 5:02 AM

#

i've been really sad lately because i'm struggling in the math course (probability, linear algebra, discrete math, but mainly probability) in my data science program. i feel inadequate and i was wondering if you guys have any tips or advice on how to improve my understanding and skills in those subjects? are there any youtube videos or anything that can help me understand how different math problems apply to different tasks in data science projects? I think if i understand how i'm going to use them in a job setting, it will help me learn better

#

i think i always struggled a little with probability even when i took stats courses before

wooden sail Jan 31, 2024, 5:28 AM

#

merry ridge This is aimed at an upper senior undergraduate level, so it's not exactly ML for...

sorry to say that engineering maths are baby maths in undergrad :p

#

chain rule is the name of the game

agile owl Jan 31, 2024, 5:55 AM

#

UNLIMITED POWER

agile owl Jan 31, 2024, 5:56 AM

#

left tartan As a CS grad student, I made the mistake of taking a stats class that was for bo...

notational inside baseball is really exposed for the obfuscation it is when you compare papers to their code

merry ridge Jan 31, 2024, 8:35 AM

#

left tartan As a CS grad student, I made the mistake of taking a stats class that was for bo...

I have absolutely no problem with graduate level statistical notation so if this is comfortable for you in CS I’ll just have to get used to it.

It is particularly annoying that this course uses a subscript in some cases and a super script in other cases to denote the same thing such as the index of the current epoch. It is making it very difficult for me to be able to just ignore a symbol that isn’t of interest at the moment because those symbols appear in multiple inconsistent locations.

amber cairn Jan 31, 2024, 11:20 AM

#

Great day everybody.
I would love your opinion in understanding what could be the best approach in determining the impact in web site traffic changes given changes on the page.

I basically have historical data, and I know the point in time when changes occurred.

I don't have confidence the casual impact is the right direction, also because there's no other way to compare/confirm the impact.

What is your take/advice?

jovial heath Jan 31, 2024, 11:50 AM

#

Hello, I need to do work that checks whether the game "beat saber" is being played or not. To do this, I separated some images of him standing still or playing, but the still images are very similar, does this interfere with the model?

#

#

If they are not like this, the database becomes too small

#

mild dirge Jan 31, 2024, 12:34 PM

#

jovial heath Hello, I need to do work that checks whether the game "beat saber" is being play...

It will likely overfit if all images of one category are very similar

#

If they are all in the same position, the model could f.e. get very good guess if it checks just a single pixel in your data

#

Whereas you want it to learn that it is not playing when the position does not change

jovial heath Jan 31, 2024, 12:36 PM

#

Is it better then for me to take similar images even if the database gets smaller?

serene scaffold Jan 31, 2024, 1:59 PM

#

jovial heath Is it better then for me to take similar images even if the database gets smalle...

it's a dataset, not a database.
the size of a dataset is typically measured in the number of instances, not the size of each instance. you can make them smaller if they're still useful for what you're trying to do after that.

agile owl Jan 31, 2024, 3:25 PM

#

all my threads are doing their duty

agile owl Jan 31, 2024, 3:44 PM

#

PPO marches on in its inevitable but lengthy quest for convergence

agile owl Jan 31, 2024, 4:00 PM

#

..w-when will it bend

#

lovely

teal lance Jan 31, 2024, 4:19 PM

#

teal lance Jan 31, 2024, 4:20 PM

#

teal lance

lapis sequoia Jan 31, 2024, 4:27 PM

#

jovial heath Hello, I need to do work that checks whether the game "beat saber" is being play...

it does, you can reduce it by doing label smoothing

#

honestly it still learns

#

people trained models on random labels and they still learn stuff

#

you could also train model and see what images the loss is the biggest on after training and remove those

teal lance Jan 31, 2024, 4:48 PM

#

teal lance

Super proud of myself

final kiln Jan 31, 2024, 5:01 PM

#

agile owl all my threads are doing their duty

what are you doing to that poor computer, monte carlo sims ?

agile owl Jan 31, 2024, 5:09 PM

#

vectorized reinforcement learning envs

#

this computer is living up to its vocation

#

it was given 64 virtual threads to be used

#

I feel bad for all the computers that are never used to their potentials, doing nothing but opening chrome tabs and copying memory around for stupid youtube video browsing

final kiln Jan 31, 2024, 6:53 PM

#

with gpu and stuff

#

it was a lot of work because AMD has very bad ML support on AWS

#

in the end I found an available nvidia instance

#

so I didnt even manage to make the amd stuff work

#

their latest image is outdated, theres no aws ami for it, etc etc

merry briar Jan 31, 2024, 9:11 PM

#

when u get the sus-est error ever

agile owl Jan 31, 2024, 9:22 PM

#

ah that's satisfying

#

does anyone else enjoy learning curve charts

final kiln Jan 31, 2024, 9:30 PM

#

I feel like I'm doing what I was doing b4 but now at an industrial level

#

Can spawn hundreds of training loops in dozens of GPU machines

#

Only limited by AWS quotas

#

And money ofc, even with spot the burn rate can become large

primal agate Jan 31, 2024, 9:38 PM

#

agile owl ah that's satisfying

ME TOO

#

I love data science

teal lance Jan 31, 2024, 9:43 PM

#

teal lance Super proud of myself

#

agile owl Jan 31, 2024, 9:43 PM

#

nice. Iwas long TY today

teal lance Jan 31, 2024, 9:44 PM

#

agile owl nice. Iwas long TY today

TY ?

agile owl Jan 31, 2024, 9:49 PM

#

teal lance TY ?

ten year treasury future

final kiln Jan 31, 2024, 9:50 PM

#

And I can track my loops on the go

#

ML=infra, all else is EDA

#

That's the lesson I'm taking

agile owl Jan 31, 2024, 9:52 PM

#

how do I take a standard normal distribution and transform it into something that looks like a square root or log shape what function can i use

teal lance Jan 31, 2024, 9:52 PM

#

final kiln And I can track my loops on the go

Oh sorry yeah that’s nice 🔥 that’s what I’m working on building in my Script a model that takes dxy , 10y , vix to create the scenario for the market sentiment

final kiln Jan 31, 2024, 9:54 PM

#

teal lance Oh sorry yeah that’s nice 🔥 that’s what I’m working on building in my Script a ...

That's cool

agile owl Jan 31, 2024, 9:54 PM

#

chat gippity to the rescue

final kiln Jan 31, 2024, 9:54 PM

#

I'm very tired rn, that thing took me 2 days to make

agile owl Jan 31, 2024, 9:54 PM

#

we're gonna use boxcox transformation

#

from scipy.stats import boxcox

final kiln Jan 31, 2024, 9:54 PM

#

Didn't even lunch todah

final kiln Jan 31, 2024, 9:54 PM

#

agile owl how do I take a standard normal distribution and transform it into something tha...

That's an interesting question

agile owl Jan 31, 2024, 9:55 PM

#

it's called boxcox

#

lol

final kiln Jan 31, 2024, 9:55 PM

#

You want a transform on a gaussian that transforms it into a sqrt or log shape, I never encountered that problem

agile owl Jan 31, 2024, 9:55 PM

#

It's to make the reward function convex

#

so the agent is risk averse

final kiln Jan 31, 2024, 9:56 PM

#

It's actually easy tho

#

Think point wise

#

You're solving an equation at every point

#

Like you want

#

gauss * f = sqrt

#

f = sqrt / gauss

#

Something like that

agile owl Jan 31, 2024, 9:57 PM

#

yeah I see what you mean but I'd have to pick out points and do the math by hand

#

I'm not that smart

#

this is a great use for chat gpt

final kiln Jan 31, 2024, 9:57 PM

#

No it's literally just dividing the samples from one by the other

agile owl Jan 31, 2024, 9:57 PM

#

easily verifiable

final kiln Jan 31, 2024, 9:57 PM

#

As long as no zeros

agile owl Jan 31, 2024, 9:58 PM

#

the gaussian function is not easy to evaluate in my head

#

lol

final kiln Jan 31, 2024, 9:58 PM

#

I'd just use numpy

#

My brain is v slow rn, I have to sleep

agile owl Jan 31, 2024, 10:00 PM

#

return boxcox((self._get_return() - self.rate / 252) / self.return_volatility)

#

this is the reward function now

#

I expect a better mean variance ratio out of sample with this let's see what happens

candid spruce Jan 31, 2024, 10:02 PM

#

Hi would anyone be willing to teach me ML using python 😄

teal lance Jan 31, 2024, 10:07 PM

#

candid spruce Hi would anyone be willing to teach me ML using python 😄

You good in python already ?

candid spruce Jan 31, 2024, 10:07 PM

#

teal lance You good in python already ?

depends what part of python django no but I know most of the needed stuff

willow pelican Jan 31, 2024, 10:08 PM

#

If I want to go into data science, should I major in CS: ML or stats: data science

crisp raptor Jan 31, 2024, 10:09 PM

#

willow pelican If I want to go into data science, should I major in CS: ML or stats: data scien...

both

willow pelican Jan 31, 2024, 10:26 PM

#

But a double major would be painful

agile owl Jan 31, 2024, 10:27 PM

#

I'm actually using Yeo-Johnson with a lambda of -1 that's basically what I wanted

#

The-Box-Cox-left-and-Yeo-Johnson-right-transformations-for-several-parameters.png

#

I actually don't think it's trivial if it's named after someone tbh

#

will be interesting to see what happens with different values of lambda

primal agate Jan 31, 2024, 10:32 PM

#

I would like to start with ML but I am not enought good at maths yet

#

Only one problem

#

but I am preety enjoying data science

#

its kinda easy

#

and it gives you fun

#

imo

agile owl Jan 31, 2024, 11:06 PM

#

final kiln It's actually easy tho

#

ok bro lol

final kiln Jan 31, 2024, 11:14 PM

#

agile owl

Dude solve it numerically by sampling both functions, dividing one array by the other and cubic splite it

#

Easy

#

those equations are probly the result of the same procedure

#

But with the analytic expressions themselves instead of their samples

#

Which also does not look hard to do

final kiln Jan 31, 2024, 11:21 PM

#

agile owl I actually don't think it's trivial if it's named after someone tbh

I wouldn't say it's trivial, but it's also not hard imo

agile owl Jan 31, 2024, 11:27 PM

#

yea I skipped real analysis sue me

final kiln Jan 31, 2024, 11:30 PM

#

my real analysis was insane

#

the professor decided that he wanted to summarize the entire math field and teach it to 2nd year students

#

The memes were insane

#

Like dude was straight up teaching differential geometry

#

Which was only gonna be useful to the 20% of the class who would've eventually gone to MSc in physics

#

Sorry you triggered me by mentioning real analysis

#

._.

warm copper Jan 31, 2024, 11:39 PM

#

hi fellow data scientists

warm copper Jan 31, 2024, 11:40 PM

#

final kiln my real analysis was insane

hoi fish

final kiln Jan 31, 2024, 11:51 PM

#

I'm eating a snack cuz otherwise I can't sleep

agile owl Feb 1, 2024, 12:08 AM

#

my original idea was to just multiply everything less than 0 by 2 and everything greater than 0 by 0.5

#

which probably would have worked but I never tried it

agile owl Feb 1, 2024, 2:26 AM

#

when ur algorith, makes a scientific breakthru

#

(actually these jumps are just an artefact of convolution smoothing of the episode rewards, the negative outliers make those big depressions)

livid goblet Feb 1, 2024, 2:32 AM

#

odd meteor Tbh I think it's only you that can decide this, because what's interesting to me...

Thank you!!

willow pelican Feb 1, 2024, 4:00 AM

#

going into data science majors, is taking statistics in highschool more vauable than a CS class? I feel like I can easily learn python and other tools outside of school than learning stats on my own

serene scaffold Feb 1, 2024, 4:09 AM

#

willow pelican going into data science majors, is taking statistics in highschool more vauable ...

are you sure you'll actually be majoring in data science? or computer science?

limber mesa Feb 1, 2024, 4:12 AM

#

primal agate I would like to start with ML but I am not enought good at maths yet

See if you can find the start of 30 days of ML on Kaggle.
It starts “basic”ish

willow pelican Feb 1, 2024, 5:01 AM

#

serene scaffold are you sure you'll actually be majoring in data science? or computer science?

ok yeah I've just relized that, I don't have enough knowledge of the industry to understand what I'd like/dislike

#

so, I think I'm oging to play it safe with CS

#

then, If I find that I want to specialize on a certain thing, maybe I'll get a minor in it, or switch to it for my masters?

#

feel like thats the most logical way to go about it at least for now, I like looking at the whole picture, probably don't need too though

agile owl Feb 1, 2024, 6:00 AM

#

anyone have an example of using GANs to generate samples from correlated time series

#

if not I guess that's going to be my next project

final kiln Feb 1, 2024, 9:18 AM

#

Milan has 16gb gpu at 7 cents

#

on aws

#

#

I've been applying LR as a function of the epoch, should I be doing it as a function of the current batch ?

#

the model overfits no matter what I change, culprit is data for sure, tho I think freezing the tokenizer and positional encoding would help a lot

#

increasing the distance between output and the tokenizer seems to help a lot

#

which is not intuitive since the number of parameters grows, so it should overfit more easily

#

my working hypothesis is that making the model grow that way slows down the convergence by a bit, so the final values on the loss val end up being shifted

agile owl Feb 1, 2024, 9:28 AM

#

tfw you're not sure if you're gonna run out of RAM or not

final kiln Feb 1, 2024, 9:29 AM

#

final kiln my working hypothesis is that making the model grow that way slows down the conv...

so the actual graph that I need is a loss/val vs loss/train

#

so there's two paths here

find a larger dataset and use that + data aug
modify the training procedure so that positional encoding is determined analytically and tokenizer is pre-trained

I'm tempted to tinker with the model, but experience has taught me that data is king, there's like a good chance that changing the dataset to higher quality stuff will make loss val converge in 10 nano seconds to the planck scale ._.

peak patio Feb 1, 2024, 9:43 AM

#

Hello,
I have equations like these(32) that I need to solve:
i_6 + i_22 = i_3 + 83
i_12 =i_26 + i_7 - 114
i_16 =i_18 - i_5 + 51
i_30 - i_8 = i_29 - 77
i_20 - i_11 = i_3 - 76
..........................

I have tried to use sympy, but its been 12 hours and the program is still running, am I doing something wrong ?

from sympy import symbols, Eq, solve
symbols_list = ['i_'+str(i) for i in range(32)]
vars_list = symbols(symbols_list)

equations = [
    vars_list[29] - vars_list[5] + vars_list[3] - 70,
    vars_list[2] + vars_list[22] - vars_list[13] - 123,
    .....
    vars_list[1] + vars_list[21] - vars_list[11] - vars_list[18] - 43
]

solution = solve(equations, vars_list)

for var in vars_list:
    print(f"{var}: {solution[var]}")

final kiln Feb 1, 2024, 9:44 AM

#

peak patio Hello, I have equations like these(32) that I need to solve: i_6 + i_22 = i_3 + ...

yes

#

you're trying to solve it symbolically

#

the best approach is to translate the problem into a matrix equation

#

Ax = B

#

then use numpy or scipy to solve it

#

can even solve it by hand

peak patio Feb 1, 2024, 9:47 AM

#

thanks

peak patio Feb 1, 2024, 9:51 AM

#

final kiln Ax = B

Can numpy or somethign else do that for me ? translate equations like ax+b=c+d into ax=c+d-b ?

final kiln Feb 1, 2024, 9:51 AM

#

peak patio Can numpy or somethign else do that for me ? translate equations like ax+b=c+d i...

you can probly do that with sympy

#

but then the solving itself gotta be a numerical approach, there's just to many equations

wooden sail Feb 1, 2024, 9:59 AM

#

the best is doing that yourself on paper

#

you'd have to read the documentation of available solvers and then it's up to you to prepare the problem in a compatible way

peak patio Feb 1, 2024, 10:02 AM

#

wooden sail the best is doing that yourself on paper

💀

wooden sail Feb 1, 2024, 10:04 AM

#

sadly i'm not trolling you 😛 that's why people go learn this in uni

final kiln Feb 1, 2024, 10:04 AM

#

it's easier than it looks, after some practice it will be second nature

#

I reckon most people who studied this can transform it into matrix form right from the equations you wrote without modifying them

wooden sail Feb 1, 2024, 10:06 AM

#

you already have them in matrix form, just gotta move a few coefficients around

#

i really do suggest you grab a pencil and a piece of paper and write it down, it won't take you long

peak patio Feb 1, 2024, 10:06 AM

#

Okay then

#

thanks

empty willow Feb 1, 2024, 10:25 AM

#

Hey whats up guys

#

in the opinion of voices which language works well with python

lapis sequoia Feb 1, 2024, 10:48 AM

#

is it possible to train a model with sine function without using LSTM? from what i see it doesnt work beyond training dataset

final kiln Feb 1, 2024, 10:52 AM

#

lapis sequoia is it possible to train a model with `sine` function without using LSTM? from wh...

Try using Taylor features and a small learning rate

lapis sequoia Feb 1, 2024, 10:58 AM

#

its using single value input and output, how am i supposed to use taylor series

final kiln Feb 1, 2024, 11:00 AM

#

lapis sequoia its using single value input and output, how am i supposed to use taylor series

instead of feeding x, feed [x, x**2, x**3, etc]

#

you should also normalize it

#

y = x % 2pi something of the sorts

lapis sequoia Feb 1, 2024, 11:29 AM

#

lapis sequoia is it possible to train a model with `sine` function without using LSTM? from wh...

CNN will work

#

You will need to send like 100 last values as input

empty willow Feb 1, 2024, 11:53 AM

#

hmm

agile owl Feb 1, 2024, 12:54 PM

#

how do you expect it to learn sine at all just given a single datapoint

tender umbra Feb 1, 2024, 12:59 PM

#

How to host a low traffic deep learning model?
So i want to host a deep learning model, A10 seems good enough for my needs. I am not expecting a lot of traffic, so paying hourly for aws ec2 or ecs doesnt seem like the best way? Can anyone guide to alternatives that charge on per api call basis?

old radish Feb 1, 2024, 1:47 PM

#

um so guys i wanna develop an ai application to detect if the user is looking at his computer what tools should i use and how do i do it?

lapis sequoia Feb 1, 2024, 2:00 PM

#

agile owl how do you expect it to learn sine at all just given a single datapoint

if it can learn linear equations, why not sine

final kiln Feb 1, 2024, 2:19 PM

#

agile owl how do you expect it to learn sine at all just given a single datapoint

I think he means the network f is of the form y=f(x) where x is a single real number as opposed to a vector

true spade Feb 1, 2024, 2:38 PM

#

Hi there, just curious, how do y'all modularize/organize your code in Jupyter notebooks?

For context, I have recently been given 2 problems to solve using different types of machine learning models (i.e. classification and regression models) and I am having difficulty splitting the code into individual functions that can be placed in another Python file (so as to avoid having the Jupyter notebook become too cluttered with long sections of code).

golden hill Feb 1, 2024, 2:53 PM

#

Hi, guys. Can you tell my someone python libraries for beginner developer?

true spade Feb 1, 2024, 2:57 PM

#

golden hill Hi, guys. Can you tell my someone python libraries for beginner developer?

What do you plan on developing using Python? The libraries you might need to use depend on what exactly you are trying to develop

golden hill Feb 1, 2024, 2:58 PM

#

true spade What do you plan on developing using Python? The libraries you might need to use...

data science

true spade Feb 1, 2024, 2:58 PM

#

golden hill data science

I see, that is a category of projects that can be done in Python, are you trying to do some data analysis on a dataset? Or are you trying to do something else?

#

If you are doing data analysis, the following libraries might be useful to you:

numpy
pandas
matplotlib (Used for plotting charts and visualizing data)

#

However, this is just a general list of libraries as I am not sure what exactly in data science you are trying to do

golden hill Feb 1, 2024, 3:00 PM

#

I wanna make ai for sorting flowers

#

for example

#

dataset: roze(500img), sunflower(500img), chamomile(500img)

golden hill Feb 1, 2024, 3:03 PM

#

true spade If you are doing data analysis, the following libraries might be useful to you: ...

thx

#

have a nice day

#

when i will make this, i will tell you about this

true spade Feb 1, 2024, 3:05 PM

#

golden hill thx

No problem

true spade Feb 1, 2024, 3:05 PM

#

golden hill have a nice day

Thanks, same to you

true spade Feb 1, 2024, 3:05 PM

#

golden hill when i will make this, i will tell you about this

Sure

golden hill Feb 1, 2024, 3:05 PM

#

xd

true spade Feb 1, 2024, 3:06 PM

#

true spade Hi there, just curious, how do y'all modularize/organize your code in Jupyter no...

Just bumping this question in case anyone is able to respond to it

teal lance Feb 1, 2024, 3:12 PM

#

teal lance

Here’s how that went ❤️❤️

teal lance Feb 1, 2024, 3:12 PM

#

teal lance Here’s how that went ❤️❤️

lapis sequoia Feb 1, 2024, 3:14 PM

#

final kiln instead of feeding x, feed ` [x, x**2, x**3, etc]`

i tried your suggestion, it fails to train that way

final kiln Feb 1, 2024, 3:15 PM

#

lapis sequoia i tried your suggestion, it fails to train that way

Increase model complexity, add a couple more layers

#

Keep learning rate small, I've done this before and LR was the final bullet

lapis sequoia Feb 1, 2024, 3:18 PM

#

final kiln Keep learning rate small, I've done this before and LR was the final bullet

model = Sequential()
model.add(LSTM(10, input_shape=(n_feat, 1)))
model.add(Dense(10))
model.add(Dense(1))
model.compile(optimizer=Adam(learning_rate=0.001), loss="mse")```
used this, any suggestions?

final kiln Feb 1, 2024, 3:18 PM

#

Also note that the tailor series of sine doesn't have all orders

#

x, x3 and x5, and etc

tidal bough Feb 1, 2024, 3:19 PM

#

omitting even powers is cheating :p