#data-science-and-ml | Python | Page 57

raw compass Apr 14, 2023, 9:13 PM

#

So are they vectors and show the direction where the function increases, so if you take the opposite you can decrease the loss?

#

P.data += minus learning rate * p.grad

wooden sail Apr 14, 2023, 9:14 PM

#

yeah, that's the most basic form of "gradient descent"

raw compass Apr 14, 2023, 9:15 PM

#

wooden sail yeah, that's the most basic form of "gradient descent"

Is my point of view correct?

#

But still don't get how I can calculate the grad for every parameter, based on the chain rule

wooden sail Apr 14, 2023, 9:17 PM

#

same way as always

#

do you know how the chain rule works?

serene scaffold Apr 14, 2023, 9:17 PM

#

raw compass But still don't get how I can calculate the grad for every parameter, based on t...

it's multivariate 😄

raw compass Apr 14, 2023, 9:18 PM

#

wooden sail do you know how the chain rule works?

Second-year maths is not that impressive 😅

#

Just checked the Wikipedia page.

wooden sail Apr 14, 2023, 9:19 PM

#

all right. well, in many cases one doesn't actually need to use the chain rule explicitly, but it can be very helpful to formulate the computation of gradients very generally

#

in essence, a gradient vector is a vector whose elements are the derivatives of a function with respect to each of its parameters, treating one parameter as a variable at a time and treating the rest as constants

#

and then for each one of these derivatives, you apply the chain rule as needed

#

for now, i guess the most important thing for you is: you probably know how to take the derivative of something like f(x). but what about f(g(x))? the chain rule tells you how to do this. and more generally for f(g(h(...(x)...)))

#

and each of those f, g, h, etc is a layer in a network, if you wanna see it that way

raw compass Apr 14, 2023, 9:22 PM

#

so like this.

wooden sail Apr 14, 2023, 9:22 PM

#

tbh i think looking at it connected to code is going to do more harm than good

#

the way this is done in code is actually very different from how it is done conceptually

raw compass Apr 14, 2023, 9:26 PM

#

wooden sail tbh i think looking at it connected to code is going to do more harm than good

so actually if I take the derivative of a function, I'm gonna get its grad?

wooden sail Apr 14, 2023, 9:27 PM

#

the derivative with respect to each parameter

golden granite Apr 14, 2023, 9:27 PM

#

@mild salmon Nice code

wooden sail Apr 14, 2023, 9:27 PM

#

the gradient vector is one generalization of the derivative to the multivariate case

raw compass Apr 14, 2023, 9:28 PM

#

derivate is just a slope which is represent the different between 2 points? or am I totally wrong?

wooden sail Apr 14, 2023, 9:30 PM

#

it is a slope, but it's not the difference between two points

#

not if the function is not a straight line 😛

raw compass Apr 14, 2023, 9:30 PM

#

wooden sail in essence, a gradient vector is a vector whose elements are the derivatives of ...

(btw) could you recommend me any sources, where I can learn about this more efficiently? if you don't mind.

wooden sail Apr 14, 2023, 9:31 PM

#

hmm idk which calculus books are recommended these days

#

maybe stewart's calculus

raw compass Apr 14, 2023, 9:32 PM

#

wooden sail maybe stewart's calculus

these ones?

wooden sail Apr 14, 2023, 9:32 PM

#

yeah

#

ooh gilbert strang has a calculus book

#

i like strang. check this out https://ocw.mit.edu/ans7870/resources/Strang/Edited/Calculus/Calculus.pdf part of MIT's OCW, so it's free

#

chapter 13 has partial derivatives and gradients

#

(you have to learn differential and integral calc in 1 variable before getting to multivariable)

raw compass Apr 14, 2023, 9:35 PM

#

wooden sail i like strang. check this out https://ocw.mit.edu/ans7870/resources/Strang/Edite...

okay, got it. 👍

#

thank you

lapis sequoia Apr 14, 2023, 10:45 PM

#

I would like to read a course by myself in my own pace in data science (free of charge), is there any you guys can recommend to me?

soft badge Apr 14, 2023, 10:49 PM

#

guys what is better website or course to learn machine learning

#

??

raw compass Apr 14, 2023, 10:50 PM

#

soft badge guys what is better website or course to learn machine learning

there is no best course or website, just start with something small.

soft badge Apr 14, 2023, 10:51 PM

#

like programming?

raw compass Apr 14, 2023, 10:51 PM

#

soft badge like programming?

wdym?

#

I mean yeah, you def should have very good python skills.

soft badge Apr 14, 2023, 10:52 PM

#

yeah

#

do you are in this area how many time?

raw compass Apr 14, 2023, 10:53 PM

#

and write things from scratch, after that you can go with pytorch, or tensorflow.

raw compass Apr 14, 2023, 10:53 PM

#

soft badge do you are in this area how many time?

like python?

soft badge Apr 14, 2023, 10:53 PM

#

raw compass like python?

in ML

raw compass Apr 14, 2023, 10:53 PM

#

soft badge in ML

im a beginner.

soft badge Apr 14, 2023, 10:53 PM

#

oh yeah

raw compass Apr 14, 2023, 10:53 PM

#

in machine learning, but have been using python for like 3 years.

soft badge Apr 14, 2023, 10:54 PM

#

oh yeah

lapis sequoia Apr 14, 2023, 10:54 PM

#

i've been starting like 10 different courses but never manage to finish any

#

except one, but forgot most of it after the summer

soft badge Apr 14, 2023, 10:54 PM

#

yeah

#

do you always find a "better course" yeah?

#

and decide start this course*

lapis sequoia Apr 14, 2023, 10:55 PM

#

maybe just "doing" it is best, i asked my professor for past lecture presentations, looking forward to getting them and past assignments, will do them then maybe kaggle

soft badge Apr 14, 2023, 10:56 PM

#

guys the course of udemy are equals course on youtube?

#

In your opinion, are they super basic courses that don't teach you anything?

lapis sequoia Apr 14, 2023, 10:56 PM

#

some courses are paid and i guess are better since they are more structured

#

i'm poor so i dont want it

#

the ways i learned other fields, especially maths, have been by carefully looking at lecture notes, youtube videos, and doing a lot of practice problems

#

so gonna try that

iron basalt Apr 14, 2023, 10:57 PM

#

lapis sequoia maybe just "doing" it is best, i asked my professor for past lecture presentatio...

Practice is always needed.

lapis sequoia Apr 14, 2023, 10:58 PM

#

yeah

iron basalt Apr 14, 2023, 10:58 PM

#

Everything else is preparation for the practice problems so you can solve them.

lapis sequoia Apr 14, 2023, 11:01 PM

#

https://www.datasciencecourse.org/lectures/ bruh

Lectures

This page lists the class lectures plus additional material (slides, notes) associated with each lecture. Recordings of all the classes will available on the course Canvas page. Lectures from a previous offering (Fall 2019) are available on [Panopto](https://scs.hosted.panopto.com/Panopto/Pages/Sessions/List.aspx#folderID="618ea253-ca45-4b14-9...

#

i just found this from CMU

#

they litterally have lecture pdfs, lectures recorded and timestamped

#

seems like a great course

soft badge Apr 14, 2023, 11:03 PM

#

i was doing the course of IBM

#

some people talk about this, and say that is good

lapis sequoia Apr 14, 2023, 11:09 PM

#

i am not sure exactly what kind of transformation this was, but often its because a lot of ml algorithms that are applied assume a normalized dataset

jade bloom Apr 14, 2023, 11:09 PM

#

lapis sequoia i am not sure exactly what kind of transformation this was, but often its becaus...

yeah it was a normalization, sorry for the unclear question, let me rephrase

#

#

here we scale our X_train data into values from 0 to 1, and only X_train to prevent data leakage

#

then here we apply the scaling to X_train

#

but we also do so to X_test

#

jade bloom Apr 14, 2023, 11:12 PM

#

jade bloom

here the min and max of X_train is 0 and 1

jade bloom Apr 14, 2023, 11:13 PM

#

jade bloom

but for X_test the min is -0.014108392024525074 and the max is 1.0186515935232023

#

so my question is 1) why aren't the values 0 to 1

#

and 2) why do we transform the X_test if we wanted to prevent data leakage

#

any help would be greatly appreciated 😄

rugged comet Apr 15, 2023, 12:30 AM

#

I have some duplicate data in my dataframe but with different names. For example, tribal-human and human-tribal are the same thing but with different names. How can I pick one to keep and remove the other? I was thinking something like this

for value in df:
    part_1, part_2 = value.split("-")
    reverse = part_2 + "-" + part_1
    while reverse in df:
        # Remove reverse from df

I'm don't think you're supposed to loop over dataframes like this though.

cold osprey Apr 15, 2023, 12:34 AM

#

Can do a replace

#

Would also put it in a lambda function to use apply() on instead of a for loop

rugged comet Apr 15, 2023, 12:37 AM

#

cold osprey Can do a replace

pandas replace or string replace?

cold osprey Apr 15, 2023, 12:38 AM

#

Iirc there's a way to replace all values in a column to another

#

Maybe like loop through the unique ones of the col, then check whether the reverse exists, if yes then do the replace

rugged comet Apr 15, 2023, 12:46 AM

#

cold osprey Maybe like loop through the unique ones of the col, then check whether the rever...

I'd rather not replace the duplicate values with anything. I'd rather remove them.

cold osprey Apr 15, 2023, 12:53 AM

#

Yeah u can do that too just drop em

rugged comet Apr 15, 2023, 1:37 AM

#

cold osprey Yeah u can do that too just drop em

I tried this.

for tag_name in df["tag_name"].unique():
    print(f"tag_name: {tag_name}")
    reverse = "-".join(reversed(tag_name.split("-")))
    print(f"reverse: {reverse}")
    if reverse in df["tag_name"].unique():
        # print(f"Droping {reverse}")
        mask = df["tag_name"] == reverse
        df.drop(df[mask].index, inplace=True)

But it removes both tribal-zombie and zombie-tribal for example. I think it's because I am iterating through .unique which isn't getting updated as I iterate.

cold osprey Apr 15, 2023, 1:38 AM

#

yep

#

maybe u can drop it from that unique() list too once u iterated over it?

rugged comet Apr 15, 2023, 1:39 AM

#

cold osprey maybe u can drop it from that unique() list too once u iterated over it?

Interesting idea.

cold osprey Apr 15, 2023, 1:39 AM

#

so like assign it to another variable before the for loop

#

then as u drop, update it

#

i have another idea but this one seems the simplest

#

the other one will likely end up for hella if statements

rugged comet Apr 15, 2023, 1:44 AM

#

cold osprey then as u drop, update it

Are you sure we're supposed to remove elements from a list as we iterate through it?

cold osprey Apr 15, 2023, 1:46 AM

#

rugged comet Are you sure we're supposed to remove elements from a list as we iterate through...


unique_list =  df["tag_name"].unique()

for tag_name in unique_list:
  # do stuff here

  if reverse in unique_list:
    # remove the reverse from main df
    # remove reverse from unique_list

#

this was what i had in mind

#

i think it unique_list shud be updating right?

#

so like if the 1st item in the list is tribal-zombie, we remove zombie-tribal too, the for loop should never 'see' zombie-tribal

#

lemme try it haha

rugged comet Apr 15, 2023, 1:49 AM

#

cold osprey ```py unique_list = df["tag_name"].unique() for tag_name in unique_list: # ...

This is exactly what I did too. However, Python doesn't like it when you remove elements from a list as you're iterating through that same list.
Here's an example

nums = [1, 2, 3, 4, 5]
for num in nums:
    nums.remove(num)
print(nums)

[2, 4]

The expected behavior is that all elements get removed but that isn't how it works.

cold osprey Apr 15, 2023, 1:51 AM

#


unique_list = ['a-b', 'b-a', 'c-d']
for item in unique_list:
    # do stuff here
    reverse = "-".join(reversed(item.split("-")))
    print(reverse)
    if reverse in unique_list:
        unique_list.remove(reverse)

#

ur example is diff than mine

#

i got what i expected to get

#

only 2 items gets printed out, b-a which is the reverse of the 1st element a-b

#

and d-c which is reverse of c-d

rugged comet Apr 15, 2023, 2:07 AM

#

hmm

#

I had some incorrect logic in my actual code.

cold osprey Apr 15, 2023, 2:10 AM

#

in ur example, when u delete the num ure currently on, num becomes the next num apparently

#

and when it goes to the beginning of the for loop again, it jumps to the next one

rugged comet Apr 15, 2023, 2:10 AM

#

cold osprey in ur example, when u delete the num ure currently on, num becomes the next num ...

Yeah that's why I thought you weren't supposed to remove elements from a list as you're iterating over that list.

cold osprey Apr 15, 2023, 2:11 AM

#

but for mine, since its deleting not the current one we r on, it should be fine i hope haha

fierce harbor Apr 15, 2023, 2:38 AM

#

Attempting to write a program that partly deals with second implicit derivatives so I worked one out by hand but I keep getting the wrong answer, can anyone spot my error?

#

I have taken the derivative of some function f(x), and got first derivative dy/dx = (-3x^2 - 4xy) / (2x^2 + 8y)

#

What is the second derivative at (0, sqrt(3)) I keep getting -1/2 when it should be -1/16

#

cold osprey Apr 15, 2023, 3:05 AM

#

fierce harbor What is the second derivative at (0, sqrt(3)) I keep getting -1/2 when it should...

how do u know it is -1/16

#

also what was the original y?

#

was ur first derivative correct?

frozen marten Apr 15, 2023, 4:27 AM

#

..

foggy vigil Apr 15, 2023, 4:41 AM

#

Does anyone know where I can find calculus problems of several variables but with solution?

#

But that re a little difficult

queen cradle Apr 15, 2023, 5:01 AM

#

There are lots of books of this sort. Schaum's used to be a brand that did this, I think. Don't know if they're still around.

zealous badger Apr 15, 2023, 7:41 AM

#

how do i convert a column with time series data like this at 5ms interval to rows with 1s interval where the related column is the mean of the value over the whole second?

2023-04-15 00:00:00.050000    2
2023-04-15 00:00:00.100000    1
2023-04-15 00:00:00.150000    2
2023-04-15 00:00:00.200000    3

should be 2023-04-15 00:00:01 2

zealous badger Apr 15, 2023, 7:58 AM

#

apparently df.resample exists

cold osprey Apr 15, 2023, 2:31 PM

#

anyone use wsl for tensorflow-gpu?

#

or yall run linux natively

serene scaffold Apr 15, 2023, 2:40 PM

#

most people do model training on mainframes that run linux. it's not hard to install pytorch on windows if you can find the right wheel for it, though.

cold osprey Apr 15, 2023, 2:54 PM

#

ye for work deffo on servers. was thinking more personal project / smaller scale stuff

frozen marten Apr 15, 2023, 2:55 PM

#

can anyone help me with designing a 3d pspnet model?

#

pspNetModel = sm.PSPNet(
'resnet34',
input_shape = (144, 144, 144, 3),
classes=4,
activation='sigmoid'
)
LR = 0.0001
optim = keras.optimizers.Adam(LR)
pspNetModel.compile(optimizer = optim, loss = total_loss,metrics='accuracy')
pspNetModel.fit(train_img_datagen,
steps_per_epoch=5,
epochs=3,
verbose=1,
validation_data=val_img_datagen,
validation_steps=val_steps_per_epoch,
)
This is giving me a val_loss nan

cold osprey Apr 15, 2023, 3:02 PM

#

what is total_loss

frozen marten Apr 15, 2023, 3:35 PM

#

used for training
wt0, wt1, wt2, wt3 = 0.25,0.25,0.25,0.25
import segmentation_models_3D as sm
dice_loss = sm.losses.DiceLoss(class_weights=np.array([wt0, wt1, wt2, wt3]))
focal_loss = sm.losses.CategoricalFocalLoss()
total_loss = dice_loss + (1 * focal_loss)

#

@cold osprey

frozen marten Apr 15, 2023, 3:38 PM

#

cold osprey what is `total_loss`

any idea why it's showing an nan?

cold osprey Apr 15, 2023, 3:39 PM

#

try other losses first?

#

instead of this total loss

frozen marten Apr 15, 2023, 3:44 PM

#

what can I try?

#

can you please suggest me some?

#

cos the ones which i checked required a model.parameters() as an arg within the loss

#

but the sm.pspnet does not support .parameters()

cold osprey Apr 15, 2023, 3:46 PM

#

cant u use just focal_loss or smth?

#

ive no idea what a pspnet model is fwiw

dire field Apr 15, 2023, 4:05 PM

#

Is this the correct channel to ask questions about pandas/polars or is there a data processing channel I'm not seeing?

serene scaffold Apr 15, 2023, 4:07 PM

#

dire field Is this the correct channel to ask questions about pandas/polars or is there a d...

this is the channel for pandas and polars.

dire field Apr 15, 2023, 4:08 PM

#

serene scaffold this is the channel for pandas and polars.

Cool, thanks.

serene scaffold Apr 15, 2023, 4:09 PM

#

for your general awareness, I can help with most pandas questions, but I typically require a copy-and-pasteable copy of the dataframe, like df.head().to_dict('list')

dire field Apr 15, 2023, 4:10 PM

#

serene scaffold for your general awareness, I can help with most pandas questions, but I typical...

Good to know. I'll try to make a minimal example.

#

I want to join/merge multiple dataframes. The catch is that they don't all share the same columns I want to join on. So I want to do what I am calling a "permissive" join where dataframes are joined based on which join_on columns they share. I think the code below is working how I expect, though I don't have thorough unit tests yet. However before preceding, I was wondering if there is a better way to do this. Ideally, there would be native pandas/polars methods so I could avoiding having to write these custom functions.

import polars as pl
from functools import reduce
from typing import Iterable



def get_shared_elements(iterables: list[Iterable]) -> list[str]:
    return list(reduce(lambda a, b: a & b, [set(s) for s in iterables]))

def join_multiple_dfs(dfs: list[pl.DataFrame], join_on: list[str]) -> pl.DataFrame:
    return reduce(
        lambda left, right: left.join(right, how="inner", on=get_shared_elements(
            iterables=[left.columns, right.columns, join_on])), dfs
        )

def test_join_multiple_dfs():
    df1 = pl.DataFrame({"subjectkey": ["a", "a", "a"], "eventname": ["x", "z", "y"], "var1": [5,6,7]})
    df2 = pl.DataFrame({"subjectkey": ["a", "a", "b"], "eventname": ["x", "y", "y"], "var2": [1, 2, 3]})
    df3 = pl.DataFrame({"subjectkey": ["a", "b", "c"], "var3": ["foo", "bar", "baz"]})

    dfs = [df1, df2, df3]
    df = join_multiple_dfs(dfs=dfs, join_on=['subjectkey', 'eventname'])
    print(df)
    # FIXME need to make expected_output
    # assert df.frame_equal(expected_output)

test_join_multiple_dfs()

In this example df3 does not have the column eventname so I only want to join on subjectkey.

cold osprey Apr 15, 2023, 4:29 PM

#

    df = join_multiple_dfs(dfs=dfs, join_on=['subjectkey', 'eventname'])

should be

    df = join_multiple_dfs(dfs=dfs, join_on=get_shared_elements(dfs))
``` ?

#

hmm wait m confused

main sigil Apr 15, 2023, 4:33 PM

#

I just started with NLP and trying to understand cosine similarly and Euclidean distance.

As cosine similarly takes direction into consideration than magnitude I always feel for all NLP tasks cosine similarly is the best.

But are there any scenario where Euclidean distance works better than cosine similarly for NLP?

cold osprey Apr 15, 2023, 4:36 PM

#

the the join_on parameter necessary? hmm

dire field Apr 15, 2023, 4:36 PM

#

cold osprey ```py df = join_multiple_dfs(dfs=dfs, join_on=['subjectkey', 'eventname']) `...

I want to get the intersection of the columns of the two dataframes that are being joined and the strings passed to join_on in join_multiple_dfs. Technically, you don't need the join_on arg if the dataframes only share the columns you want to join on, but I can't guarantee that for my use case, so the arg guards against this.

cold osprey Apr 15, 2023, 4:36 PM

#

cant run the code rn coz doing some shit with my envs

#

i dont see why join_on is necessary hmmn

#

get_shared_elements returns the shared columns between 2 dataframes, which we use to join

dire field Apr 15, 2023, 4:39 PM

#

cold osprey i dont see why `join_on` is necessary hmmn

Say that df1 has column "foo" and df2 also has column "foo" but I don't want to join on "foo". I only want to join on "subjectkey" and "eventname". That is what the join_on arg is for.

cold osprey Apr 15, 2023, 4:39 PM

#

ahh i see

#

ok its a global whitelist of sorts

dire field Apr 15, 2023, 4:39 PM

#

cold osprey ok its a global whitelist of sorts

yep

cold osprey Apr 15, 2023, 4:40 PM

#

seems fine

#

altho i cant rly brain the reduce lambda in join_multiple_dfs

#

i assume its doing what i think its doing

#

type hinting isnt helping too haha coz i dont use it KEKW

dire field Apr 15, 2023, 4:45 PM

#

cold osprey altho i cant rly brain the reduce lambda in `join_multiple_dfs`

Here is the non-functional version of it, if that helps:

def join_multiple_dfs(dfs, join_on):
  joined_df = dfs[0]
  for df in dfs[1:]
    joined_df = df.join(joined_df, how="inner", on=get_shared_elements(
            iterables=[join_df.columns, df.columns, join_on]))
  return dfs

cold osprey Apr 15, 2023, 4:47 PM

#

Haha ok it's doing what I thought it was

dire field Apr 15, 2023, 4:47 PM

#

main sigil I just started with NLP and trying to understand cosine similarly and Euclidean ...

It depends on your use case. If there is a natural interpretation of euclidean than you might have some motivation to choose that one. But otherwise, you are right, cosine similarity is often preferred in nlp.

short talon Apr 15, 2023, 6:09 PM

#

any reason why my help request would just get closed with no responses?

main sigil Apr 15, 2023, 6:27 PM

#

dire field It depends on your use case. If there is a natural interpretation of euclidean t...

Are there any good resource that explains when to choose which distance metrics. All the resources I referred didn't mention the reasons in depth

dire field Apr 15, 2023, 6:34 PM

#

main sigil Are there any good resource that explains when to choose which distance metrics....

To my knowledge there's no great rule of thumb for choosing metrics. You usually just choose your metric if there is some conceptual motivation to do so. Incidentally, ML researchers have found that learned distance/similarity metrics perform better (for down stream tasks) than metrics chosen explicitly. Look up "metric learning" for more info.

earnest widget Apr 15, 2023, 6:46 PM

#

I am currently trying out feature extraction using RESNET but I want to know if I resize the image to a smaller size, will it get affected in any way better or worse?

fallow frost Apr 15, 2023, 7:21 PM

#

its an API (written on fastapi) that gives recommendations based on the input, before it would query the DB each time, but I suggested to do all of it in memory so we dont need to do a network request to the DB, and since there are only 600k records, not that much.
the point is that I'm constantly querying the DB/dataframe (filtering), mostly with SELECT ... WHERE col IN (...) or pd.DataFrame.isin(...), so I would like to do that as eficciently as possible.
I will try the suggestion from @serene scaffold when I go back to work, but Polars sounds really interesting (credit to @tidal bough ) as I've started using pyarrow quite a bit lately, and its usually very fast for this stuff (and it has a very low memory footprint).

boreal gale Apr 15, 2023, 7:54 PM

#

fallow frost its an API (written on fastapi) that gives recommendations based on the input, b...

does your dataframe/source data change over time?
does your input change over time?
what is the characteristics of col? (e.g. cardinality, data type, unique-ness, skewness/distribution)
what is the characteristics of your input?
what is the current performance you have?
what is the desired performance?

it's worth noting pd.Series.isin could utilise two different algorithm under the hood depending on the characteristics of your series and your input, and isin itself is already quite optimised, in most cases that's the best you can eek out of pandas. (one of two is a hashmap based algo, so using a set in python might be inferior to isin)

cold osprey Apr 15, 2023, 8:21 PM

#

PermissionError                           Traceback (most recent call last)
Cell In[15], line 22
     19     n += 1
     21 im = Image.open("model.png")
---> 22 mlflow.log_image(im, "model.png")

error trace here


PermissionError: [Errno 13] Permission denied: '/c:'

#

Context: fitting a tensorflow model in wsl, using mlflow.tensorflow.autolog() which logs the metrics

queen cradle Apr 15, 2023, 8:23 PM

#

Don't post screenshots. Post text.

cold osprey Apr 15, 2023, 8:23 PM

#

model_plot = utils.plot_model(model, show_shapes=True, show_layer_names=True)
model_plot

model plot which is saved to a model.png file

#


# Train the model
epochs = 200
batch_size = 64

with mlflow.start_run():
    history = model.fit(
        X_train, y_train, batch_size=batch_size, epochs=epochs, validation_data=(X_val, y_val)
    )

    test_metrics = model.evaluate(X_test, y_test)

    n = 0
    for metric in test_metrics:
        if n == 0:
            mlflow.log_metric(("test_loss"), test_metrics[n])
        else:
            mlflow.log_metric(("test_" + metrics[n - 1].name), test_metrics[n])

        n += 1

    im = Image.open("model.png")
    mlflow.log_image(im, "model.png")

then opens the model.png file as a pillow image and logs it as an artifact on mlflow

cold osprey Apr 15, 2023, 8:24 PM

#

queen cradle Don't post screenshots. Post text.

fixed

queen cradle Apr 15, 2023, 8:25 PM

#

PermissionError suggests that something is wrong with model.png.

cold osprey Apr 15, 2023, 8:25 PM

#

its not the reading of the file causing the error

#

its the logging with mlflow thats causing it

#

not familiar with linux, let alone wsl so not sure if theres a way to give it the perms it needs to write to /C: or not

queen cradle Apr 15, 2023, 8:27 PM

#

Okay, I don't know anything about mlflow, so I'm afraid I can't help you.

#

But maybe someone else will come along who can.

soft badge Apr 15, 2023, 10:00 PM

#

In your opinion, will chatGPT or other technologies be the parameter for the development of everything from today? Are so many new artificial intelligences going to use chatGPT in their application?

#

and also in the applications, eg a website that summarizes books, it is no longer necessary to build the whole AI model and training, just integrate the chatGPT, do you think that the creation of new models will be replaced by just an integration with the chatGPT?

agile cobalt Apr 15, 2023, 10:04 PM

#

not at all.
chat gpt only does one thing: Respond to text with text

you can try to go out of your way to engineer ways to transform other tasks into text completion, but it's going to be extremely inefficient if not impossible for a lot of tasks.

#

it may be usable for summarising books, but how would you use it for recommending books? literally ask it directly and recommend whatever it hallucinates?

iron basalt Apr 15, 2023, 10:06 PM

#

soft badge and also in the applications, eg a website that summarizes books, it is no longe...

For ChatGPT's domain, it will probably be dominated by OpenAssistant based models. They have been collecting a lot of samples really fast via public community efforts.

soft badge Apr 15, 2023, 10:11 PM

#

iron basalt For ChatGPT's domain, it will probably be dominated by OpenAssistant based model...

so for example, for creation of AI in systems mainly chatGPT will be used, right, instead of having to develop all this from scratch?

iron basalt Apr 15, 2023, 10:13 PM

#

soft badge so for example, for creation of AI in systems mainly chatGPT will be used, right...

No, ChatGPT is a narrow AI. For the specific task that ChatGPT does, it will be used, although it will probably be replaced soon by OpenAssistant models and/or ChatGPT itself will be trained on the OpenAssistant datasets.

soft badge Apr 15, 2023, 10:16 PM

#

iron basalt No, ChatGPT is a narrow AI. For the specific task that ChatGPT does, it will be ...

but so to create an AI for a specific task, example: AI for playing fortnite, do you think this will be developed from scratch or will it be assisted with chatGPT going forward?

iron basalt Apr 15, 2023, 10:17 PM

#

soft badge but so to create an AI for a specific task, example: AI for playing fortnite, do...

From scratch.

#

If we are to have some generic base from which AIs are created it would have to be some world model trained on the real world and/or simulation. And this is a much more difficult task than downloading a bunch of text, creating prompts, and having people go through them and rate them and such.

#

A text model can then be included to have a better interface with humans.

iron basalt Apr 15, 2023, 10:28 PM

#

iron basalt If we are to have some generic base from which AIs are created it would have to ...

*It could be done though, especially with a public crowd effort like with OpenAssistant.

iron basalt Apr 15, 2023, 10:31 PM

#

iron basalt If we are to have some generic base from which AIs are created it would have to ...

- With a strong enough world model your text model probably does not need to be nearly as good as ChatGPT. Humans probably know less about text than ChatGPT, but that does not matter for them because what they say probably comes mostly from their world model (this becomes especially apparent when you try to get a language model to write code for something it has not seen before, it only has learned from the shadow of reality that is language).

soft badge Apr 15, 2023, 10:34 PM

#

yeah

#

do you guess prompt enginner is next profission, can replace almost every areas, provided that the engineer has knowledge in these areas?

iron basalt Apr 15, 2023, 10:38 PM

#

soft badge do you guess prompt enginner is next profission, can replace almost every areas,...

Last time I checked I did not get to be a "prompt engineer" for using Wikipedia to look up some concept.

fierce harbor Apr 15, 2023, 10:38 PM

#

cold osprey how do u know it is -1/16

The answer key to where I got the problem stated -1/16

fierce harbor Apr 15, 2023, 10:39 PM

#

cold osprey was ur first derivative correct?

it was correct according to the answer key I got it from

iron basalt Apr 15, 2023, 10:39 PM

#

Saying i'm a prompt engineer is like saying i'm a "professional Googler."

#

High paying jobs will be the same as they always have been, having a strong world model with regards to some domain.

soft badge Apr 15, 2023, 10:40 PM

#

i really dont know

soft badge Apr 15, 2023, 10:42 PM

#

iron basalt Saying i'm a prompt engineer is like saying i'm a "professional Googler."

it's really true about google pro but chatGPT is more specific in the answer and parsing question fix that code google has no power to do that.

iron basalt Apr 15, 2023, 10:43 PM

#

soft badge it's really true about google pro but chatGPT is more specific in the answer and...

To get the correct specific answer out of a language model you need to already have a lot of knowledge about the domain. If you already have enough knowledge about the domain Google will work just fine.

#

(Or you probably do not need to Google anything, except a few specific easy to Google things like for example the values of some physical constants)

soft badge Apr 15, 2023, 10:45 PM

#

yes, but the chat interprets your question, while google shows possible answers to your question, but it's not something directed like the chat, you know?

iron basalt Apr 15, 2023, 10:45 PM

#

soft badge yes, but the chat interprets your question, while google shows possible answers ...

The directed chat may be an improvement, but that does not suddenly make a huge difference. It's just a bit more nice.

soft badge Apr 15, 2023, 10:46 PM

#

yeah but think in few years how be to

#

the speed that advances is something unreal

iron basalt Apr 15, 2023, 10:47 PM

#

In a few years we may have that world model I wrote about, when that happens there may not be any prompt engineers either, there may not be any engineers...

queen cradle Apr 15, 2023, 10:47 PM

#

ChatGPT isn't actually significantly more sophisticated than what came before it. It's just bigger.

iron basalt Apr 15, 2023, 10:47 PM

#

Bigger issues to deal with at that point.

#

New versions of language models will not suddenly do something they did not do before. They just do that same thing better.

#

The bigger gains now are probably from plugins, e.g. Wolfram Language.

soft badge Apr 15, 2023, 10:53 PM

#

yeah

iron basalt Apr 15, 2023, 10:56 PM

#

*But even in that case, what is really happening is that it's being used to improve the UX for that thing. The real power is just whatever that plugin was for. Wolfram Language for example has always been amazing at what it does.

#

(Also it had NLP already (for a long time), you could code in English with it, this is just a better version of that)

soft badge Apr 15, 2023, 11:02 PM

#

iron basalt *But even in that case, what is really happening is that it's being used to impr...

yeah, how did you acquire your knowledge in the area of Ai?

iron basalt Apr 15, 2023, 11:03 PM

#

soft badge yeah, how did you acquire your knowledge in the area of Ai?

Trying to make things.

soft badge Apr 15, 2023, 11:04 PM

#

iron basalt Trying to make things.

but did you take a course, or did you follow the path of college?

iron basalt Apr 15, 2023, 11:05 PM

#

soft badge but did you take a course, or did you follow the path of college?

Neither. I was reading papers (including old papers from the 40s, 50s, 60s, 70s, 80s).

#

And books, and things on the internet, following other's work.

#

The whole courses for AI/ML and in colleges is a recent thing. It was there, but kind of like in the dusty corner (relative to now).

soft badge Apr 15, 2023, 11:07 PM

#

oh nice

iron basalt Apr 15, 2023, 11:07 PM

#

Now it has everyone's attention ( 😉 ).

soft badge Apr 15, 2023, 11:07 PM

#

do you have background in what area?

iron basalt Apr 15, 2023, 11:08 PM

#

Programming and mathematics mostly I would say.

soft badge Apr 15, 2023, 11:09 PM

#

oh this is very interesting

#

in question of programming, do you think the time it takes to be a good programmer with all this information overload has become faster, or does it all come down to practice?

iron basalt Apr 15, 2023, 11:11 PM

#

soft badge in question of programming, do you think the time it takes to be a good programm...

It's faster, but it's practice.

soft badge Apr 15, 2023, 11:12 PM

#

but do you think developing new solutions or studying first and then practicing?

#

knowing what to use, how to use it, because sometimes you create a crappy solution, but it works, you know?

iron basalt Apr 15, 2023, 11:12 PM

#

soft badge but do you think developing new solutions or studying first and then practicing?

I just started making things. So I guess you could say it was practice from day one.

soft badge Apr 15, 2023, 11:13 PM

#

in terms of improving as a developer, is your recommendation to read books and source code?

iron basalt Apr 15, 2023, 11:14 PM

#

soft badge in terms of improving as a developer, is your recommendation to read books and s...

Reading source code, yes. And just making things. Just actively programming every day for hours.

soft badge Apr 15, 2023, 11:15 PM

#

iron basalt I just started making things. So I guess you could say it was practice from day ...

when I came into contact with programming 1 year ago it was to manipulate data in the csv, I started without knowing anything, but in 3 weeks I managed to use this script to generate a budget for my father's company that was in the beginning, something that the budget sometimes takes 1 day depending on the size, in the script it took 5 min.

soft badge Apr 15, 2023, 11:15 PM

#

iron basalt Reading source code, yes. And just making things. Just actively programming ever...

oh yeah, of course

iron basalt Apr 15, 2023, 11:18 PM

#

Let me give an example. I would see something i'm interested in, like virtually evolved creatures, then I just started making that (from scratch). Repeat. Each time I would look at my code and realize that it was bad and could have been done in a more simple way. I then keep in mind next time to just directly solve the problem in the most simple way and not over-engineer a solution. I need to constantly remind myself of that or I start over-engineering automatically.

#

I would learn any of the mathematics and such needed for that domain as I tried to make whatever.

soft badge Apr 15, 2023, 11:21 PM

#

uhum... interesting

#

small things big difference

#

this habit make your code better each time

steel reef Apr 16, 2023, 2:44 AM

#

I am trying to create a machine learning model to classify text. Currently I have an accuracy of approximately 90%. Do you guys have any suggestions to help me increase it?

from sklearn.model_selection import train_test_split
from sklearn.feature_extraction.text import CountVectorizer
from sklearn.feature_extraction.text import TfidfTransformer
from sklearn.naive_bayes import MultinomialNB

train, test=train_test_split(cw,train_size=0.9999999999999999, shuffle=True)


# Tokenizing text
count_vect = CountVectorizer()
X_train_counts = count_vect.fit_transform(train.Text)


# Term Frequencies
tf_transformer = TfidfTransformer(use_idf=False).fit(X_train_counts)
X_train_tf = tf_transformer.transform(X_train_counts)

# Term Frequency times Inverse Document Frequency
tfidf_transformer = TfidfTransformer()
X_train_tfidf = tfidf_transformer.fit_transform(X_train_counts)
clf = MultinomialNB(alpha=0.1).fit(X_train_tfidf, train.class_label)

checking = pd.read_csv('checkworthy_eval.tsv',sep = '\t')

X_new_counts = count_vect.transform(checking.Text)
X_new_tfidf = tfidf_transformer.transform(X_new_counts)

predicted = clf.predict(X_new_tfidf)

print(predicted)
checking["Category"] = predicted

checking.drop(['Text'], axis=1, inplace=True)
checking.rename(columns = {'Sentence_id':'Id'}, inplace = True)
print(checking)
gfg_csv_data = checking.to_csv('checkworthy_eval_prediction.csv', index = False)```

prisma citrus Apr 16, 2023, 3:21 AM

#

Is it possible to make your own chat bot AI? Like use an already existing ai, give it certain parameters to give it personailty like its background, origin, etc. and then talk with it? If it is possible can one link it to a discord bot

#

Kind of like how Neuro-sama the vtuber AI works

agile cobalt Apr 16, 2023, 3:44 AM

#

I'm pretty sure that that is not how neuro sama works

#

if you are serious about it, look into fine tuning OpenAI's models via their API [medium difficulty] or creating your own LLM from scratch [hard difficulty]
if you are just curious about what it could look like, see https://character.ai

serene scaffold Apr 16, 2023, 3:50 AM

#

soft badge but so to create an AI for a specific task, example: AI for playing fortnite, do...

bots that play video games have essentially nothing to do with what chatgpt does.

sharp crypt Apr 16, 2023, 4:11 AM

#

has anyone done projects with imitation learning? Would love to learn more about it

lapis sequoia Apr 16, 2023, 5:01 AM

#

https://pytorch.org/blog/overview-of-pytorch-autograd-engine/ is it supposed to be "dw/dx, dw/dy." right at the end of the text up top?

steel reef Apr 16, 2023, 6:32 AM

#

steel reef I am trying to create a machine learning model to classify text. Currently I hav...

Hey guys?

lapis sequoia Apr 16, 2023, 7:44 AM

#

steel reef I am trying to create a machine learning model to classify text. Currently I hav...

isn't your train size way too much?

#

I mean... with train data 0.99 and test data hardly 1e-6 or something perc, I'd say even 90 is like... uhm. You know very less data to evaluate.

prisma citrus Apr 16, 2023, 7:49 AM

#

agile cobalt if you are serious about it, look into fine tuning OpenAI's models via their API...

Yeah i managed to find a github program that did exactly what i needed
https://github.com/drizzle-mizzle/CharacterAI-Discord-Bot/wiki/How-to-set-up

GitHub

How to set up

CharacterAI for your Discord server. Contribute to drizzle-mizzle/CharacterAI-Discord-Bot development by creating an account on GitHub.

#

Thanks for introducing characterAI to me! 😁

steel reef Apr 16, 2023, 8:42 AM

#

lapis sequoia I mean... with train data 0.99 and test data hardly 1e-6 or something perc, I'd ...

I'm evaluating data from a different file

compact egret Apr 16, 2023, 12:19 PM

#

Hiya, I have a question for u Keras pros out there, so i have a model where i pass in my training data (with the labels) as a PaddedBatchDataset object, ner_model.fit(train_dataset, epochs=10)

my question then is, in my model call function how do i access the labels, i have been looking all over and cant find any examples for my case

fringe mantle Apr 16, 2023, 12:42 PM

#

Hi guys actually I've been trying to learn how to read scatter plots and how to make sense of the pattern. Can someone share a good resource i can look up for the same!

dire field Apr 16, 2023, 1:22 PM

#

steel reef I am trying to create a machine learning model to classify text. Currently I hav...

If your dataset's vocabulary is a common language (e.g. english) then you can use pretrained word embeddings from a large language model as your features, other wise you can learn embeddings yourself. You could try using word2vec or a BERT-like architecture to learn the embeddings. Typically, learned embeddings perform better than bag of words or tfidf features.

hasty mountain Apr 16, 2023, 2:48 PM

#

All that talk about ChatGPT...and I'm still struggling to make my vanilla Transformer to converge grumpchib

#

There's a paper about using a new parameter for scaling the residual blocks. Apparentely, the residual blocks tends to both stabilize and mess up the model...

Yet my model is indifferent to it. I hope I'm not implementing it correctly

simple tapir Apr 16, 2023, 3:15 PM

#

Hey

#

import torch
import torch.nn as nn
import torch.optim as optim
from torch.utils.data import DataLoader
from sklearn.metrics import accuracy_score
from sklearn.model_selection import train_test_split
from nltk.tokenize import word_tokenize
from collections import Counter
import pandas as pd

# Load and preprocess the data
data = pd.read_csv('sentiment_dataset.csv')

text = data['Text'].tolist()
labels = data['Label'].tolist()
words = word_tokenize(' '.join(text))
word_counter = Counter(words)
vocab = sorted(word_counter, key=word_counter.get, reverse=True)
word2idx = {word: idx+1 for idx, word in enumerate(vocab)}
text = [[word2idx[word] for word in word_tokenize(sent)] for sent in text]
max_seq_length = max([len(sent) for sent in text])
text = [sent + [0]*(max_seq_length-len(sent)) for sent in text]

# Split the data into train and test sets
train_X, test_X, train_y, test_y = train_test_split(text, labels, test_size=0.2, random_state=1234)

# Define the PyTorch Dataset and DataLoader for the data
class SentimentDataset(torch.utils.data.Dataset):
    def __init__(self, X, y):
        self.X = torch.tensor(X, dtype=torch.long)
        self.y = torch.tensor(y, dtype=torch.float)

    def __len__(self):
        return len(self.X)

    def __getitem__(self, index):
        return self.X[index], self.y[index]

batch_size = 64
train_dataset = SentimentDataset(train_X, train_y)
test_dataset = SentimentDataset(test_X, test_y)
train_loader = DataLoader(train_dataset, batch_size=batch_size, shuffle=True)
test_loader = DataLoader(test_dataset, batch_size=batch_size, shuffle=False)

#and more...

#

Traceback (most recent call last):
  File "c:\Users\Salih Furkan\OneDrive\Masaüstü\Sentiment-analysis-cnn-master\model_in_torch.py", line 20, in <module>
    text = [[word2idx[word] for word in word_tokenize(sent)] for sent in text]
  File "c:\Users\Salih Furkan\OneDrive\Masaüstü\Sentiment-analysis-cnn-master\model_in_torch.py", line 20, in <listcomp>
    text = [[word2idx[word] for word in word_tokenize(sent)] for sent in text]
  File "c:\Users\Salih Furkan\OneDrive\Masaüstü\Sentiment-analysis-cnn-master\model_in_torch.py", line 20, in <listcomp>
    text = [[word2idx[word] for word in word_tokenize(sent)] for sent in text]
KeyError: 'D.H'

#

How can i solve this?

dire field Apr 16, 2023, 3:53 PM

#

simple tapir ``` Traceback (most recent call last): File "c:\Users\Salih Furkan\OneDrive\Ma...

It's telling you that D.H is not a key in the dictionary word2idx. Which means that text has a word in it that is not in your vocab. I like to make my word2indx via something like word2idx = {i: word for i, word in enumerate(text.unique())} (assuming text is a pandas Series where each element is a word) which helps prevents error like this.

simple tapir Apr 16, 2023, 3:57 PM

#

Text is like that

dire field Apr 16, 2023, 4:00 PM

#

simple tapir Text is like that

Then it probably depends on what word_tokenize (I'm not familiar with it) is doing. I would grab the first few lines of that file and play around with word_tokenize to see how it transforms them to make sure it's doing what you think it is.

cold osprey Apr 16, 2023, 4:02 PM

#

simple tapir Apr 16, 2023, 4:02 PM

#

dire field Then it probably depends on what `word_tokenize` (I'm not familiar with it) is d...

uhh I see. Lemme test it. Thanks for the help

#

oh

#

it worked

#

But... what prevented the code from running perfectly? The first couple of lines are okay but what's the obstacle there 🤔

#

oooh

dire field Apr 16, 2023, 4:11 PM

#

simple tapir But... what prevented the code from running perfectly? The first couple of lines...

Plug the line that breaks it into word_tokenizer (i.e. the line with "D.H" in it) and see what's different about that line

simple tapir Apr 16, 2023, 4:19 PM

#

After removing the D.H words, it later gave a keyerror m.j. So I thought that words having 2 lengths cause errors. But now, it gives an error because of a key "DoOrk"

#

Weird 🤔

#

Anyways, thanks for your help! It made me realize the error

forest pollen Apr 16, 2023, 8:58 PM

#

hi i need help on how to calculate the to compute the False positives of a confusion matrix

#

so the code i have so far is this:

def confusionMatrix(classified_data):
    ActualClass = classified_data[1]
    PredictedClass = classified_data[2]
    classes = np.unique(ActualClass)
    confusion_matrix = np.zeros((len(classes), len(classes)))
    for i in range(len(classes)):
        for j in range(len(classes)):
            confusion_matrix[i, j] = np.sum((ActualClass == classes[i]) & (PredictedClass == classes[j]))
    return confusion_matrix
def computeTPs(confusion_matrix): #calculated by getting this diagnals
    tps = []
    total_elem = len(confusion_matrix)
    for i in range (total_elem):
        tps = tps.append(confusion_matrix[i][i]) #confusion_matrix[i][i] will get the diagnals and append them to the tps list.
    return tps


def computeFPs(confusion_matrix):
    fps = []
    for i in range (len(confusion_matrix)):
        for j in range(len(confusion_matrix)):
            sum
    return fps```

#

the fps is essentially the columns but i was just confused on how to go about calculating it if anyone can give some pointers

serene scaffold Apr 16, 2023, 9:08 PM

#

@forest pollen which axis is for predicted and which is for actual

forest pollen Apr 16, 2023, 9:10 PM

#

ah sorry let me also show u the code for the confusion matrix:

#

so row is predicted, and column is actual

serene scaffold Apr 16, 2023, 9:11 PM

#

forest pollen ah sorry let me also show u the code for the confusion matrix:

So each value that isn't along the diagonal is a fp for the predicted class, and a fn for the actual class.

forest pollen Apr 16, 2023, 9:12 PM

#

oh so would the answer be something along the lines of me doing the sum of all the values confusion_matrix[i][j] then minusing that from the tps???

serene scaffold Apr 16, 2023, 9:13 PM

#

You don't need to do any subtraction

#

Remember that each class has its own set of true/false positive/negative values

#

What's the goal? To calculate the precision and recall for the whole system? (Rather than for each class?)

forest pollen Apr 16, 2023, 9:18 PM

#

so we are grabbing the fps, Tps, and fns to calculate recall, precision, fmeasure and accuracy

#

for the whole system

#

e.g this is a function later on:

def computeMacroPrecision(tps, fps, fns, data_size):
    precision = float(tps/(tps+fps))
    return precision```

#

see this image slightly confuses me:

#

because i thought it was the sum of all the columns - the diagnal as it is the TPs

serene scaffold Apr 16, 2023, 9:22 PM

#

forest pollen e.g this is a function later on: ```python def computeMacroPrecision(tps, fps, f...

You don't need to have float( ) in this.

forest pollen Apr 16, 2023, 9:23 PM

#

serene scaffold You don't need to have float( ) in this.

ah i'll change that thank you

serene scaffold Apr 16, 2023, 9:23 PM

#

forest pollen see this image slightly confuses me:

What do you find confusing about it

#

Also, it looks like you're computing micro precision. Because macro precision is the average of the precision for each class.

forest pollen Apr 16, 2023, 9:26 PM

#

it has to be macro average, just reading through the website and i think i understand how to go about doing it

serene scaffold Apr 16, 2023, 9:27 PM

#

If you're calculating the macro precision, recall, and F1, then you need to calculate those individually for each class

#

And then take the average of thiae

#

Those

forest pollen Apr 16, 2023, 9:43 PM

#

ah got it, i'll start working on that, i appreciate the help. felt good getting help for AI grad student haha. Thank you tho!

keen gust Apr 16, 2023, 9:59 PM

#

hi all, so I have the following line chart in streamlit. How could I go about allowing the user to select which years he wants to look at? the underlying data is a pandas df with columns for month/year/location/income

Screen_Shot_2023-04-16_at_17.57.48_PM.png

keen gust Apr 16, 2023, 10:33 PM

#

lol just realized it's literally clickable, guess that will suffice

ember kettle Apr 17, 2023, 12:48 AM

#

Currently using pandas read_csv with chunk, is there a way to start from the last chunk? Chunk starts from 2019 to 2023 but i want the more recent rows

grizzled barn Apr 17, 2023, 12:51 AM

#

Im interested in building a software that can receive a photo of a wild berry, and based on a users given location (where they are in the world), it can determine whether the wild berry is safe to eat or not. I would assume this is a relatively simple concept. Does anyone have any guidance tips on where I should start?

#

^ I'm already at what I'd consider to be an intermediate level with Python, so I'm familiar with the language, just not building photo detection software like this

dire field Apr 17, 2023, 12:55 AM

#

ember kettle Currently using pandas read_csv with chunk, is there a way to start from the la...

If you only want to read in the last n rows you can use the skiprows argument.

dire field Apr 17, 2023, 1:00 AM

#

grizzled barn Im interested in building a software that can receive a photo of a wild berry, a...

Most image prediction tasks involve neural networks these days, so you'd need either pytorch or tensorflow/keras. It's possible that there are models that are already trained on plants/berries, which would make things easier. As for the geo-stuff, I never dealt with geo data, but I've heard good things about geopandas.

grizzled barn Apr 17, 2023, 1:01 AM

#

dire field Most image prediction tasks involve neural networks these days, so you'd need ei...

Gotcha, Ill look into those asap, ty. Do you know if PyTorch or similar libraries tend to take awhile to get familiar with? My priority is creating quality software, of course, just curious if its a multi-month process.

#

The geo stuff could just be the user inputting their location manually tbh. Wouldnt have to make it automatic

steep sluice Apr 17, 2023, 1:06 AM

#

Hi, can anyone help me with a project I am pursuing?

serene scaffold Apr 17, 2023, 1:17 AM

#

steep sluice Hi, can anyone help me with a project I am pursuing?

if you need help, be sure to always ask at least one complete, answerable question.

dire field Apr 17, 2023, 1:27 AM

#

grizzled barn Gotcha, Ill look into those asap, ty. Do you know if PyTorch or similar librarie...

PyTorch has a bit of an initial learning curve, but once you get over that you start to notice that almost every PyTorch project has very similar structure.

violet gull Apr 17, 2023, 1:43 AM

#

why do i increase the number of layers and the number of nodes in a neural net?

dire field Apr 17, 2023, 2:03 AM

#

violet gull why do i increase the number of layers and the number of nodes in a neural net?

It makes the model more flexible. That is, it can fit more complex patterns in the data.

violet gull Apr 17, 2023, 2:03 AM

#

dire field It makes the model more flexible. That is, it can fit more complex patterns in t...

so ur saying if i can train a dataset on 2 nodes in 1 layer its fine?

#

if i can get the loss to near 0

dire field Apr 17, 2023, 2:04 AM

#

violet gull so ur saying if i can train a dataset on 2 nodes in 1 layer its fine?

Yes, there's nothing stopping a model from being very simple and still performing well. It all depends on the data. If simple works, all the better.

violet gull Apr 17, 2023, 2:05 AM

#

dire field Yes, there's nothing stopping a model from being very simple and still performin...

so as long as a network is good enough to get the loss to 0, it is equal to a massive model

#

and the only reason to expand a model is if the loss converges before 0

dire field Apr 17, 2023, 2:06 AM

#

violet gull and the only reason to expand a model is if the loss converges before 0

Pretty much

violet gull Apr 17, 2023, 2:06 AM

#

dire field Pretty much

so what happens if it gets 0 loss but does terribly on the testing data

dire field Apr 17, 2023, 2:07 AM

#

Then your model is overfitting, which usually means it's too flexible and has just memorized the training data and can't generalize to new data (i.e. the test set)

#

Though it would be weird if a model with only two parameters was overfitting

violet gull Apr 17, 2023, 2:13 AM

#

How do I combat over fitting

cold osprey Apr 17, 2023, 2:13 AM

#

dire field PyTorch has a bit of an initial learning curve, but once you get over that you s...

Pytorch Vs tensorflow?

I set up tf GPU with wsl just to have mlflow not work properly. Moving to pytorch now instead ragej

dire field Apr 17, 2023, 2:14 AM

#

cold osprey Pytorch Vs tensorflow? I set up tf GPU with wsl just to have mlflow not work pr...

pytorch all the way

violet gull Apr 17, 2023, 2:14 AM

#

dire field Then your model is overfitting, which usually means it's too flexible and has ju...

Also it’s more likely to overfit the bigger the model is right?

cold osprey Apr 17, 2023, 2:14 AM

#

Do u do ml research by any chance?

violet gull Apr 17, 2023, 2:15 AM

#

PyTorch is better than tensorflow yes

cold osprey Apr 17, 2023, 2:15 AM

#

My ml PhD friend is pro pytorch too

dire field Apr 17, 2023, 2:16 AM

#

violet gull Also it’s more likely to overfit the bigger the model is right?

Typically yes, however people have also discovered that if you massively overfit your data with huge models then somehow models start to work really well again.

cold osprey Apr 17, 2023, 2:16 AM

#

dire field Typically yes, however people have also discovered that if you massively overfit...

Wait what?

dire field Apr 17, 2023, 2:16 AM

#

cold osprey Do u do ml research by any chance?

Yep, I'm an applied ML researcher

cold osprey Apr 17, 2023, 2:16 AM

#

TIL

dire field Apr 17, 2023, 2:18 AM

#

cold osprey Wait what?

The phenomena is called "double descent" because you tend to see the loss curve decrease as you add more parameters then increase as you start to overfit as you would expect, but if you just keep adding parameters eventually the loss starts to decrease again??? kinda magical. Neural nets are weird

cold osprey Apr 17, 2023, 2:24 AM

#

dire field The phenomena is called "double descent" because you tend to see the loss curve ...

V interesting indeed. Will read up on this

violet gull Apr 17, 2023, 2:35 AM

#

@dire field so i should start with a very small model size and if i can get it to 0 loss then there is no reason it shouldnt do well on testing data?

dire field Apr 17, 2023, 2:38 AM

#

violet gull <@1029105140042580018> so i should start with a very small model size and if i c...

That's usually a good approach. Start small and build up from there. You may want to also have a validation set that you can validate your model on while you are tuning the number of parameters before you test your model on the test set.

violet gull Apr 17, 2023, 2:39 AM

#

what is a validation set

dire field Apr 17, 2023, 2:39 AM

#

It is another partition of your dataset that is independent from your training and test set.

#

Typically you train your model on the train set, hyperparameter tuning on the validation set, and model assessment on the test set.

violet gull Apr 17, 2023, 2:42 AM

#

dire field Typically you train your model on the train set, hyperparameter tuning on the va...

how do i make one

dire field Apr 17, 2023, 2:44 AM

#

This is how I usually do it:

from sklearn.model_selection import train_test_split

def split_train_val_test(X, y, val_test_size, random_state):
    X_train, X_test, y_train, y_test = train_test_split(
        X, y, test_size=val_test_size, random_state=random_state
    )
    X_test, X_val, y_test, y_val = train_test_split(
        X_train, y_train, test_size=0.5, random_state=random_state
    )
    return {
        "X_train": X_train,
        "y_train": y_train,
        "X_val": X_val,
        "y_val": y_val,
        "X_test": X_test,
        "y_test": y_test,
    }

violet gull Apr 17, 2023, 2:44 AM

#

im not in python

#

i need to know exactly what it is made from

dire field Apr 17, 2023, 2:47 AM

#

The idea is to randomly split your dataset into three partitions. The majority of your data will be the train set (a common heuristic is 80%). Of the remaining 20%, 10% will be your validation set and the other 10% will be your test set.

violet gull Apr 17, 2023, 2:48 AM

#

dire field The idea is to randomly split your dataset into three partitions. The majority o...

how does the validation data differ from the testing set

dire field Apr 17, 2023, 2:53 AM

#

violet gull how does the validation data differ from the testing set

The difference is in the way it is utilized. It is typically used to evaluate the performance of your model while you are in the experimental phase of model development (e.g. tweaking hyperparameters). If you repeatedly evaluate the performance of the model on the test set (and change hyperparameters in response to these evaluations), you risk "data leakage", which means that the model will start to overfit on the test set (just memorizing it). This causes the test set to no longer be a fair evaluation of how the model will generalize to unseen data.

#

The goal of the validation set is to guard you against this.

#

If you only tune the model on the validation set, you can avoid data leakage in the test set

violet gull Apr 17, 2023, 2:55 AM

#

ok i think i understand

#

i run it on 80%

#

i adjust the architecture

#

i run it on 10%

#

i adjust the learning rate

#

i run it on other 10%

#

i win

dire field Apr 17, 2023, 2:56 AM

#

correct

violet gull Apr 17, 2023, 2:57 AM

#

dire field correct

what determines how many convolution layers i need and of what size

dire field Apr 17, 2023, 3:00 AM

#

You can either experiment by tweaking the number of layers/size by hand and see if the performance improves on the validation set or setup a more complex hyperparameter tunning scheme (for example grid search)

train -> validation -> adjust archetecture/hyperparameters -> train -> validation -> adjust archetecture/hyperparameters -> ... -> lastly evaluate on test set.

violet gull Apr 17, 2023, 3:01 AM

#

so conv is complete guessing

dire field Apr 17, 2023, 3:02 AM

#

Often yes. A lot of the time people will just use the same number of layers that other successful projects used.

#

You can do principled guessing if you setup a grid search or use Bayesian optimization, but those can get involved if you don't have a package that implements them for you.

stone oriole Apr 17, 2023, 3:03 AM

#

@raven field Thanks

violet gull Apr 17, 2023, 3:04 AM

#

dire field Often yes. A lot of the time people will just use the same number of layers that...

ok ty im sure ill be back ❤️

mint nexus Apr 17, 2023, 3:29 AM

#

Good Morning friends

#

I need help

cold osprey Apr 17, 2023, 3:35 AM

#

dire field This is how I usually do it: ```python from sklearn.model_selection import trai...

No stratify?

#

Probs doesn't matter if classes are fairly balanced and dataset is big enough

dire field Apr 17, 2023, 3:39 AM

#

cold osprey No stratify?

Yeah, it's usually a good idea to stratify.

wraith escarp Apr 17, 2023, 3:39 AM

#

good to know

#

I am trying to do a multi variable linear regression with batch gradient descent. My initial cost is astronomical... and I was wondering if this is normal for the first iteration?

#

#

My data is mostly floats and the range is quite large. About 58 features and ~2000 samples

#

I was playing around with the init W

dire field Apr 17, 2023, 3:44 AM

#

It's certainly possible if your initial solution is very far from the optimum. You could try changing intial_w and initial_b, but if it's vanilla regression then the problem is convex and should converge to the global optimum no matter where you start. It just might take a little longer to get there if you start far away.

wraith escarp Apr 17, 2023, 3:45 AM

#

dire field It's certainly possible if your initial solution is very far from the optimum. Y...

Thanks, I'll look more into better initial values. My fan is going off real loud 🤣

mint nexus Apr 17, 2023, 4:01 AM

#

guys

#

?

violet gull Apr 17, 2023, 5:29 AM

#

#

explain this, it only got 52% of the testing data right but it got near 0 loss and its a very small model trained on batches

#

so i dont see how it could be over fitting

#

it also got a lot blatantly wrong

#

Expected: [[1.0000, 0.0000]]```

wooden sail Apr 17, 2023, 5:33 AM

#

the training data was not representative of the testing one

crimson patrol Apr 17, 2023, 10:30 AM

#

Guys can anyone pls tell me how to use a GPU for training deep learning models in tensorflow..I have tried literally everything..but no progress yet ...my laptop has GTX 1650

pseudo tide Apr 17, 2023, 11:05 AM

#

What version of tf are u using and what os are u using?

#

Since 2.11 version, tf dropped support for gpus on native-Windows so that may be the case

charred oyster Apr 17, 2023, 11:50 AM

#

Hello, I created a library to easily create bots and take them to porduction. Still early work but if you need features just shoot: https://github.com/momegas/megabots

GitHub

GitHub - momegas/megabots: 🤖 State-of-the-art, production ready bot...

🤖 State-of-the-art, production ready bots made mega-easy, so you don't have to build them from scratch 🤯 Create a bot, now 🫵 - GitHub - momegas/megabots: 🤖 State-of-the-art, production read...

pseudo moon Apr 17, 2023, 12:17 PM

#

what does min mean in these GAN loss functions?

tidal bough Apr 17, 2023, 12:32 PM

#

I don't know how GAN loss functions specifically work, but generally this notation would mean "minimal value of 𝓛_{join, adv} that can be achieved by varying D_{join, adv}".

pseudo moon Apr 17, 2023, 12:46 PM

#

I see, but what do you mean by varying?

tidal bough Apr 17, 2023, 12:47 PM

#

Like, consider all possible values of D_{join, adv}, and take the minimum value 𝓛_{join, adv} achieves over them all.

pseudo moon Apr 17, 2023, 12:49 PM

#

ahh i see

hasty mountain Apr 17, 2023, 2:21 PM

#

pseudo moon what does min mean in these GAN loss functions?

It's defining the objective for both the discriminator and generator, right?
Then the Discrimintor objective(D on join, adversarial) is to minimize the loss on join, adversarial samples, where this loss is defined by:
loss(join, adv) = Error(D(join, adv(fake_images)²) + Error(D(join, adv(1 - real_images)²)

Below, it's the objective function for the generator.
loss(join, adv) = Error(1 - D(join, adv(fake_images)²)

#

Then you just have to check what "join" and "adv" really mean

tidal bough Apr 17, 2023, 2:31 PM

#

D(join, adv(fake_images)²)
but in the screenshot join, adv is the subscript of D; D isn't a function of two arguments.

hasty mountain Apr 17, 2023, 2:32 PM

#

Uh... I don't know. That's why I said to check what they really mean.
I was thinking it was something like "joined images" and "adversarial images"

#

It certainly isn't a classic GAN... pithink

pseudo moon Apr 17, 2023, 2:45 PM

#

It’s actually a discriminator with two heads, one being adversarial and the other is feature imitation

It certainly isn’t a classic GAN
well I was wondering if you would know what kind of loss function this is since it looks similar to the standard GAN loss function log D(x) + log(1-D(G(z))) except it’s switched between x (real image) and G(z) (fake image) and instead of log it’s exponent

hasty mountain Apr 17, 2023, 2:48 PM

#

pseudo moon It’s actually a discriminator with two heads, one being adversarial and the othe...

Well, that's the thing...the loss for a GAN tends to be quite messy, so it seems that people tend to simply use E instead of something like Binary Cross Entropy.
Some people use Binary Cross Entropy, some use KL-Divergence, some use WGAN-Loss...

Personally, I recommend simply using a Binary Cross Entropy in a logits version(log softmax in the discriminator), or use a relativistic discriminator.

#

Oh yes...there's the relativistic discriminator, which also changes the loss slightly.

#

And to make things even more chaotic...there's a Google paper that says that...in the end, the loss choice doesn't matter that much yert

pseudo moon Apr 17, 2023, 2:50 PM

#

I see

solar seal Apr 17, 2023, 3:28 PM

#

Hi everyone, we’ve been working for a few month on a Dictionary for MLOps that would cover most of the common terms in the field, give some snippets and examples when appropriate and overall cover the missing data engineering, feature store and main principles we believe MLOps is about, we’d love to get feedbacks, augmentation and suggestions !
https://www.hopsworks.ai/mlops-dictionary

The Big Dictionary of MLOps - Hopsworks

Detailed explanations of every MLOps term you need to know. Get examples of essential MLOps terms to streamline your workflow and enhance collaboration.

violet gull Apr 17, 2023, 3:44 PM

#

wooden sail the training data was not representative of the testing one

I split the training data 80-20…..

#

Also what is that giant spike?

novel python Apr 17, 2023, 5:50 PM

#

hey everyone! So, I have some points of data usage for mobile devices for 12 months, I wanted to make a model to predict the % of chance of it being higher or lower than previous month usage, what would be the best approach to that? I thought about maybe a neural network with softmax layer at the end, but not sure if that's the best solution for that because I don't know how I'd set up the previous layers

crimson patrol Apr 17, 2023, 5:55 PM

#

@pseudo tide yes I am using tf version 2.11+ ..I tried using GPU by installing wsl..but I get an error libdevice not found at libdevice.bc

dire field Apr 17, 2023, 6:13 PM

#

novel python hey everyone! So, I have some points of data usage for mobile devices for 12 mon...

Since it sounds like a time series a RNN or LSTM might be a good architecture to use.

novel python Apr 17, 2023, 6:14 PM

#

dire field Since it sounds like a time series a RNN or LSTM might be a good architecture to...

yea, I used RNN before to try the exact prediction, but I'm still not sure how to set it up to give probabilities instead, I'll check if there's the possibility

dire field Apr 17, 2023, 6:15 PM

#

novel python yea, I used RNN before to try the exact prediction, but I'm still not sure how t...

What deep learning framework are you using?

novel python Apr 17, 2023, 6:15 PM

#

tensorflow

#

but i'm trying to move on to pytorch

dire field Apr 17, 2023, 6:18 PM

#

novel python but i'm trying to move on to pytorch

I'm only familiar with pytorch, but I think you can pass the neural net output directly to BCELoss or BCEWithLogitsLoss (no need to convert them to probabilities).

#

There is likely similar functionality in tensorflow

novel python Apr 17, 2023, 6:21 PM

#

dire field There is likely similar functionality in tensorflow

ok I'll check that. Thanks a lot!

violet gull Apr 17, 2023, 6:23 PM

#

is 300 images of 2 classes each not enough? seems like enough

agile cobalt Apr 17, 2023, 6:32 PM

#

depends,

which model are you using?
from scratch or fine tuning an existing?
how different are these two classes?
it might be enough, but if you are using a model with tens of thousands of parameters I'd expect for it to overfit quite hard

(not expecting an answer to these questions, more for you to think about it ; even if you did answer I don't think that I would have any more specific advice)

sleek harbor Apr 17, 2023, 6:33 PM

#

noob question
Which machine learning models are the most "important" for a newbie to know (to get their first job)?

(polynomial & multiple) linear regression
logistic regression
KNN
decision (regression) trees
random forest
support vector machines
k means clustering

Is that enough knowledge of theory to start working on a portfolio and get a first job, or do I need more theory? What have I missed, what else would you recommend learning as a "must know"

dire field Apr 17, 2023, 6:36 PM

#

sleek harbor *noob question* Which machine learning models are the most "important" for a new...

Linear and logistic regression are definitely the most important to know, but just knowing them won't be enough to land a job. If you are just starting out, I'd recommend Introduction to Statistical Learning. It's a book with a free pdf online and a corresponding lecture series on youtube.

sleek harbor Apr 17, 2023, 6:37 PM

#

dire field Linear and logistic regression are definitely the most important to know, but ju...

from Stanford?

dire field Apr 17, 2023, 6:38 PM

#

sleek harbor from Stanford?

Yep

violet gull Apr 17, 2023, 6:53 PM

#

agile cobalt depends, - which model are you using? - from scratch or fine tuning an existing?...

Cnn
Scratch
Elephant vs dog
The model does not have anywhere near 10000 parameters

pseudo tide Apr 17, 2023, 6:54 PM

#

crimson patrol <@360758013415391232> yes I am using tf version 2.11+ ..I tried using GPU by ins...

I haven't used wsl yet, but if u keep on having problems with it, just switch to version < 2.11, u won't lose much

violet gull Apr 17, 2023, 7:01 PM

#

Idk what I’m suppose to change

#

The dense layer section is very small so it shouldn’t over train

#

Plenty of images

#

And the loss is minimized

violet gull Apr 17, 2023, 7:36 PM

#

How low is the loss suppose to get on training?

#

I have it set to stop after 0.01

agile cobalt Apr 17, 2023, 7:38 PM

#

varies depending on what you are doing - the loss isn't very comparable between different projects
it should never actually reach 0 (even if your accuracy reaches 100%, the loss still shouldn't be exactly 0)

violet gull Apr 17, 2023, 7:38 PM

#

I know

#

But how low

agile cobalt Apr 17, 2023, 7:39 PM

#

the loss isn't very comparable between different projects

serene scaffold Apr 17, 2023, 7:39 PM

#

varies depending on what you are doing

agile cobalt Apr 17, 2023, 7:40 PM

#

iirc usually 'when it stops going down significantly' is a good reference

violet gull Apr 17, 2023, 7:40 PM

#

I can’t figure out why it’s not working so I’m trying to dig deep

agile cobalt Apr 17, 2023, 7:40 PM

#

what is not working?

violet gull Apr 17, 2023, 7:40 PM

#

Testing data

#

52% accuracy

agile cobalt Apr 17, 2023, 7:40 PM

#

probably overfit

serene scaffold Apr 17, 2023, 7:40 PM

#

are you sure you should be using accuracy, and not precision/recall?

#

(and by /, I mean and, not division)

violet gull Apr 17, 2023, 7:41 PM

#

agile cobalt probably overfit

The model is very small

agile cobalt Apr 17, 2023, 7:41 PM

#

how small exactly?

serene scaffold Apr 17, 2023, 7:41 PM

#

very small. are you doing 60 instances per class again?

violet gull Apr 17, 2023, 7:41 PM

#

A cnn part and a dense part with the dense having about 300 paramd

agile cobalt Apr 17, 2023, 7:43 PM

#

I'm not ultra experienced with tuning neural networks, but I wouldn't be surprised if that is in fact overfitting.
maybe try using data augmentation if you aren't using it yet?

#

I'd also check some of the misses to make sure it isn't misslabeled or check for patterns like X dog breed is often missclassified or Y photo angle wasn't present in the training so it gets confused about it
~~not sure how actionable that kind of thing is other than "must collect more data" though~~

violet gull Apr 17, 2023, 7:54 PM

#

With this many images and so few classes I couldn’t imagine it being an issue with the data

#

The images are pretty random and from google

#

Should cover everything

#

Also the fact that testing accuracy was essentially 50 50

hasty mountain Apr 17, 2023, 8:32 PM

#

sleek harbor *noob question* Which machine learning models are the most "important" for a new...

It's funny that most machine learning courses teach all of those models...starting from linear regression, going to KNN, decision trees and then to unsupervised models.

#

I'd also add "Neural Networks" at the end of that list. lemon_hyperpleased

sleek harbor Apr 17, 2023, 8:44 PM

#

hasty mountain It's funny that most machine learning courses teach all of those models...starti...

Why is it funny tho? 0.o

hasty mountain Apr 17, 2023, 8:45 PM

#

Because you basically already know the path

rugged comet Apr 17, 2023, 8:47 PM

#

I have a pandas column that contains strings that look like Python sets. I would like to convert the strings to Python sets in the column.
Here is what I tried

archidekt_df["color identity"] = archidekt_df["color identity"].apply(ast.literal_eval)

but I get the malformed node or string error. I know it's possible to use ast.literal_eval for sets because it says so in the docs. What am I doing wrong?

serene scaffold Apr 17, 2023, 9:05 PM

#

rugged comet I have a pandas column that contains strings that look like Python sets. I would...

what happens if you do .apply(eval) (provided that you know this won't execute malicious code)

rugged comet Apr 17, 2023, 9:07 PM

#

serene scaffold what happens if you do `.apply(eval)` (provided that you know this won't execute...

Oh it turns out I had some empty sets in my rows and ast.literal_eval can't handle that.

serene scaffold Apr 17, 2023, 9:11 PM

#

rugged comet Oh it turns out I had some empty sets in my rows and `ast.literal_eval` can't ha...

tfw {1} is a set and {} is not.

rugged comet Apr 17, 2023, 9:12 PM

#

lol

serene scaffold Apr 17, 2023, 9:12 PM

#

python should become perl and make {:} the expression for an empty dict

#

but then ∅ can be the empty set symbol instead of {}

agile cobalt Apr 17, 2023, 9:13 PM

#

serene scaffold python should become perl and make `{:}` the expression for an empty dict

backwards compatibility though 🤷

serene scaffold Apr 17, 2023, 9:13 PM

#

agile cobalt backwards compatibility though 🤷

no.
python must become perl.

nocturne eagle Apr 17, 2023, 9:14 PM

#

lol

rugged comet Apr 17, 2023, 9:18 PM

#

My next problem is I have rows that looks like

deck_id, ..., {card_id: quantity, card_id: quantity, ...}, ...

I need to turn it into a list of tuples like this

[
(deck_id, card_id, quantity),
(deck_id, card_id, quantity),
...
]

I basically need to expand the dictionary containing card_ids and their quantities.
I'm turning it into a list of tuples so that I can insert it into a MySQL database.
My proposed solution was to iterate over the rows of the dataframe and extract the information I need. However, the internet says that you generally shouldn't iterate over a dataframe like this. What would be the correct way to do this?

agile cobalt Apr 17, 2023, 10:18 PM

#

rugged comet My next problem is I have rows that looks like ``` deck_id, ..., {card_id: quan...

if you have a dictionary inside of a dataframe cell, you're already not complying with what you "generally should" do.

take a look at https://stackoverflow.com/questions/67336514/pandas-explode-dictionary-to-rows

Stack Overflow

Pandas explode dictionary to rows

I have a dataframe:
Name Sub_Marks
0 Tom {'Maths': 30, 'English': 40, 'Science': 35}
1 Harry {'Maths': 35, 'English': 30, 'Science': 25}
2 Nick {'Mat...

violet gull Apr 17, 2023, 11:30 PM

#

is 30x30 too small of image size for aminals?

#

my eyes can still identify a very small aminal

#

also i was wrong about my number of trainable parameters

#

i said a couple hundred

#

its actually 3968

#

2 small convolution layers

rugged comet Apr 17, 2023, 11:40 PM

#

agile cobalt if you have a dictionary inside of a dataframe cell, you're already not complyin...

I know I shouldn't have multiple values in one column. That's just how the data was collected. I'm making it normalized in the database though.

#

But anyway. That looks like what I want.

m = pd.DataFrame([*df['Sub_Marks']], df.index).stack()\
      .rename_axis([None,'Subject']).reset_index(1, name='Marks')

out = df[['Name']].join(m)

Could you please explain the parts of this to me. This chained expression is hard to follow.

#

Actually this is what I wanted

[(n, k, v) for (n, d) in df.values for k, v in d.items()]

#

Thanks for the help.

hasty mountain Apr 18, 2023, 1:50 AM

#

Is there an explanation to why a language model would be producing always the same output?

My Transformer tends to always generate spaces ' ' after some training.
Then, I've tried to innovate and make a Text GAN...same result.
I'm now thinking about going for a classic LSTM model...but it seems that the same result is a possibility.
Any hint?

I mean...always ' '? It doesn't generate always the same token, it always converge to generating always space tokens.

violet gull Apr 18, 2023, 3:07 AM

#

dropout layers are OP

#

this is the first time my model is working

agile cobalt Apr 18, 2023, 3:43 AM

#

you had no regularisation before?

#

no wonders it was overfitting

violet gull Apr 18, 2023, 3:55 AM

#

agile cobalt you had no regularisation before?

what mean

agile cobalt Apr 18, 2023, 4:10 AM

#

did you not see that term wherever you read about dropout layers?
https://www.geeksforgeeks.org/regularization-in-machine-learning/

GeeksforGeeks

Regularization in Machine Learning - GeeksforGeeks

A Computer Science portal for geeks. It contains well written, well thought and well explained computer science and programming articles, quizzes and practice/competitive programming/company interview Questions.

violet gull Apr 18, 2023, 4:12 AM

#

agile cobalt did you not see that term wherever you read about dropout layers? https://www.ge...

no, chat gpt did not mention the word regulariszation

agile cobalt Apr 18, 2023, 4:12 AM

#

x-x

violet gull Apr 18, 2023, 4:12 AM

#

anyways

agile cobalt Apr 18, 2023, 4:12 AM

#

did you just add it without looking up what exactly they are | they do?

violet gull Apr 18, 2023, 4:13 AM

#

agile cobalt did you just add it without looking up what exactly they are | they do?

wasnt a complex concept

#

copy pasted my RELU layer with a few modifications

#

anyways

#

my model converged on 80% accuracy unseen data

#

i added a dropout after both denses and increased the number of dense paramaters

#

so now they should not be overfitting in any way

red moon Apr 18, 2023, 8:54 AM

#

im working on a project with chatgpt api and autogpt, please dm if ur interested in help (pair or unpaid)

hasty mountain Apr 18, 2023, 12:05 PM

#

violet gull no, chat gpt did not mention the word regulariszation

Bruh...

#

Like I said...if I were to follow what ChatGPT says without trying to double-check it, I'd be struggling for trying to train a GPT to act like a BERT model

#

||I'm struggling to make my vanilla Transformer work, but still...||

cold osprey Apr 18, 2023, 12:19 PM

#

Chatgpt lel

#

If ure learning something for the first time, I wouldn't use chatgpt at all

#

Maybe to explain concepts and terminologies only

solemn quest Apr 18, 2023, 12:25 PM

#

ChatGPT is a great assistent, but it's better to dig the fundamentals by your own

versed gulch Apr 18, 2023, 12:25 PM

#

Hi, how would I filter out those tuples that contain 0, (i don't really care whether the 0 is in the first index or second index in the tuple)


holes_arr_copy = holes_arr.copy()

black_pxs = np.where(holes_arr == 0)
coords = [*zip(*black_pxs)]
coords

out: 
[(0, 0),
 (0, 1),
 (0, 2),
 (0, 3),
 (0, 4),
 (0, 5),
 (0, 6),
 (0, 7),
 (0, 8),
 (0, 9),..]

sleek harbor Apr 18, 2023, 12:33 PM

#

I find random forests fascinating.. except for the random part 😶

The way I see it, is that since samples and features are chosen randomly, there is a chance, small as it may seem, that at the end of it all, some samples and some features might end up not being used at all, which can lead to skewed and biased final results.

Is there a way to make sure that at the end, throughout the whole forest, all samples and all features would end up being used in trees on average the same amount of times, ensuring that no sample/feature would be left out? #lonely 👉👈

Separate question. When tuning hyperparameters for a random forest with a grid search using kfold cross validation with shuffle enabled, would it make sense to disable bootstrapping entirely (purely for performance reasons, to save some time), since each forest will be getting its own, slightly different dataset as is (and that kinda makes sense¿)? And then enabling bootstrapping when training the actual model with the chosen hyperparameters? Or will having the trees of the forest use the same dataset affect the choice of hyperparameters, meaning that random forests should be tuned with bootstrapping enabled?

gloomy anvil Apr 18, 2023, 1:03 PM

#

Hello y'all, I need a good source for ARIMA and SARIMAX models that I can quote and that ideally displays equations with consistent variables. Any suggestions?

queen cradle Apr 18, 2023, 1:07 PM

#

versed gulch Hi, how would I filter out those tuples that contain 0, (i don't really care whe...

I'm not quite sure what you want, but I think it might be something like:

import numpy as np
arr = np.arange(8).reshape(4, 2)
arr[np.all(arr != 0, axis=1)]

versed gulch Apr 18, 2023, 1:13 PM

#

queen cradle I'm not quite sure what you want, but I think it might be something like: ```py ...

just to get all those coordinates that dont have 0 in them in the coords variable

queen cradle Apr 18, 2023, 1:14 PM

#

sleek harbor I find random forests fascinating.. except for the random part 😶 The way I see...

There is never any guarantee that a feature will be used at all. This is not necessarily a bad thing. For example, if you have two features that are perfectly correlated, then you can get away with just one of them. If the features actually provide distinct information, though, and you construct enough trees, then some tree will use them. The key assumption is that you construct enough trees. If you have three features and you construct three trees, that's very likely not enough.

I know some people have studied non-random methods of constructing forests. My recollection is that there are trade-offs. I don't know if anyone has asked the question you're asking, though.

I'm not sure I understand your second question. But shuffling (however and whenever it's performed) is a different operation from bootstrapping: In bootstrapping, you're allowed to resample the same data point multiple times. Being able to is actually important. So I would be wary of trying to replace a bootstrap by a shuffle operation.

mint palm Apr 18, 2023, 1:14 PM

#

--------d -----------|
|----------------e
--------c -----------|----|
| |
--------b -----------| | ----------f
|
--------a --------------- |

if i am trying to learn similarity between e and f, does it make sense to put loss between e and f and also c and e(representing f in way)?
its not irrelevant to do that for my use case.

cold osprey Apr 18, 2023, 1:15 PM

#

mint palm --------d -----------| |----------------e...

what is this

mint palm Apr 18, 2023, 1:15 PM

#

a representative ppipeline

queen cradle Apr 18, 2023, 1:16 PM

#

versed gulch just to get all those coordinates that dont have 0 in them in the coords variabl...

All the entries of holes_arr that don't have a zero in them? Or all the coordinates that don't have a zero? Or something else?

versed gulch Apr 18, 2023, 1:18 PM

#

queen cradle All the entries of `holes_arr` that don't have a zero in them? Or all the coordi...

im talking about the py coords variable which tells me the coordinates in the py holes_arr where it is 0

cold osprey Apr 18, 2023, 1:20 PM

#

think hes trying to filter coords

queen cradle Apr 18, 2023, 1:21 PM

#

Like

coords = np.array(coords)
coords = coords[np.all(coords !=0, axis=1)]

maybe?

#

But in that case, I would rather filter black_pxs first.

cold osprey Apr 18, 2023, 1:22 PM

#

maybe u need to explain black_pxs and holes_arr

#

we dont know what they are so

cold osprey Apr 18, 2023, 1:25 PM

#

queen cradle Like ```py coords = np.array(coords) coords = coords[np.all(coords !=0, axis=1)]...

this works for sure but seems like u think there could be a more efficient way to do it from then holes_arr or black_pxs var

queen cradle Apr 18, 2023, 1:25 PM

#

I'm still not quite sure what he's asking. I suspect there is a faster way, but I can't tell yet.

cold osprey Apr 18, 2023, 1:26 PM

#

oh i tot OP was the one who sent that msg KEKW

queen cradle Apr 18, 2023, 1:30 PM

#

I guess I'm going to comment that he should probably be using np.nonzero instead of np.where, hope he notices, and leave him to figure the rest out.

errant bison Apr 18, 2023, 3:26 PM

#

heyy. So i am trying to make a model which allows a user to capture an image of his room through camera. And then click on the wall to paint it and it detects the wall and color it. So for this which ai algorithm or opencv modules can i use?

cold osprey Apr 18, 2023, 3:29 PM

#

errant bison heyy. So i am trying to make a model which allows a user to capture an image of ...

https://gprivate.com/64mpq

versed gulch Apr 18, 2023, 3:30 PM

#

queen cradle Like ```py coords = np.array(coords) coords = coords[np.all(coords !=0, axis=1)]...

yes I think that should do it

errant bison Apr 18, 2023, 3:35 PM

#

cold osprey https://gprivate.com/64mpq

hehe this was cute

lapis sequoia Apr 18, 2023, 3:59 PM

#

https://replit.com/@TheStrange-007/DigitsRecognizer

replit

TheStrange-007

DigitsRecognizer

If the replit webview doesn't work just copy and paste the URL on your browser.

cold osprey Apr 18, 2023, 4:01 PM

#

i mean.......

#

lapis sequoia Apr 18, 2023, 4:23 PM

#

it had 94% accuracy

#

so ye it's not perfect

violet gull Apr 18, 2023, 4:33 PM

#

How I make my model go from 80% accuracy to 95%

#

Image size 90 and 600 images of 2 classes each

cold osprey Apr 18, 2023, 4:40 PM

#

lapis sequoia it had 94% accuracy

on test set?

cold osprey Apr 18, 2023, 5:00 PM

#

also i tried multiple drawn 0s and 9s and still got misclassifications

#

so not just cherry picking

serene scaffold Apr 18, 2023, 5:01 PM

#

violet gull How I make my model go from 80% accuracy to 95%

if you ever have a question about how to improve your model, you need to say at the very least what kind of model it is, what it does, and what all the hyperparameters are. Otherwise, you are wasting everyone's time.

#

Image size 90 and 600 images of 2 classes each
so there are 600 images, and every image belongs to two classes? what are all the classes?

violet gull Apr 18, 2023, 5:02 PM

#

serene scaffold > Image size 90 and 600 images of 2 classes each so there are 600 images, and ev...

1200 images 600 dogs 600 elephants

serene scaffold Apr 18, 2023, 5:03 PM

#

violet gull 1200 images 600 dogs 600 elephants

then instead of "600 images of 2 classes each", you would want to say "2 classes with 600 images each". What you said means something else.

violet gull Apr 18, 2023, 5:03 PM

#

Oh

serene scaffold Apr 18, 2023, 5:04 PM

#

how long does it take to train your model currently?

violet gull Apr 18, 2023, 5:04 PM

#

serene scaffold how long does it take to train your model currently?

Time or num epochs

serene scaffold Apr 18, 2023, 5:04 PM

#

violet gull Time or num epochs

time, for the number of epochs you are currently doing.

violet gull Apr 18, 2023, 5:05 PM

#

About 4000 ms

#

Per epoch

serene scaffold Apr 18, 2023, 5:05 PM

#

okay... I'm asking how long it takes total.

violet gull Apr 18, 2023, 5:05 PM

#

Uh

#

I run it until loss < 0.01

#

Last time it took 2500 epochs

serene scaffold Apr 18, 2023, 5:06 PM

#

Please, in your next message, just say how long it takes to train it from start to finish, in minutes.

violet gull Apr 18, 2023, 5:07 PM

#

10000 minutes

#

That can’t be right

serene scaffold Apr 18, 2023, 5:07 PM

#

that would be 166.6 hours.

violet gull Apr 18, 2023, 5:07 PM

#

Hmmm

serene scaffold Apr 18, 2023, 5:08 PM

#

but where I'm going with this is that if it's relatively quick to train a model (less than 20 minutes), you can basically just mess with the hyperparameters and see how that affects the results.

violet gull Apr 18, 2023, 5:09 PM

#

serene scaffold but where I'm going with this is that if it's relatively quick to train a model ...

It’s not 20 mins

#

It’s hours

serene scaffold Apr 18, 2023, 5:10 PM

#

are you using a GPU?

violet gull Apr 18, 2023, 5:10 PM

#

No

#

Gpu interfacing is similar to being water boarded

serene scaffold Apr 18, 2023, 5:10 PM

#

you're using pytorch, right? it's not hard.

#

provided that you have a GPU

violet gull Apr 18, 2023, 5:11 PM

#

Not PyTorch

#

Raw java

serene scaffold Apr 18, 2023, 5:11 PM

#

why are you using java

violet gull Apr 18, 2023, 5:12 PM

#

I like java more than python

serene scaffold Apr 18, 2023, 5:12 PM

#

what is your goal for learning all this, anyway?

violet gull Apr 18, 2023, 5:13 PM

#

Ai is cool

cold osprey Apr 18, 2023, 5:13 PM

#

LEL

violet gull Apr 18, 2023, 5:13 PM

#

Java doesn’t play nicely with gpu

cold osprey Apr 18, 2023, 5:14 PM

#

then dont use java?

violet gull Apr 18, 2023, 5:14 PM

#

It’s already written in java and re writing to rust will take forever

cold osprey Apr 18, 2023, 5:14 PM

#

its like saying 'my car doesnt run well when its flooding'

#

where did rust come from now

serene scaffold Apr 18, 2023, 5:15 PM

#

that's probably the other language they like.

violet gull Apr 18, 2023, 5:15 PM

#

Rust is fast and modern

cold osprey Apr 18, 2023, 5:15 PM

#

LEL

serene scaffold Apr 18, 2023, 5:15 PM

#

violet gull Rust is fast and modern

does it have autograd on a GPU though

cold osprey Apr 18, 2023, 5:15 PM

#

i guess u can try writing pytorch with rust then

#

rust > c++

violet gull Apr 18, 2023, 5:15 PM

#

serene scaffold does it have autograd on a GPU though

I’m not using auto grad

agile cobalt Apr 18, 2023, 5:16 PM

#

violet gull I’m not using auto grad

(which is part of why it's taking so long)

violet gull Apr 18, 2023, 5:16 PM

#

No it’s taking so long because I have a really high drop rate

cold osprey Apr 18, 2023, 5:16 PM

#

agile cobalt (which is part of why it's taking so long)

but he likes the language

serene scaffold Apr 18, 2023, 5:17 PM

#

violet gull No it’s taking so long because I have a really high drop rate

Doubt

violet gull Apr 18, 2023, 5:17 PM

#

0.4 on both dense layers

agile cobalt Apr 18, 2023, 5:17 PM

#

we don't mean just the number of epochs, but also the 4000ms per epoch

violet gull Apr 18, 2023, 5:18 PM

#

It’s not on the gpu

#

And convolutions are expensive

agile cobalt Apr 18, 2023, 5:18 PM

#

exactly, which means that you should try and make it run on a gpu

violet gull Apr 18, 2023, 5:19 PM

#

That involves writing C

#

I don’t like C

agile cobalt Apr 18, 2023, 5:19 PM

#

or using an existing library that does the hard work for you

violet gull Apr 18, 2023, 5:19 PM

#

I’m categorically against libraries

#

That’s why I did everything by hand in the first place

agile cobalt Apr 18, 2023, 5:20 PM

#

not sure how to put it nicely but that's a terrible idea

cold osprey Apr 18, 2023, 5:20 PM

#

i see

serene scaffold Apr 18, 2023, 5:20 PM

#

what's more important here is that if you're going to ask for help in this channel, you should have things set up in such a way that you can action suggestions that are given to you in a reasonable amount of time. And you won't be able to do that if you're doing everything in pure Java.

agile cobalt Apr 18, 2023, 5:20 PM

#

well, doing it once for learning might be good, but if you want actual results, there's no good justification to do it all by hand

violet gull Apr 18, 2023, 5:20 PM

#

So is there no way to remove the guessing?

agile cobalt Apr 18, 2023, 5:20 PM

#

no

serene scaffold Apr 18, 2023, 5:21 PM

#

"guess and check" isn't inherently bad.

violet gull Apr 18, 2023, 5:21 PM

#

And I can’t make it educatedly guess itself?

violet gull Apr 18, 2023, 5:21 PM

#

serene scaffold "guess and check" isn't inherently bad.

It is when it’s blind guessing and it takes 8 hours between guesses

cold osprey Apr 18, 2023, 5:21 PM

#

violet gull And I can’t make it educatedly guess itself?

grid search cv

serene scaffold Apr 18, 2023, 5:21 PM

#

violet gull It is when it’s blind guessing and it takes 8 hours between guesses

that's why we're telling you to stop using Java.

cold osprey Apr 18, 2023, 5:21 PM

#

then narrow down

violet gull Apr 18, 2023, 5:21 PM

#

Rust?

serene scaffold Apr 18, 2023, 5:21 PM

#

violet gull Rust?

if it doesn't run on a GPU, then no.

violet gull Apr 18, 2023, 5:21 PM

#

I think it does

#

It’s machine level

serene scaffold Apr 18, 2023, 5:22 PM

#

but is it machine level on a GPU?

violet gull Apr 18, 2023, 5:22 PM

#

The only reason java doesn’t is it is designed to run on any hardware by creating a cpu level virtual machine

agile cobalt Apr 18, 2023, 5:22 PM

#

by the way, I think that pytorch actually has Java support - or at least it lists "C++ / Java" on the homepage download tab, I haven't really looked into it
cannot say that I recommend it, but might be a reasonable compromise

violet gull Apr 18, 2023, 5:23 PM

#

Installing stuff is hard

cold osprey Apr 18, 2023, 5:23 PM

#

question

violet gull Apr 18, 2023, 5:23 PM

#

Most stuff isn’t a 1 line import like it is in python

cold osprey Apr 18, 2023, 5:24 PM

#

did u write the os ure using discord on by hand?

violet gull Apr 18, 2023, 5:24 PM

#

No

cold osprey Apr 18, 2023, 5:24 PM

#

i see

violet gull Apr 18, 2023, 5:24 PM

#

But it was easy to install

serene scaffold Apr 18, 2023, 5:24 PM

#

agile cobalt by the way, I think that pytorch actually has Java support - or at least it list...

I thought those were mainly for deploying trained models, but I'm not sure.

tidal bough Apr 18, 2023, 5:24 PM

#

looking at the javadoc, I think the Java version might only support deployment?
https://pytorch.org/javadoc/1.9.0/

#

it's kinda barren

violet gull Apr 18, 2023, 5:24 PM

#

There is java libraries for gpu but idk how to install

serene scaffold Apr 18, 2023, 5:25 PM

#

violet gull There is java libraries for gpu but idk how to install

you can use the 8 hours while your model is training to learn bing_shrug

violet gull Apr 18, 2023, 5:25 PM

#

I just hate it so much

tidal bough Apr 18, 2023, 5:25 PM

#

https://github.com/pytorch/java-demo/blob/master/src/main/java/demo/App.java#L11-L20

arctic wedgeBOT Apr 18, 2023, 5:25 PM

#

src/main/java/demo/App.java lines 11 to 20

Module mod = Module.load("demo-model.pt1");
Tensor data =
    Tensor.fromBlob(
        new int[] {1, 2, 3, 4, 5, 6}, // data
        new long[] {2, 3} // shape
        );
IValue result = mod.forward(IValue.from(data), IValue.from(3.0));
Tensor output = result.toTensor();
System.out.println("shape: " + Arrays.toString(output.shape()));
System.out.println("data: " + Arrays.toString(output.getDataAsFloatArray()));```

tidal bough Apr 18, 2023, 5:25 PM

#

example of deployment

serene scaffold Apr 18, 2023, 5:25 PM

#

ahh! make the bad language go away!

agile cobalt Apr 18, 2023, 5:25 PM

#

tidal bough https://github.com/pytorch/java-demo/blob/master/src/main/java/demo/App.java#L11...

also archived repo pithink

tidal bough Apr 18, 2023, 5:25 PM

#

yup

violet gull Apr 18, 2023, 5:25 PM

#

Before I go to gpu which I am willing to attempt

tidal bough Apr 18, 2023, 5:25 PM

#

personally I've dabbled a bit with libtorch from Rust

violet gull Apr 18, 2023, 5:26 PM

#

Is there anything obviously wrong with my current setup?

#

Like my data or model

serene scaffold Apr 18, 2023, 5:26 PM

#

violet gull Is there anything obviously wrong with my current setup?

yes, it's in Java.

violet gull Apr 18, 2023, 5:26 PM

#

serene scaffold yes, it's in Java.

I like my defined variables and my semi colons and my curly brackets

agile cobalt Apr 18, 2023, 5:26 PM

#

the image size might be too small, other than that idk

violet gull Apr 18, 2023, 5:26 PM

#

agile cobalt the image size might be too small, other than that idk

90 is too small?

agile cobalt Apr 18, 2023, 5:27 PM

#

90x90? or 90 pixels total?

violet gull Apr 18, 2023, 5:27 PM

#

90x90

agile cobalt Apr 18, 2023, 5:27 PM

#

probably™️ passable then

cold osprey Apr 18, 2023, 5:27 PM

#

could be small for the task

violet gull Apr 18, 2023, 5:27 PM

#

LeNet uses 28x28

cold osprey Apr 18, 2023, 5:27 PM

#

dogs n elephants

#

😮

agile cobalt Apr 18, 2023, 5:27 PM

#

(as long as they were resized in a reasonable way)

tidal bough Apr 18, 2023, 5:27 PM

#

the fact it's in Java in theory isn't damning, Java is surprisingly fast for a not-really-compiled language; usually only a few times slower than Rust/C/whatever

#

the real problem is no GPU support - that, in any language, is a difference of 10x or more in training times

#

which i have no idea how to do in Java tbh, probably possible though

cold osprey Apr 18, 2023, 5:30 PM

#

https://github.com/Saratii/Saratoga-MK3

#

thats why

#

i admire the effort, but i still ask why

violet gull Apr 18, 2023, 5:31 PM

#

cold osprey i admire the effort, but i still ask why

Why not

cold osprey Apr 18, 2023, 5:31 PM

#

coz i value my time

#

we all value our time differently

violet gull Apr 18, 2023, 5:32 PM

#

This is the project I have learned the most on

#

It wasted wasted time

tidal bough Apr 18, 2023, 5:32 PM

#

huh, that's a lot of work

#

where's the SIMD though

violet gull Apr 18, 2023, 5:32 PM

#

Java doesn’t have simd

tidal bough Apr 18, 2023, 5:33 PM

#

probably should remove it from the readme then

violet gull Apr 18, 2023, 5:33 PM

#

Nobody is going to check

#

I’m submitting it as a final project with the goal of embarrassing everyone else in my entry to comp sci class

tidal bough Apr 18, 2023, 5:33 PM

#

as you can see, I did immediately

#

and if someone looking at it actually knows java (and that it doesn't have simd), they won't just start searching, they'll immediately go "wait, how"

#

I'd probably at least use a linalg library if I decided to write an NN from scratch. Linalg libraries like BLAS are usually decades-old hyperoptimized Fortran with chunks of inline assembly, so even a C custom implementation isn't going to compare, much less a Java one. (And I don't consider it fun to implement matmul.)

violet gull Apr 18, 2023, 5:36 PM

#

Ok I’ll remove that

cold osprey Apr 18, 2023, 5:37 PM

#

kek

violet gull Apr 18, 2023, 5:38 PM

#

I’m not using imports

#

Is it not more impressive to have everything by hand

cold osprey Apr 18, 2023, 5:38 PM

#

depends how u look at it

#

if it was a model that a business wanted to use for deployment, this is not impressive at all

violet gull Apr 18, 2023, 5:38 PM

#

So there is nothing wrong with my data size correct?

cold osprey Apr 18, 2023, 5:39 PM

#

its ambitious at best, idiotic at worst

#

and im being nice 🙂

violet gull Apr 18, 2023, 5:39 PM

#

Can I import vector libraries and do gpu?

#

Or is it one or the other

#

If I switch out my custom Matrix every single line needs to be re written

#

I’ll check it out

#

I’m mainly just here to make sure my model is conceptually correct

#

I can fix performance later

#

Does reinforcement learning use a neural network or is it completely different?

cold osprey Apr 18, 2023, 5:53 PM

#

KEKW

tidal bough Apr 18, 2023, 5:53 PM

#

reinforcement learning is a subset of ML, not necessarily DL, so it might not involve an NN at all.

agile cobalt Apr 18, 2023, 5:53 PM

#

it can use neural networks, but does not necessarily have to, much like regression / classification

tidal bough Apr 18, 2023, 5:53 PM

#

but there does exist deep RL, yes.

violet gull Apr 18, 2023, 5:54 PM

#

So if I was to put a model into an ant in a simulation

#

To give an ant a brain

#

What that use

small kraken Apr 18, 2023, 5:55 PM

#

can someone explain please, how AI are written

#

I mean what language devs use etc

violet gull Apr 18, 2023, 5:55 PM

#

small kraken I mean what language devs use etc

Python

#

PyTorch

#

Tensowflow

#

Jupyter notebook

#

All the tools you need are in numpy + PyTorch and maybe openCV

#

Depending on what type of AI you want ofc, very broad term

raw compass Apr 18, 2023, 7:00 PM

#

I don't understand one thing when I calculate the loss(average negative loglikelihood ), why do I have to use the log function, I mean I know what it does but don't understand why we need this during the loss.

log_likelihood += torch.log(P[ix1][ix2])

tidal bough Apr 18, 2023, 7:02 PM

#

i mean... that's why it's called the log likelihood

#

as for why not use the product of probabilities instead - one reason that comes to mind is that it can be so small or high as to not be representable as a float, which probably won't happen with the log likelihood.

raw compass Apr 18, 2023, 7:03 PM

#

tidal bough as for why not use the product of probabilities instead - one reason that comes ...

so the log likelihood is needed because of the constant "e" -2.71? so is it like a normalization?

tidal bough Apr 18, 2023, 7:05 PM

#

I'm saying that if you use normal likelihood instead, the product of all the output probabilities, it can easily be unrepresentable. say, if they're all 0.5 and there's 10000 of them, that's a product of (1/2)^10000. That's exactly 0 as far as floats are concerned (it's around 5*10^-3011 but floats don't go that low).

#

whereas log likelihood represents that easily - log(1/2)*10000 is only ≈ -6931.5.

raw compass Apr 18, 2023, 7:07 PM

#

hmm okay got it

true narwhal Apr 18, 2023, 7:17 PM

#

I'm trying to set up object detection but when I export from CVAT to a TFrecord it warns me that it exceeds 10% of system memory before saying killed and when I export it to COCO and then convert it using the create_coco_tf_record.py script it gives me an error like "indices[0] not in [0,0]" I'm not sure if its a CVAT problem, a tensorflow problem or a config problem nothing seems to get me any closer to an answer it seems like it should work and the only idea I can think of is reinstalling linux. If anyone knows what might be the problem it would really help I've been stuck on this for a while

errant bison Apr 18, 2023, 7:53 PM

#

how to convert white lines to transparent

keen gust Apr 18, 2023, 8:16 PM

#

Having an issue with pandas loc. It returns an empty df when trying to filter for a column value and I can't see any obvious issues. I've used df.columns to make sure I was writing them as they are but it won't return anything for this specific column. Any ideas?

errant bison Apr 18, 2023, 8:41 PM

#

does anyone know opencv?

serene scaffold Apr 18, 2023, 8:43 PM

#

keen gust Having an issue with pandas loc. It returns an empty df when trying to filter fo...

Hello, there's not enough information here to start answering your question. please do print(df.head().to_dict()) and put the result in the pastebin, and show the code that is not working in this chat.

#

!paste

arctic wedgeBOT Apr 18, 2023, 8:43 PM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in Discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the floppy disk icon in the top right, or by typing ctrl + S. After doing that, the URL should change. Copy the URL and post it here so others can see it.

keen gust Apr 18, 2023, 8:45 PM

#

serene scaffold Hello, there's not enough information here to start answering your question. ple...

no worries, figured it out. My strings were wrapped in " " ...lol. Didn't catch that at first

#

but thank you

errant bison Apr 18, 2023, 8:47 PM

#

How could i remove this white coloured edges?

#

plss help

iron basalt Apr 18, 2023, 9:07 PM

#

violet gull So if I was to put a model into an ant in a simulation

I'm not sure if individual ants can be trained with reinforcement learning. As a whole they can be trained. Most ant simulations have individual ants be very simple.

#

Maybe a tiny amount.

lapis sequoia Apr 18, 2023, 9:49 PM

#

Is viewing jupyter notebooks on github broken for anyone else on their phone, theyre clipped off like I can only see the left half

naive radish Apr 19, 2023, 12:21 AM

#

Has anyone ideas how one could type hint DataFrame contents? https://stackoverflow.com/questions/76038966/type-hinting-pandas-dataframe-content-and-columns

Stack Overflow

Type hinting Pandas DataFrame content and columns

I am writing a function that returns a Pandas DataFrame object. I would like to have some kind of a type hinting what columns this DataFrame contains, outside mere specification in the documentatio...

serene scaffold Apr 19, 2023, 1:10 AM

#

naive radish Has anyone ideas how one could type hint `DataFrame` contents? https://stackover...

there isn't an agreed upon way. you could do things like s: 'pd.Series[str]', where the type annotation is a string that's formatted like a 3.9+ style type hint.

indigo turret Apr 19, 2023, 1:56 AM

#

anyone have any thoughts on this? https://stackoverflow.com/questions/76049775/matplotlib-pixel-grid-not-aligning-exactly-to-pixel

Stack Overflow

Matplotlib pixel grid not aligning exactly to pixel

I'm using matplotlib to generate a pixel grid over an image like so:
from PIL import Image
import matplotlib.pyplot as plt

size = 20
im = Image.open("images/sunflower.jpg") # create PIL ...

#

Not super familiar with matplotlib but I can't find out why this is happening

onyx abyss Apr 19, 2023, 2:20 AM

#

Hi guys in the context of my master thesis i work on such data images for a classication ai algorithm any one knows where i can find a dataset contain these images "Ultrasonic Cscan images"

queen cradle Apr 19, 2023, 2:32 AM

#

indigo turret Not super familiar with matplotlib but I can't find out why this is happening

Oh man, I've looked into things like this before, and you may be in for a bad time.

indigo turret Apr 19, 2023, 2:33 AM

#

it's not too big of a deal if i can't get it to work, i'll just end up using a thicker line width

queen cradle Apr 19, 2023, 2:33 AM

#

The first thing to understand about matplotlib is that it does not use pixels. Never, anywhere, until the very very very end.

#

Any time you think you are drawing pixels, you are wrong.

#

What it's actually doing is drawing monochrome squares.

indigo turret Apr 19, 2023, 2:34 AM

#

yeah i understand that

queen cradle Apr 19, 2023, 2:34 AM

#

Your im is an image. imshow is supposed to display it. It takes each pixel of im and creates a little square whose color is the pixel color.

#

The first thing you have to do is make sure that the edges of the square are where you think they are.

#

IIRC, by default, matplotlib centers the pixels. I.e., (0, 0) in canvas space is the center of the (0, 0) pixel, not the corner.

#

If you want to add grid lines between the pixels, you will have to find the pixels edges.

#

You can do this by comparing the dimensions of the image to its dimensions on the canvas. Once you figure out the size of a pixel, you use a half-pixel offset.

#

That will get things very close.

indigo turret Apr 19, 2023, 2:37 AM

#

i see

queen cradle Apr 19, 2023, 2:37 AM

#

If you output with enough resolution then you probably won't be able to see problems.

#

But they're there.

#

The other thing that obstructs you is that canvas space is not made of pixels. I said earlier that matplotlib only uses pixels at the very end. Until the very end, it's working in canvas space, which is continuous.

#

To convert to pixels, matplotlib has to rasterize somehow. This can introduce subtle one pixel errors.

#

For example, suppose you have a checkerboard pattern with alternating swatches of color each one pixel wide and tall. It is nearly impossible to display this correctly with matplotlib.

#

In order for that to work, you have to get lucky when matplotlib rasterizes the image.

indigo turret Apr 19, 2023, 2:39 AM

#

fortunately i don't need to be super exact but it's just slightly annoying lol

queen cradle Apr 19, 2023, 2:40 AM

#

If you don't get lucky, then at some point, it will round the wrong way, and you will either skip a row or column or see the same row or column repeated.

#

There is a low-level matplotlib command which inserts a picture at the very end. This command is pixel-exact, but because it doesn't work in canvas space, it's very difficult to use correctly.

#

So your best bet is to rasterize at a higher resolution than you actually need to get exact.

indigo turret Apr 19, 2023, 2:41 AM

#

how would I go about doing that?

queen cradle Apr 19, 2023, 2:41 AM

#

Honestly, I don't know.

#

There's a lot about matplotlib that I find mysterious.

#

Usually it works correctly for me. When it doesn't, I have a very hard time figuring out what's wrong.

#

Picture sizes are one of those things that I don't know how to control.

sharp crypt Apr 19, 2023, 3:53 AM

#

how are activations determined for the first hidden layer in a neural network? are they based on weights and biases? if so, how are those determined?

indigo turret Apr 19, 2023, 5:05 AM

#

queen cradle Honestly, I don't know.

wanted to let you know i figured it out, someone on stack overflow did at least

#

it's fixed by setting snap to false in the grid function

queen cradle Apr 19, 2023, 5:06 AM

#

Wow, that's a detail I didn't know about.

#

matplotlib is full of surprises.

indigo turret Apr 19, 2023, 5:08 AM

#

according to the docs:
Snapping aligns positions with the pixel grid, which results in clearer images. For example, if a black line of 1px width was defined at a position in between two pixels, the resulting image would contain the interpolated value of that line in the pixel grid, which would be a grey value on both adjacent pixel positions. In contrast, snapping will move the line to the nearest integer pixel value, so that the resulting image will really contain a 1px wide black line.

#

that's pretty annoying considering i'd have to dig thorugh the matplotlib docs to find out what was going on, odd that's default behavior

iron basalt Apr 19, 2023, 5:12 AM

#

indigo turret that's pretty annoying considering i'd have to dig thorugh the matplotlib docs t...

I'm not sure if this even counts as a hot take, but Matplotlib is terribly designed.

indigo turret Apr 19, 2023, 5:13 AM

#

yeah i try not to work with it but unfortunately it's way too integrated with existing python libraries that not using it hurts even more

queen cradle Apr 19, 2023, 5:14 AM

#

Yeah, I think matplotlib's situation is rather unfortunate. It's well established, but it has a lot of baggage.

iron basalt Apr 19, 2023, 5:14 AM

#

If you try doing anything interesting with it you end up in weird territory like modifying private members (in some animation the standard method is modifying an underscore/private variable).

#

It's also REALLY slow (multiple orders of magnitude).

true scaffold Apr 19, 2023, 6:10 AM

#

Hi all, can anyone help me regarding an issue in deployment of flask nlp app on ec2 instance? Actually this is my first time deploying an app on ec2, and i've followed several tuts, and it does works, but as you know in NLP we use heavy models like BERT having size around 1.5GB, so the endpoint let's say get_predictions takes a lot of time (as my ec2 instance is a t2.large not GPU enabled), so after like 5 minutes or less, the ssh connection disconnects throwing a Broken Pipe error and in postman i get 502 Bad Gateway error in response.

Right now i'm running the server using gunicorn with the following command from my main project directory:

In my main project dir:

/home/ubuntu/project/mlenv/bin/gunicorn -b localhost:8000 app:app --timeout 600

Now when i hit my endpoint using postman, it takes around 4-5 minutes, then the ssh loses its connection due to broken pipe error

Also, i have tested it on my local machine on dev server, it works, but as my machine is a m1 air, it runs on CPU and it takes around 5-6 minutes to give predictions.

Any help would be really appreciated!

cold osprey Apr 19, 2023, 6:32 AM

#

its basically timing out. Not sure if theres a way to change the max timeout duration

#

u can get a better ec2 instance (with gpu)

#

or use a smaller model

pseudo moon Apr 19, 2023, 7:17 AM

#

Could anyone explain to me what the use of concatenating layers is?

wooden sail Apr 19, 2023, 7:29 AM

#

wdym by concatenating layers? you mean using more than one?

pseudo moon Apr 19, 2023, 7:32 AM

#

I mean like the ones used in the UNet

wooden sail Apr 19, 2023, 7:33 AM

#

what about them?

pseudo moon Apr 19, 2023, 7:34 AM

#

i don't really get their purpose?

wooden sail Apr 19, 2023, 7:37 AM

#

there are 2 parts to this. the first is that neural networks get all their power from the usage of several activation functions. it's not always enough to have just 1 activation function, but the usage of several layers gives you the ability to represent any function. the second part is that the type of layer you use enforces a special behavior in a network. particularly for the U-Net, you can think of it as an autoencoder. it uses layers in such a way that the input should be close to the output, but the middle layers have very few parameters. this is the same as saying "this data can somehow be represented/encoded with very few parameters", which is very strong structural knowledge about the data.

pseudo moon Apr 19, 2023, 7:43 AM

#

So in other words, it gives more information to the activation function on how the data should be represented?

wooden sail Apr 19, 2023, 7:44 AM

#

what is "it" in your sentence?

pseudo moon Apr 19, 2023, 7:44 AM

#

concatenating layers

wooden sail Apr 19, 2023, 7:44 AM

#

no

#

just concatenating layers improves the representation power of the network

#

then the specific choice of which layers & activation to use makes the network behave in a certain way

pseudo moon Apr 19, 2023, 7:46 AM

#

hmm i see

#

just to confirm, to concatenate layers we use keras.layers.Concatenate() in keras and torch.cat() in pytorch?

wooden sail Apr 19, 2023, 7:53 AM

#

ah ok you're talking about something else

#

concatenation of this kind is to take several inputs and process them together

pseudo moon Apr 19, 2023, 7:56 AM

#

wooden sail ah ok you're talking about something else

what was the other concatenate that we were talking about?

cold osprey Apr 19, 2023, 7:58 AM

#

adding more layers of a neural network

pseudo moon Apr 19, 2023, 7:58 AM

#

ah

pseudo moon Apr 19, 2023, 8:03 AM

#

wooden sail concatenation of this kind is to take several inputs and process them together

what benefits are there to this?

#

can one concatenate several inputs of different sizes? or they should be of the exact same size?

mild dirge Apr 19, 2023, 8:06 AM

#

A tensor must be homogenous, so the same shape in all dimensions but the one you concatenate them in

wooden sail Apr 19, 2023, 8:08 AM

#

they NEED to share all dimensions except the concatenation dimension. otherwise the math operations on the mare not well defined

wooden sail Apr 19, 2023, 8:09 AM

#

pseudo moon what benefits are there to this?

if one has designed a network in a way that specific layers represent specific things, then it can make sense to process them together

pseudo moon Apr 19, 2023, 8:09 AM

#

i see

hasty mountain Apr 19, 2023, 12:50 PM

#

onyx abyss Hi guys in the context of my master thesis i work on such data images for a clas...

Have you tried taking a look at MedMNIST library? Perhaps there might be something that suits you

#

That library has images from cells in optical microscopy to ultrasound exams.

simple tapir Apr 19, 2023, 12:52 PM

#

Hi

hasty mountain Apr 19, 2023, 12:53 PM

#

pseudo moon i see

Oh, concatenation is usually quite useful for conditioning outputs.

#

People tend to concatenate embedding arrays into certain inputs to condition the output. That's common for Conditional GAN, Diffusion Model(condition output on time_step)

simple tapir Apr 19, 2023, 12:55 PM

#

How can i visualise this? https://pastecord.com/fyqijokugo.properties

lapis sequoia Apr 19, 2023, 12:59 PM

#

Hey guys , can you recommend a good tutorial for TensorFlow with python ! Thank you

simple tapir Apr 19, 2023, 1:00 PM

#

lapis sequoia Hey guys , can you recommend a good tutorial for TensorFlow with python ! Thank ...

I've taken Daniel Bourke's Pytorch tutorial and found it pretty good. You may want to check out his tensorflow tutorials perhaps

lapis sequoia Apr 19, 2023, 1:02 PM

#

simple tapir I've taken Daniel Bourke's Pytorch tutorial and found it pretty good. You may wa...

I don't know witch one is better. As i have understand tensorflow it's faster or am i wrong

simple tapir Apr 19, 2023, 1:04 PM

#

lapis sequoia I don't know witch one is better. As i have understand tensorflow it's faster or...

I didn't mean to compare these two ML libraries. I have been learning PyTorch and used Daniel Bourke's course, which were quality in my opinion and I saw that he has also tensorflow courses. That is why I wanted to suggest you to take a look at them

#

He explains the context pretty well

lapis sequoia Apr 19, 2023, 1:04 PM

#

simple tapir I didn't mean to compare these two ML libraries. I have been learning PyTorch an...

Thanks will check on that for sure !

simple tapir Apr 19, 2023, 1:05 PM

#

lapis sequoia Thanks will check on that for sure !

Anytime, good luck!

lapis sequoia Apr 19, 2023, 1:05 PM

#

Thanks may the force be with you

hoary wigeon Apr 19, 2023, 1:34 PM

#

Hi @everyone!

Has anyone worked on multi touch attribution model using Markov Chain approach?

#

Let me know if anyone has worked on it, that'd be a great help!

cold osprey Apr 19, 2023, 1:49 PM

#

lapis sequoia I don't know witch one is better. As i have understand tensorflow it's faster or...

If ure using windows n gpu, I'd suggest pytorch. Tensorflow 2.11+ doesn't natively support GPU on windows, u would need WSL

lapis sequoia Apr 19, 2023, 1:53 PM

#

cold osprey If ure using windows n gpu, I'd suggest pytorch. Tensorflow 2.11+ doesn't native...

have some VM for this type of problems, but this is good to know, so Thank you !

mint bridge Apr 19, 2023, 3:16 PM

#

How much math knowledge do I need to get into neural networks?

#

And what are some good resources to learn it?

raw compass Apr 19, 2023, 6:03 PM

#

the weights in a neuron is the connected inputs? so if the weight is 4 then its handling 4 input?

mild dirge Apr 19, 2023, 6:06 PM

#

If a neuron has 4 inputs, then there will be 4 weights (or 5 including bias). Bit confused by the way you phrase your question.

#

Each weight value is multiplied with the output of the neuron feeded into the next neuron.

#

raw compass Apr 19, 2023, 6:09 PM

#

mild dirge If a neuron has 4 inputs, then there will be 4 weights (or 5 including bias). Bi...

bias is just a constant which we can add to the product, like a function or distribution?

mild dirge Apr 19, 2023, 6:09 PM

#

As seen here (bit unclear because of back ground srr). Every node is connected to every node in the next layer.

raw compass Apr 19, 2023, 6:09 PM

#

mild dirge

I got this part

mild dirge Apr 19, 2023, 6:09 PM

#

A bias is just a input neuron that is always a value of 1

raw compass Apr 19, 2023, 6:10 PM

#

mild dirge Each weight value is multiplied with the output of the neuron feeded into the ne...

huu what do you mean?

#

can you elaborate?

raw compass Apr 19, 2023, 6:10 PM

#

mild dirge A bias is just a input neuron that is always a value of 1

then what is the point of using them?

#

like 1 * 27 -> 27

#

oh sorry maybe that is addition

mild dirge Apr 19, 2023, 6:11 PM

#

So for linear regression with a single input, if you only have a weight for that input, you can get any line that goes through the origin

agile cobalt Apr 19, 2023, 6:11 PM

#

there are two different kinds of bias we're talking about here
one is the input feature, which has a fixed value of 1
the other one is the one used in the activation of each neuron, which is adjusted during training

mild dirge Apr 19, 2023, 6:11 PM

#

So y = ax where x is the input, and a is the weight value

#

But often the line you want to approximate does not go through the origin

raw compass Apr 19, 2023, 6:12 PM

#

mild dirge So for linear regression with a single input, if you only have a weight for that...

I still don't get it, what kind of origin?

mild dirge Apr 19, 2023, 6:12 PM

#

One sec

#

agile cobalt Apr 19, 2023, 6:12 PM

#

raw compass I still don't get it, what kind of origin?

the "origin" is the (0, 0) point of a 2-d graph

raw compass Apr 19, 2023, 6:12 PM

#

agile cobalt the "origin" is the (0, 0) point of a 2-d graph

yes, but how comes this into the picture?

agile cobalt Apr 19, 2023, 6:13 PM

#

make a random guess for a y = x + b function that can generate a line like this
Now try to make one without b

raw compass Apr 19, 2023, 6:13 PM

#

agile cobalt make a random guess for a `y = x + b` function that can generate a line like thi...

I cannot?

mild dirge Apr 19, 2023, 6:14 PM

#

Without a bias the function you approximate with a model with 1 input is y = a*x but a bias allows you to approximate any function that looks like y = a*x + b

raw compass Apr 19, 2023, 6:14 PM

#

mild dirge Without a bias the function you approximate with a model with 1 input is `y = a*...

hmm

agile cobalt Apr 19, 2023, 6:15 PM

#

raw compass I cannot?

which is why there is the bias feature (value = 1) is added amongst your inputs

raw compass Apr 19, 2023, 6:16 PM

#

okay how you guys understood these things, it is more like maths isn't it?

raw compass Apr 19, 2023, 6:16 PM

#

agile cobalt which is why there is the bias feature (value = 1) is added amongst your inputs

it makes no sense to me

agile cobalt Apr 19, 2023, 6:18 PM

#

let's say that you have a row with ```
x1 | x2 | x3 | y
0 0 0 2

bias is used so that the model can adjust it predictions even on those cases

#

not sure how to explain beyond that - I'll leave it for PcCamel or recommend for you to look up some videos / tutorials explaining how neural networks work

mild dirge Apr 19, 2023, 6:22 PM

#

It is basically linear algebra yeah. Adding a bias allows you to perform an affine transformation, whereas otherwise you can only perform a linear transformation.

#

But the example etrotta showed is pretty good to see why a bias is needed in some cases.

lapis sequoia Apr 19, 2023, 7:42 PM

#

Hello guys
Im new to apache spark
Im currently using python so ill be using pyspark for my project
I wanted some advice on how i would manage a 130GB json file and use apache spark to optimize my file reading and writing so that i can take the dataset and insert it into my mongoDB databases

#

(ping me with your answer)

tough falcon Apr 19, 2023, 7:58 PM

#

few years ago I used to use matplotlib.
is this still used by the majority?

mild dirge Apr 19, 2023, 7:59 PM

#

Yeah matplotlib is pretty popular still.

nocturne eagle Apr 19, 2023, 8:03 PM

#

am I the only one who hates the matplotlib API?

mild dirge Apr 19, 2023, 8:06 PM

#

It's a bit messed up sometimes yes. f.e. plt.xlabel(...) but ax.set_xlabel(...), it's just not always consistent.

#

But for most simple stuff it is easy enough to use. It gets messy when you want to customize a lot

hasty mountain Apr 19, 2023, 8:48 PM

#

Is the key for successful language models simply making them train for many, many, many epochs?
The way their gradients behave is a bit annoying...

1/100
Total Epoch Loss: 4.5432255665461225
Gradients Average: -1.071544155489823e-11
Current output: 1
2/100
Total Epoch Loss: 4.136872115298214
Gradients Average: 9.643897486144581e-11
Current output: 1
[...]
25/100
Total Epoch Loss: 2.6726607568243628
Gradients Average: -1.3268730558735342e-10
Current output: 1
26/100
Total Epoch Loss: 2.618607923324801
Gradients Average: -3.750404500846294e-11
Current output: 4
[...]
100/100
Total Epoch Loss: 0.4769937649512769
Gradients Average: -1.1401899563390216e-10
Current output: 10

#

(I got output 1 for the first 20 epochs. Only after 30 epochs the output became consistently diverse)

#

How sad...it seems that Transformer got it even worse...even with the warmup steps... grumpchib

quick pulsar Apr 19, 2023, 9:52 PM

#

Sorry if I'm using the wrong channel. I have this code with sympy and it returns "TypeError: cannot determine truth value of Relational", what is the reason and how can I fix it?

tidal bough Apr 19, 2023, 10:01 PM

#

quick pulsar Sorry if I'm using the wrong channel. I have this code with sympy and it returns...

Looking at the docs, you don't seem to be passing the arguments in the right format for that solver... and why are you using this solver in the first place? It's for rational inequalities - like, ratio of two polynomials ≥ (or other relationship) 0. Your inequalities all seem to be polynomial ones, so solve_poly_inequalities would do.

#

though unsure if it supports multivariate systems.

#

i think sympy straight up doesn't support nontrivial multivariate systems of equations.

sweet crypt Apr 19, 2023, 11:43 PM

#

We just launched https://thedrive.ai/, a context-aware storage system. If you want a ChatGPT-like system for your files and want to write content based on stored documents, you might want to try it out. I would love to hear how you would use it, and open to any feedback. This python community has been incredibly for me, and I though I would share it here. lmk if I should delete it

lapis sequoia Apr 20, 2023, 1:09 AM

#

Has anyone installed Voice cloning Ai on local hardware?

thorn swift Apr 20, 2023, 1:12 AM

#

has anybody here deployed a tensorflow app in heroku?

#

i can not get my app to work

wanton vessel Apr 20, 2023, 1:36 AM

#

Good evening! Quick question how would I take a sns data plot and have it filter out results from a data frame for a specific year?

#

For example, this code plots the occurrence of age within a df. Within that df there is a column for the year. How would I create different graphs for each year.

dataframe["age"].plot.hist()

#

Sorry as I am still kind of new to Python so if my question is rather simple I apologize 🙂

livid goblet Apr 20, 2023, 4:56 AM

#

Can anyone please suggest me a beginner friendly book on Facial Recognition ?

arctic crown Apr 20, 2023, 5:05 AM

#

please help in a neural network in the hidden layer if the activation function is the same then whats the point of having so many different nodes?

upbeat prism Apr 20, 2023, 7:24 AM

#

Good <time of day>,

Anyone knows a homepage that teaches ML/DL/NLP concepts? E.g. MLP or Transformers or CNN etc. Basically a platform that provides a nice learning environment.

upbeat prism Apr 20, 2023, 7:48 AM

#

arctic crown please help in a neural network in the hidden layer if the activation function i...

Each neuron "represents a different proeprty of your data" play around with https://playground.tensorflow.org/

Tensorflow — Neural Network Playground

Tinker with a real neural network right here in your browser.

lapis sequoia Apr 20, 2023, 11:58 AM

#

Hello everyone,

I am currently learning about Machine Learning and TensorFlow and I am interested in developing an app that could help detect wildfires or identify areas that have a high potential of starting a fire. Specifically, I am looking for sources for satellite thermography images that can be used to train the machine learning model.

As this is a big project, I am looking for others who are also learning about Machine Learning and TensorFlow and would like to collaborate on this project. If you are interested in joining, please let me know. All levels of expertise are welcome!

Thank you for your time

quaint loom Apr 20, 2023, 11:58 AM

#

Is there anyone here who would know how to make a Latex table that is similar to the picture?

mild dirge Apr 20, 2023, 12:23 PM

#

It would probably be easier to make with an online diagram maker tbh. This looks like a nightmare to do with latex.

spark nimbus Apr 20, 2023, 1:21 PM

#

A pandas question:
I have a list of dates with no regular intervals. How do I efficiently get the last date of both the previous and next months?
I've tried using pandas.tseriest.offsets.MonthEnd but that's taking too long on large datasets for my use case

boreal gale Apr 20, 2023, 1:30 PM

#

spark nimbus A pandas question: I have a list of dates with no regular intervals. How do I ef...

could you post what you have tried, and potentially with some example data (e.g. a snippet to create some test data) to make the life of anyone who tries to help easier please?

queen cradle Apr 20, 2023, 1:30 PM

#

quaint loom Is there anyone here who would know how to make a Latex table that is similar to...

I wouldn't do that as a table. The arrows on the right, in particular, would be very hard to do that way. If I had to do this in LaTeX, I'd use TikZ. It looks to me like you'd create nodes for each of the blocks of text. Some would have colored backgrounds and some would have a blue foreground. Most of this is easy. The only thing I don't know how to do is the braces used between the left three columns. If you want actual TeX brace characters then that might be difficult; I'd guess that you'd want nodes containing something like $\left{\vbox to 2in{}\right.$ but aligning the nodes properly would be a mess. On the other hand if you're happy with TikZ drawing the braces, there's probably something easier.

spark nimbus Apr 20, 2023, 1:37 PM

#

boreal gale could you post what you have tried, and potentially with some example data (e.g....

df = pd.DataFrame({'date': pd.date_range(start='31/01/1990', end='31/01/2023', freq='D')})

# Generate offsets; This code currently takes too long on the existing dataset
df['prev_dt'] = df['date'] + np.timedelta64(1, 'D') + pd.tseries.offsets.MonthEnd(-2)
df['next_dt'] = df['date'] - np.timedelta64(1, 'D') + pd.tseries.offsets.MonthEnd(2)

#

the prev_dt line currently takes a little over 24 minutes to run

boreal gale Apr 20, 2023, 1:50 PM

#

how many rows are there? (in your actual dataframe, not the test one)

spark nimbus Apr 20, 2023, 1:51 PM

#

about 77 million

boreal gale Apr 20, 2023, 1:53 PM

#

yikes. that's a lot, are there duplicates?

spark nimbus Apr 20, 2023, 1:53 PM

#

in terms of date yes but the rows are all unique

boreal gale Apr 20, 2023, 2:03 PM

#

the way you are doing it now is already quite efficient, as far as pandas usage goes.

an alternative you could try is to get the unique dates, compute a map from date -> prev_date / next_date
and use df['date'].map(mapping) to generate the new columns, i am unsure this will be faster tbh but worth a shot?

also is it possible to generate this upstream? as in before you even read your data and generate it at the source (e.g. in your SQL or whatever source you are pulling from)

spark nimbus Apr 20, 2023, 2:03 PM

#

an alternative you could try is to get the unique dates, compute a map from date -> prev_date / next_date
and use df['date'].map(mapping) to generate the new columns, i am unsure this will be faster tbh but worth a shot?
Is .map faster than doing a left merge on a dataframe holding the dates? because that's my current attempt

#

also is it possible to generate this upstream? as in before you even read your data and generate it at the source (e.g. in your SQL or whatever source you are pulling from)
Unfortunately that'd be very unlikely, would need to ask my superiors, they'd need to talks to the data factory team, and they'd need to talk to all the other teams handing the data

spark nimbus Apr 20, 2023, 2:05 PM

#

spark nimbus > an alternative you could try is to get the unique dates, compute a map from da...

ok this approach shaved off ~55 minutes

#

I guess that'll be as good as it gets

boreal gale Apr 20, 2023, 2:05 PM

#

55 minutes 🤔 ?

boreal gale Apr 20, 2023, 2:06 PM

#

spark nimbus > an alternative you could try is to get the unique dates, compute a map from da...

Is .map faster than doing a left merge on a dataframe holding the dates? because that's my current attempt
not sure, i would think so.

spark nimbus Apr 20, 2023, 2:06 PM

#

previously it took 28 minutes to generate prev_dt (I did the math wrong before) and I'd expect the same time for next_dt (so 56 mins total)
now it did it in a little over a minute

spark nimbus Apr 20, 2023, 2:07 PM

#

boreal gale > Is .map faster than doing a left merge on a dataframe holding the dates? becau...

I didn't know .map was fast to vectorize, TIL

boreal gale Apr 20, 2023, 2:08 PM

#

spark nimbus previously it took 28 minutes to generate prev_dt (I did the math wrong before) ...

sweet! that sounds good enough to me without resorting to some sort of parallelism hackily

restive tiger Apr 20, 2023, 2:32 PM

#

Hi guys I would like to ask you a few questions about chatgpt I would like to create a similar assistant as my (vedal987 "neuro-sama") for twitch streaming that answers both chat and you personally via microphone. I have both of these functions but unfortunately each separate and I would need to gather them into one. If there is anyone here who understands this I would be very happy.

quaint loom Apr 20, 2023, 3:21 PM

#

queen cradle I wouldn't do that as a table. The arrows on the right, in particular, would be ...

Thank you so much for this.

white reef Apr 20, 2023, 3:49 PM

#

hey can any one help me with my project pls?

spiral peak Apr 20, 2023, 4:46 PM

#

Does pandas no longer allow negative index usage with .iloc?
loljk, turns out there's an issue with the data pipeline, we good

keen gust Apr 20, 2023, 5:12 PM

#

how can I filter for conditions when using a pandas group and apply()?

agile cobalt Apr 20, 2023, 5:18 PM

#

keen gust how can I filter for conditions when using a pandas group and apply()?

usually you'd filter before grouping by if possible, but iirc you can use slicing inside of a lambda function - not 100% sure though

agile cobalt Apr 20, 2023, 5:19 PM

#

spiral peak Does pandas no longer allow negative index usage with `.iloc`? loljk, turns out ...

!e peeposhrug ```py
import pandas as pd
se = pd.Series((1, 2, 3))
print(se.iloc[-1])

arctic wedgeBOT Apr 20, 2023, 5:19 PM

#

@agile cobalt :white_check_mark: Your 3.11 eval job has completed with return code 0.

agile cobalt Apr 20, 2023, 5:19 PM

#

guess it does

hot blade Apr 20, 2023, 5:31 PM

#

hi there, i've got a time series analysis question which asks me to design a model that solves a 'many-to-many sequence prediction problem' -- the only further clarification to this says 'it should expect several time steps for both input and output'

#

i am meant to select my own dataset for this

#

does it imply that the prediction and dataset should be univariate, or is this multivariate?

keen gust Apr 20, 2023, 5:47 PM

#

agile cobalt usually you'd filter before grouping by if possible, but iirc you can use slicin...

I thought about doing it that way but I guess since I've just been self teaching pandas that I'm always wondering if my thought process/solution is the 'ideal' one. But thank you, just wanted to check if there wasn't a method I hadn't yet come across

foggy yarrow Apr 20, 2023, 6:22 PM

#

What are some good resources to start with ML?
I've hear about Andrew Ng but I don't which course..

quartz wigeon Apr 20, 2023, 6:36 PM

#

foggy yarrow What are some good resources to start with ML? I've hear about Andrew Ng but I ...

I think he is on coursera

#

If I'm not mistaken, the course is also for free

#

Posting my question here because someone from the help channels recommended so:

Guidance with AI in Python
Some background before the question:
I'm a beginner at AI in Python and would like to start learning. I have basic understanding of machine learning and deep learning theory (cost function, linear regression, back propagation etc.), but I don't know where to start learning the practical stuff. Specifically, I'm interested in neural networks and reinforcement deep learning. I've watched a bunch of tutorial videos on YouTube, and the libraries used vary from video to video, which includes tensorflow, pytorch and keras.

My questions are:
What are the differences between tensorflow, pytorch and keras?
Most of the time I'm blindly copying code from tutorials and feel like I've learned nothing. Any ideas on the most stripped-down basic deep learning model that I can actually try creating myself? It doesn't matter if the model has no practical application, as long as its a good warm-up exercise for a beginner. (Something other than classifying irises/handwriting like all the tutorials on YouTube)

mint palm Apr 20, 2023, 6:54 PM

#

Cant we train two models parallely on same dataset? I got some weird error and seems its process synchronizaiton error.

bright garden Apr 20, 2023, 6:55 PM

#

quartz wigeon Posting my question here because someone from the help channels recommended so: ...

You should definitely find a project that you're invested in, that's the best way to learn. For me, it was working with NASA's MERRA-2 database because I had access to a ton of training data. You might want to find some cool databases online to get started, but spend a few months working with that data.

You'll learn how to code models, tune hyperparameters, visualise your training through Tensorboard or something similar, and, if you decide to write a report, documenting it.

#

That's how to solve the 'copying from tutorials and learning nothing' problem

#

Keras is basically a very abstract version of Tensorflow that makes it easy to code your first models. If you're interested in pursuing the field, I'd easily recommend either PyTorch or Tensorflow. They both have their fair share of haters and lovers, and personally, I prefer PyTorch because it has great documentation, feels very clean and straightforward to use, and is just nice overall.

#

Of course, they will have many technical and compatibility differences. For example, Tensorflow can train on TPUs, PyTorch can't (to my knowledge). PyTorch has ways to be deployed on mobile devices, etc. but since you're starting out, you can mostly ignore these until you actually get into more professional development

#

If you have any other questions, let me know

quartz wigeon Apr 20, 2023, 7:01 PM

#

bright garden You should definitely find a project that you're invested in, that's the best wa...

Thanks a lot for your help! I'll look into the NASA database you mentioned. Someone from the help channels also mentioned the website kaggle, I'll look into that too.

quartz wigeon Apr 20, 2023, 7:08 PM

#

bright garden If you have any other questions, let me know

Besides working with data, do you have any pointers on reinforcement deep learning? I'm not sure if its the correct term, but I'm looking for something akin to training neural networks by trial and error in a simulated environment. I've seen deep learning tutorials on YouTube recently like balancing a cartpole and it really piqued my interest. Which library would be suitable for that? One youtuber recommends a library called stablebaselines but I've never heard of it prior to that. Preferably, I would like something with ample documentation and community support.

bright garden Apr 20, 2023, 7:10 PM

#

quartz wigeon Besides working with data, do you have any pointers on reinforcement deep learni...

That's definitely an interesting field, but unfortunately, I don't have any experience in it. I've seen people make bots to play games using it.

But I'd reckon that it's definitely gonna be a lot more complex than a simple neural network. You should probably start out with a classifier or a regression problem, move onto a convolutional neural network, maybe try a recurrent network to work with audio or text. If you're interested in that, you could look into LSTMs and Transformers. Once you've built up a little experience with all that, I'm sure that you could try your hand at reinforcement learning or image generation or any of those

#

As for the libraries, it's probably just PyTorch and Tensorflow combined with some simulated environment. Maybe a IO library if it's a game or a 3D physics modelling library for the balancing model

mild dirge Apr 20, 2023, 7:13 PM

#

You may want to look into openai gym @quartz wigeon

#

This contains all kind of games/challenges to be solved using (deep) reinforcement learning

quartz wigeon Apr 20, 2023, 7:16 PM

#

bright garden That's definitely an interesting field, but unfortunately, I don't have any expe...

Thanks for the tip! Unfortunately I'm still a student with very little time on my hands. This will be a very long learning journey for me. I'll try to learn as much as I can during my free time. Thanks a lot, I really appreciate your help!

quartz wigeon Apr 20, 2023, 7:16 PM

#

mild dirge This contains all kind of games/challenges to be solved using (deep) reinforceme...

That's sounds interesting! I'll look into it.

arctic crown Apr 20, 2023, 7:43 PM

#

upbeat prism Each neuron "represents a different proeprty of your data" play around with htt...

hey sorry but I didn't really get anything from that website

novel python Apr 20, 2023, 10:14 PM

#

Guys, I wanted to fill all these NaN with the closest value above, is there an easy and vectorized way to do it in pandas? I only know how to do that with for loops.

tidal bough Apr 20, 2023, 10:15 PM

#

novel python Guys, I wanted to fill all these NaN with the closest value above, is there an e...

Yup: https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.fillna.html#pandas.DataFrame.fillna

novel python Apr 20, 2023, 10:16 PM

#

ohh, now I understand what backfill does lol

novel python Apr 20, 2023, 10:16 PM

#

tidal bough Yup: https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.fillna.html#p...

that was some dumb ass question tbh, thx a lot!

tidal bough Apr 20, 2023, 10:17 PM

#

if you want closest value above, wouldn't that be ffill?

novel python Apr 20, 2023, 10:17 PM

#

yea true, ffill

jovial valley Apr 21, 2023, 5:32 AM

#

Hi guys,

Any user of yolov8 on a mac M chip?
I’m trying to train a model for a instance segmentation task, and i’m trying to use MPS (Metal Performance Shaders) to make use of GPU cores. I’m assigning it to a torch device, then assigning this device to the model to be trained - this part seems to be ok. But when i start to train the model, i see ‘0’ of GPU memory.. it seems the GPU is not being allocated for this task, then my model is taking ages to be trained.

quaint loom Apr 21, 2023, 5:38 AM

#

I am currently trying to make a TikZ table in jupyter notebook and I am not sure how to visualize the table now. Can someone have a look at my code and see what mistakes I am making? https://paste.pythondiscord.com/ikarafifed

mint palm Apr 21, 2023, 6:14 AM

#

when I need sequential embedding from BERT, should I do add_special_tokens=False.
by sequential embedding I mean for input sentence output shape is (batch_size, token_dim, embedding_size). compared to NON-sequential output whose shape is (batch_size, embedding_size)

mild dirge Apr 21, 2023, 8:02 AM

#

How do people normally label their data? Are there programs to efficiently do this?

junior mortar Apr 21, 2023, 8:13 AM

#

Hello hello to everybody i would like someone to help me with some training approaches

#

🤘🤘

haughty pewter Apr 21, 2023, 8:20 AM

#

Could someone explain how they jumped to 512 nodes in the middle 2 rows? It starts from 784 nodes from a 28x28 image at the bottom, but I don't know how it moves to 512 next

#

https://www.tutorialspoint.com/deep_learning_with_keras/keras_creating_deep_learning_model.htm This here also started from 784 -> 512 -> 512 -> 10, but it also doesn't explain where 512 came from

tidal bough Apr 21, 2023, 8:41 AM

#

these are dense layers, so they can be any number of nodes

#

assuming, that is, that the first operation here (the one done on the image) is a dense layer too - usually one does convolution layers first, and 512 would indeed be a strange number of outputs for a convolution

haughty pewter Apr 21, 2023, 8:45 AM

#

I found this guide which seems to allow me to get to 512, except unlike what was written, I had to minus 10 instead of adding it

tidal bough Apr 21, 2023, 8:47 AM

#

You'll note that it says "should", not "is". The number of neurons in hidden layers can be anything you choose.

lapis sequoia Apr 21, 2023, 9:52 AM

#

I currently have some knowledge of pandas and some machine learning algo from the sklearn library. Have been using jupyter notebook exclusively till now. Recently I did a self project that have hundreds of thousands of rows and took hours to do even simple things. How can I get started with using my GPU for ml? How vastly would the code be for using GPU as compared to the one I've been writing till now?

mild dirge Apr 21, 2023, 10:27 AM

#

There are libraries you can use to be able to use the gpu. Libraries like pytorch and tensorflow work with CUDA.

#

Which means that the code will not be much more complicated, just a single line with my_arr.to(device) will do

#

And notebooks are not great for programs that require a lot of memory and computing power

#

Garbage collection is garbage in a notebook (pun intended)

spark nimbus Apr 21, 2023, 10:33 AM

#

I have a dataframe with ~77m rows in the following format:
[key_1, key_2, start_date, end_date]
and I want to convert it to the following format:
[key_1, key_2, date] for each month interval between start_date and end_date.
How would I efficiently accomplish this? Here's the current code:

df['Date'] = df.apply(lambda x: pd.date_range(x['start_date'], x['end_date'], freq='M'), axis=1)
df = df.explode().reset_index(drop=True)
```which takes ~40 minutes. The main concern I have is the .apply function since it wouldn't be vectorized, but I don't see how I would effectively do this since pd.date_range does not accept Series parameters.

mighty patio Apr 21, 2023, 10:33 AM

#

mild dirge And notebooks are not great for programs that require a lot of memory and comput...

this depends on use, for most data science applications notebooks are the norm even if the amount of data is very large

spark nimbus Apr 21, 2023, 10:37 AM

#

mighty patio this depends on use, for most data science applications notebooks are the norm e...

I guess there's a cutoff for that, since where I work we've recently had to make a policy to only use notebooks when not using the full dataset, since some users started using ~100+ GB RAM for not properly deleting unused variables they removed from the code without properly restarting the kernel

mighty patio Apr 21, 2023, 10:42 AM

#

IMO that sounds like user error where ppl do not shut down their notebooks after they are done, but instead leave them running.

boreal gale Apr 21, 2023, 10:45 AM

#

spark nimbus I have a dataframe with ~77m rows in the following format: `[key_1, key_2, start...

have you looked into a similar trick used yesterday?
i.e. get the unique dates, generate the date range with apply, explode, join it with og dataframe

spark nimbus Apr 21, 2023, 10:50 AM

#

unfortunately there's very few duplicate combinations of (start_date, end_date), so it wouldn't work very well

#

we're talking like maybe 100 records in the entire dataset

boreal gale Apr 21, 2023, 10:51 AM

#

understood, let me have a quick think

mint palm Apr 21, 2023, 10:56 AM

#

mild dirge How do people normally label their data? Are there programs to efficiently do th...

usually setup a system to make annotator more efficient in annotation, sometime use multiple annotators and average them/choose median for unbiased annotations

boreal gale Apr 21, 2023, 11:12 AM

#

spark nimbus we're talking like maybe 100 records in the entire dataset

my current best:

import pandas as pd
import numpy as np

df = pd.DataFrame(
    {
        "start_date": pd.date_range("2000-01-01", "2100-01-01", freq='M'),
        "end_date": pd.date_range("2100-01-01", "2200-01-01", freq='M'),
    }
)

full_date_range = pd.date_range(df['start_date'].min(), df['end_date'].max(), freq='M')
df['start_ind'] = np.searchsorted( full_date_range, df['start_date'] )
df['end_ind'] = np.searchsorted( full_date_range, df['end_date'] , side='right')
df.apply(lambda x: full_date_range[x['start_ind']: x['end_ind']],axis=1)

it might or might not be faster for you depending on the actual data you have

#

and to use every core hackily...

import pandas as pd
import numpy as np
import multiprocessing as mp

df = pd.DataFrame(
    {
        "start_date": pd.date_range("2000-01-01", "2100-01-01", freq='M'),
        "end_date": pd.date_range("2100-01-01", "2200-01-01", freq='M'),
    }
)

def get_dem_date_ranges(df_slice):
    return df_slice.apply(lambda x: full_date_range[x['start_ind']: x['end_ind']],axis=1)


full_date_range = pd.date_range(df['start_date'].min(), df['end_date'].max(), freq='M')
df['start_ind'] = np.searchsorted( full_date_range, df['start_date'] )
df['end_ind'] = np.searchsorted( full_date_range, df['end_date'] , side='right')
with mp.Pool(processes=10) as pool:
    res = pd.concat(pool.map(get_dem_date_ranges, np.array_split(df, 10)))

#

at some point it might be better to just shuffle your workload to a MPP framework e.g. spark...

spark nimbus Apr 21, 2023, 11:29 AM

#

I'll forward that suggestion to my boss :)

fallow frost Apr 21, 2023, 11:29 AM

#

does anybody know if Pyarrow's IPC format is faster than parquet when S3 is used as the FileSystem?

spark nimbus Apr 21, 2023, 11:30 AM

#

fallow frost does anybody know if `Pyarrow`'s `IPC` format is faster than `parquet` when S3 i...

I can at least confirm that on high-end servers, parquet is better for local files, not sure about S3

fallow frost Apr 21, 2023, 11:31 AM

#

spark nimbus I can at least confirm that on high-end servers, parquet is better for local fil...

yeah I tested locally all the formats avaiable for ds.dataset() and parquet is the fastest and more compact

#

but I wonder if IPC is faster over the network, as I have this script that reads/writes 6 billion records, and when its its executed locally it takes 3 hours,
but when I run it as an AWS Batch and use S3 as the filesystem, it takes 9 hours!

boreal gale Apr 21, 2023, 11:34 AM

#

fallow frost does anybody know if `Pyarrow`'s `IPC` format is faster than `parquet` when S3 i...

what is your workload when using S3 as fs?

have you looked into S3 select?

fallow frost Apr 21, 2023, 11:35 AM

#

boreal gale what is your workload when using S3 as fs? have you looked into S3 select?

my workload involves reading and writing (write 6 billion records, read 5.4 billion)

#

what is S3 select?

boreal gale Apr 21, 2023, 11:41 AM

#

okay nevermind then, s3 select is basically a service for pushing some of your analytics workload to AWS instead of doing it locally, it only supports a very limited dialect of SQL.

#

https://docs.aws.amazon.com/AmazonS3/latest/userguide/selecting-content-from-objects.html

Filtering and retrieving data using Amazon S3 Select - Amazon Simpl...

Run a specified SQL expression against an object in Amazon S3, and return query results in response.

#

are you using a single parquet or multiple parquet?
did you use multipart upload?

fallow frost Apr 21, 2023, 11:44 AM

#

boreal gale are you using a single parquet or multiple parquet? did you use multipart upload...

I"m using the class from ds.dataset() to handle all the reading and writing, in total everything is splitted in 6k files

#

do you know if there is a better pyarrow format for interacting with S3??

sleek harbor Apr 21, 2023, 11:56 AM

#

When to use criterion='gini' and when to use criterion='entropy' for DecisionTreeClassifier? I'm getting conflicting results when googling.. some say that Gini is just between 0 and 0.5 and will produce the same results as using entropy (which is between 0 and 1), but Gini is more computationally efficient, so Gini should always be used.. but some state otherwise, and if it is so, then why do the other options even exist?

boreal gale Apr 21, 2023, 12:05 PM

#

fallow frost do you know if there is a better pyarrow format for interacting with S3??

i don't know actually.

but to me, at the end of the day, it's just transferring data to and from S3 + CPU time used to (de)compress any data, imo just pick the one with best compression rate (provided you don't care about other characteristics of file format, e.g. predicate pushdown possibility of parquet)

#

i also have no idea what the heck is ds.dataset() 🤔

spark nimbus Apr 21, 2023, 12:05 PM

#

boreal gale my current best: ```py import pandas as pd import numpy as np df = pd.DataFrame...

df.apply(lambda x: full_date_range[x['start_ind']: x['end_ind']],axis=1)
do you happen to know of a way to do this vectorized?

fallow frost Apr 21, 2023, 12:06 PM

#

boreal gale i also have no idea what the heck is `ds.dataset()` 🤔

https://arrow.apache.org/docs/python/generated/pyarrow.dataset.Dataset.html