weary timber Dec 5, 2024, 6:55 PM

#

1,784

serene scaffold Dec 5, 2024, 6:55 PM

#

weary timber 1,784

you have np.matmul(params["W1"], cache["A0"].T)

#

do you see?

weary timber Dec 5, 2024, 6:56 PM

#

so i dont do .t?

#

or i change the untransposed A0 (input)'s shape?

serene scaffold Dec 5, 2024, 6:56 PM

#

@weary timber try it and see.

weary timber Dec 5, 2024, 6:57 PM

#

serene scaffold <@1063131445792612383> try it and see.

one last thing, i do feedforward with all the training data is it a problem?

#

like i do data = load()

#

and do feedforward(data)

serene scaffold Dec 5, 2024, 6:57 PM

#

weary timber one last thing, i do feedforward with all the training data is it a problem?

do you know about training data and testing data?

weary timber Dec 5, 2024, 6:57 PM

#

yes

grizzled verge Dec 5, 2024, 11:59 PM

#

Hey guys, currently working on a text to handwriting letter model with EMNIST but my outputs for this jupyter notebook are really bad and I dont know why. ANy tips to improve it?

#

#

Like is something wrong with my training or

#

or potentially how my image is being saved

warm copper Dec 6, 2024, 12:18 AM

#

serene scaffold do you know about training data and testing data?

Hello! When you have freetime can you check my class project? I just want to get feedback from people who are in the field

warm copper Dec 6, 2024, 2:09 AM

#

https://drive.google.com/file/d/1RiYN9c1dn2V6eyRku1r7jgX83M35nwyh/view?usp=sharing

Google Docs

Sentiment Analysis in the Finance Domain.pdf

pure valve Dec 6, 2024, 5:33 AM

#

no ultralytics_runner found hwo to fix

unkempt apex Dec 6, 2024, 6:11 AM

#

grizzled verge

why you guys use GPT to code

#

just go step by step

#

and you will understand more things

#

rather than directly put the code and ask here

north adder Dec 6, 2024, 8:01 AM

#

Hello so i was working on a computer vision project for first time ever (project is to extract data from Invoices). I saw some YT videos that they extract the text and use Regular expressions. Is this the optimal way? or is there another approach ? Thanks in advance

coarse nacelle Dec 6, 2024, 9:56 AM

#

20,000 samples something

#

I also tried this logic, which is faster than the last. but the validation accuracy still lacks.

def build_image2text_model():
    image_input = Input(shape=(5, 28, 28, 1))  # 5 images per sequence

    # feature extraction with regularization
    conv_features = TimeDistributed(Conv2D(32, (3, 3), activation='relu', padding='same'))(image_input)
    conv_features = TimeDistributed(MaxPooling2D((2, 2)))(conv_features)
    conv_features = TimeDistributed(BatchNormalization())(conv_features)
    conv_features = TimeDistributed(Dropout(0.3))(conv_features)
    conv_features = TimeDistributed(Flatten())(conv_features)

    encoder_output = LSTM(128, return_sequences=True, kernel_regularizer=l2(1e-4))(conv_features)
    encoder_output = Dropout(0.4)(encoder_output)

    decoder_input = RepeatVector(3)(encoder_output[:, -1])

    decoder_output = LSTM(128, return_sequences=True, kernel_regularizer=l2(1e-4))(decoder_input)

    # attention mechanism
    encoder_aligned = Dense(128)(encoder_output)
    attention_output = Attention()([decoder_output, encoder_aligned])

    decoder_combined = Concatenate()([decoder_output, attention_output])

    # final layer
    output = TimeDistributed(Dense(13, activation='softmax'))(decoder_combined)

    # compile the model
    model = Model(inputs=image_input, outputs=output)
    model.compile(optimizer=Adam(learning_rate=1e-3),
                  loss='categorical_crossentropy',
                  metrics=['accuracy'])
    
    return model

# training model

image2text_model = build_image2text_model()
image2text_model.fit(
    X_img, y_text_onehot,
    epochs=40, batch_size=128,
    validation_split=0.1
)

grizzled verge Dec 6, 2024, 10:33 AM

#

unkempt apex why you guys use GPT to code

kinda short on time for my school project sorry

#

I think it was because the emnist letters has 27 classes and my training was trying to use all the possible classes instead of just 27

#

still working on it

untold fable Dec 6, 2024, 5:29 PM

#

Now in those days I really like pythtorch

#

It's code like in real pythonic

#

I really love that

weary timber Dec 6, 2024, 6:01 PM

#

!code

arctic wedgeBOT Dec 6, 2024, 6:01 PM

#

Formatting code on Discord

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

For long code samples, you can use our pastebin.

weary timber Dec 6, 2024, 6:02 PM

#

import numpy as np
import pandas as pd

data = pd.read_csv(r"C:\Users\mehme\OneDrive\Desktop\neural network\train.csv")

data = np.array(data)
m, n = data.shape
np.random.shuffle(data)

data_dev = data[0:1000].T
Y_dev = data_dev[0]
X_dev = data_dev[1:n]
X_dev = X_dev / 255.

data_train = data[1000:m].T
Y_train = data_train[0]
X_train = data_train[1:n]
X_train = X_train / 255.
_,m_train = X_train.shape

def sigmoid(n,deriv=False):
    x = 1 / (1 + np.exp(-n))
    if deriv:
        return x * (x-1)
    return x

def init():
    global params,cache
    params = {"W1":np.random.randn(10,784),"B1":np.random.randn(10,1),"W2":np.random.randn(10,10),"B2":np.random.randn(10,1)}
    cache = {}

def feedforward(input):
    cache["A0"] = input
    cache["Z1"] = params["W1"].dot(cache["A0"]) + params["B1"]
    cache["A1"] = sigmoid(cache["Z1"])
    cache["Z2"] = params["W2"].dot(cache["A1"]) + params["B2"]
    cache["A2"] = sigmoid(cache["Z2"])

def backprop(input,desired):
    m = input.size
    feedforward(input)
    dZ2 = cache["A2"] - desired
    dW2 = 1 / m * dZ2.dot(cache["A1"].T)
    db2 = 1 / m * np.sum(dZ2 , 2)
    dZ1 = params["W2"].T.dot(dZ2) * sigmoid(cache["Z1"],True)
    dW1 = 1 / m * dZ1.dot(cache["A0"].T)
    db1 = 1 / m * np.sum(dZ1)
    return dW1,db1,dW2,db2

def update_params(dW1,db1,dW2,db2,learning_rate):
    params["W1"] -= learning_rate * dW1
    params["B1"] -= learning_rate * db1
    params["W2"] -= learning_rate * dW2
    params["B2"] -= learning_rate * db1

def gradient_descent(x_train,y_train,learning_rate,iterations):
    init()
    for i in range(iterations):
        dW1,db1,dW2,db2 = backprop(x_train,y_train)
        update_params(dW1,db1,dW2,db2,learning_rate)
        if i % 50 == 0:
            print(i)

gradient_descent(X_train,Y_train,0.1,10)
feedforward(X_dev[0][0])

#

can someone with time check this code and tell me how do i fix the error

#

at backprop while calculating the db2 it gives an error saying the dz2 is 2 dimensional so it cant be summed in axis = 2

#

how does that make sense i cant get it

#

and another problem, when it works(when i set the sum to be done in axis 1) the output neurons returns 10x784 array

wheat snow Dec 7, 2024, 3:15 AM

#

Hey, i have a Pandas series ```
session_duration
0 26.625450
17 28.681083
20 35.153633
25 27.660017
28 32.193067


and a df which has a column session. I now want to basiccly get the data in the df that only have theese sessions in it, how would i filter

#

#

multible lines have the same session

serene scaffold Dec 7, 2024, 3:17 AM

#

wheat snow Hey, i have a Pandas series ``` session_duration 0 26.625450 17 28.68...

so you only want to get rows of df where the index is one of those five (0, 17, 20, 25, 28)?
also, what is the name of that series variable?

wheat snow Dec 7, 2024, 3:17 AM

#

its basiccly spotify sessions that i set to be continues listenings sessions

subtle talon Dec 7, 2024, 3:17 AM

#

wheat snow Hey, i have a Pandas series ``` session_duration 0 26.625450 17 28.68...

this looks sigma

wheat snow Dec 7, 2024, 3:18 AM

#

i tried this:

serene scaffold Dec 7, 2024, 3:18 AM

#

subtle talon this looks sigma

if you're just here to troll, kindly refrain.

wheat snow Dec 7, 2024, 3:18 AM

#

session_duration_avg_adapted_ids= session_duration_avg_adapted.index
session_duration_avg_adapted

df_session= [df['session']== session_duration_avg_adapted_ids]

serene scaffold Dec 7, 2024, 3:18 AM

#

wheat snow i tried this:

can you answer my questions before you continue?

subtle talon Dec 7, 2024, 3:19 AM

#

serene scaffold if you're just here to troll, kindly refrain.

im sorry man

#

i didnt mean to🥺

wheat snow Dec 7, 2024, 3:19 AM

#

serene scaffold so you only want to get rows of `df` where the index is one of those five (0, 17...

yes, name is session_duration

serene scaffold Dec 7, 2024, 3:19 AM

#

wheat snow yes, name is session_duration

try df_session.loc[session_duration.index]

serene scaffold Dec 7, 2024, 3:20 AM

#

subtle talon im sorry man

perhaps I misunderstood you. can you explain what it means for something to be sigma?

subtle talon Dec 7, 2024, 3:20 AM

#

a sigma is a cool person

#

that means ur code is really good

#

i think

#

idk how to read code

#

is this the javascript server

serene scaffold Dec 7, 2024, 3:21 AM

#

ah. I thought a "sigma male" was someone undesireable.

subtle talon Dec 7, 2024, 3:21 AM

#

serene scaffold ah. I thought a "sigma male" was someone undesireable.

my friend is saying javascript is better than python

serene scaffold Dec 7, 2024, 3:22 AM

#

subtle talon my friend is saying javascript is better than python

get better friends.

subtle talon Dec 7, 2024, 3:22 AM

#

is javascipt bad

wheat snow Dec 7, 2024, 3:22 AM

#

"session_duration"

#

this has to be a column in my df to work right?

serene scaffold Dec 7, 2024, 3:22 AM

#

No. I'm just being silly. He can have that opinion.

subtle talon Dec 7, 2024, 3:22 AM

#

serene scaffold No. I'm just being silly. He can have that opinion.

oh ok

#

👍

serene scaffold Dec 7, 2024, 3:22 AM

#

wheat snow this has to be a column in my df to work right?

Do exactly the code that I gave you and let me know what happens.

#

actually, do df_session.loc[session_duration_avg_adapted.index]

#

again, do this exactly.

wheat snow Dec 7, 2024, 3:23 AM

#

KeyError: "None of [Int64Index([ 0, 17, 20, 25, 28, 48, 49, 58, 77, 125,\n ...\n 2982, 2987, 2991, 2995, 2998, 3012, 3014, 3023, 3034, 3055],\n dtype='int64', name='ts', length=236)] are in the [index]

#

lemme check dtypes

subtle talon Dec 7, 2024, 3:24 AM

#

my friend say do you prefer tf or pt

wheat snow Dec 7, 2024, 3:24 AM

#

might be string even tho that would be weird

subtle talon Dec 7, 2024, 3:24 AM

#

i prefer subway

serene scaffold Dec 7, 2024, 3:24 AM

#

please run the code that I gave exactly, and if you get an error, show the whole error message

subtle talon Dec 7, 2024, 3:24 AM

#

he said by pt is pie torch

serene scaffold Dec 7, 2024, 3:25 AM

#

subtle talon my friend say do you prefer tf or pt

I don't like pytorch per se, but development of tensorflow is slowing down, so I don't consider it a viable option.

subtle talon Dec 7, 2024, 3:25 AM

#

i like subway do you like subway?

wheat snow Dec 7, 2024, 3:25 AM

#

sorry, i think i have found something important ratehr than the message

subtle talon Dec 7, 2024, 3:26 AM

#

u know what

#

i dont deserve to be here

#

some one ban me

serene scaffold Dec 7, 2024, 3:26 AM

#

wheat snow sorry, i think i have found something important ratehr than the message

okay; if you need help from me again, ping me with the result of the exact code and the whole error message.

serene scaffold Dec 7, 2024, 3:26 AM

#

subtle talon i dont deserve to be here

you do deserve to be here.

subtle talon Dec 7, 2024, 3:26 AM

#

NO

#

BAN

#

BAN

#

BAN

serene scaffold Dec 7, 2024, 3:27 AM

#

@subtle talon you deserve to be here. please do not squander what you deserve by spamming.

subtle talon Dec 7, 2024, 3:27 AM

#

ok man

#

whats a squander

wheat snow Dec 7, 2024, 3:27 AM

#

@serene scaffold i dont think you can work with that but since u insist:

serene scaffold Dec 7, 2024, 3:27 AM

#

subtle talon whats a squander

to recklessly waste

subtle talon Dec 7, 2024, 3:27 AM

#

oh ok

wheat snow Dec 7, 2024, 3:28 AM

#

KeyError                                  Traceback (most recent call last)
Input In [120], in <cell line: 4>()
      1 session_duration_avg_adapted= session_duration[(session_duration>= 26) & (session_duration<=36) ]
      2 session_duration_avg_adapted
----> 4 df_session= df.loc[session_duration_avg_adapted.index]
      5 df_session

File c:\Users\Grr\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\indexing.py:967, in _LocationIndexer.__getitem__(self, key)
    964 axis = self.axis or 0
    966 maybe_callable = com.apply_if_callable(key, self.obj)
--> 967 return self._getitem_axis(maybe_callable, axis=axis)

File c:\Users\Grrr\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\indexing.py:1191, in _LocIndexer._getitem_axis(self, key, axis)
   1188     if hasattr(key, "ndim") and key.ndim > 1:
   1189         raise ValueError("Cannot index with multidimensional key")
-> 1191     return self._getitem_iterable(key, axis=axis)
   1193 # nested tuple slicing
   1194 if is_nested_tuple(key, labels):

File c:\Users\Grrr\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\indexing.py:1132, in _LocIndexer._getitem_iterable(self, key, axis)
   1129 self._validate_key(key, axis)
   1131 # A collection of keys
-> 1132 keyarr, indexer = self._get_listlike_indexer(key, axis)
   1133 return self.obj._reindex_with_indexers(
...
-> 5842     raise KeyError(f"None of [{key}] are in the [{axis_name}]")
   5844 not_found = list(ensure_index(key)[missing_mask.nonzero()[0]].unique())
   5845 raise KeyError(f"{not_found} not in index")

KeyError: "None of [Int64Index([   0,   17,   20,   25,   28,   48,   49,   58,   77,  125,\n            ...\n            2982, 2987, 2991, 2995, 2998, 3012, 3014, 3023, 3034, 3055],\n           dtype='int64', name='ts', length=236)] are in the [index]"

#

so again in normal df we have multible copies of a session value

serene scaffold Dec 7, 2024, 3:30 AM

#

@wheat snow try this

indexer = (session_duration >= 26) & (session_duration <= 36)
df_session = session_duration.loc[indexer]

wheat snow Dec 7, 2024, 3:32 AM

#

worked but i already had that result

#

thats just the pandas series

serene scaffold Dec 7, 2024, 3:33 AM

#

where did session_duration come from?

#

What's the "mother dataframe"?

#

You can do .loc[indexer] on that.

wheat snow Dec 7, 2024, 3:34 AM

#

thing, is session_duration is not a column of the df

#

it is just a value i calculated by grouping miliseconds_played and the session togethe

#

# Assuming df is your DataFrame with 'ts' as the index
# Convert 'ts' to datetime if it's not already
df.index = pd.to_datetime(df.index)

# Sort the DataFrame by timestamp
df = df.sort_index()

# Calculate the time difference between consecutive tracks
df['time_diff'] = df.index.to_series().diff().dt.total_seconds()

# Define a session break threshold (e.g., 30 minutes)
session_break_threshold = 30 * 60  # in seconds

# Identify sessions
df['session'] = (df['time_diff'] > session_break_threshold).cumsum()

# Calculate session duration
session_duration = df.groupby('session')['ms_played'].sum() / 1000 / 60  # Convert to minutes
session_duration= session_duration[session_duration >=1]
``` maybe that helps

wheat snow Dec 7, 2024, 4:22 AM

#

that seemed to work so far @serene scaffold

#

indexer = (session_duration >= 26) & (session_duration <= 36)
df_session = df[df['session'].isin(indexer.index)]

warm copper Dec 7, 2024, 5:02 AM

#

I submitted my paperrrrr

lofty vortex Dec 7, 2024, 7:06 AM

#

How do I show probability distributions like this using matplotlib? X-axis shows for which depth this distribution is, y-axis shows dates

#

This plot was generated by R library

weary timber Dec 7, 2024, 7:40 AM

#

weary timber can someone with time check this code and tell me how do i fix the error

help me with this please

trim saddle Dec 7, 2024, 8:12 AM

#

weary timber help me with this please

Try printing the arrays and their shape between steps, that way you can see were things get messy

radiant rock Dec 7, 2024, 8:38 AM

#

#

Hey guys, do you think this a good confusion matrix?

azure burrow Dec 7, 2024, 11:54 AM

#

It's kinda confusing so at least kinda. I'm pretty sure there are some metrics that you can use to calculate how good your predictions are based on it, though, and estimate that more "objectively".

weary timber Dec 7, 2024, 6:58 PM

#

trim saddle Try printing the arrays and their shape between steps, that way you can see were...

the problem occurs when i set the sum axis to 2 when summing dZ2. if you know how i can fix that that fixes the output being 100arrays too

weary timber Dec 7, 2024, 7:59 PM

#

trim saddle Try printing the arrays and their shape between steps, that way you can see were...

turns out the output arrays is (10,inputSize)

#

like the values of the output gets turned into an array with inputsize amount of the value

slate drum Dec 8, 2024, 3:27 AM

#

weary timber at backprop while calculating the db2 it gives an error saying the dz2 is 2 dime...

db2 = 1 / m * np.sum(dZ2, axis=1, keepdims=True)

#

@weary timber

grand breach Dec 8, 2024, 6:36 AM

#

how can I manage an existing conda env using pixi - should i export conda yaml and convert that to pixi.toml

weary timber Dec 8, 2024, 12:04 PM

#

slate drum db2 = 1 / m * np.sum(dZ2, axis=1, keepdims=True)

i have done it but the calcs are wrong, can you help me out? https://paste.pythondiscord.com/7EYA

slate drum Dec 8, 2024, 4:28 PM

#

weary timber i have done it but the calcs are wrong, can you help me out? https://paste.pytho...

yes, lets try and figure it out together

weary timber Dec 8, 2024, 4:28 PM

#

slate drum yes, lets try and figure it out together

👍🏿

#

dm?

slate drum Dec 8, 2024, 4:29 PM

#

weary timber dm?

sounds good

versed pilot Dec 8, 2024, 10:26 PM

#

lofty vortex How do I show probability distributions like this using matplotlib? X-axis shows...

hi, have a look at Matt Harrison's Essential Statistics course (using Seaborn, sorry!) https://github.com/LinkedInLearning/python-statistics-essential-training-4433355/blob/main/PyStats-Solutions.ipynb

GitHub

python-statistics-essential-training-4433355/PyStats-Solutions.ipyn...

This is a repository for the LinkedIn Learning course Python Statistics Essential Training - LinkedInLearning/python-statistics-essential-training-4433355

lapis sequoia Dec 9, 2024, 1:13 AM

#

Hi I want to build a machine learning model that can detect toxic messages throughout different languages

#

Do you guys have any ideas?

agile cobalt Dec 9, 2024, 1:36 AM

#

lapis sequoia Hi I want to build a machine learning model that can detect toxic messages throu...

Kaggle has some datasets for that, at least for some of the most common languages like English and Portuguese

lapis sequoia Dec 9, 2024, 1:39 AM

#

I need arabic

lapis sequoia Dec 9, 2024, 1:56 AM

#

Just read this:

https://www.researchgate.net/publication/374997490_Arabic_Toxic_Tweet_Classification_Leveraging_the_AraBERT_Model

And it seems like classic Multilingual models like BERT are less effective than language specific. Is it a viable solution to first detect the language then use a model designed for that language specifically?

agile cobalt Dec 9, 2024, 2:06 AM

#

yes

#

in some cases a single model trained in multiple languages could perform better than models trained only in a single language - specially if the languages are similar to each other (e.g. Latin derived European languages) and you don't have a lot of training data for each

for this, just training specifically in one language should be fine though

#

just pay attention to class (in)balance and determine if you'd rather prioritize accuracy or recall

weary timber Dec 9, 2024, 5:13 AM

#

def backprop(DESIRED):
    global cache,params
    m = DESIRED.size
    DESIRED = np.eye(10)[DESIRED].T
    dZ2 = cache["A2"] - DESIRED
    dW2 = 1 / m * dZ2.dot(cache["A1"].T)
    dB2 = 1 / m * np.sum(dZ2,axis=1,keepdims=True)
    dZ1 = params["W2"].T.dot(dZ2) * sigmoid(cache["Z1"],True)
    dW1 = 1 / m * dZ1.dot(cache["A0"].T)
    dB1 = 1 / m * np.sum(dZ1,axis=1,keepdims=True)
    return dW1,dB1,dW2,dB2

isnt this a true implementation?

#

note that the inputs im making are transposed

#

@wooden sail

serene scaffold Dec 9, 2024, 11:13 AM

#

weary timber <@467435887236612106>

You do not have standing permission to ping people who have helped you in the past

keen perch Dec 9, 2024, 1:40 PM

#

What are steps to process data, like when processing data what are the things to consider or do for example: handling missing values duplicates, what else

agile cobalt Dec 9, 2024, 1:42 PM

#

it varies a lot depending on which data you're working with

serene scaffold Dec 9, 2024, 2:30 PM

#

keen perch What are steps to process data, like when processing data what are the things t...

beginner data science material sometimes makes it sound like preprocessing is a well-defined thing with exact steps that you always follow. but it isn't really like that.

weary timber Dec 9, 2024, 4:39 PM

#

serene scaffold You do not have standing permission to ping people who have helped you in the pa...

sorry!

livid locust Dec 9, 2024, 6:00 PM

#

https://discord.com/channels/267624335836053506/1315739531928145980

slate drum Dec 9, 2024, 8:04 PM

#

@livid locust locked now, you still want help?

livid locust Dec 9, 2024, 8:19 PM

#

yeah i’ll come back here after my classes i didn’t realize the time frame

slate drum Dec 9, 2024, 8:21 PM

#

we should just dm or make a group ?

reef eagle Dec 9, 2024, 8:53 PM

#

https://github.com/lucifertrj/Awesome-RAG/

found some useful GitHub repo with multiple notebooks on RAG mainly using Open Source LLMs

GitHub

GitHub - lucifertrj/Awesome-RAG: RAG-VectorDB-Embedings-LlamaIndex-...

RAG-VectorDB-Embedings-LlamaIndex-Langchain. Contribute to lucifertrj/Awesome-RAG development by creating an account on GitHub.

agile cobalt Dec 9, 2024, 11:42 PM

#

as long as it doesn't goes too much into self-advertising

#

post it and we'll say if it's too much I guess

#

is that satire?

#

regardless... I would recommend putting things in GitHub instead of just pasting in the pastebin
and (if you have any) linking your citations or other useful resources

hard vortex Dec 10, 2024, 12:10 AM

#

anyone know how to solve this??? GOING INSANE and cant figure it out

left tartan Dec 10, 2024, 12:42 AM

#

hard vortex anyone know how to solve this??? GOING INSANE and cant figure it out

Maybe open a help thread, and paste the problem as text? Hard to read this. #❓｜how-to-get-help

bleak glacier Dec 10, 2024, 12:46 AM

#

Hello, a question please, what a better approach to training a model, If no improvement is made in this run reload to the last point of improvement (weights wise), or keep on training with decreased accuracy?
Maybe it's a dip but the model still got some parameters which will help it better improve on next runs / long run
For image segmentation if that matters

#

I mean if for example I train the model for x amount of runs (each have y amount of epoches) and after each run it saves the weights, and in one run the accuracy of the validation is worse than the privous run, should I keep training with those weights on the next run or reload the last weights which had better accuracy score

glossy urchin Dec 10, 2024, 3:52 AM

#

im using a simple rnn model

#

however when i plot the predicted vs actual i get

toxic palm Dec 10, 2024, 4:13 AM

#

Hi,
could anyone please check my understanding on this:

Athena vs Redshift
Athena can't run SQL queries on the data that is present inside the warehouse(For ex: Redshift).
Athena works only on the data present inside S3 bucket.

Athena vs Glue
Catalog can be created by athena service / glue service.
Recommended way to generate catalog is by glue. The reason is simple. Athena can prepare catalog only based on the data from s3.
How about preparing the catalog from may sources like S3, SQL DB, No-SQL DB etc ? Only glue can do that.

If you want to correct/add new thing, pls.

Note: I am not literally comparing the above services. They are there for different purpose. My intention is to compare certain aspects which are command between services.

Also, if glue can do more than what athena can do, whats the purpose of Athena?

untold fable Dec 10, 2024, 5:04 AM

#

what sould next

#

i have done a oursse in coursera

#

form andrew ng machine learning spelication

stark kelp Dec 10, 2024, 7:52 AM

#

There should be a channel for Quantitative Finance

ocean hinge Dec 10, 2024, 8:42 AM

#

untold fable form andrew ng machine learning spelication

Deep learning by the same guy.

plain flax Dec 10, 2024, 9:51 AM

#

What GPUs are the best for machine learning? (pytorch)

lapis sequoia Dec 10, 2024, 9:51 AM

#

plain flax What GPUs are the best for machine learning? (pytorch)

TPUs and NPUs

#

If you really wanted the best of the best, then of course you're not going to be using a normal GPU

#

you'd be using an A100 or something

jaunty helm Dec 10, 2024, 10:09 AM

#

lapis sequoia TPUs and NPUs

last time I checked both of those are inferior to just buying something from nvidia tho?

lapis sequoia Dec 10, 2024, 10:09 AM

#

jaunty helm last time I checked both of those are inferior to just buying something from nvi...

The A100 is an Nvidia TPU

#

They're only used for AI

jaunty helm Dec 10, 2024, 10:10 AM

#

esp. training
my memory is fuzzy, but isn't NPU in like a CPU, and TPU you rent from google, and there was another thing that's very tiny which may run image recognition nets

lapis sequoia Dec 10, 2024, 10:10 AM

#

More processing power than a 4090

lapis sequoia Dec 10, 2024, 10:10 AM

#

jaunty helm esp. training my memory is fuzzy, but isn't NPU in like a CPU, and TPU you rent ...

An NPU is a Neural Processing Unit, a TPU is a Tensor GPU

#

The former is for neural networks

#

the latter is used for things such as ChatGPT

jaunty helm Dec 10, 2024, 10:11 AM

#

lapis sequoia The A100 is an Nvidia TPU

last time I checked nvidia called them Tensor Core GPUs
which I don't think is the same as TPUs, those are google products

jaunty helm Dec 10, 2024, 10:12 AM

#

lapis sequoia the latter is used for things such as ChatGPT

former for neural network, latter for ChatGPT
that doesn't even make much sense as the architecture behind ChatGPT is a neural network

lapis sequoia Dec 10, 2024, 10:12 AM

#

jaunty helm last time I checked nvidia called them Tensor Core GPUs which I don't think is t...

nvm

lapis sequoia Dec 10, 2024, 10:13 AM

#

jaunty helm > former for neural network, latter for ChatGPT that doesn't even make much sens...

I was talking about scale

#

ChatGPT is a combination of 10,000 or so Nvidia A100 Tensor GPUs

#

*V100 not A100

#

different things, same purpose

jaunty helm Dec 10, 2024, 10:14 AM

#

anyway; still, your best hardware in terms of AI rn is still just everything nvidia (though the price isn't so attractive)

lapis sequoia Dec 10, 2024, 10:15 AM

#

jaunty helm anyway; still, your best hardware in terms of AI rn is still just everything nvi...

Yeah, the price is never attractive when you're talking about large scale stuff

#

And you did in fact ask for the best, not the most cost friendly

#

lol I just found out the A100 is around 4-5 times as powerful as the V100

jaunty helm Dec 10, 2024, 10:17 AM

#

lapis sequoia Yeah, the price is never attractive when you're talking about large scale stuff

I mean it's not even about large scale but software
pytorch and others were clearly built with CUDA in mind

lapis sequoia Dec 10, 2024, 10:18 AM

#

jaunty helm I mean it's not even about large scale but software pytorch and others were clea...

Everything that's using AI has an Nvidia chip at this point, they're just the best option

jaunty helm Dec 10, 2024, 10:18 AM

#

if you're on windows and have an amd card, well too bad cause pytorch ROCm ain't supported
intel is also here I guess, even worse than amd though

bleak glacier Dec 10, 2024, 10:24 AM

#

bleak glacier I mean if for example I train the model for x amount of runs (each have y amount...

Anyone?

plain flax Dec 10, 2024, 10:43 AM

#

lapis sequoia you'd be using an A100 or something

I didn’t know that those A5000 and so on where called differently, very interesting…

plain flax Dec 10, 2024, 10:45 AM

#

lapis sequoia And you did in fact ask for the best, not the most cost friendly

Yes

#

I’m investigating all of that, might come here later, thank you

lapis sequoia Dec 10, 2024, 10:49 AM

#

plain flax I didn’t know that those A5000 and so on where called differently, very interest...

The A5000 is a normal GPU in comparison to the A100

#

The A just means it's of the Ampere architecture

#

V means the Volta architecture

plain flax Dec 10, 2024, 10:54 AM

#

Very interesting indeed.

#

Do the numbers follow any logic?

#

I heard that CUDA Cores are the most important thing to look into…

jaunty helm Dec 10, 2024, 10:56 AM

#

plain flax I’m investigating all of that, might come here later, thank you

what do you actually plan to do

#

like

What GPUs are the best for machine learning? (pytorch)
what are you gonna do with pytorch? try implementing image recognition nets? run large language models? finetune large language models?

mild dirge Dec 10, 2024, 10:57 AM

#

plain flax I heard that CUDA Cores are the most important thing to look into…

The number of cores, the speed of the cores, and the latency/speed of data transfer to TPU/GPU. Those are the main things to look for.

#

Depending on the task at hand, the priority of these change.

plain flax Dec 10, 2024, 10:58 AM

#

It’s Deepfake technology in real time…

#

https://github.com/hacksider/Deep-Live-Cam

GitHub

GitHub - hacksider/Deep-Live-Cam: real time face swap and one-click...

real time face swap and one-click video deepfake with only a single image - hacksider/Deep-Live-Cam

#

Curious on if using two graphic cards will make that much of a difference

plain flax Dec 10, 2024, 11:00 AM

#

jaunty helm like > What GPUs are the best for machine learning? (pytorch) what are you gonna...

Image related.

jaunty helm Dec 10, 2024, 11:01 AM

#

plain flax https://github.com/hacksider/Deep-Live-Cam

I mean, from the looks of their github, a 3060 alr runs at 20 fps?
so just consumer grade is probably enough

plain flax Dec 10, 2024, 11:03 AM

#

I have a 3060 and it peaks at 20 FPS, usually it goes at around 7 frames (per second)

#

Willing to make an upgrade…

#

Although I can probably wait for the new generation

plain flax Dec 10, 2024, 2:01 PM

#

👏🏻👏🏻👏🏻

rugged sorrel Dec 10, 2024, 2:11 PM

#

Hey, I had an idea for a model that could create Virtual Machines/Environments to run software not typically compatible on the system. I.e. Excel on Linux, but without the need to use Wine, and I also wanted to use it for games.

limpid zenith Dec 10, 2024, 2:12 PM

#

how is this related to DS and AI?

buoyant vine Dec 10, 2024, 2:12 PM

#

2 things, why this channel, and is this not already solved by Steam's Proton

#

ok, 3 things actually, WINE is normally better than a VM if you can use it because it distinctly isn't a VM

rugged sorrel Dec 10, 2024, 2:13 PM

#

Oh, sorry, I forgot to mention I was going to use AI to make the environment.

#

As in, it scans the software and sees what the requirements are.

limpid zenith Dec 10, 2024, 2:15 PM

#

I was going to use AI to make the environment.
like an LLM or what?

rugged sorrel Dec 10, 2024, 2:44 PM

#

Yeah? Something like that. I apologize I should have phrased it better.

#

Well, maybe not a large language model.

agile cobalt Dec 10, 2024, 2:47 PM

#

rugged sorrel As in, it scans the software and sees what the requirements are.

that isn't very realistic I think

what exactly do you have in mind when you say "scans the software"?

buoyant vine Dec 10, 2024, 3:01 PM

#

rugged sorrel As in, it scans the software and sees what the requirements are.

You are describing Proton, but with AI instead of just procedural detection

untold cliff Dec 10, 2024, 3:17 PM

#

Can you explain how nested cross validation is performed or point me to some resource that does so?
The way I understand it so far is:

split the data into k-folds
for i from 1 to k:
- test set = fold[i], outer training set = remaining (k - 1) folds
- let's call the outer training set folds out_folds for example
- for each model that we have:
  - for j from 1 to (k - 1):
    - validation set = out_folds[j], inner training set = remaining (k - 2) folds
    - let's call the inner training set folds in_folds for example
    - train the model on in_folds, evaluate on the validation set
  - report the mean performance of the model on all folds
- Now we should have performance measures for all models on the first round of cross validation on the outer fold, so do we pick the best model, train it on the first set of out_folds and evaluate it on the test set (fold[i])? This way we might report different models as the best one from one outer fold to another right?
  Or we would do the above steps on all combinations of inner folds, and then report only one final model which performs best on all of them, and then apply cross-validation on this model with the outer folds?

weary timber Dec 10, 2024, 4:30 PM

#

is learning reinforcement learning right after cnn a good idea?

unkempt apex Dec 10, 2024, 5:00 PM

#

weary timber is learning reinforcement learning right after cnn a good idea?

RL is bit different , basic knowledge of NN, some begineer level algorithms can be also good

weary timber Dec 10, 2024, 5:01 PM

#

unkempt apex RL is bit different , basic knowledge of NN, some begineer level algorithms can ...

so wdym

#

i dont learn?

unkempt apex Dec 10, 2024, 5:02 PM

#

learn then

#

do you know basics of ML?

#

of course you do!

#

then continue to learn RL

weary timber Dec 10, 2024, 5:08 PM

#

unkempt apex of course you do!

are you being sarcastic with this?

unkempt apex Dec 10, 2024, 5:10 PM

#

weary timber are you being sarcastic with this?

no I was just rechecking

smoky quest Dec 10, 2024, 6:32 PM

#

how do i get started on building a multiagent ai system?

limber spear Dec 10, 2024, 7:47 PM

#

smoky quest how do i get started on building a multiagent ai system?

What is multiagent. I am hearing about ai agent. What is all of that.

smoky quest Dec 10, 2024, 7:49 PM

#

limber spear What is multiagent. I am hearing about ai agent. What is all of that.

Its just ai's that are not fully chatbots its a network of ai's discussing stuff like email automation, coding, data analsyst or researching and you put all theese agents together and they discuss how to make their task succsesful. im also kinda new to it

pastel condor Dec 10, 2024, 9:50 PM

#

I am on 3.11.9 of Python. CUDA 12.6.3.

Need to train a model asap, have a hackathon in 5 hours. Not sure how to install PyTorch.

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu12

I tried this but it gave an error.

tranquil zenith Dec 10, 2024, 10:53 PM

#

I am having massive trouble with reading and merging pandas dataframes. I am getting seemingly contradictory output from code.

I am on MacOS Sonoma 14.6.1, with Apple M2 chip. I have python 3.11.5. I have anaconda python installed.

Problem #1: This assert fails, which it shouldn't on an outer join. I checked, the dtype of truths.id, edges.id, and merged1.id are all int64

import csv
import pandas as pd

#increase size limit because file is too big
csv.field_size_limit(10**9)

truths = pd.read_csv("truths.tsv", 
                     sep="\t", 
                     quotechar='"', 
                     engine="python", 
                     on_bad_lines='skip')

edges = pd.read_csv("truth_hashtag_edges.tsv", 
                     sep="\t", 
                     quotechar='"', 
                     engine="python", 
                     on_bad_lines='skip')

merged1 = truths.merge(edges, on = "id", how = "outer", indicator = True)

for item in merged1.id:
    assert (item in truths.id or item in edges.id)```

Problem #2: When I run this after the above code, directly contradicting the fact that the assert failed
`(merged1.id.isin(truths.id) | merged1.id.isin(edges.id)).all()`

Problem #3: This sequence of asserts passes:
```python
for item in edges.id:
    assert item in truths.id

But edges.id.isin(truths.id) gives a pd.Series with some values True and some values False. If that assert passed, it should be all True.

Also I can't easily manually inspect this stuff because I'm dealing with dataframes with hundreds of thousands of rows.

#

Also the dataset is the Truth Social dataset found at https://zenodo.org/records/7531625

serene scaffold Dec 10, 2024, 11:11 PM

#

tranquil zenith I am having massive trouble with reading and merging pandas dataframes. I am get...

Please do print(merged1.head().to_dict()) and put the result in the paste bin. Please do not post any screenshots.

#

!paste

#

@tranquil zenith nevermind; I downloaded the dataset and ran your code.

#

The problem is that x in series tells you if x is an index for an element in that series. not if it's an element in the series.

#

In [6]: truths.id
Out[6]:
0          703265
1          807614
2          807615
3          807618
4          807619
           ...
739774    1060918
739775    1060919
739776    1060920
739777    1060928
739778    1060937
Name: id, Length: 739779, dtype: int64

In [7]: 1060937 in truths.id
Out[7]: False

#

In [11]: pd.Series.__contains__?
Signature: pd.Series.__contains__(self, key) -> 'bool_t'
Docstring: True if the key is in the info axis
File:      c:\users\17032\appdata\local\programs\python\python312\lib\site-packages\pandas\core\generic.py
Type:      function

In [12]: pd.Series.__iter__?
Signature: pd.Series.__iter__(self) -> 'Iterator'
Docstring:
Return an iterator of the values.

pine snow Dec 11, 2024, 10:28 AM

#

deep veldt Dec 11, 2024, 11:58 AM

#

what's the differences between Constructive loss and triplet loss? and how do i use them in pytorch?

mild dirge Dec 11, 2024, 12:56 PM

#

pine snow

It is likely due to the initialization of the cluster centers. You could try come up with some sort of "error" metric, like average distance to cluster mean, and run the algorithm multiple times with different initializations, and pick the one with the lowest error.

odd meteor Dec 11, 2024, 5:35 PM

#

Is anyone at NeurIPS?

odd meteor Dec 11, 2024, 5:49 PM

#

pastel condor I am on 3.11.9 of Python. CUDA 12.6.3. Need to train a model asap, have a hacka...

It'd have been easier to profer a solution if you had mentioned the specific error message you got .

Meanwhile, I've been using light-the-torch since I discovered it.

pip install light-the-torch

https://github.com/pmeier/light-the-torch

GitHub

GitHub - pmeier/light-the-torch: Install PyTorch distributions with...

Install PyTorch distributions with computation backend auto-detection - pmeier/light-the-torch

past meteor Dec 11, 2024, 6:22 PM

#

pine snow

@mild dirge is right, in practice K-means++ is used a lot. It's same algo with a smarter initialisation

#

It's pretty straight forward to implement https://en.wikipedia.org/wiki/K-means%2B%2B

K-means++

In data mining, k-means++ is an algorithm for choosing the initial values (or "seeds") for the k-means clustering algorithm. It was proposed in 2007 by David Arthur and Sergei Vassilvitskii, as an approximation algorithm for the NP-hard k-means problem—a way of avoiding the sometimes poor clusterings found by the standard k-means algorithm. It ...

past bramble Dec 11, 2024, 6:33 PM

#

i want to run a llama model locally, currently i have installed it with ollama and it runs on its local server port. I want to be able to load the model separately and have more freedom over it, probably like a .keras or .h5 file. Can anyone guide me to what I'm looking for?

serene scaffold Dec 11, 2024, 6:35 PM

#

past bramble i want to run a llama model locally, currently i have installed it with ollama a...

please link the model card for the exact llama model you're talking about (there are several), and say how much GPU RAM you have.

past bramble Dec 11, 2024, 6:35 PM

#

I'm starting with a smaller model, llama3.1:8b

#

I'm not using a GPU for this, it seems to work fine on my laptop

serene scaffold Dec 11, 2024, 6:37 PM

#

past bramble I'm not using a GPU for this, it seems to work fine on my laptop

what are you trying to do with the model?

past bramble Dec 11, 2024, 6:38 PM

#

For now nothing much, a web app for this model. Later on I want to try fine tuning it and much more

agile cobalt Dec 11, 2024, 6:40 PM

#

past bramble i want to run a llama model locally, currently i have installed it with ollama a...

probably use whatever is in these directories? https://github.com/ollama/ollama/blob/main/docs/faq.md#where-are-models-stored

#

alternatively download from another site like hugging face instead of using their cli

past bramble Dec 11, 2024, 6:46 PM

#

let me take a look at that

unkempt wigeon Dec 11, 2024, 10:02 PM

#

Ordered a book on pytorch so the document is in front of me

unkempt wigeon Dec 12, 2024, 2:48 AM

#

May I ask a question for those who have made specific type of neural network? Sorry

serene scaffold Dec 12, 2024, 3:16 AM

#

unkempt wigeon May I ask a question for those who have made specific type of neural network? So...

!mute 868137138091343925 "1 day" You previously agreed to "not apologize for everything and not asking any questions to ask a question", and you have done that. This style of engagement is irritating for other users.

arctic wedgeBOT Dec 12, 2024, 3:16 AM

#

:incoming_envelope: :ok_hand: applied timeout to @unkempt wigeon until <t:1734059787:f> (1 day).

unkempt apex Dec 12, 2024, 7:34 AM

#

serene scaffold !mute 868137138091343925 "1 day" You previously agreed to "not apologize for eve...

here we go again!

unkempt apex Dec 12, 2024, 7:48 AM

#

past bramble i want to run a llama model locally, currently i have installed it with ollama a...

just go with hugging face

#

it is very easy

past bramble Dec 12, 2024, 8:29 AM

#

do you have a documentation or tutorial to use hugging face for this

deep veldt Dec 12, 2024, 10:40 AM

#

deep veldt what's the differences between Constructive loss and triplet loss? and how do i ...

can someone help me with this?

jaunty helm Dec 12, 2024, 1:40 PM

#

past bramble do you have a documentation or tutorial to use hugging face for this

huggingface transformers

#

it's mostly for inference, for finetuning you'll have to find something else
tho finetuning without a gpu sounds like asking for hell

heavy crow Dec 12, 2024, 3:16 PM

#

im trying to do facial keypoint detection but am running into a problem where my model plateaus pretty quickly and seemingly doesnt want to move. ive tried varying LR, dropout, BN, augmentations, model size etc but cant get it to work. Any ideas?

#

this is my current model:

class KeypointModel(nn.Module):
    def __init__(self, hparams):
        super().__init__()
        self.hparams = hparams
        
        self.features = nn.Sequential(
            # First conv layer
            nn.Conv2d(1, 64, kernel_size=5, stride=2, padding=2),  # 96x96 -> 48x48
            nn.ReLU(),
            nn.BatchNorm2d(64),
            
            # Second conv layer
            nn.Conv2d(64, 128, kernel_size=5, stride=2, padding=2),  # 48x48 -> 24x24
            nn.ReLU(),
            nn.BatchNorm2d(128),
            
            # Large pooling to reduce dimensions
            nn.MaxPool2d(kernel_size=4, stride=4),  # 24x24 -> 6x6
        )
        
        # fully connected layers
        self.classifier = nn.Sequential(
            nn.Flatten(),
            nn.Linear(128 * 6 * 6, 1024),
            nn.ReLU(),
            nn.Linear(1024, 512),
            nn.ReLU(),
            nn.Linear(512, 30)
        )

    def forward(self, x):
        if x.dim() == 3:
            x = torch.unsqueeze(x, 0)
        x = self.features(x)
        x = self.classifier(x)
        return x

Keypoints are 15 coordinates (x,y)

#

i was hoping to overfit first but i cant seem to even do that. even with many epochs

mild dirge Dec 12, 2024, 7:44 PM

#

Seems like a pretty small model @heavy crow , maybe try adding more convolutional layers with a stride of 1?

fresh bay Dec 12, 2024, 8:24 PM

#

heavy crow i was hoping to overfit first but i cant seem to even do that. even with many ep...

what is your learning rate?

#

you are on the right path with trying to overfit first

brave sand Dec 12, 2024, 9:08 PM

#

is cupy really much faster than numpy?

agile cobalt Dec 12, 2024, 9:33 PM

#

brave sand is cupy really much faster than numpy?

GPUs are orders of magnitude faster than CPUs for certain operations

Assuming that you have the required hardware and drivers setup, using libraries that execute heavy operations in it will be much faster than using libraries that execute the same operations in the CPU

brave sand Dec 12, 2024, 9:34 PM

#

agile cobalt GPUs are orders of magnitude faster than CPUs for certain operations Assuming t...

im currently working on a project that is implementing algorithms like q learning but with a shared action and state space and it takes forever to run simulations

#

im already using sparse matrices

agile cobalt Dec 12, 2024, 9:37 PM

#

If you are already using numpy efficiently and have a GPU you can use, then it may be worth trying to migrate

If you aren't even using numpy efficiently/correctly, I'd recommend starting there instead of changing libraries though (most libraries are similar to numpy one way or the other)

For example, are you using any for loops to iterate over rows?

brave sand Dec 12, 2024, 9:47 PM

#

agile cobalt If you are already using numpy efficiently and have a GPU you can use, then it m...

yes i am

agile cobalt Dec 12, 2024, 9:47 PM

#

....for the first part or the last question?....

brave sand Dec 12, 2024, 9:47 PM

#

agile cobalt ....for the first part or the last question?....

second part sorry

agile cobalt Dec 12, 2024, 9:48 PM

#

iterating over numpy arrays completely destroys any performance benefits you could hope to gain from using it

same but even worse with GPU libraries

#

if you haven't yet, I very strongly recommend reading the numpy user guide

heavy crow Dec 12, 2024, 11:02 PM

#

fresh bay what is your learning rate?

I'm using a LR (on plateau) scheduler but starting between 1e-3 and -6.

heavy crow Dec 12, 2024, 11:02 PM

#

mild dirge Seems like a pretty small model <@136174338633105408> , maybe try adding more co...

Yes I tried this as well, without much luck

#

It seems random. If I reinitialize the model a few times sometimes it will fit great

#

But not over fit. Val los sinks with training loss. Overall it seems very sensitive to batch size, LR, and initialization. I'm using Adam btw

silver hill Dec 13, 2024, 12:16 AM

#

why is llama so popular?

#

when I read messages in this server, everyone who wants to improve a bot choose llama

serene scaffold Dec 13, 2024, 12:24 AM

#

silver hill when I read messages in this server, everyone who wants to improve a bot choose ...

Technologies like ChatGPT and Llama are generative (large) language models. Do not refer to generative LLMs as bots, as everyone has their own preconceived notion of what a "bot" is, and it often won't be "generative LLMs".

The Llama family of generative LLMs are probably the most capable open-source ones. The GPT family (not including GPTs 1 and 2) are generally more capable, but they're proprietary.

silver hill Dec 13, 2024, 12:24 AM

#

serene scaffold Technologies like ChatGPT and Llama are generative (large) language models. Do n...

thanks for clarifying

serene scaffold Dec 13, 2024, 12:25 AM

#

silver hill thanks for clarifying

Mixtral is another generative LLM that in my experience is very capable.

silver hill Dec 13, 2024, 12:26 AM

#

what I never get is how neural networks work, I tried to make a sentence boundary code and I had issues with the neural network part, I mean, the basic part is more or less easy, (sum(input)*weight)+bias

#

well and the threshold and stuff

#

but I dont have any idea of how to apply does to make a neural network

silver hill Dec 13, 2024, 12:28 AM

#

serene scaffold Mixtral is another generative LLM that in my experience is very capable.

and youtube doesnt rlly help, any good homepage for learning the theory?

serene scaffold Dec 13, 2024, 12:30 AM

#

silver hill and youtube doesnt rlly help, any good homepage for learning the theory?

for learning the theory, starting where?

fresh bay Dec 13, 2024, 12:36 AM

#

heavy crow I'm using a LR (on plateau) scheduler but starting between 1e-3 and -6.

id go to a constant learning rate and do one of like 1e-2 and paste teh graph here

livid locust Dec 13, 2024, 2:26 AM

#

https://datascience.stackexchange.com/questions/130922/why-does-my-lstm-only-print-out-one-day

Data Science Stack Exchange

Why does my LSTM only print out one day?

I made an LSTM for predicting stocks and it only predicts 1 day but I want to predict a range.
def make_new_model(self):
# simple starting model
# builds the lstm model
###############...

#

i posted about my lstm here a few days ago and didnt come back i apologize (6 CS courses are killing me rn)

#

but if anyone could check this out sm appreciated

#

i wanted to show the variance (stuff related to accuracy of my model) and i ran into an issue

untold fable Dec 13, 2024, 6:01 AM

#

where to learn full pytorch

#

does there any course

left tartan Dec 13, 2024, 6:03 AM

#

untold fable where to learn full pytorch

Start here: https://pytorch.org/tutorials/

silver hill Dec 13, 2024, 11:12 AM

#

serene scaffold for learning the theory, starting where?

I guess from the very beginning

knotty jacinth Dec 13, 2024, 11:51 AM

#

Hello everyone, I am using insightface arcface models for face detection and recognition
The model returns embeddings for each face it detects. Embeddings are multidimensional vector 512d ( im flattening the array at the end either ways to perform cosine similarity on them)
so what is the industry standard for storing these vectors and comparing?
currently what I do is I have target_vector, vectors_in_group_photo.
I do a sequential comparison of them with each other using cosine similarity, if the similarity is below threshold. I make a verdict.
but the issue is if I need to find 2 people in the same image the time complexity for this becomes O(n²).
60 people in the image, 30 target people to recognise. Calculating cosine similarity for each is a resource intensive task.
how does one tackle it, is there any better approaches for it. Also would be eager to get information on how to store these vector embeddings once calculated

deep veldt Dec 13, 2024, 1:10 PM

#

how do i get a random image from a ImageFolder dataset class?

serene scaffold Dec 13, 2024, 2:31 PM

#

deep veldt how do i get a random image from a `ImageFolder` dataset class?

have you looked at the methods that are available for that class? and can you link to the docs for them?

deep veldt Dec 13, 2024, 2:31 PM

#

serene scaffold have you looked at the methods that are available for that class? and can you li...

yes i did and still couldnt understand it

serene scaffold Dec 13, 2024, 2:32 PM

#

deep veldt yes i did and still couldnt understand it

can you give the link? cuz I don't even know what that class is. I do nlp.

deep veldt Dec 13, 2024, 2:36 PM

#

serene scaffold can you give the link? cuz I don't even know what that class is. I do nlp.

https://pytorch.org/vision/main/generated/torchvision.datasets.ImageFolder.html

normal spire Dec 13, 2024, 3:13 PM

#

hello everyone! I'm creating a system that performs credit analysis with Django Rest integrated with a ML Algorithm that will make a prediction if the client is a good client or not. I develop all the training and pipeling on a jupyter eviroment, integrate on Django, but I'm getting this error:

TypeError at /api/credit-analysis/3/
ufunc 'isnan' not supported for the input types, and the inputs could not be safely coerced to any supported types according to the casting rule ''safe''

I verified on Jupyter if had any df with NaN value, but it isn't, also verified if the data extracted from the DB had, but isn't as well. Any tips or help?

serene scaffold Dec 13, 2024, 3:14 PM

#

normal spire hello everyone! I'm creating a system that performs credit analysis with Django ...

can you show the whole traceback and the code that causes the error?

#

Read the DM from the @arctic wedge bot, if you got one

normal spire Dec 13, 2024, 3:19 PM

#

https://paste.pythondiscord.com/UBSA

Can you read? There has both the code and the traceback

#

I'm not that system developer, so forgive any silly mistake :p

serene scaffold Dec 13, 2024, 3:24 PM

#

normal spire https://paste.pythondiscord.com/UBSA Can you read? There has both the code and...

what type is xp in if xp.any(xp.isnan(known_values)):? can you show the code where xp = happens?

#

or is xp a module?

normal spire Dec 13, 2024, 3:39 PM

#

I think is a module of sklearn, I found here the code, one moment

#

https://paste.pythondiscord.com/WAZQ

#

I think can be something about the categorical variables that I encoded to fit the ML Algorithm, but on jupyter works well. I will make some tests with these pipelines outside de django project, for tests.

abstract wasp Dec 13, 2024, 5:50 PM

#

hi for those who have experience in mlops, which course do you recommend most?
https://www.udemy.com/course/mlops-course/
https://www.udemy.com/course/azure-machine-learning-mlops-mg/
https://www.udemy.com/course/deployment-of-machine-learning-models/

serene scaffold Dec 13, 2024, 6:01 PM

#

abstract wasp hi for those who have experience in mlops, which course do you recommend most? h...

do you know what Azure is, and do you already know for sure that you need to know how to use it?

silent helm Dec 13, 2024, 6:03 PM

#

can someone help me build a resume 😭 . I dont have any projects. I want to make some projects in Machine Learning or DL or LLMs.

serene scaffold Dec 13, 2024, 6:07 PM

#

silent helm can someone help me build a resume 😭 . I dont have any projects. I want to make...

please don't ask about the same thing in more than one place. you're already asking for help with this in #career-advice

silent helm Dec 13, 2024, 6:08 PM

#

serene scaffold please don't ask about the same thing in more than one place. you're already ask...

Oh sorry

abstract wasp Dec 13, 2024, 6:39 PM

#

serene scaffold do you know what Azure is, and do you already know for sure that you need to kno...

It’s like AWS? For deployment, traffic management, maintenance, etc?

serene scaffold Dec 13, 2024, 6:40 PM

#

abstract wasp It’s like AWS? For deployment, traffic management, maintenance, etc?

yes, it's like AWS. do you know for sure that you will need to use it? because it would be a waste of your time to learn Azure and then get hired by a company that only uses AWS (likely) or on-prem infrastructure.

abstract wasp Dec 13, 2024, 6:42 PM

#

serene scaffold yes, it's like AWS. do you know for sure that you will need to use it? because i...

No idk fs that I’ll use it

abstract wasp Dec 13, 2024, 6:55 PM

#

serene scaffold yes, it's like AWS. do you know for sure that you will need to use it? because i...

So from the other two which one do you recommend?

lapis sequoia Dec 13, 2024, 7:45 PM

#

can anyone help me in dms with my scikit?

serene scaffold Dec 13, 2024, 7:53 PM

#

lapis sequoia can anyone help me in dms with my scikit?

No; ask your question here

serene scaffold Dec 13, 2024, 7:53 PM

#

abstract wasp So from the other two which one do you recommend?

I'll respond when I can

fresh bay Dec 13, 2024, 8:00 PM

#

so I am following a post online for implementing a contrastive loss function in python like this

class ContrastiveLoss(nn.Module):
    def __init__(self, temperature=0.05):
        super(ContrastiveLoss, self).__init__()
        self.temperature = temperature

    def forward(self, z_i, z_j):
        batch_size = z_i.shape[0]
        z = torch.cat([z_i, z_j], dim=0)
        sim_matrix = torch.mm(z, z.T) / self.temperature
        sim_matrix = sim_matrix - torch.eye(batch_size * 2).to(z.device)
        labels = torch.cat([torch.arange(batch_size), torch.arange(batch_size)]).to(z.device)
        loss = nn.CrossEntropyLoss()(sim_matrix, labels)
        return loss

where zi, zj is being generated from

    for images in trainloader:
        images = images[0].to(device)
        images = torch.cat([images, images], dim=0)

        z_i, z_j = model(images).chunk(2, dim=0)

        loss = contrastive_loss(z_i, z_j)
``` and each  images in trainloader is a batch of 256 images 

However this seems wrong no? Shouldnt the images being fed to the model be different in constrastive loss? Where we have one positive pair but the positive pair shouldnt be the same image should it? 

Also I am confused as to how that loss function is actually the contrastive loss function shouldnt it only be comparing the first two images

weary timber Dec 13, 2024, 8:02 PM

#

how do i shuffle datasets while keeping its labels aligned

#

like i have a dataset of [1,2,3]

fresh bay Dec 13, 2024, 8:03 PM

#

are you using pytorch

weary timber Dec 13, 2024, 8:03 PM

#

fresh bay are you using pytorch

nope from scratch 😭

#

im torturing myself

fresh bay Dec 13, 2024, 8:03 PM

#

just define a new index array and shuffle it that way

weary timber Dec 13, 2024, 8:03 PM

#

wdym

fresh bay Dec 13, 2024, 8:05 PM

#

are you letting yourself import libraries?

#

arr1 = np.array([1, 2, 3, 4, 5])
shuffle_array = [4, 3, 2, 0, 1]
arr1[shuffle_array]

#

like that works

#

@weary timber

weary timber Dec 13, 2024, 8:08 PM

#

fresh bay are you letting yourself import libraries?

ofc

#

kk

rich moth Dec 13, 2024, 8:11 PM

#

fresh bay so I am following a post online for implementing a contrastive loss function in ...

I use this in one of my codes, maybe it can help you.

    """
    InfoNCE Loss for contrastive learning with proper dimension handling.
    """
    def __init__(self, temperature=0.07):
        super(InfoNCELoss, self).__init__()
        self.temperature = temperature
        self.criterion = nn.CrossEntropyLoss()

    def forward(self, features_1, features_2, labels):
        """
        Compute InfoNCE loss between two sets of features.
        Args:
            features_1: First set of features (B x D)
            features_2: Second set of features (B x D)
            labels: Tensor of labels
        """
        if features_1.size(0) == 0 or features_2.size(0) == 0:
            return torch.tensor(0.0, device=features_1.device)

        # Project both feature sets to same dimension if needed
        if features_1.size(-1) != features_2.size(-1):
            projection_dim = min(features_1.size(-1), features_2.size(-1))
            features_1 = nn.Linear(features_1.size(-1), projection_dim).to(features_1.device)(features_1)
            features_2 = nn.Linear(features_2.size(-1), projection_dim).to(features_2.device)(features_2)

        # Normalize the features
        features_1 = F.normalize(features_1, dim=1)
        features_2 = F.normalize(features_2, dim=1)

        # Calculate similarity matrix
        logits = torch.matmul(features_1, features_2.T) / self.temperature
        
        # Labels for contrastive loss
        labels = torch.arange(logits.size(0), device=logits.device)
        
        # Calculate loss in both directions
        loss_i = self.criterion(logits, labels)
        loss_t = self.criterion(logits.T, labels)
        
        return (loss_i + loss_t) / 2```

fresh bay Dec 13, 2024, 8:12 PM

#

thanks Ill look at this!

#

@rich moth it does look like this code is taking the same image and comparing it to itself does it not?

#

which seems odd

rich moth Dec 13, 2024, 8:18 PM

#

ya, that seems to be the issue it looks like its comparing the image with itself.

fresh bay Dec 13, 2024, 8:18 PM

#

which is def odd ok thanks!

rich moth Dec 13, 2024, 8:18 PM

#

no worries

lapis sequoia Dec 13, 2024, 8:51 PM

#

what could be causing it to run for so long

#

https://paste.pythondiscord.com/IJCA

proper laurel Dec 13, 2024, 9:01 PM

#

Hey snow

lapis sequoia Dec 13, 2024, 9:01 PM

#

proper laurel Hey snow

hey

proper laurel Dec 13, 2024, 9:02 PM

#

So do u still need help

lapis sequoia Dec 13, 2024, 9:06 PM

#

proper laurel So do u still need help

yes

proper laurel Dec 13, 2024, 9:06 PM

#

Oh ok

rich moth Dec 13, 2024, 9:07 PM

#

Ive been working on a model for crypto price prediction, the idea is to try to accurately forecast future asset returns and also providing uncertainty estimates in its predictions. Ive still gone some more fine tuning in the parameters in the optuna study but im really happy with the results right out of the gate.

proper laurel Dec 13, 2024, 9:07 PM

#

rich moth Ive been working on a model for crypto price prediction, the idea is to try to a...

Can u pls help snow for me

rich moth Dec 13, 2024, 9:07 PM

#

proper laurel Can u pls help snow for me

sure, i can try

proper laurel Dec 13, 2024, 9:08 PM

#

Thanks bro

lapis sequoia Dec 13, 2024, 9:08 PM

#

tyy

rich moth Dec 13, 2024, 9:18 PM

#

lapis sequoia what could be causing it to run for so long

What part of the code seems to be running for so long? How big is the xlxs files? I mean do you have a lot of columns and values? You're preforming a grid search. It can take awhile depending on your specs and data, consider RandomizedSearchCV

#

A good idea is to create more logging and it will help you track down the issue more.

lapis sequoia Dec 13, 2024, 9:19 PM

#

rich moth What part of the code seems to be running for so long? How big is the xlxs file...

can you hop in vc possibly?

rich moth Dec 13, 2024, 9:19 PM

#

vc?

lapis sequoia Dec 13, 2024, 9:22 PM

#

rich moth vc?

voice chat

proper laurel Dec 13, 2024, 9:24 PM

#

Plunder can do it

rich moth Dec 13, 2024, 9:26 PM

#

lapis sequoia voice chat

not right now. maybe later

proper laurel Dec 13, 2024, 9:26 PM

#

I would do it if I could

#

But I cant

#

I'm going to sleep so

gentle wyvern Dec 13, 2024, 9:56 PM

#

rich moth Ive been working on a model for crypto price prediction, the idea is to try to a...

what are you basing the Gaussian distribution on?

mighty lake Dec 13, 2024, 10:17 PM

#

for data science is it better to be a jack of all trades and focus on being able to work in diff languages and frameworks or is it better to focus on specific things and languages? dk if I should focus on python or start learning R. @ me replies

#

my overall python knowledge is decent, have 0 R knowledge though

serene scaffold Dec 13, 2024, 10:18 PM

#

mighty lake for data science is it better to be a jack of all trades and focus on being able...

I've never been asked to use R a single time ever.

quartz karma Dec 13, 2024, 10:19 PM

#

Hi, can python programs using existing ML packages be optimized highly efficient?

#

Like, as efficient as programs written by c++ or so?

serene scaffold Dec 13, 2024, 10:21 PM

#

quartz karma Like, as efficient as programs written by c++ or so?

this question is a lot more complicated than you probably realize. what is your goal for having the answer, and we'll work backwards from there.

mighty lake Dec 13, 2024, 10:21 PM

#

serene scaffold I've never been asked to use R a single time ever.

cool thanks

quartz karma Dec 13, 2024, 10:23 PM

#

serene scaffold this question is a lot more complicated than you probably realize. what is your ...

I'm thinking about the limit of computing capability of ML packages. For example, on a PC if I feed 10^6 training data each having 10^6 features, than before if can give any result will the program collapse ?

serene scaffold Dec 13, 2024, 10:25 PM

#

quartz karma I'm thinking about the limit of computing capability of ML packages. For example...

what does it mean for a program to "collapse"? programs don't just "collapse" if they don't finish after a certain amount of time.

#

@quartz karma if you want to learn about ML, and you only have time to learn one of Python and C++, do not pick C++.

quartz karma Dec 13, 2024, 10:27 PM

#

serene scaffold what does it mean for a program to "collapse"? programs don't just "collapse" if...

like, will the 10^6 features exceed maxiumn num of features allowed which may lead to program terminating?

serene scaffold Dec 13, 2024, 10:28 PM

#

quartz karma like, will the 10^6 features exceed maxiumn num of features allowed which may le...

that's a matter of how much RAM your computer has. which is a hardware question.

quartz karma Dec 13, 2024, 10:30 PM

#

serene scaffold that's a matter of how much RAM your computer has. which is a hardware question.

so the internal variables of python doesnt have length limit or something?

serene scaffold Dec 13, 2024, 10:31 PM

#

quartz karma so the internal variables of python doesnt have length limit or something?

Nope.
It sounds like you're very new to programming. If you're worried about optimizing the memory or runtime of your programs, I would encourage you to stop worrying about that. You don't currently know enough to understand what actually isn't optimal.

quartz karma Dec 13, 2024, 10:34 PM

#

serene scaffold Nope. It sounds like you're very new to programming. If you're worried about opt...

Yeh maybe. I'm just surprised that there's no limitation of feature numbers or other parameters when using python ml pkgs...

serene scaffold Dec 13, 2024, 10:35 PM

#

quartz karma Yeh maybe. I'm just surprised that there's no limitation of feature numbers or o...

all these things are represented in terms of multi-dimensional arrays (also called tensors) and you can have as many of them as you want, as large as you want, as long as you have enough RAM.

serene scaffold Dec 13, 2024, 10:38 PM

#

quartz karma I'm thinking about the limit of computing capability of ML packages. For example...

if you have 10^6 instances with 10^6 features, then that's 1 trillion feature values. And if each one is a 16 bit float, then that's almost 2 terabytes.

quartz karma Dec 13, 2024, 10:40 PM

#

serene scaffold if you have 10^6 instances with 10^6 features, then that's 1 trillion feature va...

yes I agree that's huge amount, on the other hand consider how complex a ML model would be (esp. those with a scale close to LLM models), it's possible that we may need to deal with such huge data right?

serene scaffold Dec 13, 2024, 10:41 PM

#

quartz karma yes I agree that's huge amount, on the other hand consider how complex a ML mode...

yes. and all those LLMs are trained by Python code on machines with huge amounts of hardware resources.

quartz karma Dec 13, 2024, 10:46 PM

#

serene scaffold yes. and all those LLMs are trained *by Python code* on machines with huge amoun...

could it be that the real efficiency are coming from the fact that, on the lower level the computings are mainly carried out by CUDA in graphic memories, which is totally different from python?

serene scaffold Dec 13, 2024, 10:46 PM

#

quartz karma could it be that the real efficiency are coming from the fact that, on the lower...

it's not "could it be". it is.

#

if you rewrote all of it with C++, but you didn't use CUDA, it would be way too slow.
and if you rewrote it in C++ and you still used CUDA, the performance gains would be negligible, but it would have taken longer to write.

quartz karma Dec 13, 2024, 10:49 PM

#

So can I think it like this, that for smaller data/models we can handle them with existing python libs, and for big data size or LLM models, we use those suites from CUDA to achieve acceptable performance?

serene scaffold Dec 13, 2024, 10:50 PM

#

quartz karma So can I think it like this, that for smaller data/models we can handle them wit...

existing python libs still use cuda.

quartz karma Dec 13, 2024, 10:51 PM

#

serene scaffold existing python libs still use cuda.

Could you provide some hints of examples?

serene scaffold Dec 13, 2024, 10:52 PM

#

quartz karma Could you provide some hints of examples?

pytorch.
LLMs can have 7 billion or even >140 billion parameters. but even models with hundreds of parameters can benefit from CUDA.

quartz karma Dec 13, 2024, 10:54 PM

#

serene scaffold pytorch. LLMs can have 7 billion or even >140 billion parameters. but even model...

that's fascinating. On the other hand if i'm using simple models from say sci-kit learn or xgboost, then i'm totally relying on performance from python code?

serene scaffold Dec 13, 2024, 10:56 PM

#

quartz karma that's fascinating. On the other hand if i'm using simple models from say sci-k...

the ones in sklearn will probably use numpy for performance gains. idk about xgboost

rich moth Dec 13, 2024, 11:20 PM

#

gentle wyvern what are you basing the Gaussian distribution on?

@gentle wyvern its being fit to the empirical distribution of the models prediction errrors on the validation set

fresh bay Dec 14, 2024, 12:14 AM

#

can someone take a look at this code and tell if this looks right for implementing contrastive loss?

class ContrastiveLoss(nn.Module):
    def __init__(self, temperature=0.5):
        super(ContrastiveLoss, self).__init__()
        self.temperature = temperature

    def forward(self, projections_1, projections_2):
        z_i = projections_1
        z_j = projections_2
        z_i_norm = F.normalize(z_i, dim=1)
        z_j_norm = F.normalize(z_j, dim=1)
        cosine_num = torch.matmul(z_i, z_j.T)
        cosine_denom = torch.matmul(z_i_norm, z_j_norm.T)
        cosine_similarity = cosine_num / cosine_denom

        numerator = torch.exp(torch.diag(cosine_similarity) / self.temperature)

        denominator = cosine_similarity
        diagonal_indices = torch.arange(denominator.size(0))
        denominator[diagonal_indices, diagonal_indices] = 0
        denominator = torch.exp(torch.sum(cosine_similarity, dim=1))
        loss = -torch.log(numerator / denominator).sum()
        return loss

inland crown Dec 14, 2024, 12:37 AM

#

rich moth Ive been working on a model for crypto price prediction, the idea is to try to a...

That's awesome! I've been working on a trading bot. So far in the testing it's done well!

fading wigeon Dec 14, 2024, 1:58 AM

#

Hey, there's something I'm trying to work out/understand. I'm going over material that proposes one way of choosing which model to use, but then later declares that this is a flawed methodology.

The proposed way to select models was to select which model had the smallest loss in the test set. I can believe it's a flawed strategy, just not sure I understand why.

In the context of the lecture video, it's related to selecting what degree of polynomial model to use and argues that the higher degree polynomials inherently give an overly optimistic amount of loss. This is very intuitive when we're looking at the training set, but how does this carry over when we're evaluating with a test set?

#

Okay, this might be one of those embarassing situations where I should have just kept watching 😄 They're arguing to make a training set, test set, and cross validation set

agile cobalt Dec 14, 2024, 2:17 AM

#

fading wigeon Hey, there's something I'm trying to work out/understand. I'm going over materi...

If you test a large amount of models, eventually you'll find some that perform better in your specific test set by pure ~~change~~ chance - effectively overfitting your architecture to that test set

Less complex models are less sensitive thus more likely to generalize to unseen data

fading wigeon Dec 14, 2024, 2:18 AM

#

Ah, gotcha, super fair. I'm familiar with that sort of thing in statistical analysis. Where if you happen to be testing a ton of hypotheses you have to use much stricter requirements

agile cobalt Dec 14, 2024, 2:18 AM

#

yeah... in many (if not nearly all) cases, the data you collected for tests is not going to be a perfect representation of real world data

fading wigeon Dec 14, 2024, 2:22 AM

#

I guess that's another reason why it's a good idea to have a dev, test, and train set.

lapis sequoia Dec 14, 2024, 4:32 AM

#

can anyone help me fine tune a model?

agile cobalt Dec 14, 2024, 4:33 AM

#

send your actual question, rather than asking if someone is available to only then maybe send it later

#

I'm going to sleep now though, and my only recommendation without more details would be taking a look at Kaggle resources

maybe someone can answer after you ask the actual question, good night

serene scaffold Dec 14, 2024, 4:35 AM

#

@lapis sequoia what etrotta said applies broadly to any time you want to talk to people on the internet. be sure to keep it in mind.

lapis sequoia Dec 14, 2024, 4:37 AM

#

serene scaffold <@456226577798135808> what etrotta said applies broadly to any time you want to ...

I meant in vc coz i wanna screenshare some questions

serene scaffold Dec 14, 2024, 4:38 AM

#

lapis sequoia I meant in vc coz i wanna screenshare some questions

that doesn't go without saying. but this server doesn't really support screenshare calls. if you want to get help, I suggest asking your question as best you can in this channel with code examples.

lapis sequoia Dec 14, 2024, 4:39 AM

#

serene scaffold that doesn't go without saying. but this server doesn't really support screensha...

ok here's my code:
https://paste.pythondiscord.com/DQIA

it's taking hours to process the data (800k entries), and these are the results:

Random Forest Cross-Validation Scores:
R² Scores: [0.52024959 0.61293593 0.6065748 0.60181169 0.45804028]
Mean R²: 0.5599 (+/- 0.1223)

Random Forest Metrics:
RMSLE: 0.07565448909307013
R² Score: 0.6019513442397915
RMSE: 5441924.603108913

#

how could i possibly speed it up and improve performance?

#

test data looks something like this and im trying to predict the prices

#

training data looks something like that

serene scaffold Dec 14, 2024, 4:41 AM

#

@lapis sequoia thank you. do you know what a nominal feature is?

lapis sequoia Dec 14, 2024, 4:41 AM

#

serene scaffold <@456226577798135808> thank you. do you know what a nominal feature is?

yes

serene scaffold Dec 14, 2024, 4:42 AM

#

lapis sequoia yes

I suggest eliminating them from X_train. features like brand and country probably have too many unique values.

#

I would probably also eliminate date_listed and use only year.

lapis sequoia Dec 14, 2024, 4:43 AM

#

so year, hours, manufacturer, model, type, connectivity?

#

condition is always used with very rarely "Good"

#

so i dont think this should be included

serene scaffold Dec 14, 2024, 4:45 AM

#

lapis sequoia so year, hours, manufacturer, model, type, connectivity?

I think you can make that determination. think of it from the computer's perspective: numeric features can inherently be compared to one another. but names/labels cannot. if a feature can only be meaningfully represented as something other than a number (like a country, or a brand name), you want it to have as few unique values as possible. just the essentials.

lapis sequoia Dec 14, 2024, 4:46 AM

#

serene scaffold I think you can make that determination. think of it from the computer's perspec...

thing is i wouldnt mind keep testing

#

but it takes 2-3hours to train once

#

and im runnnign out of time, what could i do to train as fast as possible

#

coz 800k entries is not optimal, but if i start doing like 10k it will not be as accurate

serene scaffold Dec 14, 2024, 4:47 AM

#

lapis sequoia and im runnnign out of time, what could i do to train as fast as possible

select only the bare minimum set of features that you think are relevant to the y data.

#

what is your y data? the price?

lapis sequoia Dec 14, 2024, 4:47 AM

#

yes

#

and this is how accurate i need it to be basically

serene scaffold Dec 14, 2024, 4:48 AM

#

lapis sequoia yes

so try picking only the features that you're sure are relevant for determining the price. maybe train with only 20% of the training data, and see how accurate it is on the test data.

#

then you can start adding more of the training data back in. which will increase training time.

lapis sequoia Dec 14, 2024, 4:49 AM

#

so

X_train, X_val, y_train, y_val = train_test_split(X, y, test_size=0.8, random_state=42)

instead of

X_train, X_val, y_train, y_val = train_test_split(X, y, test_size=0.2, random_state=42)

#

ight

serene scaffold Dec 14, 2024, 4:50 AM

#

lapis sequoia so ```py X_train, X_val, y_train, y_val = train_test_split(X, y, test_size=0.8,...

no. keep the test size at 0.2. but then make X_train and y_train a subset of themselves

lapis sequoia Dec 14, 2024, 4:50 AM

#

oh wait how can i do that?

#

just subset[:100000]?

serene scaffold Dec 14, 2024, 4:50 AM

#

just pick the first n rows, or something simple like that

lapis sequoia Dec 14, 2024, 4:50 AM

#

well it'd be more like 200k

serene scaffold Dec 14, 2024, 4:51 AM

#

the important thing is that the mth row of X_train needs to line up with the mth row of y_train

lapis sequoia Dec 14, 2024, 4:51 AM

#

hmmm

#

so

#

 X_subset = X_train.sample(n=subset_size, random_state=42) 
 y_subset = y_train.loc[X_subset.index]

jaunty helm Dec 14, 2024, 4:52 AM

#

lapis sequoia but it takes 2-3hours to train once

try a model that's faster to train maybe?
e.g. reduce the n_estimators in your rfe

lapis sequoia Dec 14, 2024, 4:52 AM

#

basically that

lapis sequoia Dec 14, 2024, 4:52 AM

#

jaunty helm try a model that's faster to train maybe? e.g. reduce the `n_estimators` in your...

any recommendation?

#

xgboost?

jaunty helm Dec 14, 2024, 4:52 AM

#

lapis sequoia any recommendation?

did I not include in the same message what you can do

serene scaffold Dec 14, 2024, 4:52 AM

#

lapis sequoia ```py X_subset = X_train.sample(n=subset_size, random_state=42) y_subset = y_...

I think that would work; inspect X_subset and y_subset to confirm

lapis sequoia Dec 14, 2024, 4:52 AM

#

jaunty helm did I not include in the same message what you can do

no i meant a recommendation of a faster model

lapis sequoia Dec 14, 2024, 4:53 AM

#

serene scaffold I think that would work; inspect `X_subset` and `y_subset` to confirm

ight ty

jaunty helm Dec 14, 2024, 4:53 AM

#

lapis sequoia no i meant a recommendation of a faster model

...
I said, you can reduce the n_estimators parameter in your random forest regressor to make it train faster

lapis sequoia Dec 14, 2024, 4:53 AM

#

jaunty helm ... I said, you can reduce the `n_estimators` parameter in your random forest re...

yes but you also said try a model thats faster to train, so i just assumed u meant to use a diff model

serene scaffold Dec 14, 2024, 4:54 AM

#

jaunty helm ... I said, you can reduce the `n_estimators` parameter in your random forest re...

I don't think they know what that is. n_estimators appears once when they define param_grid, but that variable is never used.

lapis sequoia Dec 14, 2024, 4:54 AM

#

serene scaffold I don't think they know what that is. `n_estimators` appears once when they defi...

i do, i currently have a grid with 50-100-200

serene scaffold Dec 14, 2024, 4:56 AM

#

lapis sequoia i do, i currently have a grid with 50-100-200

you define param_grid in line 79 of your code, but you never use it.
the only hyperparameter that you specify (if it can even be considered a hyperparam) is the random state (which you set to the right number)

lapis sequoia Dec 14, 2024, 4:57 AM

#

serene scaffold you define `param_grid` in line 79 of your code, but you never use it. the only ...

i just removed it coz someone told me it'd be faster when im testing

#

X_train, X_val, y_train, y_val = train_test_split(X, y, test_size=0.2, random_state=42)
subset_size = int(len(X_train) * 0.2)
X_subset = X_train.sample(n=subset_size, random_state=42)
y_subset = y_train.loc[X_subset.index]
rf_model.fit(X_subset, y_subset)
rf_predictions = rf_model.predict(X_val)

so basically like that?

jaunty helm Dec 14, 2024, 4:58 AM

#

lapis sequoia yes but you also said try a model thats faster to train, so i just assumed u mea...

bad wording from my part then
anyways, you can turn down the n_estimators or change other params like max_depth
another thing is the HGBR which is still a tree but should be faster
in terms of other gradient boosted trees, lightgbm faster than xgboost faster than catboost (usually)
also, trees would usually take longer than say a ridge or lasso or a simple linear reg; however, then you'd need to deal with categorical variables differently if you have those

lapis sequoia Dec 14, 2024, 4:59 AM

#

okay sounds good

lapis sequoia Dec 14, 2024, 4:59 AM

#

lapis sequoia ```py X_train, X_val, y_train, y_val = train_test_split(X, y, test_size=0.2, ran...

or do i nede to change x_val and y_val as well?

deep veldt Dec 14, 2024, 5:11 AM

#

what's the difference between ImageFolder.imgs and ImageFolder.samples ?

weary timber Dec 14, 2024, 8:58 AM

#

can someone tell me how do i work with googles quick draw dataset when its unlabeled

#

i need to shuffle it too

deep veldt Dec 14, 2024, 11:42 AM

#

is it necessary to use sigmoid on the subnet output in a siamese network?

fiery dust Dec 14, 2024, 1:16 PM

#

hey guys

#

is it a bad idea to create a model that is bad predicting, so that you assume the opposite will happen?

#

for example, almost every time my model thinks it's going to rain, it doenst

#

so I kinda know that whenever my model thinks it's gonna rain, it wont haha

weary timber Dec 14, 2024, 1:18 PM

#

fiery dust so I kinda *know* that whenever my model thinks it's gonna rain, it wont haha

smaaaaaaaaaart

fiery dust Dec 14, 2024, 1:18 PM

#

-_-

#

it's a serious question

jaunty helm Dec 14, 2024, 1:19 PM

#

fiery dust so I kinda *know* that whenever my model thinks it's gonna rain, it wont haha

the thing is at that point, how did you even get the "bad" model, sounds to me like you messed up some training data
e.g. if this is a binary classification problems, maybe you accidentally flipped the labels

#

for multiclass, then not really
e.g. if you know it's either sunny, cloudy or rainy, you still wouldn't know if it's cloudy or sunny if your bad model predicts raining

fiery dust Dec 14, 2024, 1:20 PM

#

so even though I described a classification problem, in my case it's not, this was just for an example

#

like what if I'm predicting continuous variables

jaunty helm Dec 14, 2024, 1:20 PM

#

fiery dust like what if I'm predicting continuous variables

then shouldn't you have a continuous objective function and the bad model is just bad no matter how you spin it? I don't get it

#

like say your model never predicts correctly

say your model always predicts too high, then again that sounds like something messed up in training
say your model always predicts either too high or too low, but you don't know which; then it's not very useful cause you wouldnt know to go up or down from the predict

fiery dust Dec 14, 2024, 1:21 PM

#

lets say my model predicts the number of new members python's discord group will have. Let's imagine my model predicts that there will be over X new members today, but that is almost never the case

#

then I know that since my model is bad, almost every time my model predicts there will be lets say, 10 new members, I can say there WONT be 10 new members

jaunty helm Dec 14, 2024, 1:23 PM

#

fiery dust then I *know* that since my model is bad, almost every time my model predicts th...

I guess?

#

seems weird

past bramble Dec 14, 2024, 1:23 PM

#

I have cloned this repo: https://huggingface.co/hugging-quants/Meta-Llama-3.1-8B-Instruct-GPTQ-INT4

and followed the first code over here:
https://huggingface.co/hugging-quants/Meta-Llama-3.1-8B-Instruct-GPTQ-INT4#🤗-transformers

on running it I get this error: ```bash
CUDA extension not installed.
CUDA extension not installed.
Traceback (most recent call last):
File "C:\Users\user\Desktop\LLAMA 2\gpt.py", line 7, in <module>
model = AutoGPTQForCausalLM.from_pretrained(
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
TypeError: AutoGPTQForCausalLM.from_pretrained() missing 1 required positional argument: 'quantize_config'

How do I fix this?

hugging-quants/Meta-Llama-3.1-8B-Instruct-GPTQ-INT4 · Hugging Face

fiery dust Dec 14, 2024, 1:23 PM

#

I know it sounds weird, but in case I try to come up with a good model and I really cant, it would be weird but at least it would work?

#

I mean sure, it makes more sense to come up with GOOD models instead of BAD models, but if you cannot come up with GOOD ones, then just predict the opposite of what your BAD model predicts.

jaunty helm Dec 14, 2024, 1:24 PM

#

I'm not sure why you'd want that tho
like there's infinitely many worse answers than "there will be 10 members"

#

so knowing there isn't 10 members doesn't really help, you still have infinite wrong answers

#

like I can say "pydis's members will not double today"
well, duh, but also, how does that help me

fiery dust Dec 14, 2024, 1:25 PM

#

it's just an example

#

nvm

#

silly question

jaunty helm Dec 14, 2024, 1:26 PM

#

fiery dust I mean sure, it makes more sense to come up with GOOD models instead of BAD mode...

the problem is the opposite isn't really useful ig?
like my example with ...will not double

deep veldt Dec 14, 2024, 1:30 PM

#

deep veldt is it necessary to use sigmoid on the subnet output in a siamese network?

can anyone help me about this?

lapis sequoia Dec 14, 2024, 1:39 PM

#

Random Forest Cross-Validation Scores:
R² Scores: [0.48833798 0.56808773 0.55909636 0.56288745 0.42693348]
Mean R²: 0.5211 (+/- 0.1108)

Fitting Random Forest on all data...

Random Forest Metrics:
RMSLE: 0.07985307785989508
R² Score: 0.5581471021117961
RMSE: 5714076.326474625

This is my model's scores
this is my code
https://paste.pythondiscord.com/K5SA

what can i change to improve the rmsle?

unkempt apex Dec 14, 2024, 1:40 PM

#

deep veldt can anyone help me about this?

no, because generally ReLu works fine

jaunty helm Dec 14, 2024, 1:42 PM

#

lapis sequoia ``` Random Forest Cross-Validation Scores: R² Scores: [0.48833798 0.56808773 0.5...

always depends on the data at hand
check for underfitting, or more likely as you're using trees, overfitting
do more diligent EDA and feature engineering

deep veldt Dec 14, 2024, 1:42 PM

#

unkempt apex no, because generally ReLu works fine

but you need it to calculate the similarity?

lapis sequoia Dec 14, 2024, 1:43 PM

#

jaunty helm always depends on the data at hand check for underfitting, or more likely as you...

yeah but im lost rn im ngl.
date_listed: The date when the device was listed on the platform.

year: The year the device model was manufactured or first introduced to the market.

manufacturer: The company that produced the device.

model: The specific model name or code for the device.

type: The category of the device.

connectivity: The type of connectivity supported by the device.

hours: The total number of hours the device has been used.

brand: The commercial brand associated with the device.

country: The location where the device is listed for sale, mapped to various countries.

condition: The current state of the device.

accessories: The number of additional accessories included with the device.

discount_percentage: The percentage discount applied to the original listing price.

user_rating: A rating from past users or simulated feedback that reflects satisfaction with the device.

warranty_extension: A binary indicator of whether a warranty extension option is available for the device.

price: The original listing price of the device. (target btw)

#

condition is literally 99% "used" and 1% "good"
type has a lot of empty rows

unkempt apex Dec 14, 2024, 1:44 PM

#

deep veldt but you need it to calculate the similarity?

can you explain for what purpose you are using this?>

deep veldt Dec 14, 2024, 1:44 PM

#

unkempt apex can you explain for what purpose you are using this?>

a siamese network

jaunty helm Dec 14, 2024, 1:44 PM

#

lapis sequoia yeah but im lost rn im ngl. date_listed: The date when the device was listed on ...

I mean first of all

for col in df.select_dtypes(include=np.number):
    df[col] = df[col].fillna(df[col].median())

for col in df.select_dtypes(include='object'):
    df[col] = df[col].fillna(df[col].mode()[0])
```surely you can do better than blindly filling every missing with median / mode of the entire dataframe?
not to mention you're leaking due to this

lapis sequoia Dec 14, 2024, 1:44 PM

#

jaunty helm I mean first of all ```py for col in df.select_dtypes(include=np.number): df...

sure, what do you recommend? simple imputer?

unkempt apex Dec 14, 2024, 1:44 PM

#

deep veldt a siamese network

yeah, so in general sigmoid is not required

jaunty helm Dec 14, 2024, 1:45 PM

#

lapis sequoia sure, what do you recommend? simple imputer?

no
I recommend checking your data and finding something that makes more sense

#

and no, I have no idea what would "make more sense" because that 100% depends on the data you have

unkempt apex Dec 14, 2024, 1:45 PM

#

unkempt apex yeah, so in general sigmoid is not required

because sigmoid just maps the output of your layer/neuron into [ 0 to 1 ]

deep veldt Dec 14, 2024, 1:45 PM

#

unkempt apex yeah, so in general sigmoid is not required

do i need it when using tripletloss

#

i just looked at the pytorch example for siamese network and it uses sigmoid to get the similarity

unkempt apex Dec 14, 2024, 1:46 PM

#

deep veldt i just looked at the pytorch example for siamese network and it uses sigmoid to ...

https://towardsdatascience.com/a-friendly-introduction-to-siamese-networks-85ab17522942

Medium

A friendly Introduction to Siamese Networks

You don’t always need a lot of data to train your model, learn how to create a model with a tiny number of images per class

jaunty helm Dec 14, 2024, 1:46 PM

#

jaunty helm and no, I have no idea what would "make more sense" because that 100% depends on...

make some plots and see & feel your data
find patterns, try using it to your advantage, check if it betters the performance

unkempt apex Dec 14, 2024, 1:46 PM

#

deep veldt do i need it when using tripletloss

first, clear your purpose and then learn about it !

lapis sequoia Dec 14, 2024, 1:49 PM

#

jaunty helm make some plots and see & feel your data find patterns, try using it to your adv...

i have a lot of data, i just dont really have any other idea how to find patterns basiccally

jaunty helm Dec 14, 2024, 1:49 PM

#

lapis sequoia i have a lot of data, i just dont really have any other idea how to find pattern...

plot the data and see if you spot anything

lapis sequoia Dec 14, 2024, 1:49 PM

#

ive tried a few combinations of features

jaunty helm Dec 14, 2024, 1:50 PM

#

lapis sequoia i have a lot of data, i just dont really have any other idea how to find pattern...

how much do you mean by "a lot" btw?
trees tend to overfit, so try tuning the parameters

deep veldt Dec 14, 2024, 1:50 PM

#

unkempt apex first, clear your purpose and then learn about it !

do you really have to implement your own Contrastive loss function, because pytorch doesnt have it

past bramble Dec 14, 2024, 1:52 PM

#

past bramble I have cloned this repo: https://huggingface.co/hugging-quants/Meta-Llama-3.1-8B...

Could anyone help me with this?

lapis sequoia Dec 14, 2024, 1:54 PM

#

jaunty helm how much do you mean by "a lot" btw? trees tend to overfit, so try tuning the pa...

what parameters are best to handle overfitting?

jaunty helm Dec 14, 2024, 1:56 PM

#

lapis sequoia what parameters are best to handle overfitting?

smaller n_estimators or lower max_depth immediately comes to my mind
but also, unironically just check sklearn docs

scikit-learn

RandomForestRegressor

Gallery examples: Release Highlights for scikit-learn 1.4 Release Highlights for scikit-learn 0.24 Combine predictors using stacking Comparing Random Forests and Histogram Gradient Boosting models ...

unkempt apex Dec 14, 2024, 2:03 PM

#

deep veldt do you really have to implement your own Contrastive loss function, because pyto...

def criterion(x1, x2, label, margin: float = 1.0):
    """
    Computes Contrastive Loss
    """

    dist = torch.nn.functional.pairwise_distance(x1, x2)

    loss = (1 - label) * torch.pow(dist, 2) \
        + (label) * torch.pow(torch.clamp(margin - dist, min=0.0), 2)
    loss = torch.mean(loss)

    return loss

#

found this on github gist

unkempt apex Dec 14, 2024, 2:04 PM

#

past bramble Could anyone help me with this?

you just want to inference the model right?

#

@jaunty helm can you send that hf link again for that llama models which were 4bit

deep veldt Dec 14, 2024, 2:05 PM

#

unkempt apex ``` def criterion(x1, x2, label, margin: float = 1.0): """ Computes Cont...

isnt TripletWithDistanceLoss the same thing?

unkempt apex Dec 14, 2024, 2:05 PM

#

unkempt apex <@208918673178492929> can you send that hf link again for that llama models whic...

and in other format too

unkempt apex Dec 14, 2024, 2:05 PM

#

deep veldt isnt TripletWithDistanceLoss the same thing?

I don't know man

#

you have to search about it

jaunty helm Dec 14, 2024, 2:07 PM

#

past bramble I have cloned this repo: https://huggingface.co/hugging-quants/Meta-Llama-3.1-8B...

gptq is basically deprecated at this point

#

like the popular formats are either gguf if you need to run some parts on cpu, or exl2 if you don't

unkempt apex Dec 14, 2024, 2:08 PM

#

jaunty helm like the popular formats are either gguf if you need to run some parts on cpu, o...

yeah can you send that gguf link?

#

https://huggingface.co/bartowski/Llama-3.2-1B-Instruct-GGUF

bartowski/Llama-3.2-1B-Instruct-GGUF · Hugging Face

#

it was this I guess

jaunty helm Dec 14, 2024, 2:09 PM

#

unkempt apex it was this I guess

that's not the same model

jaunty helm Dec 14, 2024, 2:09 PM

#

unkempt apex yeah can you send that gguf link?

ig this?

unkempt apex Dec 14, 2024, 2:09 PM

#

he needs 8B model

#

https://huggingface.co/TheBloke/Llama-2-7B-GGUF

TheBloke/Llama-2-7B-GGUF · Hugging Face

#

can also use this

jaunty helm Dec 14, 2024, 2:10 PM

#

llama 2 is at least like 1 year old at this point

unkempt apex Dec 14, 2024, 2:11 PM

#

yeah

#

this https://huggingface.co/QuantFactory/Meta-Llama-3-8B-Instruct-GGUF

jaunty helm Dec 14, 2024, 2:13 PM

#

sure ig
for future reference, you can just go to the original model's page, and check to the side the Quantizations text in the model tree and click that

deep veldt Dec 14, 2024, 2:18 PM

#

does it still count as a siamese network if i use crossentrophyloss

past bramble Dec 14, 2024, 2:30 PM

#

unkempt apex this <https://huggingface.co/QuantFactory/Meta-Llama-3-8B-Instruct-GGUF>

is it necessary every one of these files are needed?

#

cuz they are 3GB+ each

lapis sequoia Dec 14, 2024, 2:37 PM

#

jaunty helm smaller `n_estimators` or lower `max_depth` immediately comes to my mind but als...

just curious about your insight on these.

jaunty helm Dec 14, 2024, 2:39 PM

#

past bramble is it necessary every one of these files are needed?

no
each file is a different level of quantization

lapis sequoia Dec 14, 2024, 2:40 PM

#

lapis sequoia just curious about your insight on these.

coz this means basically no correlation on all these values, so i'd rather stick to the others right?

jaunty helm Dec 14, 2024, 2:41 PM

#

lapis sequoia coz this means basically no correlation on all these values, so i'd rather stick...

potentially (as it may have a non-linear correlation)
looking at the graph I also don't see much interesting other than the 0s which I assume are missing
maybe the year has something but it's all squeezed to the ends cause of the 0s

lapis sequoia Dec 14, 2024, 2:43 PM

#

jaunty helm potentially (as it may have a non-linear correlation) looking at the graph I als...

using .corr()

#

Correlations with Price:
price 1.000000
year 0.039578
discount_percent 0.020176
accessories 0.000480
device_id 0.000047
warranty_extension_available -0.009423
user_rating -0.010529

#

so obv year is best in this, so the rest has to be the other cat data

jaunty helm Dec 14, 2024, 2:43 PM

#

lapis sequoia using .corr()

yes, that calculates linear correlation
I meant that while yes, the correlations look low, there may or may not be non-linear correlation it can't detect

#

one of the easier ways to find out is just compare the models when you include vs. not include them

past bramble Dec 14, 2024, 2:45 PM

#

jaunty helm no each file is a different level of quantization

so any one of those should work i suppose

lapis sequoia Dec 14, 2024, 2:45 PM

#

jaunty helm yes, that calculates linear correlation I meant that while yes, the correlations...

wouldnT just using xgboost manage that?

jaunty helm Dec 14, 2024, 2:46 PM

#

past bramble so any one of those should work i suppose

yes
don't use the QX_0 nor QX_1 ones, they're legacy and do worse
otherwise, the higher the X in QX, the higher bit it is

past bramble Dec 14, 2024, 2:47 PM

#

jaunty helm yes don't use the `QX_0` nor `QX_1` ones, they're legacy and do worse otherwise,...

So Q2_K's good then?

jaunty helm Dec 14, 2024, 2:47 PM

#

past bramble So Q2_K's good then?

Q2_K would be like 2-3 bits, if you're fine with that then sure
the original gptq you showed says INT4, which is 4bits, which is somewhere around Q3 or Q4

jaunty helm Dec 14, 2024, 2:48 PM

#

lapis sequoia wouldnT just using xgboost manage that?

sure, or it could overfit (or you can still just tune the params to fix that)
I'm just saying that, just because .corr() is low, that doesn't mean there isn't some association between two variables

past bramble Dec 14, 2024, 2:48 PM

#

i want to try it out, I'll go to larger ones after playing around with these and getting familiar

jaunty helm Dec 14, 2024, 2:55 PM

#

past bramble i want to try it out, I'll go to larger ones after playing around with these and...

usually the rule of thumb is, Q8 is virtually the same as the original, Q6 is within 99% the same
and for small models like an 8b, you don't want to go below Q4

past bramble Dec 14, 2024, 3:43 PM

#

I have tried with this, but then a new error talking about config.json: ```bash
Traceback (most recent call last):
File "C:\Users\user\Desktop\LLAMA 2\gpt.py", line 6, in <module>
pipeline = transformers.pipeline(
^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\user\Desktop\LLAMA 2.venv\Lib\site-packages\transformers\pipelines_init_.py", line 849, in pipeline
config = AutoConfig.from_pretrained(
^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\user\Desktop\LLAMA 2.venv\Lib\site-packages\transformers\models\auto\configuration_auto.py", line 1053, in from_pretrained
raise ValueError(
ValueError: Unrecognized model in ./Meta-Llama-3-8B-Instruct-GGUF. Should have a model_type key in its config.json, or contain one of the following strings in its name: albert, align, altclip, ...

#

I don't see any config files in repo, do I make my own? What information do I provide?

agile cobalt Dec 14, 2024, 3:48 PM

#

past bramble I have tried with this, but then a new error talking about `config.json`: ```bas...

are you following https://huggingface.co/docs/transformers/en/gguf#example-usage ?

GGUF and interaction with Transformers

past bramble Dec 14, 2024, 3:57 PM

#

agile cobalt are you following https://huggingface.co/docs/transformers/en/gguf#example-usage...

I am trying this now, it seems to give this error I am unsure of: ```bash
Traceback (most recent call last):
File "C:\Users\user\Desktop\LLAMA 2\gpt.py", line 6, in <module>
tokenizer = AutoTokenizer.from_pretrained(model_id, gguf_file=filename)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\user\Desktop\LLAMA 2.venv\Lib\site-packages\transformers\models\auto\tokenization_auto.py", line 875, in from_pretrained
config_dict = load_gguf_checkpoint(gguf_path, return_tensors=False)["config"]
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\user\Desktop\LLAMA 2.venv\Lib\site-packages\transformers\modeling_gguf_pytorch_utils.py", line 278, in load_gguf_checkpoint
reader = GGUFReader(gguf_checkpoint_path)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\user\Desktop\LLAMA 2.venv\Lib\site-packages\gguf\gguf_reader.py", line 94, in init
if self._get(offs, np.uint32, override_order = '<')[0] != GGUF_MAGIC:
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\user\Desktop\LLAMA 2.venv\Lib\site-packages\gguf\gguf_reader.py", line 151, in _get
.newbyteorder(override_order or self.byte_order)
^^^^^^^^^^^^
AttributeError: newbyteorder was removed from the ndarray class in NumPy 2.0. Use arr.view(arr.dtype.newbyteorder(order)) instead.

#

do I uninstall numpy and use a prior version of it? It might cause some other problems I suppose

agile cobalt Dec 14, 2024, 4:32 PM

#

past bramble I am trying this now, it seems to give this error I am unsure of: ```bash Traceb...

that is a bug in the gguf package, looks like a fix was merged yesterady https://github.com/ggerganov/llama.cpp/pull/9772 but they haven't released a new version yet

for now just pip install numpy<2

past bramble Dec 14, 2024, 7:12 PM

#

alright that helped

rich moth Dec 14, 2024, 7:24 PM

#

inland crown That's awesome! I've been working on a trading bot. So far in the testing it's d...

Nice! Its pretty fun to tinker around with. I'm putting the final touches on my backtest engine. What kind of architecture did you end up going with? I used a transformer with multiheaded attention with some additions. I got inspiried from my Prime number predictor project to make the it. Like the uncertainty estimator, it just seemed like a perfect fit for a trading bot.

past bramble Dec 14, 2024, 7:28 PM

#

I have this code, ```py
from transformers import AutoTokenizer, AutoModelForCausalLM

model_id = "./Meta-Llama-3-8B-Instruct-GGUF"
filename = "Meta-Llama-3-8B-Instruct.Q2_K.gguf"

tokenizer = AutoTokenizer.from_pretrained(model_id, gguf_file=filename)
model = AutoModelForCausalLM.from_pretrained(model_id, gguf_file=filename)
def generate_text(prompt, max_length=50):
inputs = tokenizer(prompt, return_tensors="pt")
outputs = model.generate(**inputs, max_length=max_length)
return tokenizer.decode(outputs[0], skip_special_tokens=True)

prompt = "Once upon a time in a land far, far away"
generated_text = generate_text(prompt)
print(generated_text)

Terminal:```bash
Converting and de-quantizing GGUF tensors...: 100%|████████████████████| 291/291 [03:20<00:00,  1.45it/s]97/291 [02:03<00:57,  1.64it/s]
``` It didnt show any text generation printed, why was that?

#

I tried to save the dequantized model into a folder but it didnt seem to work, the folder is empty: ```py
from transformers import AutoTokenizer, AutoModelForCausalLM

model_id = "./Meta-Llama-3-8B-Instruct-GGUF"
filename = "Meta-Llama-3-8B-Instruct.Q2_K.gguf"

tokenizer = AutoTokenizer.from_pretrained(model_id, gguf_file=filename)
model = AutoModelForCausalLM.from_pretrained(model_id, gguf_file=filename)

model.save_pretrained("./dequantized_model")
tokenizer.save_pretrained("./dequantized_model")
bash
Converting and de-quantizing GGUF tensors...: 100%|█████████████████████| 291/291 [02:45<00:00, 1.75it/s]

fresh bay Dec 14, 2024, 7:47 PM

#

anyone able to tell me if this implementation looks correct for contrastive loss?

class ContrastiveLoss(nn.Module):
    def __init__(self, temperature=0.5):
        super(ContrastiveLoss, self).__init__()
        self.temperature = temperature

    def forward(self, projections_1, projections_2):
        z_i = projections_1
        z_j = projections_2
        z_i_norm = F.normalize(z_i, dim=1)
        z_j_norm = F.normalize(z_j, dim=1)
        cosine_num = torch.matmul(z_i, z_j.T)
        cosine_denom = torch.matmul(z_i_norm, z_j_norm.T)
        cosine_similarity = cosine_num / cosine_denom

        numerator = torch.exp(torch.diag(cosine_similarity) / self.temperature)

        denominator = cosine_similarity
        diagonal_indices = torch.arange(denominator.size(0))
        denominator[diagonal_indices, diagonal_indices] = 0
        denominator = torch.exp(torch.sum(cosine_similarity, dim=1))
        loss = -torch.log(numerator / denominator).sum()
        return loss

inland crown Dec 14, 2024, 8:13 PM

#

rich moth Nice! Its pretty fun to tinker around with. I'm putting the final touches on ...

Well, I started with just a bot that monitored BTC so I could establish the best method to monitor the market. After messing with a few API's I decided to go with the wws feed from coinbase. Much faster and you don't run out of calls. Then I tested something I had AI come up with when I showed it what I look at on the charts. That didn't test well but I was just cutting teeth on Python. Now I have to with a ML module, that decided when to buy and sell on it's own. That's tested really well but my script got too long. I'm now hung on changing it to a modular system so I can make a variety o bots with it by just changing the logic module.

rich moth Dec 14, 2024, 8:13 PM

#

past bramble I tried to save the dequantized model into a folder but it didnt seem to work, t...

Wrap the generation in a try except block, maybe something is going on behind the scene

proper laurel Dec 14, 2024, 8:15 PM

#

Hey guys I'm back

inland crown Dec 14, 2024, 8:15 PM

#

Howdy back, I'm front..

fresh bay Dec 14, 2024, 8:15 PM

#

@rich moth hey if you dont mind me asking does that implementation look correct?

inland crown Dec 14, 2024, 8:20 PM

#

So my script primes as many coins as you feed it with $500 to start. I have it running 4 coins on most tests, BTC, ETH,SHIB and DOGE. It buys low, sells high and has a hold (1 - 0 - -1

#

Alas, currntly stuck at porting it to a modular file. I can't seem to get my starting prime balances to prime anything.

#

I still have to implement the stop loss and actual trading functions but it was testing well before I broke it up.

rich moth Dec 14, 2024, 8:24 PM

#

inland crown Well, I started with just a bot that monitored BTC so I could establish the best...

I used coinbase pro for trading with my first bot, I liked it. Though I didnt have many options being in the US. Lots of exchanges dont want to take you if you're from the US. So you need to design your RiskManger next? Sounds like you're diving in head first though and figuring it out. While you might cringe having to make it more modular, youll thank yourself later.

inland crown Dec 14, 2024, 8:25 PM

#

@rich moth I like how you charted out your plots. I just used a standard graph. I'm un the US, was it about certain coins?

serene scaffold Dec 14, 2024, 8:25 PM

#

inland crown So my script primes as many coins as you feed it with $500 to start. I have it r...

You're asking about crypto? Can you explain the AI angle to this?

inland crown Dec 14, 2024, 8:27 PM

#

Here was on test.

@stelecrcus . Well, I'm using it to write a lot of the base code and then I'm using the Machine LEarning module to automate the trade logic.

fresh bay Dec 14, 2024, 8:27 PM

#

@serene scaffold are you able to take a look at that loss function code and tell me if it looks right?

inland crown Dec 14, 2024, 8:29 PM

#

I'v suggesteed a crypto-bots in python topical chat. Would be great to have a place to talk specifically about crypto bots in PY.

rich moth Dec 14, 2024, 8:31 PM

#

fresh bay anyone able to tell me if this implementation looks correct for contrastive loss...

You have to exponentiate all your similarities before you summing them. You should normalize the projections before you matrix multiplcation.

rich moth Dec 14, 2024, 8:34 PM

#

inland crown <@204385862081970178> I like how you charted out your plots. I just used a stand...

Thanks, I've learned over time how much visualizing your data can really help, numerical metrics only go so far. But in general they wont allow you to register if you click you're from the US on most exchanges i've tried.

fresh bay Dec 14, 2024, 8:35 PM

#

rich moth You have to exponentiate all your similarities before you summing them. You sho...

yea I want cosine similarity for pretty specific reasons in this case hence why I am doing it that way - not sure if it will matter tbh

rich moth Dec 14, 2024, 8:36 PM

#

just curious what are the reasons?

fresh bay Dec 14, 2024, 8:37 PM

#

so I really need the similarity of vectors and want to ignore the total magintude

#

or divide through by it I guess

#

empirically tbh might end up being worse

#

and its an easy enough change tbf

#

thanks for the catch on the exp before summing

#

duh.

rich moth Dec 14, 2024, 8:43 PM

#

I see hmm

fresh bay Dec 14, 2024, 8:44 PM

#

honestly it might not actually make a difference tbf

#

thank you for the set of eyes

#

much appreciated

past bridge Dec 14, 2024, 10:31 PM

#

is there a way to make google colabs runtime less annoying? im dealing with large datasets and a bunch of libraries and nothing is running i have some weird problems with my runtime

#

either nothing runs at all and the runtime suddenly disconnects or it just takes absolute ages to run things that arent even that heavy

serene scaffold Dec 14, 2024, 10:35 PM

#

past bridge is there a way to make google colabs runtime less annoying? im dealing with larg...

the only way is probably to pay for the subscription service.

spring field Dec 14, 2024, 11:48 PM

#

I understand that this might come off as a really weird question, but
do any of you, in the "Data Science" sphere, use stuff like neovim?
I'm just interested in trying out "that stuff", switching to completely terminal-based code editing, however, I do really like some of the features of say PyCharm and VSCode that I'm not sure can be replicated in something like neovim
for instance, all the fun debugging tools in both PyCharm and VSCode, or the matplotlib chart display that's integrated into PyCharm and other simlar features
I understand I can get all of actual text-editing-related functionality (autocomplete, intellisense, stuff like that) in neovim, I'm just unsure about the rest of it, the rest of it that I might miss (like some fun extensions from VSCode and such)

spring field Dec 14, 2024, 11:49 PM

#

past bridge is there a way to make google colabs runtime less annoying? im dealing with larg...

might I suggest using paperspace
it's paid, but last I checked, it was relatively cheap and there are some free options occasionally as well
it's also owned by DigitalOcean now (and I think DigitalOcean has some GPU droplets itself now as well (might be a result of acquiring paperspace I suppose))

fading wigeon Dec 14, 2024, 11:52 PM

#

So it's important for many machine learning methods to operate off of scaled values that have a mean of 0 and a std dev of 1.

My question is, does it matter if you scale or fit a polynomial model first? I'm watching a lecture video and they suggest fitting the polynomial model first and then scaling the values with StandardScaler. I've always done it the other way around.

#

Have I been doing it wrong?

#

And if I've been doing it wrong, what happens when it's done that way?

iron basalt Dec 14, 2024, 11:57 PM

#

spring field I understand that this might come off as a really weird question, but do any of ...

Debugging, yes, Matplotlib, possible in different ways. Ideal may be to just use a tiling window manager (if you are going for the full Linux experience).

#

Notebook-style: https://github.com/luk400/vim-jukit

GitHub

GitHub - luk400/vim-jukit: Jupyter-Notebook inspired Neovim/Vim Plugin

Jupyter-Notebook inspired Neovim/Vim Plugin. Contribute to luk400/vim-jukit development by creating an account on GitHub.

#

Debugging: https://github.com/puremourning/vimspector

GitHub

GitHub - puremourning/vimspector: vimspector - A multi-language deb...

vimspector - A multi-language debugging system for Vim - puremourning/vimspector

#

(Uses the same debugger server as Visual Studio Code for Python)

#

The reason I bring up the tiling window manager is because on Linux, the entire OS is the IDE (Unix). And with tiling windows it feels just like one.

fading wigeon Dec 15, 2024, 12:01 AM

#

Nevermind, I think I misunderstood what I was looking at. They only add polynomial features with sklearn's preprocessing convenience method

#

So that makes sense it would come before the scaling

spring field Dec 15, 2024, 12:15 AM

#

iron basalt The reason I bring up the tiling window manager is because on Linux, the entire ...

That's a neat way to look at it, thanks!

#

and thanks for the resources 🤗

abstract wasp Dec 15, 2024, 1:54 AM

#

abstract wasp hi for those who have experience in mlops, which course do you recommend most? h...

Hi can someone help me w this ty 😃

past bramble Dec 15, 2024, 2:58 AM

#

I am having this code: ```py
import traceback
from transformers import AutoTokenizer, AutoModelForCausalLM

model_id = "./Meta-Llama-3-8B-Instruct-GGUF"
filename = "Meta-Llama-3-8B-Instruct.Q2_K.gguf"

try:
tokenizer = AutoTokenizer.from_pretrained(model_id, gguf_file=filename)
model = AutoModelForCausalLM.from_pretrained(model_id, gguf_file=filename)

tokenizer.save_pretrained('dequantized_model')
model.save_pretrained('dequantized_model')

def generate_text(prompt, max_length=50):
    inputs = tokenizer(prompt, return_tensors="pt")
    print(f"{inputs = }")
    outputs = model.generate(**inputs, max_length=max_length)
    print(f"{outputs = }")
    result = tokenizer.decode(outputs[0], skip_special_tokens=True)
    print(f"{result = }")
    return result

prompt = "Once upon a time in a land far, far away"
generated_text = generate_text(prompt)
print(generated_text)

except Exception as e:
traceback.print_exc()
print(F"{e = }")

The only thing in terminal is: ```bash
Converting and de-quantizing GGUF tensors...: 100%|█████████████████████| 291/291 [02:39<00:00,  1.83it/s]

deep veldt Dec 15, 2024, 5:27 AM

#

do i have to modify the fc layer of resnet to train on custom dataset?

dire kiln Dec 15, 2024, 11:10 AM

#

Good morning

odd meteor Dec 15, 2024, 11:10 AM

#

deep veldt do i have to modify the fc layer of resnet to train on custom dataset?

I think the right answer is, it depends on the variant of transfer learning you wanna use.

There are 2 variants of transfer learning (TL)

a) TL from dataset perspective
b) TL from model perspective

Your question lean more towards b so I'll focus more on that part.

Transfer Learning from model perspective also has 3 variants.

1st Varaint: This variant of transfer learning is where you finetune only the last layer of your ResNet.
step 1: Train whole model on a large dataset (since you're using Resnet which has been pretrained you'll skip this)

step 2: Freeze the model weights
step 3: Replace the last layer which is the output layer.
step 4: After you've done step 3, retrain the model on your custom dataset. (here, only the new output layer which you replaced in step 3 will be updated while performing step 4)

2nd Varaint:

This variant of transfer learning is used to train multiple output layers.

step 1: Train whole model on large dataset (you'll skip this since Resnet is already pretrained)
step 2: Freeze the weights
step 3: replace the last layer
step 4: Train on custom dataset and update all the different fully connected layers in the network.

So in contrast to 1st Varaint, you'll be updating multiple output layers here. This usually tend to perform slightly better than 1st variant.

3rd Varaint:
This variant is where you finetune the whole model.

Step 1: Just like 1st & 2nd Varaint, you train the whole model on large dataset (skip this once again since you're using Resnet).
Step 2: replace the last layer
Step 3: Now, train on custom data and update all layers. That is, instead of only updating the last layer, you'll train the model and update all layers (notice in this variant we don't have to freeze model weights)

This variant can be a bit more expensive because we have to update all the weights. However, depending on the classification task at hand, this might yield better predictive accuracy.

dire kiln Dec 15, 2024, 11:14 AM

#

I have multiple similar videos and I want to detect every instance of a common image. I have a perfect base imagine of what the program should detect, and I need it to, based on that, find every instance of it that looks kinda alike, and then present me with the candidates so that I can point which ones are correct and which aren't so that it can train itself.

deep veldt Dec 15, 2024, 11:16 AM

#

odd meteor I think the right answer is, it depends on the variant of transfer learning you ...

why freeze the layers ? im new

dire kiln Dec 15, 2024, 11:17 AM

#

dire kiln I have multiple similar videos and I want to detect every instance of a common i...

A good example would be a 2D RPG game, where the images are mostly clear with little distortion, except from the magic spells covering them sometimes. So the program would tell me: this square area appears to match 30%, or it looks 30% of this character is visible inside it.

#

Then I would confirm and the program would train itself based on that.

#

Is it possible?

#

If yes, could you guys recommend me some resources? Hints and advice would be great too.

#

I'm not a math person, I just read loads of documentation and test the output on things to know how to put the logic together.

#

Thank you in advance

odd meteor Dec 15, 2024, 11:40 AM

#

deep veldt why freeze the layers ? im new

Okay cool. Let me break it down a bit more. When it comes to transfer learning, we're essentially transferring knowledge from a large dataset.

Since labelled data are typically scarce, in practice, it makes sense to leverage pretrained models (example Resnet, Llama etc) which we can further finetune on our custom dataset.

The key idea behind transfer learning is that we can use these large dataset to learn general feature extraction layers that are generally useful.

Now, this is the part that answers your question more directly.

The essence of freezing the weights is because we want the backbone of the network (in your case, Resnet) to remain untampered with. That is, we don't wanna tamper with the CNN part. We can however, decide to tamper with the MLP or even, just the output layer.

Remember, Resnet has already been pretrained on a large image dataset (ImageNet) and it has learned stuff during that training.

If we now want to use ResNet on our own custom image dataset, say, CIFAR-10, we might not want to update the weights in every layer in ResNet again, hence, the reason one might decide to freeze the weights in those layers.

Of course, this is not always the case in transfer learning. If you want to update all the weights during training, you will use the 3rd variant.

deep veldt Dec 15, 2024, 12:04 PM

#

odd meteor Okay cool. Let me break it down a bit more. When it comes to transfer learning, ...

thanks

#

also how do you calculate the similarity and distance in a siamese network? and how to classify them after?

unborn flower Dec 15, 2024, 4:44 PM

#

is there anyone who is learning python advanced concepts here? am looking for a peer to study together

red canopy Dec 15, 2024, 7:45 PM

#

unborn flower is there anyone who is learning python advanced concepts here? am looking for a ...

I would like to learn advanced concept in both ways, about python and about ML

proper laurel Dec 15, 2024, 7:48 PM

#

Hi guys

rich moth Dec 15, 2024, 11:31 PM

#

proper laurel Hi guys

bah humbuggy

deep veldt Dec 16, 2024, 1:46 PM

#

deep veldt also how do you calculate the similarity and distance in a siamese network? and ...

may someone help me

remote stream Dec 16, 2024, 2:46 PM

#

anyone know if we need linux to utilise gpu or just anaconda is enough

agile cobalt Dec 16, 2024, 2:52 PM

#

remote stream anyone know if we need linux to utilise gpu or just anaconda is enough

Tensor Flow removed support for GPU usage in Windows a while ago, so if you're using their latest version you will need to use Linux (or WSL2)

Nearly any other major tool supports it in both Windows and Linux
For most tools you don't really need of Anaconda either, although it might be easier to setup some using it than without it

remote stream Dec 16, 2024, 2:52 PM

#

agile cobalt Tensor Flow removed support for GPU usage in Windows a while ago, so if you're u...

i thought anaconda navigator had

#

function to utilise gpu?

agile cobalt Dec 16, 2024, 2:52 PM

#

no?

#

not Anaconda itself

remote stream Dec 16, 2024, 2:53 PM

#

hmm

agile cobalt Dec 16, 2024, 2:53 PM

#

some tools you can install using it may utilise your GPU if configured correctly, but for most part, you can install those same tools without using Anaconda and they'll work just as well

remote stream Dec 16, 2024, 2:54 PM

#

agile cobalt some tools you can install using it may utilise your GPU if configured correctly...

https://www.youtube.com/watch?v=r31jnE7pR-g

YouTube

Michael Min

How To Use Your GPU for Machine Learning on Windows with Jupyter No...

A quick guide on how to enable the use of your GPU for machine learning with Jupyter Notebook, Tensorflow, Keras on the Windows operating system.

I researched and tried various methods to get this work, and discovered this to be the easiest and quickest solution.

This will allow you to use your GPU instead of your CPU when training your your n...

▶ Play video

#

is this removed?

agile cobalt Dec 16, 2024, 2:55 PM

#

remote stream https://www.youtube.com/watch?v=r31jnE7pR-g

I would strongly recommend using PyTorch instead of TensorFlow if you do not have any specific reasons to use TensorFlow

remote stream Dec 16, 2024, 2:55 PM

#

agile cobalt I would strongly recommend using PyTorch instead of TensorFlow if you do not hav...

why

agile cobalt Dec 16, 2024, 2:56 PM

#

in the last few years it has greatly overtaken TF in popularity, so most recent research, models and resources will be using it

remote stream Dec 16, 2024, 2:56 PM

#

i c

remote stream Dec 16, 2024, 2:57 PM

#

agile cobalt in the last few years it has greatly overtaken TF in popularity, so most recent ...

also

#

ive been creating lot of environments and install libraries for each of them

#

now my space is full

#

where can i clean them up

agile cobalt Dec 16, 2024, 2:57 PM

#

just delete the environments

#

how depends on what you're using to create them, but usually same place that you went to create in first place

lofty saffron Dec 16, 2024, 8:10 PM

#

Hi, I am currently trying to make a RL, DL rocket league bot that can play in competitive/casual games (online games). There is almost no way to extract data as in speed/trajectory of the ball as it changes memory adress often. I have made some pattern recognition so it can find the ball on my screen and track it and soon getting some more variables for it to see. Is there anything I can do since I am pretty stumped. It doesnt really know where it is relative to the field and the ball on the field, how can I make much more accurate?

serene scaffold Dec 16, 2024, 8:26 PM

#

lofty saffron Hi, I am currently trying to make a RL, DL rocket league bot that can play in co...

You should not make any kind of automation for playing rocket league in competitive games. we will not help you with this.

lofty saffron Dec 16, 2024, 8:27 PM

#

its not for comp its just for concept

#

i was trying to say doing it without the rlbot framework

#

Sorry if I haven't explained fully.

echo yacht Dec 16, 2024, 8:40 PM

#

hello! i'm writing an academic research paper w/ all my data analyses done in jupyterhub & don't know how to cite it in the abstract. usually when i work in R, i can just say R (version 4.1.3) but how would it be for jupyterhub? like are there versions or am i just supposed to cite python itself

agile cobalt Dec 16, 2024, 8:54 PM

#

echo yacht hello! i'm writing an academic research paper w/ all my data analyses done in ju...

seems like there are some open Issues simlar to that in their organizational repos, https://github.com/jupyterlab/frontends-team-compass/issues/144

As far as Jupyter Hub goes, you could just cite the https://github.com/jupyterhub/jupyterhub website

You may want (need?) to cite some of the libraries you are using like numpy, pytorch, polars or whatever you're using... Usually you should be able to find information about it in the GitHub readme, but in practice it's all over the place

agile cobalt Dec 16, 2024, 8:56 PM

#

echo yacht hello! i'm writing an academic research paper w/ all my data analyses done in ju...

you could also try bumping https://github.com/jupyterhub/jupyterhub/issues/2661 or otherwise contacting the repository maintainers

GitHub

Citing JupyterHub? · Issue #2661 · jupyterhub/jupyterhub

Hi folks, Do you have a recommended way to cite JupyterHub in publications? I poked around but didn't find a clear answer beyond citing Jupyter as a whole. Thanks! -Cyrus

#

(for real... there are way too many open issues about this in their repos, https://github.com/jupyterhub/jupyterhub/issues/2039, https://github.com/jupyter/jupyter/issues/190...)

echo yacht Dec 16, 2024, 8:59 PM

#

agile cobalt you could also try bumping https://github.com/jupyterhub/jupyterhub/issues/2661 ...

thank you so much for your response!! i tentatively said "all statistical analyses were performed in Python (version 3.11) in the JupyterHub environment" but i didn't think to cite each of the libraries. i also didn't think to check the readmes, i appreciate your help a ton!!

agile cobalt Dec 16, 2024, 9:02 PM

#

You do not necessarily have to cite every single tool you are using, maybe check with your professor or reviewer

lethal gull Dec 16, 2024, 9:45 PM

#

agile cobalt some tools you can install using it may utilise your GPU if configured correctly...

I dual boot linux and windows switching to ubuntu for programming was the best decision of my life so if u can i would dual boot ubuntu or just switch entirely if you can

fading wigeon Dec 17, 2024, 2:04 AM

#

Hey, I have a question. I know that high bias is associated with underfitting, but what does the term mean? Like... the etymology of the term maybe? high variance makes intuitive sense to me but I'm not sure how to interpret high bias

serene scaffold Dec 17, 2024, 2:09 AM

#

fading wigeon Hey, I have a question. I know that high bias is associated with underfitting, ...

Without actually answering your question: my opinion is that ML terminology is a shit show and you should fully disconnect any ML term from any straightforward meaning that a fluent English speaker would intuitively ascribe to it.

fading wigeon Dec 17, 2024, 2:10 AM

#

lmao fair enough

#

There definitely is some like.... nomenclature variable labeling that just bothers me because it contradicts with other mathematical paradigms

inland crown Dec 17, 2024, 3:53 AM

#

So, can I discuss ML here? Is that considered AI?

serene scaffold Dec 17, 2024, 3:53 AM

#

inland crown So, can I discuss ML here? Is that considered AI?

it's pretty much a subset of AI. and the only subset that people care about.

inland crown Dec 17, 2024, 3:56 AM

#

I use to program in PERL. Done a little c++ and java. Fairly new to PYTHON but I"ve used it a few times. I used gpt to start a script, it suggested to use a ML model. I used the joblib file and "trained" it by running my code. I really don't know enough about what I'm doing so am looking into how I can find out more about the "ML" part of my file. In the general chat, I'm asked questions, I don't know the answer to, but I'm trying to learn, not getting much help...

#

I understand the parts of the code that call it, but I don't really know what "it" is lol...

#

I don't see where my code is calling anything other then joblib.

#

well, the joblib file.

serene scaffold Dec 17, 2024, 3:58 AM

#

inland crown I use to program in PERL. Done a little c++ and java. Fairly new to PYTHON but I...

I can't imagine what joblib would have to do with ML. can you put the joblib file in the paste bin?

#

!paste

arctic wedgeBOT Dec 17, 2024, 3:58 AM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in Discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the Paste! button in the bottom left, or by pressing CTRL + S. After doing that, you will be navigated to the new paste's page. Copy the URL and post it here so others can see it.

inland crown Dec 17, 2024, 3:58 AM

#

sure, it's there...
https://paste.pythondiscord.com/XSDA

#

ok, wait, sorry, that's not the joblib file... brb.

serene scaffold Dec 17, 2024, 4:00 AM

#

nevermind--don't post the joblib file

inland crown Dec 17, 2024, 4:00 AM

#

is there a way to add a file after the fact or do I need to make a new one?

serene scaffold Dec 17, 2024, 4:00 AM

#

it looks like it's just using joblib to save and load a python object

inland crown Dec 17, 2024, 4:00 AM

#

ok, it's small.

#

So, this is the current code... As I said, I was using cgpt to start the code off. I found that it seemed that cgpt was messing with me, but it deleted the ML module probablt 50 times..

serene scaffold Dec 17, 2024, 4:01 AM

#

you have model = joblib.load(MODEL_FILENAME). it looks like model was originally created using the scikit-learn library

inland crown Dec 17, 2024, 4:01 AM

#

WHEW!

#

THANK YOU ALREADY!!

serene scaffold Dec 17, 2024, 4:02 AM

#

inland crown So, this is the current code... As I said, I was using cgpt to start the code of...

if chatgpt gave you this code, but it didn't give you a complete file that you can download to be MODEL_FILENAME, then ChatGPT was totally bullshitting you

inland crown Dec 17, 2024, 4:02 AM

#

I usually just need a lil start to research it, but I was getting nowhere fast.

#

it did. It messed with me for days before I gave up and just went solo.

#

it deleted so many bit of code that worked fine when I didn't ask it to it was crazy..

serene scaffold Dec 17, 2024, 4:03 AM

#

I am a computational linguist--my profession is essentially evaluating the utility of technologies like ChatGPT for different use cases

inland crown Dec 17, 2024, 4:03 AM

#

And now I'm left as a noob, not understanding the implementation of ML in my code, but I want and need to.

serene scaffold Dec 17, 2024, 4:03 AM

#

my professional and very authoritative opinion is that you should only use ChatGPT for "informed questions"

#

which is not currently the case.

inland crown Dec 17, 2024, 4:04 AM

#

I'm not using at all anymore. Havent for quite a while. I installed zencoder and that's helped me hammer bugs.

#

once I started making the script modular, gpt couldn't handle it at all.

#

At lease with zencoder, it can see all of my modules and functions.. But it's still been unable to get my script to run since I ported it.

#

But, Every time I ask a question, I usually figure out the answer as I hit enter.. lol... Cept for ML.. I don't think my script is using it properly and I want to use it for a variety of models to test with.

#

I mean, clearly it's not really using it activally

inland crown Dec 17, 2024, 4:25 AM

#

How did you come to determine that it was the scikit-learn library? I'm guess this is the same as the sklearn?

#

So it's trained with the sklearn.linear_model Now I have to figure out how to retrain it using the new model. The first run used 1 as buy and 0 as sell. My new model uses 1 as buy, 0 as hold and -1 as sell.

#

Im I implementing the ML properly, to generate a joblib file and then that is whats used? Can it be used in a more live sense to dynamically learn?

#

(should read "am I"?)

serene scaffold Dec 17, 2024, 1:59 PM

#

inland crown How did you come to determine that it was the scikit-learn library? I'm guess th...

they're the same. it was evident by the method calls.

serene scaffold Dec 17, 2024, 2:00 PM

#

inland crown Im I implementing the ML properly, to generate a joblib file and then that is wh...

the overwhelming majority of ML programs (ie, models) do not learn while they are being used; all the learning takes place before, during a training phase.

tawdry sundial Dec 17, 2024, 2:21 PM

#

I assume a lot of people here have tried to build or at least learned from building models to predict the stock market which is pretty much "impossible"

#

however something gets me really curious, why do most people lose money trading? isnt it a 50/50?

#

and if we take the assumption that most people lose money trading, then shouldnt a simple python program that opens trades against normal traders be profitable?

#

why wouldnt it?

serene scaffold Dec 17, 2024, 2:34 PM

#

tawdry sundial however something gets me really curious, why do most people lose money trading?...

my guess is that there are people who make enough money from day trading to live, and that that only works because they make a large profit from lots of people making small losses.

dim olive Dec 17, 2024, 4:21 PM

#

tawdry sundial however something gets me really curious, why do most people lose money trading?...

Anecdotally, the majority of people I know who day-trade are trading off of poorly educated decisions, i.e trading on stocks for companies they like, ones they hear about a lot in the news, etc. and have a general disregard for the numbers. I have been able to make a fair amount of money by trading purely based on statistics, though this is not consistent.

In general, from what I have learned, advanced learning algorithms are not better at making trading decisions than classic statistical analysis, so AI is generally just slower at making decisions, requires more serious hardware, and consumes more electricity for mostly the same results as simpler algorithms.

serene scaffold Dec 17, 2024, 4:23 PM

#

woah, Fisher is in this channel

dim olive Dec 17, 2024, 4:24 PM

#

dim olive Anecdotally, the majority of people I know who day-trade are trading off of poor...

With all that said, I think making a simple python trading algorithm could be a fun project, it is not as easy to translate statistics into profit as you might immediately think, I always recommend testing extensively without real money and limit risk for programming errors.

Much of the difficulty is not with analyzing the market, but making decisions based on edge cases and handling the API. I.e I have written in bugs before where a stock was, like 1% away from hitting my sell mark to earn profit, but I missed it and didnt handle the fall correctly and let a share spiral and lose WAY more money than I would have if I was trading without the algo simply because I didnt handle an api error response correctly.

dim olive Dec 17, 2024, 4:24 PM

#

serene scaffold woah, Fisher is in this channel

I saw you here, I had to

serene scaffold Dec 17, 2024, 4:25 PM

#

dim olive I saw you here, I had to

but I'm always in this channel

dim olive Dec 17, 2024, 4:29 PM

#

tawdry sundial why wouldnt it?

Oh, and to answer this portion: It probably would and there are loads of people doing this with many millions or billions of dollars. The problem here is that you are not necessarily competing against my random friend who may know nothing about trading, you are mostly competing in an arms race of other skilled and informed traders/developers to take the money from the randos who are making poor decisions. When you change the goal post, it is easy to see how that could be difficult.

rich moth Dec 17, 2024, 7:00 PM

#

dim olive Anecdotally, the majority of people I know who day-trade are trading off of poor...

I dunno, I made an advanced ML system and I'm pretty happy with the results so far. Looking at the metrics - Sharpe Ratio of 2.78 and Profit Factor of 1.88 it's performing well above average. Its processing 18 different technical features simultaneously while also providing confidence levels for eah predicition. Not something a simple classic system can do.

ripe vortex Dec 17, 2024, 8:13 PM

#

hi

#

i'm doing a program to help an student to do a new activitie based on a book

#

i tried to use the openai API to do that but it's so expansive

#

how mutch it will cost if i decide to train my own ai?

rich moth Dec 17, 2024, 8:34 PM

#

ripe vortex hi

Thats kind of loaded question. Theres a bunch of factors and it can get complicated and expensive quick. Personally if you're serious get a rig with a 4090 and start experimenting. There's some free ones you could use in the mean time like Google Colab they all come withs limits though.

ripe vortex Dec 17, 2024, 9:05 PM

#

I can do this project with something like $2000?

rich moth Dec 17, 2024, 9:40 PM

#

ripe vortex I can do this project with something like $2000?

I think its 2k just for Graphics card these days.

ripe vortex Dec 17, 2024, 11:43 PM

#

ow

#

how much do you think everithing will cost?

lethal gull Dec 18, 2024, 12:58 AM

#

rich moth Thats kind of loaded question. Theres a bunch of factors and it can get complic...

wait for blackwell

rich moth Dec 18, 2024, 1:10 AM

#

lethal gull wait for blackwell

wait for them to sell out and watch people jack up prices? maybe 🙂

rich moth Dec 18, 2024, 1:26 AM

#

It was crypto that caused the shortage a few years ago. Ethereum really took things to the next level, but with the increase price in BTC i foresee another smaller mining boom and a sell out of these things quick.

#

Humans love doing that when things get scarce, remember the whole toilet paper debacle during COVID19? Like why? Lol. People saw it selling out on TV and it created a sense of scarcity. The perfect storm is coming. I bought my 4090 at retail when it first came out, now its like 700-1000 more, it hasnt budged in years

#

So in a way its related to AI/ML because if you want one forr you projects, I'd pull the trigger when they drop.

lethal gull Dec 18, 2024, 2:01 AM

#

rich moth It was crypto that caused the shortage a few years ago. Ethereum really took th...

Mining is pointless now industrial grade factories have taken over in 2021 it was everyone’s last opportunity to mine so they went all in and took advantage but they halved btc and no one can compete now

ripe vortex Dec 18, 2024, 3:15 AM

#

And if I contract a freelancer to train a ia model?

#

How much it will cost?

serene scaffold Dec 18, 2024, 3:20 AM

#

ripe vortex And if I contract a freelancer to train a ia model?

this is like asking "how much would it cost to contract a programmer to make me a program?"

jaunty helm Dec 18, 2024, 3:21 AM

#

tbh if you're not doing this regularly, I heard from others that just renting is cheaper

serene scaffold Dec 18, 2024, 3:22 AM

#

jaunty helm tbh if you're not doing this regularly, I heard from others that just renting is...

renting GPU compute?

jaunty helm Dec 18, 2024, 3:22 AM

#

serene scaffold renting GPU compute?

yes

serene scaffold Dec 18, 2024, 3:23 AM

#

jaunty helm yes

if you value your time, renting GPU compute is cheaper than any roundabout solution one might come up with (but they probably won't come up with one)

jaunty helm Dec 18, 2024, 3:23 AM

#

like runpod or something
it's definitely cheaper than buying a what now, 1k 2k dollars 4090?

serene scaffold Dec 18, 2024, 3:24 AM

#

jaunty helm like runpod or something it's definitely cheaper than buying a what now, 1k 2k d...

at least you can also use it for gaming
but then you can't game while your model trains, which is a non-starter.

ripe vortex Dec 18, 2024, 3:26 AM

#

jaunty helm tbh if you're not doing this regularly, I heard from others that just renting is...

where i can rent a GPU?

serene scaffold Dec 18, 2024, 3:27 AM

#

ripe vortex where i can rent a GPU?

google colab

ripe vortex Dec 18, 2024, 3:28 AM

#

ok

fervent bridge Dec 18, 2024, 4:49 AM

#

https://github.com/RebornBeat/Time-Series-DRL-Rocket-Control-System

GitHub

GitHub - RebornBeat/Time-Series-DRL-Rocket-Control-System: A sophis...

A sophisticated deep learning-based rocket control system that combines time series prediction with real-time trajectory optimization. - GitHub - RebornBeat/Time-Series-DRL-Rocket-Control-System: ...

#

Is any system currently utilizing a Time Series DRL for Rocket Control Systems? I have to refine it but the workflow how does it look?

deep veldt Dec 18, 2024, 10:11 AM

#

How can i caluclate the similarity and distance in a siamese network can someone help me?

toxic palm Dec 18, 2024, 2:14 PM

#

I am running python code in jupyter notebook created by docker with below file
Everything is working fine, just one doubt.
Here i am creating 2 fodlers named ./files/data & ./files in the container right?
When i open the jupyter notebook, only seeing /work folder.
It's just confusing. I never said to create /work folder, but it got created & on the other hand, ./files/data & ./files are not created.

version: '3'
services:
  spark:
    image: jupyter/pyspark-notebook
    user : root
    ports:
      - "8888:8888"  # Jupyter Notebook
      - "4040:4040"  # Spark UI
    volumes:
      - ./files/data:/root/pyspark_simple_end_to_end_project/data
      - ./files:/root/pyspark_simple_end_to_end_project/jupyter_note_book_files
    environment:
      - JUPYTER_ENABLE_LAB=yes
    command: start.sh jupyter lab --NotebookApp.token=''

serene scaffold Dec 18, 2024, 2:23 PM

#

toxic palm I am running python code in jupyter notebook created by docker with below file E...

this is more of a #tools-and-devops question

weary timber Dec 18, 2024, 4:09 PM

#

im trying to make a drawing board that reprocess the image to be centered before feeding forward but cant manage to get it work

#

can someone help me with that?

weary timber Dec 18, 2024, 6:19 PM

#

weary timber im trying to make a drawing board that reprocess the image to be centered before...

pls help me with this

rich moth Dec 18, 2024, 6:45 PM

#

lethal gull Mining is pointless now industrial grade factories have taken over in 2021 it wa...

They took their shot in '21, and now they say it's over? Bah! Crypto's core was about everyone diggin' for treasure, not just the fat cats with their fancy ships. A new tide could come in, a new way to mine. Then we'll see who's laughin'

bitter harbor Dec 18, 2024, 7:33 PM

#

Is there a way to format pandas to_latex by column

table_list = self.df.rename(columns=self.get_headers()).astype("float").to_latex(
  # right align columns, add || between measurement and calculation columns
  column_format="r" * len(self.measurements) + "||" + "r" * len(self.calculations),
  float_format="{:0.3E}".format,
  index=False,
  position="ht"
).split("\n")```
Id like the first three to be floats (without trailing zeros) but id rather not concat the latex

serene scaffold Dec 18, 2024, 8:59 PM

#

bitter harbor Is there a way to format pandas `to_latex` by column ```py table_list = self.df....

there's a formatter kwarg of type dict[column_name, function], so you can have the function be "{:0.3E}".format for some columns and "{:0.2f}".format for others, or whatever you have in mind.

inland crown Dec 19, 2024, 2:26 AM

#

As I understand, you use a ML training model to make a joblib file then you use the joblib file to perform the function after it's been trained. In a crypto trading bot that continuously runs, for days+, can the ML training module be called say, every hour, to re-train on the last hour's data or is the joblib file static and once it's called, can no longer be modified.

left tartan Dec 19, 2024, 3:07 AM

#

inland crown As I understand, you use a ML training model to make a joblib file then you use ...

You build a model and serialize that model (using whatever is appropriate, joblib, or PyTorch.save, etc). You may use the model, or retrain and use a new model (which you'd serialize again either overwriting or creating a new file)

serene scaffold Dec 19, 2024, 3:27 AM

#

@bitter harbor did it work?

bitter harbor Dec 19, 2024, 3:29 AM

#

serene scaffold <@359028858344833025> did it work?

had to change around how i was replacing the headers but yes ty

serene scaffold Dec 19, 2024, 3:29 AM

#

bitter harbor had to change around how i was replacing the headers but yes ty

yay
now I can finally be happy

bitter harbor Dec 19, 2024, 3:37 AM

#

speaking of being happy, im formatting the var names in the headers myself but dont love having a constant max char len, is there a way to get latex to 'evenly' wrap so the table fits the page width?
Like I would be happy with Mass times Lever\\ Arm for m_2 but not Mass times Lever Arm for \\m_2

#

def get_header(self) -> str | None:
        if isinstance(self, Constant):
            return None

        squished_name = self.squash_name()
        uncertainty = rf"\pm{self.uncertainty}" if self.uncertainty is not None else ""
        header_base = rf"{squished_name}\\${self.symbol}{uncertainty}$\\({self.unit})"

        return r"\thead{" + header_base + "}"

# I don't like this, but I couldn't figure out how to do it with latex
def squash_name(self) -> str:
    name: list[str] = []
    words = self.name.split(" ")

    for word in words:
        if len(name) != 0 and len(word + name[-1]) < 10:  # Put words on same line if < 10 chars combined
            name[-1] = name[-1] + " " + word

        else:
            name.append(word)

    return r"\\".join(name)```The other two lines are constant, only thought ive had is to try to minimize the dimensions of the text but that sounds like a lot of work and i dont know how well it would play with the tabular

inland crown Dec 19, 2024, 4:16 AM

#

left tartan You build a model and serialize that model (using whatever is appropriate, jobli...

THANK YOU BB. Alas, the question remains. Can I keep rebuilding, reloading, reusing, relearning, rebuilding, reloading inloop on a continuously running script?

inland crown Dec 19, 2024, 4:19 AM

#

rich moth They took their shot in '21, and now they say it's over? Bah! Crypto's core was...

This reminds me of the actual progression of mining. Individuals dig to find the gold, big companies come in and take over then later, individuals pick it up again when the big guys start overlooking the small pockets and the cycle continues. Main difference, different tools. The fat cats were the ones who could own all the fancy new tools. This time around, the programmers making the tools are using them for themselves. And they become the next fat cats.... There's an algorithm there somewhere 😉

serene scaffold Dec 19, 2024, 4:29 AM

#

inland crown This reminds me of the actual progression of mining. Individuals dig to find the...

if you want to talk about the crypto economy, please go to an off-topic channel.

inland crown Dec 19, 2024, 4:30 AM

#

lol.. exactly what I'm talking about.. I asked a question and got no answer. I engage in someone elses comment, and then you say this.....

#

but to me..

#

lol

#

why not them?

serene scaffold Dec 19, 2024, 4:31 AM

#

inland crown lol.. exactly what I'm talking about.. I asked a question and got no answer. I e...

This channel is not for the discussion of crypto. I'm muting you if this continues.

inland crown Dec 19, 2024, 4:31 AM

#

it;s for data science and AI... So it pertains.

serene scaffold Dec 19, 2024, 4:31 AM

#

you can DM @sonic vapor if you have an issue with our moderation practices

serene scaffold Dec 19, 2024, 4:31 AM

#

inland crown it;s for data science and AI... So it pertains.

crypto is not part of DS/AI

inland crown Dec 19, 2024, 4:32 AM

#

trading it with AI IS!

#

SERIOUSLY!

serene scaffold Dec 19, 2024, 4:32 AM

#

inland crown This reminds me of the actual progression of mining. Individuals dig to find the...

this message is a tangent about the crypto economy.

inland crown Dec 19, 2024, 4:32 AM

#

If I'm using python and AI and ML to make a bot (lets say it's not a trading bot but a research bot) then it absoutly DOES belong here, just becuase I"m researching crypto doesn't make it not pertain!

#

And where did that tangent come from?

#

what did I REPLY too?

#

THAT was relevant?

serene scaffold Dec 19, 2024, 4:34 AM

#

DM @sonic vapor if you have any questions about this. Please make sure that all your subsequent messages in this channel are only about data science and AI.

inland crown Dec 19, 2024, 4:34 AM

#

Yet, no answer for my question though....

#

Is making a bot that monitors crypto to look for specific patterns "data science"? Answer honestly now...

native temple Dec 19, 2024, 6:25 AM

#

Think anyone could give me some guidance on this problem I am facing, with GPT-2 Fine tune.

So I am fine tuning GPT-2 on some conversational data scraped by reddit: https://www.kaggle.com/datasets/jerryqu/reddit-conversations with transformers.
The problem I am facing is with my output sentences being truncated (abruptly cut off),

Prompt: What is your favorite type of food?
Output: I like bananas, but I have a particular dislike for onion rings. I really dislike them. And (take note of the "And")

This is my tokenize function for both training and testing: (I have a feeling the problem is because of this max_length param here, please correct me if I am mistaken.)

def tokenize_data(data, tokenizer, max_length=1024):
    return tokenizer(data, return_tensors="pt", padding=True, truncation=True, max_length=max_length)

This is how I format my data after cleaning:

dialog_data.append(f"<USER> {user}{tokenizer.eos_token} <BOT> {bot}{tokenizer.eos_token}")

Then I pass dialog_data into the tokenize_data function.

Then I pass the tokenized data into get_train_eval_data function:

def get_train_eval_data(tokenized_data, test_size=0.2, random_state=42):
    input_ids = tokenized_data["input_ids"]
    attention_mask = tokenized_data["attention_mask"]

    train_input_ids, eval_input_ids, train_attention_mask, eval_attention_mask = train_test_split(
        input_ids, attention_mask, test_size=test_size, random_state=random_state
    )

    train_dataset = TensorDataset(train_input_ids, train_attention_mask)
    eval_dataset = TensorDataset(eval_input_ids, eval_attention_mask)

    return train_dataset, eval_dataset

Then finally, those get passed into the eval_dataset and train_dataset params of the Trainer class.

Reddit Conversations

Length 3 Comment Sequences from r/CasualConversation

#

And on the testing side, this is my generate call:

def generate_response(prompt, model, tokenizer, temperature=0.7):
    inputs = tokenize_data(prompt, tokenizer).to("cuda")
    
    output = model.generate(
        inputs.input_ids,
        pad_token_id=tokenizer.eos_token_id,
        eos_token_id=tokenizer.eos_token_id,
        do_sample=True,
        temperature=temperature,
    )
    
    return tokenizer.decode(output[0], skip_special_tokens=True)

And my prompt:

prompt = f"<USER> {input("Enter Prompt: ")}{tokenizer.eos_token} <BOT>"
response = generate_response(prompt, model, tokenizer)

print("Response:", response)

bitter oyster Dec 19, 2024, 6:44 AM

#

is this the right channel to ask for help on code for ai speech recognition?

odd meteor Dec 19, 2024, 11:25 AM

#

native temple Think anyone could give me some guidance on this problem I am facing, with GPT-2...

Since you already set padding to True, I think you should also try using the same tokenizer that was used on the pretrained GPT-2 model (if you're not doing so yet)

Once you've loaded and initialized the tokenizer, try to print the max_length of the input tokens to confirm if it's really 1024.

Once you've known the max length, use that same max_length in your utility function for tokenization.

odd meteor Dec 19, 2024, 11:33 AM

#

bitter oyster is this the right channel to ask for help on code for ai speech recognition?

Yes

deep veldt Dec 19, 2024, 12:08 PM

#

what's the keepdim parameter of functional.pairwise_distance?

royal seal Dec 19, 2024, 2:55 PM

#

hello

#

i would like to hear an opinion regarding my planned graduation project

limpid zenith Dec 19, 2024, 3:04 PM

#

royal seal i would like to hear an opinion regarding my planned graduation project

sure what's ur project?

serene scaffold Dec 19, 2024, 3:07 PM

#

royal seal i would like to hear an opinion regarding my planned graduation project

Hello. When you ask a question, be sure to ask your whole question all at once.

untold bloom Dec 19, 2024, 3:11 PM

#

after aggregation, you lose a dimension (over which the aggregation was performed), keepdim lets you put a placeholder dimension of 1 there instead of it disappearing

#

not specific to torch, numpy has that to for appropiate functions and they call it keepdimS

#

why use it: may be you'll do something with the result afterwards that is more convenient if it has that extra dimension

#

example: arr := (N, D) array, arr.sum(0) is of shape (D,), arr.sum(0, keepdims=True) is of (1, D)

rich moth Dec 19, 2024, 3:26 PM

#

inland crown it;s for data science and AI... So it pertains.

I agree with ya. Don't let these guys bother you. I mean my crypto project involves ML.. soo

worn steppe Dec 19, 2024, 5:51 PM

#

Guys, i trying to learn deep learning and machine learning but i need to get a better graphic card, i get a rtx 3060 ti 12GB or a rtx 4060 ti 8gb?

upbeat prism Dec 19, 2024, 5:52 PM

#

worn steppe Guys, i trying to learn deep learning and machine learning but i need to get a b...

you don't need one

worn steppe Dec 19, 2024, 5:52 PM

#

upbeat prism you don't need one

Oh.

upbeat prism Dec 19, 2024, 5:53 PM

#

ofc depends on what oyu wanna do but for just learning basic DL you don't need one, just use your CPU. and if you really need a gpu you can use google colab or some free service

#

if you just wanna train big networks then you need a GPU sure but I mean for learning you don't need it.

serene scaffold Dec 19, 2024, 5:53 PM

#

and if you want to train a "big network", it probably requires an enterprise-tier GPU (not a gaming one)

wild solar Dec 19, 2024, 5:54 PM

#

Hello everyone, I have an idea to write a python package for automating the machine learning process, the process that i have in mind is as follows:

Filling (or dropping) missing values.

Auto Feature Selection

Splitting the data in multiple forms (for multiple training tests)

Training multiple models on the test set

Auto-Hyper parameter tuning

Picking the best performing model

The code would look something like this:

model = AutoML(X, y, **optional_parameters) # AutoML is just a placeholder

X: dataset without the target
y: target

I have the skills to work and finish such a project, but I need your feedback, from my research i found that there are multiple python packages that can do the same thing, like MLJAR AutoML, Auto-sklearn, PyCaret, etc.

But I read multiple reddit threads that complaint about the limitations of these packages, and i am willing to fill these gaps in my package.

So, what do you think? should I commit to this project? or should I put my efforts into something else?

worn steppe Dec 19, 2024, 5:54 PM

#

Ok, thanks guys.

#

Do you have a course to start on neural networks?

upbeat prism Dec 19, 2024, 5:55 PM

#

worn steppe Ok, thanks guys.

if your math is good and you wanna actually learn the theory: https://www.deeplearningbook.org/ otherwise just go and start with the pytorch tutorial ^^ and do some project (classify dogs or whatever)

worn steppe Dec 19, 2024, 5:57 PM

#

upbeat prism if your math is good and you wanna actually learn the theory: https://www.deeple...

Ok, thanks.

#

have a good day.

upbeat prism Dec 19, 2024, 5:58 PM

#

worn steppe Ok, thanks.

I'd try to learn the basic concept. A very nice video for that is https://www.youtube.com/watch?v=aircAruvnKk https://www.youtube.com/watch?v=IHZwWFHWa-w and other videos of that guy.

worn steppe Dec 19, 2024, 5:59 PM

#

i gonna see this when i go to my home, i on the work now, and i need focus, english is not my first language.

#

I have some dificulty

upbeat prism Dec 19, 2024, 6:06 PM

#

worn steppe I have some dificulty

well you'll get better with time 😄 also if the math is way too complex for you: If you really wanna learn it, you can. It's very doable. But there's also a very applied side to deep learning where you don't need much math (but that's veeeery shallow)

royal seal Dec 19, 2024, 6:27 PM

#

limpid zenith sure what's ur project?

I'm doing a recommendation system as a graduation project with my two project partners we want to make a hybrid system that includes a content-based, collaborative and demographic recommendations module while utilizing reinforcement learning to keep up with new users data

as none of us has any experience we thought we would like to turn professional communities for advice

we started preprocessing on the data but have fears toward actually being able to implement everything before the deadline February 12th and if there's any integration limitation that we should know of before starting to train the modules so that we wouldn't have to start from scratch after we finish a module cause we should've done something differently if we were going for an hybrid system

royal seal Dec 19, 2024, 6:29 PM

#

serene scaffold Hello. When you ask a question, be sure to ask your whole question all at once.

Ok, i'm new here and wasn't sure if it's appropriate to ask here

odd meteor Dec 19, 2024, 7:22 PM

#

royal seal I'm doing a recommendation system as a graduation project with my two project pa...

If you're unsure about finishing before the deadline and it's crucial to submit a complete P.O.C., you might want to explore Meta's 2024 ICML paper on recommendation systems. They introduced HSTU, a new architecture designed for high-cardinality, streaming data, achieving state-of-the-art performance and efficiency.

The paper highlights how Generative Recommenders can scale to massive datasets while significantly reducing computational costs.

Check out the paper and the code therein and see if it's something that interests you.

https://arxiv.org/abs/2402.17152

arXiv.org

Actions Speak Louder than Words: Trillion-Parameter Sequential Tran...

Large-scale recommendation systems are characterized by their reliance on high cardinality, heterogeneous features and the need to handle tens of billions of user actions on a daily basis. Despite being trained on huge volume of data with thousands of features, most Deep Learning Recommendation Models (DLRMs) in industry fail to scale with compu...

royal seal Dec 19, 2024, 7:25 PM

#

odd meteor If you're unsure about finishing before the deadline and it's crucial to submit ...

i will look into it

#

also no we don't need a complete P.O.C

#

it's just that me and my team are basically fumbling in the dark without prior experience so i just wanted to seek for guidance

#

thanks for your reply

bitter oyster Dec 19, 2024, 9:03 PM

#

i havent been able to figure out why my code is not working, i'm trying to set up basic voice recognition so that when i say "what time is it", it will respond with the current time. i followed a tutorial on how to do it but it isn't picking up my voice

mighty lake Dec 19, 2024, 9:15 PM

#

anyone have experience with time series analysis/forecasting, arima models, or regression? if so, please give me book or resource recommendations that are easy to understand. I want to learn but the math and wording is so complicated everywhere I look. I suck at reading and understanding the conceptual math most of these books have. @ me replies or go to #1319410226088382576

left tartan Dec 19, 2024, 9:20 PM

#

mighty lake anyone have experience with time series analysis/forecasting, arima models, or r...

There's this pinned message, it's a bit old so let us know if anything is a dead link: #data-science-and-ml message

rich moth Dec 19, 2024, 9:28 PM

#

mighty lake anyone have experience with time series analysis/forecasting, arima models, or r...

perhaps this https://videolectures.net/videos/kdd2019_chen_pechyoni_taylor

Deep Learning for Time Series Forecasting

Time series forecasting is one of the most important topics in data science. Almost every business needs to predict the future in order to make better decisions and allocate resources more effectively. Examples of time series forecasting use cases are: financial forecasting, product sales forecasting, web traffic forecasting, energy demand forec...

untold bloom Dec 19, 2024, 9:33 PM

#

https://otexts.com/fpp3/

severe juniper Dec 19, 2024, 9:50 PM

#

bitter oyster i havent been able to figure out why my code is not working, i'm trying to set u...

Wheres the problem? In speech recognition?

#

What library are you using for it?. I tried smthn similar

bitter oyster Dec 19, 2024, 10:05 PM

#

its not taking microphone input, using libraries pyttsx3 and speech_recognition, heres the code: ```import pyttsx3
import speech_recognition as sr
from datetime import datetime
engine = pyttsx3.init()
engine.say("Hello Sir, how can I help you")
engine.runAndWait()

now = datetime.now()
current_time = now.strftime("%H:%M:%S")

def takecommand():
command = sr.Recognizer()
with sr.Microphone() as source:
print("Listening...")
command.adjust_for_ambient_noise(source)
command.pause_threshold = 1
audio = command.listen(source)

    try:
        print("Recognizing...")
        query = command.recognize.google(audio, language="en-in")
        print("You said", query)
        if 'what time is it' in query:
            engine = pyttsx3.init()
            engine.say(current_time)
            engine.runAndWait()
    except Exception as Error:
        return None
    return query

takecommand()

latent leaf Dec 19, 2024, 10:14 PM

#

How can I learn to make a very very simple ai chatbot?

I have a programmer background very little data science--but I guess I'm trying to figure out how to make my own very very simple AI data models. Or at least learn how it's done

Example: chat bot that only has knowledge of automotives

serene scaffold Dec 19, 2024, 11:05 PM

#

latent leaf How can I learn to make a very very simple ai chatbot? I have a programmer back...

a chat bot that can respond to questions about only one topic, and responds with plausible answers for that one topic domain, is still very advanced and not a good beginner project.

midnight rain Dec 19, 2024, 11:18 PM

#

latent leaf How can I learn to make a very very simple ai chatbot? I have a programmer back...

Start with a simple prediction model

midnight rain Dec 19, 2024, 11:19 PM

#

latent leaf How can I learn to make a very very simple ai chatbot? I have a programmer back...

Not chatbots which includes NLP, DL and LLMs lol

serene scaffold Dec 19, 2024, 11:25 PM

#

midnight rain Not chatbots which includes NLP, DL and LLMs lol

chatbots are part of NLP by definition, but I'm not aware of any question-answering bot techniques that are easy for beginners.

serene scaffold Dec 19, 2024, 11:25 PM

#

midnight rain Start with a simple prediction model

this is good advice.

midnight rain Dec 19, 2024, 11:26 PM

#

serene scaffold chatbots are part of NLP by definition, but I'm not aware of any question-answer...

Idk man i am still learning, my biggest problem was going right into ML and DL while not knowing the statistics fully 😭

serene scaffold Dec 19, 2024, 11:26 PM

#

midnight rain Idk man i am still learning, my biggest problem was going right into ML and DL w...

learn stats
all my homies learn stats

midnight rain Dec 19, 2024, 11:27 PM

#

serene scaffold learn stats all my homies learn stats

Yeah im doing rn, im learning distributions rn… i have a long way to go, i should do it as soon as possible, going to get ready for AI Olympiads

midnight rain Dec 19, 2024, 11:27 PM

#

midnight rain Yeah im doing rn, im learning distributions rn… i have a long way to go, i shoul...

And im so stressed man

#

Have to learn a lot in one month lol

umbral charm Dec 20, 2024, 12:43 AM

#

Hey just a quick question

#

when doing countour plots

#

is it possible to get the value of the countour as long as your x and y cord in when you hover above the interatice plot?

carmine grove Dec 20, 2024, 12:57 AM

#

I'm working on a project with assemblyai with the transcribe live audio streams feature, realized you can't set it to a different language, it only works with english. Is there an alternative to assemblyai that works with spanish? Does anybody know? I can't seem to find anything that works as good as assemblyai.

bitter oyster Dec 20, 2024, 2:12 AM

#

is there a way to have speech recognition use a wake word?

carmine grove Dec 20, 2024, 2:24 AM

#

@bitter oyster To use wake word, you can do something like this. This uses regex to match your wake word, if found gets everything after your wake word plus wake word, you can then pass that to your ai assistant.

def extract_prompt(transcribed_text, wake_word):
    pattern = rf'\b{re.escape(wake_word)}(.*)'
    match = re.search(pattern, transcribed_text, re.IGNORECASE)

    if match:
        prompt = match.group(1).strip()
        return prompt
    else:
        return None

prompt = extract_prompt
if prompt:
    do_stuff()

bitter oyster Dec 20, 2024, 3:17 AM

#

carmine grove <@893702108702986260> To use wake word, you can do something like this. This us...

do i need to change anything to add to my current code? ```import speech_recognition as sr
import pyaudio
import pyttsx3
import time
from datetime import datetime

now = datetime.now()
current_time = now.strftime("%I:%M:")
current_date = now.strftime("%A:%B:%d")

This tells the AI to Speak

def speak(text):
engine = pyttsx3.init('sapi5')
voices = engine.getProperty('voices')
engine.setProperty('voice',voices[0].id)
print("X.E.N.O.N.:" + text + "\n")
engine.say(text)
engine.runAndWait()

This states how to process audio

def takeCommand():
r = sr.Recognizer()
with sr.Microphone() as source:
print("Listening...", end="")
audio = r.listen(source)
query = ""

    try:
        print("Recognizing...",end="")
        query = r.recognize_google(audio,language='en-US')
        print(f"User said: {query}")
        
    except Exception as e:
        print("Exception:"+str(e))
        
return query.lower()

def main():
Talk = True
while Talk == True:
userSaid = takeCommand()
if "hello" in userSaid:
speak("hello sir")
if "bye" in userSaid:
speak("goodbye sir")
if "how are you" in userSaid:
speak("doing well sir")
if "stop" in userSaid:
speak("stopping sir")
break
if "exit" in userSaid:
speak("ending process")
if "what time is it" in userSaid:
speak(current_time)
if "what day is it" in userSaid:
speak(current_date)

    if "open my email" in userSaid:
        speak("Maybe you should finish your program before trying to use it")
        
    time.sleep(1)

main()

weak oxide Dec 20, 2024, 3:36 AM

#

Is there a way to get FRED data without APi? I couldn't find a Internet tutorial on it.

rich moth Dec 20, 2024, 4:05 AM

#

!paste

arctic wedgeBOT Dec 20, 2024, 4:05 AM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in Discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the Paste! button in the bottom left, or by pressing CTRL + S. After doing that, you will be navigated to the new paste's page. Copy the URL and post it here so others can see it.

carmine grove Dec 20, 2024, 4:26 AM

#

@bitter oyster you just need to check for wake word after you get transcript, after calling takeCommand, then continue if wake word, otherwise do nothing

inland crown Dec 20, 2024, 4:59 AM

#

Have you seen NVIDIA's latest mini supercomputer? 70 TRILLION computations a second. In the palm of your hand. 15 watts. $249 It can run python scripts and can run standalone AI models. In this video (I hope it's ok to share this) he uses one with a python script to monitor his driveway camera to identify guests and vehicles. Then he adds llama3.2 to it to interact with his script for the vocal call outs.

Is anyone using any of the Jetson computers or similar to run their intensive code?
https://youtu.be/QHBr8hekCzg

YouTube

Dave's Garage

NVIDIA's $249 Secret Weapon for Edge AI - Jetson Orin Nano Super: D...

FREE GIVEAWAY OF JENSEN-HUANG-SIGNED ORIN NANO SUPER! See Below!
Join Dave as he explores NVIDIA's Jetson Orin Nano Super, a compact AI powerhouse with 1024 CUDA cores and 6 ARM cores for just $249. Learn why this could be the best AI board for your projects in robotics, IoT, or AI development. Free Sample of my Book on the Spectrum: https://a...

▶ Play video

unkempt apex Dec 20, 2024, 5:20 AM

#

untold bloom <https://otexts.com/fpp3/>

this is too good !!, nice

pseudo ledge Dec 20, 2024, 5:48 AM

#

I'm trying to scrap the data below this 'load more' button, but I can't do so. Can someone help with it? code - https://paste.pythondiscord.com/4NUA

#

WhatsApp_Image_2024-12-19_at_12.55.44_85a76240.jpg

grand minnow Dec 20, 2024, 6:07 AM

#

pseudo ledge

what online AI is this and do they have an API that you can use?

pseudo ledge Dec 20, 2024, 6:08 AM

#

grand minnow what online AI is this and do they have an API that you can use?

https://www.motorsport.com/live/, i'm trying to extract commentary data

F1 Live Text Commentaries - Updates from Every Session

Live commentary and updates from every session of every Formula 1 race, written by our team of experts and journalists. Click here for more.

grand minnow Dec 20, 2024, 6:11 AM

#

pseudo ledge https://www.motorsport.com/live/, i'm trying to extract commentary data

Its unfortunate that they don't have an API. Even more unfortunate that their Terms of Service says to not scrape data.

#

So as per the server rules, can't help

#

!rule tos says so

arctic wedgeBOT Dec 20, 2024, 6:11 AM

#

Rules

5. Do not provide or request help on projects that may violate terms of service, or that may be deemed inappropriate, malicious, or illegal.

pseudo ledge Dec 20, 2024, 6:14 AM

#

grand minnow Its unfortunate that they don't have an API. Even more unfortunate that their Te...

Ohh okay, I didn't looked at it, I needed it to train the AI model, Are there any alternatives?

torpid ginkgo Dec 20, 2024, 7:21 AM

#

anyone here familiar with model training for computer vision?

odd meteor Dec 20, 2024, 8:22 AM

#

torpid ginkgo anyone here familiar with model training for computer vision?

Don't ask a question to ask a question. Ask your question with all the details one would need to be able to answer your question right away.

torpid ginkgo Dec 20, 2024, 8:31 AM

#

sorry my bad. I need assistance on how to do code relating model training for computer vision. For face recognition specifically. I've ran my codes but I dont get the output which supposedly have graphs

left tartan Dec 20, 2024, 1:19 PM

#

torpid ginkgo sorry my bad. I need assistance on how to do code relating model training for co...

Open a help thread and share you code and output: #❓｜how-to-get-help . Hard to assist without seeing the specific problem

wraith wagon Dec 20, 2024, 1:45 PM

#

hey, im looking for a mlops tool to track image processing pipeline/steps that were applied, is this possible to achieve with mlflow? as far as Ive checked there are mlflow recipes available, but Im not sure if it tracks steps applied

ocean hinge Dec 20, 2024, 2:24 PM

#

Hello, can anyone recommend me a good tutorial/book/course on data engineering? Specifically Azure if possible. I have started data science recently and would like to get the basic or intermediate info on data engineering.

deep veldt Dec 20, 2024, 2:55 PM

#

Why doesnt pytorch have a builtint contrastive loss?

serene scaffold Dec 20, 2024, 2:57 PM

#

deep veldt Why doesnt pytorch have a builtint contrastive loss?

I'm not sure; do you need help implementing your own loss function?

deep veldt Dec 20, 2024, 2:58 PM

#

serene scaffold I'm not sure; do you need help implementing your own loss function?

yes, i saw a bunch of them online but im not sure if they're the correct implementation

serene scaffold Dec 20, 2024, 2:59 PM

#

deep veldt yes, i saw a bunch of them online but im not sure if they're the correct impleme...

what's the formula, and what is the code for the implementation that you saw?

worldly terrace Dec 20, 2024, 3:00 PM

#

I have decided I'm going to make my Own LLM from Scratch, and then eventually implement higher functions into it to allow it to become AGI.

serene scaffold Dec 20, 2024, 3:00 PM

#

worldly terrace I have decided I'm going to make my Own LLM from Scratch, and then eventually im...

you will never have enough computation power at your disposal to do this.

deep veldt Dec 20, 2024, 3:01 PM

#

serene scaffold what's the formula, and what is the code for the implementation that you saw?

class ContrastiveLoss(torch.nn.Module):
    def __init__(self, margin=2.0):
        super(ContrastiveLoss, self).__init__()
        self.margin = margin

    def forward(self, output1, output2, label):
      # Calculate the euclidean distance and calculate the contrastive loss
      euclidean_distance = F.pairwise_distance(output1, output2, keepdim = True)

      loss_contrastive = torch.mean((1-label) * torch.pow(euclidean_distance, 2) +
                                    (label) * torch.pow(torch.clamp(self.margin - euclidean_distance, min=0.0), 2))


      return loss_contrastive

from here https://datahacker.rs/019-siamese-network-in-pytorch-with-application-to-face-similarity/

Master Data Science

datahacker.rs

# 019 Siamese Network in PyTorch with application to face similarity

worldly terrace Dec 20, 2024, 3:01 PM

#

serene scaffold you will never have enough computation power at your disposal to do this.

Never say never. The dumpster behind Game Stop might have some goodies.

serene scaffold Dec 20, 2024, 3:02 PM

#

worldly terrace Never say never. The dumpster behind Game Stop might have some goodies.

I'm saying never and I mean it. only exceptionally large, wealthy companies have enough computation power to do this. you cannot do it by networking together a bunch of consumer-grade GPUs. if you want to learn more about AI and LLMs, you must pick a more attainable project.

worldly terrace Dec 20, 2024, 3:05 PM

#

serene scaffold I'm saying never and I mean it. only exceptionally large, wealthy companies have...

Oh my Lord every which way I have an asperation there's some smart allik to shoot me down. Who cares if it's under powered or super basic in comparison to what the companies do? Half the biggest names in tech development started in their garage with practically nothing. I'm tired of the pessimism and cynicism. Why don't actually give me something that might help?

wild solar Dec 20, 2024, 3:06 PM

#

worldly terrace Oh my Lord every which way I have an asperation there's some smart allik to shoo...

I like your attitude

serene scaffold Dec 20, 2024, 3:07 PM

#

worldly terrace Oh my Lord every which way I have an asperation there's some smart allik to shoo...

By definition, an AGI cannot be "super basic".
There are things you can do with AI that are attainable with the hardware and resources available to you, and I'm happy to help you with those. I can't help you do things that you won't be able to do.

serene scaffold Dec 20, 2024, 3:09 PM

#

deep veldt ```py class ContrastiveLoss(torch.nn.Module): def __init__(self, margin=2.0)...

I'm still looking at this. one moment..

worldly terrace Dec 20, 2024, 3:09 PM

#

serene scaffold By definition, an AGI cannot be "super basic". There are things you can do with ...

AGI doesn't even have a solid deffinition and doesn't nessasarily mean singularity. Like, computer programs can and have been made to manipulate other programs and they run on the most basic of laptops

#

An LLM with some other higher functions added in is at least a bare bones AGI. It's a baby AGI.

wild solar Dec 20, 2024, 3:12 PM

#

worldly terrace I have decided I'm going to make my Own LLM from Scratch, and then eventually im...

imo in order to achieve what big companies were able to achieve by training LLMs on super computer, you need to invent a new approach to achieve this.

serene scaffold Dec 20, 2024, 3:13 PM

#

@worldly terrace you can potentially fine-tune (but not create from scratch) an LLM with hardware resources that you can obtain or rent. And you can also experiment with task-specific prompt engineering, without having to fine tune.

wild solar Dec 20, 2024, 3:15 PM

#

serene scaffold <@1220815493741543588> you can potentially fine-tune (but not create from scratc...

additionally, @worldly terrace can write multiple functions for the fine tuned LLM can use which gives it the ability to do advanced tasks.

worldly terrace Dec 20, 2024, 3:15 PM

#

wild solar imo in order to achieve what big companies were able to achieve by training LLMs...

My main Idea is to bruit force it with anything and everything I can find in dumpster and garage sales. I don't care if it'd take hundreds or even thousands of components, this is my aspiration in the tech feild. Plus I'd be cleaning up a whole lot of toxic e waste. Even the disposable vapes have LCDs and minor computer parts.

serene scaffold Dec 20, 2024, 3:16 PM

#

worldly terrace My main Idea is to bruit force it with anything and everything I can find in dum...

if you want to frame this as something environmentally-aware, you'll need to take the energy consumption into account

worldly terrace Dec 20, 2024, 3:18 PM

#

serene scaffold if you want to frame this as something environmentally-aware, you'll need to tak...

The e waste cleaning is just a bonus and wasn't really a primary focus smarty pants. Also you know how easy it is to make an AC current generator out of speaker magnets, scrap copper and an old trash bycicle?

#

Literally half if not everything I'd need can be found in the trash

serene scaffold Dec 20, 2024, 3:20 PM

#

worldly terrace The e waste cleaning is just a bonus and wasn't really a primary focus smarty pa...

Please re-read the #code-of-conduct. The way you're engaging here is not appropriate.
I realize that starting with "you will never have enough computation power at your disposal to do this" was stark, but I am actually trying to help you.

worldly terrace Dec 20, 2024, 3:23 PM

#

I'm disengaging. Once again I finally feel like I can achieve something and no one wants to genuienly help me. No one ping me I'm taking a long break.

serene scaffold Dec 20, 2024, 3:25 PM

#

@deep veldt I think their implementation is right, but I re-wrote it to look more like the formula.

def forward(self, output1, output2, label):
    euclidean_distance = F.pairwise_distance(output1, output2, keepdim=True)
    left_term = label * (euclidean_distance ** 2)
    right_term = (1 - label) * torch.clamp((self.margin ** 2) - (euclidean_distance ** 2), min=0.0)
    return torch.mean(left_term + right_term)

odd meteor Dec 20, 2024, 3:25 PM

#

wraith wagon hey, im looking for a mlops tool to track image processing pipeline/steps that w...

Not sure there's a specific recipe in MLFlow intrinsically capable of tracking applied image processing steps w/o creating it yourself.

You can however, customize and log those steps yourself in MLFlow like you would if you wanna track stuff like adding texts, tags, plots, and other artifacts etc.

odd meteor Dec 20, 2024, 3:48 PM

#

deep veldt Why doesnt pytorch have a builtint contrastive loss?

When it comes to Contrastive Learning (a.k.a Relationship Learning) you'd have to write the loss function yourself in PyTorch.

I think this is because there are just so many type of contrastive learning methods available; hence, contrastive learning isn't tied to a single loss function.

For example if you were using SimCLR instead of Siamese network you'd still need to write the InfoNCE loss function yourself.

odd meteor Dec 20, 2024, 3:54 PM

#

worldly terrace I have decided I'm going to make my Own LLM from Scratch, and then eventually im...

Blud you need to have raised a huge amount of funds for you to be able to build your own LLM from scratch

More so, I for one believe we all should refrain from building AGI. Nobody is ready for the consequences that comes from attaining AGI

worldly terrace Dec 20, 2024, 4:00 PM

#

odd meteor Blud you need to have raised a huge amount of funds for you to be able to build ...

I regret having asked at all

jaunty helm Dec 20, 2024, 4:03 PM

#

worldly terrace I regret having asked at all

finetuning is still possible by individuals though
still, your hardware requirements only drops from nigh impossible to very high

worldly terrace Dec 20, 2024, 4:05 PM

#

jaunty helm finetuning is still possible by individuals though still, your hardware requirem...

I am not in the headspace right now.

ocean hinge Dec 20, 2024, 4:24 PM

#

ocean hinge Hello, can anyone recommend me a good tutorial/book/course on data engineering? ...

Anyone?

lapis sequoia Dec 20, 2024, 5:06 PM

#

ok, oh lord I have some questions. OK, with fine tuning bert, when the targets multi-labeled, do you use a differnet optimizer and is AdamW from transformers better than torch.optim.AdamW?

serene scaffold Dec 20, 2024, 5:17 PM

#

lapis sequoia ok, oh lord I have some questions. OK, with fine tuning bert, when the targets m...

AdamW is the optimizer that I have used for that; it shouldn't matter which implementation you use, because the algorithm is the same.

lapis sequoia Dec 20, 2024, 5:18 PM

#

it does not matter if the targets are binary or categorical?

serene scaffold Dec 20, 2024, 5:18 PM

#

I'm saying that it doesn't matter which of the Torch or transformers implementations of AdamW you use

lapis sequoia Dec 20, 2024, 5:18 PM

#

oh

serene scaffold Dec 20, 2024, 5:20 PM

#

it takes a lot of data to fine-tune BERT for multilabel. what does your performance look like currently? can you show a precision-recall-f1-support table?

lapis sequoia Dec 20, 2024, 5:20 PM

#

yeah, atm, I am trying to make a target variable based on wether or not the job posting is fake, so, I will cut that out and just show another example

serene scaffold Dec 20, 2024, 5:21 PM

#

lapis sequoia yeah, atm, I am trying to make a target variable based on wether or not the job ...

what are you training BERT to do?

lapis sequoia Dec 20, 2024, 5:21 PM

#

text classification

serene scaffold Dec 20, 2024, 5:22 PM

#

lapis sequoia text classification

right, but can you be more specific? what are all the possible classes?

lapis sequoia Dec 20, 2024, 5:22 PM

#

    def __init__(self, n_classes):
        super(Bert_Classifier,self).__init__()
        self.bert = BertModel.from_pretrained(MODEL_NAME)
        self.drop = nn.Dropout(p=0.3)
        self.out = nn.Linear(self.bert.config.hidden_size, n_classes)
    

    def forward(self, input_ids, attention_mask):
        outputs = self.bert(
        input_ids=input_ids,
        attention_mask=attention_mask
    )
        pooled_output = outputs.pooler_output
        output = self.drop(pooled_output)
        return self.out(output)



model = Bert_Classifier(2).to(device)
model```

serene scaffold Dec 20, 2024, 5:23 PM

#

!code

arctic wedgeBOT Dec 20, 2024, 5:23 PM

#

Formatting code on Discord

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

For long code samples, you can use our pastebin.

lapis sequoia Dec 20, 2024, 5:23 PM

#

serene scaffold right, but can you be more specific? what are all the possible classes?

the possible classes are "fake" or "real", but the current data cannot classifiy it but there are patterns in the text and I am trying to make it so it is binary depending on the statement

serene scaffold Dec 20, 2024, 5:24 PM

#

lapis sequoia the possible classes are "fake" or "real", but the current data cannot classifiy...

this is a binary classification task. did you think it was multilabel?

lapis sequoia Dec 20, 2024, 5:24 PM

#

I know, I just asked that to see if there was a difference

#

here, I will show what I am trying to do

serene scaffold Dec 20, 2024, 5:26 PM

#

for binary classification, the target for each instance is 0 or 1.
for multiclass (single label) classification with n classes, the target for each instance is a vector of n elements, where all elements are 0, except exactly 1 is 1, ie [0, 1, 0, 0]
for multiclass (multilabel) classification with n classes, the target is a vector that has a 1 for each label that the instance has, ie [0, 1, 1, 0, 1]

lapis sequoia Dec 20, 2024, 5:26 PM

#

df['description'] = df['description'].str.lower()



df['text'] = df['text'].str.lower()

import string

def remove_punctuations(text):
    return text.translate(str.maketrans('','',string.punctuation))

df['text'] = df['text'].apply(remove_punctuations)



sample_txt = " ".join(i for i in df['text'])
target_txt = " ".join(i for i in df['description'])




from nltk import FreqDist
from textblob import TextBlob

blob = TextBlob(target_txt).sentences


most_common_words = FreqDist(blob).most_common(50)
print("top 50 most common words",most_common_words)

phrases = ["earn $5000/week!","contact now at fsmith@hotmail.com."]

df['fraudulent'] = [1 if (X == df['description'] | df['description'] == phrases) else 0 for X in df['fraudulent']]

blob = TextBlob(target_txt).ngrams(5)






df['n_grams'] = df['text'].apply(n_grams_blob)

df['n_grams'] = df['n_grams'].str.lower()



def fake_text(blob):
    tokens = word_tokenize(phrases)
    found_phrases = [phrases for phrase in tokens]
    return " ".join(found_phrases)


df['fake_jobs'] = df['description'].apply(fake_text)
df['fake_jobs'].head(10)


df['fradulent'] = [for phrase in phrases if X == phrases return 1 else 0]

df['fradulent'].value_counts()


for phrase in phrases:
    if df['description'] == phrases:
        df['fradulent'].apply(phrases).astype(int)

#

again, it is extremly messy

serene scaffold Dec 20, 2024, 5:27 PM

#

arctic wedge

be sure to follow this

lapis sequoia Dec 20, 2024, 5:28 PM

#

sorry about that

lapis sequoia Dec 20, 2024, 5:30 PM

#

serene scaffold - for binary classification, the target for each instance is `0` or `1`. - for m...

so it is not just anything with labels > 2 ?

serene scaffold Dec 20, 2024, 5:31 PM

#

lapis sequoia so it is not just anything with labels > 2 ?

multiclass is when there's more than two classes, and multilabel is when it's multiclass, and each instance can belong to more than one class.

lapis sequoia Dec 20, 2024, 5:31 PM

#

ok, the optimizer and/or the loss function change, right?

serene scaffold Dec 20, 2024, 5:32 PM

#

not necessarily. loss functions and optimizers can be applied to any of those.

lapis sequoia Dec 20, 2024, 5:35 PM

#

ok, does the max_length alway matter?

serene scaffold Dec 20, 2024, 5:40 PM

#

lapis sequoia ok, does the max_length alway matter?

where in the code is max_length?

lapis sequoia Dec 20, 2024, 5:44 PM

#

oh, it is not in that code, the code I sent was just myself trying to find a way for the text data to have phrases that are repeated all of the time and then try to make it binary(and numerical) and apply it to a new class in a dataframe, yes binary for that one, I was trying something else with sentiments that is unrelated to the code I posted which is why I asked about multi-class text classification with bert.

serene scaffold Dec 20, 2024, 5:47 PM

#

lapis sequoia oh, it is not in that code, the code I sent was just myself trying to find a way...

I need to see the code that involves max_length to tell you what it is and why it matters.

lapis sequoia Dec 20, 2024, 5:48 PM

#

@serene scaffold do you want me to share a differnet example for that?

serene scaffold Dec 20, 2024, 5:49 PM

#

lapis sequoia <@253696366952316929> do you want me to share a differnet example for that?

whatever the code is that inspired you to ask about max_length, that's what I need to see.

lapis sequoia Dec 20, 2024, 5:49 PM

#

ok, I really rushed this this morning

#

import torch.nn as nn
import matplotlib.pyplot as plt
import numpy as np
import nltk
from nltk.tokenize import word_tokenize
from nltk.corpus import stopwords
nltk.download("punkt")
nltk.download("stopwords")
nltk.download("wordnet")
from wordcloud import WordCloud
from textblob import TextBlob
from transformers import BertModel,BertTokenizer,AdamW,get_linear_schedule_with_warmup
import pandas as pd
import warnings
warnings.filterwarnings("ignore")




device = ("cuda:0" if torch.cuda.is_available() else "cpu")
print(device)

df.head(10)



df.isnull().sum()
df['Review'] = df['Review'].str.lower()

df.duplicated().sum()
df.drop_duplicates(inplace=True)

df['label'] = [1 if X == "POSITIVE" else 0 for X in df['label']]

MODEL_NAME = 'bert-base-uncased'
tokenizer = BertTokenizer.from_pretrained(MODEL_NAME)
MAX_LEN = 128

class Spotify_Dataset(torch.utils.data.Dataset):
    def __init__(self,Review,targets,tokenizer,max_len):
        self.Review = Review
        self.targets = targets
        self.tokenizer = tokenizer
        self.max_len = max_len
        
    def __len__(self):
        return len(self.Review)
    
    def __getitem__(self,idx):
        Review = str(self.Review[idx])
        target = self.targets[idx]
        
        
        encoding = self.tokenizer.encode_plus(
            Review,
            max_length=self.max_len,
            padding="max_length",
            return_attention_mask=True,
            return_token_type_ids=False,
            add_special_tokens=True,
            return_tensors='pt',
            )
        
        return {
            "Review":Review,
            "attention_mask":encoding['attention_mask'].flatten(),
            "input_ids":encoding['input_ids'].flatten(),
            "targets":torch.tensor(target,dtype=torch.long)
            }

serene scaffold Dec 20, 2024, 5:55 PM

#

lapis sequoia ```import torch import torch.nn as nn import matplotlib.pyplot as plt import num...

when you tokenize a segment of text with BERT, you get a vector where each token is one element of the vector. you can set a max length if you want all the tokens after a given length to get clipped off.

#

it prevents you from blowing out your GPU if you accidentally have an instance that's excessively long

lapis sequoia Dec 20, 2024, 5:57 PM

#

so, do a sample tokenizer.encode_plus() first to see the length of the tokens?

serene scaffold Dec 20, 2024, 5:59 PM

#

lapis sequoia so, do a sample tokenizer.encode_plus() first to see the length of the tokens?

you can get a sense for how many tokens your instances are by counting the number of whitespace spans there are in each instance (though "words" and tokens are not 1:1), and see if you have any outliers.

#

you can also replace padding='max_length' with padding='longest'. do you know what that would do?

lapis sequoia Dec 20, 2024, 6:00 PM

#

serene scaffold you can also replace `padding='max_length'` with `padding='longest'`. do you kno...

yes

lapis sequoia Dec 20, 2024, 6:17 PM

#

I am also pretty new to torch

lapis sequoia Dec 20, 2024, 6:41 PM

#

serene scaffold you can get a sense for how many tokens your instances are by counting the numbe...

I got all of it to work and it passed two epochs and actually worked, thank you!

weak oxide Dec 20, 2024, 9:11 PM

#

import numpy as np 
import matplotlib as plt
from matplotlib import pyplot as plt
import seaborn as sns
import sklearn 
from sklearn import linear_model
import plotly.express as px
print(plt.style.available)

plt.style.use('fivethirtyeight')               
df = pd.read_csv('Five Stocks.csv')    SHY= df['SHY'].values
GIS = df['GIS'].values
DENN=df['DENN'].values
SWPPX=df['SWPPX'].values
SHEL=df['SHEL'].values                
import plotly.express as px
px.scatter(SHY,GIS)```

#

Im unsure why the plotly express doesnt work even after I used pip install plotly

#

as for the csv all I did was take 5 random stocks and made a csv

serene scaffold Dec 20, 2024, 10:25 PM

#

weak oxide ```import pandas as pd import numpy as np import matplotlib as plt from matplot...

you should delete every line where you do x = df['x'].values.
try doing px.scatter(df, x='SHY', y='GIS')

serene scaffold Dec 20, 2024, 10:29 PM

#

weak oxide Im unsure why the plotly express doesnt work even after I used pip install plotl...

be sure to never say that something "doesn't work". that could mean basically anything. you always have to say what you expected to happen, and what actually happened.

weak oxide Dec 20, 2024, 10:30 PM

#

How you guys do the code feature on Discord

serene scaffold Dec 20, 2024, 10:30 PM

#

!code

arctic wedgeBOT Dec 20, 2024, 10:30 PM

#

Formatting code on Discord

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

For long code samples, you can use our pastebin.

weak oxide Dec 20, 2024, 10:30 PM

#

serene scaffold be sure to never say that something "doesn't work". that could mean basically an...

Ok so I expected a plotly graph after using "'px"' function

#

px

#

Oh

serene scaffold Dec 20, 2024, 10:31 PM

#

px is just the module. you have to call px.scatter, which you did

#

fig = px.scatter(df, x='SHY', y='GIS')
fig.show()

see if that works. it would cause a browser tab to open.

weak oxide Dec 20, 2024, 10:32 PM

#

Yeah I'll do that when I get home

serene scaffold Dec 20, 2024, 10:33 PM

#

weak oxide Yeah I'll do that when I get home

sure. to make sure that everyone's time is used effectively, it's best to ask when you're available to actively try things that people suggest.

lapis sequoia Dec 20, 2024, 10:33 PM

#

are people using px more now instead of seaborn and matplotlib

left tartan Dec 20, 2024, 10:38 PM

#

lapis sequoia are people using px more now instead of seaborn and matplotlib

I use plotly for nearly everything

serene scaffold Dec 20, 2024, 10:42 PM

#

lapis sequoia are people using px more now instead of seaborn and matplotlib

matplotlib has the least intuitive API of all time and seaborn is just a wrapper for matplotlib.

serene scaffold Dec 21, 2024, 12:17 AM

#

This isn't related to data science or AI. I think you should look for mental health resources to help you work on these feelings. I would hate for someone on this server to say something to you that would make it worse.

weak oxide Dec 21, 2024, 2:41 AM

#

Sorry for getting late but it worked

weak oxide Dec 21, 2024, 2:42 AM

#

serene scaffold matplotlib has the least intuitive API of all time and seaborn is just a wrapper...

I thought plt.title at least was intuitive

serene scaffold Dec 21, 2024, 2:42 AM

#

No. It's actually illegal for matplotlib to be intuitive.
It's in the bylaws.

#

But I'm glad you got it to work!

thorny geode Dec 21, 2024, 4:00 AM

#

hi, beginner here, does smaller data set cause predictions to capture more noise

serene scaffold Dec 21, 2024, 4:19 AM

#

thorny geode hi, beginner here, does smaller data set cause predictions to capture more noise

can you explain what you mean by "capture noise"?

thorny geode Dec 21, 2024, 4:37 AM

#

serene scaffold can you explain what you mean by "capture noise"?

yes, noise as the randomness of the data that causes irreducible error, i learned the term from Introduction to Statistical Learning Python

serene scaffold Dec 21, 2024, 4:39 AM

#

thorny geode yes, noise as the randomness of the data that causes irreducible error, i learne...

the data that you have available is almost always a (very small) subset of all the actual data.
like if you have data about home sales in the US, you probably don't have data about every single home sale in the US.
and the smaller your dataset, the less representative it might be of the true, complete data.

thorny geode Dec 21, 2024, 8:39 AM

#

serene scaffold the data that you have available is almost always a (very small) subset of all t...

so the smaller the dataset, the error represents the data variation larger, is that true?

deep veldt Dec 21, 2024, 10:24 AM

#

when building a siamese network, do you need fc layers? if yes how many num_classes should it output?

late igloo Dec 21, 2024, 11:34 AM

#

How to learn data analysis fast

#data-science-and-ml

Hi, could anyone please check my understanding on this:

Athena vs Redshift Athena can't run SQL queries on the data that is present inside the warehouse(For ex: Redshift). Athena works only on the data present inside S3 bucket.

If you want to correct/add new thing, pls.

Note: I am not literally comparing the above services. They are there for different purpose. My intention is to compare certain aspects which are command between services.

This tells the AI to Speak

This states how to process audio

Hi,
could anyone please check my understanding on this:

Athena vs Redshift
Athena can't run SQL queries on the data that is present inside the warehouse(For ex: Redshift).
Athena works only on the data present inside S3 bucket.