severe inlet Apr 23, 2024, 6:06 AM

#

im intending to try out a ML project with a commodity prices dataset. im thinking of doing 2 kinds of predictions: next day prediction and 1 week prediction.

may i ask for some advice on how should i proceed? can i start off with simple linear regression model, work towards other variants, maybe LSTM at the end?

#

im not sure how to "start" on the project

river cape Apr 23, 2024, 6:25 AM

#

Isn't Logistic Regression a linear classifier?

#

So do we need feature scaling while dealing with that model?

restive wave Apr 23, 2024, 6:29 AM

#

river cape Isn't Logistic Regression a linear classifier?

Yes

restive wave Apr 23, 2024, 6:30 AM

#

river cape So do we need feature scaling while dealing with that model?

"Feature scaling" is all it does

river cape Apr 23, 2024, 6:31 AM

#

restive wave "Feature scaling" is all it does

So for any linear model we dont have to necessarily apply feature scaling?

#

Because the co-efficients take care of that ?

lofty thorn Apr 23, 2024, 6:42 AM

#

is there any affect on the mean when the data is positively or negatively skewed??

lofty thorn Apr 23, 2024, 6:57 AM

#

nvm

tired lodge Apr 23, 2024, 8:06 AM

#

how would i go about evaluating the position of a connect 4 board?

#

i have a bunch of test cases and their resulting evaluation but i don't know how to go forward from there

#

._.

#

this is their file contents

nimble stag Apr 23, 2024, 8:12 AM

#

If it’s oo, you could make a class for every different level of connectedness (like two pieces, three pieces, etc.) then instantiate it whenever it actually occurs and continue checking for pieces to add from the new instantiations

tired lodge Apr 23, 2024, 8:12 AM

#

nimble stag If it’s oo, you could make a class for every different level of connectedness (l...

wdym "oo"

nimble stag Apr 23, 2024, 8:12 AM

#

Object oriented

tired lodge Apr 23, 2024, 8:15 AM

#

nimble stag Object oriented

yeah im using python which supports OOP. could you explain further what your suggestion is?

nimble stag Apr 23, 2024, 8:16 AM

#

I think ur trying to evaluate if there are chains of pieces that are the same colour, correct me if I’m wrong

tired lodge Apr 23, 2024, 8:16 AM

#

yeah i'd say so

marble spindle Apr 23, 2024, 8:16 AM

#

Anyone know any libraries which would fetch the written text and coverts into pdf formate

tired lodge Apr 23, 2024, 8:16 AM

#

im thinking of a more, manual? approach to this, where i compute each of the 6000 test cases, log their results and thats my tablebase to work off of 😭

tired lodge Apr 23, 2024, 8:16 AM

#

marble spindle Anyone know any libraries which would fetch the written text and coverts into pd...

google_it

#

i remember seeing something online

marble spindle Apr 23, 2024, 8:17 AM

#

tired lodge <:google_it:744347275953700954>

Nah I have succeeded 60per with it, but how can I train model for better results

nimble stag Apr 23, 2024, 8:17 AM

#

So when ur evaluating, for every piece check its neighbours, and if it has a matching colour neighbour instantiate a class called TwoPieces (for example). Then that object will check its neighbours and, finding a matching colour piece, would instantiate ThreePiece (and so on)

marble spindle Apr 23, 2024, 8:18 AM

#

marble spindle Nah I have succeeded 60per with it, but how can I train model for better results

Any platform?

tired lodge Apr 23, 2024, 8:22 AM

#

nimble stag So when ur evaluating, for every piece check its neighbours, and if it has a mat...

ok so ```py
array = [...] # some 2D array of board pieces

def check_pieces():
for i, row in enumerate(array):
for j, cell in enumerate(line):
another_check_func() # this returns a dict of all directions with True or False attached to them, so the instance of the class knows what direction to look

#

how do i get an eval score after that though?

marble spindle Apr 23, 2024, 8:26 AM

#

Anyone with this solution, where there is a paper which is written manually, now im working on a stuff where i need to fetch that written text and check the spelling whether its 'a' or o the model should be trained in such a way where it can fetch those stuff checks the grammar correction and convert those to pdf formate, which libary whould be good for all these any suggestion?

jaunty helm Apr 23, 2024, 8:35 AM

#

I wanted to plot a correlation heatmap in polars, this is what I have:

import polars as pl

df: pl.DataFrame
df = df.corr().select(pl.all().abs())
plt = df.plot.heatmap(height=600, rot=75, yticks=[(i, c) for i, c in enumerate(df.columns)])
display(plt)
```the above snippet yields the below image. 
the problem is that when I hover over a cell, it shows an `index`, where I'd like it to be the actual label (in the case of the image, it should be `GrLivArea`)
any way to modify the code to get the desired effect? (ideally, not turning it into a `pandas.DataFrame`)

#

tired lodge Apr 23, 2024, 8:51 AM

#

nimble stag I think ur trying to evaluate if there are chains of pieces that are the same co...

do i have to check twice for each board position because there's two counters, player 1 and player 2?

#

since with two players, there can be two types of chains

nimble stag Apr 23, 2024, 9:31 AM

#

Implementation details are up to you

#

I would suggest checking the board once and checking certain positions multiple times (if double or triple piece found)

nimble stag Apr 23, 2024, 9:34 AM

#

tired lodge ok so ```py array = [...] # some 2D array of board pieces def check_pieces(): ...

In this case, you would check the positions around you to see if they’re True when you are True and same for false

#

And if nothing more is found you could return the size of the biggest object in that position (assuming oo)

tired lodge Apr 23, 2024, 10:14 AM

#

nimble stag And if nothing more is found you could return the size of the biggest object in ...

so i have this ```py
class OnePiece: # The One Piece is real!!
def init(self, square_indexes: Tuple[int, int], direction: str) -> None:
self.y, self.x = square_indexes
self.change_in_y, self.change_in_x = directions[direction]

    self.next_x = self.x + self.change_in_x
    self.next_y = self.y + self.change_in_y

    try:
        neighbour = array[self.y][self.x]
    except IndexError:
        return False
    
    if neighbour == array[self.next_x][self.next_y]:
        return TwoPiece()

    return False

#

it takes in square_indexes which is like [i][j] ➡️ (i, j)

#

and then a direction as a str from this dict py directions = { 'up': (-1, 0), 'down': (1, 0), 'left': (0, -1), 'right': (0, 1) }

#

it then checks if you can go in that direction and if you can't, just return False

#

if you can go in that direction, call TwoPiece() and return whatever happens in it

#

oh wait, __init__ by default returns None pithink

#

using __new__ works though

river cape Apr 23, 2024, 11:10 AM

#

A linear SVR will have its kernel=linear and a non-linear svm will have its kernel = rbf?

warm trellis Apr 23, 2024, 12:01 PM

#

hey!
I'm trying to learn how to implement models from papers. Is anyone out there willing to mentor me?

tired otter Apr 23, 2024, 12:08 PM

#

in my half-educated opinion, you should learn how to implement forward method for simple nn models in pytorch/tf and let autograd do its magic

gritty vessel Apr 23, 2024, 12:09 PM

#

Yes

sick eagle Apr 23, 2024, 12:09 PM

#

tired lodge so i have this ```py class OnePiece: # The One Piece is real!! def __init__(...

that level is from Master pithink

tired lodge Apr 23, 2024, 12:10 PM

#

sick eagle that level is from Master<:pithink:652247559909277706>

??

sick eagle Apr 23, 2024, 12:10 PM

#

nothing

tired lodge Apr 23, 2024, 12:11 PM

#

sick eagle nothing

cheers mate 🍻

warm trellis Apr 23, 2024, 12:12 PM

#

I'm trying to implement paper "DEEP NON-PARAMETRIC TIME SERIES FORECASTER".. This is so far what I've built though, I'm not sure how this model can do the predictions..

    def __init__(self, h: int, input_size: int,
                 num_hidden_layers: int = 4,
                 hidden_size: int = 24
                ):
        super().__init__()
        self.input_size = input_size
        self.linear_stack = [nn.Linear(in_features=input_size, out_features=hidden_size)]
        self.linear_stack += [nn.Linear(in_features=hidden_size, out_features=hidden_size) for i in range(num_hidden_layers-1)]
        self.final_layer = nn.Linear(hidden_size, input_size)
        self.softmax = nn.Softmax()
    
    def forward(self, z):
        for linear_layer in self.linear_stack:
            z = linear_layer(z)
        sampling_probabilities = self.softmax(self.final_layer(z))
        return sampling_probabilities

sick eagle Apr 23, 2024, 12:12 PM

#

tired lodge cheers mate 🍻

🤝

tired lodge Apr 23, 2024, 12:12 PM

#

sick eagle 🤝

how do i evaluate a board position tho

#

i have some table bases, idk if i should calibrate an algorithm using those

sick eagle Apr 23, 2024, 12:14 PM

#

tired lodge how do i evaluate a board position tho

idk, i don't learn ML but i still learn some librairys in python for learn ML, (sry for my weak english)

tired lodge Apr 23, 2024, 12:15 PM

#

sick eagle idk, i don't learn ML but i still learn some librairys in python for learn ML, (...

👌

lapis sequoia Apr 23, 2024, 12:53 PM

#

anyone has done LLMs evaluation before ? any resources to read ? I need to do evaluation for an assistant and am not sure how !! anything that woulld help me build evaluation process !

#

any ideaa would help pithink

agile cobalt Apr 23, 2024, 1:01 PM

#

@lapis sequoia which sort of assistant? there are a few common metrics and monitoring tools you can use depending on the task, but they're not perfect nor make sense for all cases

cinder schooner Apr 23, 2024, 1:04 PM

#

Hello, i'm trying to implement RepeatedAugmentation for a computer vision project i'm working on and every code sample or example i find on the internet is pairing it with distributed learning. So i thought maybe i misunderstood the concept and maybe i need distributed learning to use RepeatedAugmentation. So my question is: can i use RepeatedAugmentation without distribution and if yes how?

lapis sequoia Apr 23, 2024, 1:16 PM

#

agile cobalt <@456226577798135808> which sort of assistant? there are a few common metrics an...

so I'm working on an assistant that does some ml tasks , the user would provide a dataset and ask questions ( like what's the trend ..) the user doesn't have to be technical or expert in ML , so the assistant should figure out which task based on user request , so I want to evaluate if the assistant is able to understand user's intent

agile cobalt Apr 23, 2024, 1:24 PM

#

you can try creating a compilation of a few prompt - dataset - expected final result combinations and just testing if it works, but overall I would strongly recommend against asking models about things you do not understand yourself if you have no intent or means of verifying if its output is correct or not, and even more so against products that explicitly encourage that practice

#

Even if you got 95% accuracy, the damage that those 5% wrong results could case if your end user is not perfectly aware of the model's limitations is tremendous, and models are not anywhere near reliable enough to expect 100% accuracy yet on an uncontrolled environment

dull radish Apr 23, 2024, 2:51 PM

#

Hello, so I wanna make an AI based tool which can convert let's say VB6 to VB.net or C# or in general can convert these older languages into newer ones and add documentation etc. I'm kinfa new to AI so if someone can tell me how exactly I can go abt this that'll be great thank you

warm trellis Apr 23, 2024, 2:56 PM

#

I am trying to implement the DeepNPTS model, but I'm confused a little bit, especially on how the model will learn part.. Since the model outputs probabilities, but the observation is a real value.. They have described to use Loss: Ranked Probability Score for the loss function, but I'm a little bit lost on this part, how model will learn from probability distribution ?
#data-science-and-ml message here is my draft for the model, I'm not sure if it's correct though.

#

lapis sequoia Apr 23, 2024, 3:05 PM

#

agile cobalt Even if you got 95% accuracy, the damage that those 5% wrong results could case ...

yes true, currently that's still a long term goal "to make it perfect" and usefull for non-technical users, the idea is just to create a more user-friendly approach for autoML ( if that makes sense) .. still researching this tho pithink

thorny zealot Apr 23, 2024, 3:06 PM

#

the squeeze parameter was deleted from pandas, what should I do ?

spring field Apr 23, 2024, 3:26 PM

#

thorny zealot the squeeze parameter was deleted from pandas, what should I do ?

wdym deleted? I don't see anything about it in the docs
https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.squeeze.html#pandas.DataFrame.squeeze

#

wait, parameter? parameter for what?

thorny zealot Apr 23, 2024, 3:27 PM

#

read_csv

spring field Apr 23, 2024, 3:28 PM

#

spring field Apr 23, 2024, 3:29 PM

#

spring field wdym deleted? I don't see anything about it in the docs <https://pandas.pydata.o...

you should use this method instead

river cape Apr 23, 2024, 3:42 PM

#

Do we need feature scaling for polynomial regression?

river cape Apr 23, 2024, 3:59 PM

#

Main question when do we use feature scaling?

craggy agate Apr 23, 2024, 4:06 PM

#

My model gets around a 65% val_accuracy what do I do to increase it? I feel its not reliable and when I give it actual images it get like it doesn't get up to the 65 mark of validation accuracy. Its an expression identifier model btw. I have 5 classes.

#

Here is the code:
`
train_datagen = ImageDataGenerator(
rescale=1./255,
rotation_range=30,
shear_range=0.3,
zoom_range=0.3,
width_shift_range=0.4,
height_shift_range=0.4,
horizontal_flip=True,
brightness_range=[0.8, 1.2],
fill_mode='nearest')

training_set = train_datagen.flow_from_directory(
'C:\Users\yatha\OneDrive\Desktop\CNN Expression identifier\Train',
target_size =(128, 128),
batch_size = 48,
classes = ['Anger', 'Fear', 'Happy', 'Sad', 'Surprise'],
class_mode = 'categorical',
shuffle=True,
)

test_datagen = ImageDataGenerator(rescale=1./255)

test_set = test_datagen.flow_from_directory(
'C:\Users\yatha\OneDrive\Desktop\CNN Expression identifier\Test',
target_size =(128, 128),
batch_size = 48,
classes = ['Anger', 'Fear', 'Happy', 'Sad', 'Surprise'],
class_mode = 'categorical',
shuffle=True,
)
cnn = tf.keras.models.Sequential()
cnn.add(tf.keras.layers.Conv2D(
filters=16,
kernel_size=3,
activation='relu',
input_shape=[128, 128, 3]
))
cnn.add(tf.keras.layers.MaxPool2D(pool_size=2, strides=2))
cnn.add(tf.keras.layers.Conv2D(
filters=16,
kernel_size=3,
activation='relu',
input_shape=[128, 128, 3]
))
cnn.add(tf.keras.layers.MaxPool2D(pool_size=2, strides=2))
cnn.add(tf.keras.layers.Conv2D(
filters=16,
kernel_size=3,
activation='relu'
))
cnn.add(tf.keras.layers.MaxPool2D(pool_size=2, strides=2))
cnn.add(tf.keras.layers.Flatten())
cnn.add(tf.keras.layers.Dense(units = 512, activation = 'relu'))
cnn.add(tf.keras.layers.Dense(units = 512, activation = 'relu'))
cnn.add(tf.keras.layers.Dense(units = 512, activation = 'relu'))
`

spring field Apr 23, 2024, 4:23 PM

#

craggy agate Here is the code: ` train_datagen = ImageDataGenerator( rescale=1./255, ...

if this is supposed to be a densenet then I'm slightly confused why you have all the transition layers and then all the denseblocks after all of those? you'd rather have the initial convolution, then repeat this like 3 times: (a denseblock, then a transition layer)
then go through the linear layer and then use softmax (which ig is built-in to the training set?)

vale parcel Apr 23, 2024, 4:31 PM

#

I'm getting an error. I pip installed keras-rl2 but whatever code I use it on I get this error:

Traceback (most recent call last):
File "/Users/srikanthvattikuti/Downloads/keras-rl-master/examples/dqn_atari.py", line 13, in <module>
from rl.agents.dqn import DQNAgent
File "/Library/Frameworks/Python.framework/Versions/3.12/lib/python3.12/site-packages/rl/agents/init.py", line 1, in <module>
from .dqn import DQNAgent, NAFAgent, ContinuousDQNAgent
File "/Library/Frameworks/Python.framework/Versions/3.12/lib/python3.12/site-packages/rl/agents/dqn.py", line 7, in <module>
from rl.core import Agent
File "/Library/Frameworks/Python.framework/Versions/3.12/lib/python3.12/site-packages/rl/core.py", line 7, in <module>
from rl.callbacks import (
File "/Library/Frameworks/Python.framework/Versions/3.12/lib/python3.12/site-packages/rl/callbacks.py", line 8, in <module>
from tensorflow.keras import version as KERAS_VERSION
ImportError: cannot import name 'version' from 'tensorflow.keras' (/Library/Frameworks/Python.framework/Versions/3.12/lib/python3.12/site-packages/keras/_tf_keras/keras/init.py). Did you mean: 'cxx_version'?

Someone pelase help!! I'm using MacOS Sonoma btw.

craggy agate Apr 23, 2024, 4:53 PM

#

spring field if this is supposed to be a densenet then I'm slightly confused why you have all...

Thanks, I fixed that now just gotta train it 🙂

craggy agate Apr 23, 2024, 5:26 PM

#

spring field if this is supposed to be a densenet then I'm slightly confused why you have all...

For some reason that capped out my val accuracy to 0.27

spring field Apr 23, 2024, 5:30 PM

#

oop

#

also I think in the paper they were using an averagepool instead of a maxpool

#

but how do your layers look now?

craggy agate Apr 23, 2024, 5:32 PM

#

spring field but how do your layers look now?

`# Initial Convolution Layer
cnn.add(Conv2D(filters=16, kernel_size=3, activation='relu', input_shape=[128, 128, 3]))

Dense Block 1

for _ in range(3):
cnn.add(Conv2D(filters=16, kernel_size=3, activation='relu', padding='same'))
cnn.add(MaxPool2D(pool_size=2, strides=2))

Transition Layer 1

cnn.add(Conv2D(filters=16, kernel_size=1, activation='relu'))

Dense Block 2

for _ in range(3):
cnn.add(Conv2D(filters=16, kernel_size=3, activation='relu', padding='same'))
cnn.add(MaxPool2D(pool_size=2, strides=2))

Transition Layer 2

cnn.add(Conv2D(filters=16, kernel_size=1, activation='relu'))

Dense Block 3

for _ in range(3):
cnn.add(Conv2D(filters=16, kernel_size=3, activation='relu', padding='same'))
cnn.add(MaxPool2D(pool_size=2, strides=2))

Flatten Layer

cnn.add(Flatten())

Fully Connected Layers

cnn.add(Dense(units=512, activation='relu'))
cnn.add(Dense(units=512, activation='relu'))

Output Layer

cnn.add(Dense(units=5, activation='softmax'))
`

spring field Apr 23, 2024, 5:34 PM

#

wait, Dense is not a DenseBlock? sigh

#

it's just a linear layer?

spring field Apr 23, 2024, 5:38 PM

#

craggy agate Here is the code: ` train_datagen = ImageDataGenerator( rescale=1./255, ...

cuz then this makes way more sense 😁
though it could maybe do with some batchnorms after the convolutions

craggy agate Apr 23, 2024, 6:42 PM

#

spring field cuz then this makes way more sense 😁 though it could maybe do with some batchn...

I see

fast knoll Apr 23, 2024, 8:06 PM

#

Hey anyone help me for my interview of pandas

past kite Apr 23, 2024, 8:21 PM

#

which is better for starting out in ML. Jupyter Notebook or Google collab

spring field Apr 23, 2024, 8:34 PM

#

Google collab uses jupyter notebooks

fast knoll Apr 23, 2024, 8:34 PM

#

past kite which is better for starting out in ML. Jupyter Notebook or Google collab

Jupyter notebook

spring field Apr 23, 2024, 8:35 PM

#

the question honestly doesn't really make sense to be fair

#

like, you can use collab to run your code on a GPU if you don't have one yourself

#

but you can write the code wherever you find it more comfortable

past kite Apr 23, 2024, 8:36 PM

#

k thanks

fast knoll Apr 23, 2024, 8:37 PM

#

Hey can you tell where can I prepare for my interview of pandas

#

Anyone

#

Help me guys

spring field Apr 23, 2024, 8:39 PM

#

may I suggest writing a small or a couple of small or actually not necessarily small ones, could be something bigger, but projects, essentially the point would be to practice here to get better at the thing you would be practicing, in this case pandas

real whale Apr 23, 2024, 9:01 PM

#

hello whats like a really easy and simple replace function good for column in a data frame, like so if there were many different ways NAN, nan, null, 0, ,n/a had been inputted and I just wanted the one value? Thanks

#

I get I could find out this info very easily but if someone is the sort of person to copy and paste something they have open then I'td make it easier adn more memorable

serene scaffold Apr 23, 2024, 9:18 PM

#

real whale hello whats like a really easy and simple replace function good for column in a ...

so you have a bunch of different strings that represent nan? or do you have a mix proper null values like np.nan, None, and pd.NA, and you want to convert them all to the same one?

real whale Apr 23, 2024, 9:46 PM

#

serene scaffold so you have a bunch of different *strings* that *represent* nan? or do you have ...

In this instance it's string values.

serene scaffold Apr 23, 2024, 9:47 PM

#

real whale In this instance it's string values.

the best solution depends on how you created the dataframe. did you use pd.read_csv, and were those "nan strings" already there when you loaded the df?

real whale Apr 23, 2024, 9:49 PM

#

serene scaffold the best solution depends on how you created the dataframe. did you use `pd.read...

aye pd.read_csv and yes there included as per our coursework to allow us to practice data-cleaning.

#

is it something like that?

serene scaffold Apr 23, 2024, 9:51 PM

#

yes, that should do it.

real whale Apr 23, 2024, 9:51 PM

#

as step one

serene scaffold Apr 23, 2024, 9:52 PM

#

I mean that should be the whole step. if there are any nan values that that list doesn't catch, you should add them to that list.

real whale Apr 23, 2024, 9:52 PM

#

are there any intracescies you could give me a heads up about with regard to it?

serene scaffold Apr 23, 2024, 9:52 PM

#

no? that just tells the pandas csv parser to treat those as nans when it reads the CSV. One and done.

real whale Apr 23, 2024, 9:53 PM

#

oh ok, if I wanted to replace all those null values with one type to make the measier to reference should I want to remove them?

serene scaffold Apr 23, 2024, 9:54 PM

#

because you set na_values as that list, any time those substrings appear in the CSV, they will be replaced with one kind of nan value.

real whale Apr 23, 2024, 9:54 PM

#

ok fantastic that rounds everything up

oblique mirage Apr 23, 2024, 10:44 PM

#

Hi guys, could someone tell me the 5 best libraries for creating graphics in python? I'm creating a machine learning model warning warning warning

serene scaffold Apr 23, 2024, 10:53 PM

#

oblique mirage Hi guys, could someone tell me the 5 best libraries for creating graphics in pyt...

I don't see the connection with ML and graphics.

oblique mirage Apr 23, 2024, 10:56 PM

#

serene scaffold I don't see the connection with ML and graphics.

It would be the projection of machine learning. Using the data (Y axis) and the index (X axis)

serene scaffold Apr 23, 2024, 10:57 PM

#

oblique mirage It would be the projection of machine learning. Using the data (Y axis) and the ...

Oh, so you're asking about data visualizations. You'd use matplotlib.

oblique mirage Apr 23, 2024, 11:17 PM

#

serene scaffold Oh, so you're asking about data visualizations. You'd use matplotlib.

ok, thanks

little arrow Apr 23, 2024, 11:33 PM

#

what methods of feature reduction are there, that keep the feature names? ive tried pca but unfortunately i cant use it to find which features impact my results the most

real whale Apr 23, 2024, 11:56 PM

#

Ok I've run into an error that I think stems from me not understanding the function "duh"

lapis sequoia Apr 24, 2024, 1:48 AM

#

hi;total ML n00b, very interesting stuff tho. i was just wondering to myself. i wonder how much the computers cost to run claude opus. i was looking at hiring costs the other day and i was staggered. they are buying nvidia a100s and hiring them out for 35k a year each! i understand opus is distributed obviously but i was wondering what a really good model, what it takes to run it or what hardware i should get to start being able to do something decent. ive seen rtx20xx listed here and i quite like this look of this, anyone used it ? https://docs.vllm.ai/en/latest/getting_started/installation.html

i woud like to start learning about machine learning, so imma sit here and try and take in some of the unintelligible lingo you guys talk 🙂 ive had pytorch out and got some models going, been on huggingface and messed about with transformers. i have no clue tho, i got some BERT thing going, i tried ollama and then i tried deepseekcoder and my computer shat the bed. the github one, not the ollama one. i dont know why but ollama has the same thing in it and its fast as hell. anyway i have a terrible GPU 1080(8gig) and im just trying to get started. hi everyone.

lapis sequoia Apr 24, 2024, 2:24 AM

#

dull radish Hello, so I wanna make an AI based tool which can convert let's say VB6 to VB.ne...

i am new too, but i can tell you i just updated a program from python 2 to 3 with claude and it did the whole thing in one go perfectly. was pretty impressed. i literally didnt touch the code to update it 🙂 just pasting. ive been poking these things for a while, including looking at some of the stuff on github like sweep ai and was inspired by autodev to plan a new multiple agent idea. thats one of my projects but ive got another main one for ML, thats just comparatively easy trading stuff. what i can say with confidence is that GPT is absolutely awful at code and claude is insanely good. GPT cant remember anything really and claude remembers everything - the dynamic between them changed overnight almost. ai wars going on...

dull radish Apr 24, 2024, 2:42 AM

#

lapis sequoia i am new too, but i can tell you i just updated a program from python 2 to 3 wit...

Ayay I planned on using a pretrained model and fine tune it with the particular languages I want, but ye python 2 to 3 is an easier conversion since they're the same language than something like vb6 to C#

#

I'll definitely give claude a try tho ty for the info

lapis sequoia Apr 24, 2024, 2:47 AM

#

dull radish Ayay I planned on using a pretrained model and fine tune it with the particular ...

yeah its not going to you know do everything but it was a pretty good stab for first attempt. not bad. havent tried to do porting yet, i was going to try python to rust tho 🙂 claude sonnet is ok but opus is the daddy, instead of reading the manuals for stuff i upload the whole program into it and just ask it what i need to know rather than going through every bit. incredibly helpful. i just made a CLI and after i gave it one function it wrote all of the rest of them almost right first time, had to correct few things but yeah it floored me. ive thrown the same things at it and GPT a lot of times and the differences are really interesting, ive been doing gemini as well at the same time. gemini doesnt even know what language you are writing code in 😄 give GPT one or two decent length pastes and its forgotten all code you uploaded. so that requires more focussed stuff. it does take a lot more files easily than mr claude tho 🙂 good luck with it

lapis sequoia Apr 24, 2024, 3:00 AM

#

dull radish I'll definitely give claude a try tho ty for the info

yeh also interested in using models, but the overhead is insane, the electric cost, the cost of the GPU, wow. after using ollama i thought i could run this stuff at home, but that was before i tried anything really meaty. i can see why i pay for it now 🙂 lulz

marble spindle Apr 24, 2024, 3:21 AM

#

Anyone can recommend any similar site like hugging face? Where I can find pre models for testing?

serene scaffold Apr 24, 2024, 3:23 AM

#

marble spindle Anyone can recommend any similar site like hugging face? Where I can find pre mo...

what do you want that you're not finding on hugging face? because hugging face kind of is the place.

marble spindle Apr 24, 2024, 3:31 AM

#

serene scaffold what do you want that you're not finding on hugging face? because hugging face k...

I found a model on Hugging Face and I'm wondering if there's another place like it where I can combine both models.

serene scaffold Apr 24, 2024, 3:38 AM

#

marble spindle I found a model on Hugging Face and I'm wondering if there's another place like ...

if you want to combine two models in some way, it doesn't matter whether they came from the same website or not.

shut girder Apr 24, 2024, 4:37 AM

#

What resources should I use when studying calculus for machine learning? I have not taken a high school calculus class yet, but I want to work my way to being able to understand and apply linear regression as my first project

past meteor Apr 24, 2024, 5:55 AM

#

shut girder What resources should I use when studying calculus for machine learning? I have ...

Sidenote: To use linear regression and interpret the results you don't need to know calculus or linear algebra. Many research focused social science / medicine programs teach all of this but not calculus / lin alg. Even stronger, many of the researchers themselves don't know these but do know how to correctly apply a regression. I'm mentioning this because it shouldn't have to pause your project's progress unless you're really interested in the math.

To answer your question, considering you've not covered them yet in high school I just recommend you pick up a standard textbook and go through that then.

iron basalt Apr 24, 2024, 6:18 AM

#

shut girder What resources should I use when studying calculus for machine learning? I have ...

As an informal introduction to calculus you may like the book: Calculus Made Easy by Silvanus P. Thompson. It's an old book (1910), not really a modern approach, but you might find it useful. It's a... vibe...

lapis sequoia Apr 24, 2024, 6:33 AM

#

Hello, I am creating 2 layer nn that learning xor and it's not learning and I dont really know what to do but i think something is worng with my teaching process

#

    def teach(self, X, Y, iters, learning_rate):
        for _ in range(iters):
            index = random.randint(0, len(X)-1)
            test_sample = X[index]
            test_target = Y[index]
            Z1, Y1, Z2, Y2 = self.forward(test_sample)

            Y2_error = test_target - Y2
            Y2_delta = np.dot(Y1, Y2_error[0]) * self.sigmoid_derivative(Y2)

            Y1_error = np.dot(Y2_delta, self.W2)
            Y1_delta = np.dot(np.reshape(test_sample, (2, 1)), Y1_error[0]) * \
                self.sigmoid_derivative(Y1)

            self.W2 += Y2_error * learning_rate
            self.W1 += Y1_delta * learning_rate
            self.W2b += Y2_error * learning_rate
            self.W1b += Y1_error * learning_rate

    def predict(self, X):
        Z1, Y1, Z2, Y2 = self.forward(X)
        print(Y2)

#

n.predict([1, 1])
n.predict([0, 0])
n.predict([1, 0])
n.predict([0, 1])

#

[0.49372465]
[0.4838573]
[0.48444335]
[0.49253533]

#

I think it can be related to couting error on 1st layer

#

Thanks for any help

tired otter Apr 24, 2024, 7:17 AM

#

In a simple GPT model (Karpathy's nanoGPT for ref), do i understand correctly, the only reason to aggregate every token's (past) neighbors is to increase number of subsamples + teach model to work with data of shorter lengths? So, in theory, we could have only aggregated last token and made a prediction based on that?

agile jackal Apr 24, 2024, 7:47 AM

#

has anybody tried llama3-8B-instruct-KS?
is that even the fastest version

I'm trying to use it with gpt4all
but the bot is way off topic

toxic mortar Apr 24, 2024, 8:24 AM

#

If I have a bunch of these sector categories which I want to feed my neural network with, is it better to categorize them in [0-20] categories, each category with an ID , or I should make binary input fields of every one of the sectors? For example Biotech: Biomedical/Gene : 0|1

tired otter Apr 24, 2024, 8:30 AM

#

usually for classification data is onehot vector encoded

toxic mortar Apr 24, 2024, 8:30 AM

#

tired otter usually for classification data is onehot vector encoded

How about when I have >40 categories for input?

#

Aaa mb you ment vector, not the categorical encoding

#

Sorry I got it

tired otter Apr 24, 2024, 8:31 AM

#

linear algebra does not care how many dimensions you have. i see it as separating/clustering data in higher dimension. but im not a pro at this

tired otter Apr 24, 2024, 8:32 AM

#

toxic mortar Aaa mb you ment vector, not the categorical encoding

https://en.wikipedia.org/wiki/One-hot

#

never used categorical encoding. but i think if its like [1,2,3,4,..], its possible say that some entry is 3.5 = like 3 but also like 4, but you wont find entry thats like 3 and 7 xD

toxic mortar Apr 24, 2024, 8:38 AM

#

tired otter never used categorical encoding. but i think if its like [1,2,3,4,..], its possi...

In real world example, with one-hot encoding, should I store every in column 0|1, or I should have one column where I would store a vector in?

toxic mortar Apr 24, 2024, 8:39 AM

#

tired otter never used categorical encoding. but i think if its like [1,2,3,4,..], its possi...

Ye u right, id lose information due to assumption of an ordinal relationship between categories, which in this case simply is not true

tired otter Apr 24, 2024, 8:40 AM

#

you had a list of categories, ordered by input data order, you represent it encoded format where each entry is a row vector. it is a matrix (N_data, N_categories)

boreal gale Apr 24, 2024, 8:44 AM

#

one hot encoding is definitely one way of making use of this field of categories.
just beware of "curse of dimensionality".
you don't seem to have a lot of data to work with, so i would - on top of just a normal one hot encode - also investigate if there are natural grouping of categories, e.g. all software instead of just one specific software category, to slightly reduce the number of columns you are adding to your dataset.

agile jackal Apr 24, 2024, 8:46 AM

#

#data-science-and-ml message
bump?

calm umbra Apr 24, 2024, 11:17 AM

#

hello, anyone how to get or print the C3 node

bold timber Apr 24, 2024, 11:18 AM

#

Below are two different techniques for implementing code when running a model using PyTorch in training mode:

'CODE 1'
torch.manual_seed(42)

# Set the number of epochs (how many times the model will pass over the training data)
epochs = 100

# Create empty loss lists to track values
train_loss_values = []
test_loss_values = []
epoch_count = []

# 0.Loop through the data
for epoch in range(epochs):
    
    #TRAINING MODE
    # Put model in training mode (this is the default state of a model)
    model_0.train()

    # 1. Forward pass on train data using the forward() method inside 
    y_pred = model_0(X_train)
    # print(y_pred)

    # 2. Calculate the loss (how different are our models predictions to the ground truth)
    loss = loss_fn(y_pred, y_train)

    # 3. Zero grad of the optimizer
    optimizer.zero_grad()

    # 4. Loss backwards
    loss.backward()

    # 5. Progress the optimizer
    optimizer.step()
            

'CODE 2'
epochs = 100
train_cost = []


for i in range (epochs):
      
    #TRAINING MODE
    model.train()
    cost = 0
    for feature, target in trainloader:
        output = model (feature) #feedforward
        loss = criterion(output, target) # calculate the cost
        loss.backward() #backpropagation
        
        optimizer.step() #update weight
        optimizer.zero_grad()
        
        cost += (loss.item() * feature.shape[0]) #total loss
        
    train_cost.append(cost / len(train_set))
    
    print(f'\rEpoch: {i+1:4} / {epochs:4} | train_cost: {train_cost[-1]:.4f}', end = ' ')

As we can see, there's a difference between Code 1 and Code 2 in the training mode:

Code 1: The sequence during training start with feed forward, calculating the cost, zero gradient, backpropagation, and updating weight
Code 2: The sequence during training start with feed forward, calculating the cost, backpropagation, updating wegiths, and zero gradient.

#

Based on both codes above (Code 1 & 2), which one is correct in representing the training mode phase using PyTorch?

calm umbra Apr 24, 2024, 11:22 AM

#

bold timber Below are two different techniques for implementing code when running a model us...

you should do optimizer.zero_grad() first, before backward()

bold timber Apr 24, 2024, 11:23 AM

#

calm umbra you should do optimizer.zero_grad() first, before backward()

can you give me the reason why?

calm umbra Apr 24, 2024, 11:30 AM

#

in most standard training loops, you'll want to start with a clean slate and not accumulate gradients across multiple backward passes. Therefore, you call optimizer.zero_grad() before backward() to zero out the gradients from the previous training step, ensuring that you're computing the gradients only for the current training step.

tidal bough Apr 24, 2024, 11:30 AM

#

calm umbra in most standard training loops, you'll want to start with a clean slate and not...

Does it matter here? It looks to me like zero_grad is called after step, so nothing is accumulated.

calm umbra Apr 24, 2024, 11:30 AM

#

en, yes, you are right

tidal bough Apr 24, 2024, 11:30 AM

#

i guess technically it may cause problems on the very first iteration if the model was used before the loop.

calm umbra Apr 24, 2024, 11:31 AM

#

it doesn't matter

tidal bough Apr 24, 2024, 11:33 AM

#

If I had to choose 1 or 2 I'd say 1 is more correct (it's nicer to clean the gradients right before setting them to new ones), but I'm pretty sure the second one in fact works too...

calm umbra Apr 24, 2024, 11:36 AM

#

yes, just make sure you make a step before the gradient become zero

bold timber Apr 24, 2024, 11:41 AM

#

Does that mean both of them actually can be used? @tidal bough @calm umbra

bold timber Apr 24, 2024, 11:41 AM

#

tidal bough Does it matter here? It looks to me like `zero_grad` is called after `step`, so ...

can you give me more explanation about this?

toxic mortar Apr 24, 2024, 11:42 AM

#

ValueError: pattern contains no capture groups

#

But if the pattern contains no capture group, doesnt that mean that it will return nan?

calm umbra Apr 24, 2024, 11:46 AM

#

bold timber can you give me more explanation about this?

for epoch in range(num_epochs):
for batch in dataloader:
optimizer.zero_grad() # Zero out the gradients
outputs = model(batch) # Forward pass
loss = criterion(outputs, targets) # Compute loss
loss.backward() # Compute gradients
optimizer.step() # Update weightsfor epoch in range(num_epochs):
for batch in dataloader:
outputs = model(batch) # Forward pass
loss = criterion(outputs, targets) # Compute loss
loss.backward() # Compute gradients
optimizer.step() # Update weights
optimizer.zero_grad() # Zero out the gradients

#

both type can be used, it doesn't matter

bold timber Apr 24, 2024, 11:48 AM

#

calm umbra for epoch in range(num_epochs): for batch in dataloader: optimizer.z...

Aah ok, thank you so much!

lapis sequoia Apr 24, 2024, 12:03 PM

#

shut girder What resources should I use when studying calculus for machine learning? I have ...

what resources can i avoid to learn machine learning, i just want to press a button and go really

long canopy Apr 24, 2024, 12:05 PM

#

anyone implement streaming inference? real time inference with data sent over a network. if so any suggestions for libraries or examples of implementations?

agile cobalt Apr 24, 2024, 12:12 PM

#

"real time" is just in a (potentially very small) fixed interval

you can probably find a bunch of examples of classifying things on a camera feed, but as far as libraries go, it should be just the same server you would use for real time non-ML applications + the same ML libraries you would use for normal inference

long canopy Apr 24, 2024, 12:13 PM

#

hm so kafka + pytorch?

agile cobalt Apr 24, 2024, 12:15 PM

#

could be a possibility, see under "Kafka Streams use cases" in https://kafka.apache.org/documentation/streams/

Apache Kafka

Apache Kafka: A Distributed Streaming Platform.

long canopy Apr 24, 2024, 12:16 PM

#

agile cobalt could be a possibility, see under "Kafka Streams use cases" in https://kafka.apa...

thanks a lot for the tips!

agile cobalt Apr 24, 2024, 12:16 PM

#

just be careful not to overcomplicate things

polar zinc Apr 24, 2024, 1:05 PM

#

Does anyone know an easy way to highlight data anomalies in a matplotlib graph?

marble spindle Apr 24, 2024, 1:31 PM

#

Does anyone have a model, library, or code for converting handwritten text to text/PDF? I would be so grateful for any assistance.

agile cobalt Apr 24, 2024, 1:46 PM

#

marble spindle Does anyone have a model, library, or code for converting handwritten text to te...

You could try using Tesseract or Google's Vision API

there are a bunch of other places you can look at depending on which language(s) you are working with, e.g. PaddleOCR or just browse models available on Hugging Face

#

if you're dealing with a relatively small volume of images, GPT4-Vision could also be an option worth considering, but it is relatively expensive

#

maybe try talking a bit with the guy from https://canary.discord.com/channels/267624335836053506/1232681051005128754

light osprey Apr 24, 2024, 1:52 PM

#

agile cobalt maybe try talking a bit with the guy from https://canary.discord.com/channels/26...

I myself am trying to figure it out c,: Based on my research Tesseract and Googles Vision API are indeed the most common tools to use, however, I am trying to find free solution since I need to use it regularly

agile cobalt Apr 24, 2024, 1:52 PM

#

I'm pretty sure that tesseract is free?

#

yeah it's licensed under Apache 2.0

light osprey Apr 24, 2024, 1:54 PM

#

Oh yeah Tesseract OCR right? Its free but specifically for handwritten text I ve read its not as good as it is for the printed text

#

I meant Transkribus and Google Vision API

agile cobalt Apr 24, 2024, 1:56 PM

#

well yeah handwritten text is quite a fair bit harder to classify than printed text to say the least

you can try training/fine-tuning tesseract on your data, specially if it follows a certain format or style, but if you are trying to recognize any and all random person's handwritting, good luck

#

it gets even worse with other languages but I am assuming English formal-ish documents for both of you?

marble spindle Apr 24, 2024, 1:57 PM

#

agile cobalt if you're dealing with a relatively small volume of images, GPT4-Vision could al...

Yeah, Hugging Face would be a better option. I've tried some of their models, but they don't extract some parts correctly. They only extract numbers along with some alphabet characters. If anyone is working on that project or willing to collaborate for a better outcome, I would be immensely grateful.

light osprey Apr 24, 2024, 1:59 PM

#

I truly need some luck😂 because I need it for multiple people handwritings. At first I thought ill use Tesseract for that but then read that TensorFlow has some good results. Im just in the beginning of my research cuz i cant run this goddamn code lmao. Yeah and I need it for other languages 😂😂 Im trying to practice at least for English language tho

agile cobalt Apr 24, 2024, 2:01 PM

#

it may be worth considering forcing these people to type in digital documents instead of going out of the way to recognize their handwriting

(half joking. only half.)

marble spindle Apr 24, 2024, 2:01 PM

#

agile cobalt well yeah handwritten text is quite a fair bit harder to classify than printed t...

Extracting text from printed documents is relatively straightforward due to standardized fonts and formatting. However, handwritten text presents a much greater challenge due to the variability in individual handwriting styles. Training a model to accurately recognize and extract handwritten text would indeed be a substantial task, requiring a large and diverse dataset for training and sophisticated algorithms for recognition. Collaboration and innovation in this area are crucial for advancing the capabilities of such models and making them more reliable and accurate in real-world applications.

light osprey Apr 24, 2024, 2:02 PM

#

They are my clients so they wont lift a finger to make our job easier…

light osprey Apr 24, 2024, 2:03 PM

#

marble spindle Extracting text from printed documents is relatively straightforward due to stan...

Did you check out this source? He also has explanation on youtube. I am trying to test it https://github.com/pythonlessons/mltu/tree/main/Tutorials/04_sentence_recognition

GitHub

mltu/Tutorials/04_sentence_recognition at main · pythonlessons/mltu

Machine Learning Training Utilities (for TensorFlow and PyTorch) - pythonlessons/mltu

agile cobalt Apr 24, 2024, 2:03 PM

#

just make sure they recognise there is no way it will get 100% accuracy otherwise things will end up pretty poorly for both you and your clients

marble spindle Apr 24, 2024, 2:04 PM

#

Is anyone out there interested in collaborating to work on handwritten text recognition? By pooling our efforts together, we could develop a single application that addresses this challenge, potentially leading to significant success in this field.

marble spindle Apr 24, 2024, 2:04 PM

#

light osprey Did you check out this source? He also has explanation on youtube. I am trying t...

yes dude

marble spindle Apr 24, 2024, 2:06 PM

#

agile cobalt just make sure they recognise there is no way it will get 100% accuracy otherwis...

But we can achieve at least a 98-99.9% success rate with it.

agile cobalt Apr 24, 2024, 2:08 PM

#

realistically I'd expect 90% at best for users not in the training data, and that's assuming their handwriting is readable in first place

light osprey Apr 24, 2024, 2:08 PM

#

marble spindle Is anyone out there interested in collaborating to work on handwritten text reco...

I recently got assigned to find the solution for this, so Im down. I spent couple days searching for the solution on the internet. But I am not really experienced in this field nor in python… i did finish machine learning course on udemy tho😅

marble spindle Apr 24, 2024, 2:11 PM

#

light osprey I recently got assigned to find the solution for this, so Im down. I spent coupl...

Great, let's continue learning together, and there's still much more ahead. If possible, we could work on a handwritten to digital text project.

drowsy sleet Apr 24, 2024, 2:24 PM

#

Hey everyone, I'm currently registered in a Exploratory Data Analysis course in our university and they want us to participate in a Kaggle competition which is based off of EDA (Exploratory Data Analysis) followed with prediction model on the dataset. Can someone provide me good resources which can help me to learn for the same. I don't mind if the resource is a website / video. I'm fine with anything I just need to know what all I must learn to do good in the competition while also learning good Data Analysis

lofty thorn Apr 24, 2024, 4:06 PM

#

hi

runic parcel Apr 24, 2024, 4:41 PM

#

what is the meaning of removing non linearity? like when we add a activation function in a convolution cnn model..

agile cobalt Apr 24, 2024, 4:58 PM

#

runic parcel what is the meaning of removing non linearity? like when we add a activation fun...

in a nutshell:

without activation functions, the entire model can be represented as a single linear equation, that multiplies each input by a fixed number then sums them all together

with activation functions, it adds a lot of "if" cases that let the model model more complex cases

runic parcel Apr 24, 2024, 4:59 PM

#

agile cobalt in a nutshell: without activation functions, the entire model can be represente...

what does non linearity means? in a cnn

agile cobalt Apr 24, 2024, 4:59 PM

#

have you seen the curve for the RELU function?

runic parcel Apr 24, 2024, 4:59 PM

#

yes

#

u mean the graph of relu function?

agile cobalt Apr 24, 2024, 5:00 PM

#

yep, literally this

runic parcel Apr 24, 2024, 5:00 PM

#

yes yes i have see

#

no curve

agile cobalt Apr 24, 2024, 5:00 PM

#

it literally means "not a straight line"

runic parcel Apr 24, 2024, 5:00 PM

#

yes so?

agile cobalt Apr 24, 2024, 5:01 PM

#

not being linear lets it model more complex curves

runic parcel Apr 24, 2024, 5:01 PM

#

hm

wooden sail Apr 24, 2024, 5:02 PM

#

nonlinear here does not refer to "not a straight line"

#

it refers to violating the property T(cx + y) = cT(x) + T(y) for two inputs x and y and a scalar c

runic parcel Apr 24, 2024, 5:05 PM

#

wooden sail nonlinear here does not refer to "not a straight line"

like what does it mean to remove non linearity from a convolution image after applying feature detector?

wooden sail Apr 24, 2024, 5:05 PM

#

i need more context

runic parcel Apr 24, 2024, 5:06 PM

#

in a convolutional layer we apply a feacure dectetor to a image

#

and then we use rectifier function on it to remove the non linearity in the image

#

so it makes it easier to read for the neural network

wooden sail Apr 24, 2024, 5:07 PM

#

i'd need to see where you're getting this from because none of that makes sense to me as you've written it

#

that all kinda sounds wrong without all the context

#

you don't do feature detection inside a convolutional layer and the rectifier function introduces nonlinearity

#

if someone has written this explicitly anywhere, they mean it in some special sense that you'd have to share with us to understand it

serene scaffold Apr 24, 2024, 5:08 PM

#

"nonlinearity" is an interesting case of a word that's very to-the-point, but somehow makes it sound like it's more complicated than it is.

runic parcel Apr 24, 2024, 5:09 PM

#

wooden sail if someone has written this explicitly anywhere, they mean it in some special se...

it is actuall from a cours...

wooden sail Apr 24, 2024, 5:09 PM

#

then they must have given the specific definitions that explain what they mean

runic parcel Apr 24, 2024, 5:10 PM

#

wooden sail if someone has written this explicitly anywhere, they mean it in some special se...

i mean to say the image is converted into numericals values and then a feature dector is applied like blur, edge detect

wooden sail Apr 24, 2024, 5:10 PM

#

ok, so you mean they apply a convolution/filter. that's a linear operation

runic parcel Apr 24, 2024, 5:11 PM

#

yes

#

and after that we use a activation function: recitifer, to the convolution layer

wooden sail Apr 24, 2024, 5:11 PM

#

mhm, and that introduces nonlinearity

runic parcel Apr 24, 2024, 5:12 PM

#

yes before

runic parcel Apr 24, 2024, 5:12 PM

#

runic parcel and after that we use a activation function: recitifer, to the convolution layer

then we do this

#

to remove the non linearity

wooden sail Apr 24, 2024, 5:13 PM

#

you just said the convolution happens first

#

which one is it?

runic parcel Apr 24, 2024, 5:14 PM

#

first the convoltion happens

#

so add the filter

#

and there is non linearity, so we use the refifier activation function

#

Screenshot_2024-04-24_at_10.47.38_PM.png

#

after filter

Screenshot_2024-04-24_at_10.47.59_PM.png

#

after rectifier functions

Screenshot_2024-04-24_at_10.48.11_PM.png

runic parcel Apr 24, 2024, 5:18 PM

#

wooden sail which one is it?

tthis si the images from the course

wooden sail Apr 24, 2024, 5:23 PM

#

right. first you convolve (linear), then you rectify (nonlinear)

runic parcel Apr 24, 2024, 5:24 PM

#

yes>?

wooden sail Apr 24, 2024, 5:28 PM

#

which is the opposite of what you had written before

runic parcel Apr 24, 2024, 5:32 PM

#

How

runic parcel Apr 24, 2024, 5:32 PM

#

wooden sail which is the opposite of what you had written before

Show where

wooden sail Apr 24, 2024, 5:35 PM

#

runic parcel and then we use rectifier function on it to remove the non linearity in the imag...

here

runic parcel Apr 24, 2024, 5:40 PM

#

wooden sail here

Its right

#

The image has nonlinearity and the rectifier function removes as it

wooden sail Apr 24, 2024, 5:41 PM

#

you understood nothing of what we just discussed 😛

spring field Apr 24, 2024, 6:09 PM

#

does linear mean that the 1st derivative is a constant and the same at all points??

#

therefore nonlinear would be literally anything else

#

and for relu it's not linear because it includes a condition which changes the derivative when x < 0

runic parcel Apr 24, 2024, 6:10 PM

#

wooden sail you understood nothing of what we just discussed 😛

fuck ig i saw it wrong in the course, let me give a shot and surf thanks man

#

fuck i saw all wong,

#

its used to increase the nonlinearity

wooden sail Apr 24, 2024, 6:13 PM

#

there we go

wooden sail Apr 24, 2024, 6:13 PM

#

spring field does linear mean that the 1st derivative is a constant and the same at all point...

nope

spring field Apr 24, 2024, 6:14 PM

#

is it to do with linear transformations?

runic parcel Apr 24, 2024, 6:14 PM

#

wooden sail there we go

its used to break the nonlinearity and increase it

#

ahhh understood

wooden sail Apr 24, 2024, 6:14 PM

#

spring field is it to do with linear transformations?

this is exactly what it has to do with

#

a transformation is linear if it satisfies the condition i mentioned above

#

T(ax + by) = aT(x) + bT(y) for scalars a and b, and inputs x and y

#

as an example, integration and differentiation with respect to one variable are both linear transformations

spring field Apr 24, 2024, 6:15 PM

#

is T a matrix?

wooden sail Apr 24, 2024, 6:16 PM

#

not in general, no

wooden sail Apr 24, 2024, 6:16 PM

#

wooden sail as an example, integration and differentiation with respect to one variable are ...

here, T is integration or differentiation, for example

spring field Apr 24, 2024, 6:16 PM

#

so, if we take integration and differentiation as functions, the T is a function
so, transforms the input and returns the transformation?

wooden sail Apr 24, 2024, 6:16 PM

#

however, matrices and matrix multiplication are defined precisely so that they represent a linear transformation in a particular input basis and output basis

wooden sail Apr 24, 2024, 6:17 PM

#

spring field so, if we take integration and differentiation as functions, the T is a function...

no, T is integration and x is a function

#

if you integrate a function you get another function

#

this is a linear transformation

spring field Apr 24, 2024, 6:18 PM

#

I was more thinking of integration itself being a function that takes in a function, but I think I gotcha (on some level 😁 )

wooden sail Apr 24, 2024, 6:18 PM

#

that's what i mean too

strong nymph Apr 24, 2024, 6:19 PM

#

I want a code for interactive Box plot for outliers, If someone knows

#

and how do I remove them

wooden sail Apr 24, 2024, 6:20 PM

#

.latex let $T(x)$ be defined as
[
T(x) = \int x(u) du.
]
$T$ has the property that
[
T(ax + by) = \int (ax(u) + by(u)) du = a \int x(u) du + b \int y(u) du = aT(x) + bT(y)
]

strange elbowBOT Apr 24, 2024, 6:20 PM

#

$latex.png$

wooden sail Apr 24, 2024, 6:20 PM

#

@spring field

#

which means integration is linear

spring field Apr 24, 2024, 6:23 PM

#

and x and y can be any function (that can also supposedly be integrated)?

wooden sail Apr 24, 2024, 6:23 PM

#

right

#

(more formally this is done with definite integrals, this is very loose but gets the point across)

long canopy Apr 24, 2024, 6:26 PM

#

why is echo "$(git rev-parse --show-top-level)" returning --show-top-level

spring field Apr 24, 2024, 6:27 PM

#

so
max(ax + by, 0) != a * max(x, 0) + b * max(y, 0)
and then I suppose in this case it's enough for one case where this is True, like say a = 1, b = 2, x = 5, y = -2 where max(1 * 5 + 2 * -2, 0) = 1, but 1 * max(5, 0) + 2 * (max(-2, 0)) = 5 (idk how else this can be proved, analytically?)
for it to fail the linearity condition
pretty cool

wooden sail Apr 24, 2024, 6:28 PM

#

spring field so `max(ax + by, 0) != a * max(x, 0) + b * max(y, 0)` and then I suppose in this...

right, this has to do with quantifiers

#

the definition of linearity has to be evaluated for all a, b, x, y

spring field Apr 24, 2024, 6:28 PM

#

wooden sail (more formally this is done with definite integrals, this is very loose but gets...

is that due to C being different for two indefinite integrals?

wooden sail Apr 24, 2024, 6:28 PM

#

that means that if there exists a single counter example, the function is not linear

runic parcel Apr 24, 2024, 6:28 PM

#

how much does it reduce when you use pooling to a convolution layer, so like from 5x5 to 3x3

spring field Apr 24, 2024, 6:29 PM

#

makes sense, I just wanted to maybe prove it more neatly than just finding a single counter example 😁

wooden sail Apr 24, 2024, 6:29 PM

#

spring field makes sense, I just wanted to maybe prove it more neatly than just finding a sin...

that's already a formal proof

#

https://en.wikipedia.org/wiki/Quantifier_(logic)

#

showing a single counterexample exists is proper proof

long canopy Apr 24, 2024, 6:30 PM

#

spring field makes sense, I just wanted to maybe prove it more neatly than just finding a sin...

proof by contradiction is proof

wooden sail Apr 24, 2024, 6:31 PM

#

spring field is that due to C being different for two indefinite integrals?

the c has to be set to 0 anyway, otherwise you get an affine transformation which is not linear

spring field Apr 24, 2024, 6:31 PM

#

long canopy proof by contradiction is proof

ofc ofc, I just feel as though something more format would be just cooler to show, like in an academia environment for example

wooden sail Apr 24, 2024, 6:33 PM

#

contradiction and counterexample are not the same

long canopy Apr 24, 2024, 6:33 PM

#

spring field ofc ofc, I just feel as though something more format would be just cooler to sho...

the only way to prove that something does not satisfy a condition is to show that it doesn't satisfy the condition

wooden sail Apr 24, 2024, 6:33 PM

#

which you can do either by constructing a counterexample, or by assuming the condition is satisfied and showing it leads to a contradiction

spring field Apr 24, 2024, 6:34 PM

#

runic parcel how much does it reduce when you use pooling to a convolution layer, so like fro...

could you elaborate a bit more on that? there's a formula for calculating the output size given kernel size, stride, and padding: math.floor((input_dim + 2 * padding - kernel_size) / stride) + 1

#

now that I think about it more, is an average pool over an image a linear convolution where's something like max pool would not be linear (is it still a convolution)?

#

in fact, average pool just seems like a box blur/mean blur (given stride = 1)

wooden sail Apr 24, 2024, 6:40 PM

#

correct on all accounts

tired lodge Apr 24, 2024, 6:40 PM

#

spring field could you elaborate a bit more on that? there's a formula for calculating the ou...

can i pls get help? #1232755927380262922

spring field Apr 24, 2024, 6:42 PM

#

that's as random a ping as a ping can be random 😄

full leaf Apr 24, 2024, 6:48 PM

#

Hi guys, I am trying to rename some image files using os for an experiment I am running. It works but something messed up and deleted some files, so I am trying to rewrite the code to check first if the file exists before changing it's name. It works, though it now only runs once and ends.

tired otter Apr 24, 2024, 7:11 PM

#

In a simple GPT model (Karpathy's nanoGPT for ref), do i understand correctly, the only reason to aggregate every token's (past) neighbors is to increase number of subsamples + teach model to work with data of shorter lengths? So, in theory, we could have only aggregated last token and made a prediction based on that?

tired lodge Apr 24, 2024, 7:12 PM

#

spring field that's as random a ping as a ping can be random 😄

can u pls help tho 🙏

#

almighty data science guy

iron basalt Apr 24, 2024, 8:34 PM

#

spring field does linear mean that the 1st derivative is a constant and the same at all point...

Linear from calculus means something else, the linear algebra definition is the more "correct" (within a larger context) and general one.

#

https://en.wikipedia.org/wiki/Linearity

Linearity

In mathematics, the term linear is used in two distinct senses for two different properties:

linearity of a function (or mapping);
linearity of a polynomial.
An example of a linear function is the function defined by

...

#

There is also the physics idea of linear.

toxic mortar Apr 24, 2024, 9:23 PM

#

spring field that's as random a ping as a ping can be random 😄

that moment when random is not random, but a pseudo-random 😄

severe inlet Apr 25, 2024, 12:24 AM

#

could anyone pleaseplease point me in the direction for using autoencoders into lstm for stock price predictions, or an implementation of autoencoders in python

#

im trying to implement my own encoding, but i dont have access to the papers im finding online

marsh hearth Apr 25, 2024, 1:26 AM

#

Currently I'm taking calc 1, and I was wondering how much more math do I need to truly understand how machine learning works. Do I need to go to calc 2, and 3 first, and what else do I need to know?

serene scaffold Apr 25, 2024, 3:51 AM

#

marsh hearth Currently I'm taking calc 1, and I was wondering how much more math do I need to...

you need to know a lot more about derivatives than will be covered in calc 1. integrals (which is mostly calc 2) are less important; especially knowing all the different ways of calculating them by hand.

marsh hearth Apr 25, 2024, 3:56 AM

#

serene scaffold you need to know a lot more about derivatives than will be covered in calc 1. in...

so what classes teach the derivatives required beyond calc 1

serene scaffold Apr 25, 2024, 3:57 AM

#

marsh hearth so what classes teach the derivatives required beyond calc 1

whichever one covers multivariate

marsh hearth Apr 25, 2024, 3:58 AM

#

idk, im gonna take calc 1, 2, 3, 4 hopefully in the next few quarters so that hopefully covers it

serene scaffold Apr 25, 2024, 3:59 AM

#

marsh hearth idk, im gonna take calc 1, 2, 3, 4 hopefully in the next few quarters so that ho...

are you in high school or university or what?

marsh hearth Apr 25, 2024, 3:59 AM

#

high school taking classes at my local college

#

im gonna try and take calc 2 over the summer

serene scaffold Apr 25, 2024, 4:00 AM

#

marsh hearth high school taking classes at my local college

"quarters" don't apply to college/university courses. they devide up the academic year by "semesters"

marsh hearth Apr 25, 2024, 4:01 AM

#

don't know, its a community college that im duel enrolled in with the high school

#

for example, during winter quarter i took precalc, and spring quarter (this one) im taking calc 1

serene scaffold Apr 25, 2024, 4:01 AM

#

interesting.

#

is your goal to pursue ML academically/professionally?

marsh hearth Apr 25, 2024, 4:02 AM

#

potentially yes

#

but right now im more focused on graduating with my assosciates degree in comp sci before i graduate high school then transfer to a 4 year with that

#

and we'll see where i go from there

serene scaffold Apr 25, 2024, 4:06 AM

#

marsh hearth and we'll see where i go from there

it's good not to hyperfocus on only one possibility. if you want to pursue ML, you should add some stats courses to your plan. But I don't know which ones, except "better than the one that I took".

marsh hearth Apr 25, 2024, 4:07 AM

#

haha, ok thank you for the advice ill note that 📝

severe inlet Apr 25, 2024, 4:09 AM

#

serene scaffold it's good not to hyperfocus on only one possibility. if you want to pursue ML, y...

i think basic linear algebra would be good too, its proving its usefulness in my ML projects

hushed quartz Apr 25, 2024, 5:10 AM

#

Can I ask for some suggestions on preprocessing on here?

lofty thorn Apr 25, 2024, 5:14 AM

#

can someone explain this code to me

trim saddle Apr 25, 2024, 5:24 AM

#

lofty thorn can someone explain this code to me

High level: getting a subset of a dataframe and displaying the correlation result in a heatmap?
What exactly do you struggle with?

lofty thorn Apr 25, 2024, 5:26 AM

#

I am a beginner..
and don't know much about this.
is this code complete?

from where can i find the data

#

#

i know that these are nellipses for different numbers

#

and this is a correlation matrix

trim saddle Apr 25, 2024, 5:27 AM

#

lofty thorn I am a beginner.. and don't know much about this. is this code complete? from w...

Cant say exactly, cause it doesnt show what the datasource is in your snippet.

lofty thorn Apr 25, 2024, 5:27 AM

#

yea right

#

wait let me send one more shot

#

in this code it is clearly said to download from a particular site (github)..

but i can't find the data from this link

rn_image_picker_lib_temp_4deca410-ac24-4e9f-99b5-f4e19cac834c.jpg

trim saddle Apr 25, 2024, 5:31 AM

#

The csv path is given. Data should be in the lifesat.csv here
https://github.com/ageron/data/tree/main/lifesat

GitHub

data/lifesat at main · ageron/data

Datasets used in the book Hands-On Machine Learning with Scikit-Learn, Keras & TensorFlow - ageron/data

lofty thorn Apr 25, 2024, 5:34 AM

#

but it is not same...is it?

trim saddle Apr 25, 2024, 5:36 AM

#

The path in the script is the raw path for python to ingest the data

lofty thorn Apr 25, 2024, 5:48 AM

#

please tell me ,at what time helpers are more active?
i will come then on discord

trim saddle Apr 25, 2024, 5:59 AM

#

lofty thorn please tell me ,at what time helpers are more active? i will come then on discor...

Why do you need more people? If you have questions just ask them

#

Also the data from the lifesat has nothing to do with the etf data.
So its not the correspending datasource to your initial screenshot

lofty thorn Apr 25, 2024, 6:04 AM

#

trim saddle Also the data from the lifesat has nothing to do with the etf data. So its not ...

sorry for the confusion

lofty thorn Apr 25, 2024, 6:04 AM

#

lofty thorn can someone explain this code to me

right now

lofty thorn Apr 25, 2024, 6:05 AM

#

trim saddle Why do you need more people? If you have questions just ask them

i want to write code for this

#

how do i program this

digital cipher Apr 25, 2024, 7:05 AM

#

What are people using for virtualenv

#

I was trying to avoid conda.

trim saddle Apr 25, 2024, 7:12 AM

#

digital cipher What are people using for virtualenv

Im switching to rye atm, used virtualenvwrapper before

digital cipher Apr 25, 2024, 7:12 AM

#

Do you work on windows?

#

Just tryting to get tensorflow setup and its such a uphill battle 😢

wicked slate Apr 25, 2024, 7:44 AM

#

guys if anyone looking for domains or cloud storage or priavte ip security pls dm me!!

toxic mortar Apr 25, 2024, 8:44 AM

#

in my df, theres a column offset. How to align it with actual start of the data?

#

df = pd.read_csv(file_path)

raw mortar Apr 25, 2024, 9:29 AM

#

digital cipher Just tryting to get tensorflow setup and its such a uphill battle 😢

Just use the docker image

trim saddle Apr 25, 2024, 9:29 AM

#

digital cipher Do you work on windows?

Yes, i use rye or virtualenvwrapper-win

raw mortar Apr 25, 2024, 9:30 AM

#

raw mortar Just use the docker image

https://www.tensorflow.org/install/docker

TensorFlow

Docker | TensorFlow

toxic mortar Apr 25, 2024, 9:34 AM

#

is the 50 features a lot for 40k records neural network?

#

should I reduce features?

digital cipher Apr 25, 2024, 1:55 PM

#

I'll probably just use docker, thank you!

buoyant vine Apr 25, 2024, 2:21 PM

#

toxic mortar is the 50 features a lot for 40k records neural network?

It really depends on the quality of your data and how complicated your model is

vocal cove Apr 25, 2024, 3:50 PM

#

Greetings,

Hope all are well. Would this channel be appropriate for asking regarding jax? I'm trying to parallelize a for loop (non-sequential, meaning it's as if you run a function 100 times, where each iteration is independent), and thought it seemed best to try vmap, but am facing some difficulties.

path_list = [
            self.generate_path(
                self.initial_x,
                self.final_x,
                self.time_steps[0],
                self.time_steps[-1],
                self.bisection_level,
            )
            for _ in range(self.number_paths)
        ]

All parameters are ints.

toxic mortar Apr 25, 2024, 4:02 PM

#

buoyant vine It really depends on the quality of your data and how complicated your model is

What do you mean quality, like distribution across categories,etc...? Or noise from non frequent classes? Also what do you mean by model complexity? Is it how big neural net is or?

vocal cove Apr 25, 2024, 4:06 PM

#

toxic mortar What do you mean quality, like distribution across categories,etc...? Or noise f...

Quality of data means how accurately it captures the system you're trying to imitate. That covers the type of data, the distribution of data, the noise you're getting on the data, the normalization status of the data, etc.

So once you have the dataset, you then take a model to fit a function to this dataset (basically an N+1-dimensional function/distribution), so does your model have enough parameters, does it use the correct activation, etc.

#

Model complexity is a rather broad term, you have the number of layers, the number of neurons per layer, the connectivity between layers, the activation function used for the neurons, and even the type of layers (CNNs, you have convolution and pooling layers, so the pattern and shape which you create the layers would be a measure of complexity).

#

Also, it'd be best if you refer to sheer size as scale instead of complexity.

vivid gust Apr 25, 2024, 7:44 PM

#

would dual 4060 ti cards (16gb VRAM each, so 32GB VRAM total) be any use, considering the value for money
for model training

#

or would a single 3090 be wiser

lapis sequoia Apr 25, 2024, 10:18 PM

#

vivid gust would dual 4060 ti cards (16gb VRAM each, so 32GB VRAM total) be any use, consid...

oh dont bother asking any of that here youll just be blanked

umbral charm Apr 25, 2024, 10:24 PM

#

Erm

#

i have 9.7 million data points

#

takes around ~30 secs to plot

#

using matplotlib

#

any GPU accelarated modules i can use or CUDA?

serene scaffold Apr 25, 2024, 10:32 PM

#

@umbral charm what kind of plot is it?

umbral charm Apr 25, 2024, 10:33 PM

#

serene scaffold <@318024849333288960> what kind of plot is it?

scatter plot, However it takes a signficantly less time just to do plot.plt instead of plt.scatter

serene scaffold Apr 25, 2024, 10:33 PM

#

umbral charm scatter plot, However it takes a signficantly less time just to do plot.plt inst...

a human looking at a scatter plot with 9.7 million points won't be able to take all that in. so you should downsample in some way regardless.

umbral charm Apr 25, 2024, 10:34 PM

#

serene scaffold a human looking at a scatter plot with 9.7 million points won't be able to take ...

I was reading about that

#

what is down sampling tho

#

im guessing it just takes points which are kind of plotted on top of eachother

serene scaffold Apr 25, 2024, 10:34 PM

#

no

umbral charm Apr 25, 2024, 10:34 PM

#

and removes them

#

oh, what does it do

serene scaffold Apr 25, 2024, 10:35 PM

#

you can take a uniform random sample of the points. or if there's a way to aggregate points in a way that's meaningful, like taking the average of every point that represents the same day.

umbral charm Apr 25, 2024, 10:36 PM

#

That is very True

serene scaffold Apr 25, 2024, 10:37 PM

#

everything I say is very true
I'm a very stable genius

umbral charm Apr 25, 2024, 10:37 PM

#

Thats what an unstable genius would say

serene scaffold Apr 25, 2024, 10:38 PM

#

no

umbral charm Apr 25, 2024, 10:39 PM

#

Erm is there a way, on spyder or pycharm

#

to make it use 100% of cpu

serene scaffold Apr 25, 2024, 10:56 PM

#

umbral charm Erm is there a way, on spyder or pycharm

your IDE doesn't control what the code does. it's just there to help you write it

serene scaffold Apr 26, 2024, 1:57 AM

#

vocal cove Greetings, Hope all are well. Would this channel be appropriate for asking rega...

this is the channel for jax btw

left tartan Apr 26, 2024, 2:34 AM

#

serene scaffold you can take a uniform random sample of the points. or if there's a way to aggre...

FYI, since we're on the topic, lttb is phenomenal for line charts.

serene scaffold Apr 26, 2024, 2:35 AM

#

left tartan FYI, since we're on the topic, lttb is phenomenal for line charts.

idk what that is

left tartan Apr 26, 2024, 2:35 AM

#

Im looking for the paper, one sec

#

https://github.com/sveinn-steinarsson/flot-downsample/ is the authors GitHub, https://skemman.is/bitstream/1946/15343/3/SS_MSthesis.pdf is the paper

#

(The technique is highly effective, I've used it for years)

#

Discussion here: https://taipy.io/blog/python-charting-taming-big-data-without-crashing

quasi bramble Apr 26, 2024, 2:59 AM

#

Do most data scientists work on-site or remotely?

craggy coral Apr 26, 2024, 3:19 AM

#

did AI's like chat gpt use python to machine learn

magic dune Apr 26, 2024, 5:21 AM

#

craggy coral did AI's like chat gpt use python to machine learn

yes

vagrant root Apr 26, 2024, 7:34 AM

#

hi

#

can anyone tell me why my model's first run predictions are always 0 or 1

jaunty helm Apr 26, 2024, 7:42 AM

#

How do you guys incorporate one-hot encoding with train-test splitting? (and also cross validating, etc.)
More specifically, I often get stuck with something like this:

steps = [
  ('transform_step_1', ...), 
  ('fill_nulls', ...),
  ('add_more_columns', ...),
  # ('one hot encode', what_to_do),
  ('estimator', ...)
]
pipeline = make_pipeline(steps)
```and I basically have 2 options
1. one-hot encode the entire training set before pipeline
```py
# e.g. one of the below
import pandas as pd
from sklearn.preprocessing import OneHotEncoder
X = pd.get_dummies(X)
X = OneHotEncoder().fit_transform(X)
```the problem is I often need some preprocessing before I want to do one-hot (e.g. fill nulls, maybe add more nominal columns)
2. one-hot as a step in the `sklearn.Pipeline`
```py
steps = [
  ('transform_step_1', ...), 
  ('fill_nulls', ...),
  ('add_more_columns', ...),
  ('one hot encode', OneHotEncoder()),
  ('estimator', ...)
]
```The problem is that I also use `train_test_split`/cross validation
```py
# manual split
from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X, y)
pipeline.fit(X_train, y_train)
y_pred = pipeline.predict(X_test)
print(mean_squared_error(y_test, y_pred))

# cv
from sklearn.model_selection import cross_val_score, KFold
cv = KFold(5, shuffle=True)
print(cross_val_score(pipeline, X, y, cv=cv, scoring=mean_squared_error)
```and sometimes there will be values present in the test split not in the train split, so OHE just fails

#

There's also option 3, but that's to manually type out all possible values in the nominal columns and give it to OneHotEncoder... and I really do not want to do that

boreal gale Apr 26, 2024, 8:16 AM

#

jaunty helm How do you guys incorporate one-hot encoding with train-test splitting? (and als...

have you looked into the documentation of one hot encoder?
there is a handle_unknown arg.

and how come not option 3?
if your dataset already have well defined categories for a specific column, why not transform that column's dtype to categorical when you read/clean your dataset and then use the category information in the now categorical column for the one hot encoder?

jaunty helm Apr 26, 2024, 8:21 AM

#

boreal gale have you looked into the documentation of one hot encoder? there is a `handle_un...

there is a handle_unknown arg.
ah... don't know how I missed that

and how come not option 3? ...
emphasis is I don't want to do it manually
but now that you mention it, I think I have an idea

make a pipeline with steps up to where I'd want to one-hot encode
transform the entire dataset with it
extract all the unique values from the categorical cols and put it in a variable
use that variable as the categories= of the OneHotEncoder

ty for your suggestions!

boreal gale Apr 26, 2024, 8:26 AM

#

jaunty helm > there is a `handle_unknown` arg. ah... don't know how I missed that > and how...

are you using https://scikit-learn.org/stable/modules/generated/sklearn.compose.ColumnTransformer.html as well?

i was thinking to handle each column to be one hot encoded separately.

(i have no idea if this is how it (the column transformer) works btw, it just reminded me of a collection of tooling i have written for my last job years ago for working with sklearn's pipeline better)

scikit-learn

sklearn.compose.ColumnTransformer

Examples using sklearn.compose.ColumnTransformer: Release Highlights for scikit-learn 1.4 Release Highlights for scikit-learn 1.2 Release Highlights for scikit-learn 1.1 Release Highlights for scik...

#

# read data
# convert to columns to categorical columns where sensible
steps = [
  ('transform_step_1', ...), 
  ('fill_nulls', ...),
  ('add_more_columns', ...),
  ('one hot encode(s)', ColumnTransformer([
      ("ohe1", OneHotEncoder(df.i_am_categorical_column1.dtype.categories), "i_am_categorical_column1")
      ("ohe2", OneHotEncoder(df.i_am_categorical_column2.dtype.categories), "i_am_categorical_column2")
  ]),
  ('estimator', ...)
]
...

(not 100% sure it's .dtype.categories but something to that effect

jaunty helm Apr 26, 2024, 8:40 AM

#

boreal gale ```py # read data # convert to columns to categorical columns where sensible ste...

pretty much, this is what I came up with
(I'm also trying out polars and also have other helper stuff going on, so don't mind the syntax too much)

alltf = pipe.fit_transform(ALL)  # up to where I'd want to one-hot encode
categories = [alltf[col].unique() for col in CATEGORICAL_COLUMNS]

step = ('one hot encode', make_column_transformer([
    (
        OneHotEncoder(categories=categories, sparse_output=False), 
        CATEGORICAL_COLUMNS
    )
]))

... # add `step` to the pipeline

sweet kernel Apr 26, 2024, 8:45 AM

#

Hi, am facing issues in running pyspark in anaconda. I have set all the env. variables correctly but still facing issues. Can someone please help me??

full elbow Apr 26, 2024, 9:08 AM

#

hey guys can anyone help me

#

im tryna think of what i can use to make this work
basically i need to read a line of text from the output, grab a specific line from that output, and then store that line into a variable to be used later on in the process

#

Confidence Threshold:
31

0%


100%

{
  "predictions": [
    {
      "x": 115.5,
      "y": 227,
      "width": 79,
      "height": 88,
      "confidence": 0.374,
      "class": "rotten",
      "points": [```

for example, this is an output. I need to find "confidence": 0.374 and store it into a variable to be used later

#

the thing i sent is from roboflow

full elbow Apr 26, 2024, 9:51 AM

#

twitter is kind of a bad place for that

#

in my opinion

lofty thorn Apr 26, 2024, 11:11 AM

#

can someone please make me understand this code

#

i can't write the missing code

left tartan Apr 26, 2024, 11:33 AM

#

lofty thorn can someone please make me understand this code

Can you explain what youre trying t do?

left tartan Apr 26, 2024, 11:33 AM

#

lofty thorn can someone please make me understand this code

Or open a help thread: #❓｜how-to-get-help

lofty thorn Apr 26, 2024, 11:34 AM

#

want to make a correlation matrix..but annoying this is that the book does not have data but only code

#

i don't know where can i find the data

left tartan Apr 26, 2024, 11:34 AM

#

What are the columns of your data?

#

Pandas corr() will give you the matrix, provided each series is in a separate column (ie: each column is a member of spx, and each row is a date)

lofty thorn Apr 26, 2024, 11:35 AM

#

this is all i have

rn_image_picker_lib_temp_7575bf8b-92d1-4a4b-819d-dc0919c37769.jpg

left tartan Apr 26, 2024, 11:36 AM

#

What's your data

lofty thorn Apr 26, 2024, 11:36 AM

#

i don't have the data

left tartan Apr 26, 2024, 11:37 AM

#

Uh, then how are you going to write code?

lofty thorn Apr 26, 2024, 11:37 AM

#

that is the issue...🤕is it available online??

#

idk

#

all i have is this code and matrix

left tartan Apr 26, 2024, 11:38 AM

#

You could try yfinance

lofty thorn Apr 26, 2024, 11:39 AM

#

what is this

left tartan Apr 26, 2024, 11:40 AM

#

!pypi yfinance

arctic wedgeBOT Apr 26, 2024, 11:40 AM

#

yfinance v0.2.38

Download market data from Yahoo! Finance API

Released on <t:1713302444:D>.

lofty thorn Apr 26, 2024, 11:45 AM

#

left tartan !pypi yfinance

does this code have any data??..i don't think so..
m

rn_image_picker_lib_temp_3784f046-1d60-4ca2-ac0f-30c256e45c18.jpg

spring field Apr 26, 2024, 11:51 AM

#

vagrant root can anyone tell me why my model's first run predictions are always 0 or 1

well, if you could provide use with more of the details, then maybe it could be possible, but currently we have no information whatsoever about what you're doing

spring field Apr 26, 2024, 11:53 AM

#

lofty thorn does this code have any data??..i don't think so.. m

you can use yfinance to get data from the Yahoo! Finance API, I'd suggest maybe looking at their docs to find out how to do that

lofty thorn Apr 26, 2024, 11:54 AM

#

spring field you can use `yfinance` to get data from the Yahoo! Finance API, I'd suggest mayb...

what type of data yfinance has?

left tartan Apr 26, 2024, 11:54 AM

#

lofty thorn what type of data yfinance has?

Finance data... I'm really confused what you're asking. Your original example was financial data, right?

lofty thorn Apr 26, 2024, 11:55 AM

#

wait .

#

rn_image_picker_lib_temp_9f3515cc-8247-4c3f-b4e0-4597642ab19b.jpg

lofty thorn Apr 26, 2024, 11:57 AM

#

left tartan Finance data... I'm really confused what you're asking. Your original example wa...

no

vagrant root Apr 26, 2024, 11:59 AM

#

spring field well, if you could provide use with more of the details, then maybe it could be ...

Fixed now

#

I think it was an overfitting issue

spring field Apr 26, 2024, 11:59 AM

#

lofty thorn no

well, what kind of data is it then?

lofty thorn Apr 26, 2024, 12:02 PM

#

maybe stock market related..idk know..sorry for the misconception

spring field Apr 26, 2024, 12:03 PM

#

I see a positive correlation between stock market related data and finance data

lofty thorn Apr 26, 2024, 12:09 PM

#

positive ?? how

left tartan Apr 26, 2024, 12:42 PM

#

lofty thorn positive ?? how

yfinance gives you stock market data.

runic parcel Apr 26, 2024, 1:17 PM

#

what happens with the units in:
cnn.add(tf.keras.layers.Dense(units=128, activation="relu"))
is it for the number of neurons in the hidden layer?

serene scaffold Apr 26, 2024, 1:37 PM

#

runic parcel what happens with the units in: ```cnn.add(tf.keras.layers.Dense(units=128, acti...

can you show the rest of how cnn is defined?

#

what kind of team?

runic parcel Apr 26, 2024, 1:58 PM

#

serene scaffold can you show the rest of how `cnn` is defined?

what do u want to see in the rest?

#

lofty thorn Apr 26, 2024, 2:01 PM

#

I have this book @serene scaffold ..

it has programs as an e.g. in book..but doesn't show data..

do you know where can i find data of codes written in this book

serene scaffold Apr 26, 2024, 2:03 PM

#

lofty thorn I have this book <@253696366952316929> .. it has programs as an e.g. in book..b...

it probably says somewhere in the introduction where there's a repository with all that.

serene scaffold Apr 26, 2024, 2:06 PM

#

runic parcel

I haven't worked with CNNs or keras before, but the dense layer having 128 "units" probably reflects the output of previous layers. because each input is a (64, 64, 3)-shape tensor, and 128 is 64 * 2.

lofty thorn Apr 26, 2024, 2:17 PM

#

are these all data sets?

serene scaffold Apr 26, 2024, 2:19 PM

#

lofty thorn are these all data sets?

they're CSV files. "dataset" is more abstract. it might be that a few CSV files comprise a dataset.

#

but it's likely that those CSV files are the ones the code examples refer to.

lofty thorn Apr 26, 2024, 2:33 PM

#

serene scaffold they're CSV files. "dataset" is more abstract. it might be that a few CSV files ...

do i have to download all csv files in order to extract the data?

#

https://github.com/gedeck/practical-statistics-for-data-scientists/tree/master/data

this is the link..please guide me

GitHub

practical-statistics-for-data-scientists/data at master · gedeck/pr...

Code repository for O'Reilly book. Contribute to gedeck/practical-statistics-for-data-scientists development by creating an account on GitHub.

past meteor Apr 26, 2024, 2:36 PM

#

lofty thorn https://github.com/gedeck/practical-statistics-for-data-scientists/tree/master/d...

I feel like this book is only good if you've already had a university course on stats and want a refresher

runic parcel Apr 26, 2024, 2:41 PM

#

@serene scaffold what is the use of target_size()?

#

what does it do?

serene scaffold Apr 26, 2024, 2:41 PM

#

runic parcel <@253696366952316929> what is the use of target_size()?

where?

runic parcel Apr 26, 2024, 2:42 PM

#

serene scaffold where?

#

while training and testing the cnn

#

is it for reducing the image size?

past meteor Apr 26, 2024, 2:43 PM

#

The height and width of your image

serene scaffold Apr 26, 2024, 2:43 PM

#

runic parcel ```training_set = train_datagen.flow_from_directory('dataset/training_set', targ...

looks like this is something specific to keras rather than neural networks in general. I don't know.

if you're in a notebook, do train_datagen.flow_from_directory?, with a question mark at the end, as the only code in a new cell.

past meteor Apr 26, 2024, 2:44 PM

#

runic parcel ```training_set = train_datagen.flow_from_directory('dataset/training_set', targ...

Do you know what flow from directory does?

runic parcel Apr 26, 2024, 2:45 PM

#

past meteor Do you know what flow from directory does?

no what does it?

#

reads the images from the direcotry right?

past meteor Apr 26, 2024, 2:47 PM

#

runic parcel reads the images from the direcotry right?

Exactly, it creates a dataset which is an object that is capable of iterating over your files in the directory in a batch. Target size is just the size of each image

runic parcel Apr 26, 2024, 2:48 PM

#

past meteor Exactly, it creates a dataset which is an object that is capable of iterating ov...

ahhh alright, so like i can reduce the size for running my model faster?

past meteor Apr 26, 2024, 2:50 PM

#

runic parcel ahhh alright, so like i can reduce the size for running my model faster?

It's been a while I used Tensorflow/Keras (they change their API a lot) but when I did the canonical way was using "preprocessing layers". Basically it's a layer you add right before your neural net that resizes the images

#

Or does any other thing you want

runic parcel Apr 26, 2024, 2:50 PM

#

past meteor It's been a while I used Tensorflow/Keras (they change their API a lot) but when...

allrightt

runic parcel Apr 26, 2024, 2:51 PM

#

past meteor It's been a while I used Tensorflow/Keras (they change their API a lot) but when...

and do uk what is the use of filters? in layers.conv2d()

past meteor Apr 26, 2024, 2:51 PM

#

runic parcel allrightt

Here you go

https://keras.io/api/layers/preprocessing_layers/image_preprocessing/resizing/

Check out https://keras.io/api/layers/preprocessing_layers/
For the full list

Keras documentation: Resizing layer

Keras documentation: Preprocessing layers

runic parcel Apr 26, 2024, 2:51 PM

#

past meteor Here you go https://keras.io/api/layers/preprocessing_layers/image_preprocessin...

thanksss mannn

past meteor Apr 26, 2024, 2:54 PM

#

runic parcel and do uk what is the use of filters? in layers.conv2d()

Sure, the intuition is that each filter is looking for a feature in your image. In the first few layers the filters are detecting lines, edges and so on. Deeper in the network they're composed into corners, circles and so on. Even deeper they become things that may help for the downstream task. Finally, the dense layers take the extracted features and use them to make a decision.

If you have 64 filters you're looking for 64 features across each position in your image.

The analogy I like using is that the conv layers are about learning how to see and the dense layers are about learning how to decide based on things you've seen.

#

This may not necessarily be true but anthropomorphisizing CNN's helps you understand them faster 😄

spring field Apr 26, 2024, 3:00 PM

#

runic parcel what happens with the units in: ```cnn.add(tf.keras.layers.Dense(units=128, acti...

it appears to be the number of output features for that layer
the input for that layer seems to be 14 * 14 * 32 features?

#

ngl, but the more I see tf being used, the more I see why it's falling out of usage

past meteor Apr 26, 2024, 3:04 PM

#

spring field ngl, but the more I see tf being used, the more I see why it's falling out of us...

It's actually easier than torch tbh

#

The reason why I no longer use it is simple, they change their API too much

#

Constant breaking changes

runic parcel Apr 26, 2024, 3:04 PM

#

past meteor Sure, the intuition is that each filter is looking for a feature in your image. ...

but which feature does it takes? randomly on its own?

spring field Apr 26, 2024, 3:04 PM

#

that's an interesting development choice

runic parcel Apr 26, 2024, 3:04 PM

#

like edge detect, blur, and so on?

past meteor Apr 26, 2024, 3:05 PM

#

runic parcel but which feature does it takes? randomly on its own?

And that's the point of "learning", it learns which features to detect and how to classify them in an end-to-end way

spring field Apr 26, 2024, 3:07 PM

#

mmm, ig I might also be biased towards pytorch because I started with it, but still, like in tf, like the dense layers, you don't have to specify the input feature size apparently? which I find kinda unreadable, lol, I mean, ig that's what makes it simpler to use, but yeah

past meteor Apr 26, 2024, 3:07 PM

#

70% of the deep learning I did in my master's was actually with TF/Keras, 20 % with MATLAB (... lmao) and 10 % torch

past meteor Apr 26, 2024, 3:08 PM

#

spring field mmm, ig I might also be biased towards pytorch because I started with it, but st...

Honestly, needing to specify the input is redundant 9 times out of 10

past meteor Apr 26, 2024, 3:08 PM

#

past meteor 70% of the deep learning I did in my master's was actually with TF/Keras, 20 % w...

Now I've switched and I'd always recommend Torch to folk

spring field Apr 26, 2024, 3:08 PM

#

yay

past meteor Apr 26, 2024, 3:08 PM

#

But if TF were consistent...

#

It's better

#

Most people that prefer torch haven't even used TF

#

But at the end of the day, with all the breaking changes they've converged to being pretty much the same library especially if you add lightning into the mix

spring field Apr 26, 2024, 3:13 PM

#

oh well, guess I might give it a chance at some point
(but at least looking at what I've seen others write with tf, it doesn't seem particularly enticing to me)

past meteor Apr 26, 2024, 3:14 PM

#

If you want to try something else then I'd actually advise you to try out Jax

#

I'd describe Jax as something you use to make a DL framework and not a DL framework at all (because it's mostly JIT, autodiff and so on)

spring field Apr 26, 2024, 3:16 PM

#

noted, I've heard of it as well, but only very little, will check it out though, thanks firNotes

past meteor Apr 26, 2024, 3:16 PM

#

Amazon (sadly) likes MXNet

#

So a lot of the SoTA time series models are done with that

#

That's another contender for "I want to do something different", but it's only if you're doing SoTA time series analysis. Otherwise I'd say the best advice in 2024 is "just stick to torch" 😄

lofty thorn Apr 26, 2024, 3:19 PM

#

i can't find data sets on github..
can anyone help me..
i am stuck from a while

#

please anyone

past meteor Apr 26, 2024, 3:23 PM

#

lofty thorn i can't find data sets on github.. can anyone help me.. i am stuck from a while

https://www.kaggle.com/datasets

Find Open Datasets and Machine Learning Projects | Kaggle

Download Open Datasets on 1000s of Projects + Share Projects on One Platform. Explore Popular Topics Like Government, Sports, Medicine, Fintech, Food, More. Flexible Data Ingestion.

lofty thorn Apr 26, 2024, 3:39 PM

#

chatgpt understands me better than humans..🥲

spring field Apr 26, 2024, 3:40 PM

#

it has no ability to "understand"

#

also, if you can't find datasets that are from the book, well, ig you can practice getting datasets from elsewhere and adapting them to the code in the book

lofty thorn Apr 26, 2024, 3:41 PM

#

spring field also, if you can't find datasets that are from the book, well, ig you can practi...

that is what i will do now

finite sierra Apr 26, 2024, 6:43 PM

#

I have 2 arrays:
numbers which consists of integers, and I will multiply every one of them by a certain value
mask which will be an array of True/False values, True for values I don't want to multiply (will cause overflow), and False for values I want to multiply

What is a smart way to perform the multiplication for values in numbers that are FALSE in mask?
I tried doing np.where(mask, nan, numbers * scalar) but that still does the multiplication on all numbers which results in overflow warning.

wooden sail Apr 26, 2024, 6:53 PM

#

finite sierra I have 2 arrays: `numbers` which consists of integers, and I will multiply every...

maybe numbers[np.logical_not(mask)] *= scalar

#

that'll only multiply the numbers where the mask is False

#

!e

import numpy as np
scalar = 100
numbers = np.array([[100_000, 0.1], [0.3, 0.2]])
mask = np.array([[True, False], [False, False]])
numbers[np.logical_not(mask)] *= scalar
print(numbers)

arctic wedgeBOT Apr 26, 2024, 6:56 PM

#

@wooden sail :white_check_mark: Your 3.12 eval job has completed with return code 0.

001 | [[1.e+05 1.e+01]
002 |  [3.e+01 2.e+01]]

wooden sail Apr 26, 2024, 6:56 PM

#

there we go

finite sierra Apr 26, 2024, 8:02 PM

#

wooden sail maybe `numbers[np.logical_not(mask)] *= scalar`

Exactly what I was looking for!

#

but would it be possible to somehow set all numbers to nan that didn't get multiplied in this statement?

wooden sail Apr 26, 2024, 8:03 PM

#

should be able to do the opposite indexing

#

numbers[mask] = np.NAN

finite sierra Apr 26, 2024, 8:03 PM

#

alright

wooden sail Apr 26, 2024, 8:04 PM

#

or whichever other definition of nan you like (e.g. float('nan') is another valid one, i think)

#

you could also mute the warning but i'm not sure that's the best idea

finite sierra Apr 26, 2024, 8:05 PM

#

and can I do this on a new array instead?

#

i.e. don't want to modify existing numbers

#

I could copy array then do that, but is there a smarter way without taking 3 steps of 1. copy 2. not-mask 3. mask

wooden sail Apr 26, 2024, 8:07 PM

#

sure, there are other ways. they all require at least as many steps though

#

you could create an array of 0s or an array of arbitrarily initialized values and then assign into that new array

#

!e

import numpy as np
scalar = 100
numbers = np.array([[100_000, 0.1], [0.3, 0.2]])
mask = np.array([[True, False], [False, False]])

receptacle = np.empty(shape=(2,2)) # 2d array full of random trash
receptacle[:] = np.NAN
receptacle[np.logical_not(mask)] = numbers[np.logical_not(mask)]*scalar
print(receptacle)
``` let's see if this works

arctic wedgeBOT Apr 26, 2024, 8:10 PM

#

@wooden sail :white_check_mark: Your 3.12 eval job has completed with return code 0.

001 | [[nan 10.]
002 |  [30. 20.]]

wooden sail Apr 26, 2024, 8:10 PM

#

looks good

#

idk if receptacle[:] = np.NAN or receptacle[mask] = np.NAN is better. probably doesn't make a big difference unless your array is huge

finite sierra Apr 26, 2024, 8:16 PM

#

ah thanks

buoyant kite Apr 26, 2024, 8:54 PM

#

who knows TTS?

spring field Apr 26, 2024, 9:30 PM

#

wooden sail maybe `numbers[np.logical_not(mask)] *= scalar`

I have a few ideas why, but why didn't you use ~? at least could have mentioned it

wooden sail Apr 26, 2024, 9:31 PM

#

spring field I have a few ideas why, but why didn't you use ~? at least could have mentioned ...

i always forget which operators are overloaded for elementwise operations on numpy arrays, that's about it

serene scaffold Apr 27, 2024, 1:24 AM

#

buoyant kite who knows TTS?

You have to ask an actual question. Don't ask to ask.

buoyant kite Apr 27, 2024, 1:54 AM

#

serene scaffold You have to ask an actual question. Don't ask to ask.

Do you know TTS?

craggy agate Apr 27, 2024, 3:01 AM

#

Hey there, I am beginning work on an ambitious project of an autonomous RC plane, here is what I want it to do, takeoff successfully, make an highspeed over head pass, and successfully touch down and break next to me. The plane will have a Raspberry Pi 4 on board as the onboard computer which will control the plane's movements, it would be made of IMC Carbon Fiber and would have multiple sensors like GPS, Lidar, Barometer, Gyroscopic sensor, AOA, Accelerometer, Camera, Speed Measuring Sensor, Accelerometer, etc. I would be coding the project in Python, it will also have 2, 400W brushless fans. Now my question is, **what DL model should I use? I am currently thinking of using a hybrid architecture of a CNN and LSTM. would that work? Should I implement reenforced learning? ** Also, how do I train the model using simulations? It would be impossible to find flight logs for all the data in .csv format and honestly I don't think that would work... I probably need to simulate a plane and realistic winds and condition with access to all the sensors that I would need. Could anyone maybe give me a sense of direction? I m familiar with DL and ML btw.

serene scaffold Apr 27, 2024, 4:04 AM

#

buoyant kite Do you know TTS?

Please ask your actual question. Asking if anyone knows about a topic just creates an extra step.

severe inlet Apr 27, 2024, 4:45 AM

#

ive trained my autoencoder, but i have problems extracting out the hidden layer encoded output cos i need it for LSTM. ive managed to dot the input and weights, but failed at adding the biases. could anyone pleaseplease help me with this. i have a rough idea but i dont know how to go about fixing this.

#

because i have an array of lists of my input, im thinking that i need to iterate thru the array and add the biases?

#

but my array of lists has shape (10865, 8). how can i batch it into (32,8) for successful addition of biases?

delicate sleet Apr 27, 2024, 9:04 AM

#

Hey there! I’ve seen your interest for neuro-symbolic, explainability, self organizing.. Maybe our project might interest you as well! Feel free to check it out, leave a star if you find it appealing, and share your feedback with me! 🙂 https://github.com/SynaLinks/HybridAGI

GitHub

GitHub - SynaLinks/HybridAGI: The Programmable Neuro-Symbolic AGI t...

The Programmable Neuro-Symbolic AGI that lets you program its behavior using Graph-based Prompt Programming: for people who want AI to behave as expected - SynaLinks/HybridAGI

twilit horizon Apr 27, 2024, 9:07 AM

#

hey

hard nest Apr 27, 2024, 11:56 AM

#

I'm training a model with image classification, I have the image and the mask, should I use the mask to cover everything but the important area or use it to highlight that are, leaving the rest of the image normal?

agile owl Apr 27, 2024, 11:56 AM

#

how can I simplify the process of solving for the intersection so spark can do it quickly without a udf. I was thinking of transforming the curves into histograms of fixed widths to discretize the space but not sure what to do for the intersection

past meteor Apr 27, 2024, 12:02 PM

#

agile owl how can I simplify the process of solving for the intersection so spark can do i...

there's likely smarter ways but the simple one is sampling N points from both, subtracting them and use the bissection method

agile owl Apr 27, 2024, 12:03 PM

#

past meteor there's likely smarter ways but the simple one is sampling N points from both, s...

thx

verbal musk Apr 27, 2024, 12:05 PM

#

"data science", "machine learning", "scientific computing", "artificial intelligence"... which discipline encompasses them all?

past meteor Apr 27, 2024, 12:06 PM

#

verbal musk "data science", "machine learning", "scientific computing", "artificial intellig...

these terms are not standardized

#

there's no formal definitions so nothing "encompasses" all of them

#

maybe computer science, mathematics and statistics

verbal musk Apr 27, 2024, 12:07 PM

#

okay

agile owl Apr 27, 2024, 12:09 PM

#

Some interesting emergent properties of my economy simulation. Changing nothing about the distribution of individual wealth and incomes, and simply changing the number of people:

Loan supply and demand at 6 banks and 100 people, 100,000 people and 10,000,000 people, respectively.

flint plover Apr 27, 2024, 12:11 PM

#

Anyone has experience in Nlp ?

trim saddle Apr 27, 2024, 12:47 PM

#

flint plover Anyone has experience in Nlp ?

Please dont ask to ask.
Ask an actual question.

flint plover Apr 27, 2024, 12:59 PM

#

trim saddle Please dont ask to ask. Ask an actual question.

Actually i am working on a project. I have to identify causality and non causality for genes responsible for some diseases. I have abstracts of some research papers, using that i have to mark it. So i wanted to know how to do this

buoyant kite Apr 27, 2024, 5:01 PM

#

serene scaffold Please ask your actual question. Asking if anyone knows about a topic just creat...

don't yo know TTS?

serene scaffold Apr 27, 2024, 5:02 PM

#

buoyant kite don't yo know TTS?

I'm going to mute you if you continue to ask to ask. "asking to ask" is when you say things like "can I ask a question?" or "does anyone know about x?" instead of asking the question that you actually want help with.

toxic mortar Apr 27, 2024, 5:59 PM

#

Is this free & unlimited API calls model to use https://huggingface.co/facebook/bart-large-cnn

facebook/bart-large-cnn · Hugging Face

spring field Apr 27, 2024, 6:09 PM

#

toxic mortar Is this free & unlimited API calls model to use https://huggingface.co/facebook/...

You would run it locally, so yes, you can use it as much as you possibly can

lyric trail Apr 27, 2024, 6:10 PM

#

hiiii i am working on speech recognition model can somebody help me in this

#

i am getting this error
i tried all these things but not resolving this issue
-Check file extension
-Verify file accessibility
-Test with a different audio file
-Check Whisper installation

Update Whisper

toxic mortar Apr 27, 2024, 6:12 PM

#

spring field You would run it locally, so yes, you can use it as much as you possibly can

This is locally right?

from transformers import pipeline

summarizer = pipeline("summarization", model="facebook/bart-large-cnn")

spring field Apr 27, 2024, 6:18 PM

#

toxic mortar This is locally right? ```py from transformers import pipeline summarizer = pi...

I assume so, I'm not familiar with that particular API, but I'd assume it downloads the checkpoint only once and only if it can't find it on your system
like, it's a pre-trained model, you're only downloading the weights basically and once that is done, you don't have to even be connected to the internet to run this
also there is no clear indication of using some identifying bit of API authentication that could possibly limit your usage (I mean, seems they might be doing a bit of API throttling, but that's probably not particularly relevant for you)

buoyant kite Apr 27, 2024, 6:20 PM

#

lyric trail hiiii i am working on speech recognition model can somebody help me in this

Hello, is it your testing project?

spring field Apr 27, 2024, 6:21 PM

#

lyric trail i am getting this error i tried all these things but not resolving this issue -...

where did you get that path to the file? and are you running that file on the same machine that you got the path from?

lyric trail Apr 27, 2024, 6:21 PM

#

buoyant kite Hello, is it your testing project?

yes i need to covert speech to text

#

can you help me with this

spring field Apr 27, 2024, 6:24 PM

#

spring field where did you get that path to the file? and are you running that file on the sa...

^

lyric trail Apr 27, 2024, 6:27 PM

#

spring field where did you get that path to the file? and are you running that file on the sa...

yes i have that file in my machine in download folder

spring field Apr 27, 2024, 6:27 PM

#

can you send the entire error traceback?

buoyant kite Apr 27, 2024, 6:28 PM

#

lyric trail yes i need to covert speech to text

Yeah , Recently I have developed TTS project

#

can you show me your project?

lyric trail Apr 27, 2024, 6:30 PM

#

https://paste.pythondiscord.com/ZH7A

spring field Apr 27, 2024, 6:31 PM

#

your username is USER?
and can you send your error traceback?

lyric trail Apr 27, 2024, 6:31 PM

#

actually i was unable to send the python file but i send this link is that okay...???

spring field Apr 27, 2024, 6:32 PM

#

yep

buoyant kite Apr 27, 2024, 6:34 PM

#

lyric trail actually i was unable to send the python file but i send this link is that okay...

here may be a missing line to set the device for the model before running inference in your code.

lyric trail Apr 27, 2024, 6:35 PM

#

buoyant kite can you show me your project?

are you able to read my file

buoyant kite Apr 27, 2024, 6:35 PM

#

yep

lyric trail Apr 27, 2024, 6:35 PM

#

buoyant kite here may be a missing line to set the device for the model before running infere...

what exactly are you talking i am not getting this

buoyant kite Apr 27, 2024, 6:35 PM

#

lyric trail are you able to read my file

audio = whisper.load_audio("C:\Users\USER\Downloads\sampleThree.wav") , I thinks this is correct.

lyric trail Apr 27, 2024, 6:35 PM

#

let me check

lyric trail Apr 27, 2024, 6:36 PM

#

buoyant kite audio = whisper.load_audio("C:\\Users\\USER\\Downloads\\sampleThree.wav") , I th...

what exactly are you trying to say

left tartan Apr 27, 2024, 6:37 PM

#

The general idea is interesting, although it's kinda the premise of AWS and cloud computing. I know a company who is trying to make a distributed market of idle manufacturing resources (cnc machines, etc)

lyric trail Apr 27, 2024, 6:38 PM

#

buoyant kite Yeah , Recently I have developed TTS project

can you show me your TTS project

buoyant kite Apr 27, 2024, 6:42 PM

#

lyric trail can you show me your TTS project

This is many sub files. so this is the main algorithm.

spring field Apr 27, 2024, 6:42 PM

#

buoyant kite audio = whisper.load_audio("C:\\Users\\USER\\Downloads\\sampleThree.wav") , I th...

that has the potential to create unintended escape sequences

buoyant kite Apr 27, 2024, 6:42 PM

#

spring field that has the potential to create unintended escape sequences

What do you mean?

spring field Apr 27, 2024, 6:43 PM

#

!e

"\Users"

arctic wedgeBOT Apr 27, 2024, 6:43 PM

#

@spring field :x: Your 3.12 eval job has completed with return code 1.

001 |   File "/home/main.py", line 1
002 |     "\Users"
003 |     ^^^^^^^^
004 | SyntaxError: (unicode error) 'unicodeescape' codec can't decode bytes in position 0-1: truncated \UXXXXXXXX escape

spring field Apr 27, 2024, 6:44 PM

#

their path is not incorrectly formatted, it's just wrong

#

is it that path? we don't actually know because @lyric trail is still to provide the entire error traceback

#

!traceback

arctic wedgeBOT Apr 27, 2024, 6:44 PM

#

Traceback

Please provide the full traceback for your exception in order to help us identify your issue.
While the last line of the error message tells us what kind of error you got,
the full traceback will tell us which line, and other critical information to solve your problem.
Please avoid screenshots so we can copy and paste parts of the message.

A full traceback could look like:

Traceback (most recent call last):
  File "my_file.py", line 5, in <module>
    add_three("6")
  File "my_file.py", line 2, in add_three
    a = num + 3
        ~~~~^~~
TypeError: can only concatenate str (not "int") to str

If the traceback is long, use our pastebin.

buoyant kite Apr 27, 2024, 6:44 PM

#

spring field their path is not incorrectly formatted, it's just wrong

this is Mayuresh is path.

spring field Apr 27, 2024, 6:45 PM

#

I was referring to them

buoyant kite Apr 27, 2024, 6:45 PM

#

lyric trail can you show me your TTS project

your project and my project is similar

lyric trail Apr 27, 2024, 6:46 PM

#

but why mine is not working

#

sry i am new to coding what is trace back

lyric trail Apr 27, 2024, 6:47 PM

#

buoyant kite your project and my project is similar

which IDE did you use

buoyant kite Apr 27, 2024, 6:48 PM

#

lyric trail but why mine is not working

VS code

craggy agate Apr 27, 2024, 7:12 PM

#

Hey there, I am beginning work on an ambitious project of an autonomous RC plane, here is what I want it to do, takeoff successfully, make an highspeed over head pass, and successfully touch down and break next to me. The plane will have a Raspberry Pi 4 on board as the onboard computer which will control the plane's movements, it would be made of IMC Carbon Fiber and would have multiple sensors like GPS, Lidar, Barometer, Gyroscopic sensor, AOA, Accelerometer, Camera, Speed Measuring Sensor, Accelerometer, etc. I would be coding the project in Python, it will also have 2, 400W brushless fans. Now my question is, what DL model should I use? I am currently thinking of using a hybrid architecture of a CNN and LSTM. would that work? Should I implement reenforced learning? Also, how do I train the model using simulations? It would be impossible to find flight logs for all the data in .csv format and honestly I don't think that would work... I probably need to simulate a plane and realistic winds and condition with access to all the sensors that I would need. Could anyone maybe give me a sense of direction? I m familiar with DL and ML btw.

spring field Apr 27, 2024, 7:21 PM

#

I would maybe consider starting with an RC car

magic steppe Apr 27, 2024, 8:38 PM

#

anyone know if there's something premade that converts from a generic numpy matrix to something that i can pass to scipy.linalg.solve_banded?

tired lodge Apr 27, 2024, 10:45 PM

#

craggy agate Hey there, I am beginning work on an ambitious project of an autonomous RC plane...

about the csv part. couldnt you just get other file formats and turn those into csv files? or just use the file’s content itself. like an sql file for example

craggy agate Apr 28, 2024, 2:28 AM

#

tired lodge about the csv part. couldnt you just get other file formats and turn _those_ int...

csv was just an example, main problem would be getting that data, I was thinking of maybe using a simulation with RL?

craggy agate Apr 28, 2024, 2:29 AM

#

spring field I would maybe consider starting with an RC car

Exactly my thought but I have opted for using a drone instead of RC car for now.

#

Will try to implement obsicale avoidance using it's front camera

#

Also using YOLO object detection to make it track/follow me

tired lodge Apr 28, 2024, 3:11 AM

#

craggy agate csv was just an example, main problem would be getting that data, I was thinking...

could be a good idea. i dont do anything to do with DS or AI so im just giving what i think would be useful advice for your problem

craggy agate Apr 28, 2024, 3:11 AM

#

tired lodge could be a good idea. i dont do anything to do with DS or AI so im just giving...

I see, thanks!

wintry gyro Apr 28, 2024, 4:51 AM

#

How do I save ml model locally with pyspark.
I am getting this error.

tacit basin Apr 28, 2024, 6:13 AM

#

wintry gyro How do I save ml model locally with pyspark. I am getting this error.

Is that full error message? Can't see this from error message but possibly you need to create this directory first?

severe inlet Apr 28, 2024, 11:25 AM

#

im training a lstm model, shouldnt it be training on my gpu instead of cpu? how do i change this?

cinder jay Apr 28, 2024, 11:28 AM

#

Hey guys, which pytorch image (in docker) should i use if i just want to inference with CPU?

past meteor Apr 28, 2024, 11:28 AM

#

#rules , could you remove this? It's in violation to rule 6 🙂

past meteor Apr 28, 2024, 11:28 AM

#

severe inlet im training a lstm model, shouldnt it be training on my gpu instead of cpu? how ...

With Pytorch? Tensorflow?

severe inlet Apr 28, 2024, 11:29 AM

#

tensorflow

lapis sequoia Apr 28, 2024, 11:33 AM

#

hey guyss, I'm lost , how to generate text data using LLM api and prompting

past meteor Apr 28, 2024, 11:41 AM

#

severe inlet tensorflow

Can you try running thiss? print("Num GPUs Available: ", len(tf.config.list_physical_devices('GPU')))

severe inlet Apr 28, 2024, 11:55 AM

#

past meteor Can you try running thiss? `print("Num GPUs Available: ", len(tf.config.list_phy...

u mind i do this later? half way thru my epochs

severe inlet Apr 28, 2024, 12:17 PM

#

past meteor Can you try running thiss? `print("Num GPUs Available: ", len(tf.config.list_phy...

0 gpu available after running this

past meteor Apr 28, 2024, 12:18 PM

#

severe inlet 0 gpu available after running this

Are you on windows?

severe inlet Apr 28, 2024, 12:21 PM

#

yes

#

i dont have any conda or anaconda environments installed

#

if it matters

past meteor Apr 28, 2024, 12:23 PM

#

Tensorflow isn't available wiith GPU on windows anymore, you'll have to use WSL (windows subsytem for linux)

severe inlet Apr 28, 2024, 12:23 PM

#

ahhhh okay

past meteor Apr 28, 2024, 12:23 PM

#

that's the reason why your GPU is not being found

severe inlet Apr 28, 2024, 12:23 PM

#

i just thought it was a setting i didnt enable or something

#

if thats so then alls g for now

mystic harbor Apr 28, 2024, 3:29 PM

#

@crystal geyser I've deleted your message due to the following reasons:

We do not allow advertisements in this server
Scraping facebook is against their ToS and we do not allow discussions around such topics.

Please re-read our #rules

vagrant root Apr 28, 2024, 4:08 PM

#

severe inlet im training a lstm model, shouldnt it be training on my gpu instead of cpu? how ...

Device = "cuda" torch.cuda.is_available() else "gpu"

#

Then train and test tensors.to(device)

#

Then model.to(device)

lofty thorn Apr 28, 2024, 4:17 PM

#

2024-04-28 21:44:16.128937: I tensorflow/core/util/port.cc:113] oneDNN custom operations are on. You may see slightly different numerical results due to floating-point round-off errors from different computation orders. To turn them off, set the environment variable `TF_ENABLE_ONEDNN_OPTS=0`.
WARNING:tensorflow:From C:\Users\ACER\AppData\Local\Programs\Python\Python311\Lib\site-packages\keras\src\losses.py:2976: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead.

Traceback (most recent call last):
  File "C:\Users\ACER\PycharmProjects\Face recognition\main.py", line 11, in <module>
    imgRGB = cv2.cvtColor(img, cv2.COLOR_BGR2RGB)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
cv2.error: OpenCV(4.9.0) D:\a\opencv-python\opencv-python\opencv\modules\imgproc\src\color.cpp:196: error: (-215:Assertion failed) !_src.empty() in function 'cv::cvtColor'

INFO: Created TensorFlow Lite XNNPACK delegate for CPU.```

how to fix this

#

import cv2
import mediapipe as mp
import time


cap = cv2.VideoCapture(1)
mpHands = mp.solutions.hands
hands = mpHands.Hands()

while True:
    success, img = cap.read()
    imgRGB = cv2.cvtColor(img, cv2.COLOR_BGR2RGB)
    results = hands.process(imgRGB)

    cv2.imshow("WIZARD", img)
    cv2.waitKey(1)```

this is the code

timid kiln Apr 28, 2024, 5:01 PM

#

I need some help on “where to start”. I am working at a production facility where a piece of equipment is producing a byproduct in volumes greater than expected. (FYI this isn’t a chemical reaction.). No one can figure out why this is happening.

What I was hoping to do was load a bunch of operating data into pandas, process it so there’s no empty fields or obviously erroneous values, and then (and this is where the “where do I start” part comes in) through the magic of some python library(s) it will tell me “parameter X, Y, and Z have a noticeable effect on the byproduct”.

I guess what I’m thinking is I need to practice somehow with a data set where it’s obvious to me what the answer is, then test against whatever program I write. I think?

I would appreciate your advice on how/where to begin. Please tag me if you respond. Thank you!

edit: This is all time based data obtained from the control system historian, if that helps.

devout sail Apr 28, 2024, 10:04 PM

#

timid kiln I need some help on “where to start”. I am working at a production facility wher...

Without writing any code, just conceptually, how would you know what effect some record has on the byproduct?

timid kiln Apr 28, 2024, 10:09 PM

#

devout sail Without writing any code, just conceptually, how would you know what effect some...

If I imagined some pre-built program, each column of data would have a header/name. So I'd pick the header/name and select some option that would calculate what independent variables/parameters appear to have an effect on the selected dependent variable/data.

If I were doing this by programming, after the data munging, I imagine I'd have to write code to name the dependent variable in question. But, as I mentioned, I'm at a "where do I start" type of situation so all I know at this point is "how to load data into pandas from a csv". I apologize in advance for my newbiness. 🙂

devout sail Apr 28, 2024, 10:10 PM

#

timid kiln If I imagined some pre-built program, each column of data would have a header/na...

Just to make sure I'm on the same page, are you measuring the amount of byproduct somehow?

timid kiln Apr 28, 2024, 10:10 PM

#

I installed Orange was going to give that a try once I figured out how to bypass Excel and dump data from the historian straight to a CSV. My company locks everything down pretty hard as far as software, connections, permissions, etc.

timid kiln Apr 28, 2024, 10:11 PM

#

devout sail Just to make sure I'm on the same page, are you measuring the amount of byproduc...

Yes. This byproduct has a flow meter on the pipe where it exits. It's a vapor stream.

#

We expect some vapor, but we're getting a lot more than previous, all of a sudden back at the beginning of April, and it's a financial loss if this continues.

devout sail Apr 28, 2024, 10:12 PM

#

So you have a bunch of parameters about the process, and then a column saying how much byproduct was produced, and you want to be able to predict from the parameters how much byproduct you'll get?

timid kiln Apr 28, 2024, 10:12 PM

#

No, not at all.

#

I have a bunch of time-based data measured from pressure, temperature, and flow meters throughout the facility. I want to be able to calculate which of these data appear to have an affect on the byproduct. When the byproduct flow increases, which other data did something at the same time? Same as when it decreases, which temperatures, pressures and/or flows appeared to contribute to that reduction? Right now, we've plotted everything we think has an effect on this vapor stream but visually, we haven't found a correlation. So I'm trying to determine if there's a way, mathematically, to figure this out with the logged data from the historian.

#

The hope is that we can determine "oh, it was [insert parameter here]. We just need to [reduce/increase] that [parameter] to reduce the vapor volume".

timid kiln Apr 28, 2024, 10:19 PM

#

devout sail So you have a bunch of parameters about the process, and then a column saying ho...

Thank you for taking the time to help me work through the details of this opportunity. I appreciate whatever help and guidance you can provide!

buoyant flare Apr 29, 2024, 4:54 AM

#

I was looking through an object detection project and encountered, this error seems to be originating from the BatchNormalization class's call method. The error message I'm getting is:

Using a symbolic `tf.Tensor` as a Python `bool` is not allowed. You can attempt the following resolutions to the problem: If you are running in Graph mode, use Eager execution mode or decorate this function with @tf.function. If you are using AutoGraph, you can try decorating this function with @tf.function. If that does not work, then you may be using an unsupported feature or your source code may not be visible to AutoGraph. See https://github.com/tensorflow/tensorflow/blob/master/tensorflow/python/autograph/g3doc/reference/limitations.md#access-to-source-code for more information.

Here's the relevant part of the custom BatchNormalization layer:

import tensorflow as tf

class BatchNormalization(tf.keras.layers.BatchNormalization):
    """
    Make trainable=False freeze BN for real (the og version is sad)
    """

    def call(self, x, training=False):
        if training is None:
            training = False
        training = tf.logical_and(training, self.trainable)
        return super().call(x, training)

I've tried to modify the code to handle the None value for the training argument using tf.cond, like so:

training = tf.cond(tf.equal(training, None), lambda: tf.constant(False), lambda: training)

However, I'm still receiving the same error. Can anyone here help me understand why I'm encountering this error and how to resolve it?

Any help or guidance on resolving this issue would be greatly appreciated! Thanks in advance!

pliant heron Apr 29, 2024, 6:40 AM

#

hey! idk if this is the right place to ask this question, still i'll ask sorry😅
i am self learning data visualization and i want to install matplotlib
i am using python in vs studio (windows 10)
my question is which one .whl file should i download is it pypy or cpy
thankyou for helping me out

ivory quarry Apr 29, 2024, 6:57 AM

#

craggy agate Hey there, I am beginning work on an ambitious project of an autonomous RC plane...

u should probably use something like ros

#

that + gazebo simulations

vocal barn Apr 29, 2024, 8:32 AM

#

Aye how to use opencv for face recognition on pydroid
I have basic python skills tho
And is it free?

spring field Apr 29, 2024, 8:37 AM

#

pliant heron hey! idk if this is the right place to ask this question, still i'll ask sorry😅...

normally you wouldn't install wheels manually like that
use pip:

pip install matplotlib

or

py -m pip install matplotlib

vocal cove Apr 29, 2024, 9:10 AM

#

Greetings there,

Hope all are well. I am looking for some assistance as to how I can enable jax for the code pasted in the link below for running some of the loops in parallel for wall clock efficiency.
https://paste.pythondiscord.com/LMMQ
The reason for this is that for certain instances, it takes significant time to run, so being able to parallelize would help a great amount, and even more if I can enable GPU usage (I have an RTX 3060, so it would be put to some good use here):

num_paths = [100*i for i in range(1, 15)]
values = []

for path in tqdm(num_paths):
    qpi = QPI(initial=np.array([0, 0]), final=np.array([1, 1]), bisection_level=22, number_paths=path, n_filtrations=0)
    probability_amplitude, paths = qpi.calculate_fpi()
    values.append(probability_amplitude)

plt.plot(values)

#

So, one thing that I want to enable parallelization for is the path_list definition in calculate_fpi() method. You can see it's a non-sequential for loop, which is perfect for parallelization using sth like JAX's vmap.

#

So, I would like to enable two things :

Enabling JAX to parallelize the loops.
Enabling GPU for running the code if possible.

#

I immensely appreciate the assistance in advance!

unique patio Apr 29, 2024, 9:38 AM

#

Hi everyone,

I am building an AI/ML/Data Engineering project which is going to help a “user” pick the best choice out of a N car models.

For example user provides us with a 100 Volkswagen models and their specification as PDF’s files (unstandardised format).

When PDF’s finish uploading to a server they can provide a specification they need the car to meet (for example 4x4 and above 200HP) and they describe it just like to an LLM (chat-format more or less or just specific phrases)

What would be the best approach for making such a thing?

OpenAI API is ofc off the table because of it’s broad imagination and lack of context appliance.

I think NLP might do the job, but I don’t think it’s the best choice out here.

RAG based on a vector DB might be decent choice, but what model/technique could do the trick here?

Thanks in advance everyone 🫶🏻

craggy agate Apr 29, 2024, 11:37 AM

#

ivory quarry u should probably use something like ros

Thanks!

craggy agate Apr 29, 2024, 11:47 AM

#

unique patio Hi everyone, I am building an AI/ML/Data Engineering project which is going to ...

BERT is the best for NLP tasks due to it being bidirectional, you could go with GPT as well. RNNs and LSTMs could work but might not be the best with dealing with text. I would say, go with BERT or maybe a hybrid architecture of BERT and ANN.

hasty grail Apr 29, 2024, 11:58 AM

#

For example user provides us with a 100 Volkswagen models and their specification as PDF’s files (unstandardised format).
Do you plan to persist the data, or does it vary between each conversation? I think RAG is the way to go regardless

#

e.g. with LlamaIndex: https://docs.llamaindex.ai/en/stable/use_cases/q_and_a/

#

you can look around and see what best fits your situation

unique patio Apr 29, 2024, 12:08 PM

#

craggy agate Hey there, I am beginning work on an ambitious project of an autonomous RC plane...

First of all I would be cautious with flying above anyones head because as I can Imagine a lot of things can go wrong during this process. As of the training data, maybe buy/build some plane with those sensors built in and collect the data while manually flying? That’s just an idea which came up to my mind. Good luck with the project because it sound cool tho!

unique patio Apr 29, 2024, 12:11 PM

#

hasty grail > For example user provides us with a 100 Volkswagen models and their specificat...

Data will be stored only during the process of user actively using the app (e.g removed after 10 minutes of in-activity)

#

I will take a look into sugguested solutions - thanks a lot for the advice guys.

craggy agate Apr 29, 2024, 12:37 PM

#

unique patio First of all I would be cautious with flying above anyones head because as I can...

Thanks for the ideas, I do have a ground that stays empty for the most part, I will do all my testing there, so I think there is low risk of it falling on someone or smth like that. I do like the manually flying idea, I would have to figure out how I can save that data in preferably csv format.

neat bluff Apr 29, 2024, 12:42 PM

#

craggy agate Thanks for the ideas, I do have a ground that stays empty for the most part, I w...

Oh, now it makes sense. I've thought that "high-speed overpass" meant like literally above someones head in close proximity, ~2 meters above the ground 😆 . Now I understand that that's not You had in mind. Saving that data to CSV would be extremely easy using Raspberry, but I'm not sure how attaching that to a plane will affect the aerodynamics.

#

Or basically anything else similar, but smaller than Raspberry. Size and weight is crucial in this subject - afaik. Also response times are crucial in such case so it has to be considered. I would love to be a part of this project - dm me if You don't mind that and we can talk some more about this.

craggy agate Apr 29, 2024, 12:48 PM

#

Lol, RBP won't affect aerodynamics cause it will be inside the hull.

neat bluff Apr 29, 2024, 12:49 PM

#

Well, technically it shouldn't, but mounting it stable inside of it might be a bit tricky. Also mind the weight distribution.

daring pumice Apr 29, 2024, 12:50 PM

#

Hey everyone, just a short question regarding VS Code, I am trying to build an AI for the game assetto corsa, that’s not the question but, I have made a virtual environment in vs code and when I installed the module for the game, it installed but when I tried to run a simple code to test the running, I get an error saying the said module doesn’t exist. Any help is appreciated. Please keep in mind I have other virtual environments for my other projects too. Thank you in advanced!

craggy agate Apr 29, 2024, 12:50 PM

#

daring pumice Hey everyone, just a short question regarding VS Code, I am trying to build an A...

Did you activate Venv?

neat bluff Apr 29, 2024, 12:50 PM

#

You need to install modules separetly for every virtual env as it is excluded from system env

craggy agate Apr 29, 2024, 12:51 PM

#

neat bluff Well, technically it shouldn't, but mounting it stable inside of it might be a b...

RBP is pretty light it's just the power bank powering it I am concerned about.

craggy agate Apr 29, 2024, 12:51 PM

#

daring pumice Hey everyone, just a short question regarding VS Code, I am trying to build an A...

CD into your Venv and install it there

daring pumice Apr 29, 2024, 12:51 PM

#

craggy agate Did you activate Venv?

Yup ran the command and everything. It didn’t show it in terminal but a couple seconds later I got a pop up saying the environment was activated even thought it doesnt show

#

I used pip list to see if it was even in the list and sure it was right there

daring pumice Apr 29, 2024, 12:53 PM

#

craggy agate CD into your Venv and install it there

I should install it in the venv folder or anything more specific?

neat bluff Apr 29, 2024, 12:54 PM

#

Btw, if You are a begginer and using a virtual env might complicate stuff. It doesn't do that much of a difference and will be easier for You without it.

craggy agate Apr 29, 2024, 12:55 PM

#

daring pumice I should install it in the venv folder or anything more specific?

Yep in the folder your Venv is activated in.

#

Not the Venv files, that's different

daring pumice Apr 29, 2024, 12:55 PM

#

neat bluff Btw, if You are a begginer and using a virtual env might complicate stuff. It d...

I am a beginner, the reason I am resorting to venv is because my python on my laptop got screwed so hard it doesn’t really work anymore and now I am scared to touch that monster

daring pumice Apr 29, 2024, 12:56 PM

#

craggy agate Yep in the folder your Venv is activated in.

Got it

neat bluff Apr 29, 2024, 12:56 PM

#

daring pumice I am a beginner, the reason I am resorting to venv is because my python on my la...

What exactly happened to it? What's the issue? Maybe we will be able to help. Also - reinstalling it might be an option to consider

daring pumice Apr 29, 2024, 12:57 PM

#

neat bluff What exactly happened to it? What's the issue? Maybe we will be able to help. Al...

Tried reinstalling it several times and never fixed the issue the issue is that whenever I am trying to run a code on putting I just get an error saying thing doesn’t exist even though it does

#

So don’t really know what to do anymore

neat bluff Apr 29, 2024, 12:58 PM

#

craggy agate RBP is pretty light it's just the power bank powering it I am concerned about.

Oh, it's a Pico u are going to use? Then it's in fact small and light. Some mini powerbank should be able to power it for 3 minutes of flight easly. Have You thought about connecting directly to planes battery?

neat bluff Apr 29, 2024, 12:59 PM

#

daring pumice Tried reinstalling it several times and never fixed the issue the issue is that ...

Pro tip: You never fully uninstall things unless You use software like Revo Uninstaller

And on the other hand, could You replicate the issue and paste the error output here?

craggy agate Apr 29, 2024, 12:59 PM

#

I have but that might drain the battery too fast. Especially due to the load on the pi

#

It's a model 4 b+

daring pumice Apr 29, 2024, 1:00 PM

#

daring pumice Tried reinstalling it several times and never fixed the issue the issue is that ...

The thing is wherever I tried to uninstall it. It never got fully uninstalled I think. Because everytime I tried to run the installer I just a dialogue box asking if I wanted to modify it or repair it add things

daring pumice Apr 29, 2024, 1:00 PM

#

neat bluff Pro tip: You never fully uninstall things unless You use software like Revo Unin...

Of the python or vs code?

daring pumice Apr 29, 2024, 1:00 PM

#

neat bluff Pro tip: You never fully uninstall things unless You use software like Revo Unin...

Ah got it

neat bluff Apr 29, 2024, 1:01 PM

#

daring pumice Of the python or vs code?

Python issue. Your Python and venv issue might be related to each other.

daring pumice Apr 29, 2024, 1:02 PM

#

Gotcha. Hey if you don’t mind can I friend you mate? I could really use some help with these issues. Of course if you are comfortable

#

I will send the error once I get back. I am currently outside

neat bluff Apr 29, 2024, 1:02 PM

#

craggy agate It's a model 4 b+

So it's definietly not a Pico. 4b+ has PoE - that might be worth to consider.

craggy agate Apr 29, 2024, 1:03 PM

#

Yep

neat bluff Apr 29, 2024, 1:03 PM

#

daring pumice Gotcha. Hey if you don’t mind can I friend you mate? I could really use some hel...

Sure thing. I am no ML/AI expert but I do code in Python for a long time now so maybe I will be able to help You out somehow.

daring pumice Apr 29, 2024, 1:04 PM

#

neat bluff Sure thing. I am no ML/AI expert but I do code in Python for a long time now so ...

Sent and thank you so much mate! Appreciate it a lot.

neat bluff Apr 29, 2024, 1:04 PM

#

daring pumice Sent and thank you so much mate! Appreciate it a lot.

No biggie mate. Gotta give back to the community.

daring pumice Apr 29, 2024, 1:05 PM

#

Haha true

daring pumice Apr 29, 2024, 4:24 PM

#

hey guys!, i have a question, i have python installed but pip is not installed so whenever i try to install any library, i just get hit with an error, can anyone please help me with this, thank you in advanced! i have repaired python multiple times but still no changes

serene scaffold Apr 29, 2024, 4:27 PM

#

daring pumice hey guys!, i have a question, i have python installed but pip is not installed s...

try python -m pip --version

#

show the output as text--not as a screenshot

#

btw, a lot of data science libraries don't support 3.12 yet. you usually want to stay one or two versions behind.

daring pumice Apr 29, 2024, 4:33 PM

#

C:\Users\imohi>python -m pip --version
pip 24.0 from C:\Users\imohi\AppData\Roaming\Python\Python312\site-packages\pip (python 3.12)

daring pumice Apr 29, 2024, 4:34 PM

#

serene scaffold btw, a lot of data science libraries don't support 3.12 yet. you usually want to...

this is the output i got after running your prompt, what do you suggest i do next?

tidal bough Apr 29, 2024, 4:35 PM

#

on windows you can't easily install python without pip. Your output means you do have it, it's just not in PATH and so you can't access it as pip.
You could just do nothing and use pip as python -m pip, that'd work just fine.

#

(If you want to be able to call pip as just pip, you need to add the Scripts folder of your python installation to PATH. There's an installer option for that I believe - "add to environmental variables" or something. Try rerunning the installer, choosing Modify and selecting that option.)

daring pumice Apr 29, 2024, 4:37 PM

#

sure mate will try that but if i just python -m pip, i should be able to use pip and install packages and libraries right? for running codes in vs code

serene scaffold Apr 29, 2024, 4:39 PM

#

daring pumice sure mate will try that but if i just python -m pip, i should be able to use pip...

you can just do python -m pip install numpy (or whatever you're trying to install) and that will work

#

how much calculus, linear algebra, and stats do you know?

daring pumice Apr 29, 2024, 4:40 PM

#

serene scaffold you can just do `python -m pip install numpy` (or whatever you're trying to inst...

alright thank you so much!

#

hey @serene scaffold sorry for the ping and i dont know if its right for me to ask this but can i send you a friend request mate? so that i can ask a question if i am facing any problem, only if you are comfortable of course!

serene scaffold Apr 29, 2024, 4:43 PM

#

daring pumice hey <@253696366952316929> sorry for the ping and i dont know if its right for me...

whenever you have questions, it's better to ask them in the appropriate place on this server, so that whoever happens to be available at that time can read it and start answering.

daring pumice Apr 29, 2024, 4:44 PM

#

sure then!

#

thank you for your time!

serene scaffold Apr 29, 2024, 4:49 PM

#

there are some resources in the pins

past meteor Apr 29, 2024, 4:50 PM

#

I'd say get your feet wet with something that doesn't really require you to train your own stuff

#

Do that for as long as you can, eventually you'll hit a roadblock and then you can dig into the math and stats

craggy agate Apr 29, 2024, 5:26 PM

#

Try learning simple linear regression, multiple linear regression, polynomial linear regression, support vector regression

#

Then maybe get your feet wet in deep learning

runic parcel Apr 29, 2024, 5:34 PM

#

i want to make a cnn model in which, the neural netwrok can scan the image or video and extract the phone number and name from it... how sld i do it?

runic mountain Apr 29, 2024, 7:57 PM

#

hi does anyone here know about LLM?

royal harbor Apr 29, 2024, 7:58 PM

#

Looking to grab audio from a video and change the voice to something better.
Not sure what model to use, been looking on huggingface.

I'm going to extract the audio with ffmpeg -> send it off to change the audio to a different voice then make the video again with ffmpeg.

Do you think it would be easier to extract the caption from the video then use text to speech in stead?

trim saddle Apr 29, 2024, 8:03 PM

#

runic mountain hi does anyone here know about LLM?

Please do not ask to ask. Ask your actual question and if people know, they will answer.

fossil walrus Apr 30, 2024, 6:16 AM

#

Guys i am thinking of learning automation in python , so any one tell where and how to start ? As i am a biggener in python

hasty grail Apr 30, 2024, 6:17 AM

#

fossil walrus Guys i am thinking of learning automation in python , so any one tell where and ...

For general python knowledge, you can take a look at

#

!resources

arctic wedgeBOT Apr 30, 2024, 6:17 AM

#

Resources

The Resources page on our website contains a list of hand-selected learning resources that we regularly recommend to both beginners and experts.

odd meteor Apr 30, 2024, 8:14 AM

#

runic mountain hi does anyone here know about LLM?

Don't ask question to ask question. If you had mentioned specifically what you needed more help with in LLM you probably would have gotten a much quicker response.

Now, someone would have to ask "what do you need help with in LLM" before they'll get a full picture of your question.

odd meteor Apr 30, 2024, 8:20 AM

#

runic parcel i want to make a cnn model in which, the neural netwrok can scan the image or vi...

There are some nice tutorial videos online you could use to practice. Then once you're comfortable with the videos, you can easily adjust it to your use-case.

You can check this out

https://www.kaggle.com/code/sarthakvajpayee/license-plate-recognition-using-cnn

License plate recognition using CNN

Explore and run machine learning code with Kaggle Notebooks | Using data from ai_indian_license_plate_recognition_data

hallow sphinx Apr 30, 2024, 9:03 AM

#

Do people use ipython for anything other than jupyter notebook? Why isn't jupyter notebook written in Cpython (w/ tkinter)?
I don't understand ipython, of how is it different than CPython. All I know is that it has some more features like variable? gives docs of the variable and variable?? pulls up the source code.

rough wadi Apr 30, 2024, 10:02 AM

#

Hello, Based on this https://cv.gluon.ai/build/examples_pose/demo_alpha_pose.html I want to build a fall detection application, is there anyone who can help me?

past meteor Apr 30, 2024, 11:08 AM

#

@wooden sail Given that the output of a recurrent neural net are 2 tensors X' and Z the default case is always taking Z and using that for the basis for downstream tasks.

I made a mistake and actually passed the last time step of X' which is effectively the feature vector at t+1. Interestingly enough, the results are very encouraging of something that's arguably a bug. Have you come across anyone doing this?

wooden sail Apr 30, 2024, 11:10 AM

#

i'd have to see how exactly X' is computed tbh

past meteor Apr 30, 2024, 11:10 AM

#

Standard recurrent neural network

wooden sail Apr 30, 2024, 11:11 AM

#

give me 5 min to learn how they work 😛

#

or send me a paper that uses the same syntax cuz you didn't give them names and wikipedia uses different letters

past meteor Apr 30, 2024, 11:12 AM

#

Honestly, this working is most likely a special case of my task

#

https://pytorch.org/docs/stable/generated/torch.nn.RNN.html

#

I use Z for h

#

And X' for output

wooden sail Apr 30, 2024, 11:15 AM

#

looking at a single layer, it looks like X' depends on the value of Z from the previous layer. the value of Z from the current layer depends on X'

past meteor Apr 30, 2024, 11:15 AM

#

Yes

wooden sail Apr 30, 2024, 11:15 AM

#

in some sense, swapping Z for X' is the same as removing half or one layer from your network, and leaving everything else as is

#

i'd take that to mean you already had too many layers anyway

#

either in the RNN or in however you compute the downstream task

past meteor Apr 30, 2024, 11:16 AM

#

Aha, that makes sense

wooden sail Apr 30, 2024, 11:16 AM

#

removed one level of composition in the last layer, yeah

past meteor Apr 30, 2024, 11:16 AM

#

X' logically contains the temporal context because it depends on Z

#

Yeah okay that makes sense

wooden sail Apr 30, 2024, 11:17 AM

#

and with that cleared, yes, i run into this all the time

#

i work a lot with an algorithm that uses nesterov acceleration

#

in the final iteration, you can always ask whether you want to keep the "safer" gradient step, or keep the nesterov step

#

when you're close to convergence, it doesn't make a difference

past meteor Apr 30, 2024, 11:18 AM

#

Okay yes indeed that's very similar

#

I think the analogies used in ML aren't great

#

Because it's quite obvious if you look at the math

#

Thanks!

calm pagoda Apr 30, 2024, 11:26 AM

#

I want to learn AI/ml/data science.. and as much as I know.. there's no free full content available..
So, can anyone having experience in this field suggest me some courses to buy for this.. So that I can completely learn from there..

If you have something else to tell me... pls...

tidal bough Apr 30, 2024, 12:05 PM

#

there's no free full content available
not sure what you mean by that. What's wrong with https://www.coursera.org/specializations/machine-learning-introduction ?

fair warren Apr 30, 2024, 12:09 PM

#

Do we use this group for scientific programming in general, not necessarily related to DS/AI?

hasty grail Apr 30, 2024, 12:09 PM

#

fair warren Do we use this group for scientific programming in general, not necessarily rela...

For discussion of scientific python, matplotlib, statistics, machine learning and related topics
Yes, as per the channel description (although it can be a bit hard to find on mobile)

rough wadi Apr 30, 2024, 12:40 PM

#

rough wadi Hello, Based on this https://cv.gluon.ai/build/examples_pose/demo_alpha_pose.htm...

someone who can help?

latent girder Apr 30, 2024, 12:42 PM

#

Hi, i recently just bought a course for the purpose of shifting to data science (currently a data analyst). But the course covers the whole python, which feels very slow like a total of 60 hours not including the time ill be spending on learning, practicing, building and etc.

Should i learn the whole python or just the stuffs needed for data science?

For context, i have no programming languages exp besides SQL if you would count it. Tho I have learnt doing for, while, if else, and the basic stuffs

serene scaffold Apr 30, 2024, 12:43 PM

#

latent girder Hi, i recently just bought a course for the purpose of shifting to data science ...

what do you mean by "the whole python"?

latent girder Apr 30, 2024, 12:43 PM

#

serene scaffold what do you mean by "the whole python"?

#

this is the scope of study

tidal bough Apr 30, 2024, 12:44 PM

#

yeah, that tracks, often courses that teach "data science" are meant for people who are studying for something business-related and have no prior programming experience.

latent girder Apr 30, 2024, 12:44 PM

#

i was actually interested in learning the whole of it but it feels very slow since i have current work and cannot commit to more than 2 hrs of studying.

serene scaffold Apr 30, 2024, 12:45 PM

#

the GUI, game, and web development parts look superfluous. but you want to aim to be actually good at python, not just eeking out notebooks that no one else can run.

small ore Apr 30, 2024, 12:45 PM

#

Corey Schaffer of Youtube teaches Python more than enough fro data science. ( A Students opinion)

latent girder Apr 30, 2024, 12:46 PM

#

serene scaffold the GUI, game, and web development parts look superfluous. but you want to aim t...

at first, i was actually thinking of that. But as it gets harder i lost hope.

#

xd sry so now im just interested in learning data science

#

What i mean is, can i actually do those data science stuffs in python with just data science stuffs knowledge?

#

or should i also be able to build a website and stuffs in order to get by

#

idk if im asking the right question, feel free to correct me

serene scaffold Apr 30, 2024, 12:48 PM

#

latent girder or should i also be able to build a website and stuffs in order to get by

I don't know that you need to try building a website right now. but you ultimately should be competent enough as a programmer that you could figure out how to build a basic website if you had to, without much trouble.

small ore Apr 30, 2024, 12:48 PM

#

A little python ( Especially data structures/list comprehensions etc) will do if you already know conditionals, loops and functions. Just have to familiarize with python syntax

latent girder Apr 30, 2024, 12:48 PM

#

serene scaffold I don't know that you need to try building a website right now. but you ultimate...

No no its nto what i mean

odd meteor Apr 30, 2024, 12:48 PM

#

latent girder at first, i was actually thinking of that. But as it gets harder i lost hope.

Please don't give up. You got this 💪💪💪

latent girder Apr 30, 2024, 12:49 PM

#

small ore A little python ( Especially data structures/list comprehensions etc) will do if...

well if im not mistaken, i have learned those on sql and i actually finished the beginner stage in this python course

small ore Apr 30, 2024, 12:49 PM

#

You will also need classes when you are doing custom transformers and stuff

latent girder Apr 30, 2024, 12:49 PM

#

odd meteor Please don't give up. You got this 💪💪💪

thanks, but for now ill just skip to data science and get back to it ig

latent girder Apr 30, 2024, 12:50 PM

#

small ore You will also need classes when you are doing custom transformers and stuff

so?

small ore Apr 30, 2024, 12:50 PM

#

latent girder thanks, but for now ill just skip to data science and get back to it ig

Not a bad idea.

serene scaffold Apr 30, 2024, 12:50 PM

#

latent girder so?

classes are a python language feature, if you didn't already know

latent girder Apr 30, 2024, 12:51 PM

#

well do i need to learn this whole course or just those that covers data science?

serene scaffold Apr 30, 2024, 12:51 PM

#

you don't need to learn GUI, game, or web development. but you do need to be capable with Python in general.

odd meteor Apr 30, 2024, 12:51 PM

#

latent girder thanks, but for now ill just skip to data science and get back to it ig

It appears you purchased a course on Udemy. You can skip the game dev part though.

latent girder Apr 30, 2024, 12:52 PM

#

odd meteor It appears you purchased a course on Udemy. You can skip the game dev part thoug...

would just these 2 suffice?

#

and this

#

ill just youtube for missing stuffs

small ore Apr 30, 2024, 12:52 PM

#

Basic python and then switch to data science from a simpler module like Skikit-learn ( Not Tensorflow directly) and then you can learn python as needed

serene scaffold Apr 30, 2024, 12:52 PM

#

it's hard to know what you'd actually learn from doing whatever they're referring to, without seeing what the actual material/assignments are

small ore Apr 30, 2024, 12:53 PM

#

Imo ignore those Ad like claims and decide as you learn

latent girder Apr 30, 2024, 12:55 PM

#

serene scaffold it's hard to know what you'd actually learn from doing whatever they're referrin...

https://www.udemy.com/course/100-days-of-code/?couponCode=ST2MT43024

odd meteor Apr 30, 2024, 12:56 PM

#

latent girder would just these 2 suffice?

I'd say, skip web development and game for now. You can come to it later.

Alternatively, you can use the topics here https://kaggle.com/learn to filter what to focus on first on the Udemy

Learn Python, Data Viz, Pandas & More | Tutorials | Kaggle

Practical data skills you can apply immediately: that's what you'll learn in these no-cost courses. They're the fastest (and most fun) way to become a data scientist or improve your current skills.

small ore Apr 30, 2024, 12:57 PM

#

Oh. And Pandas, A couple of plotting tools( Matplotlib, seaborn, Plotly etc), some basic numpy, should help.

latent girder Apr 30, 2024, 12:58 PM

#

odd meteor I'd say, skip web development and game for now. You can come to it later. Alter...

you mean i refer to this?

latent girder Apr 30, 2024, 12:58 PM

#

small ore Oh. And Pandas, A couple of plotting tools( Matplotlib, seaborn, Plotly etc), so...

yea the course atleast covers those

small ore Apr 30, 2024, 12:59 PM

#

Meanwhile I have a clustering question

#

I tried kmeans on my data and and the Silhouette score is dropping and dropping. No peaks to be found. I tried DBSCAN and it labels most of the data as -1. What am I doing wrong?

latent girder Apr 30, 2024, 1:02 PM

#

Is this good tho? Ill just buy this if its good

past meteor Apr 30, 2024, 1:29 PM

#

latent girder Is this good tho? Ill just buy this if its good

in my opinion it's good if you want a fast refresher of the content or if you just want to learn how to use the libraries. I think I did this one when I was studying because my labs didn't use Python. I remember being under the impression that it doesn't teach it well enough for people that have 0 background.

latent girder Apr 30, 2024, 1:30 PM

#

past meteor in my opinion it's good if you want a fast refresher of the content or if you ju...

alright, good to hear so its good?

past meteor Apr 30, 2024, 1:30 PM

#

latent girder alright, good to hear so its good?

On the contrary, if this stuff is all new to you I'd say it's not good

latent girder Apr 30, 2024, 1:31 PM

#

past meteor On the contrary, if this stuff is all new to you I'd say it's not good

yea all new

#

still looking for good stuffs

past meteor Apr 30, 2024, 1:31 PM

#

Are you open to picking up a book?

latent girder Apr 30, 2024, 1:31 PM

#

no xd

#

i like interactive learning

odd meteor Apr 30, 2024, 1:31 PM

#

latent girder Is this good tho? Ill just buy this if its good

Jose Portilla and André Nagogie course on Udemy are both great.
I can recommend their course any day.

latent girder Apr 30, 2024, 1:31 PM

#

odd meteor Jose Portilla and André Nagogie course on Udemy are both great. I can recommend ...

yea actually my sql is from jose too and it was very good

buoyant kite Apr 30, 2024, 3:12 PM

#

latent girder yea actually my sql is from jose too and it was very good

Hello, Do you know LLM?

neat bluff Apr 30, 2024, 3:40 PM

#

Hi everyone :) As usual my lack of experience with LLM's proved itself once again (it doesn't make building a huge project any easier tbh) .
I've built a data parser using LangChain and Claude 3 Opus. One of the issues I've encountered is that output limit of a 4096 tokens is very easy to achieve. As far as I've researched the limit is similar in top LLM services and the only exclusion is GPT4 which is pricier at every aspect and also way worse at actually recovering data from text and not making it up.

My question is: Which open-source models have a token output limit similar to GPT4 (8k+) or what could be the solution to my problem here? Thanks in advance.

hallow sphinx Apr 30, 2024, 3:57 PM

#

But I still don't understand what is better in ipython

latent girder Apr 30, 2024, 4:07 PM

#

buoyant kite Hello, Do you know LLM?

No

buoyant kite Apr 30, 2024, 4:20 PM

#

latent girder No

What do you know in ai? I need some help.

sturdy kiln Apr 30, 2024, 4:26 PM

#

can somebody tell me what the fuck am i looking at

#

or a resource that will point me to the right direction on what any of this means

agile cobalt Apr 30, 2024, 4:27 PM

#

sturdy kiln or a resource that will point me to the right direction on what any of this mean...

Have you tried the documentation of the library you're using?

sturdy kiln Apr 30, 2024, 4:28 PM

#

its not really a library

#

im using ARIMA as a model but the summary itself isnt

#

its more or less EDA knowledge rather than python modules

#

for context, im trying to compare baseline ARIMA(1,1,1) to ARIMA(20,1,1) which are two different models, yeah the numbers change but i have no idea how to interpret it

fallen osprey Apr 30, 2024, 5:02 PM

#

Hey i am going to take aids in college can anyone reccomends specification for buying laptop

sturdy kiln Apr 30, 2024, 5:03 PM

#

honestly anything since colab exists, unless you want to do some localized CNN and you need a beefy GPU (probably just go a for a PC at that point) probably one with CUDA support and a hefty CPU along with it

#

(dont take my word for it, might need a second or third opinion on it)

fallen osprey Apr 30, 2024, 5:05 PM

#

sturdy kiln honestly anything since colab exists, unless you want to do some localized CNN a...

I can't get pc coz I I am going to live in hostel so it will be trouble when I am coming back home

sturdy kiln Apr 30, 2024, 5:05 PM

#

also alot of RAM, if your going to do some local stuff it will eat off your RAM really quick

fallen osprey Apr 30, 2024, 5:05 PM

#

sturdy kiln also alot of RAM, if your going to do some local stuff it will eat off your RAM ...

16 will do?

#

32?

sturdy kiln Apr 30, 2024, 5:06 PM

#

i really dont have a baseline but probably 32 is a good number nowadays

fallen osprey Apr 30, 2024, 5:06 PM

#

Ok

#

I will get 16 and upgrade it

#

To 32

sturdy kiln Apr 30, 2024, 5:07 PM

#

sounds good, but if your going to start doing any AI/ML/DL/DSA stuff, then most likely the college offering it will make you use cloud computing anyways

fallen osprey Apr 30, 2024, 5:08 PM

#

sturdy kiln sounds good, but if your going to start doing any AI/ML/DL/DSA stuff, then most ...

You mean buy laptop from college?

sturdy kiln Apr 30, 2024, 5:08 PM

#

cloud services like Google Colab, MS Azure and stuff like that, which all run on the cloud so local specs dont matter that much anyways

#

colab isnt technically an ML centric service but more of a py notebook one that can run ML stuff in average

#

but hey its free, and what i use extensively lol

#

but because its free, its got a lot of limitations

fallen osprey Apr 30, 2024, 5:10 PM

#

sturdy kiln but hey its free, and what i use extensively lol

Ok so college provide cloud

fallen osprey Apr 30, 2024, 5:10 PM

#

sturdy kiln but because its free, its got a lot of limitations

Like?

sturdy kiln Apr 30, 2024, 5:11 PM

#

fallen osprey Ok so college provide cloud

Azure probably, Google is free for all, academic or personal

sturdy kiln Apr 30, 2024, 5:12 PM

#

fallen osprey Like?

the standard free service for Colab has limited compute units, when you run out, you cant do any computations anymore for the day, you're also only given a limited amount of resources, iirc around 12-15gb of RAM, and 80GB of disk storage

fallen osprey Apr 30, 2024, 5:13 PM

#

sturdy kiln the standard free service for Colab has limited compute units, when you run out,...

Oh

sturdy kiln Apr 30, 2024, 5:14 PM

#

but honestly, you wont run out if your not doing that much heavy stuff

#

the only time i ran out of units is when i was doing a 3 day-long CNN session

#

which is extremely computationally heavy

fallen osprey Apr 30, 2024, 5:19 PM

#

Oh

fallen osprey Apr 30, 2024, 5:19 PM

#

sturdy kiln which is extremely computationally heavy

Does it requires gpu

sturdy kiln Apr 30, 2024, 5:20 PM

#

you do know what a cloud compute service is, right?

fallen osprey Apr 30, 2024, 5:21 PM

#

sturdy kiln you do know what a cloud compute service is, right?

Like using someone else pc in my device right?

#

Am I right?

fallen osprey Apr 30, 2024, 5:22 PM

#

sturdy kiln you do know what a cloud compute service is, right?

I am wrong?

small ore Apr 30, 2024, 5:26 PM

#

Cloud are large server stacks like the once that are used to provide internet services/websites etc but here a user is assigned a specific space(Storage) and assigned processor time/memory on one/several of its many CPUs and GPUs. You either submit your job there or you can also connect remotely and operate virtually on that space

#

@fallen osprey

fallen osprey Apr 30, 2024, 5:27 PM

#

small ore Cloud are large server stacks like the once that are used to provide internet se...

Oh I see tyvm

tidal bough Apr 30, 2024, 5:27 PM

#

fallen osprey Like using someone else pc in my device right?

getting to run your code on some (typically virtual) machine, usually either with limits on how much compute you can use, or having to pay for it.

fallen osprey Apr 30, 2024, 5:28 PM

#

tidal bough getting to run your code on some (typically virtual) machine, usually either wit...

Tyvm

#

Btw

fallen osprey Apr 30, 2024, 5:28 PM

#

fallen osprey Hey i am going to take aids in college can anyone reccomends specification for b...

Can reccomend this he said ask 2-3 people and decide

tidal bough Apr 30, 2024, 5:29 PM

#

i am still wondering what "take aids" means

sturdy kiln Apr 30, 2024, 5:29 PM

#

its AI and Data Science

#

a course

#

honestly weird name to call it lol

tidal bough Apr 30, 2024, 5:29 PM

#

can't wait for their next course, HIV (human informational values)

sturdy kiln Apr 30, 2024, 5:30 PM

#

honestly just call it DSA (data science and anayltics) because it all falls under that anyways lol

#

not to be confused with DSA (data structures and algorithms)

tidal bough Apr 30, 2024, 5:30 PM

#

depending on the course I don't think you need a powerful computer? even if it includes some ML you can, yeah, probably get by with collab

fallen osprey Apr 30, 2024, 5:31 PM

#

tidal bough i am still wondering what "take aids" means

Ai and ds

tidal bough Apr 30, 2024, 5:31 PM

#

(also, a laptop which you can do decent ML on is going to be so expensive. you'd need a GPU...)

fallen osprey Apr 30, 2024, 5:32 PM

#

tidal bough (also, a laptop which you can do decent ML on is going to be so expensive. you'd...

If u dont mind can u tell how much it cost just curious?

sturdy kiln Apr 30, 2024, 5:34 PM

#

tidal bough can't wait for their next course, HIV (human informational values)

cant wait to join the Science and Technology Department (STD)

tidal bough Apr 30, 2024, 5:35 PM

#

fallen osprey If u dont mind can u tell how much it cost just curious?

i have no idea where you live, so google it for your country yourself. from a cursory look it seems laptops with a GPU start at like 700$, and if you want 6-8GB VRAM (which is what it'd take to fit a sizable model there) then it's more like 1000$

#

collab would likely be a better idea

small ore Apr 30, 2024, 5:36 PM

#

VRAM?

fallen osprey Apr 30, 2024, 5:36 PM

#

tidal bough collab would likely be a better idea

Does it exist in India?

tidal bough Apr 30, 2024, 5:37 PM

#

small ore VRAM?

video. RAM of the GPU.

small ore Apr 30, 2024, 5:37 PM

#

Something like cache or external cards?

tidal bough Apr 30, 2024, 5:37 PM

#

fallen osprey Does it exist in India?

i'd be surprised if no, but also, i mean, if the answer was "no", the easy solution would be buying a VPN and accessing it anyway

sturdy kiln Apr 30, 2024, 5:38 PM

#

pretty sure if you have google, you have colab

fallen osprey Apr 30, 2024, 5:38 PM

#

sturdy kiln pretty sure if you have google, you have colab

How can i check it?

small ore Apr 30, 2024, 5:38 PM

#

fallen osprey How can i check it?

Google it 😛

sturdy kiln Apr 30, 2024, 5:38 PM

#

if youd like to know how computationally heavy ML/DL stuff are, people have built server farms and supercomputers just for it lol

tidal bough Apr 30, 2024, 5:39 PM

#

small ore Something like cache or external cards?

not sure what you mean. all GPUs have some integrated memory - which is pretty much exactly like normal RAM, but built into the GPU. For ML it matters a lot since you'd want to fit your model there for optimal performance.

fallen osprey Apr 30, 2024, 5:39 PM

#

Is it ai digital labs?

tidal bough Apr 30, 2024, 5:39 PM

#

i have no idea what that is

sturdy kiln Apr 30, 2024, 5:40 PM

#

go to https://colab.research.google.com

Google Colab

small ore Apr 30, 2024, 5:40 PM

#

CPUS have cache memory on the chip. So trying to know if it is something like that or a 'card' outside the chip

fallen osprey Apr 30, 2024, 5:41 PM

#

Yeah it's available

sturdy kiln Apr 30, 2024, 5:41 PM

#

VRAM is a dedicated memory space for the GPU itself

fallen osprey Apr 30, 2024, 5:41 PM

#

Ty

small ore Apr 30, 2024, 5:41 PM

#

sturdy kiln VRAM is a dedicated memory space for the GPU itself

I get that

sturdy kiln Apr 30, 2024, 5:41 PM

#

yeah you dont need a very expensive laptop to run colab on

fallen osprey Apr 30, 2024, 5:41 PM

#

tidal bough i have no idea what that is

It says Microsoft launches ai digital labs in india

fallen osprey Apr 30, 2024, 5:41 PM

#

sturdy kiln yeah you dont need a very expensive laptop to run colab on

Ok

tidal bough Apr 30, 2024, 5:42 PM

#

small ore CPUS have cache memory on the chip. So trying to know if it is something like th...

Ah, I see what you mean. Nah, that's unrelated - GPUs also have cache, but it's not usually mentioned, whereas VRAM is important. E.g. https://www.techpowerup.com/gpu-specs/geforce-rtx-3050-8-gb.c3858

sturdy kiln Apr 30, 2024, 5:42 PM

#

small ore I get that

its embedded in the GPU itself if your asking, its nearby the chip itself but its not cache

#

cache and RAM are two different memory types

fallen osprey Apr 30, 2024, 5:43 PM

#

I have one last question are there jobs for ai and ds?

sturdy kiln Apr 30, 2024, 5:43 PM

#

fallen osprey I have one last question are there jobs for ai and ds?

very

#

although if youd want to excel in that field, youd have to excel in your knowldege of it

#

lots of any programming/data science jobs dont rely on degrees

#

but more of what you can do instead

fallen osprey Apr 30, 2024, 5:44 PM

#

I need to learn lot of maths?

sturdy kiln Apr 30, 2024, 5:44 PM

#

hence the computer science slander lol

#

lots of people took CS thinking its a free job because of the degree, not knowing youd have to get a goob internship, good recommendations or a good background to even get something barely good back

tidal bough Apr 30, 2024, 5:45 PM

#

data science is basically applied statistics. hence, yes, "lot of maths".

fallen osprey Apr 30, 2024, 5:45 PM

#

tidal bough data science is basically applied statistics. hence, yes, "lot of maths".

Ok

small ore Apr 30, 2024, 5:46 PM

#

Okay. I am trying to find what the VRAM on this laptop is like

fallen osprey Apr 30, 2024, 5:46 PM

#

sturdy kiln lots of people took CS thinking its a free job because of the degree, not knowin...

I will start trying for interns in second semester

tidal bough Apr 30, 2024, 5:47 PM

#

small ore Okay. I am trying to find what the VRAM on this laptop is like

it's typically mentioned in the GPU's name, if the laptop has a discrete GPU at all

sturdy kiln Apr 30, 2024, 5:47 PM

#

small ore Okay. I am trying to find what the VRAM on this laptop is like

most laptops without a DGPU have APUs, which have an integrated GPU inside their CPU, their VRAM isnt sometimes physical memory but often times "virtual" memory

#

or i might be misremembering

tidal bough Apr 30, 2024, 5:48 PM

#

I think "APU" is an AMD-only term

sturdy kiln Apr 30, 2024, 5:48 PM

#

ah well its a CPU with an IGPU inside anyways

neat bluff Apr 30, 2024, 5:48 PM

#

sturdy kiln most laptops without a DGPU have APUs, which have an integrated GPU inside their...

A part of RAM becomes the GPU'S VRAM if there is no dedicated VRAM

tidal bough Apr 30, 2024, 5:48 PM

#

yeah, it's how they call their new CPUs which have an unusually good iGPU

sturdy kiln Apr 30, 2024, 5:49 PM

#

neat bluff A part of RAM becomes the GPU'S VRAM if there is no dedicated VRAM

^ hence "virtual" because a part of the RAM acts as a virtual VRAM

sturdy kiln Apr 30, 2024, 5:49 PM

#

tidal bough yeah, it's how they call their new CPUs which have an unusually good iGPU

unusually good? 🤔

neat bluff Apr 30, 2024, 5:49 PM

#

Back in the the integrated GPU's sucked ass

#

That's what unusually good probably refeers to

sturdy kiln Apr 30, 2024, 5:50 PM

#

yeah i get that IGPU sucked ass, but i didnt know they stopped sucking ass now lol

tidal bough Apr 30, 2024, 5:51 PM

#

looks like I'm not mistaken and AMD integrated graphics are significantly more powerful than intel's: https://www.tomshardware.com/features/amd-vs-intel-integrated-graphics
EDIT: this might be outdated, found parity in a 2023 article

small ore Apr 30, 2024, 5:51 PM

#

All I can get

tidal bough Apr 30, 2024, 5:51 PM

#

(but still like 2x worse than a real GPU)

neat bluff Apr 30, 2024, 5:52 PM

#

New Ryzen's iGPU can kick ass of a 5 years old GTX desktop card

tidal bough Apr 30, 2024, 5:53 PM

#

hmm, not sure how to fact-check that

neat bluff Apr 30, 2024, 5:53 PM

#

small ore All I can get

That is definietly too old to do anything "AI/ML" related

sturdy kiln Apr 30, 2024, 5:53 PM

#

or anything modern related lol

small ore Apr 30, 2024, 5:54 PM

#

neat bluff That is definietly too old to do anything "AI/ML" related

Of course. I can do several beginner things which aint a DNN

#

Learner things I mean

neat bluff Apr 30, 2024, 5:55 PM

#

sturdy kiln or anything modern related lol

Yup

sturdy kiln Apr 30, 2024, 5:55 PM

#

this chat is probably getting off-topic now but whatevs

#

i have figured out what the top rows does, specifically the Information Criterion metrics, but still have no idea what the parameters or the residuals do

#

like what the fuck is Ljung-Box or heteroskedasticity

neat bluff Apr 30, 2024, 5:56 PM

#

Heterodeskadiscitiy? WTF

#

XDDDDD

small ore Apr 30, 2024, 5:56 PM

#

No one has an answer to my original question on clustering though?

neat bluff Apr 30, 2024, 5:57 PM

#

I haven't seen it. Can You tag it?

neat bluff Apr 30, 2024, 5:58 PM

#

sturdy kiln VRAM is a dedicated memory space for the GPU itself

Also much faster btw

small ore Apr 30, 2024, 5:58 PM

#

small ore Meanwhile I have a clustering question

This one

sturdy kiln Apr 30, 2024, 5:58 PM

#

yeah most modern stuff now run on GDDR6

#

which is extremely fuckin fast

neat bluff Apr 30, 2024, 5:59 PM

#

small ore I tried kmeans on my data and and the Silhouette score is dropping and dropping....

Unfortunately I am no expert. Did You code it Yourself? I suppose there might be a bug inside of logic / reward system

neat bluff Apr 30, 2024, 6:00 PM

#

sturdy kiln yeah most modern stuff now run on GDDR6

I think GDDR6X is a thing now, altough I am not 100% sure

small ore Apr 30, 2024, 6:00 PM

#

neat bluff Unfortunately I am no expert. Did You code it Yourself? I suppose there might be...

No. Just using skikit-learn

neat bluff Apr 30, 2024, 6:00 PM

#

It is actually, 4000 series RTX use them

sturdy kiln Apr 30, 2024, 6:00 PM

#

commerically available?

#

oh hmm didnt know that

neat bluff Apr 30, 2024, 6:01 PM

#

Gigabyte GeForce RTX 4080 SUPER WINDFORCE OC V2 16GB GDDR6X (GV-N408SWF3V2-16GD)

#

For the price of 1200$

sturdy kiln Apr 30, 2024, 6:01 PM

#

sheesh thats one expensive GPU