#data-science-and-ml

1 messages · Page 20 of 1

storm kelp
#

The actual concepts of datascience I'm very familiar with from using R, but I don't know how difficult it will be to translate that into Python

agile cobalt
#

I mean, pandas for one is literally a dataframe library, that data structure sounds familiar?

storm kelp
#

sure - I just want to make sure I'm using/learning the language in the most efficient manner

#

for example I spent 2 years doing datascience in R without learning about the tidyverse, which I guess is the R equivalent of pandas

lapis sequoia
#

can someone recommend me soft or basic introduction to machine learning videos/tutorials?

agile cobalt
#

you can check sklearn's MOOC or Andrew Ng's Machine Learning Specialisation course on Coursera, or look at our other !resources on the website

lapis sequoia
#

!resources

arctic wedgeBOT
#
Resources

The Resources page on our website contains a list of hand-selected learning resources that we regularly recommend to both beginners and experts.

verbal venture
#

what's the relevance of the intercept method in linear regression

coarse bronze
#

Hello there I'm looking for some datasets for some reinforcement learning models I'm working on, does anyone know any sites that have some datasets I could get? paid resources are fine I'm just looking for some places to download datasets for different models

agile cobalt
#

idk if there are many specifically for reinforcement learning, but do check out Kaggle if you haven't yet

tacit basin
misty flint
#

my friends

#

this just came out recently

#

super duper valuable if your job responsibilities include deploying models

tacit basin
wise pelican
#

More of a general question, but I'm trying to find the usage differences between numpy's vectorize and frompyfunc
From what I'm seeing, ufncs are numpy functions that operate element by element for an entire array
vectorize is a ufunc-based class that applies some given ufunc across the provided iterables
and frompyfunc is a function to create a vectorized ufunc
Is that all correct?

soft viper
#

hi there. I am building a search engine based on tf-idf and want to use cosine similarity for the query. Based on my understanding, say that a query "apple and orange", the cosine wants me to have a vector of tf-idf for each word in the query. What I don't get is what vector of tf-idf do I need to get for the documents I want to compare? Just the word "apple", "and", "orange" or the whole words that the document have.

violet gull
#

Why do I need auto grad for a NN? For back prop I’m just doing a couple gradients but they are easily done by hand and it’s not like they are ever changing so I can just hard code 4 and not ever need it

#

and what is tensorflow used for in a real neural net

tacit basin
# violet gull Why do I need auto grad for a NN? For back prop I’m just doing a couple gradient...

you can, but what if you modify you network and make it very complex. a lot to calculate by hand. check this great video from andrej karpathy https://www.youtube.com/watch?v=VMj-3S1tku0

This is the most step-by-step spelled-out explanation of backpropagation and training of neural networks. It only assumes basic knowledge of Python and a vague recollection of calculus from high school.

Links:

▶ Play video
violet gull
#

it is still using the same actiations

tacit basin
#

Sure don't use it. It's not mandatory 😜

violet gull
tacit basin
#

Autograd keeps track of operations performed and will auto calculate gradients needed for backpropagation to adjust the weights.

violet gull
#

so will a simple class declarion

tacit basin
#

If you have time watch Andrejs video.

inland notch
#

Can anyone recommend a library for AI?

soft viper
silver sleet
#

For data preprocessing, im trying to convert the date-format
dd/mm/yy to dd-mm-yy in one particular column in excel using py.
I tried using the .replace([‘/‘],’-‘)
But this isn’t working. What else can I do.

unique flame
#

I got by with that book. Van der Plas is was one of the core dev's of Scikit-klearn.

neon imp
#

Learn Python the hard way was good. Someone will need to reconfirm.

inland notch
#

also can someone know some simple projects to start learning about ai?

storm kelp
#

I downloaded data science from scratch but it seems a lot of people say it's only good for learning why libraries work the way they do, and doesn't actually tell you about how to get the most out of them

lean topaz
#

Hey guys.

I'm making a model to predict the weight of oxen using a neural network. I made the input as follows to simulate the image and the real weight of the animal.

I'm having trouble creating the model, I've already tried some close to what I wanted but I didn't have results.

X_train = np.random.randint(256, size=(10, 720, 1280))
Y_train = np.random.randint(100, size=(10))

X_test = np.random.randint(256, size=(10, 720, 1280))
Y_test = np.random.randint(100, size=(10))


X_train = X_train.reshape(X_train.shape[0], 1, 720, 1280)
X_test = X_test.reshape(X_test.shape[0], 1, 720, 1280)
X_train = X_train.astype('float32')
X_test = X_test.astype('float32')
X_train /= 255


model = keras.models.Sequential(
    [
        # Dense(32, input_shape=(1, 720, 1280)),
        # Activation('relu'), 
        # Flatten(), 
        # Activation('relu'),
        keras.layers.Dense(units=1, input_shape=(1,720,1280))
    ]
)

model.compile(optimizer='sgd', loss='mean_squared_error', metrics=['accuracy'])

model.fit(X_train, Y_train, batch_size=1, epochs=100, verbose=1, validation_data=(X_test, Y_test))
#

The model is wrong, because I'm just showing the structure of the input for you to understand..

noble zealot
#

Could anyone explain neural networks to me? (ive tried googling it before lol, understood nothing.)

shell crest
#

Did you watch 3b1b's NN video?

noble zealot
#

preferably using common english vocabulary, I'd like to know a scenario where it would be useful, how its better than another option, and the step by step process it works through

noble zealot
#

actually that thumbnail looks very familiar

#

i might have.

#

ill watch it

#

actually thanks @shell crest this one image just explained the whole thing to me XD

#

lmao

#

here's what i got from that one image, say it's recognizing the number 2, it could start by seeing it's characteristics. like it has a flat line, it has a flatish curve at the top, and it has a diagonal line, it could go into those 3 nodes which would be algorithm, and then maybe slowly get closer to a number, like look at how 5 and 7 and 2 have a flat line

#

and how 7 2 4 and 8 have diagonal lines

#

and then it can slowly say "no it's not that number

#

cause thinking how we understand the 3, it's essentually, a line with 2 curves, so the program would have to comprehend that data to classify it as a 3

shell crest
#

Visualising a NN is actually hard

#

By that I mean the 'edge detection' things

noble zealot
#

wdym by that?

shell crest
#

I think you could transform of input weights into whether a neuron is doing edge detection, but that IMO is not at all obvious

noble zealot
#

input weights

#

so you're saying that it actually goes through all the nodes and has like a score on how much it resembles each characteristic?

#

maybe?

shell crest
#

there's no 'characteristic'

#

it's all numbers

noble zealot
#

okay i know im using the wrong word

#

just pretend im using the right word lol

#

i think i know where a neural network could be useful though

#

lets say you're trying to identify a person, and they are like a black male average height

#

would a neural network process that like this?
white- 0.01 short - 0.5
black- 0.99 tall-0.5

#

or something like that

#

to eventually figure out what describes a certain thing, in this case a person, and in the videos case numbers, the best?

shell crest
#

You're asking for NN to be interpretable

#

Guess what - it's not.

noble zealot
#

Alright, I still think it is in some weird abstract way, but thanks for telling me about the video

#

thanks, ill check this out.

#

oh nice it seems my guess was exactly correct on how it works

shell crest
#

Features and Activations don't explain interpretability

#

The problem with traditional high-performing NN is that you actually don't know what feature the NN is/has found

noble zealot
#

also @shell crest explanations, i dont think they should always be this specific and precise because if you go into so much advanced detail then it gets confusing. I think it's better to focus on giving an understandable comprehendable base to start from, and then just allow the brain to build off this basic concept and explore into more advanced things over time.

shell crest
#

It's not about detail or not

noble zealot
#

It's much easier to start from a basic concept

shell crest
#

I think the cvpr paper is quite readable, and 3b1b's video quite accessible

#

NNs are made of activations and features, but it is not known what corresponds to what

#

That's why you can't say

white- 0.01  short - 0.5 
black- 0.99  tall-0.5

because this demands an interpretable NN

noble zealot
#

Let me give you an example, even though he is doing stuff like explaining each and every detail of how the nueral networks, just a simple singular image was enough to get me started and have an idea of how they work, and now i can go onto thinking about how i could use it in a different context, explanations are just best done in 1-2 simple sentences that anyone can understand, and then letting the brain build off from there.

And I'm sure of this because it's actually how I learned how to swim, roller skate, and... something else, i forgot lol, except those learning expiriences were just 2 words, even though it had no detail and i could have completely messed it up and did it the wrong way, just building off that basic knowledge, it went much farther than any swimming class could take me.

shell crest
#

I think what I have said so far is short

noble zealot
#

Well I think the main issue where i didn't understand is the uncommon words.

shell crest
#

You can ask about those words. I'm not an expert of NN math and I don't intend to be

noble zealot
#

like you said demands an interpretable NN, what demands an interpretable NN?

shell crest
#

Right

noble zealot
shell crest
#

Interpretable means we know what the neural network is looking for. So if we see certain things like noses, earrings, etc. - that's how humans identify people

#

But it is not clear if artificial neural networks look for the exact same things. This has issues even if the artificial neural network 'does better' than humans. - if you don't understand your tool, can you really use it?

noble zealot
#

you make it look for what you desire

shell crest
noble zealot
#

in order to reach your desired outcome

shell crest
noble zealot
#

uh.

#

If that was your entry for most confusing sentence

shell crest
#

most of what I just said doesn't matter, what I'm saying is that you don't actually decide what goes into each neuron

noble zealot
#

gives 1st place medal

shell crest
#

you don't design each neuron

#

(typically)

noble zealot
shell crest
#

as I said, it's unsolved lol

noble zealot
#

what's unsolved?

shell crest
#

Why does a neural network work? welll, I don't think any honest answer can say 'we know'

#

How do we make a neural network such that we know what it is looking for?

noble zealot
shell crest
shell crest
hasty mountain
shell crest
noble zealot
shell crest
noble zealot
#

so doesn't it just decide yes or no, for a lot of possiblities

shell crest
noble zealot
#

or was it f11

shell crest
#

indeed, and even after you do so, what's the point?

#

can you confidently do that for every input to the network?

noble zealot
noble zealot
#

so you can have a desire for what they will do

shell crest
#

well the idea is that eventually we'll find some neural networks that are explainable, and have features we relate to

noble zealot
#

and an expectation

#

but the end goal is for the computer to reach your expectations and process it in the desired pattern

shell crest
shell crest
noble zealot
shell crest
#

in image classification in 3b1b, the end goal is to correctly classify all digits

shell crest
noble zealot
#

so you're telling me it just magically appears

shell crest
shell crest
# noble zealot https://tenor.com/view/huh-rabbit-cute-gif-15676652

welcome to AI, as I said, you're looking for https://en.wikipedia.org/wiki/Explainable_artificial_intelligence

Explainable AI (XAI), or Interpretable AI, or Explainable Machine Learning (XML), is artificial intelligence (AI) in which humans can understand the decisions or predictions made by the AI. It contrasts with the "black box" concept in machine learning where even its designers cannot explain why an AI arrived at a specific decision. By refining t...

noble zealot
#

isn't each neuron like an algorithm that outputs a result of possibilities

shell crest
#

Most AI/ML we do today aren't really explanable. Something works, and someone does the thing that works. Most (all?) people can't explain why

shell crest
shell crest
noble zealot
#

No, that's torture

#

XD

hardy kernel
#

hey can anyone help me replicating a GIMP feature in openCV?

I want to replicate the functionality of the "curves" tool provided by GIMP to make this modification on an image. If opencv is not the correct tool what should I use instead. Any suggestions?

shell crest
#

lol then there's no point

noble zealot
#

Me and articles don't get along well unless it gets straight to the point

#

like Mozillas js documentation, best documentation ive read in my life

shell crest
#

the article does get straight to the point

#

in 2 sentences

#

ignore the rest

noble zealot
#

and w3 schools... their docs are alright, could use improvement, but surely good.

shell crest
#

Actually just read the first-2 sentences in each paragraph

noble zealot
#

okay here's what i got from the first 2-3

#

this is how humans think...

#

"HUH!? HOW IS THAT A 539 IT'S OBVIOUSLY A 2 YOU DUMMY"

shell crest
noble zealot
#

and eventually it works after being refined enough right?

unique flame
shell crest
# noble zealot this is how humans think...

no, humans think if a specific part of the image curves in a certain way, it's a 3. (see the 2 curves in 3?)
if there is a certain orientation an image could be 6. Flip it and you get 9.

hardy kernel
#

ye no worries, I asked like 3 times already ill ask again later

shell crest
#

machines are just led to best performing outcomes, even though we don't know the path they are taking to those outcomes

hardy kernel
noble zealot
#

kind of like plotting averages on a dot graph

shell crest
#

uh, I will not comment on that

noble zealot
#

an while a computer cant easily understand pixels in 8 possible different spots relative to another, if you convert it into a graph function it can understand it

#

and that's why on something i clicked on earlier

#

i saw math.

hardy kernel
#

awesome ill look into it tysm

civic fog
#

Hello, today at work I overhead some coworkers talking about a browser extension that uses a ML language model (bert?) to highlight/summarise websites. It was so early in the morning that I didn't register what they said until several hours later xDD and tbh I dont even remember who they were. Does anyone know about it? I find it quite interesting and might want to give it a try

spice mountain
simple jacinth
#

hey im trying to divide a series by another series using pandas

#

TypeError: unsupported operand type(s) for /: 'str' and 'str'

#

a.div(bc)

#

example of what 'a' is: a = t_data.iloc[0]

#

how do I fix this error

untold bloom
#

a = pd.to_numeric(a) should work maybe...

#

in case it errors as non-convertable, you can do

num_a = pd.to_numeric(a, errors="coerce")
print(a[num_a.isna()])
```to see which values were nonnumeric, and take action dependingly...
#

with the extra errors argument, nonconvertables will become NaN at the end; then we check from the original series which values caused NaN.

empty furnace
#

Hii, can somebody explain me when should I use tf and when should I use sklearn?

agile cobalt
#

depends on what you are trying to do
sklearn supports more traditional, relatively simple models
tensorflow/pytorch supports neural networks

empty furnace
#

Noted, thanks!

little dagger
#

I have a dataset thats very imbalanced. How would I go about fixing this?

agile cobalt
#

just multiclass classification?
you can test some different metrics (e.g. balanced accuracy instead of accuracy)

little dagger
#

yeah 7 classes

#

i wasnt sure if you could use oversampling and undersampling here

ruby depot
lapis sequoia
#

im doing very basic ML decision tree: is 60% accuracy rate any "good"?

#

what percent rate is the threshold for sufficient or viable?

serene scaffold
serene scaffold
velvet birch
#

I've been learning Linear Regression and came across an interesting topic of interaction term

#

Anyone has ideas on how to identify the columns with interactions??

glass estuary
#

Hello could someone who has some experience with tesseract dm me, I need some help, thanks.

gloomy anvil
#

ON ANOTHER NOTE:
I can turn nonstationary timeseries into stationary data by detrending or differencing. But can I turn stationary data into non-stationary data? I know this is a stupid question, but cointegration of timeseries can only occur between nonstationary data. So if I have a stationary timeseries is there a (VALID!) way to turn it into nonstationary data and perform a cointegration test like johansen coint?

velvet birch
#

Guys am just...frustrated with Data Science cause I feel like am just making zero progress

#

Am thinking of taking a break from it for a while and learning something else

little dagger
#

what are you frustrated with?

velvet birch
#

I just don't know

#

Some might say that it's because I don't have the adequate mathematical knowledge but the things I've studied so far didn't seem to have anything too complex yet

#

For now I feel like I'll just continue on with my degree and learn all the maths bit that can help in DS through it and mean while try something new in computer science

little dagger
#

i wouldnt give up tho

#

frustration is normally a sign of progress

velvet birch
#

Yh I shouldn't too ig

#

I'll follow ISLR as one last attempt

#

Would just read every topic in it, do every exercise and try to avoid complex shit for a while

little dagger
#

ngl i hate reading stuff

velvet birch
#

Just stick to the basic stuff ISLR provides

little dagger
#

i just practise by doing them

#

if that makes sense

velvet birch
#

Maybe it'll give me an insight

little dagger
#

for me, following a book gives me pain

velvet birch
#

Like I still genuinely think about model building as "isn't this just doing the same old shit again and again?"

little dagger
#

rather watch videos and do projects that involve the topics

velvet birch
velvet birch
#

Videos are easier to follow

little dagger
#

yeah, i just cant read books as good as others - wish i could

velvet birch
#

But the problems with videos is that...it's just the same basic shit again and again

#

I kinda want to grow out of this

little dagger
#

have you looked into mlops?

velvet birch
#

Uptil now no video felt to teach me something new, ISLR however did teach a lot of new stuff about Linear Regression itself

velvet birch
#

What is it?

#

I have only heard about it but never researched about it cause of just "lack of knowledge"

little dagger
#

its about once you built a model, how are you going to make it scalable, how are you going to deploy it, set up CI/CD ... etc

#

its another side of skills needed for machine learning

velvet birch
little dagger
#

so you could learn that stuff and take a break on the stuff youre doing

little dagger
#

for example

velvet birch
#

I guess that's where the AI bit lies

little dagger
#

once you built a model

#

how are you going to deploy it for use and how are you gonna monitor the model

#

im currently preparing for the google cloud certification for ML, you might be interested in this - teaches you a complete picture for ML

velvet birch
#

How is Andrew Ng's coursera course?

#

Heard a lot about it

little dagger
#

never done it

#

but people say its really good

#

and he is the founder of coursera i believe

#

so it has to be good right

unique flame
velvet birch
little dagger
#

how much is the course?

velvet birch
unique flame
velvet birch
#

For my country it's about $130 which is a lot

little dagger
#

ah

#

you have a link to the course?

velvet birch
little dagger
#

the first one?

velvet birch
#

Yh the first one

little dagger
#

only 42 pounds a month for me

velvet birch
#

Yh it's around the same for me too

little dagger
#

i think its worth if youre completely beginner

#

or beginner

velvet birch
#

I've heard you can get financial aid most of the times

pastel sleet
#

can anyone suggest me any good Ml or Ai project?

serene scaffold
#

this isn't really specific enough for us to help you. what is the data? which part are you trying to predict?

#

so far, we've established that you're trying to do classification. unless you tell us what the data is (what are the columns, and what do they represent), we can't really help you

#

there's no one-size-fits-all solution for classification. it depends on what the data is.

#

which is the one you want to predict?

wooden sail
#

that kinda sounds like something what would take way longer to solve with ML than by hand

#

unless you have several hundreds of files

#

ok, now we're talking.

#

and they all have the same kind of data, with the wrong headers?

#

??

serene scaffold
#

@paper rover I know you're new to this space, but we're finding it difficult to get information from you. it would help if you say exactly what your data is. What is every single column, what kind of data is in it, and what is missing?

paper rover
#

i need to google... thanks everyone

verbal venture
#

hey guys, is it possible to find out which month has the highest death rate for clinic 1 or 2 from these data sets? first dataset is monthly deaths, 2nd is yearly deaths

#

the 2nd data set mentions the clinic, the first does not. So is it possible to infer on which month the death rate was the highest in each clinic?

serene scaffold
ruby depot
gloomy anvil
#

Hello friends! I need help with this Error:
LinAlgError: 27-th leading minor of the array is not positive definite
I am trying to fit a VAR model and I am unable to understand what this means or what I have to change?

#

I am really at a loss here on how to debug this. I can post full traceback, code and data if that helps and if someone here is willing to help me figure this one out?

wooden sail
verbal venture
#

also, dumb question but does anyone know the difference between these two? subset_one.deaths.mean() #27 - incorrect subset_one.deaths.sum() / subset_one.births.sum() * 100 #10.52

serene scaffold
# verbal venture can you elaborate on this

in the second dataframe, you have (year, clinic) pairs. if there's two clinics, then each year can appear at most twice. but if a year appears only once, then that means that all the data you have about that year is for only one of the two clinics.

verbal venture
serene scaffold
#

because if data is present for both clinics for every year, then you're kind of SOL.

verbal venture
#

nah present for both

serene scaffold
#

so, for every year in the second dataframe, there's always a value for both clinics?

verbal venture
#

yeah

serene scaffold
#

then there's really no way to work backwards. you have no idea for which month the death rate is highest for each clinic, and there's no way to figure it out.

#

wait, what is pct_deaths?

verbal venture
#

the monthly avg of the deaths

serene scaffold
#

you mean, the percentage of births or deaths for that month that were deaths?

#

that doesn't really help either, unfortunately.

#

So is it possible to infer on which month the death rate was the highest in each clinic?
did someone tell you that it is?

verbal venture
#

What is the highest monthly death rate in clinic 1 compared to clinic 2?

#

Which clinic has a higher proportion of deaths?

serene scaffold
verbal venture
#

questions on this program I was doing

#

yup

serene scaffold
#

oh

#

that's different than what I thought you were asking

#

I can help in like ten minutes

#

@verbal venture so df2 tells you how many deaths happened in each clinic each year. so take the sum of both clinics' deaths for each year. those numbers are going to equal the sum of the number of deaths for each year in df1

#

do you follow?

#

hmm, come to think of it, I think you still have to assume that the distribution of deaths between the two clinics is the same within a given year

#

does the question say that you're allowed to make that assumption?

verbal venture
serene scaffold
#

can you think of what to do next?

viscid crag
#

Can someone help me with pyqt? there #help-lemon PLS!!!!

worldly dawn
#

It doesn't even relate to this channel

viscid crag
#

@worldly dawnwhere can I ask for help?

viscid crag
viscid crag
worldly dawn
# viscid crag I really need an answer)

I understand and appreciate your situation. However this is a volunteer driven community. So no one is due anything. Sometimes people may not be around or may not know the answer or may be busy or not interested in answering a specific question.
This is not a reason to go around in random channels and spamming people. If anything it will motivate them to not help you

worldly dawn
#

patience is key

verbal venture
#

Does anyone know the difference between these averages? subset_one.deaths.mean() #27 - incorrect b = subset_one.deaths.sum() / subset_one.births.sum() * 100 #10.52

#

the question was # What was the average percentage of monthly deaths after handwashing was made obligatory?

stuck socket
#

how do i get the wights fro the weights or group_weights parameters of Pool()?

stuck socket
#

wowwwwww

serene scaffold
stuck socket
stuck socket
serene scaffold
#

people like to talk about GPT-* like it's the be-all-end-all of nlp
and it's not.

stuck socket
#

u.u

iron basalt
#

(Although idk if it's just me, but search engines seem to have gotten worse when it comes to finding actually useful information)

serene scaffold
sage gust
#

Hey guys

#

anyone knows how to predict different depended variables on the basis of one independent variable

trail niche
#

Hi guys, is it possible to use K-nearest neighbors algorithm for real-time object detection? I'm searching the internet but can't find anything about it.

I'm thinking of training a model using KNN for image classification and will just manually create an algorithm for the sliding box to detect such objects on a image, but I'm thinking that can be very lousy solution (or maybe I'm wrong). Thanks!

rapid raft
#

im trying to implement kmeans clustering from scratch to seperate the lesion from the the image using 2 k clusters. .but the black corners keep getting added to the cluster with the lesion

#

can anyone guide me about how i can remove the black corners and only get the lesion

trail niche
rapid raft
#

due to this being a homework task i cant use the functions from opencv
have to manually manipulate the image using pixels

hasty mountain
#

Guys, quick question about Conv2Ds:
If I use kernels that are too big, will I have a greater chance of having vanishing gradients problem?

languid shard
#

Hey, is there a way to plot the decision boundary for Logistic regression which has more than 2 coefficients?

hasty mountain
#

I've noticed that most NNs use kernels of size around 3, 4... I've been testing a neural network with kernels like 51, 101 because I didn't want to add that much layers.
And exploding gradients have been giving me quite a headache. Even when I use residual blocks

trail obsidian
#

Im reading the book called "Introduction to Machine Learning with python"

Input:
prediction = knn.predict(X_new) print("Prediction: {}".format(prediction)) print("Predicted target name: {}".format( iris_dataset['target_names'][prediction]))

#

Output:
Prediction: [0] Predicted target name: ['setosa']

#

what does 0 means in here and how code knows the name setosa

#

can someone explain

hasty mountain
#

The code tries to predict the correct index for your class. And then you simply matches the index the model predicted with your class list

#

If you have something like

classes = ("Class1", "Class2", "Class3")

If you use print(classes[1]), it'll return "Class2".

If you would use print(classes[predicted]), since predicted = 0, you would get "Class1"

#

The same goes for the iris dataset, which is in a pandas DataFrame, if I remember it right

trail obsidian
unique flame
valid wind
#

hey I have a question

#

so I'm training an MLP classifier with 6000 images, but originally I set my batch size to 64, which made the model not even finish 1 epoch in 3 hours

#

what would be a reasonable batch size for this

#

I am using adam solver and relu activation

#

and learning rate of .001

hasty mountain
#

Maybe you should use smaller images

#

Or review your architecture

#

6.000 isn't that much, and 64 as batch size isn't that big, depending on your GPU

rugged comet
#
import tensorflow as tf
print(f"TensorFlow version: {tf.__version__}")

I just installed CUDA Toolkit 11.7 but I still get this warning.

2022-10-02 20:58:06.071163: W tensorflow/stream_executor/platform/default/dso_loader.cc:60] Could not load dynamic library 'cudart64_110.dll'; dlerror: cudart64_110.dll not found
2022-10-02 20:58:06.085819: I tensorflow/stream_executor/cuda/cudart_stub.cc:29] Ignore above cudart dlerror if you do not have a GPU set up on your machine.

I have a CUDA-enabled NVidia GPU on my PC so I'd like to use it.
What did I do wrong?

valid wind
novel python
#

hi guys, so I'm trying to deal with this project and I'm not so sure what's the best approach:

I have the monthly internet usage of a variety of mobile phones for 7 months only and I want to apply a model that will predict the usage for the next month based on what I have.
The thing is, I don't think it makes sense to consider every row to train a model, since each row is a complete different client and they have complete different actions. But also, I can't run a model on a single row for every row in the dataset (not sure if that's possible or would be optimal, since there would be no way to split into different sets)

Right now the only way I can think of a solution is to use the first 6 months as X, and the last month as y to see how accurate a model would be. But still, it doesn't make sense to run the model on the whole dataset.

#

Any suggestions?

hasty grail
tacit basin
sinful latch
#

What should I use to put NaN values ​​at Literacy(%) and lines like this?

supple wyvern
#

my values are flipped around

woeful falcon
#
import keras
import numpy as np

input1 = keras.layers.Input(shape=(1,))
input2 = keras.layers.Input(shape=(1,))

merged = keras.layers.Concatenate(axis=1)([input1, input2])

dense1 = keras.layers.Dense(2, input_dim=2, activation=keras.activations.sigmoid, use_bias=True)(merged)
dense2 = keras.layers.Dense(1, activation=keras.activations.relu, use_bias=True)(dense1)

model = keras.models.Model(inputs=[input1, input2], outputs=dense2)

model.compile(optimizer='sgd', loss='mean_squared_error')

model.summary()

x1 = np.array(
    [5,6,7], dtype=float)
x2=np.array(
    [2,3,4], dtype=float)

xarr = np.stack([x1, x2], axis=1)

#y = 2x + 3y

ys = np.array(
    [16,21, 26], dtype=float)

print(ys)

model.fit(xarr,ys, epochs=100, verbose=0)


print(model.predict([3,3])[0][0] )

print(model.predict([2,3])[0][0])

#

Simple NN to predict linear formula y = 2x + 3y

#

i am trying to give two inputs to the model, but it is throwing out error saying that only one input is given

#

Someone Familiar with tensorflow/keras please help

drowsy viper
#

Okay what do you think your input is here? as far as I can tell, there is no comma around the input

stuck schooner
#

How hard it is right now to get a part time / remote job as a first experience with a Data Analyst / DS role (aside an internship) in Europe ? Is US job market accessible ? (Either as internship or contract) (looking to work with my competence aside study)

serene scaffold
#

also, you need to show the dataframe again in order to see if your line had any effect.

steady basalt
#

Anyone know how to translate sql to nosql query

#

Or experience replicating it including “joins”

#

Best idea to select entire tables and join w pandas?

fossil ivy
#

hello, has anyone here ever used SimPy?

serene scaffold
serene scaffold
serene scaffold
fossil ivy
# serene scaffold it's best to ask your actual questions, rather than ask if anyone knows about a ...

I have already coded a full simulation, but it does not run after a specific state, but always takes different time steps (so the weather in my model is checked after a full execution of a task which for example takes 27 hours)
In reality, the weather should be checked continuously while the task is executed. I do not know how to implement that and found SimPy so I was wondering if it allows you to keep on doing stuff while constantly checking for something else on the side

#

I don't have alot of CS or python knowledge, so I can't figure a way to do that in my current code

wooden sail
#

sounds more like you need multiprocessing or multithreading

wooden sail
#

you said you want to do two things at the same time on the side

steady basalt
#

😅😅

#

Dynamodb

#

I’m using boto3 atm

#

But the current plan requires having local scripts scheduled to run to a) move data to s3 and b) download data to dashboard and data processing scripts which then run locally at scheduled hours to refresh the dash

#

This sucks. How can I get live integration to the data?

#

I’m not a backend dev

weary folio
#

Why is NumPy vectorize function flattening my array? I want to run some function per every element in my 4D NumPy array. This array contains multiple 3D elements I want to run my function on.

serene scaffold
sly cairn
#

Free code camp maybe?

#

Does anybody generate XML inside files in python? If so, do you have any libraries/VS Code extensions you recommend to make the process less tedious, or do you do it manually?

serene scaffold
sly cairn
sly cairn
#

It's a bit specific, so it will probably die buried in general help

#

I'll ask there though

desert oar
sly cairn
#

That nobody replies and gets closed in a few minutes

#

whooo

weary mountain
#

Me

desert oar
# sly cairn That nobody replies and gets closed in a few minutes
  1. the timeout is 15 minutes.

  2. you are welcome to re-post if you didn't get an answer. it's often the case that nobody who knows the answer sees your question. it's always a matter of luck to some extent, as in any help forum or channel. there is no harm or shame in re-posting if you don't get an answer the first time.

  3. often questions don't get answered because they are hard to answer. maybe they don't contain enough information (see https://stackoverflow.com/help/minimal-reproducible-example) or contain too much information so that it's difficult for someone to cut to the relevant parts of the question. the #❓|how-to-get-help post contains a lot of advice for asking questions that are easier to answer, including a link to a whole webpage that this community maintains.

  4. you aren't entitled to anyone's help; everyone here is helping on a volunteer, best-effort basis. lack of an answer is one channel does not give you any right to post your question in channels where it's off-topic.

serene scaffold
grave frost
#

No wonder there's a huge research effort and entire communities around it. It's not transformers per se, but all the subtle mathematical ways via which they can be manipulated and rewritten.

grave frost
merry pike
#

is YOLOv7 good for image recognition(1 task) or only object detection?

iron basalt
# grave frost haha, that's a very subjective take though 😉 How goes your research? Any intere...

Yeah, I like very non-incremental improvements and things that are very different (so if you have some wacky ideas / math, I would like to know about it). Research is progressing well, one of the main things is just yet again realizing how good ART is (and the many variants / things based on it, I think it's fair to call it its own branch as this point like how deep learning is its own thing because of how many there are now) and we have been getting results (for world modelling and RL) on a CPU, that are close to that achieved with a Nvidia DGX Station A100 (and also way less time spent training, also it's online learning). I can't say more details without giving away identifying information (or well, making it too easy) and I would prefer to stay anonymous for now.

iron basalt
#

So yeah, ART = good (like really good, and it's still way too unexplored).

#

Other really interesting things include grid cells (still), and Triadic Memory.

grave frost
# iron basalt Yeah, I like very non-incremental improvements and things that are very differen...

ART? I think some GOFAI people mentioned it - d'you have any handy links?
Honestly, for world modelling and RL, I feel like most approaches are far from being state-of-the-art or general enough for me to appreciate its significance for AGI. World modelling, Diffusion is all the rage - and in all honesty, the results from some of the latest models like Make-A-Video and Dreambooth are quite fantastic with an unexpected amount of fidelity.

As for RL, GATO really showed how powerful Decision Transformers are - and how potentially game changing they can be for RL especially if they demonstrative strong positive transfer at scale. The amount of tasks a simple 1B model can do is just insane to me.

On the Mathy-y side, the Anthropic papers are quite the rage. There's been a huge shift towards intepretabilty, phase transitions in transformers and Grokking (very close to the famous Double decent phenomena) which defy traditional statistical wisdom. And ofc, perhaps hinting how inefficiently we're using these architectures 😉

I feel you're locked in some sort of a competition between some other rival lab and want to minimize and leaked ideas 😄 can't blame you tbh.

merry pike
iron basalt
# grave frost ART? I think some GOFAI people mentioned it - d'you have any handy links? Honest...

DL is definitely using way more than it needs to, but for now they continue to just throw more hardware and more data at it, with some improvements in the more with less department. Their end results still make for good targets/goals though if you are trying to to do more with less. And in inference they are not bad in terms of speed after sparsification, but they are static then, and we are interest in life-long learning (hence online learning via stuff like ART). But they can definitely be used to initialize / give zero shot to an online learner, so we do value them, but for other reasons than usual usage (also the math). We are also heading towards explainable AI, although we find ours already much easier to reason with than DL, but we think we can take it further. We also use the term "designable AI" which is loosely the goal of being to have more exact control over how it behaves without constraining it in a way that locks it into local minimum (too much or maybe even on purpose) and it's also part of it becoming explainable. https://en.wikipedia.org/wiki/Adaptive_resonance_theory https://www.amazon.com/Conscious-Mind-Resonant-Brain-Makes/dp/0190070552

Adaptive resonance theory (ART) is a theory developed by Stephen Grossberg and Gail Carpenter on aspects of how the brain processes information. It describes a number of neural network models which use supervised and unsupervised learning methods, and address problems such as pattern recognition and prediction.
The primary intuition behind the A...

#

Grossberg and Carpenter are the ART people, we have been in contact and showed them our stuff.

#

"defy traditional statistical wisdom." - this has been coming up for us for a long time now and it's pretty hard to make some well engineered thing when the math is not already all there yet (DL has this problem too, it's just not as bad as when one makes some very different idea and may not even be sure where to start with the math), but on the other hand, we do enjoy math, so more math to play around with and an excuse to do so.

#

So stuck between wanting to make something very different, but drifting back towards the more familiar because the math is there.

little dagger
#

would anyone like to review my project (its not long at all) and suggest improvements I could've done

desert oar
# little dagger https://github.com/kjw009/Forest-Cover-Classification

nice little project! part 1:

  • you should include a detailed description of how to obtain this cover_data.csv file (presumably somewhere from the USFS website?), ideally including a script of some kind that downloads and processes it into its current form

  • good job describing each variable in your data set, including units

  • use pandas Categorical dtype for things like class labels: https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.Categorical.html. internally it stores the data as int, and you can easily access the int values for putting them into a machine learning model.

  • use pandas boolean dtype for yes/no binary values (soil types)

  • normally i would say that pie charts are bad, but this "proportion of classes in dataset" chart is an okay use of it

  • it would have been good if you also displayed this "distribution of 7 forest types" the other way around in addition to what you created: one "pie" for each wilderness area, with 7 "slices" for each forest type. consider also a "small multiples" 2x2 grid of bar charts as an alternative that has more fidelity at smaller sizes: https://www.displayr.com/what-are-small-multiples/

  • excellent use of violin plots!

  • i would strongly question the interpretation of the extremes of the elevation distribution as outliers. this is just a distribution with a long tail. consider that elevation is bounded at the bottom. by comparison, if you found one or two isolated aspen stands at sea level, that would be an outlier. i see that you actually dropped these outliers from the data in your pre-processing, which i think is a big mistake, and reflects that you are over-relying on mechanical procedures. domain knowledge always supersedes textbook recipes.

#

part 2:

  • # We subract 1 from every class value to include 0 as a label for the softmax you forgot to actually subtract 1. that said, you could use pandas Categorical here and not worry about it.

  • it might be interesting to see how well it compares to xgboost, as well as to a smaller neural network with fewer layers/parameters, and even linear regression. it's important to establish baselines!

  • good job with the heatmap of the confusion matrix.

  • did you validate these results manually? i'm surprised to see better accuracy in the test set than in the training set.

  • i don't know if there is currently any research into this kind of thing with neural networks, but in traditional statistics i would be strongly tempted to add some kind of "spatial autocorrelation" into this model: the tree cover type in any cell is almost certainly correlated with the tree cover type in surrounding cells, and the rate of change in tree cover type between cells is probably correlated with the rate of change of other features (elevation, etc). however if the accuracy really is > 90%, then perhaps the gains to adding this to your model would be marginal and only useful if you needed to improve it further (e.g. i see that there is a lot of "confusion" in your model between spruce/fir and lodgepole pine). this will depend a lot on your real-world application. perhaps this could be encoded as a CNN over several "layers" of features (equivalent to RGB in image models, called "channels"), but i'd have to do some research to see what's the established best practice here.

  • consider also that the rate of change of elevation (hillside/mountainside vs flat) might itself be a useful feature.

  • it might also be interesting to look into bayesian machine learning to obtain a probability distribution over tree cover types.

wary breach
#

Hey, I'm new to machine learning (learning by myself). I'm looking into StandardScaling (and scaling in general). I do have a question though regarding this. Let's say my dataframe looks something like this (as an example): py name John Marie AyAyRon age 27 22 69 sex 0 1 0 eyes 2 0 11 In this case, columns "sex" and "eyes" refer to the group they belong to (i.e. 0 for male, 1 for female, different integers for eye colors).

My question is: Is it necessary to scale these categorical/group information or is it only necessary on large data values? (@ me on reply if possible)

lapis sequoia
heavy crow
#

This is a screenshot of my tensorflow profiling session. Is there a way for me to have lower memory fragmentation? (red line is memory fragmentation)

#

also is there a way to reduce the peak memory usage? looks like only ~2% of the time i am using 95+% of heap memory

lavish kraken
# sly cairn whooo

Post it on Reddit, this discord would hardly get you any help you help

lavish kraken
# wary breach Hey, I'm new to machine learning (learning by myself). I'm looking into Standard...

Many machine learning algorithms perform better when numerical input variables are scaled to a standard range. The two most popular techniques for scaling numerical data prior to modeling are normalization and standardization. Normalization scales each input variable separately to the range 0-1, which is the range for floating-point values where we have the most precision. Standardization scales each input variable separately by subtracting the mean (called centering) and dividing by the standard deviation to shift the distribution to have a mean of zero and a standard deviation of one

lavish kraken
lavish kraken
young granite
#

someone searching for a study budy with focus on data science (kaggle)?

supple wyvern
#

lol

little dagger
#

gonna go through your review once i get home

#

thank you once again @desert oar

lavish kraken
supple wyvern
lavish kraken
supple wyvern
#

in youtube

#

I'm pretty sure it was the neural networks module

#

if you can't find it still, I'll get you the link

lavish kraken
supple wyvern
#

sorry, I've got no idea, I was just folowing the tutorial lol

lavish kraken
#

Image similarities distance .i did some project for a client in Uk

lavish kraken
desert oar
# wary breach Hey, I'm new to machine learning (learning by myself). I'm looking into Standard...

scaling isn't only for reducing the magnitude of "big" values. putting everything on roughly the same scale can be an aid to interpretation and can make it easier for the optimization algorithm to function. scaling age makes sense in this case.

scaling the binary representation of sex is maybe a good idea, but it actually makes the interpretation a little more difficult: you are now talking about hypothetical "deviations from a probability", instead of two actual categories.

under no circumstances should you attempt to scale categorical data with more than one category, like eye color. those integer values are completely misleading, because they don't actually represent numbers with a set ordering, nor do numerical operations on them like taking differences or multiplication have any meaning. ideally you should avoid encoding categorical data as numbers in order to avoid making this mistake! pandas has good support for doing this.

lavish kraken
desert oar
desert oar
lavish kraken
#

sometimes assuming i want to do some predictive analysis on a category columnmor somthing

desert oar
#

i was more talking about how to encode the data "at rest", starting when you first load it from a file, and before you put the data into any kind of machine learning model

#

that's where text and categorical data types are useful

#

onehot encoding and such are all techniques for encoding categorical data numerically for use in a model of some kind

little dagger
#

as the cause for the train set acc being lower than the test set?

desert oar
#

note also that usually it's not a great idea to use the "validation" set from your model fitting process as your final accuracy. in this case it's ok, but if you perform any parameter tuning or feature selection against that data, you've already "burned" that data so to speak and it's no longer a valid approximation of out-of-sample data

inland zephyr
#

hello everyone

#

i want to ask for references that i could read for week-of-month forecasting, since its very scarce cases for this one

desert oar
inland zephyr
#

in weekly cycles, but the week cycle i used is not a week in year, but week of month

#

said i have data from 1st week of January and i want to forecast information for 2nd week of January

#

and when the data reached end of month (last week of January) it will be used to forecast 1st week of February

desert oar
#

it sounds like maybe you want two "levels" of cycles, both weekly and monthly

inland zephyr
#

i have considered that problem

#

maybe i will try an weekly approach from daily data then using two levels forecasting

worthy hollow
#

but now i want to add to the figure

worthy hollow
# worthy hollow

something alike (putting a circle over the squares) and highlight every degrees shown above
0
45
90
etc
is it doable on python, if so any help appreciated as i have not a single clue how to make this up

grave frost
# iron basalt DL is definitely using way more than it needs to, but for now they continue to j...

hmm. that sounds an awful lot like a Hopfield network on steriods. You say it performs well for RL?
Also, atleast from the wiki page I can't help but draw all the similarities to attention - the comparison field can be represented as a scaled matrix product, the lateral inhibition created by the recognition field is similar to a softmax operation, the vigilance parameter are the final attention matrices (where that threshold is applied to the FF layer after) - the search procedure is what stands out though.

This is interesting as this line of work alone could be used to spin of variations of attention itself.

Lastly, that search procedure stage has quite some similarity with gated mixture-of-experts style architectures - except we don't explictly code s.t the gradients affect particular subnetworks depending upon the confidence level (i.e we don't weigh some threshold, after which its decided that some relatively untouched subnetwork gets assigned this weird class - that routing mechanism is learnt, which may be a factor for improving performance compute wise with explicit hardcoding)

I had a variation off MoEs of my own - its quite a lengthy and convoluted operation, but it makes sense theoretically. I'll post it someday if I get down to actually write it properly rather than disjointed discord threads 🙂

verbal venture
#

can anyone explain sytnax + ML wise what's going on with these first 3 code bits?

#

data += 0.5 * noise data[::2] += 5 * noise[::2] data[::4] += 20 * noise[::4]

grand breach
wooden sail
#

presumably there exists a noise vector which is scaled by 0.5 and added to the data. next, dada[::2] is using slice notation to make changes to every other sample, so all even samples are modified to now have 5.5 x the noise (since we add 5 x the noise on top of the original 0.5)

#

lastly, slice notation is used again to add even more noise every 4 samples

#

for some kinds of noise, this type of multiplication in front is equivalent to changing the noise variance

#

like so:

In [32]: import numpy as np

In [33]: import matplotlib.pyplot as plt

In [34]: x = np.arange(0,100,dtype=float)

In [35]: y = 3*x + 5

In [36]: noise = np.random.normal(loc=0, scale=4, size=len(x))

In [37]: y += 0.5*noise

In [38]: plt.plot(y)
Out[38]: [<matplotlib.lines.Line2D at 0x1e2ce5bbd30>]

In [39]: y[::2] += 3*noise[::2]

In [40]: plt.plot(y)
Out[40]: [<matplotlib.lines.Line2D at 0x1e2ce5e8070>]

In [41]: y[::4] += 6*noise[::4]

In [42]: plt.plot(y)
Out[42]: [<matplotlib.lines.Line2D at 0x1e2ce581e50>]

In [43]: plt.show()
#

you can see there are clear noise spikes here and there following a particular pattern

verbal venture
wooden sail
#

i have no clue, because you gave no further context. this alone means nothing

#

it's just a particular noise distribution with a special covariance matrix

#

the importance comes from the interpretation. even though the quantity is random and called noise, just from those 3 lines you can't even tell if the target is to estimate the noise or to denoise the data. there is no significance in the data itself, but rather in the context around it

pure plover
#

Trying to rename my headers for a 1D dataframe but the numerical headers are staying. I don't see anything wrong. Here is my output from print(df.head().to_dict('list')) and the code to rename my headers

#

{'N-protein spiked lysate - Input': [25564], '40uL + Aptamer flth A': [12132], '40uL + Aptamer flth B': [12587], '40uL - Aptamer flth A': [19856], '40uL - Aptamer flth B': [20392], '60uL + Aptamer flth A': [10432], '60uL + Aptamer flth B': [10117], '60uL - Aptamer flth A': [17904], '60uL - Aptamer flth B': [18839], '80uL + Aptamer flth A': [8260], '80uL + Aptamer flth B': [7885], '80uL - Aptamer flth A': [16364], '80uL - Aptamer flth B': [10854], '40uL + Aptamer wash #1 A': [649], '40uL + Aptamer wash #1 B': [575], '40uL - Aptamer wash #1 A': [807], '40uL - Aptamer wash #1 B': [769], '60uL + Aptamer wash #1 A': [1315], '60uL + Aptamer wash #1 B': [594], '60uL - Aptamer wash #1 A': [1921], '60uL - Aptamer wash #1 B': [514], '80uL + Aptamer wash #1 A': [524], '80uL + Aptamer wash #1 B': [515], '80uL - Aptamer wash #1 A': [834], 25564: [689], 12132: [288], 12587: [190], 19856: [220], 20392: [167], 10432: [97], 10117: [77], 17904: [185], 18839: [49], 8260: [190], 7885: [182], 16364: [106], 10854: [184], 649: [39293], 575: [29479], 807: [3982], 769: [3478], 1315: [55517], 594: [47794], 1921: [8238], 514: [3414], 524: [45480], 515: [61068], 834: [10099], 689: [15114]}

#

My code for renaming the headers is

#
           '25564': '80uL - Aptamer wash #1 B',
           '12132': '40uL + Aptamer wash #2 A',
           '12587': '40uL + Aptamer wash #2 B',
           '19856': '60uL + Aptamer wash #2 A',
           '20392': '60uL + Aptamer wash #2 B',
           '10432': '40uL - Aptamer wash #2 A',
           '10117': '40uL - Aptamer wash #2 B',
           '17904': '60uL - Aptamer wash #2 A',
           '18839': '60uL - Aptamer wash #2 B',
           '8260': '80uL + Aptamer wash #2 A',
           '7885': '80uL + Aptamer wash #2 B',
           '16364':'80uL - Aptamer wash #2 A',
           '10854':'80uL - Aptamer wash #2 B',
           '649':'40uL + Aptamer Elute A',
           '575':'40uL + Aptamer Elute B',
           '807':'40uL - Aptamer Elute A',
           '769':'40uL - Aptamer Elute B',
           '1315':'60uL + Aptamer Elute A',
           '594':'60uL + Aptamer Elute B',
           '1921':'60uL - Aptamer Elute A',
           '514':'60uL - Aptamer Elute B',
           '524':'80uL + Aptamer Elute A',
           '515':'80uL + Aptamer Elute B',
           '834':'80uL - Aptamer Elute A',
           '689':'80uL - Aptamer Elute B'}
df.rename(columns = headers,
            inplace=True)```
#

I truncated the part that occurred before the problem but that part of the code looked and functioned fine

#

I'm working with Pandas

verbal venture
wooden sail
#

can you post what you're doing?

#

::n downsamples by a factor of n and it is computed automatically, it shouldn't give any errors

verbal venture
#

playing around in the compiler to see what it's doing

#

a = [i for i in range(1, 101)]
b = [i for i in range(1, 20)] #increment by 4

#

a[::4] += 20 * b[::4]

wooden sail
#

that fails because they're not the same length. the problem is not the slicing

#

you can't assign an array of one length to an array of a different length

verbal venture
#

if I make them the same length it throws me the same error

wooden sail
#

show

verbal venture
#

a = [i for i in range(1, 101)]
b = [i for i in range(1, 101)] #increment by 4

#

a[::4] += 20 * b[::4]

#

ValueError: attempt to assign sequence of size 5025 to extended slice of size 25

wooden sail
#

!e

a = [i for i in range(1, 101)] 
b = [i for i in range(1, 101)] #increment by 4
a[::4] += 20 * b[::4]
arctic wedgeBOT
#

@wooden sail :x: Your 3.11 eval job has completed with return code 1.

001 | Traceback (most recent call last):
002 |   File "<string>", line 3, in <module>
003 | ValueError: attempt to assign sequence of size 525 to extended slice of size 25
verbal venture
#

I got the same error - 5025 was from other code

wooden sail
#

i do wonder what this is doing, since these are lists and not numpy arrays as were presumably used both in your original example and in the example i gave you

verbal venture
#

yeah I'm just testing it out so I guess it operates differently between list and numpy arrays?

wooden sail
#

super differently

#

for one, 20*b[::4] is not scalar multiplication, it makes 20 copies of the list b[::4]

#

that's where the length of 525 comes from

verbal venture
#

haha yeah

wooden sail
#

i never remember what exactly lists do when you call +=, if it makes a copy or not, but anyway, this is entirely different from the matrix operations that numpy does when using the same syntax

verbal venture
#

it adds it to the end of the list

wooden sail
#

you can probably never append to a slice of a list

verbal venture
#

but anyway, so in numpy it downsizes by the step provided in the slice, and replaces it ?

wooden sail
#

!e

a = [1,2]
a[::2] += [1]
arctic wedgeBOT
#

@wooden sail :x: Your 3.11 eval job has completed with return code 1.

001 | Traceback (most recent call last):
002 |   File "<string>", line 2, in <module>
003 | ValueError: attempt to assign sequence of size 2 to extended slice of size 1
cerulean lantern
#

www.mlpredict.net Here I share these artificial intelligence models trained by me, with sets of data taken from Kaggle. I try to deploy them for easy use, so that anyone can put their data in the different models and get a clear result. So you have a small taste of the potential of handling the data with machine learning algorithms. for the moment the models only have a didactic use since the data sets are few to give highly reliable results.

iron basalt
# grave frost hmm. that sounds an awful lot like a Hopfield network on steriods. You say it pe...

You can think of it as a sort of "hard" attention mechanism. However, there are variants that soften it a bit too (ART is more of an umbrella term at this point). One can be make a transformer-like system with it (if you are interested and end up making one, please let me know). What we have now though we find better than transformers (only works well with the stability given by something like ART and sparsity) (it's not a recurrent network either, it's our own unique thing, but I can say that if you read Schmidhuber's papers for alternatives to recurrent, you might stumble upon a similar idea). The reason it's "hard" is to enforce sparsity ("hard sparsity" (it's never dense at any point, even in training)) of some amount and network stability so it can do incremental/online/life-long learning (see stability-plasticity dilemma & catastrophic interference), which also has the side effect of making it run very fast (ART's hidden layer(s) is a sparse representation, and later variants make it a sparse distributed representation (SDR), which has very nice properties (nice math)). It does perform well for RL, including online RL, but this is ongoing for us (when it comes to online RL (or online tasks in general), I have not seen better). It also has been scaling well for us (both up AND down (runs on embedded / good for robotics, but also on a bunch of GPUs)).

#

TLDR: ART is unexplored and can be related back to "attention". If you read Grossberg's book, it covers many of these sorts of insights (not a connection to transformers specifically, but ideas like attention in general) (has been working on it for decades and it's biologically plausible / very biologically inspired / book covers biological implementations).

#

*Transformers being as widely applicable as they are makes them kind of relate to everything, they get a lot right (closer to biological systems (directly, but also emergent (the better they perform the more they tend to mimic biology in many ways (e.g. mimicking grid cells (which the neocortex may be doing too))))).

#

"I had a variation off MoEs of my own - its quite a lengthy and convoluted operation, but it makes sense theoretically. I'll post it someday if I get down to actually write it properly rather than disjointed discord threads" - I would love to read it.

grave frost
# iron basalt You can think of it as a sort of "hard" attention mechanism. However, there are ...

ART does sound promising in that respect; I'd have to read that book sometime - it seems quite interesting. I definitely agree with you there about the applicability of ART towards transformers - a hybrid can allow for multiple properties. However, I still doubt its expressiveness is on par with transformers, atleast for now. It would definitely learn basic patterns quickly and effectively, but that is an advantage boasted by multiple biologically inspired architectures. I suppose a more geometric approach can be taken with ART (if it hasn't been done already) the additional interpretability here is quite nice though.

grave frost
# iron basalt *Transformers being as widely applicable as they are makes them kind of relate t...

Definitely - I take you say Jean Remi-King's work in comparing brain activations and DL models' activations? I wouldn't go as far as to say that these models are definitely mimicking the brain or other biological mechanisms - yet it is quite an unexpected and surprising phenomena; One'd would think models so diverged from their biological counterparts wouldn't show any similarities at all.

iron basalt
# grave frost ART does sound promising in that respect; I'd have to read that book sometime - ...

The original ART algorithm is very old and the systems built with ART / variations of it / the main idea of it are much better and can learn more than basic patterns. The convergent behavior of biological systems and non-biological is a hypothesis with some evidence to show it, but holds more when it's already more similar to the biology, like Transformers. I do not expect it to mimic all of biology, both because there are parts that are just an implementation detail due to constraints of how the biological systems were made and also just because it's not the same thing, just similar in part.

#

(It can make for a nice way to find out what some good universal and simple ways of doing things are (when both some artificial system and biology converge to the same/similar thing (it's often something more fundamental drawing them both to it, something in the math / physics) (e.g. grid cells way of encoding position))

#

(e.g. grid cells relation to Gaussian processes)

#

(And Kolmogorov complexity)

grave frost
# iron basalt "I had a variation off MoEs of my own - its quite a lengthy and convoluted opera...

I can give you a tl;dr here - basically, its a bunch of stuff together to alleviate multiple problems simultaneously. MoEs suffer from heavy gradient issues, especially when training from scratch where experts selected may disrupt gradient flow for other experts who may be slightly better at their task thus inhibiting them.

My idea in a nutshell is for w warmup steps, one shares parameters of every corresponding expert on the dataset s.t they aren't independent until they have a better initialization much closer to a local optima and are similar to the ideal manifold; The core idea however is to model n experts in L layers as modelling a Gaussian distribution over n*L trajectories. Thus, rather than choosing an independent subnetwork at each layer whose representations may or may not align with other experts, you choose a certain sequence of experts across the whole network. Thus if you chose that 'trajectory' again, you're effectively picking the same network again with no difference - allowing for a much smoother gradient flow throughout the different trajectories, which don't have to worry about their experts being replaced.

So you have only a single router in the entire network to route through the trajectory of experts it sees fit.

The Gaussian's parameters would have to be trainable ofc; and you sample top-k trajectories as normal.
Now here comes the next part - every often so steps, you 'sync' parameters. i.e, every m steps you accumulate the gradients you calculate and average them.

#

Then, you apply those gradients to the experts weighed by the router distribution (The Gaussian) Effectively, its to protect against catastrophic forgetting. By averaging gradients, you do the naive trick where trajectories most commonly called upon become more general (as they're weighted in a manner to allow for a stronger averaged update) while less common trajectories (those specialized to tasks who benefit little from generalist knowledge) don't have their precious parameters disrupted to a large degree.

Those m accumulated gradients would be decayed by some hyperparameter to ensure that as the training goes own, experts become incentivized to actually specialize. I don't expect experts to be as discrete, as one for mathematics and one for handling Reddit shitposts, but that demarcation allows for implicit data filtering - namely 4chan shitposts would be forced upon a few experts preventing other specialized experts from going too far from their distribution.

So in a nutshell, providing resistance to catastrophic forgetting, introducing implicit data filtering, allowing for more specialized experts, and encouraging well behaved gradients (to an extent) and allowing experts to collectively pool their knowledge and divide it into parts to aid transfer learning (* to an extent, but atleast beter than vanilla MoEs) all in one stroke.

It's not going to be the next AGI 😉 but I feel like its going to be a decent first paper when I start my undergrad. I suppose that's what all freshmen like to think anyway.

iron basalt
# grave frost Ok, that was a little long for a tl;dr, but atleast you get the gist. Also, htt...

We did something very similar to pathways years ago (and prior to that, pretty much exactly pathways). We no longer use it, but if I were still into purely DL, I think it would be the best way forward. It's a very good first paper (and maybe think about what happens if you were to use ART to choose the trajectories, since ART is stable (same inputs, same trajectories) and learns very fast, especially in the beginning when it populates all the initial categories... wink).

#

(Choosing what parts to train (and which to leave alone to be stable / non-interference) is kind of what ART is all about (answering the question of how the brain learns so many tasks without disrupting existing knowledge))

iron basalt
#

*The explainability of routing methods is another potential paper.

storm kelp
#

My life would have been so much easier if I took maths as an A level instead of History

misty flint
#

Theme is vector search with arXiv papers dataset and redis vector database

shell crest
#

Shot in the dark here but does anyone know when will NeuRIPS 2022 papers be announced/released/etc?

#

Author I'm looking at says 'accepted' but no online version sadge

#

I wonder if I can find the paper on openreview, but I don't think NeuRIPS sends its current stuff there?

rugged comet
#

Getting an error when running the following simple code.

import tensorflow as tf

fashion_mnist = tf.keras.datasets.fashion_mnist

(train_images, train_labels), (test_images, test_labels) = fashion_mnist.load_data()
arctic wedgeBOT
#

Hey @rugged comet!

You either uploaded a .txt file or entered a message that was too long. Please use our paste bin instead.

rugged comet
#

Anyone know why urlopen doesn't seem to know https?

mint palm
#

i am using ssh
i ran my code on small sampled data and it ran fine.
now on complete data, the terminal says killed after some time, what can be the possible reasons?

#

i was checking ram using free -g and it was fluctuating up and down continuously

wooden sail
#

is there a system log you can check? could be too much memory or cpu/gpu usage

mint palm
wooden sail
#

if the event was logged, sure

mint palm
wooden sail
#

you might need to contact your admin then

mint palm
wooden sail
#

well, several events are logged there indeed, i can't guarantee this one should show up though

#

did you sudo dmesg?

mint palm
#

i probably dont have the permission to sudo

wooden sail
#

dmesg requires elevation so that probably won't work, then

mint palm
#

i just tried and it asked for password
i remember keeping it blank and only pressing enter when it was asked, but it says sorry try again when i just press enter

lapis sequoia
#

how come im getting 2 very different values when i try to calculate RMSE?

hushed stratus
#

yo if i was to ask, on the premise of black box learning, what's the practical application or use cases for calculus math in machine learning.

unique flame
lapis sequoia
#

interesting, thank you

grave frost
dusky abyss
#

how can i get accurate floats in python

#

i want to make a program for my numerical computing course, since typing numbers out on my calculator takes a while and there's a possibility of error i want to make a program to check if i've done it correctly

#

i need scientific calculator level float accuracy

steady basalt
#

Google how to return float of x digits

wooden sail
#

double precision (which python uses by default) and the decimal library should carry you pretty far, and symbolic operations with sympy/symengine for the rest

dusky abyss
#

what are symbolic ops

wooden sail
#

the "CAS" you see in calculator names and pay extra for that allows you to manipulate symbolic expressions, like factoring x^2 - 9, for example

dusky abyss
#

i have an older scientific calc that doesnt help me factor :^)

#

did not help with those crazy factorizations at school

iron basalt
# grave frost I agree, but the hard, discrete nature of ART routing would be at odds with the ...

In the biological version the winner is applying a Gaussian activation/inhibition to its neighbors (Gaussian weights) (https://en.wikipedia.org/wiki/Self-organizing_map 😉) (It will take more than just using the plain old original ART implementation from the 80s).

A self-organizing map (SOM) or self-organizing feature map (SOFM) is an unsupervised machine learning technique used to produce a low-dimensional (typically two-dimensional) representation of a higher dimensional data set while preserving the topological structure of the data. For example, a data set with

    p

...

#

Something to consider, but I would not go down this route for your first paper. Your proposed method with the Gaussian is similar to ART, it's more indirectly implemented and soft (routing methods in general, but this version in particular) (the more one goes towards ART, the less catastrophic forgetting (but DL being the way it is, can't go all in on that)).

frigid lion
#

hey guys i want to start learning about data science and machine learning im looking for udemy course specifically jose portilla ones any1 knows the diffrence between his masterclass course and bootcamp one can maybe you know other ones that you would recommend more?

little dagger
#

@desert oar can you review my friends project? Its not that long dw

serene scaffold
devout sail
#

Why do you need the shape to be uniform?

#

Is this really a couple, or are we talking about more

robust jungle
#

I may be stupid, but what is y train for in model.fit()

#

I legitimately cannot find this information anywhere

little dagger
#

how do you post word documents

serene scaffold
#

!paste

arctic wedgeBOT
#

Pasting large amounts of code

If your code is too long to fit in a codeblock in Discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the floppy disk icon in the top right, or by typing ctrl + S. After doing that, the URL should change. Copy the URL and post it here so others can see it.

lavish kraken
#

compare the accuracy of this two results

little dagger
#

Here is the project

#

the above is exploratory data analysis of the dataset

quartz grotto
#

What is the best library if I were looking to make an AI based language translation?

serene scaffold
pseudo wren
#

I’m about to start university and will have a calc class. I feel like my highschool years (especially because I graduated in 2020) were a chaotic time to learn, and as such I feel worried about retention. My major is data science and AI. I am also watching professor Leonard’s to supplement this learning.

For any of you who went through a college education for data science and Ai, how did you handle/study the math

hasty mountain
serene scaffold
hasty mountain
#

Why people tend to make audio and NLP way more complicated than it is? grumpchib

serene scaffold
hasty mountain
#

Sometimes make a multiplication to make all values positive...

serene scaffold
#

I would make tutorials, but I think I'd spend most of each tutorial telling people not to do things that I hate. (CC @misty flint)

hasty mountain
#

But even Nvidia's codes make audio preprocessing way more complicated than OpenAI's Guided Diffusion(which uses RGB images)

#

(Comparing Tacotron 2 and Guided Diffusion)

hasty mountain
#

When I post my codes into GitHub, I kinda try to explain each step, so it's kinda tutorial...

#

But it can be quite confusing as I usually try to do everything in a single file 👍
(Though I find this easier to understand)

serene scaffold
#

you can do as much as you want in one file, as long as it's well-structured bing_shrug

hasty mountain
#

It probably isn't py_guido

#

But then... this audio preprocessing thing even made me give up trying to make audio GANs...until I realized that audios are just 2D arrays...just like a grayscale image.

serene scaffold
#

everything in neural networks are arrays

hasty mountain
#

Now I just need to learn some small details in NLP... and I'll be able to have my own voiced AI VTuber hyperlemon

serene scaffold
#

good luck with that.

hasty mountain
#

Thanks, heh

serene scaffold
#

don't make a model of your voice, though

hasty mountain
#

Nah, I won't

#

It's just sad that I'll probably need to use a pretrained model. I don't feel like labeling more than 1 hour of audio.

misty flint
#

bam. ok credit me at the bottom

serene scaffold
misty flint
#

bam

#

see you already have one

lapis sequoia
#

how to interpret r-squared in running linear regression models? what values or range of values is deemed good or strong?

tacit basin
rugged comet
wooden sail
#

no, but it makes the math nicer. it changes how the gradient behaves and also lets you use the trained model on different data, as long as it's scaled correctly

rugged comet
#

Interesting, thank you.

spark ginkgo
#

Best options for hosting datasets? (Currently using GitHub)

I'm thinking that maybe GitHub is not the best option for the long term. I just uploaded a 60MB JSON Lines file, and I got the large-file warning.
Kaggle might be good, but they're no longer supporting organizations, just individual accounts. And I'd like to publish datasets under my startup company's name.
Is there anything else out there? I'm thinking I might need to build something myself: a simple front-end backed by S3 or IPFS.
Or ... I'm thinking, maybe I will stay with Git and either GitHub and GitLab: but I'll figure out ways to divide up the data into smaller files.

wicked wing
#

anyone know how to stop importlib from "escaping" a virtual environment that it's located in?

#

I'm in a Jupyter notebook and trying to run the following cell:

import matplotlib
%matplotlib widget
from matplotlib import pyplot as plt
#

I'm getting the following error message

#
---------------------------------------------------------------------------
ModuleNotFoundError                       Traceback (most recent call last)
Cell In [1], line 3
      1 import matplotlib
----> 2 get_ipython().run_line_magic('matplotlib', 'widget')
      3 from matplotlib import pyplot as plt

File ~/.cache/pypoetry/virtualenvs/poetry-venv-py3.10/lib/python3.10/site-packages/IPython/core/interactiveshell.py:2309, in InteractiveShell.run_line_magic(self, magic_name, line, _stack_depth)
   2307     kwargs['local_ns'] = self.get_local_scope(stack_depth)
   2308 with self.builtin_trap:
-> 2309     result = fn(*args, **kwargs)
   2310 return result

File ~/.cache/pypoetry/virtualenvs/poetry-venv-py3.10/lib/python3.10/site-packages/IPython/core/pylabtools.py:359, in activate_matplotlib(backend)
    354 # Due to circular imports, pyplot may be only partially initialised
    355 # when this function runs.
    356 # So avoid needing matplotlib attribute-lookup to access pyplot.
    357 from matplotlib import pyplot as plt
--> 359 plt.switch_backend(backend)
    361 plt.show._needmain = False
    362 # We need to detect at runtime whether show() is called by the user.
    363 # For this, we wrap it into a decorator which adds a 'called' flag.

File ~/.cache/pypoetry/virtualenvs/poetry-venv-py3.10/lib/python3.10/site-packages/matplotlib/pyplot.py:264, in switch_backend(newbackend)
    261         rcParamsOrig["backend"] = "agg"
    262         return
--> 264 backend_mod = importlib.import_module(
    265     cbook._backend_module_name(newbackend))
    266 canvas_class = backend_mod.FigureCanvas
    268 required_framework = _get_required_interactive_framework(backend_mod)

File /usr/lib/python3.10/importlib/__init__.py:126, in import_module(name, package)
    124             break
    125         level += 1
--> 126 return _bootstrap._gcd_import(name[level:], package, level)

ModuleNotFoundError: No module named 'ipympl'
#

basically, IPython is trying to import ipympl from within the Jupyter notebook using importlib, but for some reason it uses the importlib from my system Python, rather than the one found in my poetry venv

cinder schooner
#

Greetings, I'm working on an art installation where I want to add some AI and realtime processing. I will be filming at some point a video of the person and I want to process that video so that I can change the person's face in the video or maybe add like a wink to the person or something like that and he needs to see it at the end so there's no time to have someone editing the video, it needs to be automated like with a deepfake model or something. Any idea where I can start?
I will be working with touch designer at some point so I can add python code and I know enough AI engineering to start this, I just don't know where to look.

lapis sequoia
modern heron
#

i'm trying to get an output for something like f a game is selected from the dataset at random what is the probability of it's score being 'awful'. trying to display the data from a csv file using pandas

desert oar
fast rivet
#

commands used:
conda skeleton pypi sentence-splitter
conda-build sentence-splitter

error from 2nd command:
Downloading source to cache: sentence-splitter-1.4_3d1d773d07.tar.gz
INFO:conda_build.source:Downloading source to cache: sentence-splitter-1.4_3d1d773d07.tar.gz
Downloading https://pypi.io/packages/source/s/sentence-splitter/sentence-splitter-1.4.tar.gz
INFO:conda_build.source:Downloading https://pypi.io/packages/source/s/sentence-splitter/sentence-splitter-1.4.tar.gz
Error: HTTP 404 NOT FOUND for url https://pypi.io/packages/source/s/sentence-splitter/sentence-splitter-1.4.tar.gz
Elapsed: 00:00.357711

An HTTP error occurred when trying to retrieve this URL.
HTTP errors are often intermittent, and a simple retry will get you on your way.
WARNING:conda_build.source:Error: HTTP 404 NOT FOUND for url https://pypi.io/packages/source/s/sentence-splitter/sentence-splitter-1.4.tar.gz
Elapsed: 00:00.357711

An HTTP error occurred when trying to retrieve this URL.
HTTP errors are often intermittent, and a simple retry will get you on your way.
Could not download https://pypi.io/packages/source/s/sentence-splitter/sentence-splitter-1.4.tar.gz

is there someone able to get the pypi sentence-splitter through conda?
if yes, then how?

desert oar
fast rivet
#

But the conda skeleton pypi command did not give any errors

desert oar
fast rivet
#

Ooh

grave frost
desert oar
fast rivet
compact star
#

hey, just looking at creating a neural network. how should I represent a neural network in python,. using numpy arrays

inner heath
#

I have questions about stt. is there some way to use the speech recognition library without internet (recognize_google only works online). or is it possible to combine / mix models in the vosk library (e.g. polish + english)?

serene scaffold
inner heath
#

yeah but speech recognition can detect language

#

but its only work online. is possible to somehow download google recognize?

#

to use this offline

serene scaffold
#

and I doubt Google will let you download their models. it's possible to train your own if you have a lot of annotated data.

verbal mulch
#

Any data science student?
I have a question

serene scaffold
verbal mulch
#

Sorry, I am new to discord, so I was a bit skeptical.
Do you know about distplot?

serene scaffold
verbal mulch
#

I provided values from 0 to 5 but it is showing me negative values as well on the x-Axis. Why is it so?

serene scaffold
#

!code

arctic wedgeBOT
#

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

serene scaffold
#

^ Here's how to paste code. Remember to never ask people to read screenshots of code.

verbal mulch
#

Sns.distplot([0, 1, 2, 3, 4, 5])

desert oar
#

it's fitting a gaussian kde to that data, which is unbounded at both ends

serene scaffold
#

so, this is with seaborn

desert oar
serene scaffold
inner heath
verbal mulch
serene scaffold
strong sedge
#

how is the value selected in a knn regressor, I understood how its done for a classifier, you take the k nearest points and choose the most common label, but just averaging in case of a regressor wouldnt really be accurate ?
is it done thru some form of extrapolation ?

inner heath
serene scaffold
#

you don't know how, or you're constrained?

agile cobalt
# strong sedge how is the value selected in a knn regressor, I understood how its done for a cl...

from what I'm understanding, it seems like they use the distance to each point as a feature, so yes for the last question, but I'm not 100% sure
https://scikit-learn.org/stable/auto_examples/neighbors/plot_regression.html#sphx-glr-auto-examples-neighbors-plot-regression-py

inner heath
serene scaffold
strong sedge
#

second question, why am I learning all these algos when I can just use deep learning for everything ?
is there a type of problem where these algos (linear, clustering etc) shine at ??

desert oar
desert oar
# verbal mulch Yeah, I know it's deprecated but first I'll have to learn about Gaussian KDE in ...

In statistics, kernel density estimation (KDE) is the application of kernel smoothing for probability density estimation, i.e., a non-parametric method to estimate the probability density function of a random variable based on kernels as weights. KDE is a fundamental data smoothing problem where inferences about the population are made, based o...

inner heath
agile cobalt
strong sedge
agile cobalt
#

depends on what you want to do

#

you can probably just gloss over though

strong sedge
agile cobalt
#

in that case all I have to say is "good luck", but you'll probably need to have at least a moderate understanding of most basic models you may get asked about in interviews

strong sedge
wooden sail
#

there are tons of classical algorithms that work just as well or better, with less data and less complexity for many tasks

#

deep learning is usually a data-driven approach to optimizing those same classical algos, so you anyway need to know them for DL

strong sedge
wooden sail
#

valid if you meet the conditions for it to be applicable in the first place 😛

strong sedge
#

dont tell me deep learning also has a problem with correlation :(

#

serious question again, kaggle is good and all, but are there any easier website where I can practice ml ??

wooden sail
#

the most restrictive ones are having enough data, and having that data be representative of what you wanna infer. and yeah, cleaning up the data and the exploratory analysis you need to do BEFORE you can do DL requires some stats, commonly uncluding correlation

strong sedge
#

kaggle competitions are a bit high level for a starter

wooden sail
#

it can literally be the difference between having a model that doesn't work at all vs one that works amazing

#

at the end of the day, ML (including DL) is math, so 😛 have fun

strong sedge
wooden sail
#

that i wouldn't know. why not grab a book though?

compact star
#

does anyone know a good explanation of how I could implement the NEAT algorithm in python?

strong sedge
wooden sail
#

uni library

strong sedge
#

he has a couple of videos going over all the parts of NEAT

#

this is just the first video

strong sedge
strong sedge
brisk apex
#

at how many years of experience are you expected to learn about machine learning as data analyst/engineer? (assuming you start your job at 0 years of previous experience)

inner heath
desert oar
#

as well as general python programming and scripting skills of course

#

some data visualization skills too

#

"machine learning" as a discipline usually doesn't require a lot of specialized knowledge. it's all the other stuff i mentioned that makes it possible to deploy "machine learning" techniques in real world applications

iron basalt
# grave frost if its similar to ART, that's a good thing - right? Its a decent way to combat s...

Yeah, catastrophic forgetting is one of the main issues with DL, and this is not a hard fix like (the original) ART, but any fix is better than just not addressing it (also it seems that to some extent some DL models can learn to work around their own catastrophic forgetting (e.g. Transformers), but only so much (and it requires a lot of training time)). The proposed method is somewhere in between the hard fix (ART, no forgetting at all), and not having any fix, while still working within DL and having nice(ish) gradients.

mint palm
#

I am using ssh
I have realised i cannot get access to check logs.
What can probably be thè reason my process is getting killed.
My code only has to extract feature using resnext3d101 architecture on each video one by one.

young granite
desert oar
young granite
#

hahahaha

#

u are a wise rock

desert oar
#

i try to be

young granite
#

but tbh im a lil bit scared of my thesis

desert oar
#

if it makes you feel better, that's how my brain looks pretty much every day @serene scaffold

desert oar
#

(although you should also get comfortable with asking your advisor for help)

#

(don't be like me: make sure you have a thesis advisor before you start writing your thesis)

young granite
#

am a bit afraid of the expectations in my thesis, since i have no benchmark and i actually don't come from the field of data science

#

It is not the general fear of a thesis, but the fear of being in a subject area where you may not fit in properly.
I am currently working my way a little into the subject and realize that a lot of it is really unclear to me.

#

I mean, ok, most of the publications I read are from PhD students but still

#

i am a little afraid of the uncertain 🗿

iron basalt
desert oar
young granite
#

but thesis is mat science

desert oar
# young granite yes and in chemistry

then it's critically important to set expectations with your advisor. ask them what they think you're capable of, and have frequent check-ins to make sure that you are on track. a sensible undergrad program should have this built-into the system. but it can't hurt to make sure and be proactive.

grave frost
iron basalt
young granite
young granite
iron basalt
#

One can create a memory database of SDRs which can be transferred perfectly.

desert oar
iron basalt
#

(Also see fusion ART)

young granite
iron basalt
grave frost
#

basically, a KNN lookup for nearest QKV vectors and integrating them into the model

iron basalt
#

Sort of, but we work with SDRs because they are better than the regular old (dense) vectors used in most language models.

grave frost
#

It allows for insanely large context lengths - but the transfer learning ability has to be baked into the model to be able to leverage its external memory efficiently

#

Right - the ones Numenta proposed, I take? Sparse, and certain operations which work nicely with sparse matrices

iron basalt
#

Numenta uses SDRs too. That is because biology uses them.

#

(Which is based on Numenta's thing)

#

(Although Numenta has moved on to grid cells... 😉 )

#

(Still using SDRs, because they are one of the best known ways to represent things in ML)

grave frost
#

As I understand, they're quite fundamental in how the brain passes information around

iron basalt
#

Yeah, which why one can also transfer knowledge with them and even form a database out of them (like vector search).

#

(And ART can produce them)

grave frost
#

Fusion ART seems more focused on obtaining symbolical representations, which confuses me - since the main goal was to push towards more biologically plausible mechanisms?

iron basalt
#

SDRs have set operations.

#

Taking their intersection for example, is meaningful.

#

And from this, one can do GOFAI-like stuff.

grave frost
#

Interestingly, from my quick Wikipedia skim

.. ARAM learns a predictive model (encoded by the recognition nodes in F2 that associates combinations of key features to their respective classes ... https://en.wikipedia.org/wiki/Fusion_adaptive_resonance_theory#Adaptive_resonance_associative_map

Fusion adaptive resonance theory (fusion ART) is a generalization of self-organizing neural networks known as the original Adaptive Resonance Theory models for learning recognition categories (or cognitive codes) across multiple pattern channels. There is a separate stream of work on fusion ARTMAP, that extends fuzzy ARTMAP consisting of two fuz...

#

It seems really similar to contrastive learning

iron basalt
#

(FALCON is how RL comes into play btw)

#

(at bottom of page)

grave frost
#

just perform similarity and surface level operations

#

I don't imagine transformations play well with SDRs 🤔

iron basalt
#

Yeah, which is where more works come into play (including our internal stuff).

#

The search for the one representation to rule them all (and be able to do all the operations we want to do, and for that, we continue to look at biology for answers).

grave frost
#

but I don't really get the obsession with controlling representations. One can't effectively manipulate biological representations - why would simulated ones succeed?

#

DL models have interpretable representations to an extent - atleast for smaller counterparts. https://twitter.com/neelnanda5/status/1559060507524403200

I've spent the past few months exploring @OpenAI's grokking result through the lens of mechanistic interpretability. I fully reverse engineered the modular addition model, and looked at what it does when training. So what's up with grokking? A 🧵... (1/17) https://t.co/AutzPTjz6g

Likes

1241

Retweets

210

#

But when you scale it up - it all breaks down. It's simply too complex to be reverse engineered - yet.

iron basalt
#

Well it's not just that they can be manipulated and understood for explainability / designability, it's understanding how biology is stable, distributed and such.

#

The main thing is that even if we can't manipulate it, it def. acts as a good way to pass around knowledge.

split compass
#

How can I get LaTeX font on my plots? 🙂

iron basalt
#

It just so happens that SDRs have set operations, which was not the goal from the beginning.

#

But now that we are here, and SDRs work as well as they do, we might as well ask the question of how explainable / designable / etc we can go.

#

(As long as it does not negatively affect the rest / limit it)

grave frost
#

Hm. Sounds like a much different question to me. I for one, doubt we'll ever be able to actually interpret such models without high performance penaltiies atleast

iron basalt
grave frost
#

The representation is simply too information dense to be effectively interpreted. And if you simplify those, like GOFAI, then they won't really represent that information well.

iron basalt
#

With no downside yet.

grave frost
#

in all fairness, there wasn't any theoretical results arguing against it either.

#

Making DL sparse has been quite an old field for a lot of time

iron basalt
#

My guess is that for explainbility some system that interprets the SDRs would have to explain it for us.

grave frost
#

They work terrifically well for inference. Absolutely fail for training due to gradient issues.

grave frost
#

It's hard to develop a truly universal (and simple) framework around it

iron basalt
grave frost
#

tbf, it isn't a totally useless approach. FSD companies use GANs and autoencoder to project their latent space as close as possible to the original image distribution. They found that the learned latent actually ignores a lot of useless information

iron basalt
#

Yeah that is related to why sparsity works (sparse latent).

grave frost
#

This is a pretty old image I found - but it demonstrates it quite nicely. notice how the reconstruction below ignores useless details like the signboard, focusing on lanelines and vehicles in immediate vicinity

#

Also, the traffic barrier is visualized, but the colors of the cars isn't carried through

iron basalt
#

Yup. However, SDRs can capture most of the information in there, even if it's not as important.

#

We have such reconstruction experiments.

#

One way to see this is that with an SDR as your latent, because it's sparse, it can be a massive vector.

grave frost
#

I'm not arguing why sparse vectors are bad; but rather that using another system to interpret a black-box is only applicable to a certain degree.

#

Sparse latents on their own can provide insane speedups especially on CPU.

iron basalt
#

Yes, it can only be applied to so far, but SDRs certainly make it a lot better.

grave frost
#

Well, gradient issues with SDR's is huge problem

#

but if someone solves it, that's pretty much a Turing-award level breakthrough to me

iron basalt
#

For example, we can pick certain "atoms" in the SDR latent space and add in certain objects / parts of objects into the scene.

#

And a human can do that by hand.

#

A human can view an SDR and learn what symbols are in it by playing around with it.

iron basalt
iron basalt
tacit helm
#

Hello, does anybody know where I can get examples or exercise for event in progress problems in python ?
I am trying to build a function that gets the peak number of views, given a collection of view times.

iron basalt
grave frost
desert oar
#

actually that section has less info than i thought

#

you can select multiple columns at a time by passing a list of column names

#
x = data[['a', 'b', 'c']]
y = data['label']

for example

snow helm
#

Would anyone be able to help me organize a dataset column in descending order

desert oar
#

yes, a MLP is just a NN with fully-connected layers, what keras calls Sequential

#

at least... for the most part? there are probably some historical specifics in the terminology here

desert oar
desert oar
#

you'll have to check the documentation, but from what i remember scikit-learn does not have a CNN implementation. MLP usually does not mean CNN or anything fancier than fully-connected/dense/sequential layers.

tired bay
#

i'm trying to make a multi billion AI can you help me

#

a bot that can solve google recapcha

novel python
#

I want to apply RNN to a dataset I have of internet data usage over a period of 6 months for a variety of clients. My problem is that I have no idea how to approach that case because every row is a whole different "trend", such as:

#

those are 5 different clients, for example

#

I'm having trouble finding the best solution how to separate this dataset into test/train to predict for the next month

desert oar
#

i suggest reading the docs for these. you should be able to answer questions like this if you're expected to build such models

#

"multi-layer perceptron" is generally synonymous with feedforward, densely-connected neural networks

#

do you know what a convolutional layer is?

#

you can't really accidentally use one. you would know if you did.

spare briar
#

what is this class where they teach you cnns before you know what a mlp is

desert oar
#

it's kind of amazing how easy keras and pytorch are, and how little you do actually need to know in order to use them. but that is more of a testament to how freakishly effective deep learning is, and how well designed those libraries are. it doesn't change the situation.

#

that's ridiculous

#

if you need decent self-study materials, look into the fast.ai course

#

it's free online and the material seems to be good quality from what i've seen

#

and the people who made the course actually do know something about deep learning

spare briar
robust cliff
#

what framework should i use for multi label image recognition?

lapis sequoia
#

how can I sort the values inside a group

fast rivet
strange kindle
#

Good day everyone,
I tried creating a virtual environment in my Linux terminal with ' conda create --prefix ./env pandas matplotlib numpy'

When I tried importing pandas and co, it returns error ' module not found '

wooden sail
#

did you activate the environment?

desert oar
#

.apply(lambda df: df.sort_values(...))

desert oar
fast rivet
# desert oar i'm pretty sure both URLs worked when i tested it, and one redirected to the oth...

could there be something else wrong with my meta.yaml?

{% set version = "1.4" %}

package:
  name: "{{ name|lower }}"
  version: "{{ version }}"

source:
  url: "https://pypi.io/packages/source/{{ name[0] }}/{{ name }}/{{ name }}-{{ version }}.tar.gz"
  sha256: 3d1d773d07cc733ca2955aa87d0fa1c0a7274c6bdeec1daac5c5e92efb512f63

build:
  number: 0
  script: "{{ PYTHON }} -m pip install . -vv"

requirements:
  host:
    - pip
    - python
    - regex >=2017.12.12
  run:
    - python
    - regex >=2017.12.12

test:
  imports:
    - sentence_splitter
  requires:
    - pytest
    - pytest-runner

about:
  home: "https://github.com/berkmancenter/mediacloud-sentence-splitter"
  license: "GNU Lesser General Public v3 (LGPLv3)"
  license_family: LGPL
  license_file: 
  summary: "Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder"
  doc_url: 
  dev_url: 

extra:
  recipe-maintainers:
    - your-github-id-here
fast rivet
fast rivet
lapis sequoia
#

Hey Hii I am new to this platform

strange kindle
wooden sail
#

are you using pycharm or vscode by any chance?

#

in both of these you need to specify which venv to use as well

lapis sequoia
#

So basically I want to make my carrier in data science . Please anyone here can give me advise what step to be followed to become a data scientist

#

🙏

strange kindle
wooden sail
#

pycharm will require you to activate the venv from its terminal

strange kindle
wooden sail
#

you sure you activated it from inside pycharm and not on a terminal outside?

strange kindle
desert oar
desert oar
#

but tldr it should work unless the installation itself failed, but then you would see an error message

#

try invoking the env python directly, without activating

#

./env/bin/python or whatever

strange kindle
fast rivet
desert oar
fast rivet
desert oar
#

there might be some jank in how it follows (or doesn't follow) redirects

lilac geyser
#

Hello
I'm trying to create a neural network from scratch.
So i have choosen to create a neural network that can approximate square function i.e, if input is 2 then the output of the neural network should be 4.
As I'm coding it from scratch, I have choosen to keep a 1 input neuron and 2 hidden layers with 2 neurons and 1 output layer.
Can I build a network something like above?
Or should I need to plan building something else?
Thank you

desert oar
lilac geyser
#

Yes exactly @desert oar

desert oar
#

you absolutely can do that from scratch. people used to do it like that for many years before tensorflow came out. you will need to work through the backprop calculations by hand, which i think is also a valuable excercise.

#

it's probably best to try doing it with numpy instead of "plain" python, since numpy is so ubiquitous in scientific computing and data science

#

numpy will also give you a lot more power to scale up to bigger neural networks more easily

lilac geyser
#

Yes I'm using numpy

#

Im stuck actually

desert oar
#

i suggest starting with 0 hidden layers. just input and output layers, compute the gradient, and run sgd. this should be equivalent to a traditional linear model from statistics, and you should test your implementation by verifying that the results are the same as e.g. scikit-learn or statsmodels.

#

scale that up from 1 input to maybe 5 or 10, and from 1 output to maybe 5 or 10. then you can graduate to one hidden layer.

#

that is the progression that i recommend

#

it will force you to gradually increase both the level of complexity and generality

#

start small, build intuition and understanding, then scale up gradually. it's a tried and true method for learning things and solving problems.

#

if you can't do it with 1 input and 1 output and 0 hidden layers, how can you expect to do it with 30 inputs and 5 outputs and 2 hidden layers?

#

also the process of figuring out how to generalize from e.g. 1 input to 15 inputs, and from 1 output to 5 outputs, will i think be enlightening

lilac geyser
#

Ok I'll start from scratch again
May be I directly jumped to 2 hidden layers.
Will try to code for 1 hidden layer with 1 input and 1 output

#

Thank you @desert oar

lilac geyser
desert oar
lilac geyser
#

So how do I understand then?
Like what should i call them?

wooden sail
#

you understand them as a composition of affine and nonlinear transformations

desert oar
#

and a basic "feedforward" neural network is just feeding the output of one linear model into another

#

each "layer" is basically just a linear regression (apart from the nonlinear activations)

lilac geyser
#

Ok thank you!

wooden sail
#

i don't think regression is a great term there though

copper patio
#

What is the best module for starting with machine learning?

wooden sail
#

regression usually refers to finding params, not the application of a (part of a) model

fast rivet
desert oar
fast rivet
#

ooh

desert oar
wooden sail
#

oof

wooden sail
fast rivet
desert oar
#

i suspect that should be handled by http redirects. maybe conda is failing to follow them properly

#

good catch, that's probably affected me before as well

odd meteor
copper patio
#

Is pytorch any good?

odd meteor
# copper patio Is pytorch any good?

Have you any knowledge on ML already? If yes, then I'd presume you're interested in getting into Deep Learning. PyTorch is one of the popular deep learning frameworks currently. Just think of it as React vs. Vue of Software Development but for Machine Learning.

Is PyTorch any good? Yes, it is.

copper patio
#

Thx

serene scaffold
#

Also, all my data science helper homies are in the chat today

#

(edd, salt, and emyrs)

wooden sail
#

it's important to keep in mind jax is just numpy with jit, parallelization, and autodiff. if you want to do ML with it, you need other stuff on top or you build it from the ground up

serene scaffold
wooden sail
#

yes

#

smart indexing is kinda painful

#

smart? fancy? i forget the term numpy uses

spare briar
#

this is a feature 🙂

serene scaffold
#

Where you use masks?

wooden sail
#

using masks is ok. manipulated a mask part of an array isn't, since that is of an arbitrary size

#

XLA no likey when you do that

#

iirc there's an open issue regarding masking methods

copper patio
#

Any suggested tutorial series for pytorch - preferably available for free

austere swift
#

pytorch has several guides and examples on their website

#

Don't know of much else though

gritty stone
#

Hello,

I'm not sure if this is the correct channel for that question. I need some advice what technology/libraries to use.

I have a dataset containing words in two languages. I want to create and train a model to be able to reconstruct a word in language2 when given a word in language1 (and preferably vice versa). The languages are closely related so the model would be trained to do slight changes like sound shifts, palatalization etc. Are there any libraries that could potentially help me with this or are there any works that tackle such a problem?

serene scaffold
gritty stone
#

text

serene scaffold
#

Then sound shifts and palatalization don't matter. Models that deal with text don't know what the words sound like.

#

By the way, when looking for AI solutions, don't look for libraries. Look for implementations of techniques

#

People don't make libraries that solve specific ai problems.

#

What are the two languages?

#

Ukrainian and Russian?

gritty stone
#

Polish and Proto-Slavic

serene scaffold
#

I see

gritty stone
#

I have a dataset of reconstructed words in Proto-Slavic but it's incomplete. I want to incorporate machine learning model to do the job

serene scaffold
#

(I'm a computational linguist. But I haven't worked on proto language reconstruction.)

#

Is the problem that you don't know what the rules are?

gritty stone
#

I see. I considered the approach of simply making a set of rules and applying that to a word to reconstruct it, I think that would totally work. I was just really curious if this could be done using AI as this would both:

  • potentially learn irregular development in a few cases
  • be doable from a perspective of someone who doesn't know the rules
gritty stone
#

I might, though, have a problem with picking a correct order of applying these rules

serene scaffold
#

Even if one doesn't personally know what the rules for the sound changes are, someone does, because proto languages are unknown and are only based on applying sound changes that some linguist believes account for the differences in the child languages.

#

Forgive me if this isn't news to you.

#

You might look into sequence to sequence models

#

If you can encode each phoneme as one symbol (what it is doesn't matter as long as you can decode it later), it should be pretty straightforward. As sequence to sequence goes.

gritty stone
#

I'm aware that we don't really know what a reconstructed language sounded like. After all it's a reconstruction. I just assume those reconstructed words are correct.

#

Thank you, I'll look into that

serene scaffold
novel python
#

what is actually happening here? overfitting?