lapis sequoia May 8, 2022, 12:01 AM

#

I am sorry. But there's no internet on my pc rn. Let me try to generate a hotspot

serene scaffold May 8, 2022, 12:04 AM

#

did you try using the KFold generator?

>>> from sklearn.model_selection import KFold
>>> X = np.array([[1, 2], [3, 4], [1, 2], [3, 4]])
>>> y = np.array([1, 2, 3, 4])
>>> kf = KFold(n_splits=2)
>>> print(kf)
KFold(n_splits=2, random_state=None, shuffle=False)
>>> for train_index, test_index in kf.split(X):
...     print("TRAIN:", train_index, "TEST:", test_index)
...     X_train, X_test = X[train_index], X[test_index]
...     y_train, y_test = y[train_index], y[test_index]

lapis sequoia May 8, 2022, 12:04 AM

#

will this work for you?

serene scaffold May 8, 2022, 12:04 AM

#

lapis sequoia will this work for you?

you can't copy and paste it as text in a code block?

lapis sequoia May 8, 2022, 12:04 AM

#

from sklearn.utils import shuffle
col_num = X.shape[1]

new_Ind = []
cur_MaxScore = 0
col_Ind_Random = shuffle(range(0,col_num), random_state=1)

for cur_f in range(0, col_num):
    new_Ind.append(col_Ind_Random[cur_f])
    newData = X.iloc[:, new_Ind]
    clf=DecisionTreeClassifier(max_features="sqrt", max_depth=4, min_samples_split=22, min_samples_leaf=9, random_state=0)
    cur_Score = cross_val_score(clf, X, targets, cv=5).mean()
    if cur_Score < cur_MaxScore:
        new_Ind.remove(col_Ind_Random[cur_f])
    else:
        cur_MaxScore = cur_Score
        print("Score with " + str(len(new_Ind)) + " selected features: " + str(cur_Score))```

#

that would not show show you outputs

lapis sequoia May 8, 2022, 12:06 AM

#

serene scaffold did you try using the KFold generator? ```py >>> from sklearn.model_selection im...

Oh. Should I use a k-fold generator and loop through it and do feature scaling before each iteration?

serene scaffold May 8, 2022, 12:07 AM

#

you can copy and paste that as well. remember that you're more likely to get help when you make it as easy as possible for answerers, and text is easier to work with than screenshots/pictures.

anyway, try using the KFold generator to make the folds, and then evaluate during each iteration of the KFold generator.

serene scaffold May 8, 2022, 12:07 AM

#

lapis sequoia Oh. Should I use a k-fold generator and loop through it and do feature scaling b...

sounds like you understand what I'm saying 😄

lapis sequoia May 8, 2022, 12:08 AM

#

I studied k fold generator. I just forgot about it

#

Is it not possible to scale the whole dataset though?

#

sc.fit_transform(X)

serene scaffold May 8, 2022, 12:09 AM

#

lapis sequoia Is it not possible to scale the whole dataset though?

you could do that before starting the kfold train/eval loop, I suppose. I've never actually worked with decision trees.

lapis sequoia May 8, 2022, 12:10 AM

#

Then there would be no need to do the loop at all

serene scaffold May 8, 2022, 12:10 AM

#

lapis sequoia Then there would be no need to do the loop at all

do you know the difference between fit and fit_transform?

lapis sequoia May 8, 2022, 12:10 AM

#

cross_val_score(clf, X, targets, cv=5).mean()

lapis sequoia May 8, 2022, 12:10 AM

#

serene scaffold do you know the difference between fit and fit_transform?

no

#

I just discovered feature scaling today

#

I believe it doesn't matter in decision trees though.

serene scaffold May 8, 2022, 12:12 AM

#

lapis sequoia no

fit and transform are two operations. fit adjusts the parameters of an sklearn object, and transform modifies data in some way. whereas predict passes X data through a model and returns the predictions.

#

so, fit_transform isn't going to be a method of the decision tree classifier, because it's a model.

lapis sequoia May 8, 2022, 12:13 AM

#

This what I found on YouTube

#

Aren't both returning scaled data?

serene scaffold May 8, 2022, 12:15 AM

#

lapis sequoia Aren't both returning scaled data?

the decision tree classifier is intended to return decisions

lapis sequoia May 8, 2022, 12:16 AM

#

There's no mention of classifier as of now in code

#

Just transformation

serene scaffold May 8, 2022, 12:17 AM

#

but you have clf=DecisionTreeClassifier(max_features="sqrt", max_depth=4, min_samples_split=22, min_samples_leaf=9, random_state=0)

lapis sequoia May 8, 2022, 12:18 AM

#

lapis sequoia ```py from sklearn.utils import shuffle col_num = X.shape[1] new_Ind = [] cur_M...

Could you solve this problem first. Why isn't my accuracy getting improved even by doing feature selection.

serene scaffold May 8, 2022, 12:21 AM

#

lapis sequoia Could you solve this problem first. Why isn't my accuracy getting improved even ...

what are the features

lapis sequoia May 8, 2022, 12:22 AM

#

Index(['age', 'anaemia', 'creatinine_phosphokinase', 'diabetes',
'ejection_fraction', 'high_blood_pressure', 'platelets',
'serum_creatinine', 'serum_sodium', 'sex', 'smoking', 'time']

serene scaffold May 8, 2022, 12:22 AM

#

can you show me a few lines of the CSV as text?

lapis sequoia May 8, 2022, 12:24 AM

#

75,0,582,0,20,1,265000,1.9,130,1,0,4,1
55,0,7861,0,38,0,263358.03,1.1,136,1,0,6,1
65,0,146,0,20,0,162000,1.3,129,1,1,7,1
50,1,111,0,20,0,210000,1.9,137,1,0,7,1

#

works?

serene scaffold May 8, 2022, 12:24 AM

#

yes

#

where is it that you're doing feature selection?

lapis sequoia May 8, 2022, 12:25 AM

#

lapis sequoia ```py from sklearn.utils import shuffle col_num = X.shape[1] new_Ind = [] cur_M...

in this code

serene scaffold May 8, 2022, 12:27 AM

#

I don't get how it's doing feature selection

lapis sequoia May 8, 2022, 12:27 AM

#

It's taking in a single feature at a time. Checking if it's increasing the accuracy of the model. If yes, we take it in. If no then we discard it.

serene scaffold May 8, 2022, 12:28 AM

#

well, it doesn't look like your newData variable ever gets used.

lapis sequoia May 8, 2022, 12:30 AM

#

Oh shoot

#

Looks like I fucked up the code

serene scaffold May 8, 2022, 12:30 AM

#

now you know how I feel every day

lapis sequoia May 8, 2022, 12:30 AM

#

Man! That's what happens when you try to reuse someone else's code 😭😭😭

serene scaffold May 8, 2022, 12:30 AM

#

this just goes to show that anyone can betray you

#

anyone

#

you can never let your guard down

lapis sequoia May 8, 2022, 12:30 AM

#

I thought I would just copy it from the professors notebook and modify it

#

Oh god

serene scaffold May 8, 2022, 12:31 AM

#

professors can't write python code if their life depended on it.

lapis sequoia May 8, 2022, 12:32 AM

#

Btw. I will modify it. Back to feature scaling. If I don't do train test split. And do fit_tranform(X) and pass that transformed X in cross_val. Will it be right?

#

X_transformed=sc.fit_transform(X) 
cross_val_score(clf, X_transformed, targets, cv=5).mean()```

serene scaffold May 8, 2022, 12:33 AM

#

only one way to find out 😄

lapis sequoia May 8, 2022, 12:34 AM

#

lapis sequoia ```py X_transformed=sc.fit_transform(X) cross_val_score(clf, X_transformed, tar...

Like this

lapis sequoia May 8, 2022, 12:34 AM

#

serene scaffold only one way to find out 😄

on it

serene scaffold May 8, 2022, 12:34 AM

#

https://tenor.com/view/i-dont-know-lets-run-this-daniel-shiffman-clueless-im-not-sure-gif-17373301

Tenor

lapis sequoia May 8, 2022, 12:35 AM

#

Worked

#

https://c.tenor.com/WN2_SLl77acAAAAM/clap-tom.gif

#

Good job kolv

#

Kolv and stelercus partners
https://c.tenor.com/7Ypq9_9najcAAAAM/thumbs-up-double-thumbs-up.gif

serene scaffold May 8, 2022, 12:36 AM

#

Stelercus. Ste-ler-cus

#

or is it stel-er-cus

#

idk

lapis sequoia May 8, 2022, 12:37 AM

#

What's Is/eum

serene scaffold May 8, 2022, 12:37 AM

#

"he/him" but in latin

lapis sequoia May 8, 2022, 12:39 AM

#

Okay. So i corrected the code. It's still giving me lower accuracy than what I got by selecting all features.

serene scaffold May 8, 2022, 12:46 AM

#

🔥 🔥 🔥 🔥

#

it might be that decision trees can learn to ignore features that aren't helpful.

#

(though it would likely take additional computation time to figure out which ones those are.)

lapis sequoia May 8, 2022, 12:50 AM

#

tysm

serene scaffold May 8, 2022, 12:50 AM

#

yw

lapis sequoia May 8, 2022, 12:51 AM

#

serene scaffold it might be that decision trees can learn to ignore features that aren't helpful...

copying that to put it my report. Haha

misty flint May 8, 2022, 2:31 AM

#

serene scaffold professors can't write python code if their life depended on it.

fax

#

kekHands

languid stratus May 8, 2022, 2:42 AM

#

Anyone good with TFX?

lapis sequoia May 8, 2022, 3:20 AM

#

I am getting an accuracy boost taking p=2 instead of p=3 in knn. In the minkowski distance. But what effect does increasing p would bring in generalisation of model. Does taking p with the highest validation score suffice?

#

The boost is 78.94 to 79.60

lofty basin May 8, 2022, 4:56 AM

#

https://www.codechef.com/problems/THREEBOX

lofty basin May 8, 2022, 4:56 AM

#

lofty basin https://www.codechef.com/problems/THREEBOX

can any one can help me with this

#

output=[]
for i in range(int(input(""))):
a,b,c,d=map(int,input("").split(" "))
if (a+b+c)<=d:
output.append(1)
elif (d*2)>=(a+b+c)>=d:
output.append(2)
else:
output.append(3)
for i in output:
print(i)

lofty basin May 8, 2022, 4:57 AM

#

lofty basin output=[] for i in range(int(input(""))): a,b,c,d=map(int,input("").split(" ...

that's my program and code chef is not accepting it

lapis sequoia May 8, 2022, 5:44 AM

#

Can anyone help me on this

#

#

Anyone

worldly dawn May 8, 2022, 5:54 AM

#

lapis sequoia Anyone

Is this a test?

lapis sequoia May 8, 2022, 5:58 AM

#

No

worldly dawn May 8, 2022, 6:01 AM

#

lapis sequoia No

Ok. I will still wait 1h to help, just in case 😉

lapis sequoia May 8, 2022, 6:01 AM

#

Why?

#

It’s unlimited tries

#

Get it wrong doesn’t affect me@

worldly dawn May 8, 2022, 6:02 AM

#

if it's not a test, then there is nothing wrong waiting a bit

lapis sequoia May 8, 2022, 6:02 AM

#

Just help me it’s not a test

worldly dawn May 8, 2022, 6:02 AM

#

lapis sequoia Just help me it’s not a test

I will help you in 1h

lapis sequoia May 8, 2022, 6:03 AM

#

Okay u will give me the answer in 1 hour?

worldly dawn May 8, 2022, 6:03 AM

#

I will help you figure it out in 1h.
It's better to help you understand it than spoon feed you an answer that you don't understand

lapis sequoia May 8, 2022, 6:03 AM

#

I’m not waiting 1 hour for that

#

I’ll ask in a different server thanks anyways

worldly dawn May 8, 2022, 6:04 AM

#

!rule 8

arctic wedgeBOT May 8, 2022, 6:04 AM

#

Rules

8. Do not help with ongoing exams. When helping with homework, help people learn how to do the assignment without doing it for them.

lapis sequoia May 8, 2022, 6:04 AM

#

It’s not an exam

#

What makes u think that?

#

Like I said it’s unlimited tries

#

I just don’t have hours to wait@for 1 problem

#

It’s 12 o clock where I live

safe elk May 8, 2022, 6:33 AM

#

serene scaffold professors can't write python code if their life depended on it.

Had an experience with that he asked me to code in Matlab lmao after I helped him extend his PHD thesis with a Python script...other profs knew only Matlab so Matlab it was..

zenith hawk May 8, 2022, 6:38 AM

#

Ok very quick question. So i have like 10 graphs of same event but they differ slightly. I wanted to somehow use graph neural network to get approximation of fitting function, is it a right approach? Maybe some libs for that exist alr? Thanks.

#

Or it will be easier to get all functions from each graph and get median?

wooden sail May 8, 2022, 7:03 AM

#

that depends on what knowledge you have about the graphs. if they are different because they are afflicted with 0-mean noise, you can simply average them. if the noise is impulsive, median filtering could work. if you already know a model for what the graph should look like but not the parameter models, you can optimize for parameters (e.g. with machine learning or classical gradient/newton methods)

#

if you know the type of noise but not the model, you could try and make a denoising network that takes in several noisy versions of a signal you made yourself synthetically, and give the network knowledge of the clean version, too

cyan sierra May 8, 2022, 12:09 PM

#

Hello. I was wondering, do we typically only use cross_val_score on training data?

serene scaffold May 8, 2022, 12:18 PM

#

cyan sierra Hello. I was wondering, do we typically only use cross_val_score on training dat...

do you understand what cross validation involves?

cyan sierra May 8, 2022, 12:19 PM

#

Yes, it is already splitting the data

unique flame May 8, 2022, 12:56 PM

#

Is there someone here who is currently exploring Object detection in python?

misty flint May 8, 2022, 1:24 PM

#

safe elk Had an experience with that he asked me to code in Matlab lmao after I helped hi...

kekHands

#

one of our profs "preferred" people to do their projects in matlab

#

everyone ended up choosing python instead

#

kekHands

safe elk May 8, 2022, 1:25 PM

#

Nice go python

woven coral May 8, 2022, 2:50 PM

#

#

anyone knows how to fix this???

lapis sequoia May 8, 2022, 3:20 PM

#

Is that a test??

lapis sequoia May 8, 2022, 4:29 PM

#

Hi everyone, my pytorch ai wont work and i've narrowed its issue down to the fact that it doesnt respond to reward, i've attatched the py file of which i use to calculate its next move and how it uses reward for that

#

https://paste.pythondiscord.com/hekedehepe

#

and im wondering what i can do to make it respond to reward and calculate a move based on its reward

bronze flume May 8, 2022, 4:40 PM

#

does anyone used ACO,PSO algorithms for classification ?

odd meteor May 8, 2022, 5:44 PM

#

misty flint everyone ended up choosing python instead

😂 The kind of rebellion I love to see

echo summit May 8, 2022, 5:52 PM

#

Guys, im new to data science and im currently out of ideas, so need help 😰 . I found a solution with iterrows, but the script will running for 2 weeks, so i need better solution

# DataFrame A
# ID | NAME |   CITY   |  STATE  |
# 1  | Jeff | New York |    ""   |
# 2  |Harold|  Dallas  |         |

#DataFrame B
# ID |   CITY   | STATE |
# 1  | New York |  NYC  |
# 2  |  Houston |  TX   |
# 3  |  Dallas  |  TX   |

I need to check for every row if CITY from DataFrame A is in CITY of DataFrame B if it matches then return value from STATE column in matched row and assign to the DataFrame A

so in the end it would be

# DataFrame A
# ID | NAME |   CITY   |  STATE  |
# 1  | Jeff | New York |   NYC   |
# 2  |Harold|  Dallas  |   TX    |

atm i have

dfA = dfA.assign(state=dfA["CITY"].isin(dfB["CITY"]).astype(str))

but it only returns boolean. How would i combine it to return those states?

agile cobalt May 8, 2022, 6:03 PM

#

echo summit Guys, im new to data science and im currently out of ideas, so need help 😰 . I ...

that is just a left inner join (in sql terms), you can use merge or join method for that

#

!d pandas.merge

arctic wedgeBOT May 8, 2022, 6:03 PM

#

pandas.merge


pandas.merge(left, right, how='inner', on=None, left_on=None, right_on=None, left_index=False, right_index=False, sort=False, suffixes=('_x', '_y'), copy=True, indicator=False, validate=None)```
Merge DataFrame or named Series objects with a database-style join.

A named Series object is treated as a DataFrame with a single named column...

odd meteor May 8, 2022, 6:03 PM

#

woven coral

Confirm if there's any word token in your corpus. If your corpus list isn't empty, then the problem is from your wordcloud code

agile cobalt May 8, 2022, 6:03 PM

#

!d pandas.DataFrame.join

arctic wedgeBOT May 8, 2022, 6:03 PM

#

pandas.DataFrame.join


DataFrame.join(other, on=None, how='left', lsuffix='', rsuffix='', sort=False)```
Join columns of another DataFrame.

Join columns with other DataFrame either on index or on a key
column. Efficiently join multiple DataFrame objects by index at once by
passing a list.

echo summit May 8, 2022, 6:12 PM

#

😮 i'll check it, brb

paper oak May 8, 2022, 6:23 PM

#

I want to start learning AI ML, so from where should I start? Suggest some good resources to start with

lapis sequoia May 8, 2022, 6:26 PM

#

paper oak I want to start learning AI ML, so from where should I start? Suggest some good ...

do you have a solid basis in numpy ?

paper oak May 8, 2022, 6:28 PM

#

lapis sequoia do you have a solid basis in numpy ?

nope I have knowledge of python, so from here... in what direction should i move? I mean what should be my learning path?

#

would be great , if u give some advice

lapis sequoia May 8, 2022, 6:29 PM

#

paper oak nope I have knowledge of python, so from here... in what direction should i move...

in short: numpy -> pandas -> matplotlib (optionally seaborn & more) -> AI ML

#

have you seen the resources section ?

#

!resources

arctic wedgeBOT May 8, 2022, 6:29 PM

#

Resources

The Resources page on our website contains a list of hand-selected learning resources that we regularly recommend to both beginners and experts.

paper oak May 8, 2022, 6:30 PM

#

lapis sequoia have you seen the resources section ?

nope I'll check it out thankss 🙂

lapis sequoia May 8, 2022, 6:43 PM

#

Hi everyone, my pytorch ai wont work and i've narrowed its issue down to the fact that it doesnt respond to reward, i've attatched the py file of which i use to calculate its next move and how it uses reward for that
https://paste.pythondiscord.com/hekedehepe
and im wondering what i can do to make it respond to reward and calculate a move based on its reward

#

Hello, I just created a bar plot with the following picture, it turns out that there is a text hit the top spine(the red sign), and I want the bar is close to the side spines(the green sign). Does anyone know the solution for these problem? Thank you

#

This is the code

lapis sequoia May 8, 2022, 7:01 PM

#

lapis sequoia This is the code

can you share the code as text and (preferably) the data ?

arctic wedgeBOT May 8, 2022, 7:13 PM

#

Hey @lapis sequoia!

It looks like you tried to attach file type(s) that we do not allow (.zip). We currently allow the following file types: .gif, .jpg, .jpeg, .mov, .mp4, .mpg, .png, .mp3, .wav, .ogg, .webm, .webp, .flac, .m4a, .csv, .json.

Feel free to ask in #community-meta if you think this is a mistake.

#

Hey @lapis sequoia!

You either uploaded a .txt file or entered a message that was too long. Please use our paste bin instead.

lapis sequoia May 8, 2022, 7:19 PM

#

lapis sequoia can you share the code as text and (preferably) the data ?

https://paste.pythondiscord.com/guqinuxapi.py

#

📎 Winer_Olympic_Medals.csv

#

all set-up

mild dirge May 8, 2022, 7:50 PM

#

lapis sequoia Hello, I just created a bar plot with the following picture, it turns out that t...

increase ylim and decrease* xlim

lapis sequoia May 8, 2022, 7:51 PM

#

mild dirge increase ylim and decrease* xlim

yeah that should work (though I haven't been able to sofar)

lapis sequoia May 8, 2022, 7:53 PM

#

mild dirge increase ylim and decrease* xlim

Oh yeah, i forgot about that, thank youu:D

mild dirge May 8, 2022, 7:53 PM

#

and try not to hardcode the limits, just add like a proportion from the highest bar

lapis sequoia May 8, 2022, 7:58 PM

#

lapis sequoia Oh yeah, i forgot about that, thank youu:D

lmk if it works, I kept getting 'int object is not callable', even when using an example straight from the matplotlib docs

bronze fiber May 8, 2022, 8:07 PM

#

can anyone suggest any full length pandas tutorial which goes from beginner to advanced topics?

zenith panther May 8, 2022, 8:15 PM

#

hi, i had this code that can scrape data from multiple pages, i tried to run it on google colab but it couldnt cause of the RAM somehow i tried to upgrade it but still couldnt run it so i decided to run the same code in Jupyter notebook and it's still running for 4 hours now . i didnt understand the issue, can someone help me out with it ? thank you in advance.

tacit basin May 8, 2022, 8:28 PM

#

zenith panther hi, i had this code that can scrape data from multiple pages, i tried to run it ...

How fast it supposed to run?

lapis sequoia May 8, 2022, 8:29 PM

#

lapis sequoia lmk if it works, I kept getting 'int object is not callable', even when using an...

it's worked, I added this
plt.xlim(xmin=-0.8, xmax=22)
plt.ylim(ymax = 45)

zenith panther May 8, 2022, 8:30 PM

#

tacit basin How fast it supposed to run?

well it depends on the code and the output but usually it takes seconds and maybe minutes

tacit basin May 8, 2022, 8:30 PM

#

So hours doesn't seem right

lapis sequoia May 8, 2022, 8:30 PM

#

does anyone know how to set the linewidth of the spines?

tacit basin May 8, 2022, 8:30 PM

#

Where did you run it when it took minutes?

zenith panther May 8, 2022, 8:31 PM

#

tacit basin Where did you run it when it took minutes?

not the same code tho ... at first i tried on google collab but then the RAM wasnt sufficient so it didnt run and now on jupyter it takes hours

tacit basin May 8, 2022, 8:32 PM

#

zenith panther not the same code tho ... at first i tried on google collab but then the RAM was...

How do you know it should be minutes?

#

If it's not the same code

#

Difficult to investigate this with so little details

zenith panther May 8, 2022, 8:33 PM

#

tacit basin How do you know it should be minutes?

i did extract data but it was from one page

#

so when i try to do the pagination it stuck

lapis sequoia May 8, 2022, 8:36 PM

#

zenith panther hi, i had this code that can scrape data from multiple pages, i tried to run it ...

maybe there is a problem in your code

zenith panther May 8, 2022, 8:38 PM

#

lapis sequoia maybe there is a problem in your code

well yeah it can be but in this case it should run and get error right ?

lapis sequoia May 8, 2022, 8:43 PM

#

infinite loop can be the problem too

lapis sequoia May 8, 2022, 8:43 PM

#

lapis sequoia does anyone know how to set the linewidth of the spines?

set_linewidth so

plt.gca().spines[:].set_linewidth(4)
``` to have a slightly thick border around the graph (all the spines)

zenith panther May 8, 2022, 8:45 PM

#

lapis sequoia infinite loop can be the problem too

right it does make sense.. thanks

lapis sequoia May 8, 2022, 8:49 PM

#

lapis sequoia set_linewidth so ```py plt.gca().spines[:].set_linewidth(4) ``` to have a slight...

thankss

brave osprey May 8, 2022, 10:08 PM

#

hi i want to develop a fintech do you ahve some code

#

can anyone help me with a bayesian hidden markov chain in rule?

#

in chain is the thingh

#

with stan

fiery dust May 8, 2022, 11:41 PM

#

json.decoder.JSONDecodeError: Expecting property name enclosed in double quotes: line 1 column 2 (char 1)

#

thonk

#

ok apparently if I send a json in this format:

{'symbol': 'AVAXUSDT',  'timeframe': '1',  'side': 'SHORT',  'order_type': 'Market',     'order_price': '52.535',    'tp_min': '52.535',  'tps': '[52.515, 52.51, 52.5, 52.475, 52.45]',  'sl': '52.565', 'rr': '2.8483333333'}

the error appears, but if I send the json like this:

{
   "symbol":"AVAXUSDT",
   "timeframe":"1",
   "side":"SHORT",
   "order_type":"Market",
   "order_price":"52.535",
   "tp_min":"52.535",
   "tps":"[52.515, 52.51, 52.5, 52.475, 52.45]",
   "sl":"52.565",
   "rr":"2.8483333333"
}

the error doesnt appear

echo summit May 8, 2022, 11:51 PM

#

agile cobalt that is just a `left inner join` (in sql terms), you can use `merge` or `join` m...

nice, i had some troubles with duplicates, but finally it worked, so thanks!

vestal gale May 9, 2022, 1:37 AM

#

Getting started with ML and AI, would love to see what you guys use for predicting/forecasting sales. Anyone able to help?

misty flint May 9, 2022, 1:39 AM

#

is this real or one of those fake news articles https://www.techradar.com/news/your-mechanical-keyboard-isnt-just-annoying-its-also-a-security-risk

TechRadar

Your mechanical keyboard isn't just annoying, it's also a security ...

This website is all ears

#

bc idk if you can train a feasible model using speech recognition and word probabilities

#

with NLP

#

like

#

it sounds like the accuracy wouldnt be good but thats just my intuition

safe elk May 9, 2022, 1:46 AM

#

misty flint is this real or one of those fake news articles https://www.techradar.com/news/y...

Sounds feasible different areas on the keyboard does yield different sounds...maybe even fft and signal processing can do it without resorting to nn..try tapping your keyboard an listen while drinking coffee lmao

misty flint May 9, 2022, 1:47 AM

#

Pika

#

kekHands

#

bruh

#

really?

#

wouldnt you need really good mic quality

delicate apex May 9, 2022, 1:49 AM

#

misty flint wouldnt you need really good mic quality

like the kind people are getting for their computers so they can work remotely?

safe elk May 9, 2022, 1:51 AM

#

In paper they say detect 41.8% of keystrokes and 27% of typed words correctly in such a noisy environment---even without user specific training. To investigate the pote

#

https://dl.acm.org/doi/10.1145/3328916

#

Be noisy lmao

delicate apex May 9, 2022, 1:55 AM

#

yeah, keyboards are rather awful security wise, but the vast majority of exploits require physical access, and if a bad guy can touch your computer, the keyboard is not your biggest problem

misty flint May 9, 2022, 2:25 AM

#

CLe_MonkaChrist

iron basalt May 9, 2022, 2:47 AM

#

There are more fun physical hacks, like using a laser to shine into a printer's LED through the window and uploading a virus.

#

(LEDs are inputs too)

#

It's a non issue though. Just an interesting research project.

misty flint May 9, 2022, 3:20 AM

#

RunFail RunFail

#

~~everything is a vulnerability~~

arctic wedgeBOT May 9, 2022, 7:44 AM

#

Hey @barren wedge!

You either uploaded a .txt file or entered a message that was too long. Please use our paste bin instead.

barren wedge May 9, 2022, 7:45 AM

#

How to make my model run faster in google colab?
https://paste.pythondiscord.com/pagorozeyi

upper spindle May 9, 2022, 7:55 AM

#

how do I locate my ipynb file, my os is windows 10

lapis sequoia May 9, 2022, 8:22 AM

#

upper spindle how do I locate my ipynb file, my os is windows 10

You can copy paste it

lapis sequoia May 9, 2022, 8:34 AM

#

fiery dust ok apparently if I send a json in this format: ``` {'symbol': 'AVAXUSDT', 'time...

Maybe because json doesn't allow single quote

odd meteor May 9, 2022, 10:07 AM

#

upper spindle how do I locate my ipynb file, my os is windows 10

import os
os.getcwd()

This will tell you your current work directory. If the iPython-Notebook file you're looking for is different from the one you're currently working on, just search on your pc / try looking in the right folder.

mighty spoke May 9, 2022, 10:14 AM

#

anyone know how to do this

supple leaf May 9, 2022, 10:18 AM

#

Hello, how do I reach the hours in this column?

#

df['hour'] = [x[11:13] for x in df.index]

It doesnt work.. it says this

#

odd yoke May 9, 2022, 10:42 AM

#

You must slice the timeperiod column, not the index of the dataframe

supple leaf May 9, 2022, 10:42 AM

#

so it should say df.Timeperiod?

odd yoke May 9, 2022, 10:45 AM

#

Yes

upper spindle May 9, 2022, 10:47 AM

#

does this mean they're in zip files now?

unique flame May 9, 2022, 11:07 AM

#

Anyone here familiar with YOLO? I've used inference in DarkNet and OpenCV on the same image, but the inference on OpenCV has some slightly more detections. Wonder if anyone has experienced something similar.

odd meteor May 9, 2022, 11:39 AM

#

supple leaf Hello, how do I reach the hours in this column?

You could easily convert the column to datetime object and extract the hour.

pd.to_datetime(df['time_period']).dt.hour

You could also try using the apply function + lambda on the column to extract the hour as well.

df['time_period'].apply(lambda x: x[10:12])

serene scaffold May 9, 2022, 11:56 AM

#

odd meteor You could easily convert the column to datetime object and extract the hour. ``...

always the one that uses dt 😄

#

also wouldn't df['time_period'].str[10:12] be more idiomatic?

odd meteor May 9, 2022, 12:00 PM

#

serene scaffold always the one that uses dt 😄

That's the one I mostly use when working with a time column 😃

serene scaffold May 9, 2022, 12:00 PM

#

odd meteor That's the one I mostly use when working with a time column 😃

well, if they have timestamps encoded as strings, they should really convert those to a proper datetime

supple leaf May 9, 2022, 12:06 PM

#

Big thanks guys, will try it now

urban lance May 9, 2022, 1:38 PM

#

Hey, thanks for your answer. Sorry I haven't replied till now.
I am having some trouble with the .rank() function. Can't seem to figure it out. also the code you shared is not filtering the first 2 distinct IDs but its only returning 1 unique value

urban lance May 9, 2022, 1:47 PM

#

urban lance Hey, thanks for your answer. Sorry I haven't replied till now. I am having some ...

I'll try to use dense instead of min

#

seems to have a better result but it's still not quite right

tough bolt May 9, 2022, 2:20 PM

#

How do I approach explaining Graph Neural Networks to a group of people who have absolutely no idea about neural networks let alone programming?

wooden sail May 9, 2022, 2:27 PM

#

if they're good at math, you can explain the math

urban lance May 9, 2022, 3:16 PM

#

tough bolt How do I approach explaining Graph Neural Networks to a group of people who have...

compate it to our brain?

fervent flicker May 9, 2022, 3:16 PM

#

if you're* good at math you can explain the math

#

sorry to butt in

urban lance May 9, 2022, 3:17 PM

#

fervent flicker if you're* good at math you can explain the math

but what if they're not good at math

#

they won't understand a thing either 😛

fervent flicker May 9, 2022, 3:17 PM

#

you just explain all the necessary prereqs

#

if you haven't practiced explaining things enough times it will be hard to explain it all concisely enough

#

and that's why people love cheatsheets

#

there's one on graph theory for nns, i bet

#

@tough bolt have you seen this site: https://distill.pub/2021/gnn-intro/ condensing it for nonexperts may be difficult

#

and i agree with @urban lance's idea that the human brain is a good analog for graph nns because we have compartmentalized brains that operate like them.

pseudo wren May 9, 2022, 3:41 PM

#

does anyone here have the time to review a brief colab notebook

#

like i'm talking 20 cells

#

half of which are just things you can ignore like setting up the dataset

#

i just need feedback and to see what i'm doing right/wrong

#

https://colab.research.google.com/drive/1sWE7lCxYEu0v0K0kStOu1duu5jApJSZD?usp=sharing

Google Colaboratory

#

notebook in question

candid pollen May 9, 2022, 3:45 PM

#

anyone familiar with lstm? i have a question

tough bolt May 9, 2022, 3:58 PM

#

fervent flicker <@136879578839777280> have you seen this site: https://distill.pub/2021/gnn-intr...

I haven't but it might be a good starting point. Thank you

tough bolt May 9, 2022, 3:58 PM

#

urban lance compate it to our brain?

Not a bad idea

spiral iris May 9, 2022, 4:32 PM

#

Any have to read book about DS, AI or smth?

#

To understand the basics,to extend knowledge, to see how to implement effective algorithms, projects for AI on Python?

manic heron May 9, 2022, 4:38 PM

#

recently did some citizen datascience using polars

#

https://www.dolthub.com/blog/2022-05-06-the-most-expensive-hospitals/

Why nonprofit hospitals can be so damn expensive

One of the most expensive hospitals in America may actually be a nonprofit. Insurers pay the hospital, Mary Lanning Healthcare in Nebraska…

#

here is the notebook, if anyone would like to give feedback

#

https://github.com/alecstein/dolt_datascience/blob/master/hospitals_v3/2022-05-06-prices-compared-with-medicare.ipynb

GitHub

dolt_datascience/2022-05-06-prices-compared-with-medicare.ipynb at ...

notebooks used to analysis projects. Contribute to alecstein/dolt_datascience development by creating an account on GitHub.

#

i'm not a pro data scientist, but i'm doing my best

lapis sequoia May 9, 2022, 5:04 PM

#

Is calculus really necessary to understand ml

#

?

spiral iris May 9, 2022, 5:05 PM

#

Calculus is necessary in programming after all

#

Not everything but some parts like integrals are very useful

#

And matrix math

lapis sequoia May 9, 2022, 5:06 PM

#

Oh

spiral iris May 9, 2022, 5:06 PM

#

As far as I know

#

Those are easy parts of calculus

lapis sequoia May 9, 2022, 5:06 PM

#

Okay thanks

#

Linear algebra is the basic for ml right?

lapis sequoia May 9, 2022, 5:10 PM

#

manic heron https://github.com/alecstein/dolt_datascience/blob/master/hospitals_v3/2022-05-0...

Can polars be used as an alternative for pandas? Nice project BTW

manic heron May 9, 2022, 5:11 PM

#

absolutely. i think of it as the spiritual successor to pandas

#

it's the future imo

#

i like pandas fine, but i find polars cleaner as an API and faster

#

and the code that i write is just way more elegant and readable

lapis sequoia May 9, 2022, 5:16 PM

#

Oh

serene scaffold May 9, 2022, 5:17 PM

#

manic heron i like pandas fine, but i find polars cleaner as an API and faster

I'm looking at your notebook. the most obvious difference is col, which must involve some delayed execution. it's an interesting solution to all the self-referencing indexing one often sees in pandas.

manic heron May 9, 2022, 5:18 PM

#

serene scaffold I'm looking at your notebook. the most obvious difference is `col`, which must i...

yep. polars expressions are awesome

#

imo

serene scaffold May 9, 2022, 5:18 PM

#

manic heron yep. polars expressions are awesome

are there any other similarly significant differences?

spiral iris May 9, 2022, 5:19 PM

#

lapis sequoia Linear algebra is the basic for ml right?

Sure is

manic heron May 9, 2022, 5:19 PM

#

well expressions make a huge difference but:

#

groupby/agg is cleaner

serene scaffold May 9, 2022, 5:19 PM

#

also, does it support sets?

manic heron May 9, 2022, 5:19 PM

#

select/filter are cleaner

lapis sequoia May 9, 2022, 5:19 PM

#

spiral iris Sure is

Okay thanks 🙂

manic heron May 9, 2022, 5:19 PM

#

i mean these are judgments

#

but what i'd recommend is just rewriting one of your existing notebooks in polars

#

i had a "wow" moment where i was like holy shit, this is just a better way

serene scaffold May 9, 2022, 5:20 PM

#

I actually avoid notebooks as much as possible 👀

manic heron May 9, 2022, 5:20 PM

#

if you want to comment here https://www.reddit.com/r/Python/comments/ululk1/i_used_a_new_dataframe_library_polars_to_wrangle/

r/Python - I used a new dataframe library (polars) to wrangle 300M ...

0 votes and 1 comment so far on Reddit

manic heron May 9, 2022, 5:21 PM

#

serene scaffold are there any other similarly significant differences?

i can basically give you examples, if you want specifics

spiral iris May 9, 2022, 5:21 PM

#

manic heron https://github.com/alecstein/dolt_datascience/blob/master/hospitals_v3/2022-05-0...

Read through it,understood nothing,can you explain what is this you write code in?

manic heron May 9, 2022, 5:21 PM

#

assuming other people have the same questions as you

serene scaffold May 9, 2022, 5:21 PM

#

thanks for letting me know!

spiral iris May 9, 2022, 5:24 PM

#

manic heron assuming other people have the same questions as you

Yeah looks like a deep dark forest lemon_angrysad

#

Can u explain for us in plain words which instruments did you use?

misty flint May 9, 2022, 5:28 PM

#

serene scaffold I actually avoid notebooks as much as possible 👀

kekHands

#

thats fair

#

theyve def been abused

#

but they do have their use cases

#

such as rapid experimentation

serene scaffold May 9, 2022, 5:35 PM

#

misty flint such as rapid experimentation

once the rapid experimentation is done, it all has to get composed into py files, and the notebook must be destroyed

spiral iris May 9, 2022, 5:37 PM

#

U say that python > jupyter?

#

Then why there are a lot more jupyter ai projects,then python?

wooden sail May 9, 2022, 5:42 PM

#

jupyter runs python code. it's just that it also allows you to put in tex, markdown, and images all in the same place, too, but you could just as easily make a script in a file (or a few) and run it from the terminal or your favorite IDE. jupyter is one way of doing that, too

spiral iris May 9, 2022, 5:43 PM

#

Oh so jupyter isn't necessary

wispy coyote May 9, 2022, 5:43 PM

#

serene scaffold once the rapid experimentation is done, it all has to get composed into py files...

Out of curiosity why destroy the notebook? I use the markdown in it heavily as a notebook, and I find it good to refer back to

spiral iris May 9, 2022, 5:43 PM

#

And I can use a pure python

#

Community version is enough for AI stuff, or I'l have to buy a professional?

wooden sail May 9, 2022, 5:44 PM

#

community version of? python is free

spiral iris May 9, 2022, 5:44 PM

#

Pycharm

wooden sail May 9, 2022, 5:45 PM

#

you don't need pycharm either. the community version would be fine, if you like it. you can pay for the pro version if it has anything you like. technically, all you need is notepad or any other text editor, and to download python

spiral iris May 9, 2022, 5:45 PM

#

There ide better then pycharm as far as I know

spiral iris May 9, 2022, 5:46 PM

#

wooden sail you don't need pycharm either. the community version would be fine, if you like ...

So embedded notepad for editing is enough?

wooden sail May 9, 2022, 5:47 PM

#

sure. i don't think windows' notepad supports syntax highlight though, which is imo the core thing you'd want. you could use, for instance, notepad++ or something like that

#

or spyder, if you like it. many people like vscode

spiral iris May 9, 2022, 5:48 PM

#

I'm on linux py_guido

#

Pycharm isn't a bad choice imo

wooden sail May 9, 2022, 5:49 PM

#

on linux, your default text editor supports syntax highlighting

#

or you can use vim, emacs, or (the one i use) micro

spiral iris May 9, 2022, 5:55 PM

#

You also are on linux?😎

#

I heard that vim takes ages to learn how to use it properly and to get accustomed to it's features

serene scaffold May 9, 2022, 6:09 PM

#

wispy coyote Out of curiosity why destroy the notebook? I use the markdown in it heavily as a...

Because I'm a destructive person

wispy coyote May 9, 2022, 6:10 PM

#

serene scaffold Because I'm a destructive person

I probably should take some of that energy.. *Stares at years old py2.7 files I refuse to delete, but never use*

manic heron May 9, 2022, 6:40 PM

#

spiral iris Can u explain for us in plain words which instruments did you use?

in plain words: i used this dataframe library

#

https://pola-rs.github.io/polars/py-polars/html/reference/

manic heron May 9, 2022, 6:41 PM

#

serene scaffold once the rapid experimentation is done, it all has to get composed into py files...

curious as to how you would do data science without notebooks

serene scaffold May 9, 2022, 6:44 PM

#

manic heron curious as to how you would do data science without notebooks

Regular py files and running them from the command line.

manic heron May 9, 2022, 6:45 PM

#

serene scaffold Regular py files and running them from the command line.

what's the advantage there?

serene scaffold May 9, 2022, 6:46 PM

#

Linear execution order, being able to import stuff, modularity

#

Notebooks are fine for exploratory analysis. But they're bad for production.

#

I mostly use an IPython repl for quick experimentation though.

manic heron May 9, 2022, 6:49 PM

#

you can run a notebook in order too

#

you can import stuff

serene scaffold May 9, 2022, 6:50 PM

#

I mean being able to import stuff that you create

#

if you define a function in a notebook, how do you import that into something else?

manic heron May 9, 2022, 6:51 PM

#

idk. it seems like you have a strong opinion about this, and i won't try to change your mind, but it is possible to import functions defined in notebooks

#

we used notebooks in a pretty serious cross-lab collaboration, it didn't seem to be an issue tbh

#

but yea, everyone has their tastes

serene scaffold May 9, 2022, 6:52 PM

#

how do you import functions defined in a notebook?

manic heron May 9, 2022, 6:52 PM

#

i mean i just googled this: https://duckduckgo.com/?q=importing+functions+from+another+jupyter+notebook&t=newext&atb=v312-1&ia=web

importing functions from another jupyter notebook at DuckDuckGo

DuckDuckGo. Privacy, Simplified.

#

if that was a serious, non-rhetorical question

agile cobalt May 9, 2022, 6:53 PM

#

I for one use Atom+Hydrogen, which allows for me to run python code somewhat like IPython but in normal .py files - that way I can test stuff while still keeping it well formatted

manic heron May 9, 2022, 6:53 PM

#

yea you can do that with vscode as well

#

it's dope

agile cobalt May 9, 2022, 6:53 PM

#

I admit that importing stuff can be a pain though, so breakpoints + to/from clipboard is useful

lapis sequoia May 9, 2022, 7:18 PM

#

agile cobalt I for one use Atom+Hydrogen, which allows for me to run python code somewhat lik...

What's hydrogen

#

I use atom. And ran it earlier in the regular python console. Now I do it with vscode.

agile cobalt May 9, 2022, 7:18 PM

#

https://nteract.io/atom

nteract

Take your computing experience to the next level.

nteract is a desktop application that allows you to develop rich documents that contain prose, executable code, and images.

lapis sequoia May 9, 2022, 7:19 PM

#

Damn. Looks like it is a transformation of atom into vscode

#

What's better in life 😁

echo vigil May 9, 2022, 7:50 PM

#

urban lance Hey, thanks for your answer. Sorry I haven't replied till now. I am having some ...

!e
import pandas as pd df = pd.DataFrame([[1, 1, 'a'], [1, 2, 'b'], [1, 3, 'c'], [1, 4, 'd'], [2, 7, 'e'], [2, 10, 'f'], [3, 0, 'g']], columns = ['user_id', 'timestamp', 'auxillary']) print(f"Before:\n {df}") df['rank'] = df.groupby('user_id')['timestamp'].rank(method='min', ascending=True) df = df[df['rank'] <= 2] print(f"After:\n {df[['user_id', 'timestamp', 'auxillary']]}")

arctic wedgeBOT May 9, 2022, 7:50 PM

#

@echo vigil :white_check_mark: Your eval job has completed with return code 0.

001 | Before:
002 |     user_id  timestamp auxillary
003 | 0        1          1         a
004 | 1        1          2         b
005 | 2        1          3         c
006 | 3        1          4         d
007 | 4        2          7         e
008 | 5        2         10         f
009 | 6        3          0         g
010 | After:
011 |     user_id  timestamp auxillary
... (truncated - too many lines)

Full output: https://paste.pythondiscord.com/xajiseride.txt?noredirect

echo vigil May 9, 2022, 7:51 PM

#

Correct me if I'm wrong but that seems to be working fine -- what should be different about the output on this test case?

urban prism May 9, 2022, 9:10 PM

#

Is there way I can make changes on installed libraries on Kaggle?

misty flint May 9, 2022, 9:25 PM

#

serene scaffold once the rapid experimentation is done, it all has to get composed into py files...

if i had to do this in a prod environment, i would use a tool that allows you to track model data

desert oar May 9, 2022, 9:26 PM

#

serene scaffold once the rapid experimentation is done, it all has to get composed into py files...

nbconvert 🙂

desert oar May 9, 2022, 9:26 PM

#

misty flint if i had to do this in a prod environment, i would use a tool that allows you to...

dvc is an excellent tool for this kind of thing

#

https://dvc.org

Data Version Control · DVC

Open-source version control system for Data Science and Machine Learning projects. Git-like experience to organize your data, models, and experiments.

misty flint May 9, 2022, 9:43 PM

#

desert oar dvc is an excellent tool for this kind of thing

ive heard great things about it. the creator spoke about it on a podcast once lol

desert oar May 9, 2022, 9:44 PM

#

misty flint ive heard great things about it. the creator spoke about it on a podcast once lo...

i can highly recommend it, I've used it quite a bit

#

I can't speak to how it would scale across a large team, but I've used it in a small team and it worked pretty well

#

It also made sharing data and trained models pretty easy, less time wasted re-computing things

fervent flicker May 9, 2022, 9:45 PM

#

i can't wait for https://mlem.ai/

#

it's a project related to dvc. a trained model database, if i understand correctly

nocturne wigeon May 9, 2022, 9:51 PM

#

can someone help with an exe of a bot i built please?

serene scaffold May 9, 2022, 9:52 PM

#

nocturne wigeon can someone help with an exe of a bot i built please?

an exe of a bot. doesn't sound like a data science question

nocturne wigeon May 9, 2022, 9:52 PM

#

channels quotes "ai"

#

and it is ai

serene scaffold May 9, 2022, 9:52 PM

#

okay, so it's not a discord bot? what is the problem?

nocturne wigeon May 9, 2022, 9:52 PM

#

no not a discord bot

#

a chess bot

#

trying to figuring out it's language, i tried like 3 decompilers

#

maybe its py

#

not sure tho

#

its an program but i forgot what language it is written in

serene scaffold May 9, 2022, 9:54 PM

#

if the question is really just about decompiling, try a general help channel. see #❓｜how-to-get-help

nocturne wigeon May 9, 2022, 9:54 PM

#

mhmh well not what im looking exactly for but i'll give a shot ok

eager cloak May 9, 2022, 10:58 PM

#

How would I fix this?

#

The code: ```py
from bitcoin import *
import bitcoinlib

def new():

public_key=privtopub(private_key)

private_key = random_key()
public_key=private_key.to_public()
address=public_key.to_address('P2PKH')
P2WPKH=public_key.to_address('P2WPKH')
print(f'Priv key>{private_key}\n Pub key> {public_key}\n Address>{address}\n P2WPKH> {P2WPKH}')

new()

lapis sequoia May 9, 2022, 11:12 PM

#

Not sure if this would be the right channel but

#

from ib_insync import *
from numpy import *
import yfinance as yf
import time
import pandas as pd 

ib = IB()

#vars
contract = Stock('MSFT', 'SMART', 'USD')
nextOperation = True #True = buy, False = sell
#Buys the asset if its price decreased by more than the threshold.
dipThreshold = -2.25
#Buys the asset if its price increased by more than the threshold.
upTrendThreshold = 1.5
#Sells the asset if its price has increased above the threshold since bought.
profitThreshold = 1.25
#Sells if its price has decrease by more than this threshold to stop loses.
stopLossThreshold = -2
def returnCurrentPrice():
    stock = yf.Ticker('MSFT')
    return stock.info["regularMarketPrice"]
def returnHistoricalPrice():
    stock = yf.Ticker('AMD')
    return stock.history(period='7d', interval='1d')
lastOpPrice = 90
buyOrder = LimitOrder('BUY', 1, returnCurrentPrice())
sellOrder = LimitOrder('SELL', 1, returnCurrentPrice())
def returnBalance():
    accountBalanceString = ib.accountSummary()
    for a in accountBalanceString:
        if a.tag=="CashBalance":
            return double(a.value)

#funcs
def attemptToMakeTrade():
    precentageDiff = (returnCurrentPrice() - lastOpPrice)/lastOpPrice*100
    if nextOperation == True:
        tryToBuy(precentageDiff)
    else:
        tryToSell(precentageDiff)

#

def placeBuyOrder():
    print("Placing buy order")
    ib.placeOrder(contract, buyOrder)
    lastOpPrice = returnCurrentPrice()

def placeSellOrder():
    print("Placing sell order")
    ib.placeOrder(contract, sellOrder)
    lastOpPrice = returnCurrentPrice()


def tryToBuy(precentageDiff):
    if precentageDiff >= upTrendThreshold or precentageDiff <= dipThreshold:
        placeBuyOrder()
        nextOperation = False
        print("Next operation is", nextOperation)

def tryToSell(precentageDiff):
    if precentageDiff >= profitThreshold or precentageDiff <= stopLossThreshold:
        placeSellOrder()
        nextOperation = True
        print("Next operation is", nextOperation)

def startBot():
    print("Starting...")
    ib.connect('127.0.0.1', 7497, clientId=1)
    lastOpPrice = returnCurrentPrice()
    nextOperation = True
    while(1 > 0):
        attemptToMakeTrade()
        time.sleep(30)

startBot()

#

Its expected to change the nextOperation bool but instead just repeats the tryToBuy and placeBuyOrder

#

and it just repeats that non stop

#

sorry if its a bit of a text wall

arctic wedgeBOT May 10, 2022, 12:11 AM

#

:incoming_envelope: :ok_hand: applied mute to @lapis sequoia until <t:1652142105:f> (9 minutes and 59 seconds) (reason: duplicates rule: sent 4 duplicated messages in 10s).

lapis sequoia May 10, 2022, 12:13 AM

#

lmao

arctic wedgeBOT May 10, 2022, 1:03 AM

#

pytorch_grad_cam/base_cam.py line 62

def forward(self,```
`pytorch_grad_cam/base_cam.py` line 74
```py
outputs = self.activations_and_grads(input_tensor)```
`pytorch_grad_cam/activations_and_gradients.py` line 39
```py
def __call__(self, x):```

urban prism May 10, 2022, 1:03 AM

#

This bot is amazing.

arctic wedgeBOT May 10, 2022, 1:06 AM

#

No, you are lemon_starstruck

strange stag May 10, 2022, 2:12 AM

#

looking for some help with ray and custom environments
code is bare-bones and very very simple, and the environment is as simplest ive ever seen
entire traceback is also included
https://bpa.st/F5SA

strange stag May 10, 2022, 2:50 AM

#

hmm, seems i needed to follow the directions on the ray website and add the env_config param to the env __init__

#

however, getting some numpy errors now 😕

novel python May 10, 2022, 3:09 AM

#

Guys, how do you change the figsize for jointplots and pairplots with Seaborn? I tried going for the usual plt.figure(figsize=(12,6)) but it clearly doesn't work, couldn't find anywhere how.

wind jay May 10, 2022, 3:18 AM

#

im trying to figure out how i would get started with artificial intelligence, say for example with text. how might i go about doing that?

#

like is there any good tutorials, websites anything reallly

#

would you mind expanding on that?

tacit basin May 10, 2022, 3:33 AM

#

wind jay im trying to figure out how i would get started with artificial intelligence, sa...

This is great notebook, which you can run on GPU for free on kaggle https://www.kaggle.com/code/jhoward/getting-started-with-nlp-for-absolute-beginners
fastai courses are great for deep learning as well course.fast.ai

Getting started with NLP for absolute beginners

Explore and run machine learning code with Kaggle Notebooks | Using data from U.S. Patent Phrase to Phrase Matching

tacit basin May 10, 2022, 3:38 AM

#

wind jay would you mind expanding on that?

So this is one of the ways and it's a practical way, you learn stats, and all else on a learn as you need basis. So you can start playing the real game, NLP in your case as soon as possible. With the current libraries like hugging face you can get a lot done with minimal code or stats. Then you learn as you go.

#

Hugging face NLP course is great too https://huggingface.co/course/chapter1/1

Introduction - Hugging Face Course

tacit basin May 10, 2022, 5:19 AM

#

is python 3.10 supported by major DS libs? like pandas, scki-kit, matplotlib, pytorch, etc. ?

left hazel May 10, 2022, 6:10 AM

#

Sadly not yet by pytorch (soon!), but most others do.

eager cloak May 10, 2022, 6:26 AM

#

Heya, so this code is giving me the error belowpy from bitcoin import * import bitcoinlib def new(): private_key = random_key() public_key=private_key.to_public() address=public_key.to_address('P2PKH') P2WPKH=public_key.to_address('P2WPKH') print(f'Priv key>{private_key}\n Pub key> {public_key}\n Address>{address}\n P2WPKH> {P2WPKH}') new()
Error: Desktop>python main2.py Traceback (most recent call last): File "main2.py", line 12, in <module> new() File "main2.py", line 8, in new public_key=private_key.to_public() AttributeError: 'str' object has no attribute 'to_public'

#

How can I fix that?

raven torrent May 10, 2022, 7:06 AM

#

what is the best ML library that is user friendly that is not tensorflow or keras

#

@eager cloak what is the best ML library that is user friendly that is not tensorflow or keras

low bronze May 10, 2022, 7:10 AM

#

hi

#

hi, I get a warning when i do this but it works, I dont know the right way to do this

for col in  df.columns[1:]:
    df[col] = pd.to_numeric(df[col], errors='coerce')

I get this warning. Could not find a way to solve this on stack over flow

/local_disk0/tmp/1652161177867-0/PythonShell.py:147: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
self.io.write(decoded)

#

please @ me

celest vine May 10, 2022, 7:14 AM

#

I have a supervised classification problem (yes,no), but the data is highly imbalanced. Yes - 5% and No - 95%.
Is it possible to make a good prediction model using this data?

tacit basin May 10, 2022, 7:33 AM

#

raven torrent what is the best ML library that is user friendly that is not tensorflow or kera...

pytorch with fastai

urban lance May 10, 2022, 7:34 AM

#

echo vigil Correct me if I'm wrong but that seems to be working fine -- what should be diff...

I don't understand why it is deleting the 3rd and 4th row 🤔

#

this is the result I'm looking for:

a | 1.0
a | 1.0
a | 1.0
b | 2.0
c | 3.0
c | 3.0
d | 4.0

a count of distinct values

eager cloak May 10, 2022, 7:34 AM

#

raven torrent <@566669435872477184> what is the best ML library that is user friendly that is ...

why u asking me lol

tacit basin May 10, 2022, 7:34 AM

#

eager cloak Heya, so this code is giving me the error below```py from bitcoin import * impor...

seems like random_key returns string, did you expect something else?

tacit basin May 10, 2022, 7:35 AM

#

celest vine I have a supervised classification problem (yes,no), but the data is highly imba...

use metric for imbalanced data, you could also use training techniques for imbalanced data, like undersampling, oversampling.

eager cloak May 10, 2022, 7:36 AM

#

tacit basin seems like `random_key` returns string, did you expect something else?

i mean.. no, tho why is it giving an error?

celest vine May 10, 2022, 7:44 AM

#

tacit basin use metric for imbalanced data, you could also use training techniques for imbal...

Okay, thank you!

desert pine May 10, 2022, 8:13 AM

#

Hi, I have a question. Is that possible to create product recommender systems based on their phone number instead of their username on website since
my sales dataset mostly came from whatsapp

urban lance May 10, 2022, 8:43 AM

#

urban lance this is the result I'm looking for: ``` a | 1.0 a | 1.0 a | 1.0 b | 2.0 c | 3.0 ...

I think I got it to work

sudden egret May 10, 2022, 9:16 AM

#

Have anyone worked on Time Series Classification? I am working on a problem now where I have to forecast the change in insurance carrier.

The inputs I have now so far is the Insurance Carrier Name, Companies and Premium amount of each company.

So the goal is to build a model that takes these inputs and forecast whether the company would change their carrier in future or not.

Still now I tried windowing techniques but I feel like Im missing out somewhere. There are almost 99 different companies.

#

Any help or tidbits to get started on this would be highly appreciated.

#

This is how the sample data looks.

tawny vine May 10, 2022, 9:21 AM

#

how to create an ai voice assistant

urban lance May 10, 2022, 9:26 AM

#

tawny vine how to create an ai voice assistant

ask siri

low bronze May 10, 2022, 9:37 AM

#

desert pine Hi, I have a question. Is that possible to create product recommender systems ba...

yes, you can use the phone number like an id

robust charm May 10, 2022, 10:07 AM

#

Hi, has anyone here used stablebasline3?

tawny vine May 10, 2022, 11:43 AM

#

urban lance ask siri

using py bruh

tawny vine May 10, 2022, 11:44 AM

#

robust charm Hi, has anyone here used stablebasline3?

yes

robust charm May 10, 2022, 12:05 PM

#

tawny vine yes

Hi im training a chess AI using self play. I have a couple questions about rewards and training if you have the time

lapis sequoia May 10, 2022, 12:10 PM

#

hey guys, How to combine multiple hyperspectral images into one? ```py

Example: Iterate over data set

for sample in dataset:
datacube, labelmap = sample
print(datacube.shape, labelmap.shape)
output -
(240,520,16)(240,520)
(240,520,16)(240,520)
(240,520,16)(240,520)
(240,520,16)(240,520)

charred spear May 10, 2022, 1:39 PM

#

https://www.asiaone.com/digital/army-robots-and-zero-human-workers-will-build-dam-china?utm_source=tldrnewsletter

AsiaOne

'An army of robots' and zero human workers will build a dam in China

China is using artificial intelligence (AI) to effectively turn a dam project on the Tibetan Plateau into the world's largest 3D printer, according to scientists involved in the project.The 180 metre (590 feet) high Yangqu hydropower plant will...

queen torrent May 10, 2022, 1:44 PM

#

sudden egret Have anyone worked on Time Series Classification? I am working on a problem now ...

Hi, I'm currently working on Time Series. I can look into this if you provide me a few more details.

sudden egret May 10, 2022, 1:45 PM

#

queen torrent Hi, I'm currently working on Time Series. I can look into this if you provide me...

Sure what other information you need? or else you want me to DM?

queen torrent May 10, 2022, 1:46 PM

#

sudden egret Sure what other information you need? or else you want me to DM?

yes you can send the info over DM.

inland zephyr May 10, 2022, 1:55 PM

#

does anyone have good references for document verification methods? i wonder if non-CNN approach is available somewhere

serene scaffold May 10, 2022, 1:57 PM

#

inland zephyr does anyone have good references for document verification methods? i wonder if ...

document verification. what are you trying to verify?

inland zephyr May 10, 2022, 2:00 PM

#

just if a word or sentence is in the document

#

like let said the sender name should be inside the document

#

i know there is possibility that the name in the document will be abbreviated, so i will try to check the similarity of the phrase in the document with the sender name

odd meteor May 10, 2022, 2:33 PM

#

inland zephyr like let said the sender name should be inside the document

If you already knew the specific words/sentence you wanna search for in the document, you could do something like this


list_of_words = ['Money Laundering', 'Police', 'Weed', 'Cocaine', 'Drugs', 'Gun']

df.loc[df["document_message"].str.contains('|'.join(list_of_words), na=False)] 

'''
If you wanna flag 🚩 document where such words appeared you could do this
'''

df['flag'] = np.where( (df['document'].str.contains('|'.join(list_of_words)) == True) 1, 0)

misty flint May 10, 2022, 3:14 PM

#

nice word list

#

kekHands

serene scaffold May 10, 2022, 3:23 PM

#

odd meteor If you already knew the specific words/sentence you wanna search for in the docu...

why not

df['flag'] = df.loc[df["document_message"].str.contains('|'.join(list_of_words), na=False)].astype(bool)

odd meteor May 10, 2022, 3:31 PM

#

serene scaffold why not ```py df['flag'] = df.loc[df["document_message"].str.contains('|'.join(l...

The one that comes to mind first. This is shorter and it works as well.

lapis sequoia May 10, 2022, 3:50 PM

#

!docs

arctic wedgeBOT May 10, 2022, 3:50 PM

#

All inventories (`41` total)

• aiohttp
• arcade
• arrow
• asciimatics
• asyncpg
• attr
• black

lapis sequoia May 10, 2022, 3:50 PM

#

!resources

arctic wedgeBOT May 10, 2022, 3:50 PM

#

Resources

The Resources page on our website contains a list of hand-selected learning resources that we regularly recommend to both beginners and experts.

lapis sequoia May 10, 2022, 3:50 PM

#

can anyone give me ML docs

serene scaffold May 10, 2022, 3:56 PM

#

lapis sequoia can anyone give me ML docs

for what library?

urban prism May 10, 2022, 4:02 PM

#

Does someone here have experience with OpenMMLabs?

inland zephyr May 10, 2022, 4:35 PM

#

odd meteor If you already knew the specific words/sentence you wanna search for in the docu...

will take note this shipit 👍

mental bane May 10, 2022, 5:44 PM

#

This might be a dumb question but can someone please explain me the use of 'passthrough' in this code and in general?

tacit basin May 10, 2022, 6:13 PM

#

urban prism Does someone here have experience with OpenMMLabs?

A bit. What's your question

urban prism May 10, 2022, 6:17 PM

#

Do you know how img_metas should be formed? I'm using another library and it sends the image to mmdet without that, which causes an error. I changed some lines in the libraries to get past it but now I have to manually declare it in the library's code

#

Planing to declare it here https://github.com/open-mmlab/mmdetection/blob/73b4e65a6a30435ef6a35f405e3474a4d9cfb234/mmdet/models/detectors/base.py#L112
Right now img_metas is NoneType

arctic wedgeBOT May 10, 2022, 6:20 PM

#

mmdet/models/detectors/base.py line 112

def forward_test(self, imgs, img_metas, **kwargs):```

gray orchid May 11, 2022, 12:56 AM

#

inland zephyr i know there is possibility that the name in the document will be abbreviated, s...

Try paddle ocr, light but powerful

lapis sequoia May 11, 2022, 2:06 AM

#

serene scaffold for what library?

Python machine learning

urban prism May 11, 2022, 2:07 AM

#

What ML library? There are many of them @lapis sequoia

serene scaffold May 11, 2022, 2:22 AM

#

lapis sequoia Python machine learning

There are a lot of those. You have to know which library you want before you can look for the docs of it.

lapis sequoia May 11, 2022, 2:22 AM

#

oh idk about ml lib let me check it out

serene scaffold May 11, 2022, 2:23 AM

#

It sounds like there's a misunderstanding here. What are you trying to do? Just learn about machine learning?

lapis sequoia May 11, 2022, 2:26 AM

#

serene scaffold It sounds like there's a misunderstanding here. What are you trying to do? Just ...

i want to learn machine learning

#

im confused

#

um these?

#

can u suggest which one should i use? irdk

neon imp May 11, 2022, 2:31 AM

#

Not sure this is the channel, but anyone done any quantum computing work?

serene scaffold May 11, 2022, 2:37 AM

#

neon imp Not sure this is the channel, but anyone done any quantum computing work?

We don't really have a channel for that. You can try a general help channel, but ask your actual question.

serene scaffold May 11, 2022, 2:38 AM

#

lapis sequoia can u suggest which one should i use? irdk

Sounds like you should read a machine learning book. All those libraries solve different problems. Learning individual libraries is not a viable way to learn ml

lapis sequoia May 11, 2022, 2:38 AM

#

okay

misty flint May 11, 2022, 2:46 AM

#

@spare briar i remember you were into bandits https://eugeneyan.com/writing/bandits/

eugeneyan.com

Bandits for Recommender Systems

Industry examples, exploration strategies, warm-starting, off-policy evaluation, and more.

neon imp May 11, 2022, 2:48 AM

#

Yeah I was just wondering. Quantum has a lot of overlap with Data Science stuff in my head, was wondering if some people here did it.

#

Just because Quantum is 100% about state following a probability distribution.

median moat May 11, 2022, 2:57 AM

#

Everywhere I go I see his(rex) face.

misty flint May 11, 2022, 2:57 AM

#

sadblob

median moat May 11, 2022, 2:57 AM

#

No don't leave

plucky harness May 11, 2022, 4:53 AM

#

I need some help. I have created a handwritten data of 1000 png images and now I'm stuck with the part of pre-processing and feature engineering. I mean I am facing problems with how to import, load, and process the data for model training with scikit-learn.

mighty bloom May 11, 2022, 5:52 AM

#

someone help me in finding the syntax error in this?

tacit basin May 11, 2022, 6:46 AM

#

mighty bloom someone help me in finding the syntax error in this?

How do you know you have syntax error here?

tacit basin May 11, 2022, 6:47 AM

#

plucky harness I need some help. I have created a handwritten data of 1000 png images and now I...

Whats your end goal? Image classification?

mighty bloom May 11, 2022, 6:53 AM

#

tacit basin How do you know you have syntax error here?

i run the code n i got the error

mighty bloom May 11, 2022, 6:53 AM

#

tacit basin How do you know you have syntax error here?

is there no error in the code?

tacit basin May 11, 2022, 6:57 AM

#

mighty bloom is there no error in the code?

What's the error message?

mighty bloom May 11, 2022, 6:59 AM

#

tacit basin What's the error message?

SYNTAX ERROR : invalid syntax in line 129 which is the 1st line of "Output(...)"

tacit basin May 11, 2022, 7:26 AM

#

mighty bloom SYNTAX ERROR : invalid syntax in line 129 which is the 1st line of "Output(...)"

is this plotly?

#

from plotly website i see this should be the syntax?:

@app.callback(
    Output(component_id='body-div', component_property='children'),
    Input(component_id='show-secret', component_property='n_clicks')
)

mighty bloom May 11, 2022, 7:37 AM

#

tacit basin is this plotly?

yup

mighty bloom May 11, 2022, 7:37 AM

#

tacit basin from plotly website i see this should be the syntax?: ```py @app.callback( O...

oh thanks a lot

gray swallow May 11, 2022, 7:39 AM

#

Guys can you help me

main geode May 11, 2022, 7:39 AM

#

Hey I am looking for some amazing people to colab on a project, anyone's interested.

gray swallow May 11, 2022, 7:40 AM

#

I'm using SVM classifier

#

I used two columns as outputs (Y) for my model

main geode May 11, 2022, 7:40 AM

#

I am a beginner in ML and wanted to begin with some good project if we could make up a team or someone could add to his project would love to help

gray swallow May 11, 2022, 7:41 AM

#

main geode I am a beginner in ML and wanted to begin with some good project if we could mak...

Me too buddy. Add me to your team

main geode May 11, 2022, 7:42 AM

#

gray swallow Me too buddy. Add me to your team

That' cool, let just others reply than we form a team

#

@gray swallow y ( the label) should be a 1dimension array

eager cloak May 11, 2022, 7:43 AM

#

How can I turn a P2PKH BTC address into a P2SH address?

tacit basin May 11, 2022, 7:43 AM

#

gray swallow I'm using SVM classifier

do you want to use multi output ? https://scikit-learn.org/stable/modules/multiclass.html

scikit-learn

1.12. Multiclass and multioutput algorithms

This section of the user guide covers functionality related to multi-learning problems, including multiclass, multilabel, and multioutput classification and regression. The modules in this section ...

gray swallow May 11, 2022, 7:43 AM

#

tacit basin do you want to use multi output ? https://scikit-learn.org/stable/modules/multic...

Thanks let me check it

tacit basin May 11, 2022, 7:44 AM

#

gray swallow Thanks let me check it

but figure out first if you want to use multioutup or sigle output is sufficient for you

gray swallow May 11, 2022, 7:44 AM

#

main geode <@781868298626662401> y ( the label) should be a 1dimension array

I used list of tuples

main geode May 11, 2022, 7:45 AM

#

what dataset are you using

gray swallow May 11, 2022, 7:45 AM

#

Here is the dataset. I combined risk_score and default values as outputs . So I zipped them into a list of tuples and did the same for the X values

📎 financial_data.csv

#

import pandas as pd
import matplotlib.pyplot as plt

import seaborn as sn

import numpy as np

from sympy import together

import matplotlib.pyplot as pyplot
import pickle
from matplotlib import style
from sklearn.model_selection import train_test_split
from sklearn import svm
from sklearn import metrics

import data

dataset = pd.read_csv('financial_data.csv')

now we start the Exploratory Data Analysis process

print(dataset.head())

response = dataset["default", "risk_score"]

response values

default_values = dataset["default"]
risk_score_values = dataset["risk_score"]
place_holders = np.zeros( len(risk_score_values))

print(place_holders)

exit()

Y_values is a combination of risk score and default values

Y_values = list(zip(risk_score_values, default_values, place_holders, place_holders, place_holders, place_holders, place_holders, place_holders))

Y_values = list(zip(risk_score_values, default_values))

X values defined here

age = dataset["age"]
pay_schedule = dataset["pay_schedule"]
home_owner = dataset["home_owner"]
income = dataset["income"]
months_employed = dataset["months_employed"]
years_employed = dataset["years_employed"]
has_debt = dataset["has_debt"]
amount_requested = dataset["amount_requested"]

zippping X values together

X_values = list(zip(age, pay_schedule, home_owner, income, months_employed, years_employed, has_debt,amount_requested ))

drop columns

dataset = dataset.drop(columns=["default","entry_id", "risk_score"])

X_train,X_test, y_train, y_test = train_test_split(dataset, Y_values, test_size=0.2, random_state=0)

clf = svm.SVC()

clf.fit(X_train, y_train)

y_predict = clf.predict(X_test)

acc = metrics.accuracy_score(y_test, y_predict)

print(acc)

#

That was the code

main geode May 11, 2022, 7:47 AM

#

No bro that's now how svm works, It requires a label of 1d array not combination of all the col

#

choose one feature as your label and then make predictions

gray swallow May 11, 2022, 7:48 AM

#

main geode No bro that's now how svm works, It requires a label of 1d array not combination...

I didn't know about that. Thanks bro.

main geode May 11, 2022, 7:48 AM

#

Here i can see Risk_score is the label here, therfore y = dataset["risk_score"]

#

Your code would work more nice if you use jupyter notebook or google colab

#

you can do the process step by step though jupyter notebook

gray swallow May 11, 2022, 7:50 AM

#

Ok thanks bro

#

I'm creating a system to predict whether any applicant would default or not.

#

Will it work if I use default as my label here

main geode May 11, 2022, 7:57 AM

#

yes definitely

barren wedge May 11, 2022, 8:43 AM

#

How to solve this?

RuntimeError: CUDA out of memory. Tried to allocate 5.56 GiB (GPU 0; 11.17 GiB total capacity; 6.59 GiB already allocated; 3.88 GiB free; 6.73 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

eager cloak May 11, 2022, 8:47 AM

#

All the text is styled in yellow, is there a way for me to seperate it?
Like make the unblurred text yellow and the other text green

This is essentially the code```py
from bitcoinaddress import Wallet
info=input(f"{bcolors.RESET}Private key>{bcolors.YELLOW} ")
wallet = Wallet(info)
print(wallet)

plush glacier May 11, 2022, 9:30 AM

#

does anyone here know how i could add a tensor of shape (3,4) to a specific part of a larger tensor for example a tensor with shape (40,10) and add it to the point of (20:3,3:4) of the large tensor
with tensorflow
it is a small part of a school assingment the entire assingment is making a tetris ai and this is just for the tetris game part

#

pls ping me if you reply to me

wooden sail May 11, 2022, 9:41 AM

#

plush glacier does anyone here know how i could add a tensor of shape (3,4) to a specific part...

what do you mean by the point (20:3,3:4)? you mean (20:23,3:7)?

tacit basin May 11, 2022, 9:45 AM

#

barren wedge How to solve this? RuntimeError: CUDA out of memory. Tried to allocate 5.56 GiB...

you are trying to fill more stuff into your gpu that it can handle. use smaller model, smaller batch size, if these are images, lower image size...

hasty mountain May 11, 2022, 9:58 AM

#

Hey guys, can someone give me some help with Gym-Retro? I didn't edit not even a single thing in the files, yet I still get this error.

#

This is (literally) the entire code(excluding imports):

env = retro.make(game="StreetFighterIISpecialChampionEdition-Genesis")
obs = env.reset()

env.render(close=True) # habilita o fechamento da janela
env.close()

checkpoint = CheckpointCallback(save_freq=100000, save_path="D:/Python/Projects/Hakisa")

model = PPO2(policy="CnnPolicy", env=env, gamma=0.99, n_steps=64, learning_rate=3e-9, vf_coef=0.5, verbose=1)
start = time.time()
model.learn(total_timesteps=1000000, log_interval=100, reset_num_timesteps=True, callback=checkpoint)
end = time.time()

print("Duration: ", (end-start)/3600)

#

PS: Ping me if you can help.

lapis sequoia May 11, 2022, 10:00 AM

#

hasty mountain Hey guys, can someone give me some help with Gym-Retro? I didn't edit not even a...

hi, if noone here knows you could keep an eye on this thread post https://github.com/openai/large-scale-curiosity/issues/8 (the one from yesterday)

hasty mountain May 11, 2022, 10:03 AM

#

hasty mountain Hey guys, can someone give me some help with Gym-Retro? I didn't edit not even a...

Oh yes, I've looked in retro_env.py and there's nothing unusual there. There's a self.em attribute correctly assgined, etc. There's only a garbage collector to ensure there'll be only a single emulator running at a time.

#

Unless that garbage collector is deleting the current emulator I'm running, but this shouldn't be happening, right?

barren wedge May 11, 2022, 10:08 AM

#

tacit basin you are trying to fill more stuff into your gpu that it can handle. use smaller ...

this is my code

inputs = self.tokenizer([first_token] + name.tolist(),
max_length=self.max_length,
truncation=self.truncation,
padding=self.padding,
return_tensors='pt').to(device)

attention = inputs.attention_mask

outputs = self.model(**inputs)

len of the name = 121401

tropic pecan May 11, 2022, 10:46 AM

#

Hi Yall, I'm working on a project where we are trying to optimize some cooling patterns for a specific manufacturing process and the simulation result will be similar to this image and I also have the data with coordinates and temperatures at those certain points during that process. So I was wondering what sort of ML algorithms I would need to optimize the cooling patterns

plush glacier May 11, 2022, 11:02 AM

#

wooden sail what do you mean by the point (20:3,3:4)? you mean (20:23,3:7)?

no i mean he second one i dont think that i thought much when i said it so (20:23,3:7)

wooden sail May 11, 2022, 11:04 AM

#

all right. well the main issue is that tensorflow likes keeping things const

plush glacier May 11, 2022, 11:05 AM

#

ik so is there a way to add it to it

wooden sail May 11, 2022, 11:05 AM

#

one work around is to circumvent it entirely: cast your tensors to numpy nd arrays, do the math there, and then make a new tensorflow tensor and assign it to the original one

#

alternatively, you can make tensorflow modify the entries of the tensor, but i don't recall how to do that in any easy way

plush glacier May 11, 2022, 11:06 AM

#

wouldn't it be possible to pad the tensor that is added

#

to match the size

wooden sail May 11, 2022, 11:06 AM

#

sure, that's also an option, but that's very inefficient

plush glacier May 11, 2022, 11:06 AM

#

well it is only for very small tensors so memory isn't a issue

#

it would probably be like 10x faster than the numpy solution

wooden sail May 11, 2022, 11:07 AM

#

probably, yeah

plush glacier May 11, 2022, 11:07 AM

#

because the data doesn't have to be moved from gpu to cpu and then back to the gpu

#

not that that would be a solution that i would want because i dont know how it will pad the image

wooden sail May 11, 2022, 11:11 AM

#

how about this

#

https://www.tensorflow.org/api_docs/python/tf/tensor_scatter_nd_update

TensorFlow

tf.tensor_scatter_nd_update | TensorFlow Core v2.8.0

Scatter updates into an existing tensor according to indices.

#

https://www.tensorflow.org/api_docs/python/tf/tensor_scatter_nd_add

TensorFlow

tf.tensor_scatter_nd_add | TensorFlow Core v2.8.0

Adds sparse updates to an existing tensor according to indices.

#

this works, for example:

import tensorflow as tf
import numpy as np

x = tf.constant( [[1,2,3],
                [4,5,6],
                [7,8,9]])

y = tf.constant( [[1,1],
                [3,3]] )

indices = np.zeros((2,2,2), dtype=int) #2 x num rows in y x num cols in y
indices[0,:,:] = np.arange(0,2).reshape(-1,1) #row indices
indices[1,:,:] = np.arange(1,3).reshape(1,-1) #col indices
indices = indices.reshape(2,-1).T #reshaped into size #inds x 2
print(indices)

print(tf.tensor_scatter_nd_add(x, indices, tf.reshape(y, [4])))

#

pretty convoluted though, and does not add in-place. it makes a new tensor

proud pond May 11, 2022, 11:36 AM

#

Hellow everyone
I tried a lot of times to learn reinforcement learning, but I always find myself lost or I don't know what's the next step or what to learn.
I would like to put a plan for myself to learn so I don't find myself one day lost and without a task.
Can you guys help me with that, would appreciate it.
Like with resources, a link, a plan or anything.
Thank you pixels_snek_2

tacit basin May 11, 2022, 11:40 AM

#

barren wedge this is my code inputs = self.tokenizer([first_token] + name.tolist(), ...

what's your batch size?
you get this error when you train or make prediction?

plush glacier May 11, 2022, 11:56 AM

#

wooden sail this works, for example: ```py import tensorflow as tf import numpy as np x = t...

i was thinking more something like https://www.tensorflow.org/api_docs/python/tf/pad

TensorFlow

tf.pad | TensorFlow Core v2.8.0

Pads a tensor.

#

it seems to support a bit more things

#

and no numpy needed

#

although i dont know if i could make a solution that jit compiles well

wooden sail May 11, 2022, 11:59 AM

#

no numpy was needed in what i did either. this thing also makes the tensor sparse, so the 0 entries don't wase more memory

#

what i did was just a makeshift cartesian product. there are other ways of doing it without numpy

#

and it was only to make the indices, not the actual tf tensors

plush glacier May 11, 2022, 12:00 PM

#

oh to place the thing at the right place i think it will be just some simple math

wooden sail May 11, 2022, 12:01 PM

#

probably the same math i put there 😛

#

numpy and tensorflow (probably) should have a meshgrid function that does the same thing

plush glacier May 11, 2022, 12:01 PM

#

https://www.tensorflow.org/api_docs/python/tf/meshgrid

TensorFlow

tf.meshgrid | TensorFlow Core v2.8.0

Broadcasts parameters for evaluation on an N-D grid.

wooden sail May 11, 2022, 12:01 PM

#

there you go

#

but this will make the index array in gpu, which i thought was kinda pointless

plush glacier May 11, 2022, 12:02 PM

#

i dont think that a index array would be possible because each game will be slightly different

wooden sail May 11, 2022, 12:03 PM

#

you need the index array both for padding and for the function i suggested

#

otherwise you don't know where in the larger tensor to add the smaller one

plush glacier May 11, 2022, 12:04 PM

#

it is always at the same place at the start

wooden sail May 11, 2022, 12:04 PM

#

sure, and then it will presumably change as time goes on. you have to track it some how

#

either in gpu or memory

plush glacier May 11, 2022, 12:04 PM

#

well my solution for that was to make it 0.5 instead of 1

wooden sail May 11, 2022, 12:04 PM

#

make what 0.5?

plush glacier May 11, 2022, 12:05 PM

#

the value of the one that can move

wooden sail May 11, 2022, 12:05 PM

#

idk what you mean by that. if it works for you, cool 😛

#

but in any case, you always need to know WHERE to add the smaller tensor, otherwise the operation is not well defined. neither in math nor in code

plush glacier May 11, 2022, 12:06 PM

#

ik and i also know that i should try to avoid stupid mistakes like #data-science-and-ml message

barren wedge May 11, 2022, 12:45 PM

#

tacit basin what's your batch size? you get this error when you train or make prediction?

Make prediction in model(**inputs)
Throw me error

safe moss May 11, 2022, 1:09 PM

#

is this the channel to talk about pandas in?

rose agate May 11, 2022, 1:09 PM

#

the animal? no. the package? yes

safe moss May 11, 2022, 1:17 PM

#

excellent

#

i have a query,

i had three dataframes that i believe i correctly processed so that their columns were identical and any that were not present in all three were removed

#

as you can see they were all reduced to 116

#

but i used the concatenate command on them, expecting them all to merge seamlessly and remain at 116 columns but i have experienced this unexpected result and i am not sure why

#

( i added one extra column)

#

the display comes out with 233 columns and i am not sure why

rose agate May 11, 2022, 1:26 PM

#

safe moss i have a query, i had three dataframes that i believe i correctly processed so ...

I don't remember how concat works but I think it might be because the column names aren't the same

safe moss May 11, 2022, 1:26 PM

#

rose agate I don't remember how concat works but I think it might be because the column nam...

but i think i used the column names to filter them out initially

rose agate May 11, 2022, 1:27 PM

#

what is the contents of q_set?

safe moss May 11, 2022, 1:27 PM

#

so basically all three data frames had a number of columns, i then reduced them down so that the only columns left in each were columns that were present in all three dfs

#


for df in df_array:    
    for question in df.loc[0]:
        map[question] = map.get(question, 0) + 1

q_set = set()



for key, val in map.items():

    if val == 3:

        q_set.add(key)

print(len(q_set))

116

rose agate May 11, 2022, 1:32 PM

#

Are there null values in the columns of the final frame?

rose agate May 11, 2022, 1:58 PM

#

You might need to specify axis=1

safe moss May 11, 2022, 1:58 PM

#

i assume you mean in the newly created df?

rose agate May 11, 2022, 1:58 PM

#

Other than that I'm not sure

#

Yeah

safe moss May 11, 2022, 1:59 PM

#

one moment

rose agate May 11, 2022, 1:59 PM

#

Also I'd try if set(df1.columns) == set(df2.columns)

safe moss May 11, 2022, 2:02 PM

#

what will that do, sorry my brain is a bit fried atm from something else

#

creating two sets and comparing them?

rose agate May 11, 2022, 2:03 PM

#

Yes, just testing that the columns in each of the data frame are the same

#

Also I'd try adding axis=1 in your statement and see if that fixes it

safe moss May 11, 2022, 2:03 PM

#

rose agate Are there null values in the columns of the final frame?

rose agate May 11, 2022, 2:04 PM

#

Either your problem is that the columns aren't the same names or something else I don't know

safe moss May 11, 2022, 2:08 PM

#

rose agate Also I'd try adding axis=1 in your statement and see if that fixes it

this actually increases the columns from 233 to 351, but 351 is 3*117 so that makes some kind of sense i suppose

#

ill try the set

safe moss May 11, 2022, 2:10 PM

#

rose agate Also I'd try if set(df1.columns) == set(df2.columns)

i guess the problem is here somewhere

rose agate May 11, 2022, 2:12 PM

#

Looks like it, try getting q_set by doing the intersections of those sets

#

Not sure if I'm remembering right, but I think it may only allow you to do 1 intersection at a time

versed gulch May 11, 2022, 2:21 PM

#

Hi does anyone know how to overlay 2 images over one another in python i.e. the mask onto the original image using the numpy arrays?

tacit basin May 11, 2022, 2:23 PM

#

barren wedge Make prediction in model(**inputs) Throw me error

What's the model size? And input size? Does these both fit into GPU memory?

wooden sail May 11, 2022, 2:39 PM

#

versed gulch Hi does anyone know how to overlay 2 images over one another in python i.e. the ...

you could, for instance, use matplotlib to plot one of the images first, then plot the other on top with transparency and a possibly different color. alternatively, you could use the images in gray-scale to define different color layers of an RGB(a) image

safe moss May 11, 2022, 2:40 PM

#

rose agate Looks like it, try getting q_set by doing the intersections of those sets

lemon_grumpy

rose agate May 11, 2022, 2:42 PM

#

safe moss <:lemon_grumpy:754441880158339132>

Looks like there's only 31 common columns. Is concat working now?

versed gulch May 11, 2022, 2:42 PM

#

wooden sail you could, for instance, use matplotlib to plot one of the images first, then pl...

with transparency and different colour, how would I do this?

safe moss May 11, 2022, 2:42 PM

#

do you mean doing concat as per before or changing the formulae

wooden sail May 11, 2022, 2:43 PM

#

versed gulch with transparency and different colour, how would I do this?

instead of me coding it, you can check out this example on stackoverflow. i think it shows exactly what you want: https://stackoverflow.com/questions/31877353/overlay-an-image-segmentation-with-numpy-and-matplotlib
namely, the accepted response

rose agate May 11, 2022, 2:43 PM

#

I think axis=1 was wrong, try without

versed gulch May 11, 2022, 2:44 PM

#

wooden sail instead of me coding it, you can check out this example on stackoverflow. i thin...

thanks

rose agate May 11, 2022, 2:44 PM

#

Make sure you update the frames with the function you wrote before to drop everything not in the set

lapis sequoia May 11, 2022, 3:04 PM

#

So i have to switch rows and columns without using the transpose function. This code is somehow not working, and is giving errors because I use matrix[y] and matrix[x][y]. I dont get why, does someone know how to fix this code?

wooden sail May 11, 2022, 3:04 PM

#

what error do you get?

#

also, what type is "matrix"? list of lists? np array?

lapis sequoia May 11, 2022, 3:05 PM

#

#

matrix is a pandas dataframe

wooden sail May 11, 2022, 3:06 PM

#

ah. sadly i've never used pandas :x hopefully someone else can help you out

lapis sequoia May 11, 2022, 3:07 PM

#

thank you anyways!

serene scaffold May 11, 2022, 3:10 PM

#

lapis sequoia So i have to switch rows and columns without using the transpose function. This ...

hello, I might be able to help, but I won't look at screenshots of code. If you have a numpy array named arr, arr.T should be sufficient to transpose it most of the time. There are other options if you have more than two dimensions and you don't just want to reverse them.

lapis sequoia May 11, 2022, 3:11 PM

#

for this function i'm not allowed to use df.T. I am trying to do it with a for loop but it's giving errors

serene scaffold May 11, 2022, 3:12 PM

#

lapis sequoia for this function i'm not allowed to use df.T. I am trying to do it with a for ...

why are you not allowed to use .T? is this part of an assignment?

lapis sequoia May 11, 2022, 3:12 PM

#

yes it's an assignment

serene scaffold May 11, 2022, 3:12 PM

#

alright. Can you show the code as text in a markdown block?

#

!code

arctic wedgeBOT May 11, 2022, 3:12 PM

#

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

serene scaffold May 11, 2022, 3:12 PM

#

^ this is the only format I will accept. I will also not accept screenshots of error messages.

lapis sequoia May 11, 2022, 3:14 PM

#

def transpose(matrix):
    transposed = pd.DataFrame()
    
    for y in range(len(matrix)):
        for x in range(len(matrix[y])):
            transposed[y][x] = matrix[x][y]
    

    return transposed

wooden sail May 11, 2022, 3:15 PM

#

my impression is that, instead of a double loop, you'd wanna iterate through the rows or columns only. pick one, not the other. read the lists, and put them into another list of lists. then make a new dataframe from that. i don't know the exact functions to do that though

serene scaffold May 11, 2022, 3:16 PM

#

There are a few problems here:

You would need to initialize the DataFrame with empty cells for the desired shape
You need to also transpose the columns and indices of the original DataFrame and put those in the transposed DataFrame
You index Dataframes with the iloc accessor, which means "index location". See the next point:
DataFrames are one data structure. They are not a list of lists. Expressions like transposed[y][x] should be transposed.iloc[x, y] because you are indexing the one data structure, not two.

#

Keep in mind that you would never do this in a real situation, and this assignment is almost completely pointless.

#

any "real" code that involves allocating empty space in a DataFrame and then putting stuff into it later is wrong.

#

oh, you also need to be using .iloc

lapis sequoia May 11, 2022, 3:19 PM

#

okay, i guess that should help me for now, thank you! It's indeed a really annoying assignment since df.T would be so much easier and faster ;/

serene scaffold May 11, 2022, 3:19 PM

#

lapis sequoia okay, i guess that should help me for now, thank you! It's indeed a really annoy...

as long as you understand why you should never actually write code like this, I suppose it might be useful for helping you understand what all .T does.

brazen totem May 11, 2022, 3:19 PM

#

Sorry to barge in but I was told I should ask here instead of the general question channel

Let's say I'm trying to predict a category (decided using arbitrary statistical cutoffs) using these same cutoffs with other additional statistics. How do I determine how well my regression predicts a category without arbitrarily assigning them beforehand?

Or is it just impossible?

serene scaffold May 11, 2022, 3:21 PM

#

brazen totem Sorry to barge in but I was told I should ask here instead of the general questi...

I'm not sure I follow. do you mind making your question less abstract? what are the categories, and what are the "statistical cutoffs"?

brazen totem May 11, 2022, 3:23 PM

#

I'm working on a project on categorizing poker player types. I use 3 statistics VPIP, PRF, and Limping to decide which player is which category. If these 3 values are within certain ranges they are marked as that player type. Then I run an aggregate on each category onto a larger set of statistics. Then I use both these aggregate statistics and the 3 statistics I used to sort category to predict player category

#

I'm trying to see if this makes sense or I'm doing something really wrong

serene scaffold May 11, 2022, 3:25 PM

#

so you are trying to decide what category a poker player belongs to. in data science we call this "classification". so you care trying to predict the classes of poker players.

brazen totem May 11, 2022, 3:25 PM

#

Yes

serene scaffold May 11, 2022, 3:26 PM

#

well, I guess it might not be classification per se if the class is determined by properties of the player that are always known

#

anyway

wooden sail May 11, 2022, 3:26 PM

#

it does make sense. you could take the statistics as a vector of numbers input into your classifier, and the output is the class the player belongs to (encoded in some way. probabilities, most likely)

serene scaffold May 11, 2022, 3:27 PM

#

nothing about it immediately jumps out at me as terrible

brazen totem May 11, 2022, 3:27 PM

#

The idea is to first create broad categorizations of poker players to create strong counter strategies

and then try to predict which category a random player is given their stats (to choose the proper strategy)

wooden sail May 11, 2022, 3:28 PM

#

the only thing is that depending on how the aggregates are computed, some might be redundant. not a huge problem if it's not that many

brazen totem May 11, 2022, 3:28 PM

#

I'm trying to do this in python but I'm not sure how exactly I would structure this

wooden sail May 11, 2022, 3:31 PM

#

you're essentially trying to pair up vectors of these statistics with labels. how you do that is a bit more freehand, though. for example, the labels could be encoded as a single number or as a vector. i also wouldn't know what the best architecture for this is. a support vector machine can split up the parameter space with hyperplanes, but you probably don't know a priori that this is the best way of doing it

brazen totem May 11, 2022, 3:31 PM

#

it feels like it would just predict with 100% accuracy if I include the 3 main stats I used to categorize in the first place

wooden sail May 11, 2022, 3:32 PM

#

if you generate the numbers yourself perfectly and without any noise, and this model perfectly describes what you are analyzing, yes, that would be the case (and also if the phenomenon can be measured without noise). this is why results that are entirely synthetic should be taken with a grain of salt if you don't put some work into noise statistics

brazen totem May 11, 2022, 3:33 PM

#

so I need to add some randomization into how I categorize?

wooden sail May 11, 2022, 3:34 PM

#

you'd probably wanna contaminate the measured numbers on purpose, or as you say, label some data incorrectly, or something like that. if you're making everything up and there is no noise or anything, and you know the model perfectly, there's hardly any need to use ML in the first place

#

you could use classical optimization methods on your known model to learn the parameters and that's all

brazen totem May 11, 2022, 3:35 PM

#

wooden sail you'd probably wanna contaminate the measured numbers on purpose, or as you say,...

there's a lot of noise in poker statistics

#

since it takes an obnoxious amount of hands to be certain about stats

wooden sail May 11, 2022, 3:36 PM

#

that's not really noise, just missing info. the thing is, if you are making up a definition yourself in which you assign a label by hand to some set of vectors using something simple like inequalities, the problem is very easy and you don't need ML 😛

brazen totem May 11, 2022, 3:37 PM

#

so id just use a regression?

wooden sail May 11, 2022, 3:37 PM

#

if you only have 3 parameters, you wouldn't even need that

brazen totem May 11, 2022, 3:38 PM

#

so you think I should go the opposite direction and see how the aggregate stats NOT included in the 3 parameters predicts class?

wooden sail May 11, 2022, 3:38 PM

#

but for the sake of argument, let's say these 3 numbers are indeed noisy with unknown noise stats. and that maybe the labelling process is a bit more complicated. then it makes sense to use ML.

#

well, why would you not include the 3 params?

#

the question i'm asking you is, is the problem so easy that you don't really need ML? there's no point in hiding the parameters "just because"

brazen totem May 11, 2022, 3:39 PM

#

wouldn't they perfectly predict class? since those are the only inputs

wooden sail May 11, 2022, 3:39 PM

#

if there is no noise, yes

brazen totem May 11, 2022, 3:39 PM

#

there is lots of noise

wooden sail May 11, 2022, 3:40 PM

#

so my question goes kind of like this

#

let's say we have numbers a and b in a vector, [a,b]. if a > c_0 and b > c_1, then we assign class A

#

so on and so forth. this gives us 4 classes.

#

next we receive the numbers [a,b] from some scenario we've never seen before. do we know anything about a and b a priori? can we trust the numbers, or do they have mistakes?

brazen totem May 11, 2022, 3:41 PM

#

these numbers have mistakes

wooden sail May 11, 2022, 3:41 PM

#

and going a step further, do we know c_0 and c_1, or not?

brazen totem May 11, 2022, 3:41 PM

#

we define c_0 and c_1

wooden sail May 11, 2022, 3:44 PM

#

all right. then it does make sense. it's a probability estimation problem. how certain one is that the underlying a > c_0 given only a noisy a, and having learned noise statistics from previous examples

brazen totem May 11, 2022, 3:45 PM

#

alright so how would I go about coding that?

#

should I use sample size to determine how noisy a/b are expected to be?

#

and then take into account that noise when assigning class?

#

so instead of saying this player is 100% class Y or class Z

#

we say there's a 40% chance of them being class Y and a 60% chance of them being class Z?

wooden sail May 11, 2022, 3:47 PM

#

i don't think you have an appropriate model to say that though, unless you have good reason to make a probability distribution dependent on those 3 numbers

#

if most of the data is labelled correctly, it should be ok as is

brazen totem May 11, 2022, 3:48 PM

#

poker standard deviation is generally agreed upon? although depending on playstyle it ranges

wooden sail May 11, 2022, 3:48 PM

#

i can't comment on that, idk

brazen totem May 11, 2022, 3:48 PM

#

but if it isn't needed I won't bother with it lol

brazen totem May 11, 2022, 3:49 PM

#

wooden sail if most of the data is labelled correctly, it should be ok as is

so how do we decide when the model incorrectly labels a player?

#

vs correctly

#

yes there;s noise in the input stats but if the input stats are all we use to classsify wouldn't it necessarily be 100% accurate?

wooden sail May 11, 2022, 3:52 PM

#

the only way is to have some ground truth you labeled by hand, then

#

since the labelled data using only thresholds can be incorrect

brazen totem May 11, 2022, 3:53 PM

#

this feels like a chicken and egg problem

wooden sail May 11, 2022, 3:53 PM

#

you can't very it because you don't know the ground truth

#

this is more a clustering problem than a classification one, then

#

there is nothing to very against

brazen totem May 11, 2022, 3:53 PM

#

yeah actual player type is impossible to be certain about

#

we can only have a level of certainty over the stats we have

wooden sail May 11, 2022, 3:54 PM

#

then maybe approach is as clustering instead

brazen totem May 11, 2022, 3:55 PM

#

clustering would be letting the data catogorize itself?

wooden sail May 11, 2022, 3:55 PM

#

pretty much

#

though in fairness, even if the data has a few mistakes in the labelling, it should still work

brazen totem May 11, 2022, 3:56 PM

#

what if it just says nothing though? that there's too much noise to be certain that any player type exists

wooden sail May 11, 2022, 3:56 PM

#

that's a result in itself, then, isn't it?

brazen totem May 11, 2022, 3:56 PM

#

yeah but that's pretty obviously incorrect

wooden sail May 11, 2022, 3:57 PM

#

what i mean is that it would mean the data has no inner structure, and the categories were arbitrary from the get-go

#

if that's the case, there is no difference from simply doing a set of inequalities yourself, no network needed

brazen totem May 11, 2022, 3:57 PM

#

wooden sail though in fairness, even if the data has a few mistakes in the labelling, it sho...

what do you mean by this?

brazen totem May 11, 2022, 3:58 PM

#

wooden sail if that's the case, there is no difference from simply doing a set of inequaliti...

alright Ill do this as well I guess

wooden sail May 11, 2022, 3:58 PM

#

don't worry about it. go ahead and try it out

#

label the data somehow and see if the network learns anything interesting

brazen totem May 11, 2022, 3:59 PM

#

so first id
label player type based on those 3 stats
let machine algorithm try to create its own player types

#

see the correlation between the 2?

#

alright that's something I probably know how to do

#

in that case which clustering method should I use

#

I plan on having 5 player types

wooden sail May 11, 2022, 4:02 PM

#

that's one thing. the other is to also treat it as a classifier problem and see what the network does. you'd expect it modifies the threshold values in some sense. idk how well it will work out with so few dimensions though.

brazen totem May 11, 2022, 4:02 PM

#

wooden sail that's one thing. the other is to also treat it as a classifier problem and see ...

how would I do that

#

Ill just do both tbh

wooden sail May 11, 2022, 4:02 PM

#

that would be the usual thing of taking an image and saying it's a dog, take another image and say it's a cat

#

and feed them to the network

#

i would say to try both approaches, yeah

#

see if you find anything interesting

brazen totem May 11, 2022, 4:04 PM

#

so id include the aggregate statistics into the machine learning model right

#

alongside the main 3 classifiers?

wooden sail May 11, 2022, 4:04 PM

#

you can if you want, why not. you can also test if including them makes any difference, too

brazen totem May 11, 2022, 4:05 PM

#

yeah tbh the one thing ive learned in my economectrics class is that data science is basically trying a bunch of random shit and seeing if it tells you anything

#

😂

wooden sail May 11, 2022, 4:05 PM

#

especially with this type of black box ML you're dealing with here, where the model is based on... not much

#

it gets pretty artistic

#

there are other problems where the architecture and target functions have very good foundations, but i don't think this is one of them 😛 play around and see if anything interesting comes out

brazen totem May 11, 2022, 4:06 PM

#

yeah I'm just trying to see if I can get something useful using hud stats

#

and potentially classify players on the fly

#

using limited/noisy information

misty flint May 11, 2022, 4:10 PM

#

wooden sail especially with this type of black box ML you're dealing with here, where the mo...

causal stats is making a rise since its more explainable than DL

#

blobhyperthink

#

however you usually need an SME for such methods

brazen totem May 11, 2022, 4:11 PM

#

what's an SME?

misty flint May 11, 2022, 4:11 PM

#

subject matter expert

#

for the domain you are working in

#

this is crucial for making sure you are looking at the right features and your assumptions are correct

brazen totem May 11, 2022, 4:12 PM

#

ahhh Id say I know SME

#

I know a player that has gone really deep into this shit

wooden sail May 11, 2022, 4:12 PM

#

right, that's why i was asking you so much annoying stuff about the stats

#

if you could incorporate that, it'd be better

brazen totem May 11, 2022, 4:13 PM

#

how would I incorporate that though?

wooden sail May 11, 2022, 4:13 PM

#

you could turn it into a parametric estimation problem where you update your priors on the distributions

brazen totem May 11, 2022, 4:13 PM

#

use a SME to manually classify each player?

wooden sail May 11, 2022, 4:13 PM

#

basically have ML-powered bayesian estimation

#

if you could assign a parametric family to how each of the numbers you feed the network are distributed, that'd be a good start

brazen totem May 11, 2022, 4:14 PM

#

ah but it depends on the sample size

#

say you have 500 hands vs 50,000 hands

wooden sail May 11, 2022, 4:14 PM

#

you mentioned variance earlier, but my guess is that that is precisely... a guess. that the numbers were assumed to be gaussian distributed

brazen totem May 11, 2022, 4:14 PM

#

the variance will be immensely different

wooden sail May 11, 2022, 4:15 PM

#

the sample variance, yes

#

the true variance of the underlying distribution, no

brazen totem May 11, 2022, 4:15 PM

#

ah I use poker variance calculator

wooden sail May 11, 2022, 4:15 PM

#

the idea is to find the parametric model that best explains the observed data

brazen totem May 11, 2022, 4:15 PM

#

ahhh

#

how would I do that

wooden sail May 11, 2022, 4:16 PM

#

you can read into maximum likelihood and bayesian estimation layer

#

for now i'd suggest to try what we discussed now, and see if it does what you wanted

brazen totem May 11, 2022, 4:17 PM

#

these stats are generally regarded to be normally distributed

#

but I'm not sure why

wooden sail May 11, 2022, 4:18 PM

#

wave your hands wildly while yelling central limit theorem

#

then the name of the game is hyperparameters

#

each of your 3 statistics is contaminated by noise, and so it has its own mean and variance for what you say is usually regarded a normal distribution

brazen totem May 11, 2022, 4:19 PM

#

ah yep

wooden sail May 11, 2022, 4:19 PM

#

the idea is to have the network estimate the mean somehow

#

and then to realize that both the mean and variance have their own... mean and variance

#

if the variance is low, you can trust the result (these are the so-called "error bars" and are related to p values, as some people call them)

#

then you can attach a certainty to your classification

brazen totem May 11, 2022, 4:20 PM

#

ahhhhh

#

that sounds exactly what I wanted to do

wooden sail May 11, 2022, 4:21 PM

#

you can do that with usual classification btw, but that does not consider the statistics of the measured parameters explicitly

#

i would still recommend you begin with what we discussed earlier because this other stuff can get hairy. if you have time constraints, gotta use that time wisely

brazen totem May 11, 2022, 4:22 PM

#

yeah I have literally 24 hours

wooden sail May 11, 2022, 4:22 PM

#

yeah, no

#

do the other stuff

#

do it like a classical image classification problem

brazen totem May 11, 2022, 4:22 PM

#

but this 3rd thing seems super interesting

#

how would I go about doing it? ill do it last if I have time

wooden sail May 11, 2022, 4:24 PM

#

you would need to make a custom cost function based on the maximum likelihood estimation. since this is gaussian, funnily enough, it's the same as doing least squares. however, the knowledge that each parameter is gaussian distributed instead of normal distributed means there is an extra term multiplied to it (added, after taking the log-likelihood) that depends on the statistics of each parameter

brazen totem May 11, 2022, 4:26 PM

#

yeah I have zero idea how to do that lol

#

I've learned about maximum likelihood estimation but I thought it was for binary prediction

wooden sail May 11, 2022, 4:28 PM

#

i would more call it a "philosophy" or solution approach. make a cost function that searches for the parameters that maximize the probability of observing the given data

#

it can be done in many different settings

brazen totem May 11, 2022, 4:29 PM

#

shit if only I had an extra week to work on this

#

ah well I can always do it as like a curiosity after the due date

brazen totem May 11, 2022, 4:47 PM

#

and for labeling I should only have one column for it right?

#

rather than 5 different columns with binary 0/1?

wooden sail May 11, 2022, 4:49 PM

#

either is ok. the difference is whether you use categorical or sparse categorical cross entropy

brazen totem May 11, 2022, 4:49 PM

#

what does that mean

wooden sail May 11, 2022, 4:50 PM

#

the cost function to minimize is different, but the problems are equivalent

brazen totem May 11, 2022, 4:50 PM

#

ah if they can have multiple classes at once?

#

ah nvm

#

they legit are just the same

wooden sail May 11, 2022, 4:51 PM

#

yep

brazen totem May 11, 2022, 4:51 PM

#

im lazy so id rather have 1 column only

full oar May 11, 2022, 5:11 PM

#

Hey everyone do anyone know how to implement this kind node chart in python in gui

#

Screenshot_2022-05-11-22-06-52-43_40deb401b9ffe8e1df2f1cc5ba480b12.jpg

mighty spoke May 11, 2022, 5:33 PM

#

Hi i'm trying to create an algorithm to bin my data but i'm not sure how to build it, like i could create a for loop and an if statement for each but that would be very long

versed gulch May 11, 2022, 5:36 PM

#

Does anyone know if Python has any deconvolution algorithms for image preprocessing/processing and if so could you share the links to them. As I've only found the weiner and richard_lucy ones?

eager cloak May 11, 2022, 5:39 PM

#

Heya!

#

How can I make it so that it just prints the response? (in this case, Hi, my friend, what can I do for you today?)

wooden sail May 11, 2022, 5:45 PM

#

mighty spoke Hi i'm trying to create an algorithm to bin my data but i'm not sure how to buil...

if you already have the values of t in a vector and have chosen T, you can compute all the values of tau. then, once you have the tau values, apply to all of them the procedure described in the text. in reality, splitting tau into bins follows the same formula that one uses to find the "phase" of the observations. having chosen M, you divide the period T by M, let's call this w = T/M. this is the width of each bin. then to find the bin index, you take tau/w and round down, i.e. floor(tau/w). if you have the values of t and/or tau in a vector, this requires no loops at all if using numpy arrays, for example.

cursive quest May 11, 2022, 5:45 PM

#

eager cloak How can I make it so that it just prints the response? (in this case, `Hi, my fr...

The response is a dictionary, so to refer to the part of the response you want to print you would refer to it as response.json()['cnt']

eager cloak May 11, 2022, 5:46 PM

#

cursive quest The response is a dictionary, so to refer to the part of the response you want t...

okay thank you

#

❤️ kiss_void

cursive quest May 11, 2022, 5:47 PM

#

pepoThumb

mighty spoke May 11, 2022, 5:52 PM

#

wooden sail if you already have the values of t in a vector and have chosen T, you can compu...

Hi so I have the values of tau(phases) in a array and I tried using pandas to bin it but i'm not sure its working right

Tp=1/fr#time period
print(Tp)
phases = foldAt(t, Tp, T0=0)

plt.figure()
#data is the intensity


x7, y7 = zip(*sorted(zip(phases, data)))#ensures x and y values correspond to each others in pairs when sorted

plt.plot(x7, y7)

df4 = pd.DataFrame({'X' : x7, 'Y' : y7})  #we build a dataframe from the data
M=20#no of bins

bins=np.arange(0, max(phases), step=Tp/M)


categorical_object = pd.cut(x7, bins)
count=pd.value_counts(categorical_object)
grp = df4.groupby(by = categorical_object)        #we group the data by the cut
ret = grp.aggregate(np.mean)```

wooden sail May 11, 2022, 5:55 PM

#

pd.cut looks likei t should do what you ask it to, yeah

#

pd is what you imported pandas as, right? should it be count = x7.value_counts(...) ?

native ibex May 11, 2022, 5:59 PM

#

I am new to data science and ai and I am a self taught learn.
What are basic topic to learn

wooden sail May 11, 2022, 6:00 PM

#

you wanna check out stuff like numpy, pandas, tensorflow, pytorch, jax on the python side

#

and at least some multivariable calculus, linear algebra, statistics, and optimization on the math side

mighty spoke May 11, 2022, 6:05 PM

#

wooden sail pd is what you imported pandas as, right? should it be count = x7.value_counts(....

when i do that it says ``` count=x7.value_counts(categorical_object)

AttributeError: 'tuple' object has no attribute 'value_counts'```

mighty spoke May 11, 2022, 6:05 PM

#

wooden sail pd.cut looks likei t should do what you ask it to, yeah

its giving wiered values like just 2's in a small amount of bins and zero in others

wooden sail May 11, 2022, 6:08 PM

#

seems about right. it'll depend on your actual data though, you'd have to plot it and see

#

try df4.value_counts, then? i'm not sure what a good way to do this with pandas is, tbh. i'd just do it with numpy

mighty spoke May 11, 2022, 6:10 PM

#

wooden sail seems about right. it'll depend on your actual data though, you'd have to plot i...

yeah still gives an error

mighty spoke May 11, 2022, 6:10 PM

#

wooden sail try df4.value_counts, then? i'm not sure what a good way to do this with pandas ...

how would i do it with numpy

wooden sail May 11, 2022, 6:12 PM

#

just do the math operations on the phases. or easier, use numpy histogram, since it seems that's what you're trying to make

mighty spoke May 11, 2022, 6:12 PM

#

wooden sail just do the math operations on the phases. or easier, use numpy histogram, since...

but i still would have to put the values into the bins right

wooden sail May 11, 2022, 6:13 PM

#

https://numpy.org/doc/stable/reference/generated/numpy.histogram.html

#

this would do everything for you

woven coral May 11, 2022, 6:32 PM

#

How to use TF IDF vectorizer with GRU in Keras Python

#

anyone knows???

serene scaffold May 11, 2022, 6:38 PM

#

woven coral How to use TF IDF vectorizer with GRU in Keras Python

to do what?

woven coral May 11, 2022, 6:38 PM

#

fake news detection

#

#

#

??

odd meteor May 11, 2022, 6:53 PM

#

native ibex I am new to data science and ai and I am a self taught learn. What are basic top...

!Resources

arctic wedgeBOT May 11, 2022, 6:53 PM

#

Resources

The Resources page on our website contains a list of hand-selected learning resources that we regularly recommend to both beginners and experts.

odd meteor May 11, 2022, 6:55 PM

#

Regression, Classification, Dimensionality Reduction (LDA, PCA), Clustering. Those are the basic topics

odd meteor May 11, 2022, 7:07 PM

#

woven coral How to use TF IDF vectorizer with GRU in Keras Python

You'd need to feed your tokenized text to your NN. So apply TF-IDF before feeding it to GRU. Or better still, use the tokenizer function in Keras directly.

from keras.preprocessing.text import Tokenizer
tokenizer = Tokenizer()
tokenizer.fit_on_texts('pass your input text here')

If you want to pad the sequence (text to sequence), then you need to also import the pad_sequence function from Keras

woven coral May 11, 2022, 7:41 PM

#

thank u so much

#

#

#

why error????

odd meteor May 11, 2022, 7:51 PM

#

woven coral why error????

Sorry, I omitted 's'. It should be fit_on_texts... It's been fixed now.

woven coral May 11, 2022, 7:55 PM

#

let me check

brazen totem May 11, 2022, 7:56 PM

#

should I use density based hierchal or partional clustering

#

when I expect the different groups to have some overlap

woven coral May 11, 2022, 7:56 PM

#

#

error

odd meteor May 11, 2022, 7:59 PM

#

woven coral error

What's your X?

woven coral May 11, 2022, 8:00 PM

#

vector

#

#

usig tf idf vectorizing

#

using

#

i think this method is not working on tf idf

#

its working on word2vec

frigid elk May 11, 2022, 8:27 PM

#

what's a good resource for how to best structure my feature set for a binary classification problem of predicting an event 2 months in the future? .. not sure how my features should be built . ... one column for rolling 12 month average for sales vs 12 columns, one for every month in the past 12 months. ... or a mixture of both. ... looking for some examples

odd meteor May 11, 2022, 8:28 PM

#

Replace where you have X with a random text, say, "I love cats" to see how it works. The problem is from your 'X'

frigid elk May 11, 2022, 8:28 PM

#

obviously, other measures than sales, but looking for structure

odd meteor May 11, 2022, 8:33 PM

#

woven coral vector

It ought to be a text or list of texts.

#

https://www.tensorflow.org/api_docs/python/tf/keras/preprocessing/text/Tokenizer

TensorFlow

tf.keras.preprocessing.text.Tokenizer | TensorFlow Core v2.8.0

Text tokenization utility class.

median moat May 11, 2022, 8:41 PM

#

What are Data Science projects that are good for a resume?

odd meteor May 11, 2022, 8:43 PM

#

median moat What are Data Science projects that are good for a resume?

The one you'll definitely have lots of fun in doing. Personalised projects are the best

lapis sequoia May 11, 2022, 9:19 PM

#

ayo what up anyone got any resources on cnn?

brazen totem May 11, 2022, 9:40 PM

#

how do you determine cluster std for kmean

arctic cliff May 11, 2022, 9:54 PM

#

So in normalization we should subtract the dataset by it's mean then divide by the standard deviation and here's a code

x -= x.mean(axis=0)
x /= x.std(axis=0)

where std from the documentation is simply applying the standard deviation formula with subtracts included:

std = sqrt(mean(x)), where x = abs(a - a.mean())**2.

The question is why should I subtract with the mean before dividing by the standard deviation?

brazen totem May 11, 2022, 9:55 PM

#

you mean why can't you do it in one line?

arctic cliff May 11, 2022, 9:56 PM

#

Nope
since the std formula already subtract by the mean
Why should I still subtract by the mean before dividing by the standard deviation?
Shouldn't I just do x /= x.std(axis=0) instead?

brazen totem May 11, 2022, 9:56 PM

#

isn't x the value from the random sample?

arctic cliff May 11, 2022, 9:56 PM

#

x is the sample

brazen totem May 11, 2022, 9:56 PM

#

yeah

#

so you subtract mean to figure out how much this specific value differs from the mean
then divide it by the standard deviation to see how extreme that deviation is

arctic cliff May 11, 2022, 9:58 PM

#

😳

#

It all makes sense now

brazen totem May 11, 2022, 9:58 PM

#

yeah it's very logical

arctic cliff May 11, 2022, 9:58 PM

#

Thanks a lot!

brazen totem May 11, 2022, 9:58 PM

#

np lol

vagrant trench May 11, 2022, 10:09 PM

#

hello guys , hope you doing well . pls I need help !

#

I need how to retrieve the number of clusters for the kmeans with elbow curve

serene scaffold May 11, 2022, 10:13 PM

#

vagrant trench I need how to retrieve the number of clusters for the kmeans with elbow curve

you pick how many clusters there are, yes?

vagrant trench May 11, 2022, 10:13 PM

#

yes

serene scaffold May 11, 2022, 10:14 PM

#

did you use sklearn to do it?

vagrant trench May 11, 2022, 10:38 PM

#

serene scaffold did you use sklearn to do it?

no, i used " elbow courbe " for that but that oblige me to read the curve to return it

odd meteor May 11, 2022, 10:52 PM

#

vagrant trench no, i used " elbow courbe " for that but that oblige me to read the curve to...

You'd have to get the number of clusters from the plot. And because of that, we'll only be able to know how many clusters are present in your data if you post the Elbow Plot.

vagrant trench May 11, 2022, 10:54 PM

#

odd meteor You'd have to get the number of clusters from the plot. And because of that, we'...

alright , I thought that it was another way to return it , thanks

#

another qst pls , how to plot the fitness curve vs Kmeans ?

odd meteor May 11, 2022, 11:03 PM

#

vagrant trench alright , I thought that it was another way to return it , thanks

There's actually another method to find the number of cluster(s) but Elbow plot is kinda more popular. You can get the number of clusters using KElbowVisualizer. You could do something like this

from sklearn.cluster import KMeans
from yellowbrick.cluster import KElbowVisualizer
print('Elbow Method to determine the number of clusters to be formed:')
Elbow_ = KElbowVisualizer(KMeans(), k=10, timings=False)
Elbow_.fit(train_df)
Elbow_.show()

If you don't fancy this method, you can use Silhouette plot as well.

What I usually do is, first use Elbow Plot, then validate the number of clusters using KElbowVisualizer.

odd meteor May 11, 2022, 11:06 PM

#

vagrant trench another qst pls , how to plot the fitness curve vs Kmeans ?

I'm not sure I understand this Ques. Care to elucidate?

vagrant trench May 11, 2022, 11:07 PM

#

it makes error : No module named 'yellowbrick' !

odd meteor May 11, 2022, 11:07 PM

#

vagrant trench it makes error : No module named 'yellowbrick' !

You need to pip install the yellowbrick package on your machine first.

orchid carbon May 11, 2022, 11:10 PM

#

So.. I used this dataset I don't remember from where of weather pictures (literal sunshine, cloud, etc.)
What I did was use the KNN algorithim to classify the weather type. I am getting near 79% accuracy
The "features" I used (don't know exatcly how it works yet) was the medium of the R, G, and B proportions of every image pixels.
So all Red values of the pixels summed and divided by the amount of pixels.
Does this make sense?
It is analyzing the color context of the image, so as much as it ain't capable of identifying "clouds" or "sunrays" it can identify a color scheme of a "sunshine colored image" per say, I think

vagrant trench May 11, 2022, 11:12 PM

#

odd meteor You need to pip install the yellowbrick package on your machine first.

aaaah okay , thaaanks

vagrant trench May 11, 2022, 11:13 PM

#

orchid carbon So.. I used this dataset I don't remember from where of weather pictures (litera...

I have no idea about KNN 😅

orchid carbon May 11, 2022, 11:13 PM

#

:p

vagrant trench May 11, 2022, 11:14 PM

#

any one here did a genetic algorithm for clustering ?

odd meteor May 11, 2022, 11:21 PM

#

orchid carbon So.. I used this dataset I don't remember from where of weather pictures (litera...

Since you didn't used NN to do the image classification, what it means is, KNN used the pixels in the image as features. RGB with/without their specific location in each respective image for each class was used as a feature.

orchid carbon May 11, 2022, 11:23 PM

#

Yeah. By NN you mean Neural Networks right

odd meteor May 11, 2022, 11:24 PM

#

orchid carbon Yeah. By NN you mean Neural Networks right

https://tenor.com/view/rq-yay-rq-yes-cat-rq-rq-yes-yes-yes-gif-20226294

Tenor

safe moss May 12, 2022, 12:28 AM

#

Can anyone help, I have three dataframes that I want to perform difflib.get_close_matches on before merging them into one single dataframe but I am a bit stuck

vagrant trench May 12, 2022, 12:41 AM

#

@odd meteor can u help me also this time pls 🥺 I want to plot time execution vs clusters number ( time execution in Y axe and number of clusters in X axe ) I don't know how

odd meteor May 12, 2022, 12:50 AM

#

vagrant trench <@519319496868233227> can u help me also this time pls 🥺 ...

from sklearn.cluster import KMeans
num_clusters = range(1, 11)
inertia = []

for i in num_clusters:
    model = KMeans(n_clusters = i, random_state=42, init='k-means++')
    model.fit(train_df)
    inertia.append(model.inertia_)
    
plt.figure(figsize=(10,6))
plt.plot(n_clusters, inertia, marker='o')
plt.title('Elbow Plot: Number of Clusters vs. Inertia')
plt.ylabel('Inertia')
plt.xlabel('Number of Clusters (K)')
plt.show()

hoary wigeon May 12, 2022, 4:52 AM

#

max_temp = weather_df.groupby(['Station.State'])[['Station.City', 'Data.Temperature.Max Temp']]

I want to fetch the only city with max temperature

celest vine May 12, 2022, 6:28 AM

#

Is it possible to scrape only those twitter profiles which have a certain profile pictures?
Suppose I want to scrape profiles with Bored Apes as there profile pic.

celest vine May 12, 2022, 6:31 AM

#

hoary wigeon ```py max_temp = weather_df.groupby(['Station.State'])[['Station.City', 'Data.Te...

max_temp = weather_df[weather_df['temperature'] == max(weather_df['temperature']]```

hoary wigeon May 12, 2022, 6:40 AM

#

celest vine ``` max_temp = weather_df[weather_df['temperature'] == max(weather_df['temperatu...

first i want to group by state

#

then i want to find the city with max temperature in that state

celest vine May 12, 2022, 6:48 AM

#

So, you want to find city with mac temperature for each state?

hoary wigeon May 12, 2022, 6:51 AM

#

celest vine So, you want to find city with mac temperature for each state?

yes

rose agate May 12, 2022, 6:57 AM

#

hoary wigeon ```py max_temp = weather_df.groupby(['Station.State'])[['Station.City', 'Data.Te...

try weather_df.groupby('Station.State')['temperature'].max()

brazen totem May 12, 2022, 7:09 AM

#

how do you run dbscan machine learning with a ton of Nan values

#

I'm trying to use several columns in my prediction so naturally there's gonna be a lot of observations with at least 1 Nan

#

do I just restrict the model to columns where I expect Nan to be the lowest and drop em or is there a better way to deal with missing values

#

replacing everything with 0 will fuck up a lot of stuff

#

ill just restrict the model to columns where I expect Nan to be lowest and drop Nan for now but hopefully someone comes up with a better solution lol

#

what did I do wrong here lol

hoary wigeon May 12, 2022, 7:25 AM

#

rose agate try `weather_df.groupby('Station.State')['temperature'].max()`

city with max temperature for each state

brazen totem May 12, 2022, 7:28 AM

#

nvm figured out the graph

rose agate May 12, 2022, 7:31 AM

#

hoary wigeon city with max temperature for each state

ah, maybe weather_df.loc[weather_df.groupby('Station.State')['temperature'].idxmax()]['city']

safe moss May 12, 2022, 7:41 AM

#

is anyone able to tell me how i would be able to keep the 0 index when i am filtering like this please?

#

i would like to keep all the questions intact

jagged hornet May 12, 2022, 7:43 AM

#

I cant understand the error

jagged hornet May 12, 2022, 7:43 AM

#

jagged hornet I cant understand the error

@safe moss

brazen totem May 12, 2022, 7:46 AM

#

safe moss is anyone able to tell me how i would be able to keep the 0 index when i am filt...

maybe add "or year of the answer == 'Year'" in the inequality?

#

you have to explicitly single out the first row that's the only way I think

hoary wigeon May 12, 2022, 7:50 AM

#

rose agate ah, maybe `weather_df.loc[weather_df.groupby('Station.State')['temperature'].idx...

That helped me thank you 😊

safe moss May 12, 2022, 8:04 AM

#

brazen totem maybe add "or year of the answer == 'Year'" in the inequality?

for some reason it does this i have no idea why but i agree with your premise

brazen totem May 12, 2022, 8:05 AM

#

nono OR

#

not and

safe moss May 12, 2022, 8:05 AM

#

ah yes

#

i have tried OR and '|' but neither has worked

#

same result

#

wait maybe i need to reload the data

#

thank you that worked, with | rather than or

brazen totem May 12, 2022, 8:07 AM

#

ah np

worthy phoenix May 12, 2022, 8:26 AM

#

is there any way to update sklearn models to the newest version of sklearn?, so it works with the newer versions?

brazen totem May 12, 2022, 8:43 AM

#

yeah you should use the -U

worthy phoenix May 12, 2022, 8:44 AM

#

i mean i have the model dumped already, im just trying to load it with pickle and gives error

worthy phoenix May 12, 2022, 8:44 AM

#

brazen totem yeah you should use the -U

i dumped it with, 0.19.0 and im now on 1.0.2

#

ModuleNotFoundError: No module named 'sklearn.feature_extraction.dict_vectorizer'

brazen totem May 12, 2022, 8:46 AM

#

you tried uninstalling and reinstalling already?

worthy phoenix May 12, 2022, 8:46 AM

#

i mean im on a separate work environment rn so ye i had to install everything from scratch

brazen totem May 12, 2022, 8:48 AM

#

in what ive seen online it might just be misnamed

#

you checked the folder to make sure you're typing it right?

worthy phoenix May 12, 2022, 8:49 AM

#

ye, thats not the issue, basically the issue is that when there is sklearn version mismatch. For example, trying to deserialize a sklearn(>= 0.22.X) object dumped with another sklearn version < 0.22.X. since sklearn introduced a change between those version

brazen totem May 12, 2022, 8:49 AM

#

ahhh so you're trying to use old code with new functions

#

and it breaks down

worthy phoenix May 12, 2022, 8:49 AM

#

yes

brazen totem May 12, 2022, 8:50 AM

#

other than redoing the code the new way idk what fixes there are

worthy phoenix May 12, 2022, 8:50 AM

#

pain, i dont have the dataset to train again

#

;-;

#

dont even remember how i found it

brazen totem May 12, 2022, 8:51 AM

#

or maybe you can like manually install an old version of the packages

#

after uninstalling the new one

#

if you remember which version it was that is

worthy phoenix May 12, 2022, 8:51 AM

#

ye it was 0.19.0

#

i remember that

#

but i need some dependencies that need higher versions of scikit-learn

#

so i cant even downgrade it

brazen totem May 12, 2022, 8:52 AM

#

if this has any solution it's gonna be hella bullshit is all I know

worthy phoenix May 12, 2022, 8:53 AM

#

ima google around more to see if there is a way

brazen totem May 12, 2022, 8:53 AM

#

@wooden sail
you got any clue how to interpret this?

#

this is optics modeling of my poker data using 5 different variables normalized

worthy phoenix May 12, 2022, 8:58 AM

#

very cool stackoverflow 👍
https://stackoverflow.com/questions/62283893/scikit-learn-upgrade-from-0-19-1

Stack Overflow

scikit learn upgrade from 0.19.1

I trained some data science models with scikit learn from v0.19.1. The models are stored in a pickle file. After upgrading to latest version (v0.23.1), I get the following error when I try to load ...

brazen totem May 12, 2022, 8:58 AM

#

LOL

#

yeah that's what I would expect

worthy phoenix May 12, 2022, 8:59 AM

#

guess gotta find the training data again, rip, they should have maintained model persistancy for older versions as well

#

;-;

brazen totem May 12, 2022, 9:00 AM

#

they dont care lmao

safe moss May 12, 2022, 9:01 AM

#

can anyone tell me what the point of doing something like .fillna(0) to get rid of NaN is because as soon as I do something else to the dataframe the NaNs just come back?

worthy phoenix May 12, 2022, 9:03 AM

#

brazen totem they dont care lmao

LMAO, i hate my brain got an idea

#

ima just edit the pickle with the newer module names

brazen totem May 12, 2022, 9:04 AM

#

hopefully that works ?

worthy phoenix May 12, 2022, 9:04 AM

#

idk, thats the only hack which comes in my mind rn

#

anddddd it workeddddd

#

bruhhhhhhh

brazen totem May 12, 2022, 9:12 AM

#

nice

odd meteor May 12, 2022, 9:13 AM

#

safe moss can anyone tell me what the point of doing something like .fillna(0) to get rid ...

You need to save it once you've done so. You can either use inplace = True or df = df.fillna(0)

safe moss May 12, 2022, 9:14 AM

#

odd meteor You need to save it once you've done so. You can either use `inplace = True` or ...

thanks for the reply bro ill use that

safe moss May 12, 2022, 9:34 AM

#

is anyone able to assist with seaborn

sterile rivet May 12, 2022, 11:44 AM

#

Any idea how to convert a 1gb json file to csv?

tidal bough May 12, 2022, 11:51 AM

#

sterile rivet Any idea how to convert a 1gb json file to csv?

!pypi json-stream You can use something like that to not have to hold it all in memory

arctic wedgeBOT May 12, 2022, 11:51 AM

#

json-stream v1.3.0

Streaming JSON decoder

frigid elk May 12, 2022, 11:57 AM

#

sterile rivet Any idea how to convert a 1gb json file to csv?

also, don't know your situation, but it may be worth looking into parquet or some filetype with a splittable compression algorithm

sterile rivet May 12, 2022, 11:58 AM

#

I have converted it using to_csv how do I download it?

sleek tapir May 12, 2022, 1:23 PM

#

is anyone doing andrew ngs

#

course (again next month)

rotund ledge May 12, 2022, 1:39 PM

#

I'm trying to get an animation of filling in a 3d line (to visually represent a line integral)
Effectively this: https://pythonguides.com/matplotlib-fill_between/#Matplotlib_fill_between_animation
It's not working though

import matplotlib.pyplot as plt
import matplotlib.patches as mpatches
import matplotlib.animation as mtA
import numpy as np
from mpl_toolkits.mplot3d.art3d import Poly3DCollection

nsteps = 10000
t_init = 0.0
t_final = 20.0
t=np.linspace(t_init,t_final, nsteps)

rectangles=30
val = np.empty(rectangles + 1)
ts = np.empty(rectangles + 1)
tp = np.empty(rectangles + 1)

#parameterization of curve
def x_param ():
    #x=np.cos(t)
    x=t
    return x

def y_param ():
    #y = np.sqrt(t)
    y=t
    return y

def z_param ():
    z=t
    return z

def reinman ():
    pass

def main ():
    fig =plt.figure()
    ax=fig.add_subplot(projection='3d')
    #line, = ax.plot([], [], [])
    ax.plot(x_param(),y_param(),z_param(),color="orange")

    set01xtoline = [x_param(), y_param(), np.zeros(len(t))]
    set1 = [x_param(), y_param(), z_param()]
    zZ=np.zeros(len(t))

    def animate(i):
        x3=x_param()
        y3=y_param()
        z3=z_param()
        xA=x3[0:(i-1)]
        yA=y3[0:(i-1)]
        zA=z3[0:(i-1)]
       
        p = fill_between_3d(ax, xA,yA,zZ, xA,yA,zA, mode=1, c="C0")
        return p

    ani = mtA.FuncAnimation(fig,animate, frames=1000, interval=50000)
    #fill_between_3d(ax, *set01xtoline, *set1, mode=1, c="C0")
    plt.show()

if __name__ == '__main__':
    main()```

#

Fill 3d function I'm using

def fill_between_3d(ax, x1, y1, z1, x2, y2, z2, mode=1, c='steelblue', alpha=0.4):
    if mode == 1:

        for i in range(len(x1) - 1):
            verts = [(x1[i], y1[i], z1[i]), (x1[i + 1], y1[i + 1], z1[i + 1])] + \
                    [(x2[i + 1], y2[i + 1], z2[i + 1]), (x2[i], y2[i], z2[i])]

            ax.add_collection3d(Poly3DCollection([verts],
                                                 alpha=alpha,
                                                 linewidths=0,
                                                 color=c))

    if mode == 2:
        verts = [(x1[i], y1[i], z1[i]) for i in range(len(x1))] + \
                [(x2[i], y2[i], z2[i]) for i in range(len(x2))]

        ax.add_collection3d(Poly3DCollection([verts], alpha=alpha, color=c))```

tacit basin May 12, 2022, 1:56 PM

#

sleek tapir is anyone doing andrew ngs

Andrew has more than one course :) which one you are referring to?

vagrant trench May 12, 2022, 2:00 PM

#

hello guys , I want to plot time execution ( in y label ) vs clusters number ( in x label ) , how can I do it pls ? PS : I have the time execution and clusters number

distant horizon May 12, 2022, 2:14 PM

#

safe moss is anyone able to assist with seaborn

Might be able to, what the question

misty flint May 12, 2022, 2:20 PM

#

https://gfxspeak.com/2022/02/28/makes-movie-scientists/