#data-science-and-ml | Python | Page 313

velvet thorn May 20, 2021, 11:48 AM

#

one thing that helped

#

was like

#

thinking about taking slices

#

across axes

#

and what shapes the results would be

lapis sequoia May 20, 2021, 12:03 PM

#

Hey anyone got a chance to play around with DataSpell?

sly salmon May 20, 2021, 12:06 PM

#

I feel like pandas is much more manageable for labeled data compared to numpy, but I heard that pandas is at times 20x slower

tall zinc May 20, 2021, 12:06 PM

#

Woo, finally got around to rebuilding the main solve loop for my Keep Talking and Nobody Explodes bot, so it can now solve multiple concurrent modules at once rather than having to do them all one at a time
(Seemed relevant what with the vast majority of what it does being computer vision)
https://www.youtube.com/watch?v=ZSGCO4eFRJE

YouTube

slurpleslixie

all the morse

▶ Play video

lapis sequoia May 20, 2021, 12:08 PM

#

sly salmon I feel like pandas is much more manageable for labeled data compared to numpy, b...

#

but at the end you will just execute a cell and continue working with edited variable

#

hence you won't feel it as much

sly salmon May 20, 2021, 12:09 PM

#

tall zinc Woo, finally got around to rebuilding the main solve loop for my Keep Talking an...

that looks so rad!

tall zinc May 20, 2021, 12:09 PM

#

Thanks :)

sly salmon May 20, 2021, 12:09 PM

#

tall zinc Thanks :)

how did you end up achieving that? what libraries did you use? that's so impressive

tall zinc May 20, 2021, 12:10 PM

#

OpenCV for all the Computer Vision stuff, the overlay is from wxPython, pynput for sending mouse input and mss for the screen capture

#

And tesseract for the OCR (at least when I couldn't be bothered to hack something more accurate together with opencv)

#

It does all the other (non-needy) modules too, though it takes a long time with the one with lots of words because OCR is slooow and it has to do a few passes to make sure it's right
https://www.youtube.com/watch?v=DvTNRo8tCqo

YouTube

slurpleslixie

KCANE AI vs 8 modules

Keep Cheating and Nobody Explodes bot trying out 8 modules.

Still need to update a bunch of the drawing and text and its display in general, but I think the non-needy modules are all basically finished at this point

All written in Python, using OpenCV for the image processing/computer vision parts and a bit of wxPython for the display window ...

▶ Play video

#

Still working on making the drawing output more often and more informatively

tall zinc May 20, 2021, 12:19 PM

#

tall zinc OpenCV for all the Computer Vision stuff, the overlay is from wxPython, pynput f...

The vast majority of what opencv is being used for is calling inRange() on hsv images and finding contours. And some eroding and dilating but most all of it is just using that. The keypad with symbols on it uses matchShape (and a ShapeContextDistanceExtractor when matchShape is lying) but otherwise it's largely just testing for certain colours and sizes of resulting contours and then using handwritten logic to work out what that means

lapis sequoia May 20, 2021, 12:40 PM

#

what does the word prior mean in terms of ML? I'm reading a paper in which they say that

The construction phase is prior-driven, not data-driven—-data comes in only at the learning phase.

Please let me know if anyone can help and if more detail is needed. Thanks.

desert oar May 20, 2021, 1:01 PM

#

It didn't change for me, so I think it's read-only.

desert oar May 20, 2021, 1:01 PM

#

lapis sequoia what does the word `prior` mean in terms of ML? I'm reading a paper in which the...

They mean the "prior probability" as in Bayesian statistics

cyan lantern May 20, 2021, 1:02 PM

#

anyone here familiar with sparse matrices and how they work

lapis sequoia May 20, 2021, 1:02 PM

#

desert oar They mean the "prior probability" as in Bayesian statistics

yeah that's what i thought so. but what do we mean by prior-driven and data-driven.
I've just talked with a friend of mine, and what he suggested is instead of giving data to our model to learn we give certain properties for granted.
(btw in this approach we use very less data so above suggestion made kinda sense to me.)

desert oar May 20, 2021, 1:03 PM

#

yes i think your friend's interpretation is reasonable

#

from Bayes' theorem we have P(θ|Y) ∝ P(Y|θ) * P(θ) where θ is our model parameter and Y is our data. if we don't have a lot of data, our estimates of P(θ|Y) will depend more strongly on our assumptions of P(θ)

desert oar May 20, 2021, 1:05 PM

#

cyan lantern anyone here familiar with sparse matrices and how they work

it's better to just ask your question, then if someone has an answer they can just answer without extra back-and-forth

#

"don't ask to ask", is the saying

cyan lantern May 20, 2021, 1:06 PM

#

well im running into issues with predictions after building a model

lapis sequoia May 20, 2021, 1:07 PM

#

desert oar from Bayes' theorem we have `P(θ|Y) ∝ P(Y|θ) * P(θ)` where `θ` is our model para...

yeah that makes a lot more sense now. Thanks a lot for answering : )

cyan lantern May 20, 2021, 1:07 PM

#

my training data after preprocessing have different number of features to the test data, which raised an error during prediction stage

#

both test and training datasets were preprocessed the same way

#

Screen_Shot_2021-05-20_at_7.11.54_pm.png

desert oar May 20, 2021, 1:13 PM

#

this doesn't appear to be relevant to sparse matrices btw

#

can you show your full code?

#

at least how clf is defined

#

my training data after preprocessing have different number of features to the test data
basically, this should not ever happen

cyan lantern May 20, 2021, 1:16 PM

#

https://stackoverflow.com/questions/67619930/sparse-matrices-dimensions-different-after-performing-the-same-preprocessing

Stack Overflow

Sparse matrices dimensions different after performing the same prep...

I am confused with how my test and training sparse matrices have different number of features after performing the same preprocessing
this is preventing me from predicting my test data
def vectoriz...

#

I posted this whole part on stackoverflow

#

the model worked during validation

#

so this works fine (predicting the training data)

clf.predict(X)

cyan lantern May 20, 2021, 1:18 PM

#

desert oar > my training data after preprocessing have different number of features to the ...

that is what I thought as well

desert oar May 20, 2021, 1:19 PM

#

You are not using sklearn correctly

#

you are creating new and separate transformers for each split

#

you don't want to do that

cyan lantern May 20, 2021, 1:19 PM

#

oh

desert oar May 20, 2021, 1:19 PM

#

after .fit-ing a transformer, it keeps the fitted state internally, then you just .transform on the other datasets

cyan lantern May 20, 2021, 1:21 PM

#

so the vectorizer function is not working correctly?

desert oar May 20, 2021, 1:21 PM

#

it cannot work correctly as-written

#

vectorizer = CountVectorizer(stop_words = 'english')
classifier = LogisticRegression(C = 0.01, max_iter = 1000000, penalty = 'l2')

x_train = vectorizer.fit_transform(data_train)
clf.fit(x_train, y_train)
pred_train = clf.predict(x_train)

x_test = vectorizer.transform(data_test)
pred_test = clf.predict(x_test)

#

your code should look something like this

#

better yet, use a "pipeline" to automate the sequence of preprocessing and classifier fitting

from sklearn.pipeline import make_pipeline

clf = make_pipeline(
    CountVectorizer(stop_words = 'english'),
    LogisticRegression(C = 0.01, max_iter = 1000000, penalty = 'l2'),
)

clf.fit(data_train)

pred_train = clf.predict(data_train)
pred_test = clf.predict(data_test)

cyan lantern May 20, 2021, 1:24 PM

#

and does that work with both text features and numerical features?

#

because right now, I am vectorizing each text feature on its own first (though not done correctly), then hstacking them with the numerical features (in np.array form)

#

but what you are showing is that I just preprocess the whole dataset together?

#

or am I misunderstanding it?

desert oar May 20, 2021, 1:27 PM

#

if you only want to apply the CountVectorizer to some dataframe columns but not all, use https://scikit-learn.org/stable/modules/generated/sklearn.compose.ColumnTransformer.html#sklearn.compose.ColumnTransformer

#

and here are the Pipeline docs https://scikit-learn.org/stable/modules/generated/sklearn.pipeline.Pipeline.html#sklearn.pipeline.Pipeline

#

https://scikit-learn.org/stable/modules/compose.html
https://scikit-learn.org/stable/modules/classes.html#module-sklearn.compose
https://scikit-learn.org/stable/modules/classes.html#module-sklearn.pipeline

cyan lantern May 20, 2021, 1:28 PM

#

thank you

desert oar May 20, 2021, 1:29 PM

#

I added an answer on SO as well

lavish bison May 20, 2021, 1:30 PM

#

Hi guys, can anyone help me with this?

warm basin May 20, 2021, 1:35 PM

#

Please I am stuck here. The task is on the picture. Trying to create an API for an image search. I am done with the ml part and the trained model has been assigned to a variable model

warm basin May 20, 2021, 1:43 PM

#

lavish bison Hi guys, can anyone help me with this?

Check this https://datacarpentry.org/python-ecology-lesson/09-working-with-sql/index.html

This is db not sql file

lavish bison May 20, 2021, 1:46 PM

#

warm basin Check this https://datacarpentry.org/python-ecology-lesson/09-working-with-sql/i...

thank you!!!!

warm basin May 20, 2021, 1:50 PM

#

lavish bison thank you!!!!

I hope it help

thorn bobcat May 20, 2021, 1:53 PM

#

yo

#

I'm trying to work on my own style GAN encoder and would like to learn the basics leading upto this do I learn OpenCV, Tensorflow or Keras first?

grave frost May 20, 2021, 2:01 PM

#

is there any specific reason just to build an encoder?

thorn bobcat May 20, 2021, 2:02 PM

#

grave frost is there any specific reason just to build an encoder?

I want to create Cartoons from images and videos.

#

Want to do motion detection and face recognition.

#

also want to turn images into videos.

#

maybe make my own anime

#

pithink i have alot of things I want to do

grave frost May 20, 2021, 2:08 PM

#

that's a whole GAN - I thought you meant encoder seperately

thorn bobcat May 20, 2021, 2:08 PM

#

this is an example of something i want to pull of my own
https://github.com/yuval-alaluf/restyle-encoder

GitHub

yuval-alaluf/restyle-encoder

Official Implementation for "ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement" https://arxiv.org/abs/2104.02699 - yuval-alaluf/restyle-encoder

#

@grave frost I want to work on this and improve it too.

#

this is almost one of the things I'd like to achieve.

cyan lantern May 20, 2021, 2:11 PM

#

desert oar your code should look something like this

somehow it is still not working

desert oar May 20, 2021, 2:21 PM

#

cyan lantern somehow it is still not working

show your code

#

always show your code

cyan lantern May 20, 2021, 2:23 PM

#

Screen_Shot_2021-05-21_at_12.23.23_am.png

#

Screen_Shot_2021-05-21_at_12.23.44_am.png

thorn bobcat May 20, 2021, 2:23 PM

#

thorn bobcat I'm trying to work on my own style GAN encoder and would like to learn the basic...

can i get advice on this?

cyan lantern May 20, 2021, 2:28 PM

#

desert oar show your code

am I not meant to hstack them?

desert oar May 20, 2021, 2:28 PM

#

if you share your code as text it's a lot easier to read than as a screenshot

#

!paste

arctic wedgeBOT May 20, 2021, 2:28 PM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in discord, you can paste your code here:
https://paste.pydis.com/

After pasting your code, save it by clicking the floppy disk icon in the top right, or by typing ctrl + S. After doing that, the URL should change. Copy the URL and post it here so others can see it.

cyan lantern May 20, 2021, 2:28 PM

#

ah sorry yeah

desert oar May 20, 2021, 2:29 PM

#

hm, the vectorizer might still not be emitting the full array

#

it shouldnt though

#

make sure you restart your notebook in case there are any typos or something

cyan lantern May 20, 2021, 2:30 PM

#

https://paste.pythondiscord.com/ukocaviwiw.makefile

desert oar May 20, 2021, 2:30 PM

#

you can hstack them although i do think ColumnTransformer will be easier to work with

#

that code you wrote looks like it should work

cyan lantern May 20, 2021, 2:33 PM

#

X_test is 10000x8718 and X is 40000x31765

cyan lantern May 20, 2021, 2:34 PM

#

desert oar you can hstack them although i do think `ColumnTransformer` will be easier to wo...

yeah I decided to hstack because dont understand how column transformer work

desert oar May 20, 2021, 2:34 PM

#

cyan lantern X_test is 10000x8718 and X is 40000x31765

yeah this definitely is not what you'd want

#

i would normally expect this to work...

#

let me see if there's a missing flag or something

cyan lantern May 20, 2021, 2:34 PM

#

thanks for helping out btw

desert oar May 20, 2021, 2:42 PM

#

https://replit.com/@maximum__/count-vectorizer#main.py
yeah, i can't reproduce the problem

repl.it

maximum__

count vectorizer

A Python repl by maximum__

#

this works as expected

cyan lantern May 20, 2021, 2:43 PM

#

hmmm

#

even without hstack it is not right

Screen_Shot_2021-05-21_at_12.45.47_am.png

desert oar May 20, 2021, 2:46 PM

#

right, hstack isn't the problem here

cyan lantern May 20, 2021, 2:47 PM

#

I may have found the problem

thorn bobcat May 20, 2021, 2:47 PM

#

anyone here worked with GAN's?

cyan lantern May 20, 2021, 2:47 PM

#

if I do this instead

#

it works fine

topaz epoch May 20, 2021, 2:47 PM

#

Hey guys , how do I start learning ds

cyan lantern May 20, 2021, 2:48 PM

#

so I guess the order matters

topaz epoch May 20, 2021, 2:48 PM

#

Does ds include ai and ml

desert oar May 20, 2021, 2:48 PM

#

cyan lantern so I guess the order matters

what do you mean by that?

#

you shouldn't even be able to run that code with those lines commented out

#

restart your damn notebook

#

and use [''] to get dataframe columns, don't use .

#

(what happens if you have a column called map?)

cyan lantern May 20, 2021, 2:51 PM

#

desert oar you shouldn't even be able to run that code with those lines commented out


other_features = ["n_steps", "n_ingredients"]
features = df_train[other_features]
test_features = df_test[other_features]

name = vectoriser.fit_transform(df_train.name)
test_name = vectoriser.transform(df_test.name)

steps = vectoriser.fit_transform(df_train.steps)
test_steps = vectoriser.transform(df_test.steps)

ingr = vectoriser.fit_transform(df_train.ingredients)
test_ingr = vectoriser.transform(df_test.ingredients)

X = hstack([steps,ingr, name])
X_test = hstack([test_steps, test_ingr, test_name])
y = df_train.duration_label

#

now it works

desert oar May 20, 2021, 2:51 PM

#

restart your notebook anyway

#

it's highly likely that you just had some other variable name hanging around due to a typo

#

OH

#

that's the problem

#

facepalm

#

you need a separate vectorizer for each set of features...

cyan lantern May 20, 2021, 2:52 PM

#

yeah hahahaha

desert oar May 20, 2021, 2:52 PM

#

don't re-use it

#

other_features = ["n_steps", "n_ingredients"]
features = df_train[other_features]
test_features = df_test[other_features]

name_vectoriser = CountVectorizer()
name = name_vectoriser.fit_transform(df_train.name)
test_name = name_vectoriser.transform(df_test.name)

steps_vectoriser = CountVectorizer()
steps = steps_vectoriser.fit_transform(df_train.steps)
test_steps = steps_vectoriser.transform(df_test.steps)

ingr_vectoriser = CountVectorizer()
ingr = ingr_vectoriser.fit_transform(df_train.ingredients)
test_ingr = ingr_vectoriser.transform(df_test.ingredients)

X = hstack([steps,ingr, name])
X_test = hstack([test_steps, test_ingr, test_name])
y = df_train.duration_label

#

columntransformer will be really useful here

cyan lantern May 20, 2021, 2:53 PM

#

yeah I will try to learn that and redo it

#

thank you for the help

desert oar May 20, 2021, 2:57 PM

#

from sklearn.compose import make_column_transformer
from sklearn.feature_extraction.text import CountVectorizer
from sklearn.linear_model import LogisticRegression
from sklearn.pipeline import make_pipeline

clf = make_pipeline(
    make_column_transformer(
        ('passthrough', ['n_steps', 'n_ingredients']),
        (CountVectorizer(), 'name'),
        (CountVectorizer(), 'steps'),
        (CountVectorizer(), 'ingr'),
    ),
    LogisticRegression(C=0.01, max_iter=1000000, penalty='l2'),
)

@cyan lantern something like this

sly salmon May 20, 2021, 2:57 PM

#

when using np.mean for rows (axis=1), the array has to be flattened.
So, when using np.mean for columns (axis=0), I'd assume array has to be modified (not flattened - but... stood up?)
Is there a term for this?

desert oar May 20, 2021, 2:57 PM

#

why does it have to be flattened?

sly salmon May 20, 2021, 2:58 PM

#

that's what it does by default

desert oar May 20, 2021, 2:58 PM

#

numpy knows how long each column is, so it can "step over" the right number of elements in the underlying flat array to do the computation

sly salmon May 20, 2021, 2:58 PM

#

desert oar numpy knows how long each column is, so it can "step over" the right number of e...

aah okay, makes sense. Thanks

desert oar May 20, 2021, 2:58 PM

#

yeah it won't internally re-allocate memory for the array

#

that'd be very inefficient

warm basin May 20, 2021, 3:00 PM

#

Please I am stuck here. The task is on the picture. Trying to create an API for an image search. I am done with the ml part and the trained model has been assigned to a variable model

digital aurora May 20, 2021, 3:21 PM

#

Guys, can anybody tell me what all to study under pandas.

#

Like what all are important attributes and functions.

sly salmon May 20, 2021, 3:30 PM

#

does np.percentile sort the array implictly?

#

well, I guess it has to.

sharp herald May 20, 2021, 5:55 PM

#

anyone know what that notation for matrix M means?

rigid tendon May 20, 2021, 5:56 PM

#

sharp herald anyone know what that notation for matrix M means?

wdym notation?

sharp herald May 20, 2021, 5:56 PM

#

that representation

#

because P and Q are both matrices too

heavy bay May 20, 2021, 7:38 PM

#

Hi, so I want to make a program which predicts crypto prices, so what python library should I use for that?. (I am relatively new to ml, I've made a few simple ml projects)

heady tide May 20, 2021, 8:05 PM

#

#

Tensorflow throws an error IndexError: tuple index out of range when I add weights to my validation data

#

but if you see the last two code cellls

#

the shapes are correct for both validation and training

limpid saddle May 20, 2021, 8:39 PM

#

SVM_clf_counts = Pipeline([('vect', CountVectorizer()),
                   ('clf', LinearSVC(C=0.1, max_iter=3000)),
                  ])
SVM_clf_counts.fit(X_train, y_train)
SVM_cnt_pred_tr = LR_clf_counts.predict(X_train)
SVM_cnt_pred_val = LR_clf_counts.predict(X_val)
SVM_cnt_pred_tst = LR_clf_counts.predict(X_test)


print("precision on training: ",precision_score(y_train, SVM_cnt_pred_tr, average='micro'))
print("precision on validation: ",precision_score(y_val, SVM_cnt_pred_val, average='micro'))
print("precision on testing: ",precision_score(y_test, SVM_cnt_pred_tst, average='micro'))```

#

#

I don't understand what the error is in this code, can someone help

desert oar May 20, 2021, 9:24 PM

#

@sharp herald this is "block matrix" notation

#

P stands for "all the elements of P"

#

0 stands for "fill with 0s up to the correct dimensions"

velvet linden May 20, 2021, 9:25 PM

#

so i have this program that checks for the input in the csv file, and then if it is not there, then it writes, but the write part doesn't work for some reason. I also am not getting any errors

desert oar May 20, 2021, 9:26 PM

#

limpid saddle ```py SVM_clf_counts = Pipeline([('vect', CountVectorizer()), ...

you're making predictions with LR_clf_counts, but you fitted SVM_clf_counts, so the LR_clf_counts is probably un-fitted. that's what the error means: you haven't fitted the vectorizer yet, so it has no vocabulary stored.

desert oar May 20, 2021, 9:26 PM

#

velvet linden so i have this program that checks for the input in the csv file, and then if it...

'w' mode means "overwrite the file if it exists". use 'a' mode to add lines to the end of the file.

#

also do not open a file for both reading and writing at the same time. you will make a big mess

velvet linden May 20, 2021, 9:27 PM

#

desert oar also do not open a file for both reading and writing at the same time. you will ...

wait so how do I not open it twice?

sharp herald May 20, 2021, 9:27 PM

#

desert oar <@!336904049121296385> this is "block matrix" notation

thanks

desert oar May 20, 2021, 9:28 PM

#

velvet linden wait so how do I not open it twice?

store the rows in a list, close the file for reading, modify the list as needed, then overwrite the file.

velvet linden May 20, 2021, 9:29 PM

#

sorry

#

I might be a bit dumb here but how do I store the rows in a list

desert oar May 20, 2021, 9:31 PM

#

import csv

uuu = input('user: ')
uu = input('pass0: ')
u = input('pass1: ')

with open('test1.csv') as fp:
    rows = list(csv.reader(fp))

new_rows = []
for row in rows:
    if row == [uuu, uu]:
        print('nogood')
    else:
        new_rows.append([uuu, uu])
        print('end')
rows.extend(new_rows)
del new_rows

with open('test1.csv', 'w') as fp:
    csv.writer(fp).writerows(rows)

#

admittedly i don't understand what this code is supposed to do, but it looks more or less like what you wrote, but without the chance of messing up the files

#

note that i do not .append to rows - i .append to a new list. this is because you should never mutate something that you are iterating over

velvet linden May 20, 2021, 9:33 PM

#

desert oar note that i do _not_ `.append` to `rows` - i `.append` to a _new_ list. this is ...

but it still doesn't work

desert oar May 20, 2021, 9:33 PM

#

what does "doesn't work" mean?

#

what happened, and what were you expecting?

#

note that this is also untested code written by a stranger on the internet, so it could be buggy or incorrect

velvet linden May 20, 2021, 9:34 PM

#

ok

#

so i want it to first take an input, then i want it to read the csv, and if it is not in the csv then write it

#

but its not writing it

desert oar May 20, 2021, 9:36 PM

#

try (uuu, uu) instead of [uuu, uu]

#

i can't remember if csv rows are returned as tuples or lists. probably tuples, so use () and not [].

velvet linden May 20, 2021, 9:37 PM

#

@desert oar that didn't make a difference

desert oar May 20, 2021, 9:38 PM

#

does it never print nogood?

#

what actually does happen

#

and how is it different from your expectations?

#

it might help if you used https://repl.it and posted your code along with an example csv that shows the problem

replit

The collaborative browser based IDE

Replit is a simple yet powerful online IDE, Editor, Compiler, Interpreter, and REPL. Code, compile, run, and host in 50+ programming languages.

#

for the sake of the demonstration, you should save to a different filename so i can see both the inputs and outputs

velvet linden May 20, 2021, 9:39 PM

#

desert oar does it never print `nogood`?

no it doesn't print nogood

velvet linden May 20, 2021, 9:40 PM

#

desert oar for the sake of the demonstration, you should save to a different filename so i ...

what do you mean by this?

desert oar May 20, 2021, 9:40 PM

#

save to test2.csv instead of test1.csv, so that when i run your repl.it post i can re-run it as many times as i want, without overwriting the original file

velvet linden May 20, 2021, 9:40 PM

#

ok

#

so it is alrady on replit

#

@desert oar https://replit.com/@27jkpatel/csv#test1.py

repl.it

27jkpatel

csv

A Python repl by 27jkpatel

lapis sequoia May 20, 2021, 10:40 PM

#

guys, using this api

#

https://pypi.org/project/Google-Images-Search/

PyPI

Google-Images-Search

Search for image using Google Custom Search API and resize & crop the image afterwords

#

How can i know the methods?

#

i havent found documentation anywhere

lapis sequoia May 20, 2021, 10:52 PM

#

lapis sequoia https://pypi.org/project/Google-Images-Search/

https://github.com/arrrlo/Google-Images-Search

GitHub

arrrlo/Google-Images-Search

Search for image using Google Custom Search API and resize & crop the image afterwords using Python - arrrlo/Google-Images-Search

#

where is the docs?

haughty jackal May 20, 2021, 11:09 PM

#

are there any recommendations for resources to use for getting started with a.i and machine learning

limpid saddle May 20, 2021, 11:32 PM

#

id_train, X_train, y_train = ftrain_preprocessed['SentenceId'], ftrain_preprocessed['Phrase'], ftrain_preprocessed['Sentiment']
id_test, X_test, = ftest_preprocessed['SentenceId'], ftest_preprocessed['Sentiment']```

#

I keep getting this error

#

KeyError                                  Traceback (most recent call last)
/opt/conda/lib/python3.7/site-packages/pandas/core/indexes/base.py in get_loc(self, key, method, tolerance)
   3079             try:
-> 3080                 return self._engine.get_loc(casted_key)
   3081             except KeyError as err:

pandas/_libs/index.pyx in pandas._libs.index.IndexEngine.get_loc()

pandas/_libs/index.pyx in pandas._libs.index.IndexEngine.get_loc()

pandas/_libs/hashtable_class_helper.pxi in pandas._libs.hashtable.PyObjectHashTable.get_item()

pandas/_libs/hashtable_class_helper.pxi in pandas._libs.hashtable.PyObjectHashTable.get_item()

KeyError: 'Sentiment'

The above exception was the direct cause of the following exception:

KeyError                                  Traceback (most recent call last)
<ipython-input-97-bf964073328e> in <module>
      1 id_train, X_train, y_train = ftrain_preprocessed['SentenceId'], ftrain_preprocessed['Phrase'], ftrain_preprocessed['Sentiment']
----> 2 id_test, X_test, = ftest_preprocessed['SentenceId'], ftest_preprocessed['Sentiment']

/opt/conda/lib/python3.7/site-packages/pandas/core/frame.py in __getitem__(self, key)
   3022             if self.columns.nlevels > 1:
   3023                 return self._getitem_multilevel(key)
-> 3024             indexer = self.columns.get_loc(key)
   3025             if is_integer(indexer):
   3026                 indexer = [indexer]

/opt/conda/lib/python3.7/site-packages/pandas/core/indexes/base.py in get_loc(self, key, method, tolerance)
   3080                 return self._engine.get_loc(casted_key)
   3081             except KeyError as err:
-> 3082                 raise KeyError(key) from err
   3083 
   3084         if tolerance is not None:

KeyError: 'Sentiment'```

tidal bough May 20, 2021, 11:33 PM

#

Sentiment is not a key in ftest_preprocessed then

limpid saddle May 20, 2021, 11:33 PM

#

what do i need to fix in the code?

tidal bough May 20, 2021, 11:33 PM

#

You're trying to get a nonexisting column of a dataframe.

lapis sequoia May 20, 2021, 11:33 PM

#

i think understanding what u are doing first will help tho

limpid saddle May 20, 2021, 11:34 PM

#

Ahh I see, I'll try to see why it isn't there even tho it's supposed to be

#

thank you

lapis sequoia May 20, 2021, 11:34 PM

#

u can print ftrain_preprocessed.keys()

slate hollow May 21, 2021, 12:27 AM

#

recurrent = keras.models.Sequential([
    keras.layers.SimpleRNN(1, input_shape=(None, 1))
])
recurrent.compile(loss='mse', optimizer='nadam')
recurrent.fit(X_train, y_train, validation_data=(X_valid, y_valid), epochs=40)```is this supposed to take forever

#

winged stratus May 21, 2021, 2:06 AM

#

slate hollow

40 epochs * ~88 seconds/epoch = 3520 seconds ~ almost 1 hour

#

oof

lapis sequoia May 21, 2021, 2:46 AM

#

= for ever xd

slate hollow May 21, 2021, 3:46 AM

#

import numpy as np
import tensorflow as tf

keras = tf.keras


# returns batch_size number of sequences, each of length len_
def gen_time_series(num_instances: int = 32, len_: int = 64):
    freq1, freq2, offset1, offset2 = np.random.rand(4, num_instances, 1)
    time = np.linspace(0, 1, len_)
    series = 0.5 * np.sin((time - offset1) * (freq1 * 10 + 10))
    series += 0.2 * np.sin((time - offset2) * (freq2 * 20 + 20))
    series += 0.1 * (np.random.rand(num_instances, len_) - 0.5)
    return series.reshape(series.shape + (1,)).astype(np.float32)


seq_len = 50
instance_num = 10 ** 5
train_amt = int(instance_num * 0.6)
val_amt = int(instance_num * 0.2)
raw_data = gen_time_series(instance_num, seq_len + 1)  # +1 for the instance to predict
X_train, y_train = raw_data[:train_amt, :-1], raw_data[:train_amt, -1]
X_valid, y_valid = raw_data[train_amt:val_amt, :-1], raw_data[train_amt:val_amt, -1]

linear = keras.models.Sequential([
    keras.layers.Flatten(input_shape=(seq_len, 1)),
    keras.layers.Dense(1)
])
linear.compile(loss='mse', optimizer='nadam')
linear.fit(X_train, y_train, validation_data=(X_valid, y_valid), epochs=40)
```does anyone know why even when i'm providing the validation data, it isn't showing?

near cosmos May 21, 2021, 6:29 AM

#

I'm on the hunt for a nice image annotation tool. Any recommendations?

old grove May 21, 2021, 6:30 AM

#

Hi, I have recently started learning data science and have a doubt in pandas. Whats does the describe give.. I mean 25th,50th and 75th one basically i didnt understand...The rest i understood...just those 3 i didnt get ?

near cosmos May 21, 2021, 6:34 AM

#

old grove Hi, I have recently started learning data science and have a doubt in pandas. Wh...

They are percentiles. Sort the data, find the value 25% into the list, that's the 25% percentile

old grove May 21, 2021, 7:00 AM

#

near cosmos They are percentiles. Sort the data, find the value 25% into the list, that's th...

These are sorted values..okay...so 25th perc will be (28+29)/2 i.e 29.5 correct

old grove May 21, 2021, 7:02 AM

#

old grove Hi, I have recently started learning data science and have a doubt in pandas. Wh...

So what having 25th perc as 29.5...what does that mean compared to 29.5 ??

#

i mean values will be around,less or more than percentiles in Age..?

near cosmos May 21, 2021, 7:05 AM

#

Another term for them is quartiles. They are cut points in the distribution such that a quarter of the values are below the 1st quartile (25%), half the values are below the 2nd quartile (50%), and so on

old grove May 21, 2021, 7:07 AM

#

near cosmos Another term for them is quartiles. They are cut points in the distribution such...

so lets say my 25th perc id 29.5 and 25th perc comes as 2.75 so i can say that either 2 or 3 values to left of 50th perc will be less than 25th perc value i.e 29.5 correct and same applies for 50 and 75

#

so ok this are percentiles not percent 😃

#

goti it

#

thanks a lot @near cosmos 👍 😃

steel hill May 21, 2021, 9:00 AM

#

#

Would anyone know how to make it so the graphs are transparent? I haven't been able to figure out a way to do this and any help would be highly appreciated. Thank you.

#

oops, my own heatmap didnt upload

#

import os
import json
import sys
import matplotlib
matplotlib.use('TkAgg')
import matplotlib.pyplot as plt
import pandas as pd
import numpy as np
import matplotlib.image as mpimg 
import sqlite3
import seaborn as sns
map = sys.argv[1]
file = f"./{map}.csv"
df = pd.read_csv(file, header=None, usecols=[0,1])
print(df)
map_img = mpimg.imread(f'{map}.png') 
hmax = sns.kdeplot(df[0], df[1], cmap="Reds", shade=True, bw=.15)
hmax.collections[0].set_alpha(0)
if 'metalworks' in map:
    xmin = -3034
    xmax = 3374 
    ymin = -6699
    ymax = 4939
elif 'product' in map:
    xmin = -2859
    xmax = -171
    ymin = -3668
    ymax = 3776
elif 'process' in map:
    xmin = -5222
    xmax = 5216 
    ymin = -3146
    ymax = 3128
plt.imshow(map_img, zorder=0, extent=[xmin, xmax, ymin, ymax],resample=False)
plt.savefig(f'{map} heatmap.png', dpi=1200, transparency=True)
plt.show()```
here is the relevant code btw

desert oar May 21, 2021, 11:33 AM

#

@steel hill you might have to define your own colormap that has transparency, or otherwise need to find a way to set the "alpha" channel for the colors to something less than 1

lapis sequoia May 21, 2021, 12:14 PM

#

import tensorflow as wtf

#

Hi guys, I want to do time series model prediction but I am wondering how I can treat the skewed data here?

desert oar May 21, 2021, 12:47 PM

#

lapis sequoia Hi guys, I want to do time series model prediction but I am wondering how I can ...

is log(UnitPrice) less skewed? that's usually a good place to start. you can also consider the more general family of box-cox transformations, or a transformation using the inverse hyperbolic sine function (https://en.wikipedia.org/wiki/Inverse_hyperbolic_functions#Inverse_hyperbolic_sine), arcsinh(t*x)/t, where t=1 is the standard arcsin function

lapis sequoia May 21, 2021, 12:50 PM

#

lapis sequoia Hi guys, I want to do time series model prediction but I am wondering how I can ...

for the box-cox transformation you can refer to the scikit learn library.

#

https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.PowerTransformer.html

lapis sequoia May 21, 2021, 2:12 PM

#

old grove Hi, I have recently started learning data science and have a doubt in pandas. Wh...

percentiles

#

numbers that leave x% data to the left side and the rest on the right

marble citrus May 21, 2021, 3:19 PM

#

how can I use matplotlib in vs code

tidal bough May 21, 2021, 3:20 PM

#

marble citrus how can I use matplotlib in vs code

Like in anything else? Not sure what you're asking.

marble citrus May 21, 2021, 3:20 PM

#

I am getting module not found when running my code @tidal bough

#

@tidal bough this is what I mean

lapis sequoia May 21, 2021, 3:42 PM

#

marble citrus <@!266216750876459008> this is what I mean

are you running on a virtual env? if so check your installed packages. if not just install using pip.

marble citrus May 21, 2021, 3:42 PM

#

lapis sequoia are you running on a virtual env? if so check your installed packages. if not ju...

no I dont think so

#

is there any way I can check if I am on a virtual environment?

lapis sequoia May 21, 2021, 3:45 PM

#

that status bar below will show your current interpreter.

marble citrus May 21, 2021, 3:47 PM

#

lapis sequoia May 21, 2021, 3:50 PM

#

I think your using the default. I would suggest use notebooks instead when learning. (colab notebook/kaggle/jupyter notebook/jupyterlab).

#

install matplotlib using pip install matplotlib.
then import matplotlib.pyplot as plt

marble citrus May 21, 2021, 3:53 PM

#

lapis sequoia install matplotlib using pip install matplotlib. then import matplotlib.pyplot ...

i have already installed matplotlib (but my environment is the default one)

marble citrus May 21, 2021, 3:54 PM

#

lapis sequoia I think your using the default. I would suggest use notebooks instead when learn...

do i run that in the command pallete?

lapis sequoia May 21, 2021, 3:59 PM

#

One of the downside of using the default is mixing up packages from your other projects that may result to some conflict.

lapis sequoia May 21, 2021, 3:59 PM

#

marble citrus do i run that in the command pallete?

yes run it in the terminal

#

pip install matplotlib

#

import it in your python script using import matplotlib.pyplot as plt

marble citrus May 21, 2021, 4:03 PM

#

lapis sequoia One of the downside of using the default is mixing up packages from your other p...

i have some of my other projects which I made in the normal python environment, will that cause some problems?

charred skiff May 21, 2021, 4:05 PM

#

Good evening has anyone read google mu zero paper here?

marble citrus May 21, 2021, 4:09 PM

#

lapis sequoia I think your using the default. I would suggest use notebooks instead when learn...

this is not working

grave frost May 21, 2021, 4:16 PM

#

lapis sequoia *import tensorflow as wtf*

ayyy don't abuse ma boi like that

lapis sequoia May 21, 2021, 4:36 PM

#

who?

#

tensorflow as your boi?

#

ew

fresh zenith May 21, 2021, 4:41 PM

#

is there a way

#

where i can make a constantly updating graph

#

in matplotlib?

tidal bough May 21, 2021, 4:42 PM

#

yes; just plot to the same figure repeatedly.

fresh zenith May 21, 2021, 4:44 PM

#

can you give me a basic example or somthing to read?

tidal bough May 21, 2021, 4:48 PM

#

fresh zenith can you give me a basic example or somthing to read?

https://stackoverflow.com/a/33050617
This gives some important info, and here's a working example:

import time
import matplotlib.pyplot as plt
fig = plt.figure()
for i in range(100):
    plt.scatter(i,i,figure=fig)
    time.sleep(0.1)
    plt.pause(0.01)

fresh zenith May 21, 2021, 4:52 PM

#

ok thanks

#

this is what i have so far

#

import matplotlib.pyplot as plt
import random

y_data = []
average = 0

for i in range(0, 60):
    y_data.append(random.randint(0, 100))


for i in y_data:
    average += i

print(y_data)
print((average/60))

plt.plot(y_data)
plt.ylabel("Jason's Gay Percentage")
plt.xlabel("Seconds")
plt.show()

grave frost May 21, 2021, 4:52 PM

#

lapis sequoia tensorflow as your boi?

kill this heretic! Those who don't believe in the gospel of Google are condemned to the worst depths of pytorch! 👺

lapis sequoia May 21, 2021, 4:53 PM

#

to classify almost 1000 classes, how many images do i need per class?

grave frost May 21, 2021, 5:06 PM

#

lapis sequoia to classify almost 1000 classes, how many images do i need per class?

1

#

theoretically

#

practically? as much as you can store

desert oar May 21, 2021, 5:10 PM

#

lapis sequoia to classify almost 1000 classes, how many images do i need per class?

number of images itself isn't that important. what you need is various inputs for each class. if you have 100 data points for class 532, but they are all very similar to each other, that isn't much better than having just 1 data point for that class.

#

you also need to worry about overall class imbalance - if some classes are much more common than other classes, the model can get pretty good accuracy by simply never predicting the rare classes

lapis sequoia May 21, 2021, 5:11 PM

#

isnt amount of images = various inputs per class?

#

ah, u mean how different are between each other?

grave frost May 21, 2021, 5:12 PM

#

@desert oar you forgot data quality, noise, model architecture, gpu memory, bank account

lapis sequoia May 21, 2021, 5:12 PM

#

desert oar you also need to worry about overall class imbalance - if some classes are much ...

i am building my own data set with google image search api, so i was wondering for a number

desert oar May 21, 2021, 5:13 PM

#

there is no specific number. more is better. if you have a lot of features in your data, you will need more data points to cover the feature space.

desert oar May 21, 2021, 5:13 PM

#

lapis sequoia ah, u mean how different are between each other?

yeah, this is what i meant

#

with image classification, you can use data augmentation to help with this somewhat

grave frost May 21, 2021, 5:14 PM

#

ist just me, or is finding people with knowledge in multiple domains difficult AF?

#

or is signal processing + AI a niche job in general?

lapis sequoia May 21, 2021, 5:15 PM

#

desert oar with image classification, you can use data augmentation to help with this somew...

ye i know

#

also... is there any argument in keras to, lets say, augment data coloring it?

#

like if i have a red car, apart from rotating scaling flipping etc, pait it blue? or at least add it blue color? or something?

grave frost May 21, 2021, 5:17 PM

#

check out imgaug lib. it has enough augmentations to last you a lifetime

grave breach May 21, 2021, 5:17 PM

#

@lapis sequoia So, you need to peek the red car and make it blue?

lapis sequoia May 21, 2021, 5:18 PM

#

grave frost check out `imgaug` lib. it has enough augmentations to last you a lifetime

from keras?

grave frost May 21, 2021, 5:18 PM

#

lapis sequoia from keras?

github - not integrated directly in keras

lapis sequoia May 21, 2021, 5:18 PM

#

grave breach <@456226577798135808> So, you need to peek the red car and make it blue?

yeah like if i have an image, multiply it by some color

grave frost May 21, 2021, 5:18 PM

#

lapis sequoia yeah like if i have an image, multiply it by some color

im pretty sure that's not how colored filters work - or do they?

lapis sequoia May 21, 2021, 5:19 PM

#

i think yes

grave frost May 21, 2021, 5:19 PM

#

I would think scaling pixels to values for a particular color

#

like if blue is 0-10, then all image values would be scaled in that range

desert oar May 21, 2021, 5:19 PM

#

im sure opencv has this stuff

#

that said, the imgaug library does have this functionality already @lapis sequoia https://imgaug.readthedocs.io/en/latest/source/overview/color.html

lapis sequoia May 21, 2021, 5:20 PM

#

maybe photoshop "color" blending mode is what i want

hallow orbit May 21, 2021, 5:20 PM

#

did someone need me

desert oar May 21, 2021, 5:20 PM

#

no i tagged the wrong person, sorry

hallow orbit May 21, 2021, 5:20 PM

#

oki

grave breach May 21, 2021, 5:21 PM

#

lapis sequoia yeah like if i have an image, multiply it by some color

I suggest you to go with something more advanced (not necessary more complex) than python

#

Like Mathematica

lapis sequoia May 21, 2021, 5:21 PM

#

https://gyazo.com/1bd34c4c37e3d7937f71d9d475f0a97e

Gyazo

grave breach May 21, 2021, 5:21 PM

#

It has a function that does what you need

lapis sequoia May 21, 2021, 5:21 PM

#

this is how i wanted to augment the images

#

okey, ill take a look at imgaug

#

but since keras provides a generator u can pass to the fit method after

#

can i pass the analogue of imgaug?

desert oar May 21, 2021, 5:24 PM

#

i imagine keras gives you some way to write your own generator

lapis sequoia May 21, 2021, 5:27 PM

#

keras ImageDataAugmentation class has

#

brightness_range=None,

#

can i somehow add the color one?

grave breach May 21, 2021, 5:28 PM

#

lapis sequoia keras ImageDataAugmentation class has

Seriously, I suggest you to write a short wolfram script that augments your data, put it in a folder, and then do all the ml in keras

lapis sequoia May 21, 2021, 5:29 PM

#

i dont wanna write on disk all the augmented images

#

i only wanna have the basic ones, and during the train, provide the augmented ones, like u normally do with keras

grave breach May 21, 2021, 5:29 PM

#

You can keep them on memory and then pass all the data to python via mathlink

lapis sequoia May 21, 2021, 5:30 PM

#

                              validation_data = validataion_gen,```

grave breach May 21, 2021, 5:30 PM

#

But, if you do the ml part on wolfram (faster and easier than keras) I think that you can do data augmentation on the fly

lapis sequoia May 21, 2021, 5:30 PM

#

validataion_gen = data_gen.flow_from_directory

desert oar May 21, 2021, 5:30 PM

#

grave breach Seriously, I suggest you to write a short wolfram script that augments your data...

im not sure why learning mathematica is a better option than using a library in python if you're already using python for other stuff 🤷‍♂️

lapis sequoia May 21, 2021, 5:30 PM

#

                              horizontal_flip = True,
                              vertical_flip = False,
                              brightness_range = (0.5, 1.6),
                              rotation_range = 11,
                              validation_split = 0.17)```

#

in the end, what u provide to fit, is a generator

desert oar May 21, 2021, 5:30 PM

#

cool that you can do neural networks in mathematica though

#

definitely a powerful tool

grave breach May 21, 2021, 5:31 PM

#

desert oar im not sure why learning mathematica is a better option than using a library in ...

Mathematica has the stability and the coherence (sorry for spelling) that no other has (mathematica get designed by the ceo since the 80')

desert oar May 21, 2021, 5:31 PM

#

i remember i had a license for it when i was an undergrad, through my school. but i didnt really have a use for it then and didn't have the patience to learn the language

#

you spelled everything correctly

lapis sequoia May 21, 2021, 5:31 PM

#

the ideal was to have exact the same thing as this, but with an extra option saying add_color = (255,0,0) or something xD

#

oh

#

look what i found

grave breach May 21, 2021, 5:33 PM

#

@desert oar By the way, I think that mathematica is better than python when doing research or training models, but I also think that the best comes when you take what you researched on mathematica and take it to python (or other languages for production)

lapis sequoia May 21, 2021, 5:33 PM

#

from imgaug import augmenters as iaa

seq = iaa.Sequential([
    iaa.Fliplr(0.5), # horizontally flip
    # sometimes(iaa.AdditiveGaussianNoise(loc=0, scale=(0.0, 0.05), per_channel=0.5)),
    iaa.OneOf([
        iaa.Sharpen(alpha=(0, 1.0), lightness=(0.75, 1.5)),
        iaa.Emboss(alpha=(0, 1.0), strength=(0, 2.0)),
        # iaa.Noop(),
        iaa.GaussianBlur(sigma=(0.0, 1.0)),
        # iaa.Noop(),
        iaa.Affine(rotate=(-10, 10), translate_percent={"x": (-0.25, 0.25)}, mode='symmetric', cval=(0)),
        # iaa.Noop(),
        # iaa.PerspectiveTransform(scale=(0.04, 0.08)),
        # # iaa.Noop(),
        # iaa.PiecewiseAffine(scale=(0.05, 0.1), mode='edge', cval=(0)),
        
    ]),
    sometimes(iaa.ElasticTransformation(alpha=(0.5, 3.5), sigma=0.25)),
    # More as you want ...
], random_order=True)

datagen = ImageDataGenerator(preprocessing_function=seq.augment_image)

grave frost May 21, 2021, 5:33 PM

#

I don't know why there are people in weird places encouraging niche languages to newbies for no good reason

lapis sequoia May 21, 2021, 5:34 PM

#

can i somehow keep the default augment params?

desert oar May 21, 2021, 5:34 PM

#

lapis sequoia can i somehow keep the default augment params?

what do you mean by this?

grave frost May 21, 2021, 5:34 PM

#

like the guys at one server trying to get someone to write a NN in FORTRAN and x86

lapis sequoia May 21, 2021, 5:35 PM

#

lapis sequoia ```py from imgaug import augmenters as iaa seq = iaa.Sequential([ iaa.Flipl...

i think here, when u call datagen, only the changes u wrote, like gaussian blur, sharpen, etc, will be applied to images

grave breach May 21, 2021, 5:35 PM

#

grave frost I don't know why there are people in weird places encouraging niche languages to...

Mathematica is not niche, it is used in the largest universities and research institutions, it also has a completely different approach, that makes it easy to learn and fast (to use)

lapis sequoia May 21, 2021, 5:35 PM

#

lapis sequoia ```data_gen = ImageDataGenerator(rescale = 1./255, ...

this ones are gone

grave breach May 21, 2021, 5:35 PM

#

grave breach Mathematica is not niche, it is used in the largest universities and research in...

But I think that's not the right channel to talk about

grave frost May 21, 2021, 5:36 PM

#

grave breach Mathematica is not niche, it is used in the largest universities and research in...

oh yeah, newbies should use mathematica instead of an already defined, well-maintained lib with a single language??

lapis sequoia May 21, 2021, 5:37 PM

#

brb, gtg

grave breach May 21, 2021, 5:37 PM

#

grave frost oh yeah, newbies should use mathematica instead of an already defined, well-main...

Mathematica is well defined since the 90s, and get maintained by an extremely large and professional team

#

it is also used in large scale production

#

(even alexa is in part powered by mathematica)

grave frost May 21, 2021, 5:38 PM

#

I think that mathematica is better than python when doing research or training models, but I also think that the best comes when you take what you researched on mathematica and take it to python (or other languages for production)
most research uses JAX and python tho? if it is indeed maintained by such a big team, it definitely doesn't convince many in research or in Applied ML

grave frost May 21, 2021, 5:38 PM

#

grave breach (even alexa is in part powered by mathematica)

most products use a mixture of multiple languages 🤷

grave breach May 21, 2021, 5:38 PM

#

Oh no... Religious wars...

#

By the way, mathematica neural network framework is powered by mxnet

#

That is heavily used in production and research

#

Also many of fortune 500 companies actively uses mathematica for research

grave frost May 21, 2021, 5:39 PM

#

what? mxnet?

grave breach May 21, 2021, 5:39 PM

#

Yes

grave frost May 21, 2021, 5:40 PM

#

is it even maintained?

grave breach May 21, 2021, 5:40 PM

#

Yes

#

But I think that wolfram has it's own branch

#

I also think that in order to have a productive conversation you should take a look about who they are at Wolfram Research

#

What they did, what mathematica can do, etc.

grave frost May 21, 2021, 5:43 PM

#

I mean, I don't even have to argue how much of the industry uses mxnet

grave breach May 21, 2021, 5:43 PM

#

Sorry, but I don't even remember the point of the conversation

#

I'll make you a recap:
Mathematica is the world's fastest language (not performance, speed of coding)
It (with MatLab) is the industry standard for research
Top universities, companies and institutions uses it
It powers large scale productions systems

desert oar May 21, 2021, 5:50 PM

#

i think the point is that this is a python server, and most newbies here can barely use python, let alone do serious machine learning or understand the math that goes into it, so recommending that they use mathematica instead os not really helpful to those people

grave breach May 21, 2021, 5:50 PM

#

desert oar i think the point is that this is a python server, and most newbies here can bar...

I think you're right

desert oar May 21, 2021, 5:50 PM

#

its definitely an interesting topic though

#

i know there are people who really love mathematica

#

where have you seen it used in industry? finance?

grave breach May 21, 2021, 5:51 PM

#

I'm a physics enthusiast

#

I mainly use it for it

desert oar May 21, 2021, 5:52 PM

#

i know it has some very powerful symbolic math capabilities

#

i definitely used it to try and figure out homework answers in college

#

didn't always though...

grave breach May 21, 2021, 5:53 PM

#

desert oar i definitely used it to try and figure out homework answers in college

Now we have wolfram|alpha for that 😉

desert oar May 21, 2021, 5:58 PM

#

yup, very handy tool

#

it was also useful for quick plotting when i needed intuition about how a function ought to work

grave breach May 21, 2021, 5:59 PM

#

Definitely

late shell May 21, 2021, 6:03 PM

#

Hello, I'm trying to code a multiple linear regression model myself without using any libraries except numpy. But even after a lot of epochs, my accuracy is stuck at 38%. Using sklearn's linear regression gives 94%. I'm guessing that while moving down the cost function using gradient descent, my algorithm is stuck at some local minima. Any way I can confirm that and if it turns out to be true how can I get out of that local minima and move towards global minima? Thanks

near cosmos May 21, 2021, 6:33 PM

#

grave breach I'll make you a recap: Mathematica is the world's fastest language (not performa...

fwiw, you could mostly make the same arguments for python (or several other environments)

grave breach May 21, 2021, 6:34 PM

#

Well, python alone is a small, fast and flexible language

#

So, without package it cannot do as many things as mathematica

#

the problem is

#

That since the developers of the packages are different

#

Many package wouldn't perfectly fit

#

So, while python is better for production

#

Mathematica is better for research

#

They have different purposes

near cosmos May 21, 2021, 6:37 PM

#

late shell Hello, I'm trying to code a multiple linear regression model myself without usin...

calculate loss on a grid instead of doing a gradient search to look at the shape. or change your gradient descent parameters and try again and look for changes

tidal bough May 21, 2021, 6:38 PM

#

If you have few points, you can also exactly calculate the correct (best fit) parameters by solving the normal equation.

near cosmos May 21, 2021, 6:39 PM

#

grave breach They have different purposes

True enough. My point was that your last three arguments also are arguments for python 1) it is an industry standard, 2) top places use it, 3) it is used in production systems

grave breach May 21, 2021, 6:40 PM

#

Oh yes, I think I did not explained myself correctly

#

I meant that with mathematica has born for doing so

#

So it is designed to be more powerful in research

#

It also comes with a big set of tools

late shell May 21, 2021, 6:42 PM

#

tidal bough If you have few points, you can also exactly calculate the correct (best fit) pa...

yeah, the best fit parameters calculated using sklearn are slightly off from my custom coded regression

late shell May 21, 2021, 6:42 PM

#

near cosmos calculate loss on a grid instead of doing a gradient search to look at the shape...

I'm sorry I don't know what a grid search is, haven't reached that part yet, still a beginner

near cosmos May 21, 2021, 6:43 PM

#

late shell yeah, the best fit parameters calculated using sklearn are slightly off from my ...

the point is that you don't need gradient descent. you can find the best fit parameters analytically. but my sense was that you were trying to understand gradient descent

tidal bough May 21, 2021, 6:44 PM

#

Basically make up a whole bunch of parameters (say, equally distributed on a grid) and for each set of them, calculate the error

#

that'll allow you to look at how the error looks like depending on the params

late shell May 21, 2021, 6:45 PM

#

near cosmos the point is that you don't need gradient descent. you can find the best fit par...

yes, exactly

late shell May 21, 2021, 6:45 PM

#

tidal bough Basically make up a whole bunch of parameters (say, equally distributed on a gri...

oh okay, lemme try this, thanks

desert oar May 21, 2021, 6:57 PM

#

late shell Hello, I'm trying to code a multiple linear regression model myself without usin...

strong +1 recommendation to start with least squares

obtuse spindle May 21, 2021, 7:10 PM

#

can someone explain the road map or share a useful link to learn data science and machine learning . I am total beginner with no proper guide.I know c++ and python. Should i learn django network & otherskills?

main kernel May 21, 2021, 7:22 PM

#

obtuse spindle can someone explain the road map or share a useful link to learn data science an...

study some math is a good idea, data science is to undestande and build better models

tidal bough May 21, 2021, 7:28 PM

#

obtuse spindle can someone explain the road map or share a useful link to learn data science an...

https://www.coursera.org/learn/machine-learning is a very popular free course. Sadly, it's not in Python (it uses Octave, basically free Matlab), but it mostly focuses on the internals of some common algorithms, so the language doesn't matter much.

#

(it also teaches you some of the linear algebra required if you don't know it)

late shell May 21, 2021, 7:42 PM

#

desert oar strong +1 recommendation to start with least squares

umm, can you please elaborate a bit more.

lapis sequoia May 21, 2021, 8:08 PM

#

so guys, how can i add another transformation to ImageDataGenerator from keras?

wanton bobcat May 21, 2021, 8:10 PM

#

Hello? Can someone help me in basic python console?

desert oar May 21, 2021, 8:30 PM

#

late shell umm, can you please elaborate a bit more.

i recommend starting by implementing least squares regression. it's easier to program, you get an exact result rather than a local optimum, and it's a good excuse to dig into the linear algebra and optimization problem a bit more deeply than gradient descent

steel hill May 21, 2021, 8:47 PM

#

desert oar <@!125788547939696640> you might have to define your own colormap that has trans...

would you have any idea how to get started on that? or perhaps maybe a repository of color maps that would have transparency?

silent current May 21, 2021, 10:32 PM

#

Suggestions for building a dashboard? Bokeh? Plotly? Something else?

teal wadi May 21, 2021, 10:36 PM

#

hey guys there is someone who can help me with python pandas

silent current May 21, 2021, 10:38 PM

#

Just ask your question

teal wadi May 21, 2021, 10:38 PM

#

i need to fix my timestamp on pandas i get wrong value when i do pd.to_date

#

i convert timestamp to date UTC

#

and it doesnt work well when i do other thing its just gives me an error

desert oar May 21, 2021, 11:20 PM

#

@teal wadi it helps if you share your code and the specific errors or unexpected output

#

!paste

arctic wedgeBOT May 21, 2021, 11:22 PM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in discord, you can paste your code here:
https://paste.pydis.com/

After pasting your code, save it by clicking the floppy disk icon in the top right, or by typing ctrl + S. After doing that, the URL should change. Copy the URL and post it here so others can see it.

lapis sequoia May 21, 2021, 11:56 PM

#

hi guys, i have a question about keras
keras ImageDataGenerator class provides some of the basic transformations to increase data amount, but i am wondering how can i add my own transformation
In this case, i wanna change the color. Ive seen imgaug library has it, but i dont know how to use it with keras
Can someone help?

arctic wedgeBOT May 22, 2021, 2:39 AM

#

Hey @native ginkgo!

Uh-oh! It looks like your message got zapped by our spam filter. We currently don't allow .txt attachments, so here are some tips to help you travel safely:

• If you attempted to send a message longer than 2000 characters, try shortening your message to fit within the character limit or use a pasting service (see below)

• If you tried to show someone your code, you can use codeblocks
(run !code-blocks in #bot-commands for more information) or use a pasting service like:

https://paste.pythondiscord.com

native ginkgo May 22, 2021, 2:39 AM

#

    import pyaudio
ModuleNotFoundError: No module named 'pyaudio'

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "c:/Users/Vandana/Desktop/vansh coding/discord bot/alex.py", line 45, in <module>
  File "c:/Users/Vandana/Desktop/vansh coding/discord bot/alex.py", line 28, in commandlistener
    with sr.Microphone() as source:
  File "C:\Users\Vandana\AppData\Local\Packages\PythonSoftwareFoundation.Python.3.8_qbz5n2kfra8p0\LocalCache\local-packages\Python38\site-packages\speech_recognition\__init__.py", line 79, in __init__
    self.pyaudio_module = self.get_pyaudio()
  File "C:\Users\Vandana\AppData\Local\Packages\PythonSoftwareFoundation.Python.3.8_qbz5n2kfra8p0\LocalCache\local-packages\Python38\site-packages\speech_recognition\__init__.py", line 110, in get_pyaudio
    raise AttributeError("Could not find PyAudio; check installation")
AttributeError: Could not find PyAudio; check installation
PS C:\Users\Vandana> pip install PyAudio
Collecting PyAudio
  Using cached PyAudio-0.2.11.tar.gz (37 kB)
Building wheels for collected packages: PyAudio
  Building wheel for PyAudio (setup.py) ... error```

#

it is saying that pyaudio is not found on comp

#

and when i am trying to pip install

#

it is also showing error

winged stratus May 22, 2021, 2:51 AM

#

late shell Hello, I'm trying to code a multiple linear regression model myself without usin...

also for linear regression the loss space has only one minima, so you have some bug in the training loop

late shell May 22, 2021, 4:24 AM

#

winged stratus also for linear regression the loss space has only one minima, so you have some ...

oh yeah, hpw could i forget this. Thanks a lot.

languid falcon May 22, 2021, 4:55 AM

#

anyone free for me to message them? I have some general questions about a few topics regarding data science and the types of way to do them in python

lapis sequoia May 22, 2021, 5:31 AM

#

languid falcon anyone free for me to message them? I have some general questions about a few to...

Go on

polar stag May 22, 2021, 8:06 AM

#

any data science course you people recommend? i can see one in pinned messages from columbia's ML

oak violet May 22, 2021, 8:33 AM

#

hey guys, im new in data science and im having a hard time getting the newest googlebot string.. could someone help me with that please?

oblique raft May 22, 2021, 9:28 AM

#

hi! can someone recommend me a good tutorial for generating text with keras ?

lapis sequoia May 22, 2021, 10:51 AM

#

i need help with translating one of the old crypto hash function algorithms from C code to Python... can anyone help?

teal wadi May 22, 2021, 11:41 AM

#

i need to fix my timestamp on pandas i get wrong value when i do pd.to_date
i convert timestamp to date UTC
and it doesnt work well when i do other thing its just gives me an error

#

DM me if someone can help me with it i can share my code and hope for help

hard hound May 22, 2021, 11:42 AM

#

ValueError: shapes (1,3) and (4,4) not aligned: 3 (dim 1) != 4 (dim 0)

#

Is the error I am getting again and again I have checked my code and I know hat it means But I am ubale to solve it

hard hound May 22, 2021, 11:50 AM

#

polar stag any data science course you people recommend? i can see one in pinned messages f...

Try MIT's Intro to deep learning http://introtodeeplearning.com/

MIT Deep Learning 6.S191

MIT's introductory course on deep learning methods and applications.

lapis sequoia May 22, 2021, 12:34 PM

#

hi guys, i have a question about keras
keras ImageDataGenerator class provides some of the basic transformations to increase data amount, but i am wondering how can i add my own transformation
In this case, i wanna change the color. Ive seen imgaug library has it, but i dont know how to use it with keras
Can someone help?

#

actually

#

https://stepup.ai/custom_data_augmentation_keras/

Step Up AI

Tutorial: Custom Data Augmentation in Jeras

Learn how to implement your custom preprocessing function and integrate it into Keras data augmentation pipeline. Follow along on the Colab notebook.

#

is this what i want?

#

when u extend ur custom class from ImageDataGenerator, does it still have the augmenatations from ImageDataGenerator?

#

Yes, right?

upper spade May 22, 2021, 12:59 PM

#

hey guys

#

so im planning to pick up ai and machine learning

#

but i have 0 experience with pandas whatsoever

#

or any other library needed

#

what book should i read?

somber prism May 22, 2021, 1:04 PM

#

guy i have this Salary dataset , it has like 30 samples and only one feature , so its shape is (30, 2) . i took last 2 samples as test data . i tried fitting it and when i see the score for training i get 94. but when i see the score for testing i get -131. can someone explain me why ?

somber prism May 22, 2021, 1:05 PM

#

upper spade so im planning to pick up ai and machine learning

start with ml for stanford in coursera , then watch tutorials on yt

upper spade May 22, 2021, 1:06 PM

#

oh i see

#

thanks man

frail oak May 22, 2021, 1:09 PM

#

Hello! I want to try do a sentiment analysis project, I found this tutorial https://github.com/bentrevett/pytorch-sentiment-analysis
Can anyone familiar with the subject look at the tutorial and tell me if it's any good? Thanksies ❤️

GitHub

bentrevett/pytorch-sentiment-analysis

Tutorials on getting started with PyTorch and TorchText for sentiment analysis. - bentrevett/pytorch-sentiment-analysis

polar stag May 22, 2021, 1:10 PM

#

hard hound Try MIT's Intro to deep learning http://introtodeeplearning.com/

okay, thanks. will check it out

grave frost May 22, 2021, 1:23 PM

#

frail oak Hello! I want to try do a sentiment analysis project, I found this tutorial http...

it has 3k stars 🤷 so must be good

#

Brain wrenching question 🧠 : For calculating attention on a multi-sequence input (as in rather than having a single 1D sequence of tokens, we have a 2D/3D array of tokens that all are considered to be a single sequence) is there any such method/technique/research that has been done into this? I can't seem to find relevant stuff.

sterile stream May 22, 2021, 1:47 PM

#

I am trying to learn Machine learning, and I am confused about which path to go down. There is a Coursera course by Andrew Nag, but it does not teach it in python. Then there is a playlist on machine learning by Sentex (the YouTuber).

https://youtube.com/playlist?list=PLQVvvaa0QuDfKTOs3Keq_kaG2P55YRn5v

Other than that, there are machine learning modules like TensorFlow, Keras, PyTorch, etc. I am not sure which path to choose for moving ahead. You all are experienced than me, what do you suggest?

Note: I am not learning this as a hobby. I want to get a master’s degree in Robotics, and my college does not teach any of this stuff (I am an Undergraduate currently).

YouTube

Machine Learning with Python

#

The playlist is symbolic

lapis sequoia May 22, 2021, 1:50 PM

#

sterile stream I am trying to learn Machine learning, and I am confused about which path to go ...

tensorflow is like assembly, keras like c, and pytorch like python 🙂

#

is like, different levels

somber prism May 22, 2021, 2:03 PM

#

lapis sequoia tensorflow is like assembly, keras like c, and pytorch like python 🙂

this didnt made any sense , can you explain more ?

#

guy i have this Salary dataset , it has like 30 samples and only one feature , so its shape is (30, 2) . i took last 2 samples as test data . i tried fitting it and when i see the score for training i get 94. but when i see the score for testing i get -131. can someone explain me why ?

regal trail May 22, 2021, 2:05 PM

#

Hello, I just started learning some basic ML using sklearn. I was wondering if I could make a similar app to the one in the documentation ( https://scikit-learn.org/stable/tutorial/text_analytics/working_with_text_data.html), but with my own dataset ? : )

sterile stream May 22, 2021, 2:07 PM

#

lapis sequoia tensorflow is like assembly, keras like c, and pytorch like python 🙂

That didn't exactly answer my question

old grove May 22, 2021, 2:33 PM

#

Hi Guyz, This is my dataset and i need to get new dataset as decades but i want to add all colum values in that . Like say 1960-1970 is decade so in 1960 as first value in first colum and in second colum i need the sum of all values from 1960-1969... same for value 1970 as first colum value and next colum will have sum of all values from 1970-1979.

#

I tried groupby

#

googled but i am not getting any tresults

#

is there any inbult method

#

Like This.... Like in vehicle theft 0 index has sum of values from 1960-1969..Like This

fierce hill May 22, 2021, 2:36 PM

#

You can use . sum() at the end of your line

#

To add the values

lapis sequoia May 22, 2021, 2:36 PM

#

rip fcb

old grove May 22, 2021, 2:36 PM

#

tried groupby.sum

#

but it gives me corresponding value for that value and not sum

lapis sequoia May 22, 2021, 2:37 PM

#

Use for loop in range

#

firstly find the index of last year of each decade using .loc then do sum within for loop

#

that should easily solve your problem

old grove May 22, 2021, 2:41 PM

#

lapis sequoia firstly find the index of last year of each decade using .loc then do sum within...

You mean index slice those 10 values and take its sum ?

lapis sequoia May 22, 2021, 2:42 PM

#

Yes

#

Make another dataframe for it

old grove May 22, 2021, 2:42 PM

#

ok... but any ib method using groupby or agg.. anyone knows ?

#

will write a loop..but in genral asking ?

lapis sequoia May 22, 2021, 2:42 PM

#

Why do you need to use that?

#

well you can use agg function with lambda

#

but basically it’s same

#

for simple line you could do agg + lambda + for in one line

south burrow May 22, 2021, 2:44 PM

#

hi yall , i'am dealing with covid data analysis , i 'm using this dataset https://raw.githubusercontent.com/beoutbreakprepared/nCoV2019/master/covid19/data/clean-outside-hubei.csv but i get this result (i dont understand why the shape it like with one column?) py sys:1: DtypeWarning: Columns (1,2,6,7,9,10,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,30,31,32) have mixed types. Specify dtype option on import or set low_mcify dtype option on import or set low_memory=False. (2676313, 1)

#

import pandas as pd
df = pd.read_csv('data.tar\latestdata.tar').shape
print(df)

old grove May 22, 2021, 2:48 PM

#

lapis sequoia Why do you need to use that?

Oh yeah loop is giving me summed values for 1960 to 1969 and then i will append this summed value to index 1960 :sum(1960-1969) and so on for years....

#

but i have to create other frame and append the years and summed values for ranges and append... Thanks anyways 🙂

upper spade May 22, 2021, 3:36 PM

#

hey guys having abit of a roadblock using pandas here

#

print(df.sort_values(['Name','Speed']).iloc[0:15])

#

its sorting Name but not Speed

#

can i know why?

#

lapis sequoia May 22, 2021, 3:39 PM

#

god this cuda annd cudnn are killing me

#

3-5 months ago i tried installing and using them

#

i changed my drivers with them

#

i failed but my drivers were still cuda

#

my c drive storage died after installing them

#

and now when i try to use them i realize they are not updated and i need to reinstall

#

aghhh

lapis sequoia May 22, 2021, 3:41 PM

#

old grove but i have to create other frame and append the years and summed values for rang...

Yup definitely not an efficient way of doing it but so long as it works, it’s fine lol

old grove May 22, 2021, 3:44 PM

#

lapis sequoia Yup definitely not an efficient way of doing it but so long as it works, it’s fi...

yeah man i too wanted to avoid that..thats why asked any ib method or one liner method.. but yeah not everything comes inbulit.. sometimes we toj have to implement our own logic 😊😅

lapis sequoia May 22, 2021, 4:04 PM

#

using Google image search api, how can i search for more than 1 param?

#

'imgType': 'lineart|photo',

#

this sais it is not allowed

#

but 1 by 1 i can

fierce hill May 22, 2021, 4:50 PM

#

upper spade hey guys having abit of a roadblock using pandas here

Try using ascending = true in the brackets

serene scaffold May 22, 2021, 5:02 PM

#

upper spade its sorting Name but not Speed

in what way is it not sorting speed? I'm pretty sure what it's supposed to do is sort by name, and then sort by speed only when two values in Name are equal. All your names are different.

lapis sequoia May 22, 2021, 5:04 PM

#

Yo I always think on weekdays that “I’m so excited to do my side project on the coming weekends” only to realize that I’m a lazy squidward lying on the bed on weekends

raven quiver May 22, 2021, 5:04 PM

#

quick question.. does anyone here use lightfm at all ? I am trying to use the beta distribuition as a normalization(https://www.reddit.com/r/statistics/comments/4svy2e/how_would_i_normalize_product_review_ratings/d5daucj/) and I am not really understanding the beta and alpha in this case

r/statistics - Comment by u/Pandanleaves on ”How would I normalize ...

5 votes and 4 comments so far on Reddit

serene scaffold May 22, 2021, 5:05 PM

#

@upper spade

>>> df
  letter  number
0      a       5
1      z       6
2      a      10
3      a       1
>>> df.sort_values(['letter', 'number'])
  letter  number
3      a       1
0      a       5
2      a      10
1      z       6

raven quiver May 22, 2021, 5:05 PM

#

lightfm allows an alpha and user_alpha, which are L2 penalties.. but I don't quite get how to get the alpha and beta described in the post

lapis sequoia May 22, 2021, 5:06 PM

#

serene scaffold <@!424867508722597889> ```py >>> df letter number 0 a 5 1 z ...

You know what that reminds me of how simple and efficient python is. That sort by multiple criteria stuff is not that simple in vba

#

or even in excel formula it needs something like RANK()+SUMPRODUCT()

serene scaffold May 22, 2021, 5:07 PM

#

lapis sequoia You know what that reminds me of how simple and efficient python is. That sort b...

this is pandas-specific functionality, though the key parameter for list.sort and sorted works similarly if you pass a tuple.

lapis sequoia May 22, 2021, 5:08 PM

#

Ooh

strong dock May 22, 2021, 5:20 PM

#

hello everyone!
I wanted to implement github repo : RankIQA based on Caffe
I am facing trouble in installing it in windows 10
I installed using Anaconda but im unable to import caffe
Can I install Caffe on Google Colab
Pls guide me as I sense the Caffe community is not much active, I commented on issues of the official repo but got no replys.

serene scaffold May 22, 2021, 5:32 PM

#

strong dock hello everyone! I wanted to implement github repo : RankIQA based on Caffe I am ...

why are you using anaconda?

strong dock May 22, 2021, 5:32 PM

#

I thought it would be easier

#

I dont have much experience

#

to make it from source

serene scaffold May 22, 2021, 5:33 PM

#

strong dock I thought it would be easier

that might have been true in the past, but there's a lot more community support available if you don't use anaconda at all.

strong dock May 22, 2021, 5:33 PM

#

should I dual boot ubuntu

#

?

serene scaffold May 22, 2021, 5:34 PM

#

strong dock should I dual boot ubuntu

the data science ecosystem accomodates linux much better than Windows, but you should be able to get by on Windows if you have the C++ build tools installed.

#

!build

arctic wedgeBOT May 22, 2021, 5:34 PM

#

Microsoft Visual C++ Build Tools

When you install a library through pip on Windows, sometimes you may encounter this error:

error: Microsoft Visual C++ 14.0 or greater is required. Get it with "Microsoft C++ Build Tools": https://visualstudio.microsoft.com/visual-cpp-build-tools/

This means the library you're installing has code written in other languages and needs additional tools to install. To install these tools, follow the following steps: (Requires 6GB+ disk space)

1. Open https://visualstudio.microsoft.com/visual-cpp-build-tools/.
2. Click Download Build Tools >. A file named vs_BuildTools or vs_BuildTools.exe should start downloading. If no downloads start after a few seconds, click click here to retry.
3. Run the downloaded file. Click Continue to proceed.
4. Choose C++ build tools and press Install. You may need a reboot after the installation.
5. Try installing the library via pip again.

strong dock May 22, 2021, 5:34 PM

#

thanks will try it

stark zenith May 22, 2021, 7:36 PM

#

I think you can still pip install packages on colab.

lapis sequoia May 22, 2021, 9:04 PM

#

guys, since google api forbides too many requests per day to it, can any of u help me? I am trying to create a big dataset

raven quiver May 22, 2021, 9:30 PM

#

does anyone here work with reccomendations engines? I have been trying to increase the fit of my model with various normalizations and it doesn't seem to do anything, was wondering if someone could help me out quick

rotund lily May 22, 2021, 11:45 PM

#

hey i need help making a racial detection robot in python

exotic maple May 22, 2021, 11:55 PM

#

rotund lily hey i need help making a racial detection robot in python

This is...oddly specific

frank dock May 23, 2021, 12:00 AM

#

Hello! Here's my bachelor's thesis on privacy-preserving federated learning on decentralized data. I have made it open source now, and I would love it if anybody here could try it, give feedback, or contribute in any way. The goal is to make an open source library for doing secure federated learning using different privacy-preserving algorithms in an easy and efficient way.
It was written for Norwegian University of Science and Technology as a part of my degree in Computer Science.
Contact me if you want to know more about the research, and please ⭐ the project if you find it interesting!
https://github.com/dilawarm/federated

GitHub

dilawarm/federated

Bachelor's Thesis in Computer Science: Privacy-Preserving Federated Learning Applied to Decentralized Data - dilawarm/federated

humble nest May 23, 2021, 12:24 AM

#

i made a ethereum and dogecoin comparison graph but ethereum seems to be very linear since it has such a high value compared to dogecoin so its hard to compare the both together. what potential improvements should i make on this graph? thanks for the feedback if so

#

also if youre wondering, this is purely experimental, i just wanted something to plot and decided to analyse cryptocurrencies

exotic maple May 23, 2021, 12:30 AM

#

humble nest i made a ethereum and dogecoin comparison graph but ethereum seems to be very li...

you could try minmax-scaling each crypto

#

to their corresponding ranges

#

but what exactly do you want to compare? Rate of change per time period?

humble nest May 23, 2021, 12:32 AM

#

i wanna compare their value throughout the past month

humble nest May 23, 2021, 12:32 AM

#

exotic maple but what exactly do you want to compare? Rate of change per time period?

yeah basically

exotic maple May 23, 2021, 12:32 AM

#

then compute that change

drowsy stag May 23, 2021, 12:32 AM

#

humble nest May 23, 2021, 12:33 AM

#

exotic maple then compute that change

wdym

exotic maple May 23, 2021, 12:33 AM

#

the change is simply change = x - x(-1)

humble nest May 23, 2021, 12:33 AM

#

i plot the data i just need to scale it

exotic maple May 23, 2021, 12:33 AM

#

or percent cahange

#

think carefully about what you want to do, what exactly do you want to plot

humble nest May 23, 2021, 12:33 AM

#

how do i apply that to my code

#

yeah sorry im just a beginner to data visualization

exotic maple May 23, 2021, 12:34 AM

#

that's ok, but the most important question is -what- is what you want to see

#

everything builds from there

humble nest May 23, 2021, 12:34 AM

#

yep

humble nest May 23, 2021, 12:34 AM

#

exotic maple you could try minmax-scaling each crypto

what syntax do i use for that

exotic maple May 23, 2021, 12:35 AM

#

but the most important question is -what- is what you want to see

humble nest May 23, 2021, 12:35 AM

#

i wanna see the movement of the crypto

#

when it goes down and when it goes up

exotic maple May 23, 2021, 12:35 AM

#

in relation to what?

humble nest May 23, 2021, 12:35 AM

#

because i cant have it going at constant rate up

exotic maple May 23, 2021, 12:35 AM

#

to itself? to other?, etc

humble nest May 23, 2021, 12:36 AM

#

in relation to doge

#

because if i plot it individually

#

it shows a lot of movement

#

but when its in relation to doge it just goes up at a constant rate

#

shall i show a depiction of what i mean?

exotic maple May 23, 2021, 12:37 AM

#

yes please because im not sure i get it

#

im thinking what you want is a common standard regression but i might be wrong

humble nest May 23, 2021, 12:39 AM

#

one sec

#

oh my bad

#

i meant dogecoin

#

ethereum is always linear, even when plotted individually

#

#

heres dogecoin when its plotted by itself

exotic maple May 23, 2021, 12:42 AM

#

so you just want to plot them...together? no specifics analysis or anything?

humble nest May 23, 2021, 12:43 AM

#

i mean i do want analysis which is why i want to make it so i can see it going up or down

#

instead of it just going at a straight line

exotic maple May 23, 2021, 12:43 AM

#

your Y axis is a mess in that plot

#

why is it being generated like that

humble nest May 23, 2021, 12:43 AM

#

nah its meant to be like that

#

because dogecoins value is pretty small

slate hollow May 23, 2021, 12:44 AM

#

http://surpriselib.com/ so looking at this framework, it doesn't use gpu right?

Surprise

Home

A simple Python library for building and testing recommender systems.

exotic maple May 23, 2021, 12:44 AM

#

Id say your best is to calculate the pct_change for each crypt

#

pct_change from x0 to x1 for each crypto, since that would be standarized

humble nest May 23, 2021, 12:44 AM

#

so i have to plot in the y axis value manually?

exotic maple May 23, 2021, 12:44 AM

#

no?

#

just calculate pct_change on each (instead of values) and plot that

humble nest May 23, 2021, 12:45 AM

#

i got it from a csv file

exotic maple May 23, 2021, 12:45 AM

#

you need to calculate pct change then

humble nest May 23, 2021, 12:45 AM

#

dang discord is not loading

#

or i could just analyse another set of data

#

because ethereum has a huge value difference

exotic maple May 23, 2021, 12:46 AM

#

humble nest because ethereum has a huge value difference

just normlaize your data

#

its pretty trivial -> pct-change = (current_value - previous_value) / previous_value

humble nest May 23, 2021, 12:47 AM

#

load

#

aight

#

so i just plot the averages?

exotic maple May 23, 2021, 12:47 AM

#

average has nothing to do with pct_change

humble nest May 23, 2021, 12:48 AM

#

dude i told you i am new

#

so no need to have high expectations of me

exotic maple May 23, 2021, 12:49 AM

#

I'm not, i'm just wondering why you mentioned something unrelated

#

as i said, I think I havent quite understood WHAT is what you want to see

humble nest May 23, 2021, 12:50 AM

#

i just needed to know how to make it so i can see 2 different datasets normally without it being too linear

exotic maple May 23, 2021, 12:50 AM

#

humble nest i just needed to know how to make it so i can see 2 different datasets normally ...

your plot only looks linear because your Y axis is strange

humble nest May 23, 2021, 12:50 AM

#

makes sense

exotic maple May 23, 2021, 12:50 AM

#

if your Y axis was bottom-top you would see ETHs movement

#

like a normal plot...

#

humble nest May 23, 2021, 12:51 AM

#

thing is

exotic maple May 23, 2021, 12:51 AM

#

but your chart is a weird thing that goes from 2.7k to.... 2.7k?

humble nest May 23, 2021, 12:51 AM

#

and dogecoin has a pretty low price

exotic maple May 23, 2021, 12:51 AM

#

at least ive never seen that

humble nest May 23, 2021, 12:51 AM

#

im analysing dogecoins price

exotic maple May 23, 2021, 12:51 AM

#

do you experience with financial analysis?

#

covariances? correlations? Betas?, etc?

Python expertise is not related to subject matter expertise.

#

if your financial expertise is solid then you just need to think of what kind of visualization you need for your analysis

humble nest May 23, 2021, 12:52 AM

#

sorry discord is being slow rn

humble nest May 23, 2021, 12:52 AM

#

exotic maple do you experience with financial analysis?

not really im just a beginner hopping onto the data analysis rabbit hole

exotic maple May 23, 2021, 12:53 AM

#

humble nest not really im just a beginner hopping onto the data analysis rabbit hole

then you need to get some subject matter expertise before attempting to code it

humble nest May 23, 2021, 12:53 AM

#

humble nest also if youre wondering, this is purely experimental, i just wanted something to...

i said this for a reason

#

not meant to be financial i just needed something to analyse

#

and decided to use cryptocurrencies as an example

exotic maple May 23, 2021, 12:53 AM

#

Oh so you're just using financial data to learn how to plot in Python? is that correct?

humble nest May 23, 2021, 12:54 AM

#

yup

exotic maple May 23, 2021, 12:54 AM

#

that would have been a lot faster to say lol

humble nest May 23, 2021, 12:54 AM

#

im just learning how to manipulate and plot csv files

exotic maple May 23, 2021, 12:54 AM

#

ok 1st recommendation

#

Learn Pandas

#

what plotting library are you using? Matplotlib?

humble nest May 23, 2021, 12:54 AM

#

yea

#

using matplotlib

exotic maple May 23, 2021, 12:55 AM

#

Do you know Pandas?

humble nest May 23, 2021, 12:55 AM

#

and pandas as well

#

i just used pandas to make it read the csv file so i can manage it

exotic maple May 23, 2021, 12:55 AM

#

ok perfect

#

make a copy of your dataframe so you dont have to read it again in case of errprs

#

good, so, before going to visualization, try tampering a bit with your data

#

df2 = df.copy()

#

on that 2nd dataframe, try calculating pct_change for each crypto

#

https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.pct_change.html

#

and plot that, NOT the values

#

you can also find their correlation, etc

#

maybe fit a linear model and calculate the RMSE (root mean squared error) of trying to predict ETH through Doge

humble nest May 23, 2021, 12:58 AM

#

most of the time i was just trying to find the proper csv file to use

#

and i am actually going to do predictions soon but thats for another time

#

for now im doing something as simple as plotting 2 datasets together

exotic maple May 23, 2021, 12:58 AM

#

ok ignore regression

#

do the pct_change thing

#

and plot that

humble nest May 23, 2021, 12:59 AM

#

alright

#

but what percentage should i change it to

exotic maple May 23, 2021, 12:59 AM

#

?

humble nest May 23, 2021, 12:59 AM

#

like

exotic maple May 23, 2021, 12:59 AM

#

you dont set it manually lol

#

you calculate it

humble nest May 23, 2021, 12:59 AM

#

oh

#

then i plot the proportion of the value?

exotic maple May 23, 2021, 1:00 AM

#

no...the pct_change

exotic maple May 23, 2021, 1:00 AM

#

humble nest then i plot the proportion of the value?

wha does this mean?

humble nest May 23, 2021, 1:01 AM

#

like i plot a percentage of the value so it doesnt have a huge value difference

#

so i can actually see the graph move better

#

but i feel like that wouldnt be such a great idea anyways

humble nest May 23, 2021, 1:01 AM

#

exotic maple no...the pct_change

thing is how do i use that syntax

#

what information do i need to put in it

#

actually i wont bother you and try to read the documentations to get a more vivid understanding

exotic maple May 23, 2021, 1:03 AM

#

humble nest actually i wont bother you and try to read the documentations to get a more vivi...

that's the correct approach for any code question

#

if something about the documentation is not clear, then you can specifically ask that and it will be much easier to help you 🙂

humble nest May 23, 2021, 1:04 AM

#

aight

#

sounds great

#

shall we dm just in case because i feel like we've flooded the chat way too much

#

anyways gonna go afk

#

oh btw sorry about the confusion, the reason why the y axis is messed up is because its values are made up of the ethereums prices, which is why it keeps going straight

#

guess i might need to set some index on the y axis

grim patrol May 23, 2021, 2:09 AM

#

How do you add a softmax regression layer to an RNN model with autoencoder? using TensorFlow

#

I implemented the class to encode/decode and call

#

but I don't really understand how to take that output and add a layer on top of it

near cosmos May 23, 2021, 4:50 AM

#

exotic maple and plot that, NOT the values

I generally recommend always looking at the raw values also. You'll learn things even if that's not the conventional view of the data.

median ember May 23, 2021, 5:48 AM

#

how can I use numpy.isin for 2d arrays? like if I have a [[0,1], [1,2]] and I want to search for [[0,1]] so that it returns [True, False]?

exotic maple May 23, 2021, 5:50 AM

#

near cosmos I generally recommend always looking at the raw values also. You'll learn things...

Raw values should be observed? Not sure what this means aside from that

#

Obv you must always checl your raw data first 😋

near cosmos May 23, 2021, 6:44 AM

#

exotic maple Obv you must always checl your raw data first 😋

Yes, that's what I mean. For a newbie, maybe it's not obvious

gray plover May 23, 2021, 6:55 AM

#

can i block a response from the package in chatbotAI like it keeps asking about my family n i wanna remove that response

minor grove May 23, 2021, 7:26 AM

#

I have a basic question but not sure exactly how to ask but here goes. I have 2 data sets one set is a set of successful transactions and the others are where I had a failure. The failures don't seem to have a pattern that i can spot visually. How could I figure out what combinations of factors and the order of that combination that led to the failure programmatically or using ML?

minor grove May 23, 2021, 7:44 AM

#

Can anyone help?

velvet thorn May 23, 2021, 7:48 AM

#

median ember how can I use numpy.isin for 2d arrays? like if I have a [[0,1], [1,2]] and I wa...

use ==

#

!e

import numpy as np

a = np.array([[0, 1], [1, 2], [2, 3]])
b = np.array([0, 1])

print((a == b).all(axis=1))

arctic wedgeBOT May 23, 2021, 7:49 AM

#

@velvet thorn :white_check_mark: Your eval job has completed with return code 0.

[ True False False]

velvet thorn May 23, 2021, 7:50 AM

#

grim patrol but I don't really understand how to take that output and add a layer on top of ...

just call that output with another layer

velvet thorn May 23, 2021, 7:50 AM

#

minor grove I have a basic question but not sure exactly how to ask but here goes. I have 2 ...

do you have any ML experience

minor grove May 23, 2021, 7:51 AM

#

I am trying to get some experience with it, so this was basically my first project that I chose to try and understand.

#

Mainly trying to find out where to start looking to be able to solve this problem.

velvet thorn May 23, 2021, 7:52 AM

#

minor grove I am trying to get some experience with it, so this was basically my first proje...

so that's a no?

#

i.e. you don't know how to perform EDA, preprocess data, model relationships, etc.

minor grove May 23, 2021, 7:52 AM

#

I am trying to go through a coursera course on ML, but yes the answer is no

velvet thorn May 23, 2021, 7:53 AM

#

hm

#

then I would suggest

#

you work on said course

minor grove May 23, 2021, 7:56 AM

#

That is what I am trying to do

midnight charm May 23, 2021, 8:13 AM

#

Hey, is it possible to use a stacking algorithm on just 3 inputs?

#

Like, I have 3 inputs(predictions) [0.999854,0.9894, 0.97802734375] and somehow I need to get a better prediction that the best one, in this case 0.999854

eager timber May 23, 2021, 9:15 AM

#

dnd ||read status||

#

hehehehehe

#

gotcha

hoary wigeon May 23, 2021, 9:55 AM

#

any good data analyst here?

#

I'm working with RFM Analysis

#

-------------------------------------
| Quantity |   UnitPrice  | Invoice |
-------------------------------------
|    6     |      10.05   |   I105  |
|   -2     |      10.05   |   I105  |
|    3     |      12.36   |   I107  |
|   -1     |      12.36   |   I107  |
-------------------------------------

hoary wigeon May 23, 2021, 10:18 AM

#

It seems I105 and I107 has returend their order

#

so i must count them in monetary analysis ?

lapis sequoia May 23, 2021, 10:30 AM

#

hey anyone knows a good docker containerfor data-science?

grave frost May 23, 2021, 11:47 AM

#

I don't know why I always manage to find weird use-cases that no one ever implements 😦

#

Anyone know anything about computing attention on multi-dimensional sequences?

vital lodge May 23, 2021, 12:05 PM

#

i have a doubt on lstm neural networks.

#

From what I saw lstm is a great algorithm for time series forecast prediction.

#

But when dealing with something like the stock market we can't be sure when the market might go down or go up, then how come lstm makes such accurate stock prices predictions?

grim patrol May 23, 2021, 12:34 PM

#

velvet thorn just call that output with another layer

How is that done in practice?
Currently my autoencoder looks like this:

class AnomalyDetector(Model):
    def __init__(self):
        super(AnomalyDetector, self).__init__()
        self.encoder = tf.keras.Sequential([
            layers.Dense(64, activation="relu"),
            layers.Dense(32, activation="relu"),
            layers.Dense(16, activation="relu"),
            layers.Dense(8, activation="relu")])

        self.decoder = tf.keras.Sequential([
            layers.Dense(16, activation="relu"),
            layers.Dense(32, activation="relu"),
            layers.Dense(64, activation="relu"),
            layers.Dense(79, activation='sigmoid')
        ])

    def call(self, x):
        encoded = self.encoder(x)
        decoded = self.decoder(encoded)
        return decoded

Do I just make the last decoder softmax activation? The article I'm implementing describes this as a separate step from the autoencoder

arctic wedgeBOT May 23, 2021, 1:18 PM

#

Hey @blissful heath!

It looks like you tried to attach file type(s) that we do not allow (). We currently allow the following file types: .gif, .jpg, .jpeg, .mov, .mp4, .mpg, .png, .mp3, .wav, .ogg, .webm, .webp, .flac, .m4a.

Feel free to ask in #community-meta if you think this is a mistake.

blissful heath May 23, 2021, 1:18 PM

#

How do I turn this Text into a table? I opened TXT with Pandas, but it organized in a standard way, the big problem that this file does not have a delimiter. I thought of a logic according to the position where the characters of each line start (example: the main column comes from lines that have their first character in position 8, while the secondary column comes from lines where the characters start in position 15. But I am not able to develop the logic, can someone help me?

#

My goal

dusky granite May 23, 2021, 1:29 PM

#

I have converted the model to a .tflite but am unable to predict using it
this is what i used to convert the model

#

converter = tf.lite.TFLiteConverter.from_keras_model(newmodel)
tflite_model = converter.convert()

with open('numeric_values-model.tflite', 'wb') as f:
  f.write(tflite_model)```and this is how i predicted in colab
```to_predict=tf.constant(np.array([[2.0,2.0,6.0,6.2]]))
predictions=newmodel.predict(to_predict)
SPECIES = ['Setosa', 'Versicolor', 'Virginica']
for prediction in predictions:
  index=np.argmax(prediction)
  print(SPECIES[index],prediction[index])```want to be able to use it with tflite
don't know how to do so

raven knoll May 23, 2021, 1:37 PM

#

Does anyone seen this error before. I am using dask and Im trying to use the countvectorizer but when I fit_transform the data I get an error and I tried a lot of stuff to solve it but no luck

lapis sequoia May 23, 2021, 2:37 PM

#

smokingDeaths = fatalities[(fatalities['ICD10 Diagnosis'] == "All deaths which can be caused by smoking") & (fatalities['Sex'].isnull() != True)]
smokingDeathsMaleYears = []

for each in smokingDeaths[smokingDeaths['Sex'] == 'Male']['Year']:
    smokingDeathsMaleYears.append(each)

    
smokingDeathsFemaleYears = []
for each in smokingDeaths[smokingDeaths['Sex'] == 'Female']['Year']:
    smokingDeathsFemaleYears.append(each)
    

smokingDeathsMaleValues = []
for each in smokingDeaths[smokingDeaths['Sex'] == 'Male']['Value']:
    smokingDeathsMaleValues.append(each)
    
smokingDeathsFemaleValues = []

for each in smokingDeaths[smokingDeaths['Sex'] == 'Female']['Value']:
    smokingDeathsFemaleValues.append(each)
plt.plot(smokingDeathsMaleYears, smokingDeathsMaleValues, label = "Male")
plt.plot(smokingDeathsFemaleYears, smokingDeathsFemaleValues, label = "Female")```
This is the code I've used to plot the graphs

#

#

but the lines are not getting plotted on the same scale for some reason

humble nest May 23, 2021, 2:53 PM

#

oh yeah thats similar to what happened to me

#

i still havent found a solution to it

#

the values in the y axis arent in proper order

lapis sequoia May 23, 2021, 2:55 PM

#

@humble nest I found the problem with mine though

#

the Y values were actually strings

#

the problem was fixed when I converted them into integers

humble nest May 23, 2021, 3:00 PM

#

OH

#

no wonder the code wasnt detecting it as being numbers

#

makes sense

#

thanks for the solution btw

boreal summit May 23, 2021, 3:55 PM

#

Hello house, anyone has link to a telegram chatbot code? I was asked to build one for something, so I was hoping I could just edit the code and stuff.

#

Thanks. 🙏🏿

serene scaffold May 23, 2021, 6:01 PM

#

lapis sequoia ```py smokingDeaths = fatalities[(fatalities['ICD10 Diagnosis'] == "All deaths w...

as a matter of code quality, I don't believe there's any benefit in this case to copying each data point over to a Python list. Simply passing expressions like smokingDeaths[smokingDeaths['Sex'] == 'Female']['Year'] to plt.plot should be sufficient, though you can also do list(smokingDeaths[...]['Year']) to get the same effect as your append for loops.

lapis sequoia May 23, 2021, 6:03 PM

#

humble nest thanks for the solution btw

Of course

lapis sequoia May 23, 2021, 6:04 PM

#

serene scaffold as a matter of code quality, I don't believe there's any benefit in this case to...

Thanks, I was thinking of doing it like that as well, but I had to change it back to storing them in variables to see what was causing the error after it didn't work out initially

serene scaffold May 23, 2021, 6:05 PM

#

lapis sequoia Thanks, I was thinking of doing it like that as well, but I had to change it bac...

smoking_death_female_year = smokingDeaths[smokingDeaths['Sex'] == 'Female']['Year'] would work

lapis sequoia May 23, 2021, 6:07 PM

#

Right,

#

that does look more readable

#

But yeah I'm just trying to make something work at the moment to reach the deadline,

#

considering the fact that my code is probably not going to be read at all

dapper halo May 23, 2021, 6:14 PM

#

is there a way to index what rows have the same values for X columns and dropping them from a dataframe?

Everything im seeing uses a loop which looks messy and can take ages depending on how large the df is

lapis sequoia May 23, 2021, 6:17 PM

#

I think there is

#

Can't remember what though, but there should be a function for that in Pandas

serene scaffold May 23, 2021, 6:17 PM

#

dapper halo is there a way to index what rows have the same values for X columns and droppin...

Can you provide me with a csv as text (no screenshot) that I can copy and explain in more detail what you're trying to do?

#

Please ping me when you do this or I will not know that you have done it.

serene scaffold May 23, 2021, 6:24 PM

#

dapper halo is there a way to index what rows have the same values for X columns and droppin...

I'm not sure if you're still here. If you know how to use masks in pandas, this will help you figure it out: https://stackoverflow.com/questions/22701799/pandas-dataframe-find-rows-where-all-columns-equal

Stack Overflow

Pandas Dataframe Find Rows Where all Columns Equal

I have a dataframe that has characters in it - I want a boolean result by row that tells me if all columns for that row have the same value.

For example, I have

df = [ a b c d

0 'C' 'C...

arctic wedgeBOT May 23, 2021, 6:26 PM

#

Hey @dapper halo!

It looks like you tried to attach file type(s) that we do not allow (.csv). We currently allow the following file types: .gif, .jpg, .jpeg, .mov, .mp4, .mpg, .png, .mp3, .wav, .ogg, .webm, .webp, .flac, .m4a.

Feel free to ask in #community-meta if you think this is a mistake.

dapper halo May 23, 2021, 6:26 PM

#

ah pooo it didnt like it

lapis sequoia May 23, 2021, 6:26 PM

#

💀

serene scaffold May 23, 2021, 6:26 PM

#

dapper halo ah pooo it didnt like it

You have to copy and paste it as text into the chat

dapper halo May 23, 2021, 6:26 PM

#

probably why you said as text

#

yeah lmao

lapis sequoia May 23, 2021, 6:26 PM

#

How big is the file

serene scaffold May 23, 2021, 6:27 PM

#

I only need a sample

lapis sequoia May 23, 2021, 6:27 PM

#

but you can always just take like 10 rows

#

work with it

#

apply it on the whole

#

yeah

serene scaffold May 23, 2021, 6:27 PM

#

whatever print(df.head().to_csv()) prints out basically

lapis sequoia May 23, 2021, 6:28 PM

#

Unless you have 300 columns per row for some f'ed up reason

dapper halo May 23, 2021, 6:29 PM

#

Nh,Redshift,Metallicity,Density,N_SiII,N_SiIII,N_SiIV
15,0.25,-2,-1,12,14,13.5
15.5,0.25,-1.5,-2,12,12,13.5
16,0.25,-2,-2.5,12,12,12
16.5,0.25,-3,-1.5,13.75,13,14

@serene scaffold

serene scaffold May 23, 2021, 6:29 PM

#

dapper halo Nh,Redshift,Metallicity,Density,N_SiII,N_SiIII,N_SiIV 15,0.25,-2,-1,12,14,13.5 1...

while I saw this, adding a mention to a message after the fact does not trigger a ping.

dapper halo May 23, 2021, 6:30 PM

#

Man my incompetence is shining today

serene scaffold May 23, 2021, 6:30 PM

#

dapper halo Man my incompetence is shining today

Don't worry about it. So, which are the columns in question?

dapper halo May 23, 2021, 6:30 PM

#

the N_Six

lapis sequoia May 23, 2021, 6:30 PM

#

dapper halo Man my incompetence is shining today

Happens to the best of us

#

N_six?

dapper halo May 23, 2021, 6:31 PM

#

id like to ignore Nh, Redshift, metallicity, and density.

Only focus on the last 4 or just any grouping

lapis sequoia May 23, 2021, 6:31 PM

#

Which one's N_six

#

what?

serene scaffold May 23, 2021, 6:31 PM

#

dapper halo the N_Six

look into DataFrame.eq
pick one arbitrary column and see if the other two are equal to it. since they all have to be the same, it doesn't matter which you pick

dapper halo May 23, 2021, 6:31 PM

#

yeah. I'll set the N_Six to some threshold of 12 or whatever. If all of those columns have the same value the network cant train on em

serene scaffold May 23, 2021, 6:32 PM

#

also look into .all

dapper halo May 23, 2021, 6:32 PM

#

thank ya thank ya. Ill check em out

lapis sequoia May 23, 2021, 6:33 PM

#

?

#

No idea what you guys just talked about but I hope it works out for you

dapper halo May 23, 2021, 6:34 PM

#

lapis sequoia Which one's N_six

the last three were N_Six...one is Silicon II, Silicon III, Silicon IV

lapis sequoia May 23, 2021, 6:34 PM

#

Oh

#

three of the columns make up N_six?

#

I see

dapper halo May 23, 2021, 6:35 PM

#

x just meant which state of silicon. but yes

lapis sequoia May 23, 2021, 6:37 PM

#

I see

#

Makes sense

#

@serene scaffold Is it just me or do you sound like you've worked as a professional recruiter once in your life

late shell May 23, 2021, 6:51 PM

#

Why do people say that one can learn ML/AI without being good at math. That seems absolute BS. I'm trying to understand what the hell maximum likelihood is for logistic regression and it's taken me hours and I still can't quite wrap my head around it.

lapis sequoia May 23, 2021, 6:51 PM

#

Not to be too blunt, but yeah you do sound kinda Pro

lapis sequoia May 23, 2021, 6:51 PM

#

late shell Why do people say that one can learn ML/AI without being good at math. That seem...

Because StackOverFlow my brother

#

Also, I think everybody's good at math

#

they just might not know how to apply it though

#

If you can do Multiplication, Division, Subtraction, Addition

#

your brain's pretty capable of applying mathematical concepts to solve problems

serene scaffold May 23, 2021, 6:56 PM

#

lapis sequoia <@!253696366952316929> Is it just me or do you sound like you've worked as a pro...

A professional recruiter? No.

grave frost May 23, 2021, 7:24 PM

#

late shell Why do people say that one can learn ML/AI without being good at math. That seem...

whoever says that doesn't know ML/AI at all then 🤷 easy way to weed out the scammers

lapis sequoia May 23, 2021, 7:25 PM

#

grave frost whoever says that doesn't know ML/AI at all then 🤷 easy way to weed out the sca...

StackOverflow my brother

#

if you find yourself having to deal with some math in ML/AI

#

just post it on SOF

stark zenith May 23, 2021, 8:25 PM

#

Knowing everyone looks stuff up on SO makes me feel better about having to look up stuff on SO. But it also makes me worry that I do not truly understand what I am doing.

exotic maple May 23, 2021, 8:40 PM

#

late shell Why do people say that one can learn ML/AI without being good at math. That seem...

Most people who say that already have a background in math and feel its somewhat trivial, or they're just bullshitters lol

#

but aside from that, most things in the math (that i've seen) is just keeping a cool mind and figure out the logic of how things should be

#

for example for backpropagation, which is just partial derivatives to minimize the cost function at the end of the NN. Conceptually it's "simple" but applying or building that for M layers of N neurons...ugh

cedar sun May 23, 2021, 9:01 PM

#

hi guys, if any of u is interested in helping me making a data set pls ping me. Google api doesnt allow too many requests per user, So it will be faster if someone of u cooperate. the image data set will be about pokemons. I have an script already

serene scaffold May 23, 2021, 9:04 PM

#

cedar sun hi guys, if any of u is interested in helping me making a data set pls ping me. ...

what kind of dataset are you trying to create? maybe there's a way to request that information from Google

cedar sun May 23, 2021, 9:05 PM

#

yeah, but u cant do infinite per day

#

using Google Image Search api

lapis sequoia May 23, 2021, 9:18 PM

#

Hi, I'd like to get back in to programming for the purpose of being able to scrape data from websites. I've done a full semester of Intro to Compsci with Python five years ago. What's a good book or course that I can jump into?

cedar sun May 23, 2021, 9:23 PM

#

serene scaffold what kind of dataset are you trying to create? maybe there's a way to request th...

btw, do u mean contacting with google?

bronze skiff May 23, 2021, 9:25 PM

#

lapis sequoia <@!253696366952316929> Is it just me or do you sound like you've worked as a pro...

gotta work first before one recruits

serene scaffold May 23, 2021, 9:40 PM

#

bronze skiff gotta work first before one recruits

There's no reason to expect that they'd know anything about my employment history.

cedar sun May 23, 2021, 9:47 PM

#

steler ignores me

stuck swallow May 23, 2021, 9:49 PM

#

is there any ai library that allows you to generate images from a dataset? Similar to how https://thispersondoesnotexist.com/ does it

This Person Does Not Exist

bronze skiff May 23, 2021, 9:50 PM

#

stuck swallow is there any ai library that allows you to generate images from a dataset? Simil...

build a generative model and produce those images

stuck swallow May 23, 2021, 9:50 PM

#

what library is the best for that? i heard opencv is only for rigid objects is that fine?

cedar sun May 23, 2021, 10:28 PM

#

@serene scaffold:(

solar nest May 23, 2021, 10:28 PM

#

hi all!

serene scaffold May 23, 2021, 10:29 PM

#

cedar sun <@!253696366952316929>:(

Yes? I'm waiting to hear back from a friend who works at Google to ask if you can request data from them.

cedar sun May 23, 2021, 10:29 PM

#

ah lol

#

okey okey

#

tyvm tho lol

lapis sequoia May 23, 2021, 10:30 PM

#

Hi guys, quick question as i am a bit confused. I want to predict the UnitPrice which is my target, should I use standardscaler and then do one hot encoding?

solar nest May 23, 2021, 10:31 PM

#

trying to plot some data with python for the first time. it's a CSV file with 4 columns: datetime, a, b and c. i already learned how to work with 2 columns: just add "squeeze=True". but with 4 columns, how can i treat this as a time series and get a line plot with three different lines (which, to make it worse, have totally different maximum values)?

tiny flax May 23, 2021, 10:36 PM

#

You load it into pandas split the columns into series now you have 4 variables each for one column
Plt.plot(date,a)
plt.plot(date,b)
…

#

Plt.show()

grave frost May 23, 2021, 10:38 PM

#

stuck swallow what library is the best for that? i heard opencv is only for rigid objects is t...

as pastafish said, you need a generative model for that. It's not something that can be done with OpenCv - generating human faces is not easy

solar nest May 23, 2021, 10:41 PM

#

@tiny flax
i'm interpreting that to mean i should write something like

series = read_csv('data.csv', header=0, index_col=0, parse_dates=True)
pyplot.plot(series[0], series[1])
pyplot.show()

which unfortunately gives KeyError: 0

tiny flax May 23, 2021, 10:42 PM

#

No thats not a series its a dataset

stuck swallow May 23, 2021, 10:42 PM

#

grave frost as pastafish said, you need a generative model for that. It's not something that...

Is generative model a library? Or what libraries support it

tiny flax May 23, 2021, 10:42 PM

#

Series is 1D

#

Im on my phone honestly typing code is hard

solar nest May 23, 2021, 10:43 PM

#

yeah i noticed 😛

#

it's alright perhaps someone else will come along

#

ah wait now i'm getting somewhere

#

index_col=0 means that i can just say .plot(data['a'])

grave frost May 23, 2021, 10:46 PM

#

stuck swallow Is generative model a library? Or what libraries support it

no - generative models are a flavour of Neural Networks. it's not a library. if you want to understand them in-depth, I recommend you get started with ML (the pinned messages provide a good starting point)

tiny flax May 23, 2021, 10:46 PM

#

solar nest `index_col=0` means that i can just say `.plot(data['a'])`

Yeah

solar nest May 23, 2021, 10:51 PM

#

each of the .plot() calls returns a Line2D object, which does not understand set_ylim() ... how is the latter now accessible?

tiny flax May 23, 2021, 10:51 PM

#

I turned on my laptop yay

solar nest May 23, 2021, 10:52 PM

#

🙂

tiny flax May 23, 2021, 10:59 PM

#

solar nest each of the `.plot()` calls returns a `Line2D` object, which does not understand...

yeah set y limit is not for line2D its used in axes

solar nest May 23, 2021, 11:01 PM

#

but then how do you access set_ylim after having done line = plt.plot(a)?

tiny flax May 23, 2021, 11:04 PM

#

#

@solar nest

#

plt.ylim()

solar nest May 23, 2021, 11:05 PM

#

that works but is not specific to each line

#

problem is, a goes from 0 to 10, b goes from 0 to 12000 and c goes from 0 to 4096

tiny flax May 23, 2021, 11:08 PM

#

solar nest problem is, a goes from 0 to 10, b goes from 0 to 12000 and c goes from 0 to 409...

in that case if I want to limit it I would select the series and select all values greater than a number

#

its a hassle but easier than to set a y limit

#

I think

solar nest May 23, 2021, 11:09 PM

#

huh

#

nono i can't drop them

#

i need to scale them

#

there must be a way

tiny flax May 23, 2021, 11:11 PM

#

like bigger step values?

#

in the graph i mean

solar nest May 23, 2021, 11:12 PM

#

no, multiple, independent y axes ..

lapis sequoia May 23, 2021, 11:13 PM

#

Anybody know any similar discord servers for R?

exotic maple May 23, 2021, 11:13 PM

#

perhaps you can try using subplots?

solar nest May 23, 2021, 11:14 PM

#

ah! found something! https://stackoverflow.com/questions/46011940/how-to-plot-two-pandas-time-series-on-same-plot-with-legends-and-secondary-y-axi

Stack Overflow

How to plot two pandas time series on same plot with legends and se...

I want to plot two time series on the same plot with same x-axis and secondary y-axis. I have somehow achieved this, but two legends are overlapping and is unable to give label to x-axis and second...

#

@exotic maple no, i also later need to group it by day and put each day in a subplot.

near cosmos May 24, 2021, 12:48 AM

#

solar nest <@!263491859173736449> no, i also later need to group it by day and put each day...

Seaborn is the easiest way to do faceting like that in python

slate hollow May 24, 2021, 2:15 AM

#

https://stackoverflow.com/questions/31593201/how-are-iloc-and-loc-different

Stack Overflow

How are iloc and loc different?

Can someone explain how these two methods of slicing are different?
I've seen the docs,
and I've seen these answers, but I still find myself unable to understand how the three are different. To me,...

#

so this thread covers loc vs iloc

#

is there a similar thread for those two vs the raw index operator

velvet thorn May 24, 2021, 2:49 AM

#

slate hollow is there a similar thread for those two vs the raw index operator

that just gives you columns

slate hollow May 24, 2021, 2:54 AM

#

oh ok

tiny flax May 24, 2021, 3:52 AM

#

How do you get the random_state in sklearn?

#

Coz on one random state I see 5% better accuracy so I wanted to get it in a variable

#

Like after training the model

strange oriole May 24, 2021, 4:02 AM

#

i will setup the server rn wait a sec

#

but

simple epoch May 24, 2021, 4:02 AM

#

🗿

strange oriole May 24, 2021, 4:02 AM

#

if you type #tweet

#

it should @ you and say undefined coz i dont have the python server up

#

made a tweet generator

#

#tweet

random aurora May 24, 2021, 4:06 AM

#

#tweet hello

strange oriole May 24, 2021, 4:06 AM

#

@random aurora undefined

#

@random aurora undefined

random aurora May 24, 2021, 4:06 AM

#

😢

#

#tweet hello

strange oriole May 24, 2021, 4:07 AM

#

@random aurora undefined

#

@random aurora undefined

random aurora May 24, 2021, 4:07 AM

#

bruh it not working @strange oriole

strange oriole May 24, 2021, 4:07 AM

#

lol

#

try again

random aurora May 24, 2021, 4:07 AM

#

ok

#

#tweet hello

strange oriole May 24, 2021, 4:07 AM

#

@random aurora undefined

#

@random aurora undefined

random aurora May 24, 2021, 4:08 AM

#

#tweet hello

strange oriole May 24, 2021, 4:08 AM

#

@random aurora helloge man all when I us dorn tweet

dog

gay

conspiracy posting

prayn hfw

#

@random aurora undefined

random aurora May 24, 2021, 4:08 AM

#

YOO!!

#

ITS WORKING

#

cool @strange oriole

dim olive May 24, 2021, 4:11 AM

#

#tweet hello

strange oriole May 24, 2021, 4:11 AM

#

@dim olive hello are ifto you sent her frears esees is mo in make of Keemstar seesing a 12-tee.

#

@dim olive undefined

dim olive May 24, 2021, 4:12 AM

#

!ban 846208222225891329 selfbotting is against discord ToS

arctic wedgeBOT May 24, 2021, 4:12 AM

#

:incoming_envelope: :ok_hand: applied ban to @strange oriole permanently.

dim olive May 24, 2021, 4:13 AM

#

!ban 518944568302108712 it appears you are only here to help your friend with a selfbot. This is against ToS, we do not want this in our community.

arctic wedgeBOT May 24, 2021, 4:13 AM

#

failmail :ok_hand: applied ban to @random aurora permanently.

dapper halo May 24, 2021, 4:14 AM

#

👋

lapis sequoia May 24, 2021, 4:15 AM

#

👋

#

What's selfbotting

#

Are they pretending to be bots?

dim olive May 24, 2021, 4:17 AM

#

it is when you automate your user account in discord. It is against ToS

lapis sequoia May 24, 2021, 4:18 AM

#

?

#

Why would that be against ToS? sounds random as hell

#

Maybe because then it'd be easier to spam though

dapper halo May 24, 2021, 4:19 AM

#

What would automating your user account be useful for?

#

outside of just developing a bot....which its probably not their primary user account so thats just all it is for

inland zephyr May 24, 2021, 5:38 AM

#

Hello i need your suggestion about my previous question about siamese neural network for multiple person face recognition

#

The one idea that fly to my mind is instead using SNN for directly make the similarity calculation, I use the NN feature (from n-variation of image) then store it on database. Then the new image come, feature extracted then i check the similarity based on stored feature in paralleled?

#

Since as far as i know, general NN also good to create the feature

coral kindle May 24, 2021, 7:35 AM

#

I usually use selenium and BeautifulSoup for webscraping

#

Idk how Scrapy is different

jade chasm May 24, 2021, 7:51 AM

#

Hey guys, we are using Pytorch. After a while, all class probabilities converge to close to 0.99, making the model a random number generator. Anyone know any ways to deal with this?

#

We have tried adding L1 norm by adding the number of parameters to the loss, we have used batch normalization in the linear/convolutional layers and we have added gradient norm clipping

boreal mulch May 24, 2021, 9:53 AM

#

ok

abstract moon May 24, 2021, 10:47 AM

#

Hello Everyone. I am new to python. Have been coding in java and C++ and mainframes uptill now. I am facing some issues using the Pandas package in python. I know how to do data manipulation in java and C++ through loops. But in python it takes a lot of time. So i switched to Pandas and it is great!!!. I have 31 rows and 5 columns in excel sheet. I want to divide the 15th row data by 1st row and so on uptill 30th row data is divided by 15th row. And write the output in same file in the next columns or even next sheet would do. Could you please help me out.

ripe forge May 24, 2021, 10:55 AM

#

The simplest way that's Also stupid fast, I'd say, is to create "shifted" columns in pandas. Take a look at shift method

#

Then once shifted columns are created, just divide

#

No iteration needed, no loops needed, and then you can save the output as you prefer

abstract moon May 24, 2021, 11:00 AM

#

Thanks @ripe forge for yourinput. If i understand correctly. I will have to make a copy of my data and shift that to 15 columns down and then divide the value of one dataframe with other.

ripe forge May 24, 2021, 11:04 AM

#

Yep exactly. There's a .shift method in pandas that makes this easy

cedar sun May 24, 2021, 11:04 AM

#

Guys, just one thing

#

If i have a pretrained model

#

But i dont have the data set it was trained with

#

But i wanna train it with more augmented data, can i with my own dataset?

ripe forge May 24, 2021, 11:06 AM

#

Yes, that sounds a bit like what we do in transfer learning in any case.

#

The only caveats is, if this original data was also directly relevant to you, then each iteration with the new dataset may erode some of the learnings specific to older dataset.

cedar sun May 24, 2021, 11:07 AM

#

Yeah, i would like to use the same dataset

ripe forge May 24, 2021, 11:07 AM

#

You can mitigate this somewhat by freezing the initial layers but I'd suggest very few iterations and freezing both.

cedar sun May 24, 2021, 11:07 AM

#

But i havent it

#

Freezing which ones?

ripe forge May 24, 2021, 11:08 AM

#

Maybe freeze everything except the last layer to begin

cedar sun May 24, 2021, 11:08 AM

#

Lol

#

Mm okey i will try

#

The model is inception

ripe forge May 24, 2021, 11:09 AM

#

Oh. Then the original data isn't directly relevant to you is it?

#

What's the task for your model? Ie what are you trying to predict

cedar sun May 24, 2021, 11:11 AM

#

It is

#

I was trying to predict pokemons

#

But sadly, if the pokemon is colored, it fails to predict. So i made a generator that extends the keras one wich adds different colors to the img, so i pretend the nn to focus on the shape too

ripe forge May 24, 2021, 11:13 AM

#

And the original inception data for your model is also on Pokémon?

cedar sun May 24, 2021, 11:13 AM

#

Yes

ripe forge May 24, 2021, 11:13 AM

#

Ah OK

cedar sun May 24, 2021, 11:13 AM

#

Sec, let me see if i find it

ripe forge May 24, 2021, 11:14 AM

#

I'm confused then, how'd you create modified images if you don't have original data?

cedar sun May 24, 2021, 11:15 AM

#

With my own images from the pokemons

#

:D

ripe forge May 24, 2021, 11:15 AM

#

Oh ok. OK then got it, all my original statements apply. Your model performance may deteriorate if you overdo this

#

Maybe you could consider an alternative, convert your input to greyscale and see if the model is able to predict Pokémon from that as is. I suppose this depends on how the original model was trained

#

Ie instead of having the model deal with coloured images, have a preprocessing that deals with it such that the model doesn't have to.

cedar sun May 24, 2021, 11:17 AM

#

Nah, gray scale it fails

#

I tried

ripe forge May 24, 2021, 11:17 AM

#

Fair enough.

cedar sun May 24, 2021, 11:17 AM

#

I bet original data was colored pokemons

ripe forge May 24, 2021, 11:17 AM

#

Then the original must be using colours

cedar sun May 24, 2021, 11:17 AM

#

So it pays attention to colors

#

So i thought about 2 ways

#

The first is retraining with this color modification augmented data

ripe forge May 24, 2021, 11:18 AM

#

It would be ideal if you had original data

cedar sun May 24, 2021, 11:18 AM

#

And the second, modifying the input layer to recieve the mask of the pokemon aswell

#

As the "shape" of the pokemon

#

I have another nn which returns the mask of an image. It is called u2net

#

Cuz when i was child, i remember pokemon had something like "who is this pokemon?" And only the shape was showm

#

And i was able to guess it

#

So maybe a nn can too :D

modern beacon May 24, 2021, 11:28 AM

#

is there a module for generating responses to input based on training data?

upper spade May 24, 2021, 11:33 AM

#

yo guys i just finished learning pandas

#

took alot of my brainpower

#

but not sure if i really get it yet

#

is there any sort of project or wtv that i can do

#

to know if i really get it

eager timber May 24, 2021, 12:07 PM

#

hehehe the first thing i notice

#

in #ot0-psvm’s-eternal-disapproval

cedar sun May 24, 2021, 12:14 PM

#

@serene scaffold hello dude, did u get any reply?

cedar sun May 24, 2021, 1:36 PM

#

I have one question... idk if it will be possible but

#

I am trying to make a pokemon classifier

#

and i have 898 classes

#

but there are pokemons such as Primal or what ever, which are the same but different shape, w/e

#

The thing is i downloaded a model

#

with a 928 classes output, cuz for it, kyogre != primal kyogre

#

so it is on a different class

#

Can i remove the last output layer of this pretrained model and reduce it to 898 classes???

#

no, right?

novel elbow May 24, 2021, 1:45 PM

#

cedar sun Can i remove the last output layer of this pretrained model and reduce it to 898...

yes, you can remove the last layer and add a new one with 898 outputs