#data-science-and-ml | Python | Page 408

wooden sail Jun 3, 2022, 8:57 PM

#

.latex $\begin{bmatrix} u_1 & u_2 & \dots & u_N \end{bmatrix} \begin{bmatrix} v_1 \ v_2 \ \vdots \ v_N \end{bmatrix} = \sum{n=1}^N u_n v_n$

strange elbowBOT Jun 3, 2022, 8:57 PM

#

Failed to render input.

View Logs

wooden sail Jun 3, 2022, 8:57 PM

#

this is not as helpful as the texit bot

#

.latex $\begin{bmatrix} u_1 & u_2 & \dots & u_N \end{bmatrix} \begin{bmatrix} v_1 \ v_2 \ \vdots \ v_N \end{bmatrix} = \sum{n=1}^N u_n v_n$

strange elbowBOT Jun 3, 2022, 8:58 PM

#

$latex.png$

wooden sail Jun 3, 2022, 8:58 PM

#

this is terrible, it doesn't update on edit. i'm sorry about the spam

#

.latex $\begin{bmatrix} u_1 & u_2 & \dots & u_N \end{bmatrix} \begin{bmatrix} v_1 \ v_2 \ \vdots \ v_N \end{bmatrix} = \sum_{n=1}^N u_n v_n$

strange elbowBOT Jun 3, 2022, 8:58 PM

#

$latex.png$

wooden sail Jun 3, 2022, 8:59 PM

#

@scenic tulip finally, here we go. i just did this several times. after you've seen it enough times, you can intuit the operations in your head for simple transformations

scenic tulip Jun 3, 2022, 9:00 PM

#

@wooden sail yeah that's sweet. I've never heard of latex but it allows you to post calculations in an image that is somehow formatted?

wooden sail Jun 3, 2022, 9:01 PM

#

it's for formatting pdf documents in general, but it's famous for allowing you to nicely typeset equations and diagrams

mild dirge Jun 3, 2022, 9:02 PM

#

It's good for formal stuff like research papers

scenic tulip Jun 3, 2022, 9:04 PM

#

wow i've never heard of this but yeah....wow that's awesome stuff

serene scaffold Jun 3, 2022, 9:47 PM

#

scenic tulip <@467435887236612106> yeah that's sweet. I've never heard of latex but it allow...

a lot of people think of latex as the "math formatting language", but it's really for general-purpose typesetting. think "microsoft Word but code"

#

like, you can even set variables and stuff.

tidal bough Jun 3, 2022, 9:57 PM

#

it's like word, except you actually have an idea what's going on with your document

serene scaffold Jun 3, 2022, 10:30 PM

#

tidal bough it's like word, except you actually have an idea what's going on with your docum...

except for when you get unexpected behavior

#

I was using a macro that unexpectedly added exclamation points, and that isn't even what the macro is specified to do.

tidal bough Jun 3, 2022, 10:31 PM

#

I actually learned just today that you're supposed to, in align, put & right before the alignment point, not after

#

I was aligning a ton of shit by spaces. 😔

serene scaffold Jun 3, 2022, 10:31 PM

#

I thought the & was the alignment point

tidal bough Jun 3, 2022, 10:32 PM

#

all I know is that if you do

x =& 5\\
y =& 10

the spaces after = get smaller than they should be

#

&= is the right way

serene scaffold Jun 3, 2022, 10:33 PM

#

BingShrug

plush jungle Jun 3, 2022, 11:36 PM

#

does anyone know why my tensorflow isn't detecting any gpus on my pc? I've got an rtx 3080

serene scaffold Jun 3, 2022, 11:39 PM

#

plush jungle does anyone know why my tensorflow isn't detecting any gpus on my pc? I've got ...

how did you install tensorflow

#

and how do you know it's not detecting your gpu

mild dirge Jun 3, 2022, 11:46 PM

#

@plush jungle ?

serene scaffold Jun 3, 2022, 11:52 PM

#

also, when I say "how do you know it's not detecting your gpu", I'm not asking "are you sure that it's not ...".

misty flint Jun 4, 2022, 1:44 AM

#

Sip

lapis sequoia Jun 4, 2022, 3:29 AM

#

@misty flint

#

shipit

#

Can I befriend you

#

I code in python

misty flint Jun 4, 2022, 3:36 AM

#

umm

#

i would hope you do

#

since we are in a python server

#

kekHands py_sun

fierce loom Jun 4, 2022, 3:42 AM

#

Is there any AI developer community of python

sinful spire Jun 4, 2022, 5:08 AM

#

hey everybody, is there an app or something which can be able to fix your code while programming, I'm doing my project, I mean is this thing existing before?

tacit basin Jun 4, 2022, 5:17 AM

#

sinful spire hey everybody, is there an app or something which can be able to fix your code w...

Sourcery extension to vscode for example can refactor the code for you

edgy agate Jun 4, 2022, 7:06 AM

#

heyy guys //

#

i am working on a dataset .. but having some problem . please help me out

#

these are my datasets

📎 sample_submission_2zvVjBu.csv 📎 train_wn75k28.csv 📎 test_Wf7sxXF.csv

#

https://colab.research.google.com/drive/1zTxsOwwDOkNYptUdQKYjKaJtA6ltelbJ?usp=sharing and this the link of my Notebook

Google Colaboratory

gray orchid Jun 4, 2022, 7:39 AM

#

and where is your problem

serene scaffold Jun 4, 2022, 7:45 AM

#

edgy agate https://colab.research.google.com/drive/1zTxsOwwDOkNYptUdQKYjKaJtA6ltelbJ?usp=sh...

if we click the link, we have to request access. but it's easier if you create a minimal example of your problem and an explanation of what you want to have happen instead.

edgy agate Jun 4, 2022, 7:55 AM

#

serene scaffold if we click the link, we have to request access. but it's easier if you create a...

You are provided with the leads data of last year containing both direct and indirect leads. Each lead provides information about their activity on the platform, signup information and campaign information. Based on his past activity on the platform, you need to build the predictive model to classify if the user would buy the product in the next 3 months or not. ....... this is what i want to do

edgy agate Jun 4, 2022, 7:56 AM

#

gray orchid and where is your problem

here only .. above msg

serene scaffold Jun 4, 2022, 7:56 AM

#

edgy agate You are provided with the leads data of last year containing both direct and ind...

are you asking us to do it for you? what part are you having trouble with?

arctic wedgeBOT Jun 4, 2022, 7:58 AM

#

Hey @edgy agate!

It looks like you tried to attach file type(s) that we do not allow (.ipynb). We currently allow the following file types: .gif, .jpg, .jpeg, .mov, .mp4, .mpg, .png, .mp3, .wav, .ogg, .webm, .webp, .flac, .m4a, .csv, .json.

Feel free to ask in #community-meta if you think this is a mistake.

fiery adder Jun 4, 2022, 8:41 AM

#

Hello! I am thinking of an idea for research on the topic of parameter optimisation viewed as a language problem. Here is what I mean by that - There are already multiple big pre-trained language models such as CodeBERT which can generate good contextual embeddings for source code. So if they're used as a baseline and built upon, we can create a supervised learning pipeline that predicts code parameters which satisfy desired outcomes. For example if we have the function def f(x): return 2 + 2 * x - x*x we can ask the model to maximise it and to find that the desired x is 1. At the beginning we expect to be able to solve such simple optimisation problems, but with time we may derive methods which are able to solve for more parameters and complicated functions and probably even have such a model to optimise parameters for other ML models in the future. If achieved this approach may replace or work together with traditional hyper-parameter tuning solutions like Bayesian optimisation (which are computationally expensive since they require testing the function itself with multiple parameters).

#

One approach will be to take the problem purely as a language task and replace the desired parameter(s) with a masked token and then train a model (fine-tune pre-trained BERT-like model) to predict such tokens given desired outcomes.

#

Another approach will be to take advantage of the pre-trained NL-PL models to generate embeddings for the source code, but then use these representations in a separate regression model. In this case it might be a good idea to built some meta learning environment to better generalise to different functions and then take few-shot approach by first providing a few examples of input-result pairs and then asking for predicted parameters given a desired outcome.

#

What do you think about the idea as a whole and the proposed approaches? Do you think they're feasible and if not - why? Do you think such a study will be pointless and if so - do you have better ideas in this direction?

serene scaffold Jun 4, 2022, 8:55 AM

#

@fiery adder this there a tldr for this?

mild dirge Jun 4, 2022, 10:13 AM

#

I think this whole setup seems kinda vague, trying to make a language model predict the outcome of some given formula seems like an inefficient and likely bad way to optimize parameters

#

How would the language model even know what good parameters are?

wooden sail Jun 4, 2022, 10:34 AM

#

generating data to train this seems ghastly. either you need to check out basically all the machine learning everyone has ever done and the learned parameters, or you'd need to somehow make it self supervised and each example will involve solving a whole machine learning problem. or did you have some idea on how to circumvent this?

loud cove Jun 4, 2022, 11:10 AM

#

Hi, I'm doing KMean clustering on a article texts under the same category to get subcategories.
I'm only getting one major cluster, can someone tell me what I'm doing wrong?
I tried with lemmatizing and without,
with original text and and with cleanup.
with max features at 8k and without setting max features.

https://github.com/MAmr21/EGYFWD/blob/main/KO/Article Classification/articles classifier.ipynb

river maple Jun 4, 2022, 12:09 PM

#

can someone explain how the code for the gradient descent is theta = theta - alpha * (1/m) * (X' * ((X * theta)-y));

#

this is the formula

wooden sail Jun 4, 2022, 12:10 PM

#

you want an explanation of the math or how to code the math?

river maple Jun 4, 2022, 12:11 PM

#

how to code the math..

#

why is there no summation in that code?

wooden sail Jun 4, 2022, 12:12 PM

#

i think you need to escape some asterisks in what you wrote with a \

#

but at any rate, it looks like the expression you wrote is in terms of matrices and vectors

#

matrix-vector multiplication is itself a sum of products, just like the image you showed

#

.latex \boldsymbol{Ax} = \begin{bmatrix} \boldsymbol{A_{1,:} x} \ \boldsymbol{A_{2,:} x} \ \vdots \ \boldsymbol{A_{m,:} x} \end{bmatrix}

strange elbowBOT Jun 4, 2022, 12:14 PM

#

Failed to render input.

View Logs

wooden sail Jun 4, 2022, 12:14 PM

#

oof

#

#

here

#

you can think of a vector as an n x p matrix with p = 1

#

then you see the multiplication is indeed a sum following that definition

river maple Jun 4, 2022, 12:17 PM

#

hmm makes sense

#

but for the cost function i had to use the sum function

#

J = (1/(2 * m)) * sum(((X * theta)-y).^2)

#

oh i get it

#

Thanks for the help

young granite Jun 4, 2022, 12:33 PM

#

in row/col 3/3 i want to plot 2 x axis but i dont know how i can achieve that in the grid, im able to plot a second y-axis but x doesnt work...

scenic tulip Jun 4, 2022, 2:24 PM

#

@wooden sail you on rn?

#

Maybe someone else knows this. So I'm writing out arrays of results, containing 20 elements to a file. When it writes the output comes out as this :

#

 [  7   8   8 ...   2  -2   1]
 [ -7 -13 -14 ...  -2   5   3]
 ...
 [ -1  -3  -2 ...  -8  -6   2]
 [  2   4  15 ...   8   3   0]
 [ -2  -2  -9 ...  -1  -4  -2]]```

#

How can I view all of the in between data

mild dirge Jun 4, 2022, 2:28 PM

#

https://stackoverflow.com/questions/1987694/how-to-print-the-full-numpy-array-without-truncation

Stack Overflow

How to print the full NumPy array, without truncation?

When I print a numpy array, I get a truncated representation, but I want the full array.

Is there any way to do this?

Examples:

numpy.arange(10000)
array([ 0, 1, 2, ..., 99...

wooden sail Jun 4, 2022, 2:35 PM

#

scenic tulip ```[[ 2 10 -2 ... 0 4 1] [ 7 8 8 ... 2 -2 1] [ -7 -13 -14 ....

this shouldn't really matter, you never want to look at the entirety of a large matrix with your eyes (for the most part)

#

you could write the contents as a csv if you like

lapis sequoia Jun 4, 2022, 2:37 PM

#

Hi, I am going through deep minds RL slides by David Silver, and I have a question on moving mean and how it forgets past data.
in chapter 4, for model free RL, there is a topic on monte-carlo method that that uses incrementing mean with running average

V(St) ← V(St) + α (Gt − V(St))

here, α is supposed to be the one thing that represents a moving mean/running average. what I don't understand is how would the formula forget the past values of V(St) when we keep using it iteratively.

wooden sail Jun 4, 2022, 2:39 PM

#

if you do a couple of iterations, it might become more clear. let's replace this with a simpler nomenclature first. say, y <- y + a(x - y)

#

we can rearrange that into (1 - a)y + ax. and you probably have a condition like 0< a < 1

#

at the next iteration, instead of x, we have some other value. let's call it z.

#

then we get (1-a) [(1-a) y + ax] + az

#

we expand into (1-a)^2 y + a(1-a)x + az

#

as the sequence continues, y will get mutliplied by increasingly high powers of (1-a), and the previous values of the updates Gt too (but with a lower exponent than y)

#

since (1-a) is also between 0 and 1, the more you repeat this, the smaller the value of y, and also of the old updates

#

i wrote it that way so that you can kinda see that the algorithm produces a weighted sum at every iteration. the higher the iteration number, the smaller the weights of the older quantities

lapis sequoia Jun 4, 2022, 2:43 PM

#

thanks for taking the time to answer Edd, just give me a minute to process this

lapis sequoia Jun 4, 2022, 2:44 PM

#

wooden sail we can rearrange that into (1 - a)y + ax. and you probably have a condition like...

for this part, is the condition 0 < a < 1 often the case?

#

is it because of the a(x-y)

wooden sail Jun 4, 2022, 2:44 PM

#

it should be the case, yes

lapis sequoia Jun 4, 2022, 2:45 PM

#

ohhh, i think im getting it

#

wait, is a less than 1 because of the idea of iterative mean?

#

like the formula before α was 1/N(t), but for non-moving average

wooden sail Jun 4, 2022, 2:47 PM

#

i would have to see how your book defines this stuff, i would call it either "momentum" from the ML perspective or "convex combination" from the linalg standpoint

lapis sequoia Jun 4, 2022, 2:48 PM

#

oh, im using the RL slides from deep ai, the 2015 one, should I share the link? i think im gettting the idea tho

wooden sail Jun 4, 2022, 2:49 PM

#

but the idea, if you look at V and G as vectors, is that this operation yields a vector pointing from V to G and passing through V. this is the parametric equation of a line joining two points in N dimensional space. if alpha is equal to 0, you stay exactly at V

#

if alpha becomes 1, you move all the way to G

#

for values in between, you land on the line segment connecting them

#

setting alpha = 0 means "no change", while alpha = 1 means "forget the previous stuff entirely and just move to G"

lapis sequoia Jun 4, 2022, 2:50 PM

#

thats a lot of linear algebra words 😄

#

but im getting the idea, Ill have to dig deeper into it

#

thank you @wooden sail , I thought I would have to wait a while to get help

wooden sail Jun 4, 2022, 2:52 PM

#

glad it helps. i'm not familiar with those slides, so if you could share the link, that'd be cool. it's not like i'm a mathematician or anything either, but i've learned most of the stuff this way thanks to uni

lapis sequoia Jun 4, 2022, 2:53 PM

#

https://www.deepmind.com/learning-resources/introduction-to-reinforcement-learning-with-david-silver
its from this link, the chapter 4

Introduction to Reinforcement Learning with David Silver

Interested in learning more about reinforcement learning? Follow along in this video series as DeepMind Principal Scientist, creator of AlphaZero and 2019 ACM Computing Prize Winner David Silver, gives a comprehensive explanation of everything RL.

#

for linear algebra, did you use the "mathematics for machine learning" ?

#

I have a copy but its just sitting there cause I thought I had just enought linear algebra

wooden sail Jun 4, 2022, 2:55 PM

#

i've checked some of linear algebra done right by axler and linear algebra done wrong by treil, and also gilbert strang's linear algebra. just straight up math books

#

and then several papers and books on optimization, signal processing, etc

lapis sequoia Jun 4, 2022, 2:56 PM

#

lots of really great tips, thank you kindly

wooden sail Jun 4, 2022, 2:56 PM

#

i learned about machine learning as an application of maths, really very late into the game 😛 i don't know most of the pop nomenclature

lapis sequoia Jun 4, 2022, 2:57 PM

#

you have a very strong foundation tho, coming from math

wooden sail Jun 4, 2022, 2:57 PM

#

i'm comfortable with mangling indices and wiping my tears while staring at a piece of paper, yes

misty flint Jun 4, 2022, 2:57 PM

#

kekHands

#

my friend who also has a background in math is my go-to when i dont understand a new algorithm

#

ID_BoomKek

#

hes also very good at solving problems irl too

lapis sequoia Jun 4, 2022, 2:59 PM

#

i tried getting into linear algebra with 3b1b,

#

i guess that is way below the barrier

wooden sail Jun 4, 2022, 2:59 PM

#

actually

#

this is my hot take, but 3b1b linalg is not good to learn from

#

it's GREAT to review concepts, but NOT to learn

lapis sequoia Jun 4, 2022, 3:00 PM

#

what about khan academy?

wooden sail Jun 4, 2022, 3:00 PM

#

it's presented from the standpoint that you already learned the concepts (somewhat) or have at least heard about them

lapis sequoia Jun 4, 2022, 3:00 PM

#

i had a hard time learning from there

wooden sail Jun 4, 2022, 3:00 PM

#

khan academy is usually solid for practicing concrete problems. grinding through a few can build intuition

lapis sequoia Jun 4, 2022, 3:01 PM

#

really? I had a really hard time there, felt like the talk about determinants was different from 3b1b

#

i thought of trying gilbert strang but 3b1b 16 video playlist looked from enticing.

wooden sail Jun 4, 2022, 3:01 PM

#

did they hit you with a laplace expansion

lapis sequoia Jun 4, 2022, 3:02 PM

#

they hit me with a basic 3 equation thingy

wooden sail Jun 4, 2022, 3:02 PM

#

i think gilbert strang's book is pretty good. it won't go into more abstract stuff though

lapis sequoia Jun 4, 2022, 3:02 PM

#

hmmm, im motivated now, ill try the video playlist first tho

#

oh, if you dont mind, I have a another question on RL

#

about temporal difference

wooden sail Jun 4, 2022, 3:04 PM

#

mhm?

lapis sequoia Jun 4, 2022, 3:05 PM

#

V(St) ← V(St) + α (Rt+1 + γV(St+1) − V(St))

do you happen to know this formula?

#

for temporal difference, I have a question on it thats bothering me

wooden sail Jun 4, 2022, 3:06 PM

#

looks familiar

lapis sequoia Jun 4, 2022, 3:07 PM

#

its supposed to be used for model free RL, when we can't step into the state of next time step St+1

#

but the formula has the recursive V(St+1) in it,

#

wait, I think I am making the question more complicated

#

so, to restart, if we have the model, we could recursively call V(St+1) from V(St) which in turn calls V(St+2) from V(St+1)

#

thats what I got for a model based RL

#

but temporal difference is an algorithm thats used for model free

#

and it has the V(St+1) being used as a part of the formula to find V(St)

#

im confused on how a model free algorithm can do this

wooden sail Jun 4, 2022, 3:15 PM

#

i'm not really sure, the nomenclature in the slides is all weird to me 😛

lapis sequoia Jun 4, 2022, 3:16 PM

#

im looking for the "pain" reaction lol

#

I guess that problem is for the tomorrow me

lapis sequoia Jun 4, 2022, 4:18 PM

#

what kind of visualisation can I do to show this data

#

I am thinking of a scatterplot with equal distances on x axis for each country. With 2 coloured dots at each x denoting the value of administered vaccines for each date. With legend denoting colour of each date.

glacial sparrow Jun 4, 2022, 4:34 PM

#

anyone familiar with sklego's RBF here?

scenic tulip Jun 4, 2022, 4:57 PM

#

@wooden sail writing as csv did it...thank you!!

wooden sail Jun 4, 2022, 4:58 PM

#

cool

#

if you just need it stored for later but don't need to actually look at the matrix, consider also .npy or npz

lapis sequoia Jun 4, 2022, 5:01 PM

#

what might be the issue?

river maple Jun 4, 2022, 5:14 PM

#

why is a column of ones added to the data matrix after feature normalization?

main fox Jun 4, 2022, 5:53 PM

#

lapis sequoia what might be the issue?

You use boxplots to show how a numerical variable varies within a category.

nova matrix Jun 4, 2022, 6:18 PM

#

Guys is standard matplotlib and seaborn enough for visualisations or should we know some advanced visualisation libraries like cuff links

mild dirge Jun 4, 2022, 6:19 PM

#

that obviously depends on how complicated you need stuff to be

#

But matplotlib can do a whole lot, have never been limited so far, except for maybe 3d stuff

fiery adder Jun 4, 2022, 6:32 PM

#

wooden sail generating data to train this seems ghastly. either you need to check out basica...

I am also not sure how data can be generated efficiently. But it turns out that HPO has already been tested as a sequence problem with Transformers. https://arxiv.org/abs/2205.13320

arXiv.org

Towards Learning Universal Hyperparameter Optimizers with Transformers

Meta-learning hyperparameter optimization (HPO) algorithms from prior
experiments is a promising approach to improve optimization efficiency over
objective functions from a similar distribution....

fiery adder Jun 4, 2022, 6:33 PM

#

fiery adder Hello! I am thinking of an idea for research on the topic of parameter optimisat...

So if the community here can suggest any feasible way of generating a dataset for the described approach or if data exist for something similar?

wooden sail Jun 4, 2022, 6:40 PM

#

you see they discuss there usage of vast amounts of HPO data

#

which at google they certainly have. idk how easy it is to get that in the wild, though

#

you rely on people all over the world having solved enough problems to make this trainable

misty flint Jun 4, 2022, 6:51 PM

#

nova matrix Guys is standard matplotlib and seaborn enough for visualisations or should we k...

it literally depends on what your use case is like camel mentioned

#

but matplotlib/seaborn is pretty robust for quick visualizations

#

my personal favorite is plotly

#

theres also specific data viz software like tableau/powerBI/looker/etc.

#

but that tends to be more in the business context where you are creating something for business stakeholders

#

i.e. you need to create a dashboard showing X, Y, Z for someone in a specific business unit/function

#

if that is your world, then i highly recommend "storytelling with data" by cole knaflic

#

ok_handbutflipped

rich merlin Jun 4, 2022, 6:55 PM

#

I'm relatively new to pycharm and pandas,
does anyone have a minute to help me figure out where to start and how to make assessments on trends?

fleet musk Jun 4, 2022, 7:13 PM

#

helo friends, i am getting a warning in pandas, did some reading on stack overflow, unable to fully grasp it

#

ticker["candle"] = np.array(range(len(ticker)))%25 + 1
__main__:1: SettingWithCopyWarning: 
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy

#

#

how to fix it?

lapis sequoia Jun 4, 2022, 8:25 PM

#

main fox You use boxplots to show how a numerical variable varies within a category.

Yeah but that's what my column is, isn't it?

main fox Jun 4, 2022, 8:42 PM

#

lapis sequoia Yeah but that's what my column is, isn't it?

Do you have NaN values?

loud cove Jun 4, 2022, 8:56 PM

#

lapis sequoia what kind of visualisation can I do to show this data

i suggest using a map visual with these details.
and a bar chart for the top and bottom countries if you care about that.

loud cove Jun 4, 2022, 8:57 PM

#

loud cove Hi, I'm doing KMean clustering on a article texts under the same category to get...

anyone got an idea?

lapis sequoia Jun 4, 2022, 9:34 PM

#

main fox Do you have NaN values?

yeah

main fox Jun 4, 2022, 9:39 PM

#

lapis sequoia yeah

You need to drop those

lapis sequoia Jun 4, 2022, 9:43 PM

#

main fox You need to drop those

hmm worked now

lapis sequoia Jun 4, 2022, 11:08 PM

#

I code

#

hbu

misty flint Jun 5, 2022, 1:56 AM

#

if anyone's interested in RecSys, there's a series by the great chip huyen this month; starting tomorrow at 10a PT!

#

Praise

dreamy phoenix Jun 5, 2022, 1:56 AM

#

Hello all. I am having a lot of fun messing around with pyplot and I need a bit of some help.

misty flint Jun 5, 2022, 1:57 AM

#

3.5 sessions, ending with a big RecSys ML System Design Session

runic crystal Jun 5, 2022, 1:57 AM

#

Hey everyone! Can anyone confirm if we can change color of seborn catplots based on conditional statements

dreamy phoenix Jun 5, 2022, 1:58 AM

#

#

I am trying to draw a line graph with formatted percentages on the y-axis. Currently, these are formatted strings. The formatted strings are not ordered correctly, trying to sort them gives me a squiggle.

#

I think what I would need to do is find a way to format the floating point numbers as they're displayed instead of converting them to a string and formatting that.

runic crystal Jun 5, 2022, 2:01 AM

#

And bar_label doesn't seem to work with catplots either. Any Idea about it ?

dreamy phoenix Jun 5, 2022, 2:04 AM

#

okay thank you

misty flint Jun 5, 2022, 2:05 AM

#

misty flint 3.5 sessions, ending with a big RecSys ML System Design Session

dreamy phoenix Jun 5, 2022, 2:06 AM

#

from isolation import isolate_total_stub, isolate_age_stub
import matplotlib.pyplot as plt
from matplotlib.ticker import (MultipleLocator,
                               FormatStrFormatter,
                               AutoMinorLocator)

# very simple extraction, drop some columns and check some data
cdc_data = pd.read_csv('CDC_Delay_of_Care_Data.csv')
cdc_data = cdc_data.drop(columns=['INDICATOR','FLAG','UNIT'])


# do you have good data?
data_types_valid = type_check_numeric_columns(cdc_data)
acceptable_null_threshold = compare_nulls_against_threshold(cdc_data)


# separate the categories of delayed care
delay_of_medical_care = cdc_data[cdc_data.PANEL == 'Delay or nonreceipt of needed medical care due to cost']

# isolate the totals stub
total_delay_of_medical_care = isolate_total_stub(delay_of_medical_care)

x_axis = total_delay_of_medical_care.YEAR
y_axis = total_delay_of_medical_care.ESTIMATE
fig, ax = plt.subplots()

ax.plot(x_axis, y_axis)
plt.show()

runic crystal Jun 5, 2022, 2:07 AM

#

dreamy phoenix Jun 5, 2022, 2:07 AM

#

I am not using the ticker library imports at this time

#

oh sorry I thought you were talking to me. excuse me

runic crystal Jun 5, 2022, 2:09 AM

#

this is what wrote for the colors

#

the commented lines

runic crystal Jun 5, 2022, 2:31 AM

#

I gave that a try and it did not work. I am now certain that my data is wacky. I have repeated values that are true for some year and false for other years. And I was plotting year-wise graphs from my data. Those values being true for some and false for others is toasting up the library. I might just break my data into separate files rater than them being in a single file. That should do the job. Thanks anyway!

lapis sequoia Jun 5, 2022, 3:50 AM

#

dreamy phoenix I am trying to draw a line graph with formatted percentages on the y-axis. Curr...

What does the data look like

fleet sleet Jun 5, 2022, 4:20 AM

#

Hey I am a beginner ,
trying to automate data from MySQL database to spread sheet and I have all the basic libraries required, sheets api is also enabled.. created credentials for the same on GCP
Have given the right path to the credentials.json file and everything still I seem to go nowhere
Can someone please help me out ?

fleet sleet Jun 5, 2022, 8:19 AM

#

The debug log is

#

PS C:\Users\conta\OneDrive\Desktop\Workspace> & 'C:\Python310\python.exe' 'c:\Users\conta.vscode\extensions\ms-python.python-2022.6.3\pythonFiles\lib\python\debugpy\launcher' '51612' '--' 'c:\Users\conta\OneDrive\Desktop\Workspace\pyautomation\sheetsNew.py'
There is an Exception in credsLogin Function : 'module' object is not callable
Authentication DONE !
C:\Python310\lib\site-packages\pandas\io\sql.py:761: UserWarning: pandas only support SQLAlchemy connectable(engine/connection) ordatabase string URI or sqlite3 DBAPI2 connectionother DBAPI2 objects are not tested, please consider using SQLAlchemy
warnings.warn(
MID PID merchant_name locality city
0 b'242307' b'1418703' b'Ruchi Curry Point' b'Manikonda' b'Hyderabad'
1 b'243056' b'1418703' b'Ruchi Curries' b'Madhapur' b'Hyderabad'
2 b'650871' b'1418703' b'Ruchi Curries' b'Nizampet' b'Hyderabad'
3 b'1235155' b'1418703' b'Ruchi Curry Point' b'Nizampet' b'Hyderabad'
4 b'1318633' b'1418703' b'Ruchi Curry Point, Nizampet' b'Nizampet' b'Hyderabad'
Deleting Google Sheet...
There is an Exception in clearGoogleSheet Function : Could not automatically determine credentials. Please set GOOGLE_APPLICATION_CREDENTIALS or explicitly create credentials and re-run the application. For more information, please see https://cloud.google.com/docs/authentication/getting-started
Writing Google Sheet...
There is an Exception in writingGoogleSheet Function : Could not automatically determine credentials. Please set GOOGLE_APPLICATION_CREDENTIALS or explicitly create credentials and re-run the application. For more information, please see https://cloud.google.com/docs/authentication/getting-started
Part 1 Completed !

Google Cloud

Getting started with authentication | Authentication | Google C...

cinder matrix Jun 5, 2022, 10:03 AM

#

Hi guys, i've built a model which takes keywords and generates narratives. However, i find the bleu and rouge evaluation isn't appropriate for my case.

So instead am thinking of evaluating by how much the user input keywords is present in the generated text. Would this be a proper way of evaluating how much keywords permeated in the text? Does such a metric or better exists? If not, how would i proceed? Thanks and please @ so i get notified when replying

mint palm Jun 5, 2022, 10:32 AM

#

i want to simulate transfer learning

#

how do i do it?

#

i have trained my model

#

now i wanna check how it will fine tune on deployment

loud cove Jun 5, 2022, 10:49 AM

#

dreamy phoenix I am trying to draw a line graph with formatted percentages on the y-axis. Curr...

https://stdworkflow.com/269/matplotlib-solves-the-problem-that-x-axis-values-are-not-sorted-by-array

Matplotlib: solves the problem that X axis values are not sorted by...

Problem Description¶
Just look at the title. Let me show you the picture first.

The code and data corresponding to this figure are as follows. …

tacit basin Jun 5, 2022, 10:59 AM

#

mint palm i want to simulate transfer learning

What do you mean fine tune on deployment?

mint palm Jun 5, 2022, 11:14 AM

#

tacit basin What do you mean fine tune on deployment?

i mean normal fine tune

mint palm Jun 5, 2022, 11:15 AM

#

tacit basin What do you mean fine tune on deployment?

retraining sort of

spring marsh Jun 5, 2022, 11:45 AM

#

Can someone please help me on how to setup my GPU for deep learning on tensorflow

arctic wedgeBOT Jun 5, 2022, 1:26 PM

#

Hey @wooden sail!

It looks like you tried to attach file type(s) that we do not allow (.ipynb). We currently allow the following file types: .gif, .jpg, .jpeg, .mov, .mp4, .mpg, .png, .mp3, .wav, .ogg, .webm, .webp, .flac, .m4a, .csv, .json.

Feel free to ask in #community-meta if you think this is a mistake.

mortal cairn Jun 5, 2022, 2:02 PM

#

#

Hi, i'm trying to find the intersection point of to sets of data. Neither line cannot be defined by a mathematical function and has each about 21450 values of x and y. Any ideas of functions or libraries i can use?

serene scaffold Jun 5, 2022, 2:14 PM

#

mortal cairn Hi, i'm trying to find the intersection point of to sets of data. Neither line c...

in what format is the data? arrays? dataframes?

mortal cairn Jun 5, 2022, 2:15 PM

#

they're series I read from a csv using pandas

#

Someone mentioned using shapely so I'm trying that now

serene scaffold Jun 5, 2022, 2:16 PM

#

mortal cairn they're series I read from a csv using pandas

you can see if there are two rows where the values are the same. or you can take the difference of the two Series and see which index has the smallest difference

mortal cairn Jun 5, 2022, 2:17 PM

#

Ah yea. That's true. Thanks for the idea

wooden sail Jun 5, 2022, 2:19 PM

#

it doesn't seem like they have the same domain, so you'll have to do some padding. otherwise, that seems the easiest way (arg min (abs(diff)))

serene scaffold Jun 5, 2022, 2:20 PM

#

wooden sail it doesn't seem like they have the same domain, so you'll have to do some paddin...

is this lisp?

#

tangerine_think

pliant pewter Jun 5, 2022, 2:20 PM

#

No, it's just math

wooden sail Jun 5, 2022, 2:20 PM

#

no, that's just math

serene scaffold Jun 5, 2022, 2:20 PM

#

I was making a joke

#

also Aurendil is Edd's alt confirmed

pliant pewter Jun 5, 2022, 2:20 PM

#

Lisp would have more parentheses

wooden sail Jun 5, 2022, 2:20 PM

#

aurendil tried to joke with me before and also failed

serene scaffold Jun 5, 2022, 2:21 PM

#

!otn s lisp

arctic wedgeBOT Jun 5, 2022, 2:21 PM

#

Query results

• python-is-not-lisp

pliant pewter Jun 5, 2022, 2:24 PM

#

Are there any successful NLP joke/sarcasm detectors out there?

serene scaffold Jun 5, 2022, 2:25 PM

#

pliant pewter Are there any successful NLP joke/sarcasm detectors out there?

that's a notoriously difficult task, as sarcasm is sometimes difficult for humans to detect, and even when we can, it's relies heavily on world knowledge

pliant pewter Jun 5, 2022, 2:30 PM

#

It's kind of hard just from a language point of view, yeah. But I've noticed that lots of animals seem to have a concept of play/joking, and you can see it in their facial expression. Probably just need more information than just words.

wooden sail Jun 5, 2022, 2:32 PM

#

btw, if any of you are interested, i'm preparing this short intro to jax. specifically, looking at jit, vectorization, and automatic differentiation of functions f:C^n -> R^m (cr or wirtinger calc). the final example does something that could be understood as some form of "deep unfolding"/self supervised training/hyper parameter optimization or whatever you wanna call it. the target is undergrad people with knowledge of linalg and optimization https://github.com/3ddP/jax_example/blob/master/examples.ipynb

#

any comments and/or feedback are welcome. analytic solutions are used to corroborate the jax results, but the math isn't explained. it's expected the students will already know it

misty flint Jun 5, 2022, 2:51 PM

#

pliant pewter Are there any successful NLP joke/sarcasm detectors out there?

id be interested if you found anything

#

Sip

serene scaffold Jun 5, 2022, 3:11 PM

#

the twitter API gives you their own sentiment scores, if I remember correctly. what are you trying to do?

#

you want to get the sentiment score of individual words? I've never heard of that

#

sentiment scores will reflect the sentiment of the whole tweet

misty flint Jun 5, 2022, 3:20 PM

#

http://projector.tensorflow.org/

Embedding projector - visualization of high-dimensional data

Visualize high dimensional data.

#

pretty nifty

misty flint Jun 5, 2022, 3:21 PM

#

misty flint http://projector.tensorflow.org/

serene scaffold Jun 5, 2022, 3:21 PM

#

that sounds good

#

did you get tweepy set up?

hollow sentinel Jun 5, 2022, 3:57 PM

#

#

why does printing the head of the dataframe in thonny look like that

#

it looks gross lol

serene scaffold Jun 5, 2022, 3:59 PM

#

hollow sentinel why does printing the head of the dataframe in thonny look like that

head is a method

hollow sentinel Jun 5, 2022, 4:00 PM

#

yeah but is there a cleaner way to look at the dataframe

serene scaffold Jun 5, 2022, 4:00 PM

#

you can see that it says "bound method of ..."

#

did you try print(df.head()), where you call the method?

hollow sentinel Jun 5, 2022, 4:00 PM

#

yep that's why

serene scaffold Jun 5, 2022, 4:01 PM

#

but you're just using pandas' native printing functionality. I don't know if thonny does anything like pycharm's dataframe viewer thing

hollow sentinel Jun 5, 2022, 4:02 PM

#

i can't even open anaconda-navigator on my mac anymore

#

soooo no more uploading ipynbs to my github

#

gonna stick out like a sore thumb 💀

bold timber Jun 5, 2022, 5:41 PM

#

The column of dropoff_site have some label. How to do replacing the missing value in load_weight when dropoff_site is 'MRF'?

tacit basin Jun 5, 2022, 5:55 PM

#

mint palm i mean normal fine tune

with fastai that would be as simple as using pretrained model and using fine_tune method on learner https://docs.fast.ai/callback.schedule.html#Learner.fine_tune

Hyperparam schedule

Callback and helper functions to schedule any hyper-parameter

chilly helm Jun 5, 2022, 6:03 PM

#

can you freelance as a data scientist?

copper tinsel Jun 5, 2022, 6:10 PM

#

chilly helm can you freelance as a data scientist?

Yeah y not

mint palm Jun 5, 2022, 6:13 PM

#

tacit basin with fastai that would be as simple as using pretrained model and using fine_tun...

but i dont have the data for fine tuning

#

i want to generate that too

elder falcon Jun 5, 2022, 6:43 PM

#

misty flint http://projector.tensorflow.org/

Bro you are amazing. Can I ask with which technology you visualised data in the form of dashboard. Do reply @misty flint

fallen crane Jun 5, 2022, 7:15 PM

#

1-Is it possible to build a new programming language from scratch, as it is called 0101? Is there any knowledge currently available that helps to do that?

misty flint Jun 5, 2022, 7:17 PM

#

elder falcon Bro you are amazing. Can I ask with which technology you visualised data in the ...

bro this isnt me. this is google's tensorflow

fallen crane Jun 5, 2022, 7:17 PM

#

2- When I review some visual and read sources, all I find is a theoretical explanation of 0101's supposed work steps from the beginning, but if I can ask, how was 0101 introduced into the electronic circuit, using any technology and any knowledge?

misty flint Jun 5, 2022, 7:17 PM

#

kekHands

#

also update: chip huyen's RecSys series is off to a great start

#

1000

fervent vale Jun 5, 2022, 7:32 PM

#

Hi guys

tacit basin Jun 5, 2022, 7:49 PM

#

mint palm but i dont have the data for fine tuning

For supervised training fine tuning you need labeled data

tacit basin Jun 5, 2022, 7:54 PM

#

misty flint also update: chip huyen's RecSys series is off to a great start

I've started too many courses tbh, i wish i finished half of them 😂

misty flint Jun 5, 2022, 7:56 PM

#

tacit basin I've started too many courses tbh, i wish i finished half of them 😂

rip. it helps if you have an end-goal. for me, i might use these concepts at work possibly creating a RecSys protoype. it also helps that it's only 3.5 sessions

#

3.5 hrs total. 10a PT on sundays

#

also im super interested so im def planning on completing this one

#

and this one is less of a lecturer-student style and more of a self-study group style where peeps share more of their experiences/learnings

#

so i like that format more since its interactive

burnt island Jun 5, 2022, 8:00 PM

#

anyone with a good knowledge of SARIMAX and ARIMAX models or resources on time series forecasting.

I'm working on a personal project which has to do with crypto price modelling, I want to use SARIMAX or ARIMAX before CNN to model

tacit basin Jun 5, 2022, 8:03 PM

#

I will quit my job one day and do all these Udemy courses lol

quaint wave Jun 5, 2022, 8:05 PM

#

Hi guys, I'm currently writing up a project on the use of neural networks in detecting football tactics and stumbled across a paper which I don't understand. Would anyone be willing to help? I'll dm you the pdf

tacit basin Jun 5, 2022, 8:06 PM

#

quaint wave Hi guys, I'm currently writing up a project on the use of neural networks in det...

You could ask the question here and if someone knows the answer they will help

quaint wave Jun 5, 2022, 8:10 PM

#

tacit basin You could ask the question here and if someone knows the answer they will help

I don't even know how to frame any questions because I don't understand what the paper is saying tbh. I have an understanding of how neural nets work but this is too complicated for me. Am I allowed to upload a file here?

mint palm Jun 5, 2022, 8:13 PM

#

tacit basin with fastai that would be as simple as using pretrained model and using fine_tun...

Yup but can i somehow simulate it??

misty flint Jun 5, 2022, 8:14 PM

#

tacit basin I will quit my job one day and do all these Udemy courses lol

https://youtu.be/UvqN3bAv0pM

YouTube

Tina Huang

Why you keep quitting online courses (and then buy more)

Head to http://brilliant.org/TinaHuang/ to get started for free with Brilliant's interactive lessons. The first 200 people will also get 20% off an annual membership.

✉️ NEWSLETTER: https://tinahuang.substack.com/
It's about learning, coding, and generally how to get your sh*t together c:

In this video, I talk about why you keep quitting you...

▶ Play video

#

kekHands

#

she has some good points

#

that i feel is very relevant for people studying the topics in this channel

fleet musk Jun 5, 2022, 8:15 PM

#

helo, the help channel is very slow to help with problems sometimes

#

can i ask here

misty flint Jun 5, 2022, 8:15 PM

#

only if its related to #data-science-and-ml

#

and sometimes people cant help you here either; it just depends on the problem

fleet musk Jun 5, 2022, 8:16 PM

#

misty flint only if its related to <#366673247892275221>

helo rex,, remember me?

misty flint Jun 5, 2022, 8:16 PM

#

/availability

#

oh you are the guy that was stuck with pycharm

fleet musk Jun 5, 2022, 8:16 PM

#

ok. it is pandas related

fleet musk Jun 5, 2022, 8:16 PM

#

misty flint oh you are the guy that was stuck with pycharm

hehe.

misty flint Jun 5, 2022, 8:16 PM

#

just ask it

fleet musk Jun 5, 2022, 8:17 PM

#

tacit basin Jun 5, 2022, 8:17 PM

#

mint palm Yup but can i somehow simulate it??

Simulate what?

misty flint Jun 5, 2022, 8:17 PM

#

eww financial data RunFail

fleet musk Jun 5, 2022, 8:17 PM

#

using spyder IDE
i have extracted stock data and put it into a dataframe

fleet musk Jun 5, 2022, 8:17 PM

#

misty flint eww financial data <:RunFail:793712787692060723>

😦

mint palm Jun 5, 2022, 8:18 PM

#

For example for making a model that allocate resources based on parameters, can we simulate those condition??

fleet musk Jun 5, 2022, 8:19 PM

#

i ask miwojo then

mint palm Jun 5, 2022, 8:19 PM

#

tacit basin Simulate what?

But i think now that if we can simulate manually then whats the need of nn@misty flint

misty flint Jun 5, 2022, 8:19 PM

#

why did you ping me

#

kekHands

mint palm Jun 5, 2022, 8:20 PM

#

Mistake sir, pardon my dust

tacit basin Jun 5, 2022, 8:20 PM

#

mint palm For example for making a model that allocate resources based on parameters, can ...

Which conditions you want to simulate? Can you explain your goal a bit?

mint palm Jun 5, 2022, 8:20 PM

#

Network slicinf

fleet musk Jun 5, 2022, 8:20 PM

#

fleet musk

@tacit basin in this pic, last 3 columns are of interest
i have column "candle" and i need to calculate mean of values of Candle 10, 15, 20 etc only if they belong to same date
there are 22 different dates
what do?

misty flint Jun 5, 2022, 8:21 PM

#

my worst nightmare

#

monkaCHRIST

mint palm Jun 5, 2022, 8:21 PM

#

We allocate embb mmtc or urllc based on speed quantity of data etc

fleet musk Jun 5, 2022, 8:21 PM

#

helo melio. rex is bullying me. halp plez

misty flint Jun 5, 2022, 8:21 PM

#

was attacked by one the other day

#

kekHands

mint palm Jun 5, 2022, 8:21 PM

#

Everything is cardinal

misty flint Jun 5, 2022, 8:22 PM

#

yeah melio, you can help stardust; idk how im bullying stardust tho Oopsies

fleet musk Jun 5, 2022, 8:22 PM

#

misty flint yeah melio, you can help stardust; idk how im bullying stardust tho <:Oopsies:79...

im kidding frend :}

tacit basin Jun 5, 2022, 8:23 PM

#

fleet musk <@490342783572246538> in this pic, last 3 columns are of interest i have column...

groupby by date and candle?

fleet musk Jun 5, 2022, 8:23 PM

#

tacit basin groupby by date and candle?

hory shitto. lemme try

misty flint Jun 5, 2022, 8:24 PM

#

groupby ftw

#

that

#

and json_normalize

#

pretty up there on my pandas fave functions

#

kekHands

mint palm Jun 5, 2022, 8:27 PM

#

Wait tell me whats the point of transfer learning

#

How do we fine tune

fleet musk Jun 5, 2022, 8:27 PM

#

is this correct?

#

ticker is df

#

dataframe

tacit basin Jun 5, 2022, 8:29 PM

#

mint palm Wait tell me whats the point of transfer learning

For example you have nn trained on imagenet 1 million images. Then you have labeled images from your domain. You fine tune (transfer learning) nn on these images

mint palm Jun 5, 2022, 8:30 PM

#

mint palm Wait tell me whats the point of transfer learning

If pretraining is done on 500 000 example how big should fine tune data be

fleet musk Jun 5, 2022, 8:30 PM

#

getting this error, this wasnt there before i added the groupby line

mint palm Jun 5, 2022, 8:30 PM

#

To learn and fit well enough

#

Deoends?

tacit basin Jun 5, 2022, 8:33 PM

#

Yep depends on how similar are ptetrained data to your domain data

tacit basin Jun 5, 2022, 8:34 PM

#

fleet musk is this correct?

df.groupby(['feat1', 'feat2']).mean()

fleet musk Jun 5, 2022, 8:34 PM

#

ok ok

#

why do i need mean?

tacit basin Jun 5, 2022, 8:35 PM

#

I though you said mean

fleet musk Jun 5, 2022, 8:36 PM

#

for example, make group by dates, so 22 groups, then i take candle number from candle column,

for that candle number, i need to take Candle "close" price from another column

tacit basin Jun 5, 2022, 8:36 PM

#

Reread again. You said mean that's why I guess

fleet musk Jun 5, 2022, 8:36 PM

#

ok. ill omit mean for now

tacit basin Jun 5, 2022, 8:37 PM

#

groupby returns groups and if you specify aggregate mathod it will calculate that on group

fleet musk Jun 5, 2022, 8:37 PM

#

ill need to read on aggregate

#

ticker.groupby("date",axis=1)

#

is this what i do, for date grouping

fleet musk Jun 5, 2022, 9:24 PM

#

groupby didnt work, not suited here
i used numpy split, now need to find a way to perform operations on split portions of each dataframe

solid urchin Jun 5, 2022, 10:17 PM

#

!dice_1

#

dice_1

pseudo wren Jun 5, 2022, 11:55 PM

#

what is a good way to account for date while working on a model

#

the date is relevant to my dataset but it is in a format the python interpreter cannot understand

#

the date is important for me to keep because i need it to record trends in this dataset, but i'm not sure what the best way of separating this data is

errant onyx Jun 6, 2022, 12:19 AM

#

Hello people, I am thinking about picking up either "An Introduction to Statistical Learning (with applications in R)" or "Hands-on Machine Learning with Scikit-Learn, Keras & TensorFlow"

#

Do you have any experience with these, which one would you recommend?

serene scaffold Jun 6, 2022, 12:28 AM

#

@errant onyx we're going to be partial to the second one, because those three things are python libraries. R is a separate language, ie not python

errant onyx Jun 6, 2022, 12:29 AM

#

I know that's the case, it's also the reason I asked the question identically in the R discord

#

But I sort of wanted to know if you guys think it was good

serene scaffold Jun 6, 2022, 12:30 AM

#

errant onyx I know that's the case, it's also the reason I asked the question identically in...

I've never read either. The book I recommend to beginners is "data science from scratch"

errant onyx Jun 6, 2022, 12:30 AM

#

I'm sort of an R person but it seems most ML things are done in Python in industry

#

That one seems good too

#

Saw it being recommended too

serene scaffold Jun 6, 2022, 12:30 AM

#

errant onyx I'm sort of an R person but it seems most ML things are done in Python in indust...

I work in the AI department of my company, and I don't know anyone who uses R. We just do everything in python

muted vector Jun 6, 2022, 12:31 AM

#

(srry to interrupt convo but how would u guys recommend starting learn AI with python?)

errant onyx Jun 6, 2022, 12:31 AM

#

serene scaffold I work in the AI department of my company, and I don't know anyone who uses R. W...

That's what I feared

#

I'm in academia so most things are done in R here

serene scaffold Jun 6, 2022, 12:31 AM

#

muted vector (srry to interrupt convo but how would u guys recommend starting learn AI with p...

See the book I just recommend a few messages ago

serene scaffold Jun 6, 2022, 12:31 AM

#

errant onyx I'm in academia so most things are done in R here

The one R user I know is a linguistics post doc

errant onyx Jun 6, 2022, 12:31 AM

#

haha, of course he/she's from academia

serene scaffold Jun 6, 2022, 12:32 AM

#

Oh, I know another. Also in academia. I could ask him for advice about how he switched to Python

muted vector Jun 6, 2022, 12:32 AM

#

this? :>

serene scaffold Jun 6, 2022, 12:32 AM

#

muted vector this? :>

!resources data science

arctic wedgeBOT Jun 6, 2022, 12:32 AM

#

Resources

The Resources page on our website contains a list of hand-selected learning resources that we regularly recommend to both beginners and experts.

serene scaffold Jun 6, 2022, 12:32 AM

#

It's on there.

muted vector Jun 6, 2022, 12:33 AM

#

ty!

errant onyx Jun 6, 2022, 12:33 AM

#

I've read Python for data analysis and python crash course, and done some coding in Python in general

#

But there's a world of books/resources, such a difficult choice

serene scaffold Jun 6, 2022, 12:35 AM

#

errant onyx But there's a world of books/resources, such a difficult choice

Do you work for a university currently?

#

Or attend one?

#

They might give you an ORiley subscription. In which case you can try any book without fear of commitment

errant onyx Jun 6, 2022, 12:35 AM

#

I'm doing a phd, should ideally be finishing in 1,5 years

#

so I have that time to learn more data science basically

#

I'm not super worried about getting a book, I'll probably read it cover to cover in any case

#

Just you know, I wanna hit the sweet spot when it comes to a book

#

Not too basic, not too theoretical

#

anyway I'm gonna look up the data science for scratch book

#

thanks

serene scaffold Jun 6, 2022, 12:37 AM

#

What is your PhD in

errant onyx Jun 6, 2022, 12:37 AM

#

Medical sciences

#

so yeah, very different

serene scaffold Jun 6, 2022, 12:37 AM

#

BTW I'm at a wedding so I might disappear if someone makes me dance

#

But I don't wanna

errant onyx Jun 6, 2022, 12:38 AM

#

Yeah, I know the feeling

#

Been there

serene scaffold Jun 6, 2022, 12:38 AM

#

errant onyx Medical sciences

Are you hoping to work as a data scientist in a medical related area?

errant onyx Jun 6, 2022, 12:38 AM

#

quite a few times too

#

I think I would be able to contribute the most there, but I don't feel I should be constrained to only the medical field-

#

What I mean is that that is probably the best goal for me in a perfect world

#

but you never know the opportunities that might pop up

#

I work a bit with bioinformatics right now though

#

But I have poor knowledge of the underlying algorithms I'd say

#

I also am on week 5 of Andrew Ng's machine learnign course

#

He's gonna replace it with a new course though, kind of typical

hollow sentinel Jun 6, 2022, 12:58 AM

#

where do i find apis for data science?

serene scaffold Jun 6, 2022, 1:00 AM

#

hollow sentinel where do i find apis for data science?

Apis that do what

hollow sentinel Jun 6, 2022, 1:02 AM

#

muted vector (srry to interrupt convo but how would u guys recommend starting learn AI with p...

provide health diagnoses for patients

#

or i could do my first project with web scraping

#

oh i didn't mean to ping whoever that was

#

https://machinelearningmastery.com/start-here/ i'd recommend this guy's blog

Machine Learning Mastery

Start Here with Machine Learning

Your guide to getting started and getting good at applied machine learning with Machine Learning Mastery.

#

i find it hard to think of projects that are personally applicable to me

#

so i get frustrated when i think of projects

#

it's tough

misty flint Jun 6, 2022, 1:06 AM

#

ben rogajan introduced an API to me and im using that one for a DE project

#

DoggoKek

#

https://developer.nytimes.com/apis

hollow sentinel Jun 6, 2022, 1:07 AM

#

i think fitness might be an idea

misty flint Jun 6, 2022, 1:08 AM

#

Is there an API call limit?
Yes, there are two rate limits per API: 4,000 requests per day and 10 requests per minute. You should sleep 6 seconds between calls to avoid hitting the per minute rate limit. If you need a higher rate limit, please contact us at code@nytimes.com.

#

up to you. as long as you find the problem interesting, youre more likely to finish it

#

Oopsies

hollow sentinel Jun 6, 2022, 1:09 AM

#

https://wwwn.cdc.gov/nchs/nhanes/continuousnhanes/default.aspx?BeginYear=2021

#

ah yes

#

data i cannot access

#

we love

misty flint Jun 6, 2022, 1:10 AM

#

errant onyx I'm sort of an R person but it seems most ML things are done in Python in indust...

i have a DS interview tomorrow. that company uses R + Microsoft tooling

#

lots of peeps in the bioinformatics/pharmaceutical space use R. CDC exclusively uses R as well

#

im still biased towards python tho since if youre going to deploy models, its going to be in python

#

sdk's for R are very uncommon

misty flint Jun 6, 2022, 1:18 AM

#

hollow sentinel we love

hahaha rip. i forgot SAS is also what they use

#

absolutely tragic

#

🕯️

misty flint Jun 6, 2022, 1:19 AM

#

serene scaffold BTW I'm at a wedding so I might disappear if someone makes me dance

dance with me stel

#

dumb_dance

muted vector Jun 6, 2022, 1:30 AM

#

is matplotlib a good thing to plot stuf with?

royal crest Jun 6, 2022, 1:34 AM

#

Yes

#

it's the standard

lapis sequoia Jun 6, 2022, 1:35 AM

#

misty flint <a:dumb_dance:804726812785115166>

https://c.tenor.com/xJ_mJ01nxmUAAAAM/yay-yes-yes-yes.gif

hollow sentinel Jun 6, 2022, 1:43 AM

#

that and seaborn

#

seaborn is nice

misty flint Jun 6, 2022, 1:45 AM

#

p l o t l y

#

RunFail

royal crest Jun 6, 2022, 2:11 AM

#

no one talks about pyCairo

#

;-;

misty flint Jun 6, 2022, 3:06 AM

#

royal crest no one talks about pyCairo

if i cant switch out the default matplotlib backend for it, i def have never heard of it

#

kekHands

serene scaffold Jun 6, 2022, 3:07 AM

#

@errant onyx as I was going to say earlier, if you get a PhD in something that isn't data science in itself, but you can also do data science, I would say that puts you in a good position. Also my cousin's wife is very angry at me for refusing to dance with her.

misty flint Jun 6, 2022, 3:08 AM

#

misty flint if i cant switch out the default matplotlib backend for it, i def have never hea...

oh wait jk

#

cairo is an option

misty flint Jun 6, 2022, 3:09 AM

#

serene scaffold <@182849136960208897> as I was going to say earlier, if you get a PhD in somethi...

why did you refuse

royal crest Jun 6, 2022, 3:09 AM

#

serene scaffold <@182849136960208897> as I was going to say earlier, if you get a PhD in somethi...

PhD in something that isn't data science in itself, but you can also do data science
that's me, and it really is valuable

misty flint Jun 6, 2022, 3:09 AM

#

weddings make you do mandatory things unless you hide

#

kekHands

#

ZoomEyes

#

oh yeah?

misty flint Jun 6, 2022, 4:12 AM

#

#

blobhyperthink

#

image_41a21cd1-0e70-481e-a9cf-e781efe9ab3220220605_231201.jpg

#

blobpoll

worldly dawn Jun 6, 2022, 4:16 AM

#

misty flint

that's the same for pretty much every type of engineer

#

Being able to get shit done, but being an expert in 1 or 2 areas

lapis sequoia Jun 6, 2022, 5:41 AM

#

I am a dot

plush jungle Jun 6, 2022, 5:48 AM

#

i'm trying to run stylegan2 ada
https://github.com/johndpope/stylegan2-ada
but I keep getting this error
RuntimeError: Could not find MSVC/GCC/CLANG installation on this computer. Check compiler_bindir_search_path list in "C:\python\stylegan2-ada-main\stylegan2-ada-main\dnnlib\tflib\custom_ops.py".

#

the file it's talking about has this code

def _prepare_nvcc_cli(opts):
    cmd = 'nvcc ' + opts.strip()
    cmd += ' --disable-warnings'
    cmd += ' --include-path "%s"' % tf.sysconfig.get_include()
    cmd += ' --include-path "%s"' % os.path.join(tf.sysconfig.get_include(), 'external', 'protobuf_archive', 'src')
    cmd += ' --include-path "%s"' % os.path.join(tf.sysconfig.get_include(), 'external', 'com_google_absl')
    cmd += ' --include-path "%s"' % os.path.join(tf.sysconfig.get_include(), 'external', 'eigen_archive')

    compiler_bindir = _find_compiler_bindir()
    if compiler_bindir is None:
        # Require that _find_compiler_bindir succeeds on Windows.  Allow
        # nvcc to use whatever is the default on Linux.
        if os.name == 'nt':
            raise RuntimeError('Could not find MSVC/GCC/CLANG installation on this computer. Check compiler_bindir_search_path list in "%s".' % __file__)
    else:
        cmd += ' --compiler-bindir "%s"' % compiler_bindir
    cmd += ' 2>&1'
    return cmd```

worldly dawn Jun 6, 2022, 5:49 AM

#

plush jungle the file it's talking about has this code ```py def _prepare_nvcc_cli(opts): ...

is nvcc installed?

plush jungle Jun 6, 2022, 5:50 AM

#

msvc is installed, I don't know about nvcc

#

is that part of cuda?

#

cause I already installed cuda-toolkit

worldly dawn Jun 6, 2022, 5:51 AM

#

plush jungle cause I already installed cuda-toolkit

sounds like it's trying to call nvcc

plush jungle Jun 6, 2022, 5:51 AM

#

when I google "download nvcc" it just directs me to download cuda-toolkit

worldly dawn Jun 6, 2022, 5:52 AM

#

is it in your path?

#

I would recommend to dig into the content of _find_compiler_bindir() and see what is it looking for

plush jungle Jun 6, 2022, 5:53 AM

#

yeah I looked into that actually

#

there are actually two versions from two different github forks

#

patterns = [
'C:/Program Files (x86)/Microsoft Visual Studio//Professional/VC/Tools/MSVC//bin/Hostx64/x64',
'C:/Program Files (x86)/Microsoft Visual Studio//BuildTools/VC/Tools/MSVC//bin/Hostx64/x64',
'C:/Program Files (x86)/Microsoft Visual Studio//Community/VC/Tools/MSVC//bin/Hostx64/x64',
'C:/Program Files (x86)/Microsoft Visual Studio */vc/bin',
'C:/Program Files/Microsoft Visual Studio/2022/Community/VC/Auxiliary/Build/vcvars64.bat',
]
def _find_compiler_bindir():
    for compiler_path in patterns:
        if os.path.isdir(compiler_path):
            return compiler_path
    return None```

#

this is one

#

this is the other

#

def _find_compiler_bindir():
    hostx64_paths = sorted(glob.glob('C:/Program Files (x86)/Microsoft Visual Studio/*/Professional/VC/Tools/MSVC/*/bin/Hostx64/x64'), reverse=True)
    if hostx64_paths != []:
        return hostx64_paths[0]
    hostx64_paths = sorted(glob.glob('C:/Program Files (x86)/Microsoft Visual Studio/*/BuildTools/VC/Tools/MSVC/*/bin/Hostx64/x64'), reverse=True)
    if hostx64_paths != []:
        return hostx64_paths[0]
    hostx64_paths = sorted(glob.glob('C:/Program Files (x86)/Microsoft Visual Studio/*/Community/VC/Tools/MSVC/*/bin/Hostx64/x64'), reverse=True)
    if hostx64_paths != []:
        return hostx64_paths[0]
    vc_bin_dir = 'C:/Program Files (x86)/Microsoft Visual Studio 14.0/vc/bin'
    if os.path.isdir(vc_bin_dir):
        return vc_bin_dir
    return None```

worldly dawn Jun 6, 2022, 5:54 AM

#

oh bo

#

y

plush jungle Jun 6, 2022, 5:54 AM

#

so I figured out that it's looking for the c complier in visual studio

worldly dawn Jun 6, 2022, 5:55 AM

#

do any of these directories exist for you?

plush jungle Jun 6, 2022, 5:55 AM

#

no. instead my MSVC is located here

C:/Program Files/Microsoft Visual Studio/2022/Community/VC/Tools/MSVC/14.32.31326/bin/Hostx64\x64```

#

so i did this

#

def _find_compiler_bindir():
    return 'C:/Program Files/Microsoft Visual Studio/2022/Community/VC/Tools/MSVC/14.32.31326/bin/Hostx64\x64'
    for compiler_path in patterns:
        if os.path.isdir(compiler_path):
            return compiler_path
    return None```

#

and I got this

#

RuntimeError: NVCC returned an error. See below for full command line and output log:

nvcc "C:\Users\Alex\AppData\Local\Programs\Python\Python310\lib\site-packages\tensorflow\python\_pywrap_tensorflow_internal.lib" --gpu-architecture=sm_86 --use_fast_math --disable-warnings --include-path "C:\Users\Alex\AppData\Local\Programs\Python\Python310\lib\site-packages\tensorflow\include" --include-path "C:\Users\Alex\AppData\Local\Programs\Python\Python310\lib\site-packages\tensorflow\include\external\protobuf_archive\src" --include-path "C:\Users\Alex\AppData\Local\Programs\Python\Python310\lib\site-packages\tensorflow\include\external\com_google_absl" --include-path "C:\Users\Alex\AppData\Local\Programs\Python\Python310\lib\site-packages\tensorflow\include\external\eigen_archive" --compiler-bindir "C:/Program Files/Microsoft Visual Studio/2022/Community/VC/Tools/MSVC/14.32.31326/bin/Hostx64d" 2>&1 "C:\python\stylegan2-ada-main\stylegan2-ada-main\dnnlib\tflib\ops\fused_bias_act.cu" --shared -o "C:\Users\Alex\AppData\Local\Temp\tmp2gk5m51p\fused_bias_act_tmp.dll" --keep --keep-dir "C:\Users\Alex\AppData\Local\Temp\tmp2gk5m51p"

'nvcc' is not recognized as an internal or external command,
operable program or batch file.```

#

so my current working theory is that nvcc is installed with cuda-toolkit but it's not in my path

worldly dawn Jun 6, 2022, 5:57 AM

#

yeah, sounds like it can't find nvcc

plush jungle Jun 6, 2022, 5:57 AM

#

these are in my path

C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7\bin
C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7\libnvvp```

#

but nothing about nvcc

worldly dawn Jun 6, 2022, 5:58 AM

#

is nvcc in either of these directories?

plush jungle Jun 6, 2022, 5:58 AM

#

let me check

#

nvcc is in the first one

#

it's an exe

worldly dawn Jun 6, 2022, 5:59 AM

#

ok then that's weird

plush jungle Jun 6, 2022, 5:59 AM

#

is it possible that by doing

def _find_compiler_bindir():
    return 'C:/Program Files/Microsoft Visual Studio/2022/Community/VC/Tools/MSVC/14.32.31326/bin/Hostx64/x64'

i've given it a bad path somehow?

#

like that it can find nvcc but it's thrown off by the path i'm giving it?

worldly dawn Jun 6, 2022, 6:00 AM

#

then your assumption would be that the actual error does not match the error message

plush jungle Jun 6, 2022, 6:00 AM

#

yeah

worldly dawn Jun 6, 2022, 6:00 AM

#

which is fair, but would have to be proven

#

you should be able to see how nvcc is called exactly and either see what is being returned or being able to call the same thing manually yourself

plush jungle Jun 6, 2022, 6:02 AM

#

oh yeah

#

yeah when i type it into terminal it says the same thing

worldly dawn Jun 6, 2022, 6:03 AM

#

what if you type just "nvcc" ?

plush jungle Jun 6, 2022, 6:03 AM

#

that's what I did

worldly dawn Jun 6, 2022, 6:03 AM

#

ok, I haven't used windows in years. But do the .exe matter at the end? like in nvcc vs nvcc.exe ?

plush jungle Jun 6, 2022, 6:04 AM

#

no, typically you don't put the .exe on the end

worldly dawn Jun 6, 2022, 6:04 AM

#

ok, then something is wrong with your path or installation

#

you should at the very least get an nvcc error

#

not a system error about the executable

#

and the fact that just calling nvcc without arguments give you such error does mean that it's not about your compiler argument

plush jungle Jun 6, 2022, 6:06 AM

#

it's gotta be the path. someone on stackoverflow had the same issue in 2017

#


/Developer/NVIDIA/CUDA8.0.61/bin
As indicated in the install guide, the correct path is:

/Developer/NVIDIA/CUDA-8.0.61/bin
                      ^```

#

but that's not what my path looks like in the year of our lord 2022

#

mine looks like this

#

C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7\bin

worldly dawn Jun 6, 2022, 6:07 AM

#

then get your path out of the matrix into the current year

cursive walrus Jun 6, 2022, 6:13 AM

#

hi guys i am new to python programming i am having this trouble i have this code that detect plant and i am getting this error: IndexError: tuple index out of range please help me

worldly dawn Jun 6, 2022, 6:20 AM

#

cursive walrus hi guys i am new to python programming i am having this trouble i have this code...

something is trying to reach something out of range

cursive walrus Jun 6, 2022, 6:20 AM

#

worldly dawn something is trying to reach something out of range

i can send you the code can you look at it

worldly dawn Jun 6, 2022, 6:21 AM

#

cursive walrus i can send you the code can you look at it

it's getting late here and I don't do DMs. Better to paste it here

cursive walrus Jun 6, 2022, 6:21 AM

#

ok

#

import cv2
import os
#Cascade
cascade = cv2.CascadeClassifier('./golden_pothos_cascade.xml')
#Reading Image
capture = cv2.VideoCapture(0)
while True:
success, img =capture.read()
#Converting to Gray Image
gray_Image = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)
#Adding Gaussian Blur
blur=cv2.GaussianBlur(gray_Image,(13,13),cv2.BORDER_DEFAULT)
#Detecting Plant
detection_result, rejectLevels, levelWeights =cascade.detectMultiScale3(blur, scaleFactor=1.0485258, minNeighbors=6, minSize=(30,30),outputRejectLevels = 1)
greaterweightindex = 0
currentweight = levelWeights[0]
#Area with Heighest Confidence
for (weight) in levelWeights:
if weight > currentweight:
greaterweightindex = greaterweightindex+1
currentweight = weight
#Highest Confidence Area
x = detection_result[greaterweightindex][0]
y = detection_result[greaterweightindex][1]
w = detection_result[greaterweightindex][2]
h = detection_result[greaterweightindex][3]
#Modifying Cofidence
confidence= round(currentweight[0], 2)
finalconfidence= confidence * 100
#Drawing Rectangle
cv2.rectangle(img,(x,y), (x+w, y+h), (0,0,255), thickness=2)
cv2.rectangle(img,(x,y-35), (x+w, y), (0,0,255), thickness=-1)
#Adding Text
cv2.putText(img, str(f"Golden Pothos {finalconfidence}%"), (x,y-5), cv2.FONT_HERSHEY_COMPLEX, 0.6, (255,255,255), thickness=2)
#Displaying Image
cv2.imshow("Detected Plant",img)
#Adding Wait
if cv2.waitKey(1) == 13:
break
cv2.waitKey(1)

#

the error is at currentweight = levelWeights[0]

plush jungle Jun 6, 2022, 6:27 AM

#

cursive walrus > import cv2 > import os > #Cascade > cascade = cv2.CascadeClassifier('./golden_...

my theory is that it's not detecting anything, so it's returning an empty tuple or something

#

and that's why it's out of range

#

run the code again but before the line that throws the error put

print(levelWeights)```

#

@cursive walrus

cursive walrus Jun 6, 2022, 6:33 AM

#

plush jungle my theory is that it's not detecting anything, so it's returning an empty tuple ...

#

this is what i am getting

plush jungle Jun 6, 2022, 6:33 AM

#

yep, it's as I expected, an empty tuple

#

ok try this

#

greaterweightindex = 0
if not levelWeights:
    continue
currentweight = levelWeights[0]```

cursive walrus Jun 6, 2022, 6:39 AM

#

plush jungle yep, it's as I expected, an empty tuple

now i am getting this error

plush jungle Jun 6, 2022, 6:40 AM

#

do this and tell me what it prints

greaterweightindex = 0
if not levelWeights:
    continue
print(levelWeights)
currentweight = levelWeights[0]```

cursive walrus Jun 6, 2022, 6:44 AM

#

plush jungle do this and tell me what it prints ```py greaterweightindex = 0 if not levelWeig...

[-1.06755358]

plush jungle Jun 6, 2022, 6:45 AM

#

who wrote this code?

#

cause this looks like a mistake

confidence= round(currentweight[0], 2)```

cursive walrus Jun 6, 2022, 6:47 AM

#

plush jungle who wrote this code?

i took it from github

plush jungle Jun 6, 2022, 6:47 AM

#

currentweight isn't a list or a tuple, so of course this will throw an error

#

what happens if you do this

#

confidence= round(currentweight, 2)```

cursive walrus Jun 6, 2022, 6:49 AM

#

plush jungle who wrote this code?

omg it worked thanks man you helped me a lot.

#

thank you

plush jungle Jun 6, 2022, 6:50 AM

#

I am the duck

#

hey @worldly dawn how did you get to be a helper?

#

do you have to defeat one in single combat?

worldly dawn Jun 6, 2022, 7:06 AM

#

plush jungle do you have to defeat one in single combat?

it does involve some intense training

plush jungle Jun 6, 2022, 7:07 AM

#

walk uphill both ways through the snow in the heat of summer while row reducing a matrix?

fierce pine Jun 6, 2022, 8:09 AM

#

@plush jungle hello

plush jungle Jun 6, 2022, 8:09 AM

#

yo

fierce pine Jun 6, 2022, 8:15 AM

#

plush jungle yo

I am having a data set but it is in txt file. Idk how to load it and i want to do it using linear regression..

#

Also sorry for pinging you like this

plush jungle Jun 6, 2022, 8:16 AM

#

the issue is just that it's in a txt file?

fierce pine Jun 6, 2022, 8:18 AM

#

plush jungle the issue is just that it's in a txt file?

Yes but the data is also not properly arranged.

plush jungle Jun 6, 2022, 8:18 AM

#

what type of data is it

#

and how is it supposed to be arranged

errant onyx Jun 6, 2022, 8:19 AM

#

serene scaffold <@182849136960208897> as I was going to say earlier, if you get a PhD in somethi...

Thanks for the response. 🙂 Appreciate it

arctic wedgeBOT Jun 6, 2022, 8:21 AM

#

Hey @fierce pine!

You either uploaded a .txt file or entered a message that was too long. Please use our paste bin instead.

plush jungle Jun 6, 2022, 8:21 AM

#

!past

arctic wedgeBOT Jun 6, 2022, 8:21 AM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the floppy disk icon in the top right, or by typing ctrl + S. After doing that, the URL should change. Copy the URL and post it here so others can see it.

fierce pine Jun 6, 2022, 8:27 AM

#

https://paste.pythondiscord.com/ivumiruhaz

#

Its half the data

wooden sail Jun 6, 2022, 8:29 AM

#

looks like you could read the file line by line, split the strings based on the spaces, and pick the columns you're interested in afterwards

#

or make a pandas dataframe and ask it for a column

plush jungle Jun 6, 2022, 8:31 AM

#

yeah or you could use regex

wooden sail Jun 6, 2022, 8:32 AM

#

also, regression is kind of a loose term. do you mean fit a first order polynomial to a sequence of data? fit a general curve to a sequence of data? have the input be vector-valued?

fierce pine Jun 6, 2022, 8:37 AM

#

wooden sail also, regression is kind of a loose term. do you mean fit a first order polynomi...

Idk much i am at initial stage. What data should i predict using multiple regression? Any suggestions?

wooden sail Jun 6, 2022, 8:38 AM

#

no clue 😛

#

if you've never done any of this before, i'd see if you can predict a future value in one of the columns given the past values

#

plot the data of the column first to see if it has some behavior you can recognize, pick a model function based on that, and fit its parameters

fierce pine Jun 6, 2022, 8:47 AM

#

wooden sail if you've never done any of this before, i'd see if you can predict a future val...

Oh ohkkk, so in this data which column's value should i predict? I am thinking to do it using multiple variable linear regression model from pandas, but what shld i predict as per ur opinion?

wooden sail Jun 6, 2022, 9:03 AM

#

you could try to predict a full row

#

lemme get back home and tex somwthing up

#

lol

fierce pine Jun 6, 2022, 9:05 AM

#

wooden sail you could try to predict a full row

Yessss pleasee and thankss

wooden sail Jun 6, 2022, 9:16 AM

#

aight

#

we're gonna look at a short time window linear predictor

#

our assumption is that, over a reasonable small time period, the data behaves like a straight line. pretty much a loose form of taylor's theorem

#

so we wanna set up a model that captures this and learn its parameters

#

we first recall that a linear equation looks as follows (gonna write instead of tex in the end)

#

#

which we've conveniently written in matrix form on the right. notice we have 2 unknowns, m and b, because we observe y and x in the data

#

we need at least as many observations of y and x as the number of parameters we want to find

#

now, in your case, we don't just have one value x, but several measurements (temperatures and other stuff). and we want to use old data to predict those quantities, so we also don't have just one value of y

#

#

which we can all arrange into a single matrix vector equation

#

#

that's for a single row of data. but we need several rows to compute all of the parameters in M (n^2 + n of them). that means we need at least n different columns in x and y

#

and the whole point of this is: those columns are the rows of data in your file

wooden sail Jun 6, 2022, 9:48 AM

#

#

the matrix M you get from this is a linear predictor of y

#

in particular, a predictor that only looks at the previous row of data. you can change this by changing the shape of M and giving X a block toeplitz structure

violet gull Jun 6, 2022, 9:55 AM

#

Import "tensorflow.keras.optimizers" could not be resolved

#

help

#

yes tensorflow is installed

#

ive tried both 2.8 and 2.9

#

2.7 apparently is non existent

#

ping me with response because this chat is so dead id get bored staring at it

hasty grail Jun 6, 2022, 10:09 AM

#

violet gull ping me with response because this chat is so dead id get bored staring at it

you may need to import the base library tensorflow first

violet gull Jun 6, 2022, 10:10 AM

#

violet gull Jun 6, 2022, 10:11 AM

#

hasty grail you may need to import the base library `tensorflow` first

no

hasty grail Jun 6, 2022, 10:12 AM

#

hmm no idea then

#

maybe try importing keras first

violet gull Jun 6, 2022, 10:16 AM

#

hasty grail maybe try importing keras first

nop :C

hasty grail Jun 6, 2022, 10:16 AM

#

did keras successfully import?

#

if not then you should install keras

violet gull Jun 6, 2022, 10:17 AM

#

hasty grail did keras successfully import?

yes

hasty grail Jun 6, 2022, 10:18 AM

#

weird

violet gull Jun 6, 2022, 10:19 AM

#

very sadge

#

tensorflow cringe

hasty grail Jun 6, 2022, 10:21 AM

#

try reinstalling it maybe

#

are you using conda?

violet gull Jun 6, 2022, 10:23 AM

#

hasty grail try reinstalling it maybe

already did

violet gull Jun 6, 2022, 10:23 AM

#

hasty grail are you using conda?

no

hasty grail Jun 6, 2022, 10:25 AM

#

... or venv

#

Chances are, setting up a clean environment would resolve installation issues

violet gull Jun 6, 2022, 10:25 AM

#

idk how to make a venv right

#

and the tutorials are bad

#

the one i made was on wrong version of python

wooden sail Jun 6, 2022, 10:42 AM

#

violet gull no

one simple workaround is not not import the optimizers like that and call them by the full name when you need them

violet gull Jun 6, 2022, 10:43 AM

#

i dont need workaround i need the intended way to work like its suppose to

#

and if the normal imports wont work then those wont work either

wooden sail Jun 6, 2022, 10:45 AM

#

can you at least try? many people on google complain they get the same error you do, but it still works when importing tf and keras, and then calling keras.optimizers

violet gull Jun 6, 2022, 10:45 AM

#

can u give example of what u mean

wooden sail Jun 6, 2022, 10:47 AM

#

import tensorflow as tf
optim = tf.keras.optimizers.Adam()

like so

violet gull Jun 6, 2022, 10:47 AM

#

i think that worked

wooden sail Jun 6, 2022, 10:47 AM

#

other than that, people suggest to use tensorflow.python.keras.etc , with that extra python in the name

#

well if that works, that's good enough. seems to be an IDE problem

violet gull Jun 6, 2022, 10:48 AM

#

ok ty

violet gull Jun 6, 2022, 12:01 PM

#

i switched to a jupyter note book and the tensor imports are still broken

#

the devs of tensorflow deserve a cactus up their bum

#

and jupyter deserves cactus up bum for giving useless error messages

arctic wedgeBOT Jun 6, 2022, 12:03 PM

#

Hey @violet gull!

You either uploaded a .txt file or entered a message that was too long. Please use our paste bin instead.

violet gull Jun 6, 2022, 12:04 PM

#

https://paste.pythondiscord.com/tiqoyimevo this is the error message for the first "fix" from tensorflow.python.keras.model import Sequential

#

https://paste.pythondiscord.com/isiladubik heres one from just trying to install tensorflow

#

someone save me from this cringeness i just want to do coding and tensor flow makes me want to commit hate crimes its so terrible

somber burrow Jun 6, 2022, 12:16 PM

#

Hi guys, i have a problem wit exporting a .txt file on .csv using pandas, and writed in columns, can someone help me ?

serene scaffold Jun 6, 2022, 12:23 PM

#

@somber burrow try explaining what the problem is

#

Don't "ask to ask"

somber burrow Jun 6, 2022, 12:34 PM

#

#

i have a problem, i tried to read a .txt file and exported to .csv and separating the lines using a delimiter by colums using categories.

file.txt is like this

[groups]
admins = user1,user2,user3
users_network = user4,user5
users_m4s = user6,user7,user8,user9
and the .csv file should be

groups

user1 = admins
user2 = admins
user3 = admins
user4 = users_network
user5 = users_network
user6 = users_m4s ... for the rest of element of category line

#

``import pandas as pd
import numpy as np

df = pd.read_table("D:\GIT-files\Automate-Stats\SVN_sample_files\sample_svn_input.txt" , sep='=',engine='python')
print(df)

df.to_csv("D:\GIT-files\Automate-Stats\SVN_sample_files\sample_svn_input_update.csv" , index=None)

df = pd.read_table ("D:\GIT-files\Automate-Stats\SVN_sample_files\sample_svn_input_update.csv" , sep='=',engine='python')
print(df)``

#

but its not displaying and exporting right

#

practicaly the lines form the txt files , on the left of " = " its the group and after its the elements of that group

#

i want to display for each element the group separatly

serene scaffold Jun 6, 2022, 12:52 PM

#

@somber burrow

[groups] 
admins = user1,user2,user3 
users_network = user4,user5 
users_m4s = user6,user7,user8,user9

this is not a csv. csv is strictly comma-separated values on individual lines. you would need a more sophisticated parser for this.

#

you might need to write your own regular expression

cinder schooner Jun 6, 2022, 1:06 PM

#

hello, I'm a software engineer and I have been trying to specialize in AI for a year now. I was used when I was into software to preparing for interviews at big tech by preparing coding interviews and system design interviews. There's plenty of ressources about that on the internet. But now that i'm into AI i've been wondering what do I need to prepare in order to do great at interviews for Machine learning or AI positions? Are coding problems still relevant? how to prepare for system design for AI? what do big tech ask for this kind of positions? Thank you for your answers, i'm really grateful for being part of this discord community.

serene scaffold Jun 6, 2022, 1:13 PM

#

cinder schooner hello, I'm a software engineer and I have been trying to specialize in AI for a ...

does your current company have AI-related positions, and would they support you in making a lateral move to that? because that's going to be the easiest way. also, for how long have you been a SWE?

when I interviewed for AI positions, I presented on research I had done for my university.

cinder schooner Jun 6, 2022, 1:14 PM

#

I'm sorry if I didn't explain as I should I have a software engineering degree and then got to a masters degree in data science. I'm not currently working in software. @serene scaffold

#

and I did work as a software engineer for like a year and a half but they were all part time jobs

#

I'm also working on a lot of personal projects in AI etc but I'm really trying to know if it make sense to get back at preparing coding interviews and if not what to prepare

serene scaffold Jun 6, 2022, 1:19 PM

#

I only have experience with interviews for career starters, so I should let someone else comment. but I would at least be prepared to talk about anything you worked on during your masters. did you publish?

gilded flame Jun 6, 2022, 1:40 PM

#

What caused this to skip to the next column index?

serene scaffold Jun 6, 2022, 1:41 PM

#

also @cinder schooner try asking in #career-advice as well

#

I thought that was where we were lemon_sweat

serene scaffold Jun 6, 2022, 1:41 PM

#

gilded flame What caused this to skip to the next column index?

show code

gilded flame Jun 6, 2022, 1:42 PM

#

serene scaffold show code

uploading

serene scaffold Jun 6, 2022, 1:42 PM

#

!code

arctic wedgeBOT Jun 6, 2022, 1:43 PM

#

Hey @gilded flame!

You either uploaded a .txt file or entered a message that was too long. Please use our paste bin instead.

gilded flame Jun 6, 2022, 1:44 PM

#

serene scaffold !code

https://paste.pythondiscord.com/eyofufijid

serene scaffold Jun 6, 2022, 1:44 PM

#

gilded flame https://paste.pythondiscord.com/eyofufijid

I don't have time to dive into this, but hopefully someone can help.

gilded flame Jun 6, 2022, 1:46 PM

#


                    
                    cursor = cnx.cursor()
                    cursor.execute(QUERY)
                    df = pd.DataFrame(cursor.fetchall())
                    

                    if alldf is not None:
                       if not df.empty:
                           alldf = pd.concat([alldf,df],axis=0)
                    else:
                        alldf = df
                 
                
                    print(df)
                    field_names = [ i[0] for i in  cursor.description]
                    print(field_names)
                        
                    xlswriter = pd.ExcelWriter('{}/{}.xls'.format(type,loc),engine='openpyxl')


                    if not df.empty:
                        df.columns = field_names  
                      
                        df.to_excel(xlswriter,index=false)

                        xlswriter.save()
                    else:
                        cnx.close()```

#

def saveToExcel(query,filename):

    xlswriter = pd.ExcelWriter("%s.xls"%(filename),engine='openpyxl')
    queryDatas = executor(query)
    print(queryDatas)
    export = queryDatas
    export.to_excel(xlswriter)
    xlswriter.save()


    print("succes savetoExcel")```

#

using pandas.concat([],axis=0) to stack the dataframes vertically but won't stack vertically?

fierce pine Jun 6, 2022, 3:00 PM

#

wooden sail in particular, a predictor that only looks at the previous row of data. you can ...

So which of my column is dependent dataset and which ones are independent? Can u tell by looking at the data i sent please

wooden sail Jun 6, 2022, 3:01 PM

#

the way i wrote it, all columns are both dependent and independent 😛 since the idea is to take a full row (data from all columns) and use it to try to predict the next full row of data. anything with numeric values, let's say

fierce pine Jun 6, 2022, 3:24 PM

#

wooden sail the way i wrote it, all columns are both dependent and independent 😛 since the ...

And it will predict next row using what? Date? Time? Precipitation?

wooden sail Jun 6, 2022, 3:25 PM

#

all of it

#

it will use all the previous rows of data to predict the next row of data, as long as you can convert the data to numerical values in some way

#

i would say that, since the sensor data is gathered at a regular interval, you can ignore the date and time

vital lodge Jun 6, 2022, 3:36 PM

#

hi

#

does anyone know sort of video classification

#

like using audio and image features for classification

hollow sentinel Jun 6, 2022, 4:00 PM

#

sounds like deep learning

#

CNN/RNN

serene scaffold Jun 6, 2022, 4:13 PM

#

vital lodge does anyone know sort of video classification

what videos into what classes?

hollow sentinel Jun 6, 2022, 5:00 PM

#

has anyone used the mysql workbench with a mac

#

i was thinking of doing some kind of exploratory data analysis project

#

with power BI

#

honestly why do that when python exists

normal moth Jun 6, 2022, 5:15 PM

#

Hello guys, so I am trying to learn Data Science from ground up
I have fairly decent amount of exposure to Python but don't know anything related to Data Science.
Are there any good sources, courses and/or YT channels which I can refer to for learning about Data Science

#

If anyone could help I would be grateful!

serene scaffold Jun 6, 2022, 5:21 PM

#

normal moth Hello guys, so I am trying to learn Data Science from ground up I have fairly de...

!resources data science

arctic wedgeBOT Jun 6, 2022, 5:21 PM

#

Resources

The Resources page on our website contains a list of hand-selected learning resources that we regularly recommend to both beginners and experts.

normal moth Jun 6, 2022, 5:26 PM

#

serene scaffold !resources data science

Thank you!

hallow panther Jun 6, 2022, 6:29 PM

#

adventures in overfitting

wooden sail Jun 6, 2022, 7:15 PM

#

dunno if i'd call that overfitting

tacit basin Jun 6, 2022, 7:32 PM

#

wooden sail dunno if i'd call that overfitting

How would you call it?

wooden sail Jun 6, 2022, 7:33 PM

#

that looks like underfitting instead, since it's not close to describing the data, let alone the noisy data

#

the model hasn't been trained enough or cannot represent the data correctly

tacit basin Jun 6, 2022, 7:35 PM

#

With training train gets better and valid worse. That's a definition of overfilling, isn't it?

wooden sail Jun 6, 2022, 7:35 PM

#

ah wait, what is the plot showing

#

since the axes are not labelled, i assumed this was data and predictions

#

is it the loss?

#

if so, then yes

arctic quail Jun 6, 2022, 7:41 PM

#

hello 🙂

#

quick question concerning the approx_fprime function

https://docs.scipy.org/doc/scipy/reference/generated/scipy.optimize.approx_fprime.html

#

if i have for instance a function with 3x parameters, which i want to approxmitate, how can i pass these 3x parameters into the approx_fprime function ?

untold smelt Jun 6, 2022, 10:27 PM

#

anyone would be willing to help me with a basic quiz in AI?

plush jungle Jun 6, 2022, 10:59 PM

#

torch.cuda.is_available()```
is always returning false

#

the internet says to upgrade your nvidia drivers, so I did that and it's still happening

subtle grotto Jun 7, 2022, 12:09 AM

#

Hi I have a question about one of the debugging exercises. In the Arguments, Paramaters, and Debugging section, 7. Debugging Functions - 1st screenshot

this is telling me there is a problem on line 21, but the actual problem is up on line 13- 2nd screenshot

Can anyone please explain to me how this debugging error would point me to find the “correct” error? Thanks.

#

long locust Jun 7, 2022, 1:08 AM

#

subtle grotto Hi I have a question about one of the debugging exercises. In the Arguments, Par...

Always read tracebacks from the bottom up, and pay attention to the ^, does it point out anything that might be missing?

subtle grotto Jun 7, 2022, 1:10 AM

#

there were two issues with the code...line 13 is missing the ":", and down in the 'def mean' section...'sm_list/len_list' - supposed to be sum_list.

oak olive Jun 7, 2022, 1:11 AM

#

Hi!

brave sand Jun 7, 2022, 1:11 AM

#

does anyone have any experience with shapley values?

oak olive Jun 7, 2022, 1:11 AM

#

Is this a good enough binarization?

#

#

#

May a OCR recognise the number?

topaz prairie Jun 7, 2022, 1:47 AM

#

In tensorflow, which metric tracks how confident a categorical CV model is with it's predictions while training? Similar to accuracy, but I'm trying to see an average of how confident my model is with it's predictions.
I'm basically looking for the mean confidence I guess?
What's the metric called for something like this? I'm using softmax activation on my output layer, if that matters.

weary ridge Jun 7, 2022, 2:26 AM

#

is there any seperate servers for image processing in python?

wooden sail Jun 7, 2022, 3:45 AM

#

topaz prairie In tensorflow, which metric tracks how confident a categorical CV model is with ...

any vector p-norm with p >= 1 will do this. the larger the value of the norm, the more confidence the model has. the degenerate case is the infinity norm, which just takes the largest value of the vector. note that this tells you nothing about whether the predictions are correct 😛 if you set p = 1, the output will always be 1 though, thanks to how softmax works. so pick p >= 2

topaz prairie Jun 7, 2022, 3:48 AM

#

I'll be blunt. I have no idea what you just said.

wooden sail Jun 7, 2022, 3:54 AM

#

what i'm saying is "that's a bad metric if you use it alone" and "use mean squared error between the output of the softmax and a vector of zeros" (this second one is why the metric is bad)

topaz prairie Jun 7, 2022, 3:55 AM

#

"that's a bad metric if you use it alone"
I agree, that's not the intent though. Just learning, to be honest.
use mean squared error
👍

I understand what you said about p-norms also, I took stats ^^ Thanks for the assistance.

wooden sail Jun 7, 2022, 3:56 AM

#

oh, what was it you didn't understand then?

topaz prairie Jun 7, 2022, 3:57 AM

#

Which statistic metric to use. You clarified with "use mean squared error."

#

I understand most concepts, but I'm very poor with names (also reflects in human names, and just names in general).

#

So just takes me a bit to remember which thing is which lol

wooden sail Jun 7, 2022, 3:58 AM

#

all right. MSE is the p norm with p = 2 between two vectors. since all you want is to study the prediction vector, it's the same as MSE between the softmax output and a vector of zeros. you'd wanna maximize it.

topaz prairie Jun 7, 2022, 3:58 AM

#

Ok got it, thanks.

wooden sail Jun 7, 2022, 3:59 AM

#

if you don't need it to be differentiable because you won't optimize with respect to this, all you need is to look at the maximum element in the softmax output. the closer this is to 1, the better

weary ridge Jun 7, 2022, 4:58 AM

#

can someone suggest me sources on where i can read about text recognition from an image

#

online sources d be highly useful

plush jungle Jun 7, 2022, 5:00 AM

#

what are you looking to learn about, just how it works?

plush jungle Jun 7, 2022, 5:01 AM

#

weary ridge can someone suggest me sources on where i can read about text recognition from a...

or are you looking for sources on how to do it in python?

weary ridge Jun 7, 2022, 5:01 AM

#

yes

#

like using pytesseract and opencv

#

i have some use cases but i dont know how to implement them using codes

#

so i wanna learn about it

plush jungle Jun 7, 2022, 5:02 AM

#

https://nanonets.com/blog/ocr-with-tesseract/

weary ridge Jun 7, 2022, 5:15 AM

#

how to find a given sentence in the inputted image?

#

"you are good" in a image

#

is there any approach to solve this problem

plush jungle Jun 7, 2022, 5:17 AM

#

weary ridge is there any approach to solve this problem

pytesseract should return a string when run

#

you can use regex on that string to match with the text you want

weary ridge Jun 7, 2022, 5:18 AM

#

plush jungle pytesseract should return a string when run

text = pytesseract.image_to_string(img)

#

this will generate string of the text

weary ridge Jun 7, 2022, 5:18 AM

#

plush jungle you can use regex on that string to match with the text you want

but the thing is what if the text is complicated?

#

like some random characters installed in between due to foreign languages

plush jungle Jun 7, 2022, 5:19 AM

#

regex can handle that

weary ridge Jun 7, 2022, 5:20 AM

#

S) l\infected.html > @) Search, Pr @

¥

ka Mail - Knox Portal @iNinfected.htmi

You are infected!

om | O Jype here to search t F g A , AIC O Bl F va 4

weary ridge Jun 7, 2022, 5:20 AM

#

plush jungle regex can handle that

ohh

plush jungle Jun 7, 2022, 5:22 AM

#

what are you searching for in this string

weary ridge Jun 7, 2022, 5:22 AM

#

you are infected

#

wait lemme run with regex

plush jungle Jun 7, 2022, 5:22 AM

#

https://rubular.com/r/CI5qU8WMCbBPcy

weary ridge Jun 7, 2022, 5:23 AM

#

what link is this?

plush jungle Jun 7, 2022, 5:23 AM

#

rubular is a website that lets you test regexes in real time

#

so you don't have to run a python script every single time you want to tweak your regex

weary ridge Jun 7, 2022, 5:24 AM

#

plush jungle rubular is a website that lets you test regexes in real time

ohh

#

how the code in python looks like for using this regex?

#

how to comment these selected lines

#

at once

#

should we have to # all the time for each lines?

plush jungle Jun 7, 2022, 5:27 AM

#

don't post code in images

#

post it like this

#

!code

arctic wedgeBOT Jun 7, 2022, 5:27 AM

#

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

plush jungle Jun 7, 2022, 5:28 AM

#

also what variable represents the string you just posted?

#

is it b?

weary ridge Jun 7, 2022, 5:29 AM

#

plush jungle don't post code in images

ohkkk

#

test = pytesseract.image_to_string(img)
if(test.find("You are infected!")!=-1):
    print("Match Found")
else:
    print("Match not Found")

#

i got the code without using regex

#

🥲

weary ridge Jun 7, 2022, 5:30 AM

#

plush jungle also what variable represents the string you just posted?

no b is just an iterable

plush jungle Jun 7, 2022, 5:30 AM

#

yeah if you know there won't be any letters in between you don't need regex

weary ridge Jun 7, 2022, 5:30 AM

#

ohh

#

what s mean by letters inbetween

#

can you give any example?

plush jungle Jun 7, 2022, 5:31 AM

#

but what regex can do is detect strings like this

you a8re in4fesecte$d```

weary ridge Jun 7, 2022, 5:31 AM

#

ohh

#

thats interesting

plush jungle Jun 7, 2022, 5:31 AM

#

if you run into that problem, remember that regex is the solution

weary ridge Jun 7, 2022, 5:31 AM

#

but while doing image to text, why will some random letters come inbetween

plush jungle Jun 7, 2022, 5:32 AM

#

ocr, like all machine learning, is probabilistic

#

the computer just makes educated guesses

#

sometimes those guesses are wrong

weary ridge Jun 7, 2022, 5:34 AM

#

yeahh

#

you are right

#

https://www.w3schools.com/python/python_regex.asp

W3Schools offers free online tutorials, references and exercises in all the major languages of the web. Covering popular subjects like HTML, CSS, JavaScript, Python, SQL, Java, and many, many more.

#

this link will work right?

plush jungle Jun 7, 2022, 5:42 AM

#

yeah that's a good source to learn regex

weary ridge Jun 7, 2022, 5:43 AM

#

@plush jungle I have a followup qn too

#

how to find the coordinates of the box enclosing the sentence You are infected!

plush jungle Jun 7, 2022, 5:45 AM

#

import pytesseract
from pytesseract import Output
import cv2
img = cv2.imread('image.jpg')

d = pytesseract.image_to_data(img, output_type=Output.DICT)
n_boxes = len(d['level'])
for i in range(n_boxes):
    (x, y, w, h) = (d['left'][i], d['top'][i], d['width'][i], d['height'][i])
    cv2.rectangle(img, (x, y), (x + w, y + h), (0, 255, 0), 2)

cv2.imshow('img', img)
cv2.waitKey(0)```

#

something like this

weary ridge Jun 7, 2022, 5:48 AM

#

ohh i ll try once

worldly dawn Jun 7, 2022, 5:50 AM

#

plush jungle ```py import pytesseract from pytesseract import Output import cv2 img = cv2.imr...

that was quick. Did you have that ready?

plush jungle Jun 7, 2022, 5:51 AM

#

worldly dawn that was quick. Did you have that ready?

that's my secret captain, I always copy paste from stackoverflow answers

worldly dawn Jun 7, 2022, 5:51 AM

#

(or copy/pasted from a sample?)

weary ridge Jun 7, 2022, 5:52 AM

#

some people are pro in searching questions in stackoverflow

worldly dawn Jun 7, 2022, 5:52 AM

#

They are called senior engineers

weary ridge Jun 7, 2022, 5:52 AM

#

whereas people like me dont get answer to single questinos

#

😵‍💫

weary ridge Jun 7, 2022, 5:52 AM

#

worldly dawn They are called `senior engineers`

i see

plush jungle Jun 7, 2022, 5:52 AM

#

btw, recursive, I know this is off topic for this channel but do you have any idea why this is giving me strange values

target = (math.cos(math.radians(self.angle)), math.sin(math.radians(self.angle)))```

#

if self.angle is 90

#

it should give (0,1)

weary ridge Jun 7, 2022, 5:53 AM

#

actually pi/2 radian is different from 90degree

#

like pi/2 is irrational

plush jungle Jun 7, 2022, 5:53 AM

#

but I'm doing math.radians

weary ridge Jun 7, 2022, 5:53 AM

#

so degree to radian conversion is not exact

#

its approximate

#

am i correct?

plush jungle Jun 7, 2022, 5:54 AM

#

but it's not even close

#

it's giving me
(6.123233995736766e-17, 1.0)

#

oh wait

#

that is close

worldly dawn Jun 7, 2022, 5:55 AM

#

In [8]: (math.cos(math.pi / 2), math.sin(math.pi / 2))
Out[8]: (6.123233995736766e-17, 1.0)

#

yeah, looks like a float representation issue

plush jungle Jun 7, 2022, 5:56 AM

#

oh i'm dumb

#

I had this

worldly dawn Jun 7, 2022, 5:56 AM

#

In [9]: (math.cos(math.pi), math.sin(math.pi ))
Out[9]: (-1.0, 1.2246467991473532e-16)

plush jungle Jun 7, 2022, 5:56 AM

#

        self.x += self.target_vector[0]/100
        self.x += self.target_vector[1]/100```

#

I need to lay off the copy pasting

#

anyway, this is tangentially related to machine learning

#

cause I'm making a reinforcement learning bot

worldly dawn Jun 7, 2022, 5:57 AM

#

numerical precision does matter even in ml

plush jungle Jun 7, 2022, 5:57 AM

#

by reverse engineering a flappy bird reinforcement learner

#

and retrofitting it for a top down pygame shooter

worldly dawn Jun 7, 2022, 5:58 AM

#

plush jungle and retrofitting it for a top down pygame shooter

that would be an interesting blog post

plush jungle Jun 7, 2022, 5:58 AM

#

worldly dawn that would be an interesting blog post

yeah if I get it working I might do that

worldly dawn Jun 7, 2022, 5:59 AM

#

plush jungle yeah if I get it working I might do that

even if you do not.
Learning from failures is as valuable (if not more) than learning from the success

plush jungle Jun 7, 2022, 5:59 AM

#

amen to that

worldly dawn Jun 7, 2022, 5:59 AM

#

there is a demotivator about it too

#

(not a meme channel but https://despair.com/collections/demotivators/products/mistakes?variant=4376100306965)

weary ridge Jun 7, 2022, 6:03 AM

#

plush jungle ```py import pytesseract from pytesseract import Output import cv2 img = cv2.imr...

can you send me the link of this stackoverflow site>?

plush jungle Jun 7, 2022, 6:03 AM

#

sure

#

https://stackoverflow.com/questions/20831612/getting-the-bounding-box-of-the-recognized-words-using-python-tesseract

plush jungle Jun 7, 2022, 6:05 AM

#

worldly dawn (not a meme channel but <https://despair.com/collections/demotivators/products/m...

yeah I mean unironically it's something to be proud of if you think about it

#

marie curie discovered both radium and the fact that being around radium kills you

worldly dawn Jun 7, 2022, 6:10 AM

#

plush jungle yeah I mean unironically it's something to be proud of if you think about it

it's a bit too late here to get into these kind of debates

plush jungle Jun 7, 2022, 6:10 AM

#

sorry, I tend to wax philosophical in the late hours of the night

worldly dawn Jun 7, 2022, 6:11 AM

#

np, it's still an interesting question

fleet musk Jun 7, 2022, 6:32 AM

#

hi guys, so I am stuck in a problem, and found something that might help me out on stackoverflow

#

https://stackoverflow.com/questions/44967805/pandas-how-to-find-a-particular-pattern-in-a-dataframe-column

Stack Overflow

Pandas: How to find a particular pattern in a dataframe column?

I'd like to find a particular pattern in a pandas dataframe column, and return the corresponding index values in order to subset the dataframe.

Here's a sample dataframe with a possible pattern:

#

.
my question is, instead of integers, can I use a range?
like [100-105, 110-115, 120-125]
.

urban lance Jun 7, 2022, 7:50 AM

#

for some reason it gives me a future warning when I do on date 🤔

#

var = (df.set_index('date').groupby("user")).rolling('14D')

#

it doesn't throw the warning if I set the index to the date 🤷‍♂️

urban lance Jun 7, 2022, 8:15 AM

#

and also 😬

#

Does anyone have experience with the df.rolling function?

serene scaffold Jun 7, 2022, 10:31 AM

#

@urban lance try giving a minimal reproducible example that can be copied and pasted exactly.

haughty pewter Jun 7, 2022, 11:07 AM

#

are there any usually reasonable ways in general to find a correlation analysis between 2 columns if there's already 100,000 rows to use

pliant pewter Jun 7, 2022, 11:43 AM

#

If you gave the points like 20% opacity, it would be easier to visualize the density of them

#

But a priori, I do not see any strong correlations in that plot, lol!

serene scaffold Jun 7, 2022, 11:44 AM

#

it would be difficult to imagine a weaker correlation

pliant pewter Jun 7, 2022, 11:48 AM

#

Without better density information, it's hard to say. Maybe there's a very dense straight line in the middle, with lots of outliers

serene scaffold Jun 7, 2022, 11:49 AM

#

or, maybe there isn't lemon_angrysad

haughty pewter Jun 7, 2022, 11:54 AM

#

there's really not a lot i can work with regarding trying to make it dense so would "there is no correlation between Age and the Final Score" suffice
unless i do something like "check if age > insertNumberHere use row", if it's possible, would that be fine too

pliant pewter Jun 7, 2022, 11:56 AM

#

I mean, you can compute the correlation coefficient easily. It's gonna be small. And then you can say there's no useful (linear) correlation.

flat hollow Jun 7, 2022, 12:49 PM

#

This is a top-down view of my surface plot. The z axis shows the speed of fluid flow. The structure is a big vial full of water (big circle) with a tube going through the middle (small circle). I'm trying to figure out how to extract the velocity from the middle and find the peak velocity, ideally without having to manually label which pixels to grab the values from because I have a lot of these images and the position of these things in the x-y plane changes slightly. Any ideas?

haughty anvil Jun 7, 2022, 1:11 PM

#

Hi, has anyone here used SciPy before? What are some example projects that one can build with SciPy? I'm trying to get a better understanding of it. Also, what is the difference between SciPy and NumPy? Thank you!

serene scaffold Jun 7, 2022, 1:27 PM

#

haughty anvil Hi, has anyone here used SciPy before? What are some example projects that one c...

scipy is mostly statistics stuff that you can do to numpy arrays

#

numpy is pretty much at the foundation of everything

#

that said, there aren't projects you can do specifically in terms of one data science library

#

it's not like "build a website with django".

haughty anvil Jun 7, 2022, 1:32 PM

#

Hi @serene scaffold ! Ok, thank you! So with SciPy it sounds like there are things you can do with the data? Like if you did some speech recognition stuff and get a text transcript back. Then are there things one can do with SciPy on with that text?

serene scaffold Jun 7, 2022, 1:36 PM

#

haughty anvil Hi <@253696366952316929> ! Ok, thank you! So with SciPy it sounds like there are...

Like if you did some speech recognition stuff and get a text transcript back
no, you can't do that with scipy. scipy is for doing math.

#

Then are there things one can do with SciPy on with that text?
probably not. try spaCy.

haughty anvil Jun 7, 2022, 1:37 PM

#

oooh

#

Ok, gotcha! Thank you!

urban lance Jun 7, 2022, 1:47 PM

#

serene scaffold <@424603301103796234> try giving a minimal reproducible example that can be copi...

I decided to not use rolling anymore

hallow panther Jun 7, 2022, 1:51 PM

#

Does anyone have experience with Google's MT5 text model?

urban lance Jun 7, 2022, 1:51 PM

#

I have a df with session IDs, I'd like to group information by session to pass through a function but each group has to have the rows of previously processed groups as well. Is groupby able to do this?

serene scaffold Jun 7, 2022, 2:34 PM

#

urban lance I have a df with session IDs, I'd like to group information by session to pass t...

pandas doesn't effectively support iterative operations where previous iterations matter.

serene scaffold Jun 7, 2022, 2:35 PM

#

hallow panther Does anyone have experience with Google's MT5 text model?

remember to always ask your actual questions. don't try to filter people by what they think they know before you've said what you really need help with.

brave sand Jun 7, 2022, 2:45 PM

#

how can I use Shapley values to design utility and payoff for multi agent reinforcement learning?

haughty topaz Jun 7, 2022, 2:58 PM

#

how can I exactly predict tomorrows stock price? Trynna get a bag quick

serene scaffold Jun 7, 2022, 2:59 PM

#

haughty topaz how can I exactly predict tomorrows stock price? Trynna get a bag quick

what do you mean "trynna get a bag quick"?

#

it sounds like you might have unrealistic expectations. you can't predict the future, let alone exactly. you can only forecast it.

novel elbow Jun 7, 2022, 3:40 PM

#

urban lance I have a df with session IDs, I'd like to group information by session to pass t...

You can pass a class instead of a function to aggregate the groupby, that way you store the intermediate results in the class

prime finch Jun 7, 2022, 4:50 PM

#

Hello everyone, may I ask, do you have any references about RecommenderNet algorithm?

hollow sentinel Jun 7, 2022, 5:19 PM

#

serene scaffold what do you mean "trynna get a bag quick"?

it means he's tryna get a bag quick stel

misty flint Jun 7, 2022, 5:48 PM

#

ID_BoomKek

#

bruh

serene scaffold Jun 7, 2022, 6:06 PM

#

hollow sentinel it means he's tryna get a bag quick stel

well, if any of us could exactly predict the stock market, a few of us would be rich and we wouldn't hang out in this Discord BingShrug

plush jungle Jun 7, 2022, 6:09 PM

#

any machine learning tool you have access to, wall street investment bankers also have access to. if there was a way to predict stocks accurately, they'd still be richer than you because they'd use the same tool but with better data and more expertise

#

and more seed capital

plush jungle Jun 7, 2022, 7:36 PM

#

deep Q learning is short term, right? it only ever looks at which actions have immediate benefits given a current state?

#

so it's not going to be able to patterns that take longer delays between the action and the reward?

#

I'm trying to repurpose this deep Q learning code that teaches a bot to play flappy bird and have it learn to play a top down shooter game

#

#

the blue dot tries to shoot the red dot by deciding to change the angle of its laser sight, do nothing, or shoot

#

it's 186,000 turns in, and it's really not getting noticeably better

#

the code that updates the neural net's weights is as follows:

#

        minibatch = random.sample(replay_memory, min(len(replay_memory), model.minibatch_size))

        # unpack minibatch
        state_batch = torch.cat(tuple(d[0] for d in minibatch))
        action_batch = torch.cat(tuple(d[1] for d in minibatch))
        reward_batch = torch.cat(tuple(d[2] for d in minibatch))
        state_1_batch = torch.cat(tuple(d[3] for d in minibatch))

        # get output for the next state
        output_1_batch = model(state_1_batch)

        # set y_j to r_j for terminal state, otherwise to r_j + gamma*max(Q)
        y_batch = torch.cat(tuple(reward_batch[i] if minibatch[i][4]
                                  else reward_batch[i] + model.gamma * torch.max(output_1_batch[i])
                                  for i in range(len(minibatch))))

        # extract Q-value
        q_value = torch.sum(model(state_batch) * action_batch, dim=1)

        # PyTorch accumulates gradients by default, so they need to be reset in each pass
        optimizer.zero_grad()

        # returns a new Tensor, detached from the current graph, the result will never require gradient
        y_batch = y_batch.detach()

        # calculate loss
        loss = criterion(q_value, y_batch)```

#

I don't entirely understand what y_batch and q_value are, but as far as I can tell, nothing in this does anything that would track the long term benefits of a move

#

which means if it takes 50 turns for a bullet to reach the target, the model will never learn how to aim

iron basalt Jun 7, 2022, 10:47 PM

#

plush jungle deep Q learning is short term, right? it only ever looks at which actions have ...

No.

plush jungle Jun 7, 2022, 10:48 PM

#

iron basalt No.

I'm not sure I understand how it makes long term connections between an action (like firing a bullet) and a delayed reward (like the bullet hitting its target 50 moves later)

#

this minibatch code is the only part where it does gradient descent, so somewhere in the code I posted must be the long term learning you're talking about

#

could you give me a hint as to how this works?

unborn inlet Jun 7, 2022, 10:55 PM

#

how many images are good for an ML database of dogs?

#

also, if im trying to detect something, do i need a database of stuff that is what im trying to detect and a database of stuff im not trying to detect?

#

if that make sany sense

agile cobalt Jun 7, 2022, 10:59 PM

#

depends on which model you are using, what is the purpose of the model, and which kind of pictures you'll feed it later and probably a few dozen other factors I do not even know

if you want to accurately identify all dog breeds, from any angle, and tell apart not-a-dog as well, that 1.000.000 joke might not even have been all that far-fetched

if you just want to tell if a picture of a front-facing dog is a Shiba Inu or a Chihuahua, a few dozens or hundreds would suffice

unborn inlet Jun 7, 2022, 11:00 PM

#

i basically want to say if its a dog or not a dog

#

im using MLPClassifier

iron basalt Jun 7, 2022, 11:01 PM

#

plush jungle I'm not sure I understand how it makes long term connections between an action (...

Do some q-learning by hand with a q-table in a small simple maze (such as a T-maze).

agile cobalt Jun 7, 2022, 11:01 PM

#

"not a dog" can be literally anything, or just one specific kind of thing?

unborn inlet Jun 7, 2022, 11:01 PM

#

agile cobalt "not a dog" can be literally anything, or just one specific kind of thing?

like if i send an image of a house it shoudl say no dog detected

#

yk?

agile cobalt Jun 7, 2022, 11:01 PM

#

https://xkcd.com/1425/

xkcd: Tasks

#

disclaimer: I have never personally worked with classifying images

you may be able to make it work using a HuggingFace or fast.ai pre-trained model and potentially fine-tune to which kinds of dogs your data will actually include, but it might be trickier than it sounds

#

that said, if you want to do it with your own dataset, without using a pre-trained model, I don't really have any ideas of how to help other than "good luck"

unborn inlet Jun 7, 2022, 11:08 PM

#

im gonna try a different approach actually

#

thank you for the help tho

pseudo wren Jun 8, 2022, 2:55 AM

#

I'm doing a time series model based off the collapse of WireCard

#

the model is based on the stock prices for that time

#

my graphs are looking a little fucked though

#

so i'm not totally sure what to do with it

#

#

unsure what i'm doing wrong, but the graph is real....wonky looking

proven pier Jun 8, 2022, 3:49 AM

#

Yall have any good books you recommend for DSP/data science? @ me since I turn off all notifications lol

plush jungle Jun 8, 2022, 5:22 AM

#

I'm trying to understand the code and concepts behind Q learning, as explained here

#

https://www.toptal.com/deep-learning/pytorch-reinforcement-learning-tutorial

#

but I'm stuck on how the Q learning algorithm predicts future payoffs, not just the payoffs that will occur at t+1

#

it uses replay memory, and randomly selects 32 previous examples of turns

#

but in that replay memory the only information is the state, action, reward and image

#

there's nothing linking any given turn to its future reward

iron basalt Jun 8, 2022, 5:25 AM

#

plush jungle I'm trying to understand the code and concepts behind Q learning, as explained h...

It seems your goal is to understand Q-learning. Adding deep learning into it is trying to tackle two problems at the same time. Split up the problem into multiple sub problems and do those separately. In this case that is understanding Q-learning without deep learning, and then how deep learning comes into play.

plush jungle Jun 8, 2022, 5:26 AM

#

ok so if we take the maze example you suggested

#

I think I get how it works

iron basalt Jun 8, 2022, 5:26 AM

#

Using tabular methods for Q-learning (RL in general) makes it really obvious, since they can even be done by hand for very simple toy problems.

plush jungle Jun 8, 2022, 5:26 AM

#

because the reward is given based off of immediate success or not

#

well actually wait

#

with a bigger maze

iron basalt Jun 8, 2022, 5:27 AM

#

Follow the Q-learning algorithm for a simple maze and see how the Q-table is updated.

plush jungle Jun 8, 2022, 5:27 AM

#

ok so each state has a value associated with each action

#

and that makes up the table

#

so each square of the maze that the player could occupy is a state

#

and eventually the correct path is produced in the table

#

through rewards updating the table values

iron basalt Jun 8, 2022, 5:29 AM

#

Yes. Although it may not take it exactly depending on the choice of exploration vs exploitation.

plush jungle Jun 8, 2022, 5:29 AM

#

right, I think I understand that too

#

but it all falls apart when you go from like 100 states to millions

#

because in my code, the states are vectors representing the image

iron basalt Jun 8, 2022, 5:30 AM

#

Yup.

#

Tabular only works for simple things. Its space complexity is bad.

plush jungle Jun 8, 2022, 5:31 AM

#

I want the agent to learn that firing the bullet will yield a powerful reward, but not immediately. the neural network that influences what action the agent chooses is trained on minibatchs

iron basalt Jun 8, 2022, 5:32 AM

#

You know what else is not immediate? The reward at the end of the maze. So how does the agent know, when all the way at the start, where to go?

#

(Not trained vs trained)

plush jungle Jun 8, 2022, 5:33 AM

#

because each square in the maze receives a reward based on whether it hit a wall or how close it is to the goal, right?

iron basalt Jun 8, 2022, 5:33 AM

#

No reward is given except at the goal state.

plush jungle Jun 8, 2022, 5:34 AM

#

oh

#

so it works backwards then? the square before the goal gets a strong update to the weight for choosing the right action

#

and then the square behind that gets a stronger weight for the action that gets you to that state?

#

like because of exploration, eventually the agent will stumble its way to the end

#

the final square's action weight will be updated, but then what about final square - 1

#

if reward is only given at the end, how does final square -1 know to update the weight for the action that gets it to final square

#

since it won't receive a reward for doing so

iron basalt Jun 8, 2022, 5:37 AM

#

https://wikimedia.org/api/rest_v1/media/math/render/svg/678cb558a9d59c33ef4810c9618baf34a9577686

#

Imagine s_t is the tile before the last tile (the goal).

plush jungle Jun 8, 2022, 5:40 AM

#

the tile right before the goal makes sense to me, but "estimate of optimal future value" is the part that confuses me. for s_t-1 how does it calculate that future value?

iron basalt Jun 8, 2022, 5:40 AM

#

s_t becomes s_t-1 when it moves to the goal.

#

They are the same thing.

#

s_t, s_t+1 or s_t-1, s_t

lapis sequoia Jun 8, 2022, 5:41 AM

#

proven pier Yall have any good books you recommend for DSP/data science? @ me since I turn o...

What's DSP

iron basalt Jun 8, 2022, 5:42 AM

#

Why is that image so small? Click to enlarge.

plush jungle Jun 8, 2022, 5:42 AM

#

yeah it's the s+1 I'm struggling with

#

it just got reward at time t

iron basalt Jun 8, 2022, 5:42 AM

#

So you did the action that takes you to the goal state s_t+1

plush jungle Jun 8, 2022, 5:42 AM

#

how does it know reward at time t+1

iron basalt Jun 8, 2022, 5:43 AM

#

But what are you updating according to the equation now?

plush jungle Jun 8, 2022, 5:43 AM

#

we just got from final square to goal? I guess we'd update Q?

iron basalt Jun 8, 2022, 5:44 AM

#

Yes, but Q of what?

#

Imagine Q as the Q table.

plush jungle Jun 8, 2022, 5:44 AM

#

right

iron basalt Jun 8, 2022, 5:44 AM

#

You look things up in it.

plush jungle Jun 8, 2022, 5:44 AM

#

it tells you which actions yield what rewards in a given state

#

in this case that state being the final square