#data-science-and-ml | Python | Page 345

harsh bear Oct 3, 2021, 6:12 AM

#

This is a code running on vps

#

im having 2 problems

#

It isnt writing to the database

#

I want it to display a graph, but since it is a graph i was thinking if it cld be a webapp or smth it wld be cool

wicked grove Oct 3, 2021, 6:37 AM

#

i have a doubt in this code
data_pos = data_pos.iloc[:int(20000)]
why have they used int(20000), can i just write it as data_pos.iloc[0:20000,:]

quick kestrel Oct 3, 2021, 7:05 AM

#

guys someone tell me can i make a ai model with scikit-learn or do i need pytorch or tensorflow

#

?

fading burrow Oct 3, 2021, 7:09 AM

#

depends on what you mean by ai model there. sklearn is a machine learning module

desert oar Oct 3, 2021, 7:11 AM

#

wicked grove i have a doubt in this code data_pos = data_pos.iloc[:int(20000)] why have they ...

yes, and you can also write data_pos.iloc[0:20000] or data_pos.iloc[:20000]

#

there is no good reason to ever write int(20000)

wicked grove Oct 3, 2021, 7:11 AM

#

Okayy thanks a lot!!

#

And it means all rows from 0 to 20000 and all columns?

desert oar Oct 3, 2021, 7:12 AM

#

yes

#

you can also do data_pos.head(20000)

wicked grove Oct 3, 2021, 7:15 AM

#

Ohh alrightt,thankss

velvet thorn Oct 3, 2021, 7:59 AM

#

desert oar there is no good reason to ever write `int(20000)`

to create a copy?

#

in case you wanna do identity comparisons?

#

😔

wicked grove Oct 3, 2021, 8:12 AM

#

velvet thorn in case you wanna do identity comparisons?

What is identity comparison?

#

Wanted to ask you a doubt in this
Would it be possible to figure something out using the nltk library?

velvet thorn Oct 3, 2021, 8:35 AM

#

wicked grove What is identity comparison?

is

eager imp Oct 3, 2021, 8:43 AM

#

there are lots and lots of papers, books, youtube videos etc. about ML and NLTK

tough bolt Oct 3, 2021, 8:45 AM

#

Is anyone familiar with PYG (pytorch geometric)?

next lance Oct 3, 2021, 9:19 AM

#

eager imp there are lots and lots of papers, books, youtube videos etc. about ML and NLTK

My aim is to create a chat bot like Google assistant which can also detect images and also has voice features

#

I was browsing Tenserflow and Keras today

#

But it's hard to figure out what's going on in the code

eager imp Oct 3, 2021, 9:21 AM

#

next lance My aim is to create a chat bot like Google assistant which can also detect image...

you're aware that any one of these features alone is worth years of research and work?

next lance Oct 3, 2021, 9:22 AM

#

eager imp you're aware that any one of these features alone is worth years of research and...

Yes I am aware

#

I am just a student in class 9th for now

#

And it takes takes much of time making so many samples

#

Learning all this

#

I think I am going in the right direction

#

I have heard of raspberry is it similar to AI and Machine Learning

eager imp Oct 3, 2021, 9:24 AM

#

next lance I am just a student in class 9th for now

not sure what that means..

next lance Oct 3, 2021, 9:24 AM

#

I am a student

#

Studies in Class 9th

#

What about you

#

I think you have been learning from a very long time

eager imp Oct 3, 2021, 9:26 AM

#

that's right

next lance Oct 3, 2021, 9:26 AM

#

Oh

#

So Do you have any idea what should I do in first

#

Should I watch YouTube tutorials first ?

eager imp Oct 3, 2021, 9:27 AM

#

what you should do first? well..

#

watch some sci-fi

#

get inspired

next lance Oct 3, 2021, 9:27 AM

#

Huh?

eager imp Oct 3, 2021, 9:27 AM

#

pick a project

next lance Oct 3, 2021, 9:27 AM

#

Inspiration with Sci fi ?

eager imp Oct 3, 2021, 9:27 AM

#

yep

#

like iron man

next lance Oct 3, 2021, 9:27 AM

#

eager imp pick a project

Let's say a simple without graphic chat bot

next lance Oct 3, 2021, 9:28 AM

#

eager imp yep

But is there any realtion between Machine learning and AI with Iron Man

#

Machine learning is all about Data and kinda of Robotics ?

eager imp Oct 3, 2021, 9:29 AM

#

iron man is full of inspiration in areas of robotics, AI, material design etc

#

pick whatever that inspires you the most and stick with it

next lance Oct 3, 2021, 9:30 AM

#

Oh so Iron Man is a project I am gonna be working on

next lance Oct 3, 2021, 9:30 AM

#

eager imp pick whatever that inspires you the most and stick with it

Ya Iron Man is good 😄

#

Then what should I do after selecting a Project

eager imp Oct 3, 2021, 9:31 AM

#

read research papers, watch youtube videos on those topics you don't fully understand

#

and always keep visualizing the thing you want to achieve

#

the real effort is to discern irrelevant stuff from relevant information to achieve your goal

#

don't let shiny new research etc. distract you

next lance Oct 3, 2021, 9:34 AM

#

Yes I know Thanks a lot

#

So What should I read first

#

Like is there any good yt tutorial

#

I know about Sendex

eager imp Oct 3, 2021, 9:35 AM

#

well, you said you want to achieve something like google assist, can you visualize, imagine how you want to interact with it?

next lance Oct 3, 2021, 9:36 AM

#

Yes I can imagine a lot about it

eager imp Oct 3, 2021, 9:36 AM

#

what's the most important feature you want it to have?

next lance Oct 3, 2021, 9:36 AM

#

Like there can be a assistant which Can call a robot for you

next lance Oct 3, 2021, 9:36 AM

#

eager imp what's the most important feature you want it to have?

Hmm

#

Chating for now

eager imp Oct 3, 2021, 9:37 AM

#

chatting is boring

next lance Oct 3, 2021, 9:37 AM

#

I am a school student so I cannot work on big projects

eager imp Oct 3, 2021, 9:37 AM

#

what's useful about chatting?

next lance Oct 3, 2021, 9:37 AM

#

Oh yeh there's no.much use

#

A image detector

eager imp Oct 3, 2021, 9:37 AM

#

anyone can work on big projects, it's not a matter of what or who you are or how much time you have at hand

next lance Oct 3, 2021, 9:37 AM

#

A robot?

#

That can use GPS to detect where you are

eager imp Oct 3, 2021, 9:38 AM

#

if you invest 10 minutes every day in your vision, it's more than someone who doesn't invest those 10 minutes

next lance Oct 3, 2021, 9:38 AM

#

pithink yes I know about it

#

So what about a robot

#

Like I cannot even get parts of a robot

eager imp Oct 3, 2021, 9:38 AM

#

what's useful about a robot with GPS?

next lance Oct 3, 2021, 9:39 AM

#

Like if you need tea

#

What you have to do is just call the robot and the robot will use GPS to find the way in your home. not sure if I am right

eager imp Oct 3, 2021, 9:39 AM

#

next lance A image detector

image detection is interesting also, but how could it be useful to your grandma?

#

or neighbour?

next lance Oct 3, 2021, 9:40 AM

#

Like they can get information using a image that is in the mobile

#

Without searching keywords on Google

eager imp Oct 3, 2021, 9:40 AM

#

so basically a wikipedia extension?

next lance Oct 3, 2021, 9:40 AM

#

Yes maybe

eager imp Oct 3, 2021, 9:41 AM

#

or maybe a tool identifier

#

just imagine you get in a workshop for the first time and there are all the new and old tools and you don't know what to do with any of those

next lance Oct 3, 2021, 9:42 AM

#

What about a voice lock and voice search service
I can say Open phone and my phone will automatically by detecting my voice. Then I can say search the meaning of this word and it will search it for me
Or another example is like Open YouTube, search Python basics and play the first video

next lance Oct 3, 2021, 9:42 AM

#

eager imp just imagine you get in a workshop for the first time and there are all the new ...

Oh looks great

eager imp Oct 3, 2021, 9:43 AM

#

maybe it could help identify tools and provide instructions for use?

next lance Oct 3, 2021, 9:43 AM

#

We can extend it to like you are in a plane and you don't know anything about controls

next lance Oct 3, 2021, 9:43 AM

#

eager imp maybe it could help identify tools and provide instructions for use?

Exactly

eager imp Oct 3, 2021, 9:43 AM

#

or maybe a car?

next lance Oct 3, 2021, 9:43 AM

#

Yes

eager imp Oct 3, 2021, 9:43 AM

#

"what's that knob for?"

tough bolt Oct 3, 2021, 9:43 AM

#

tough bolt Is anyone familiar with PYG (pytorch geometric)?

bump

next lance Oct 3, 2021, 9:43 AM

#

Automatically understand what part does what by using it's image

eager imp Oct 3, 2021, 9:44 AM

#

that would certainly be useful

next lance Oct 3, 2021, 9:44 AM

#

Oh yeh

#

So should I lock this idea and work on it
It's pretty easy and interesting

eager imp Oct 3, 2021, 9:45 AM

#

i'm not sure about easy, but certainly interesting

next lance Oct 3, 2021, 9:45 AM

#

I can use Tenserflow for image detection

#

Then something else maybe for searching it

#

Can we just create a data group where I will add so many data sets

eager imp Oct 3, 2021, 9:45 AM

#

i think it's a goal worth pursuing, sure - and potentially quite profitable if you do it right 🙂

next lance Oct 3, 2021, 9:46 AM

#

😅

#

Not sure about profit But it's Pretty good

eager imp Oct 3, 2021, 9:46 AM

#

much better than a boring chatbot

next lance Oct 3, 2021, 9:46 AM

#

Ya true

#

So I will start learning Image detection now

#

Can I add you as a friend as to talk to you later maybe

#

Bye

eager imp Oct 3, 2021, 9:47 AM

#

cya

tough bolt Oct 3, 2021, 9:50 AM

#

tough bolt bump

.

pliant bone Oct 3, 2021, 10:36 AM

#

for image related stuff I would go with opencv https://github.com/opencv/opencv

GitHub

GitHub - opencv/opencv: Open Source Computer Vision Library

Open Source Computer Vision Library. Contribute to opencv/opencv development by creating an account on GitHub.

next lance Oct 3, 2021, 11:05 AM

#

https://youtu.be/COlbP62-B-U

YouTube

sentdex

Intro - TensorFlow Object Detection API Tutorial p.1

Hello and welcome to a miniseries and introduction to the TensorFlow Object Detection API. This API can be used to detect, with bounding boxes, objects in images and/or video using either some of the pre-trained models made available or through models you can train on your own (which the API also makes easier).

Text tutorials and sample code: h...

▶ Play video

desert oar Oct 3, 2021, 12:30 PM

#

wicked grove Wanted to ask you a doubt in this Would it be possible to figure something out ...

nltk has some primitive tools that could help with text processing in general, but i'm not sure about this particular task

by the way, in american and british english we say "ask a question". i see a lot of people on this server in particular treat "doubt" as synonymous with "question" and it seems very foreign to me. a "doubt" is more about being pessimistic or skeptical. you might "doubt your understanding" of a topic in that you are feeling unconfident in your understanding, but you wouldn't say that "i have a doubt" as in "there is something specific i don't understand"

#

it's such a common issue that i never say anything about it, but i felt compelled in this case because my original response was "i doubt it", and that gave me pause

frank roost Oct 3, 2021, 12:33 PM

#

Hi, I am currently doing a project in predicting Collective Variables for studying Molecular Dynamics using Deep Learning. If possible, I would like to check models using some datasets already available online. I need multiple trajectories of a single system(like a simple protein) with the same conditions. If anybody could provide me some resources, it would be really helpful.

serene scaffold Oct 3, 2021, 12:34 PM

#

Look into time series forecasting

desert oar Oct 3, 2021, 12:34 PM

#

https://otexts.com/fpp3/

Forecasting: Principles and Practice (3rd ed)

3rd edition

smoky stone Oct 3, 2021, 12:39 PM

#

hey guys, im a beginner and i've put up a small question on #help-cupcake

#

this is the question:

Question: Please help me understand how to solve this. I've already tried installing openpyxl, with pip and pip3.
Error: ImportError: Missing optional dependency 'openpyxl'. Use pip or conda to install openpyxl.
Code: import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

df = pd.read_excel('superstore_sales.xlsx') #importing the data

Explain: I'm running this on VS Code with a Jupyter extension, but i'm just not able to import data from a file to start the analysis

next lance Oct 3, 2021, 12:40 PM

#

Hello I think that creating something virtual is better than creation something real like.
If I create a Real Robot, it's good but not better than other AI projects.

Am I Right

smoky stone Oct 3, 2021, 12:41 PM

#

smoky stone this is the question: **Question**: Please help me understand how to solve this...

Could anyone help me with this?

halcyon gust Oct 3, 2021, 12:49 PM

#

smoky stone this is the question: **Question**: Please help me understand how to solve this...

you need to install openpyxl

smoky stone Oct 3, 2021, 12:50 PM

#

halcyon gust you need to install openpyxl

I have it installed, checked with pip freeze too

halcyon gust Oct 3, 2021, 12:50 PM

#

can you import it?

#

maybe try after importing it

desert oar Oct 3, 2021, 12:52 PM

#

smoky stone I have it installed, checked with pip freeze too

You probably have multiple python installations, or are in the wrong virtual environment

smoky stone Oct 3, 2021, 12:53 PM

#

halcyon gust can you import it?

no it says there's no such module as openpyxl

smoky stone Oct 3, 2021, 12:54 PM

#

desert oar You probably have multiple python installations, or are in the wrong virtual env...

is there a way i can check and correct this?

pliant bone Oct 3, 2021, 1:03 PM

#

smoky stone is there a way i can check and correct this?

import openpyxl

halcyon gust Oct 3, 2021, 1:06 PM

#

smoky stone is there a way i can check and correct this?

this is a more general issue. in python it's important to understand what environment you are using and what packages are installed for this specific environment.

for the sake of discussion, every python.exe file is configured to use a different pip system that has its own packages installed on it.

serene scaffold Oct 3, 2021, 1:06 PM

#

smoky stone no it says there's no such module as openpyxl

you can do which python to see where the python you're using is located.

halcyon gust Oct 3, 2021, 1:07 PM

#

and it does indeed sound as though, for example, when you do 'pip freeze', you are not getting the packages data from the actual environment you are using when trying to do 'import openpyxl'

pliant bone Oct 3, 2021, 1:10 PM

#

guessing it's the wrong python virtual env or the virtual env is not activated

tough bolt Oct 3, 2021, 1:25 PM

#

tough bolt Is anyone familiar with PYG (pytorch geometric)?

bump

smoky stone Oct 3, 2021, 1:26 PM

#

serene scaffold you can do `which python` to see where the `python` you're using is located.

Okayy I'll do that, thanks

desert oar Oct 3, 2021, 1:56 PM

#

smoky stone Okayy I'll do that, thanks

Are you using conda/anaconda?

obtuse osprey Oct 3, 2021, 2:42 PM

#

hello

#

am trying to get into government bootcamp , and i got stuck in two question about machine learning models , any one can offer a help pls

#

its very basic 2 question's

#

prime hearth Oct 3, 2021, 2:55 PM

#

if trying to get in, its best that you find the answer yourself

#

giving answer is not helping in long run

#

however, you can find many examples for each of those from quick google search or examples from your own experience

obtuse osprey Oct 3, 2021, 3:06 PM

#

my problem is i really dont know anything about Sql , if i knew what he want from the Q i could search for it , but i got lost

prime hearth Oct 3, 2021, 3:11 PM

#

You can ask then for clarification and im sure they will help. Even in coding test like during job, if there is something dont understand it. always best to email the recruiter otherwise if just doing something dont understand it does not give a good impression, compared to asking for clarification it shows good communication

desert oar Oct 3, 2021, 3:55 PM

#

fortunately this has nothing to do with sql

wicked grove Oct 3, 2021, 4:35 PM

#

desert oar nltk has some primitive tools that could help with text processing in general, b...

Haha 😂🙈you are right,my bad
I do have another question tho
If have the coordinates of each word in the pdf ,would that be helpful for this tool?

#

^I

smoky stone Oct 3, 2021, 4:42 PM

#

desert oar Are you using conda/anaconda?

I use pip, I'm not very sure what I'm using☹️

arctic crown Oct 3, 2021, 4:56 PM

#

@serene scaffold you there?

serene scaffold Oct 3, 2021, 4:56 PM

#

arctic crown <@!253696366952316929> you there?

yes, but I don't know what you would like help with

arctic crown Oct 3, 2021, 4:56 PM

#

Can you please explain tensor

serene scaffold Oct 3, 2021, 4:57 PM

#

A tensor is the name for arrays used in PyTorch and Tensorflow. I believe the only difference is what methods they have, and that they can go on a GPU.

#

There's a distinction between "array" and "tensor" in pure mathematics, but I don't really understand it.

arctic crown Oct 3, 2021, 4:58 PM

#

got it so a tensor is basically an array or list

#

right?

serene scaffold Oct 3, 2021, 4:58 PM

#

it's an n-dimensional array, yes

arctic crown Oct 3, 2021, 4:59 PM

#

got it thx

azure marsh Oct 3, 2021, 5:22 PM

#

serene scaffold There's a distinction between "array" and "tensor" in pure mathematics, but I do...

Yeah, it's a bit obtuse:

https://math.stackexchange.com/questions/412423/what-are-the-differences-between-a-matrix-and-a-tensor

Mathematics Stack Exchange

What are the Differences Between a Matrix and a Tensor?

What is the difference between a matrix and a tensor? Or, what makes a tensor, a tensor? I know that a matrix is a table of values, right? But, a tensor?

serene scaffold Oct 3, 2021, 5:23 PM

#

azure marsh Yeah, it's a bit obtuse: https://math.stackexchange.com/questions/412423/what-a...

isn't a matrix an array that is strictly two-dimensional? so the question remains about arrays vs tensors specifically.

azure marsh Oct 3, 2021, 5:24 PM

#

No

#

A matrix is 2 dimensions or higher

#

a list is 1 dimensional, and array is N dimensional

#

list/matrix/array are more about the format

#

tensor is about interpretation

serene scaffold Oct 3, 2021, 5:25 PM

#

I thought a vector was one-dimensional and that "list" doesn't have a specific meaning here.

azure marsh Oct 3, 2021, 5:25 PM

#

but yes the more direct comparison is array and tensor, but the question deals with format vs meaning

#

vector and list can be thought of as similar in this scenario

#

vector is probably better suited like you mention

serene scaffold Oct 3, 2021, 5:26 PM

#

I'm adamant that we avoid the term "list" in these sorts of discussions as the python list data type does not support mathematical operations.

snow harbor Oct 3, 2021, 6:01 PM

#

Hi, whos meeting with development app for kinect 2.0 on python

#

I search on other forums, and i don't show more details

chilly geyser Oct 3, 2021, 6:40 PM

#

"vector" for 1D is just convention though, since matrices, tensors are all elements of some vector space (and so 'vectors' are elements of said spaces). There is no 'tensor space' in common language, tensors are multilinear maps

winter flume Oct 3, 2021, 6:44 PM

#

hello,

I have a video, and I want to identify the objects that move in it (movement in some direction, rotation, magnification, reduction ..), how can I calculate these changes between any two consecutive frames?
In addition, I also want a breakdown of all the details I can conclude are on the same object, for example for a person shirt color, pants color, hair, glasses ... or for example if it is a car, what is the license plate number, color, type. ..

azure marsh Oct 3, 2021, 7:00 PM

#

Object tracking and object descriptors

edgy hearth Oct 3, 2021, 7:26 PM

#

guy i have a doubt

#

it is a really simple one

#

really simple

#


data = pandas.read_csv("3.1 cost_revenue_clean.csv")```

#

you see i wanna run the csv file using pandas

#

but i cant

#

#

i get this error

desert oar Oct 3, 2021, 7:28 PM

#

...does the file actually exist?

#

look at the error message

edgy hearth Oct 3, 2021, 7:28 PM

#

desert oar ...does the file actually exist?

yes it does

drifting summit Oct 3, 2021, 7:28 PM

#

edgy hearth

put the file in the same directory as the code file

#

it will work

edgy hearth Oct 3, 2021, 7:29 PM

#

drifting summit put the file in the same directory as the code file

oke

#

lemme try that

#

i've heard that you should do that

#

but oke lemme try

drifting summit Oct 3, 2021, 7:30 PM

#

if that doesnt work

edgy hearth Oct 3, 2021, 7:30 PM

#

wait im confuesd ??

#

where should i put it

#

where all my vs codes are there ?

drifting summit Oct 3, 2021, 7:31 PM

#

directory is folder

desert oar Oct 3, 2021, 7:31 PM

#

you might need to figure out what VS Code's "current working directory" is

drifting summit Oct 3, 2021, 7:31 PM

#

lol u wud know that

desert oar Oct 3, 2021, 7:31 PM

#

it might not be where your code is

#

for now, try using the full path to the filename instead of just the filename

#

e.g. C:/Users/.../data.csv

#

(use forward slashes in python)

edgy hearth Oct 3, 2021, 7:31 PM

#

desert oar e.g. `C:/Users/.../data.csv`

yeah i did that

#

im not tha dumb lol

drifting summit Oct 3, 2021, 7:32 PM

#

edgy hearth where all my vs codes are there ?

send the file path

#

of the

#

code file

edgy hearth Oct 3, 2021, 7:32 PM

#

one sec

#

lemme try something

drifting summit Oct 3, 2021, 7:34 PM

#

edgy hearth lemme try something

that wont work

drifting summit Oct 3, 2021, 7:34 PM

#

drifting summit send the file path

do this

prime hearth Oct 3, 2021, 7:34 PM

#

it could also be that the file has a space

edgy hearth Oct 3, 2021, 7:34 PM

#

drifting summit that wont work

lol can you wait

prime hearth Oct 3, 2021, 7:34 PM

#

but i not sure about this

drifting summit Oct 3, 2021, 7:34 PM

#

edgy hearth lol can you wait

lmao

#

sure

desert oar Oct 3, 2021, 7:34 PM

#

prime hearth it could also be that the file has a space

python can handle files with spaces

edgy hearth Oct 3, 2021, 7:35 PM

#

have some patency my guy

drifting summit Oct 3, 2021, 7:35 PM

#

ok homie

prime hearth Oct 3, 2021, 7:35 PM

#

oh okay thanks , i dint know that

edgy hearth Oct 3, 2021, 7:35 PM

#

oke soo what do you need ?

#

the path of what the file which im trying to print

#

??

drifting summit Oct 3, 2021, 7:35 PM

#

noo

edgy hearth Oct 3, 2021, 7:35 PM

#

then ?

drifting summit Oct 3, 2021, 7:35 PM

#

the file in which u r writing the code

edgy hearth Oct 3, 2021, 7:36 PM

#

you mean the vs code right ?

drifting summit Oct 3, 2021, 7:36 PM

#

tkinter project.py

drifting summit Oct 3, 2021, 7:36 PM

#

drifting summit tkinter project.py

the file path of this file

#

HOMIE u there?

#

my guy

edgy hearth Oct 3, 2021, 7:37 PM

#

yeah yeah

drifting summit Oct 3, 2021, 7:37 PM

#

lol

edgy hearth Oct 3, 2021, 7:37 PM

#

C:\Users\aasim\OneDrive\Desktop\Visual Studio Codes\data science(2).py

#

will this work ?

drifting summit Oct 3, 2021, 7:38 PM

#

@desert oar salty boi help me out here

drifting summit Oct 3, 2021, 7:38 PM

#

edgy hearth ``C:\Users\aasim\OneDrive\Desktop\Visual Studio Codes\data science(2).py``

yeah

#

yeah

edgy hearth Oct 3, 2021, 7:39 PM

#

oke

drifting summit Oct 3, 2021, 7:39 PM

#

so

#

now

#

put the file u wanna print

#

in

#

"Visual Studio Codes"

edgy hearth Oct 3, 2021, 7:40 PM

#

when ever i try to run the code

#

i get this error

#

FileNotFoundError: [Errno 2] No such file or directory: "C:/AASIM'S STUFF/Python/videos/ml/Complete 2020 Data Science & Machine Learning Bootcamp/Complete 2020 Data Science & Machine Learning Bootcamp/2. Predict Movie Box Office Revenue with Linear Regression/3.1 cost_revenue_clean.csv"

drifting summit Oct 3, 2021, 7:40 PM

#

edgy hearth ``FileNotFoundError: [Errno 2] No such file or directory: "C:/AASIM'S STUFF/Pyth...

damn homie be flexing

edgy hearth Oct 3, 2021, 7:42 PM

#

drifting summit damn homie be flexing

yeah ikr

#

you see i have the file

#

drifting summit Oct 3, 2021, 7:44 PM

#

yeah ik u do

#

thats not the issue

edgy hearth Oct 3, 2021, 7:45 PM

#

drifting summit yeah ik u do

lol

#

atleast someone belives me

#

because the code definetly does not

drifting summit Oct 3, 2021, 7:46 PM

#

yeah

lapis sequoia Oct 3, 2021, 7:47 PM

#

@edgy hearth is the py file located in the same folder?

desert oar Oct 3, 2021, 7:48 PM

#

@edgy hearth what does import os; print(os.getcwd()) show?

lapis sequoia Oct 3, 2021, 7:48 PM

#

i was about to say that only

#

lol

drifting summit Oct 3, 2021, 7:50 PM

#

@edgy hearth he's fed up

#

😂

desert oar Oct 3, 2021, 7:57 PM

#

i noticed that they had "ENG" in their taskbar. i wonder if this could be a unicode issue, where they accidentally typed 2 characters that look identical but have different character codes

#

e.g. greek, turkish, and cyrillic all have various letters that look like latin letters, but are actually different code points

lapis sequoia Oct 3, 2021, 7:59 PM

#

desert oar i noticed that they had "ENG" in their taskbar. i wonder if this could be a unic...

i also have ENG in my task bar but i can access other files without anything extra

#

i dont think that would be the issue

desert oar Oct 3, 2021, 7:59 PM

#

right, i am just suspecting that maybe they had switched input modes and something got messed up

#

it's a totally wild guess at a weird issue

#

(which btw has nothing to do with pandas or data science)

lapis sequoia Oct 3, 2021, 8:00 PM

#

desert oar right, i am just suspecting that maybe they had switched input modes and somethi...

what input mode?

desert oar Oct 3, 2021, 8:00 PM

#

idk! i'm not them

#

i meant "the language that appears in your computer when you type on your keyboard"

lapis sequoia Oct 3, 2021, 8:01 PM

#

oh the "ENG" or whatever just means the keyboard layout

drifting summit Oct 3, 2021, 8:03 PM

#

desert oar i noticed that they had "ENG" in their taskbar. i wonder if this could be a unic...

SALTY BOI at it hard

desert oar Oct 3, 2021, 8:03 PM

#

yes, and if it's greek then you can have both T (LATIN CAPITAL LETTER T) and T (GREEK CAPITAL LETTER TAU) in the same document, which literally use the same glyph in pretty much every font

#

consider also that -–—− are 4 different code points, and some programs like MS Word might "helpfully" convert the first one into any of the other 3

drifting summit Oct 3, 2021, 8:06 PM

#

desert oar consider also that `-–—−` are 4 different code points, and some programs like MS...

SALTY BOI at it even harder

#

https://tenor.com/view/jeremy-clarkson-sometimes-my-genius-almost-frightening-driving-car-ride-gif-16463163

Tenor

azure marsh Oct 3, 2021, 8:16 PM

#

@edgy hearth How about opening an rich terminal like IPython and letting it tab completing the filename

#

You might also be bumping up against window's path length limit. There's a way you can change it, but try just moving it to C:\tmp\ and see if you can access it. It's supposed to be 260 characters (the above path is 229), but I've had issues with less than that for some reason, around the 200 mark, even with the supposed fix

arctic crown Oct 3, 2021, 8:37 PM

#

@serene scaffold you there?

#

can you please explain whats a dimention

serene scaffold Oct 3, 2021, 8:37 PM

#

arctic crown <@!253696366952316929> you there?

Yes, but it's best to just put your question out there rather than depend on the availability of one person.

#

Please paste that as text.

arctic crown Oct 3, 2021, 8:38 PM

#

"
Now that we've talked about the rank of tensors it's time to talk about the shape. The shape of a tensor is simply the number of elements that exist in each dimension.
TensorFlow will try to determine the shape of a tensor but sometimes it may be unknown. To get the shape of a tensor we use the shape attribute.

rank2_tensor.shape"

serene scaffold Oct 3, 2021, 8:39 PM

#

@arctic crown if the shape of a tensor is (3, 5, 2), how many dimensions do you think it has?

arctic crown Oct 3, 2021, 8:39 PM

#

3?\

serene scaffold Oct 3, 2021, 8:39 PM

#

Yes. It's like how a square is two dimensional, and a cube is three dimensional.

arctic crown Oct 3, 2021, 8:40 PM

#

so this is a cube

#

(3, 5, 2),

#

and this (3,5) is a square

serene scaffold Oct 3, 2021, 8:40 PM

#

It is the shape of a three dimensional array or tensor.

#

It's not a perfect analogy. "Square matrix" has a specific meaning

#

Namely a two dimensional array where the lengths of each dimension are the same.

arctic crown Oct 3, 2021, 8:41 PM

#

ok now can you please explain Rank/Degree of Tensors

desert oar Oct 3, 2021, 8:47 PM

#

"rank" is a linear algebra term

arctic crown Oct 3, 2021, 8:59 PM

#

@serene scaffold sorry for the pings but can you please explain

#

Changing Shapes of tensors

serene scaffold Oct 3, 2021, 9:00 PM

#

what is your question?

arctic crown Oct 3, 2021, 9:01 PM

#

how Changing Shapes of tensors works

iron basalt Oct 3, 2021, 9:03 PM

#

arctic crown ok now can you please explain Rank/Degree of Tensors

https://mathworld.wolfram.com/TensorRank.html

serene scaffold Oct 3, 2021, 9:05 PM

#

arctic crown how Changing Shapes of tensors works

why are you interested to know how to change the shape of a tensor?

arctic crown Oct 3, 2021, 9:06 PM

#

i am learning tesorflow

serene scaffold Oct 3, 2021, 9:08 PM

#

arctic crown i am learning tesorflow

In [8]: tensor
Out[8]: tensor([ 0.,  1.,  2.,  3.,  4.,  5.,  6.,  7.,  8.,  9., 10., 11.])

In [9]: tensor.shape
Out[9]: torch.Size([12])

In [10]: tensor.reshape(6, 2)
Out[10]:
tensor([[ 0.,  1.],
        [ 2.,  3.],
        [ 4.,  5.],
        [ 6.,  7.],
        [ 8.,  9.],
        [10., 11.]])

In [11]: tensor.reshape(4, 3)
Out[11]:
tensor([[ 0.,  1.,  2.],
        [ 3.,  4.,  5.],
        [ 6.,  7.,  8.],
        [ 9., 10., 11.]])

In [12]: tensor.reshape(3, 2, 2)
Out[12]:
tensor([[[ 0.,  1.],
         [ 2.,  3.]],

        [[ 4.,  5.],
         [ 6.,  7.]],

        [[ 8.,  9.],
         [10., 11.]]])

This is with pytorch rather than tensorflow

#

do you see what (6, 2), (4, 3), and (3, 2, 2) all share in relation to 12?

arctic crown Oct 3, 2021, 9:09 PM

#

serene scaffold Oct 3, 2021, 9:09 PM

#

*?

arctic crown Oct 3, 2021, 9:15 PM

#

the all times

#

= 12

serene scaffold Oct 3, 2021, 9:15 PM

#

yes, good job!

arctic crown Oct 3, 2021, 9:17 PM

#

ty

#

so (x,y) x is the number of columns you want and y is the amount of rows

serene scaffold Oct 3, 2021, 9:19 PM

#

arctic crown so (x,y) x is the number of columns you want and y is the amount of rows

backwards 😄

#

look at this one again

In [10]: tensor.reshape(6, 2)
Out[10]:
tensor([[ 0.,  1.],
        [ 2.,  3.],
        [ 4.,  5.],
        [ 6.,  7.],
        [ 8.,  9.],
        [10., 11.]])

arctic crown Oct 3, 2021, 9:20 PM

#

ah yea

#

(y,x)

#

i am confused with this tho (3, 2, 2) how does this work?

serene scaffold Oct 3, 2021, 9:22 PM

#

It's like this

arctic crown Oct 3, 2021, 9:22 PM

#

hm?

serene scaffold Oct 3, 2021, 9:23 PM

#

The original tensor is 0 to 11. So it divides that into three equal parts, and then divides each of those three into two equal parts (with two remaining in each)

#

You can see how each of the four values in the outermost dimension are consecutive numbers.

In [12]: tensor.reshape(3, 2, 2)
Out[12]:
tensor([[[ 0.,  1.],
         [ 2.,  3.]],

        [[ 4.,  5.],
         [ 6.,  7.]],

        [[ 8.,  9.],
         [10., 11.]]])

arctic crown Oct 3, 2021, 9:24 PM

#

got it thx

#

@serene scaffold whats tf.ones

serene scaffold Oct 3, 2021, 9:32 PM

#

arctic crown <@!253696366952316929> whats tf.ones

did you try reading the documentation? https://www.tensorflow.org/api_docs/python/tf/ones

TensorFlow

tf.ones | TensorFlow Core v2.6.0

Creates a tensor with all elements set to one (1).

#

I'm going to be gone for the next few hours btw

arctic crown Oct 3, 2021, 9:33 PM

#

got it

#

can someone please explain linear regression

stark zenith Oct 3, 2021, 9:47 PM

#

arctic crown can someone please explain linear regression

regression is used to predict continuous values

#

linear means along a line, but that might be a very simplistic explanation

#

linear regression "tries to find a line where the mean of the squared errors between estimated points on the line and the actual points is minimal"

worldly lake Oct 3, 2021, 10:00 PM

#

hello, do u know how i can with my code:

    for n in photos_data['data']['product_images']:
        print(n['url'])

Out: 
product_images/with_watermark/877/2225877.jpg
product_images/with_watermark/848/2225848.jpg
product_images/with_watermark/849/5973849.jpg
product_images/with_watermark/851/5973851.jpg
product_images/with_watermark/852/5973852.jpg
product_images/with_watermark/850/5973850.jpg
product_images/with_watermark/507/7246507.jpg

select only .jpg name?

desert oar Oct 3, 2021, 10:01 PM

#

worldly lake hello, do u know how i can with my code: ``` for n in photos_data['data']['p...

what is photos_data? a dict? a dataframe? something else?

#

but in general you can just use if inside the loop

#

and either use .endswith() or regex

#

!d str.endswith

arctic wedgeBOT Oct 3, 2021, 10:02 PM

#

str.endswith


str.endswith(suffix[, start[, end]])```
Return `True` if the string ends with the specified *suffix*, otherwise return `False`. *suffix* can also be a tuple of suffixes to look for. With optional *start*, test beginning at that position. With optional *end*, stop comparing at that position.

worldly lake Oct 3, 2021, 10:03 PM

#

@desert oara dict

desert oar Oct 3, 2021, 10:03 PM

#

then do what i said. and this isn't a data science question. see #❓｜how-to-get-help for general python questions

worldly lake Oct 3, 2021, 10:03 PM

#

oh, im sorry, and thank u 🙂

desert oar Oct 3, 2021, 10:08 PM

#

that's ok. "data science" has to do with statistics, machine learning, etc

eager night Oct 3, 2021, 10:18 PM

#

serene scaffold Look into time series forecasting

I am having troubles with classifying the different groups. For example, how would I differentiate between Store A, Store B, Store C when I am processing the data?

serene scaffold Oct 3, 2021, 10:19 PM

#

eager night I am having troubles with classifying the different groups. For example, how wou...

What format is the data in again? Doesn't one of the columns tell you this?

eager night Oct 3, 2021, 10:21 PM

#

Yes it does, but is there a specific way I need to process the data? Like how would I specify the class

stark zenith Oct 3, 2021, 10:22 PM

#

Would a Pandas question be appropriate here @desert oar ? I've been working on a work thing but can't seem to figure it out.

#

I have to be really vague too since it's a work thing. 😛

serene scaffold Oct 3, 2021, 10:27 PM

#

stark zenith Would a Pandas question be appropriate here <@!389497659087650836> ? I've been w...

You can ask pandas questions, yes. The best way to get help with that is to provide a sample of your data frame as text that can be copied and pasted, and the desired output.

#

Remember that just displaying the dataframe will usually clip columns, which might make the example useless.

slender wyvern Oct 3, 2021, 10:40 PM

#

I was tinkering around with numpy dtypes and noticed that when using structured data types, while all the relevant attributes for an iterable object are exposed, I can't iterate the data type. Any ideas, why this is not allowed?

#

!e example:

import numpy as np

# see https://docs.python.org/3/reference/datamodel.html#special-method-names
class A:
    a = [0, 1, 2]

    def __len__(self):
        return len(self.a)

    def __getitem__(self, key):
        return self.a[key]

a = A()

print([i for i in a])  # [0, 1, 2]
print(a[0], a[1], a[2], len(a), '\n')  # 0 1 2 3

xyz = np.dtype([("x", np.float_), ("y", np.int8), ("z", np.int8)])
print(xyz[0], xyz[1], xyz[2], len(xyz))  # float64 int8 int8 3
print(hasattr(xyz, "__getitem__"), hasattr(xyz, "__len__")) # True, True

# I have to use this, but...
print([x[0] for x in xyz.fields.values()])  # [dtype('float64'), dtype('int8'), dtype('int8')]
# ...why does that not work though:
[x for x in xyz]  # 'numpy.dtype[void]' object is not iterable

arctic wedgeBOT Oct 3, 2021, 10:40 PM

#

@slender wyvern :x: Your eval job has completed with return code 1.

001 | [0, 1, 2]
002 | 0 1 2 3 
003 | 
004 | float64 int8 int8 3
005 | True True
006 | [dtype('float64'), dtype('int8'), dtype('int8')]
007 | Traceback (most recent call last):
008 |   File "<string>", line 25, in <module>
009 | TypeError: 'numpy.dtype[void]' object is not iterable

stark zenith Oct 3, 2021, 10:41 PM

#

serene scaffold You can ask pandas questions, yes. The best way to get help with that is to prov...

Thanks. Not sure what I can share though, since it's company data, but I also don't know how much I can relay just by asking broad questions.

serene scaffold Oct 3, 2021, 10:44 PM

#

stark zenith Thanks. Not sure what I can share though, since it's company data, but I also do...

You can make random data of the same types with arbitrary column names

velvet thorn Oct 3, 2021, 10:49 PM

#

slender wyvern I was tinkering around with numpy dtypes and noticed that when using structured ...

you can't iterate through a class with __getitem__ if __iter__ is set to None, which I would guess is the case

#

!e

class X:
    __iter__ = None

    def __len__(self):
        return 1

    def __getitem__(self, key):
        return 1

print([x for x in X()])

arctic wedgeBOT Oct 3, 2021, 10:50 PM

#

@velvet thorn :x: Your eval job has completed with return code 1.

001 | Traceback (most recent call last):
002 |   File "<string>", line 10, in <module>
003 | TypeError: 'X' object is not iterable

stark zenith Oct 3, 2021, 10:50 PM

#

serene scaffold You can make random data of the same types with arbitrary column names

That's a good idea, but I think I have half of it figure out so far. But I'm also halfway into 'best approach paralysis' too. :/

slender wyvern Oct 3, 2021, 10:51 PM

#

but then it should have the attribute __iter__ I'd assume

#

!e

import numpy as np
xyz = np.dtype([("x", np.float_), ("y", np.int8), ("z", np.int8)])
print(hasattr(xyz, "__iter__")) # False

arctic wedgeBOT Oct 3, 2021, 10:51 PM

#

@slender wyvern :white_check_mark: Your eval job has completed with return code 0.

False

velvet thorn Oct 3, 2021, 10:52 PM

#

slender wyvern but then it should have the attribute `__iter__` I'd assume

probably done on the C side

#

I would guess that the interpreter checks tp_iter first

serene scaffold Oct 3, 2021, 10:53 PM

#

stark zenith That's a good idea, but I think I have half of it figure out so far. But I'm als...

If you explain what you're trying to do generally, I may or may not be able to point you in the right direction

stark zenith Oct 3, 2021, 10:54 PM

#

serene scaffold If you explain what you're trying to do generally, I may or may not be able to p...

At this point, I have a bunch of one-hot encoded columns in Pandas - what is a good way of going through the columns and setting values to another column value if it's greater than 1?

#

For context, what I'm trying to do is get a picture of the same-ness between a bunch of different 3rd party hotel booking channels grabbed through metasearch.

serene scaffold Oct 3, 2021, 10:55 PM

#

I think I can make an example when I get back to my desktop in a few hours.

old thorn Oct 3, 2021, 10:55 PM

#

what would be the best way to even out sample size to reduce bias, I just trained a Logistic Regression and the bias is crazy

#

I actually am not sure what to do

velvet thorn Oct 3, 2021, 10:56 PM

#

stark zenith At this point, I have a bunch of one-hot encoded columns in Pandas - what is a g...

do you have a more concrete example

#

which source/destination columns?

stark zenith Oct 3, 2021, 11:00 PM

#

velvet thorn which source/destination columns?

I've already created 20 columns from the top 20 Channels as one-hot encoding. I want to change the 1's in those columns to match the values in another column, which is the Channel Rate column.

velvet thorn Oct 3, 2021, 11:01 PM

#

stark zenith I've already created 20 columns from the top 20 Channels as one-hot encoding. I ...

all of them?

stark zenith Oct 3, 2021, 11:02 PM

#

velvet thorn all of them?

All of the 1's, yes, and all of the new columns.

velvet thorn Oct 3, 2021, 11:02 PM

#

stark zenith I've already created 20 columns from the top 20 Channels as one-hot encoding. I ...

are there any columns other than those 21

stark zenith Oct 3, 2021, 11:04 PM

#

velvet thorn are there any columns other than those 21

Yes, but I've already sorted a lot of them - some are categorical that I've had to sort for only one type. One of those columns is the Hotel Name, which I'm hoping to Group By on - this way I have a single hotel for each line, with columns of the individual Channel Rate in each applicable column.

velvet thorn Oct 3, 2021, 11:06 PM

#

stark zenith Yes, but I've already sorted a lot of them - some are categorical that I've had ...

hm

#

I have this as a first approximation

#

!e

import pandas as pd

s = pd.DataFrame([[0, 1, 'one'], [1, 0, 'two'], [1, 1, 'three'], [0, 0, 'four']], columns=['a', 'b', 'r'])

print(s)

print(s[['a', 'b']].where(~s[['a', 'b']].astype(bool), s['r'], axis=0))

arctic wedgeBOT Oct 3, 2021, 11:06 PM

#

@velvet thorn :white_check_mark: Your eval job has completed with return code 0.

001 |    a  b      r
002 | 0  0  1    one
003 | 1  1  0    two
004 | 2  1  1  three
005 | 3  0  0   four
006 |        a      b
007 | 0      0    one
008 | 1    two      0
009 | 2  three  three
010 | 3      0      0

stark zenith Oct 3, 2021, 11:10 PM

#

velvet thorn !e ```py import pandas as pd s = pd.DataFrame([[0, 1, 'one'], [1, 0, 'two'], [1...

Thanks! I'm having a look. I appreciate the help.

stark zenith Oct 3, 2021, 11:15 PM

#

velvet thorn !e ```py import pandas as pd s = pd.DataFrame([[0, 1, 'one'], [1, 0, 'two'], [1...

Yes, this is really close to what I'm thinking. would anything change if the s['r'] column is an integer instead of string?

velvet thorn Oct 3, 2021, 11:15 PM

#

stark zenith Yes, this is *really* close to what I'm thinking. would anything change if the s...

no

stark zenith Oct 3, 2021, 11:20 PM

#

velvet thorn no

So to get an idea of what it is doing, it's replacing with values from 'r' if it isn't considered 'bool'? or if it isn't considered 'True'?

velvet thorn Oct 3, 2021, 11:20 PM

#

stark zenith So to get an idea of what it is doing, it's replacing with values from 'r' if it...

the latter

#

.astype converts values to boolean

#

then that's passed into .where

stark zenith Oct 3, 2021, 11:20 PM

#

Ah, and 1 is True and 0 is False?

velvet thorn Oct 3, 2021, 11:21 PM

#

left.where(condition, right) basically replaces values from left with those from right if the corrresponding value in condition is False

#

but we want to replace the True values

#

so we invert with ~

velvet thorn Oct 3, 2021, 11:21 PM

#

stark zenith Ah, and 1 is True and 0 is False?

yes

desert oar Oct 3, 2021, 11:40 PM

#

stark zenith Would a Pandas question be appropriate here <@!389497659087650836> ? I've been w...

Yes this is a good channel for pandas

quasi torrent Oct 4, 2021, 12:00 AM

#

can I ask a question about keras python here?

serene scaffold Oct 4, 2021, 12:02 AM

#

quasi torrent can I ask a question about keras python here?

Yes

quasi torrent Oct 4, 2021, 12:03 AM

#

ok so I am trying to build a regression model where the inputs are x,y, and z (which are floats) and the output is a mathematical function f(x,y,z)=0.1x*cos(2y+5z)

#

here is how I generated the data set

#

the inputFunc function is simply f(x,y,z)

#

this is the sequential model that I built. The input shape is a numpy array in the form [x y z] and the output is the corresponding f(x,y,z). I normalized the input data from 0 to 1

#

This is the compiling and I am using the Adam optimizer

#

this is the model.fit method line

#

the problem is my loss functions are not changing when I train the model

#

Any clue what could be the reasons?

desert oar Oct 4, 2021, 12:14 AM

#

I don't know the answer but it helps if you post text, not screenshots

#

!paste

arctic wedgeBOT Oct 4, 2021, 12:15 AM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the floppy disk icon in the top right, or by typing ctrl + S. After doing that, the URL should change. Copy the URL and post it here so others can see it.

desert oar Oct 4, 2021, 12:15 AM

#

Maybe learning rate is way too low?

austere swift Oct 4, 2021, 12:15 AM

#

it looks like its going down, but very slowly

#

yeah try increasing your learning rate

desert oar Oct 4, 2021, 12:16 AM

#

That or the features are so totally unrelated to the labels that there's nothing to learn

austere swift Oct 4, 2021, 12:16 AM

#

it could also just be a case of underfitting, if increasing the learning rate doesnt help you can try increasing the complexity of your model

quasi torrent Oct 4, 2021, 12:20 AM

#

Epoch 89/100 45/45 - 0s - loss: 0.2182 - mse: 0.2182 - mae: 0.3756 - val_loss: 0.2556 - val_mse: 0.2556 - val_mae: 0.4205 Epoch 90/100 45/45 - 0s - loss: 0.2179 - mse: 0.2179 - mae: 0.3757 - val_loss: 0.2554 - val_mse: 0.2554 - val_mae: 0.4226 Epoch 91/100 45/45 - 0s - loss: 0.2186 - mse: 0.2186 - mae: 0.3748 - val_loss: 0.2554 - val_mse: 0.2554 - val_mae: 0.4214 Epoch 92/100 45/45 - 0s - loss: 0.2183 - mse: 0.2183 - mae: 0.3748 - val_loss: 0.2554 - val_mse: 0.2554 - val_mae: 0.4210 Epoch 93/100 45/45 - 0s - loss: 0.2183 - mse: 0.2183 - mae: 0.3753 - val_loss: 0.2554 - val_mse: 0.2554 - val_mae: 0.4210 Epoch 94/100 45/45 - 0s - loss: 0.2180 - mse: 0.2180 - mae: 0.3753 - val_loss: 0.2554 - val_mse: 0.2554 - val_mae: 0.4212 Epoch 95/100 45/45 - 0s - loss: 0.2189 - mse: 0.2189 - mae: 0.3753 - val_loss: 0.2555 - val_mse: 0.2555 - val_mae: 0.4209 Epoch 96/100 45/45 - 0s - loss: 0.2179 - mse: 0.2179 - mae: 0.3746 - val_loss: 0.2554 - val_mse: 0.2554 - val_mae: 0.4210 Epoch 97/100 45/45 - 0s - loss: 0.2180 - mse: 0.2180 - mae: 0.3749 - val_loss: 0.2557 - val_mse: 0.2557 - val_mae: 0.4203 Epoch 98/100 45/45 - 0s - loss: 0.2185 - mse: 0.2185 - mae: 0.3756 - val_loss: 0.2554 - val_mse: 0.2554 - val_mae: 0.4226 Epoch 99/100 45/45 - 0s - loss: 0.2182 - mse: 0.2182 - mae: 0.3747 - val_loss: 0.2560 - val_mse: 0.2560 - val_mae: 0.4200 Epoch 100/100 45/45 - 0s - loss: 0.2189 - mse: 0.2189 - mae: 0.3765 - val_loss: 0.2554 - val_mse: 0.2554 - val_mae: 0.4222

quasi torrent Oct 4, 2021, 12:20 AM

#

quasi torrent `Epoch 89/100 45/45 - 0s - loss: 0.2182 - mse: 0.2182 - mae: 0.3756 - val_loss: ...

This is changing the learning rate to 0.01

quasi torrent Oct 4, 2021, 12:21 AM

#

austere swift it could also just be a case of underfitting, if increasing the learning rate do...

by increasing complexity do you mean adding more layers?

austere swift Oct 4, 2021, 12:22 AM

#

yes or making the layers larger

quasi torrent Oct 4, 2021, 12:25 AM

#

https://colab.research.google.com/drive/1GuVbdlAbkcHec_N3-ljEKFq8e4g37dB4?usp=sharing

Google Colaboratory

#

here is also the full program

#

so I changed my model to this:model = Sequential([ Dense(units=64,input_shape=(3,),activation='relu'), Dense(units=120,activation='relu'), Dense(units=100,activation='relu'), Dense(units=100,activation='relu'), Dense(units=1) ])

#

Epoch 1/100 45/45 - 1s - loss: 0.2311 - mse: 0.2311 - mae: 0.3865 - val_loss: 0.2580 - val_mse: 0.2580 - val_mae: 0.4207 Epoch 2/100 45/45 - 0s - loss: 0.2209 - mse: 0.2209 - mae: 0.3771 - val_loss: 0.2636 - val_mse: 0.2636 - val_mae: 0.4218 Epoch 3/100 45/45 - 0s - loss: 0.2221 - mse: 0.2221 - mae: 0.3784 - val_loss: 0.2564 - val_mse: 0.2564 - val_mae: 0.4242 Epoch 4/100 45/45 - 0s - loss: 0.2200 - mse: 0.2200 - mae: 0.3768 - val_loss: 0.2627 - val_mse: 0.2627 - val_mae: 0.4225 Epoch 5/100 45/45 - 0s - loss: 0.2190 - mse: 0.2190 - mae: 0.3774 - val_loss: 0.2557 - val_mse: 0.2557 - val_mae: 0.4204 Epoch 6/100 45/45 - 0s - loss: 0.2185 - mse: 0.2185 - mae: 0.3753 - val_loss: 0.2554 - val_mse: 0.2554 - val_mae: 0.4212 Epoch 7/100 45/45 - 0s - loss: 0.2180 - mse: 0.2180 - mae: 0.3756 - val_loss: 0.2555 - val_mse: 0.2555 - val_mae: 0.4208

#

the losses are sort of still going down really slowly

#

if not the same

#

https://paste.pythondiscord.com/nepobusula.typescript

desert oar Oct 4, 2021, 12:41 AM

#

Honestly those labels look really random

#

Wait

#

Oh i see this is a simulated dataset

#

That min max scaler seems questionable

#

Well i guess you know the input data is bounded

#

Hm

#

Can you also print the gradients somehow

#

I'd be curious if this works with a smaller network

#

That's a lot if parameters for 500 data points, maybe it's ok but my instinct would be to generate a lot more simulated points or make the network a lot smaller

quasi torrent Oct 4, 2021, 12:50 AM

#

How do I print the gradients? I'm pretty new to machine learning

quasi torrent Oct 4, 2021, 12:54 AM

#

desert oar That's a lot if parameters for 500 data points, maybe it's ok but my instinct wo...

is 20,000 data points ok?

tender hearth Oct 4, 2021, 1:40 AM

#

How do I apply a function to the 2nd dimension of a Tensor?

#

For context, I'm writing a custom collate_fn to process a batch of audio waveforms

#

I'm padding them using torch.nn.utils.rnn.pad_sequence which returns a Tensor 1 rank higher

#

now I need to apply torchaudio.transforms.MelSpectrogram which produces a 2D Tensor to each Tensor in dim 0

rough fulcrum Oct 4, 2021, 2:22 AM

#

My code keeps on stopping after i use my if statement and i want to use my if statement multiple times `import speech_recognition as sr
import pyautogui
import time

r = sr.Recognizer()
mic = sr.Microphone()

with mic as source:
r.adjust_for_ambient_noise(source)
audio = r.listen(source)

recognition = {
"success": True,
"error": None,
"transcription": None
}

Number = 0

try:
recognition["transcription"] = r.recognize_google(audio)
except sr.RequestError:
recognition["success"] = False
recognition["error"] = "API unavailable"
except sr.UnknownValueError:
pass

if recognition["transcription"] == "left":
pyautogui.press('w')
`

next lance Oct 4, 2021, 3:08 AM

#

I see many people using Jupiter Notebook while Image detection using tenserflow
Can I not use Pycharm?

royal crest Oct 4, 2021, 3:18 AM

#

next lance I see many people using Jupiter Notebook while Image detection using tenserflow ...

you can use jupyter nb on pycharm

next lance Oct 4, 2021, 3:19 AM

#

Ohh

#

just do pip install Jupiter-notebook?

pastel valley Oct 4, 2021, 3:20 AM

#

is this what numpy.linalg.solve() do?

stark zenith Oct 4, 2021, 3:31 AM

#

pastel valley is this what numpy.linalg.solve() do?

I think so, I remember making a notebook to cheat in Precalc earlier this year using Numpy.

#

mostly to do the really tedious shit

#

actually I'd use sympy to do sympy.apart because that was tedious

#

np.cross is good for cross-product stuff

#

very useful

pastel valley Oct 4, 2021, 3:42 AM

#

sir can you help understand this very confusing math

#

i dont ask for code but what should i implement in this

#

#

is it a program to find a basis given a vectors and scalars?

quaint bloom Oct 4, 2021, 3:47 AM

#

How does Wasserstein loss affect CycleGANs? I've seen ppl using it for other GANs but not much for CycleGAN

worldly lake Oct 4, 2021, 4:50 AM

#

hello, does anyone know how to combine all the elements of the SKU column by listing the URL column separated by commas?
Like:

{"sku": 43956, "url": "7222021.jpg, 7222019.jpg, 7222017.jpg, 4176997.jpg, 2518544.jpg, 2518520.jpg, 2518488.jpg, ..."}

velvet thorn Oct 4, 2021, 9:05 AM

#

worldly lake hello, does anyone know how to combine all the elements of the SKU column by lis...

', '.join(row['url'] for row in data)

#

would be a good start

drifting mason Oct 4, 2021, 10:50 AM

#

Can someone guide me where am I going wrong?

#

heavy crow Oct 4, 2021, 10:51 AM

#

#

My loss resets after each epoch for some reason. Any ideas why?

#

I'm training a efficient net V2 backbone to recognize a bunch of classes

drifting mason Oct 4, 2021, 10:52 AM

#

drifting mason

any idea guys?

primal tulip Oct 4, 2021, 11:00 AM

#

drifting mason any idea guys?

try

plt.boxplot(cortisone.Cushings)

or

plt.boxplot(cortisone['Cushings'])

The issue being python thinks Cushings is a variable and not part of your pandas Dataframe. You need to specify correctly the Pandas syntax.

drifting mason Oct 4, 2021, 11:23 AM

#

@primal tulip Thanks a lot, how can I plot two box plots at the same time, please can you help me?

primal tulip Oct 4, 2021, 11:36 AM

#

drifting mason <@!234094728805613569> Thanks a lot, how can I plot two box plots at the same t...

You need to create multiple graphs in the same plot. For that you need to specify it when you create the fig and ax elements. Then you create your graphs with different labels and finally you plot.

You can read this stackoverflow question for reference.
https://stackoverflow.com/questions/42734109/two-or-more-graphs-in-one-plot-with-different-x-axis-and-y-axis-scales-in-pyth

upper granite Oct 4, 2021, 12:00 PM

#

Hi guys
have you ever seen this error in DBeaver
SQL Error [16777232]: Query failed (#20211004_115307_00151_s2r9w): Error reading tail from s3://some-bucket/folder/folder/part-00010-0287d64b-292f-428e-9da5-10e61bd353c1-c000.snappy.parquet with length 16384
I have delta table in S3

royal crest Oct 4, 2021, 12:16 PM

#

try #databases ?

lapis sequoia Oct 4, 2021, 12:29 PM

#

Good guys @primal tulip always there to help 😜

drifting mason Oct 4, 2021, 1:12 PM

#

@primal tulip My data looks like this

#

and the box-plot looks like this, with the second box-plot being null

#

like this

desert oar Oct 4, 2021, 1:12 PM

#

drifting mason Can someone guide me where am I going wrong?

to explain what's happening from a python perspective: Cushings is not a variable name, it's a column in the cortisone dataframe

drifting mason Oct 4, 2021, 1:13 PM

#

I got it, but now, am I going wrong somewhere, pls can u look into it

#

2 seems null

#

there is data tho

#

basically, the box-plot for healthy is not being generated

harsh bear Oct 4, 2021, 1:14 PM

#

Hello. I need help with making graph from csv.. Basically I know nothing abt CSV

drifting mason Oct 4, 2021, 1:15 PM

#

this may help

#

https://stackoverflow.com/questions/26903304/reading-data-from-a-csv-file-in-python

Stack Overflow

Reading data from a CSV file in Python

I am reading data from a CSV file (xyz.CSV) which contains below data:

col1,col2,col3,col4
name1,empId1,241682-27638-USD-CIGGNT ,1
name2,empId2,241682-27638-USD-OCGGINT ,1
name3,empId3,241942-3719...

graceful birch Oct 4, 2021, 1:15 PM

#

Do anyone have experience with the turtle module of python

harsh bear Oct 4, 2021, 1:15 PM

#

just one sec

graceful birch Oct 4, 2021, 1:15 PM

#

Because I need to import an image and it hates me

drifting mason Oct 4, 2021, 1:15 PM

#

drifting mason like this

can somebody please help

harsh bear Oct 4, 2021, 1:15 PM

#

harsh bear Hello. I need help with making graph from csv.. Basically I know nothing abt CSV

I have 2 csv files, with different data, and same date

#

I want it to plot a graph

drifting mason Oct 4, 2021, 1:15 PM

#

Read about Matplotlib

harsh bear Oct 4, 2021, 1:16 PM

#

i did

drifting mason Oct 4, 2021, 1:16 PM

#

Which part exactly do you need help at

drifting mason Oct 4, 2021, 1:17 PM

#

drifting mason can somebody please help

guys please help

arctic wedgeBOT Oct 4, 2021, 1:17 PM

#

Hey @harsh bear!

Uh-oh! It looks like your message got zapped by our spam filter. We currently don't allow .csv attachments, so here are some tips to help you travel safely:

• If you attempted to send a message longer than 2000 characters, try shortening your message to fit within the character limit or use a pasting service (see below)

• If you tried to show someone your code, you can use codeblocks
(run !code-blocks in #bot-commands for more information) or use a pasting service like:

https://paste.pythondiscord.com

#

Hey @harsh bear!

Uh-oh! It looks like your message got zapped by our spam filter. We currently don't allow .csv attachments, so here are some tips to help you travel safely:

• If you attempted to send a message longer than 2000 characters, try shortening your message to fit within the character limit or use a pasting service (see below)

• If you tried to show someone your code, you can use codeblocks
(run !code-blocks in #bot-commands for more information) or use a pasting service like:

https://paste.pythondiscord.com

desert oar Oct 4, 2021, 1:17 PM

#

graceful birch Do anyone have experience with the turtle module of python

try #game-development

harsh bear Oct 4, 2021, 1:18 PM

#

https://paste.pythondiscord.com/kozoreroja.lua

#

These r my 2 csv files which automatically get daily data from my bot

#

I want to be able to get the date as the horizontal column

#

And 2 bar graphs, with one being the server count other member count

#

Can any1 understand me?

desert oar Oct 4, 2021, 1:19 PM

#

sure, thanks for providing the csv data

dense lintel Oct 4, 2021, 1:20 PM

#

where can i learn some machine learning?

#

im a beginner in ai

#

i know python pretty well

desert oar Oct 4, 2021, 1:20 PM

#

we have some pinned resources

dense lintel Oct 4, 2021, 1:21 PM

#

right, thanks

#

which is "easier" to use, pytorch or tensorflow?

desert oar Oct 4, 2021, 1:25 PM

#

if you already know python, i recommend:

starting to learn probability and stats, maybe try "Bayesian Methods for Hackers" https://camdavidsonpilon.github.io/Probabilistic-Programming-and-Bayesian-Methods-for-Hackers/
start messing around with pandas; get some csv datasets and practice making useful data visualizations. start reading Tufte "The Visual Display of Quantitative Information", then Wilke "Fundamentals of Data Visualization"(https://clauswilke.com/dataviz/), and at least skim Cleveland "The Elements of Graphing Data"
start messing around with scikit-learn, xgboost, and pytorch (i think it's easier to use than tensorflow). load up some easy datasets like kaggle Titanic, and/or simulate some data, and start fitting models, try stuff, see what works, make nice visualizations thereof, practice writing up / explaining your process and results

Bayesian Methods for Hackers

Bayesian Methods for Hackers : An intro to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view.

dense lintel Oct 4, 2021, 1:25 PM

#

alright thanks!

desert oar Oct 4, 2021, 1:25 PM

#

added a link to the first book @dense lintel

dense lintel Oct 4, 2021, 1:26 PM

#

yep ill read it

desert oar Oct 4, 2021, 1:26 PM

#

also great resources:
http://themlbook.com/
https://otexts.com/fpp3/

The Hundred-Page Machine Learning Book by Andriy Burkov

All you need to know about Machine Learning in a hundred pages. Supervised and unsupervised learning, support vector machines, neural networks, ensemble methods, gradient descent, cluster analysis and dimensionality reduction, autoencoders and transfer learning, feature engineering and hyperparameter tuning! Math, intuition, illustrations, all i...

Forecasting: Principles and Practice (3rd ed)

3rd edition

#

and definitely start learning linear algebra and calculus if you don't know them already

#

you will need them in order to understand how this stuff works

#

sometimes even in order to understand the software documentation

dense lintel Oct 4, 2021, 1:30 PM

#

i dunno calculus but i do know linear algebra

bold timber Oct 4, 2021, 1:54 PM

#

whether we should transform the target variable into normal distributions?

desert oar Oct 4, 2021, 1:58 PM

#

harsh bear https://paste.pythondiscord.com/kozoreroja.lua

maybe like this?

import matplotlib.pyplot as plt
import pandas as pd

# Read members from CSV
members = pd.read_csv('members.csv', header=None)
members.columns = ['date', 'count']

# Read servers from CSV
servers = pd.read_csv('servers.csv', header=None)
servers.columns = ['date', 'count']

# Make a new "figure" with two side-by-side plotting areas
# A plotting area is an "axes" in matplolib terms
fig, ax = plt.subplots((1, 2))

# Plot each dataset onto one of the plotting areas
members.plot.bar('date', 'count', ax=ax[0])
servers.plot.bar('date', 'count', ax=ax[1])

# Display the plot
plt.show()

desert oar Oct 4, 2021, 1:59 PM

#

bold timber whether we should transform the target variable into normal distributions?

doesn't have to be normal specifically, but if it has a "nicer" distribution it might be easier for the model to learn. normality assumptions are usually about conditional normality anyway, which isn't something you can easily transform to

bold timber Oct 4, 2021, 2:02 PM

#

desert oar doesn't have to be normal specifically, but if it has a "nicer" distribution it ...

I recently seeing a practical use transform to target variable by using logarithmic. And now, I'm confuse

desert oar Oct 4, 2021, 2:03 PM

#

bold timber I recently seeing a practical use transform to target variable by using logarith...

yes, that's a very typical thing to do. if the data is spread over several "orders of magnitude", the logarithmic transformation helps put all the data onto a similar scale

bold timber Oct 4, 2021, 2:14 PM

#

desert oar yes, that's a very typical thing to do. if the data is spread over several "orde...

how do we know when we should use transform for target variable?

desert oar Oct 4, 2021, 2:19 PM

#

bold timber how do we know when we should use transform for target variable?

there's no single rule. but generally if the data distribution crosses several orders of magnitude, you should do something to bring it all within a similar order of magnitude

#

similarly, if the numbers are very large or very small, you might need to center and scale, e.g. subtract the training set mean and divide by the training set standard deviation

#

you might want to look into the "box-cox" family of transformations, of which the logarithm is one special case. you can also look into the "inverse hyperbolic sine (IHS)" transformation if your data can be zero or negative

bold timber Oct 4, 2021, 2:21 PM

#

desert oar there's no single rule. but generally if the data distribution crosses several o...

if the target variable have a skewed we should be transform it?

desert oar Oct 4, 2021, 2:22 PM

#

yes, heavily skewed data or data with other "weird" statistical properties like "fat tails" can benefit from transformation

#

square roots are another valid transformation. anything differentiable and monotonic can work

#

https://ronakchhatbar.medium.com/box-cox-transformation-cba8263c5206

Medium

Box-Cox Transformation.

The real-world data is not always distributed the way we want, that is Normal-Distribution It is always distributed in some distribution…

bold timber Oct 4, 2021, 2:22 PM

#

desert oar you might want to look into the "box-cox" family of transformations, of which th...

yeah I use transform the data by using 'yeo-johnson' in Pipeline but not in target variable. is that true?

serene scaffold Oct 4, 2021, 2:22 PM

#

12-year-old me can be quoted saying "I will never understand why I need to know square roots."

desert oar Oct 4, 2021, 2:23 PM

#

yes, that is the right way to use it @bold timber . the scikit-learn pipeline will derive the parameters required to perform the transformation from the training set, and it will apply them to the test set

bold timber Oct 4, 2021, 2:34 PM

#

desert oar yes, heavily skewed data or data with other "weird" statistical properties like ...

despite a target variable? isn't target variable should be naturally exist?

desert oar Oct 4, 2021, 2:34 PM

#

bold timber despite a target variable? isn't target variable should be naturally exist?

i don't know what you mean by that

bold timber Oct 4, 2021, 2:36 PM

#

desert oar i don't know what you mean by that

I mean the target variable is should be naturally exist, isn't it?

velvet thorn Oct 4, 2021, 3:01 PM

#

bold timber I mean the target variable is should be naturally exist, isn't it?

ah

#

you mean

#

you should leave the target variable unmodified?

primal tulip Oct 4, 2021, 3:02 PM

#

drifting mason like this

There must be something wrong over the way you're trying to plot the second graph. Read a bit on duplicating the same axis in matplotlib. If I recall correctly it would be something like declaring the first ax, then call a special method for the second ax variable and assigning both with different labels.

import pandas as pd
import matplotlib.pyplot as plt

df = pd.DataFrame(<the data you are using>)

fig, ax = plt.subplots()

ax.barplot(df[col_1])
ax2 = ax.twinx()

ax2.barplot(df[col_2])

plt.plot()
plt.show()

Read on '''ax.twinx()'''. I hope it's enough to point you in the right direction, I can't help more than that right now.
@drifting mason

bold timber Oct 4, 2021, 3:02 PM

#

velvet thorn you should leave the target variable unmodified?

Yes, that's right

velvet thorn Oct 4, 2021, 3:05 PM

#

bold timber Yes, that's right

not necessarily

#

okay

#

think about this

#

say you have 3 predictors

#

x1, x2, x3

#

and a target, y

#

imagine if you just perform a logarithmic transformation

#

so

#

now your predictors are

#

ln x1, ln x2, ln x3

#

against y

#

that's the same as

#

x1, x2, x3 against e^y

#

right?

bold timber Oct 4, 2021, 3:07 PM

#

velvet thorn not necessarily

Despite of target variable have a skewed?

velvet thorn Oct 4, 2021, 3:07 PM

#

bold timber Despite of target variable have a skewed?

read my example

#

and tell me if it makes sense

warm valley Oct 4, 2021, 3:08 PM

#

Hello, I am having trouble filtering a data frame,
I am using it on text classification,

mask = np.logical_and(y_pred_class==5, y_test==1) #Both of these are np array cond= x_test[mask] #x_test in np array with text reviews[reviews['Text'].isin(cond)].
I am specifically telling, I want those values whose predicted is 5 and original is 1, even then the last code is returning those with original value 5

bold timber Oct 4, 2021, 3:11 PM

#

velvet thorn imagine if you just perform a logarithmic transformation

In this case only transformation for predictors?

bold timber Oct 4, 2021, 3:12 PM

#

velvet thorn right?

And then?

desert oar Oct 4, 2021, 3:24 PM

#

@bold timber you can transform the target variable, as long as you can also un-transform it to get predictions

#

so transformations should also be invertible as well as monotonic and differentiable

bold timber Oct 4, 2021, 3:31 PM

#

desert oar <@!786960616664727572> you can transform the target variable, as long as you can...

If I not use transform and only use cross validation that's okay?

velvet thorn Oct 4, 2021, 3:36 PM

#

desert oar so transformations should also be _invertible_ as well as monotonic and differen...

hold up

#

doesn't invertibility imply strict monotonicity?

velvet thorn Oct 4, 2021, 3:38 PM

#

desert oar <@!786960616664727572> you can transform the target variable, as long as you can...

actually is that true?

#

I mean like you can group outcomes, for example, right

worldly lake Oct 4, 2021, 3:46 PM

#

@velvet thorn, oh, thank u, i started learn groupby() from itertools, but gonna use ur method too. thanks 😉

lusty stag Oct 4, 2021, 4:16 PM

#

say I have data from 50 users and I want to normalize features before training
should I fit on 1 user and transform the rest? or fit on whole?

vale hedge Oct 4, 2021, 5:54 PM

#

Pretty new to unstructured learning. I want to cluster some text then I want to order clusters and samples in a 1D list. I'm wondering if there are any packages to help do something like this. (Mainly wondering about last part in ordering in 1D)

drifting mason Oct 4, 2021, 6:42 PM

#

primal tulip There must be something wrong over the way you're trying to plot the second grap...

Thanks, I will try that out

desert oar Oct 4, 2021, 7:15 PM

#

vale hedge Pretty new to unstructured learning. I want to cluster some text then I want to...

order them how?

#

that is: what would be a good ordering in your use case? there's no package for this because there's no general-purpose definition of an ordering on clusters

vale hedge Oct 4, 2021, 7:45 PM

#

desert oar order them how?

Order them by distance. But doesn't matter if they are first or last. Just need something that somewhat makes sense.

#

Maybe optimizing like traveling salesman would be nice but I don't need it

foggy blade Oct 4, 2021, 7:48 PM

#

Hey anybody able to recommend a more advanced natural language processing resource than what I could find? The YouTube videos are quite introductory. I wouldn't mind diving into some maths

vale hedge Oct 4, 2021, 7:49 PM

#

foggy blade Hey anybody able to recommend a more advanced natural language processing resour...

What did you find and what packages did you work with?

#

And what problem are you trying to solve

foggy blade Oct 4, 2021, 7:53 PM

#

@vale hedge first I started with NLTK, basic youtube guide for sentiment analysis and a few other topics. I'm really interested in generation, ie something in the lines of a generative neural network but for nlp. I'm not sure how it would work for NLP. I was inspired by all of these copywriter ai startups. I'd love to delve into that, even if just to understand it rather than have an effective model

#

but the guides on youtube are basically just do this and parrot learn. A book would be nice, but they all seem really outdated.

#

Maybe papers?

#

I have tons of machine learning with classic machine learning and neural networks. Not so much with NLP haha. It just seems fun

vale hedge Oct 4, 2021, 7:55 PM

#

You want to look into transformers: GPT-3 or perhaps bert

foggy blade Oct 4, 2021, 7:56 PM

#

But isn't GPT-3 like beta exclusive? I signed up on open.ai a while ago, but as far as I understand it, it's quite difficult to get in if you're not gonna provide them with an income.

#

I'll check out bert

vale hedge Oct 4, 2021, 7:56 PM

#

GPT latest version is not public so you might look into some variants

foggy blade Oct 4, 2021, 7:57 PM

#

i see. So is gpt 2 public?

#

fully

#

or also an API?

vale hedge Oct 4, 2021, 7:58 PM

#

I think gpt-2 is public and you can download a fully pretrained model to use it. Not sure about API.

foggy blade Oct 4, 2021, 7:59 PM

#

Nice I just found it. Epic. I thought it was all proprietary. Thanks man! Will definitely check our BERT too. Looks pretty cool.

#

Thanks!

vale hedge Oct 4, 2021, 7:59 PM

#

Np gl it's pretty interesting stuff

foggy blade Oct 4, 2021, 7:59 PM

#

I can imagine!

vale hedge Oct 4, 2021, 8:05 PM

#

For a lot of these models you usually start with pretrained models for general text. You can also train on specific types of text or corpora if you want it to specialize in different types of language.

desert oar Oct 4, 2021, 8:11 PM

#

vale hedge Order them by distance. But doesn't matter if they are first or last. Just nee...

Distance from what exactly?

slender sand Oct 4, 2021, 8:11 PM

#

Don't want to torpedo a conversation here but can ya'll stomach a noob question? I'm new to ML/DS and really don't want to put bogus findings in front of my boss.

desert oar Oct 4, 2021, 8:12 PM

#

slender sand Don't want to torpedo a conversation here but can ya'll stomach a noob question?...

That's what this channel is for, go ahead

slender sand Oct 4, 2021, 8:12 PM

#

much apprecciated

#

I've got your basic ecomm dataset and I'm trying to find out if there are one or more features that lead to a sale/no sale outcome. Been using Random Forest Classifier, which gives me a pretty wild (overfit?) accuracy of 99%, 80% on cross-val. The #1 feature (price) scores 23 in feature importance. So far so good, I feel.

#

But if I run a point biserial correlation on that feature and the Y/N purchase goal, it's totally untethered

#

does that just mean it's not a contributing factor? or have i botched something

vale hedge Oct 4, 2021, 8:14 PM

#

desert oar Distance from what exactly?

Er I think the basis for clustering is distance. So that distance. Like a cluster can have a centroid so then I want to use the distance to another centroid.

slender sand Oct 4, 2021, 8:16 PM

#

Like maybe price is the most important, but still not really that telling in the actual outcome?

desert oar Oct 4, 2021, 8:33 PM

#

vale hedge Er I think the basis for clustering is distance. So that distance. Like a clus...

the basis of (most) clustering algorithms is the distance between points, not the distance from some particular point to every other point

#

it would help if you explained why you want this and what you're trying to achieve

desert oar Oct 4, 2021, 8:44 PM

#

slender sand But if I run a point biserial correlation on that feature and the Y/N purchase g...

don't forget that correlation is a linear phenomenon, and that a pairwise correlation is a lot like an average over the entire dataset. what if there is a non-linear relationship that depends on other variables in the data? maybe price is very meaningful once you account for market segment, or maybe it's only meaningful in certain segments but not others

#

you might want to look at mutual information instead of correlation, for example

#

scikit-learn has a routine for it

slender sand Oct 4, 2021, 8:44 PM

#

oh?

desert oar Oct 4, 2021, 8:44 PM

#

https://scikit-learn.org/stable/modules/generated/sklearn.feature_selection.mutual_info_classif.html

scikit-learn

sklearn.feature_selection.mutual_info_classif

slender sand Oct 4, 2021, 8:46 PM

#

and this would still ultimately be measuring those two "columns"

desert oar Oct 4, 2021, 8:47 PM

#

yes, but mutual information is more general than correlation - it (attempts to) measure the statistical dependence between two variables, not necessarily linear dependence

#

but it won't help with the conditional dependence issue - that's essentially what all models do, learning conditional dependence structures

slender sand Oct 4, 2021, 8:50 PM

#

so then I can't defensibly say "changing price X% will have Y effect", because it seems to be a certain perfect storm of other features that lend it that oomph!

lapis sequoia Oct 4, 2021, 9:10 PM

#

anyone know how to get a deep learning ai with tensorflow and tkinter to respond to a message by the user like a chat bot

vale hedge Oct 4, 2021, 9:28 PM

#

desert oar it would help if you explained why you want this and what you're trying to achie...

I want to display all samples as 1D list similar to text segmentation (rather than visualize in higher dimensional space). So then I can look at text representation and as well as for evaluating performance.

#

If you don't know of anything then it's fine. I can just try to look or test some solutions out by myself.

desert oar Oct 4, 2021, 9:54 PM

#

slender sand so then I can't defensibly say "changing price X% will have Y effect", because i...

you can recover an "average effect" of sorts with a technique called partial dependence

#

look up "partial dependence plots" and also techniques for model explainability like LIME and SHAP

desert oar Oct 4, 2021, 9:55 PM

#

vale hedge I want to display all samples as 1D list similar to text segmentation (rather th...

one option is to use something like multidimensional scaling, principal components, or some other dimension reduction that can emit 1 dimension

#

"text segmentation" isn't something you can usually visualize as a 1-d list either, unless you mean visualizing the sequence of segmented sections of text? which would require you to define some notion of "sequence"

#

i'm asking these questions because the task is ill-defined, not because i'm trying to dodge giving an answer

vale hedge Oct 4, 2021, 11:15 PM

#

I am trying to do some kind of topic modeling. The primary feature from my understanding should be some description that describes the topic. I want it in 1D list so I can look at a line of descriptions.

desert oar Oct 4, 2021, 11:17 PM

#

ok, that helps clarify more

#

are you looking at topics for each document? if so, you can sort by relevance to the document

#

otherwise you can sort by frequency or total score across the dataset or something

#

if you want to try to put topics on some kind of 1-d spectrum, then i recommend what i recommended above: multidimensional scaling or pca

vale hedge Oct 4, 2021, 11:21 PM

#

From what I understand PCA is based on orthogonality.

#

So (3,0) and (0,3) might get reduced to 3. But that is not what I want to do.

#

I want to find closest distance so (3,1) should be closer to (3,0)

desert oar Oct 4, 2021, 11:29 PM

#

i still don't really know what you're asking or how that relates to "I want it in 1D list so I can look at a line of descriptions."

#

and (3, 0), (0, 3) might get reduced to 3 and 0 with 1-d PCA

stark zenith Oct 4, 2021, 11:32 PM

#

How do I return a dataframe, but only for rows that have a value in the last column?

desert oar Oct 4, 2021, 11:33 PM

#

stark zenith How do I return a dataframe, but only for rows that have a value in the last col...

by "have a value" do you mean "not null"? do you know the column name, or do you want to do it by the position of the column (not usually something you want to do)?

stark zenith Oct 4, 2021, 11:34 PM

#

Yes, not null, and by the position of the column as last column.

#

@desert oar

#

I'm hoping to use it to pull values from different pages of a report that is being made.

desert oar Oct 4, 2021, 11:38 PM

#

last_column = data.iloc[:, -1]
last_column_has_value = last_column.notnull()
data = data.loc[last_column_has_value].copy()

i broke it into 3 steps so you can see what the individual sections are. you can of course write this as a one-liner. note the use of .copy() after subsetting with .loc - it isn't strictly necessary, but if you are making any "in-place" changes to the dataframe later in the code, this will avoid warnings about "setting a copy on a slice"

stark zenith Oct 4, 2021, 11:40 PM

#

desert oar ```python last_column = data.iloc[:, -1] last_column_has_value = last_column.not...

Thanks! Yeah, I'm aware of the perils of slicing views vs copies. I'll give this a shot. I appreciate the help!

glad tundra Oct 4, 2021, 11:50 PM

#

Hey can anyone tell why why there is significant difference between the loss function of sklearn's linear regression and my own coded linear regression algorithm?

quiet vault Oct 5, 2021, 1:32 AM

#

So I have data normalized or scaled between 0 and 1

#

For some reason my model is predicting 2

#

and then i turn up the scaling range to like 50, it starts predicting in the 60s

#

does someone know why this happens

prime hearth Oct 5, 2021, 2:35 AM

#

depends, scaling shouldnt affect output in this sense

#

what haopens when you dont scale

#

it might be maybe using more output labels somewhere in code and depends the algo and how it is implemented

lapis sequoia Oct 5, 2021, 4:46 AM

#

can someone fill me in on what kernel density estimation plots are for.. when trying to understand distribution of data

feral lodge Oct 5, 2021, 6:32 AM

#

It gives a smoother plot of the data than a histogram would and tries to fill in the gaps

#

The density estimation itself can also be used for sampling new points, and compute likelihood values at arbitrary data points. Which can be used to build a bigger model, composed of several smaller ones

#

Imo the plots themselves are much less interesting than the sampling and likelihood aspects. It can be very useful to construct a probabilistic model of your data points

fading quarry Oct 5, 2021, 6:45 AM

#

how can i change the column name of my dataframe? I have old name and new name in a csv file?

vale hedge Oct 5, 2021, 7:02 AM

#

quiet vault and then i turn up the scaling range to like 50, it starts predicting in the 60s

Would you happen to be accidentally scaling the Y some how as well? It sounds like a very abnormal situation.

edgy brook Oct 5, 2021, 7:39 AM

#

Hey guys! I'm a bit new to data science but has anyone here got an example of a graph that shows 4 attributes? It could be about anything as I haven't really got a clue on how to visualise it. There's this one I had found but it's only for 3, i think

Screen_Shot_2021-10-05_at_6.38.31_pm.png

quasi parcel Oct 5, 2021, 8:08 AM

#

https://rapidapi.com/weatherapi/api/weatherapi-com/

RapidAPI

WeatherAPI.com API Documentation (weatherapi) | RapidAPI

WeatherAPI.com is a powerful fully managed free weather and geolocation API provider that provides extensive APIs that range from the weather forecast, historical weather, IP lookup, and astronomy through to sports, time zone, and geolocation.

#

you can use api @slate verge

#

Hi i have a trouble in this code, like i have to even consider customer_ids but the customer_ids are encrypted so if there is encrypted string this line of code is working or else its not working there is an error saying attribute error how to pass this
https://paste.pythondiscord.com/ixepotupox.pl in this snippet we have line 9 i need to consider even if the customer_id is empty

#

how can i do that

#

please do help

#

thank you

feral lodge Oct 5, 2021, 8:47 AM

#

@edgy brook your type of plot is called a scatter plot. For more than 2 variables, people usually use a scatter plot matrix. That's just a grid of all possible 2-variable plots. Like this

Iris-data-scatterplot-matrix-with-4-variables-There-are-3-classes-Setosa-red.png

#

The colors on those plots are class labels. So they have 3 classes of flowers, and 4 properties of the flowers

old grove Oct 5, 2021, 9:01 AM

#

Hi Guys Can anyone help me with this ?

What's The difference Between Data Scientist and Bussiness intelligence analyst or ba,data analyst.

I mean whatever you say the base goal work is on finding useful insights and relevant information,target audience to provide the insights found to benifit Bussines so What's different in each or they are just names given ?

tiny geyser Oct 5, 2021, 9:27 AM

#

old grove Hi Guys Can anyone help me with this ? What's The difference Between Data Scien...

I'm not sure there is a formal definition but I'd expect a data scientist to be expert in advanced statistics and artificial intelligence (ML, deep learning, algorithms, text mining etc) and I'd expect a BI analyst to be more geared towards data visualisations and domain knowledge (dashboards, tools like tablau, Qlik etc) with basic stats knowledge

old grove Oct 5, 2021, 9:32 AM

#

tiny geyser I'm not sure there is a formal definition but I'd expect a data scientist to be ...

so data science is Doing visualizations to determine past and present insights as well use ml model to predict future information and use that insight to benifit business.

Bi/da analyst will look only for visualization using Tableu,Power Bi and focus on visualization and get insights from the data present to benift the Bussiness and thats it

Am i correct ?

floral junco Oct 5, 2021, 9:34 AM

#

hey does anyone know how to make a sentence generator?

eager imp Oct 5, 2021, 9:35 AM

#

there are different approaches

#

what kind of sentences?

#

NLTK can help with meaning, structure etc

#

if you want to produce sentences that look legitimate without further concern of meaning, you'll probably want to go the GPT road

#

combining both approaches would be the king's discipline, but i'm not aware of any project that got there yet (successfully)

#

there's of course the simplest approach of all

#

templates

edgy brook Oct 5, 2021, 9:41 AM

#

feral lodge The colors on those plots are class labels. So they have 3 classes of flowers, a...

I see.. thank you! Just wondering that if it was in 1 class of flower, would it still be a scatter matrix or just just a normal scatter line graph?

eager imp Oct 5, 2021, 9:41 AM

#

floral junco hey does anyone know how to make a sentence generator?

does that answer your question?

floral junco Oct 5, 2021, 9:44 AM

#

well it uses noam chomsky's phrase structure

tiny geyser Oct 5, 2021, 9:44 AM

#

old grove so data science is Doing visualizations to determine past and present insights a...

I'd agree. BI analysts usually 'look back' where as DS's will be predicting the future brainmon

eager imp Oct 5, 2021, 9:44 AM

#

floral junco well it uses noam chomsky's phrase structure

what is "it"?

floral junco Oct 5, 2021, 9:45 AM

#

this is what the sentence generator will use as a base for the sentence structure

eager imp Oct 5, 2021, 9:45 AM

#

is that some kind of homework?

floral junco Oct 5, 2021, 9:46 AM

#

no im designing something

#

if you look at diagram 15 its pretty straightfoward

eager imp Oct 5, 2021, 9:46 AM

#

yeah, too straightforward

#

so straightforward in fact that you could use a template

floral junco Oct 5, 2021, 9:47 AM

#

thats the thing im a bit new to python and i dont know where to start with this sentence generator

eager imp Oct 5, 2021, 9:47 AM

#

ah

#

well, then you should read the NLTK book

#

https://www.nltk.org/book/

#

it also covers sentence generation afaik

floral junco Oct 5, 2021, 9:49 AM

#

ill check it out thanks

feral lodge Oct 5, 2021, 10:33 AM

#

@edgy brook it would still be a scatter matrix. If we have 4 variables, we always end up with a 4x4 matrix like in the previous picture. No relation to the number of classes at all 🙂 here's another example with 3 variables without the class coloring

#

The reason they added colors is just to make extra clear that certain flower types cluster in certain ways when we plot their properties against each other. A common use case of scatter plots is to see which features (variables) are the most useful for separating classes.

lapis sequoia Oct 5, 2021, 10:58 AM

#

Anyone familiar with azure formfields? any idea how to turn that into a dataframe? Im stuck

#

I can turn it into a dictionary, but then it puts all the values on one line

#

in the dataframe

#

Im so confused, cant find anything online.

next lance Oct 5, 2021, 11:39 AM

#

The more the smaples we give to train the AI the more better it becomes then why don't Google uses sooooo many images for training Google lenses

lapis sequoia Oct 5, 2021, 11:42 AM

#

what

#

thats not what I asked

next lance Oct 5, 2021, 11:43 AM

#

lapis sequoia thats not what I asked

It is my question!! 😄

lapis sequoia Oct 5, 2021, 11:43 AM

#

Answer mine

#

https://tenor.com/view/now-george-buckman-flora-and-ulysses-right-away-this-instant-gif-20518805

Tenor

humble spade Oct 5, 2021, 12:09 PM

#

i am doing a classification problem and when I visualize the features with the hue being the class i found 2 classes overlapped on each other HOW can i separate them??

#

or is it not possible ?

desert oar Oct 5, 2021, 12:20 PM

#

lapis sequoia I can turn it into a dictionary, but then it puts all the values on one line

what does the dict look like? it's usually possible to turn a dict into a dataframe

lapis sequoia Oct 5, 2021, 12:20 PM

#

It's so nested

#

It's impossibru

desert oar Oct 5, 2021, 12:21 PM

#

is there a schema for it?

lapis sequoia Oct 5, 2021, 12:25 PM

#

what do you mean?

serene scaffold Oct 5, 2021, 12:37 PM

#

lapis sequoia what do you mean?

If it's nested consistently

lapis sequoia Oct 5, 2021, 12:37 PM

#

No idea

#

the dataframe.from_dict just puts random column names, and all of the values and keys on on cell

#

one cell

desert oar Oct 5, 2021, 12:42 PM

#

lapis sequoia the dataframe.from_dict just puts random column names, and all of the values and...

well yeah that's because it's guessing

#

what is the format of the data?

lapis sequoia Oct 5, 2021, 12:43 PM

#

its a formfield, that i turn into a dict

#

How can i choose which column and data to put in the dataframe from dict?

desert oar Oct 5, 2021, 12:44 PM

#

i'm asking you to provide more details

#

what's the format of the data? what are the keys? how many are they? what is the nesting structure? are there lists of things anywhere in there? etc. etc.

#

if you give an illustrative example that would be even better

lapis sequoia Oct 5, 2021, 12:44 PM

#

ok hold on

#

its too big to paste here

#

Ill show a tiny part of it then

#

dont have nitro

bold timber Oct 5, 2021, 12:45 PM

#

how to switch the plot from bottom to top?

lapis sequoia Oct 5, 2021, 12:46 PM

#

Item table: {'1': FormField(value_type=dictionary, label_data=None, value_data=None, name=1, value={'Description': FormField(value_type=string, label_data=None, value_data=FieldData(page_number=1, text=mercedes, bounding_box=[Point(x=2.505, y=4.96), Point(x=3.77, y=4.96), Point(x=3.77, y=5.08), Point(x=2.505, y=5.08)], field_elements=None), name=Description, value='bmw', confidence=1.0), 'Quantity': FormField(value_type=float, label_data=None, value_data=FieldData(page_number=1, text=30,00, bounding_box=[Point(x=5.91, y=4.975), Point(x=6.255, y=4.975), Point(x=6.255, y=5.095), Point(x=5.91, y=5.095)], field_elements=None), name=Quantity, value=3000.0, confidence=1.0), 'Amount': FormField(value_type=float, l

#

@desert oar

#

where i want columns to be Description, quantity, amount etc, and data for them to be mercedes, bmw etc etc

desert oar Oct 5, 2021, 12:48 PM

#

ok, and this item table 1 is one whole dataframe, right?

lapis sequoia Oct 5, 2021, 12:48 PM

#

item table is the dictionary/formfield

desert oar Oct 5, 2021, 12:48 PM

#

bold timber how to switch the plot from bottom to top?

swap the order of the cells?

lapis sequoia Oct 5, 2021, 12:49 PM

#

its some kind of strange hybrid made by azure

desert oar Oct 5, 2021, 12:49 PM

#

that's not what i'm asking. i see {'1': ... indicating that there are more of these things

lapis sequoia Oct 5, 2021, 12:49 PM

#

Ah yes, that is row number 1

#

and it repeats

#

row 2, same columns etc etc

#

i couldnt paste the entire thing

desert oar Oct 5, 2021, 12:49 PM

#

that's fine, i don't need it. i just need a sense of the structure

#

!paste we do however have a "paste site" for bigger files 👇

arctic wedgeBOT Oct 5, 2021, 12:50 PM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the floppy disk icon in the top right, or by typing ctrl + S. After doing that, the URL should change. Copy the URL and post it here so others can see it.

desert oar Oct 5, 2021, 12:50 PM

#

just fyi in the future

lapis sequoia Oct 5, 2021, 12:50 PM

#

oh ok thanks

#

but the structure is the same, if you stop at quantity as the last one

#

to keep it simple

desert oar Oct 5, 2021, 12:50 PM

#

first thing i do in this situation is re-format the data to make the nesting structure visually clear

lapis sequoia Oct 5, 2021, 12:50 PM

#

Exactly, and how in the world would I do that

desert oar Oct 5, 2021, 12:50 PM

#

with your text editor 🙂

#

i mean literally take your example

lapis sequoia Oct 5, 2021, 12:50 PM

#

I only want, Key and value not all of the rest

desert oar Oct 5, 2021, 12:50 PM

#

and format it with indentation

lapis sequoia Oct 5, 2021, 12:51 PM

#

Manually?

desert oar Oct 5, 2021, 12:51 PM

#

otherwise i don't have a damn clue what's in here if it's not visually formatted

#

if you have to, sure

bold timber Oct 5, 2021, 12:51 PM

#

desert oar swap the order of the cells?

it doesn't works

lapis sequoia Oct 5, 2021, 12:51 PM

#

You mean how I want it to be?

desert oar Oct 5, 2021, 12:51 PM

#

no i mean take the data that you have and give it indentation so you can see the nesting structure

lapis sequoia Oct 5, 2021, 12:51 PM

#

oh

#

ok hold on

desert oar Oct 5, 2021, 12:53 PM

#

Item table:
{
  '1': FormField(
    value_type=dictionary,
    label_data=None,
    value_data=None,
    name=1, value={
      'Description': FormField(
        value_type=string,
        label_data=None,
        value_data=FieldData(
          page_number=1,
          text=mercedes,
          bounding_box=[Point(x=2.505, y=4.96), Point(x=3.77, y=4.96), Point(x=3.77, y=5.08), Point(x=2.505, y=5.08)],
          field_elements=None),
          name=Description,
        value='bmw',
        confidence=1.0),
      'Quantity': FormField(
        value_type=float,
        label_data=None,
        value_data=FieldData(
          page_number=1,
          text=30,00,
          bounding_box=[Point(x=5.91, y=4.975), Point(x=6.255, y=4.975), Point(x=6.255, y=5.095), Point(x=5.91, y=5.095)],
          field_elements=None
        ),
        name=Quantity,
        value=3000.0,
        confidence=1.0),
      'Amount': ...

#

ok, progress

#

now the question is: how, conceptually, does this need to look when you flatten it?

#

what columns do you want?

bold timber Oct 5, 2021, 12:54 PM

#

bold timber how to switch the plot from bottom to top?

I mean switch the line plot

desert oar Oct 5, 2021, 12:54 PM

#

bold timber I mean switch the line plot

be more specific

swift mist Oct 5, 2021, 12:54 PM

#

does anyone here knows how to decrypt caesar cipher?

clever island Oct 5, 2021, 12:55 PM

#

Hi everybody ! I have a quick (I hope) question on how to store ML coeff in a database. I'll have to store an unknown number of point who are vectors of dimension ~300. It feel way too bruteforce to declare a table with 300 column but don't find much 'good practice' with some search.... Any Idea ? Is it legit to have so many columns ?

lapis sequoia Oct 5, 2021, 12:55 PM

#

@desert oar oh you fixed it for me, okay well what is the next step here?

desert oar Oct 5, 2021, 12:55 PM

#

lapis sequoia <@!389497659087650836> oh you fixed it for me, okay well what is the next step h...

now the question is: how, conceptually, does this need to look when you flatten it?

desert oar Oct 5, 2021, 12:55 PM

#

clever island Hi everybody ! I have a quick (I hope) question on how to store ML coeff in a da...

what database? postgresql for example supports array-valued columns

gray tartan Oct 5, 2021, 12:55 PM

#

Hey everyone ! I'm having an horribly weird bug with pandas
I'm basically trying to do a df.groupby(["A", "B"], sort=False, as_index=False).apply(some_lambda)
And sometimes the dataframe that the lambda gets has its columns shifted, like it doesn't have A and B columns anymore, but their value goes in later columns, which is ultra weird
I thought it was a bugged version issue at first but i can't even reproduce it in a python console using the same interpreter and the same packages versions, the same code applied on the same exact dataframe returns me the expected result
(sry if i'm interrupting stuff, i can always ask it in an help channel but it's quite urgent :/, thanks for the help !)

lapis sequoia Oct 5, 2021, 12:55 PM

#

I only need 'Description' and bmw

desert oar Oct 5, 2021, 12:56 PM

#

swift mist does anyone here knows how to decrypt caesar cipher?

see #algos-and-data-structs but this sounds like a homework question, so keep our rules in mind

lapis sequoia Oct 5, 2021, 12:56 PM

#

and like, quantity and 3000 and so on

bold timber Oct 5, 2021, 12:56 PM

#

desert oar be more specific

the line plot from higher value to lower value as a life expectancy, how to switch the line plot from lower value to upper value against life expectancy?

desert oar Oct 5, 2021, 12:57 PM

#

gray tartan Hey everyone ! I'm having an horribly weird bug with `pandas` I'm basically tryi...

can you post a reproducible example? what is "sometimes"? that would be a very weird bug indeed, but the standard of evidence for a bug like that should be high

gray tartan Oct 5, 2021, 12:57 PM

#

desert oar can you post a reproducible example? what is "sometimes"? that would be a very w...

well i can't reproduce it myself :/ but i can show you an input and output

#

(that's what i mean by "sometimes")

desert oar Oct 5, 2021, 12:58 PM

#

bold timber the line plot from higher value to lower value as a life expectancy, how to swit...

maybe reverse the order of the rows in the dataframe? sns.lineplot(..., data=a.iloc[::-1])

desert oar Oct 5, 2021, 12:58 PM

#

gray tartan well i can't reproduce it myself :/ but i can show you an input and output

that's a start

lapis sequoia Oct 5, 2021, 12:58 PM

#

@desert oar Basically, i want to make it readable by the dataframe.from_dict

desert oar Oct 5, 2021, 12:58 PM

#

lapis sequoia <@!389497659087650836> Basically, i want to make it readable by the dataframe.fr...

you're thinking about this backwards. don't try to hammer this into one particular function that you think you need. instead think of "what columns do i need in my dataframe?", then figure out how to make that happen

lapis sequoia Oct 5, 2021, 12:59 PM

#

Yeah, and Im stuck, ive tried it all

#

I tried looping through but that didnt work either

#

Im so confused

gray tartan Oct 5, 2021, 12:59 PM

#

desert oar that's a start

should i put it there or open an help channel ?

desert oar Oct 5, 2021, 1:00 PM

#

gray tartan should i put it there or open an help channel ?

here is fine, it's pandas

bold timber Oct 5, 2021, 1:00 PM

#

desert oar maybe reverse the order of the rows in the dataframe? `sns.lineplot(..., data=a....

still doesn't works

desert oar Oct 5, 2021, 1:00 PM

#

bold timber still doesn't works

that was a bad suggestion 😆 let me see if there's a way in seaborn

#

@lapis sequoia stop thinking about code! be specific about what you want in the dataframe. you said "description", but "description" is itself a big nested blob of stuff, so that doesn't help

arctic wedgeBOT Oct 5, 2021, 1:01 PM

#

:incoming_envelope: :ok_hand: applied mute to @gray tartan until <t:1633439500:f> (9 minutes and 59 seconds) (reason: newlines rule: sent 111 newlines in 10s).

desert oar Oct 5, 2021, 1:02 PM

#

!paste @gray tartan

arctic wedgeBOT Oct 5, 2021, 1:02 PM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the floppy disk icon in the top right, or by typing ctrl + S. After doing that, the URL should change. Copy the URL and post it here so others can see it.

desert oar Oct 5, 2021, 1:02 PM

#

<@&831776746206265384> can we unmute our hapless friend? ☝️

lapis sequoia Oct 5, 2021, 1:02 PM

#

https://tenor.com/view/calculating-puzzled-math-confused-confused-look-gif-14677181

Tenor

desert oar Oct 5, 2021, 1:02 PM

#

lapis sequoia https://tenor.com/view/calculating-puzzled-math-confused-confused-look-gif-14677...

ideas first, code second

zenith nova Oct 5, 2021, 1:02 PM

#

!unmute 165918545975181312

arctic wedgeBOT Oct 5, 2021, 1:02 PM

#

:incoming_envelope: :ok_hand: pardoned infraction mute for @gray tartan.

lapis sequoia Oct 5, 2021, 1:02 PM

#

You are basically saying that there is no way to get that into a dataframe

desert oar Oct 5, 2021, 1:02 PM

#

i'm serious, this is how you have to solve these kinds of problems

gray tartan Oct 5, 2021, 1:02 PM

#

thanks, i thought it'd be ok since it wasn't so big :/

desert oar Oct 5, 2021, 1:03 PM

#

lapis sequoia You are basically saying that there is no way to get that into a dataframe

no, that is not at all what i am saying. i am saying that there is no generic all-purpose magic one-and-done way to get that into a dataframe. you need to use your brain to come up with a mapping between "things in that form field" and "columns in a dataframe", and then you can write code to implement that mapping.

lapis sequoia Oct 5, 2021, 1:04 PM

#

@desert oar I see. I came here when my brain ran out of ideas.

zenith nova Oct 5, 2021, 1:04 PM

#

gray tartan thanks, i thought it'd be ok since it wasn't so big :/

no worries, it was just a liiitle bit too big. Best to use paste for it anyway

gray tartan Oct 5, 2021, 1:04 PM

#

so here's the input dataframe (in records orient)
https://paste.pythondiscord.com/yowuhevavu.json
here's the code

new_dates_impact_df.groupby(
                ["deviceCategory", "segment"], sort=False, as_index=False
            ).apply(
                lambda dates_df: print(dates_df.to_dict(orient="records"))
            )

and here's the output of the first group

[{'date': '2021-10-03', 'users': 'desktop', 'transactions': 'A_buyers', 'transactionRevenue': 366}]

but when i try to reproduce it, i get, as expected :

[{'date': '2021-10-03', 'deviceCategory': 'desktop', 'segment': 'A_buyers', 'users': 1830, 'transactions': 1311, 'transactionRevenue': 32129.88}]

bold timber Oct 5, 2021, 1:05 PM

#

desert oar that was a bad suggestion 😆 let me see if there's a way in seaborn

how to solve?

desert oar Oct 5, 2021, 1:05 PM

#

bold timber how to solve?

try ax.invert_axis

desert oar Oct 5, 2021, 1:06 PM

#

lapis sequoia <@!389497659087650836> I see. I came here when my brain ran out of ideas.

so back up. what individual specific pieces of data do you want?

something like this:

for each car in this dataset, i need X Y and Z specific fields

lapis sequoia Oct 5, 2021, 1:06 PM

#

Yes but this nested mess

#

If description, amount and quantity were at the same location I would fix it

desert oar Oct 5, 2021, 1:06 PM

#

@bold timber
https://matplotlib.org/stable/api/_as_gen/matplotlib.axes.Axes.invert_xaxis.html
https://matplotlib.org/stable/api/_as_gen/matplotlib.axes.Axes.invert_yaxis.html
another way: https://matplotlib.org/stable/gallery/subplots_axes_and_figures/invert_axes.html

lapis sequoia Oct 5, 2021, 1:06 PM

#

but its so messed up

desert oar Oct 5, 2021, 1:07 PM

#

lapis sequoia If description, amount and quantity were at the same location I would fix it

do you see how much shit is in Description? you're not answering my question. what parts of that field do you need?

#

you just keep complaining how it's a mess

#

of course it's a mess, and it's your job to unfuck it

serene scaffold Oct 5, 2021, 1:08 PM

#

https://tenor.com/view/star-wars-no-no-no-no-palpatine-lightsaber-gif-7521430

Tenor

lapis sequoia Oct 5, 2021, 1:08 PM

#

https://tenor.com/view/no-more-go-t-sam-mercy-stop-gif-4954655

Tenor

No more

▶ Play video

serene scaffold Oct 5, 2021, 1:09 PM

#

data-science-and-ai-and-gifs

gray tartan Oct 5, 2021, 1:09 PM

#

gray tartan so here's the input dataframe (in records orient) https://paste.pythondiscord.co...

To add informations to it, as i think it is a pandas or an environment bug, idk how it could happen otherwise : i have version 1.3.3 of pandas in both contexts.
Both are ran on the same machine, the bugged one being ran with uvicorn and the ok one with a standard python console

desert oar Oct 5, 2021, 1:10 PM

#

gray tartan To add informations to it, as i think it is a pandas or an environment bug, idk ...

👀 if you can reproduce it with uvicorn, maybe file a bug report? that's super weird

bold timber Oct 5, 2021, 1:11 PM

#

desert oar <@!786960616664727572> https://matplotlib.org/stable/api/_as_gen/matplotlib.axe...

still doesn't work, any clue?

desert oar Oct 5, 2021, 1:11 PM

#

bold timber still doesn't work, any clue?

show your code

#

"doesn't work" is a phrase that "doesn't mean anything" to me

bold timber Oct 5, 2021, 1:11 PM

#

desert oar show your code

gray tartan Oct 5, 2021, 1:11 PM

#

I reproduce it with uvicorn each time, but idk to who file the bug exactly :/ pandas github ?

desert oar Oct 5, 2021, 1:11 PM

#

gray tartan I reproduce it with uvicorn each time, but idk to who file the bug exactly :/ pa...

yes

desert oar Oct 5, 2021, 1:12 PM

#

bold timber

...you didn't call the function, you just wrote the name of it with ; after it

bold timber Oct 5, 2021, 1:13 PM

#

desert oar ...you didn't call the function, you just wrote the name of it with `;` after it

I wrote the function

gray tartan Oct 5, 2021, 1:13 PM

#

ok i'm gonna make a standalone script runnable with uvicorn that reproduces it and send that...
That's so annoying since it's blocking me from deploying my stuff to production angerysad

eager imp Oct 5, 2021, 1:15 PM

#

desert oar of course it's a mess, and it's your job to unfuck it

data science: 90% data unfucking 10% science 😉

gray tartan Oct 5, 2021, 1:15 PM

#

more like 99% 0.5%
~~and 0.5% of package weirdness~~ 👀

eager imp Oct 5, 2021, 1:15 PM

#

lmao

#

right

gray tartan Oct 5, 2021, 1:16 PM

#

ok, i just ran it with pandas 1.2.5 and it works fine

#

so it affect one of the later versions

desert oar Oct 5, 2021, 1:16 PM

#

lapis sequoia If description, amount and quantity were at the same location I would fix it

ok here: if you just want the text value and you want to ignore all the other shit like the bounding box, you can do this (and i really hope i'm not doing your homework for you):

data_flat = {}
for record_id, formfields in table.items():
    record = {}
    record['Description'] = formfields.value['Quantity'].value
    record['Quantity'] = formfields.value['Quantity'].value
    record['Amount'] = formfields.value['Amount'].value
    data_flat[record_id] = record

data = pd.DataFrame.from_dict(data_flat, orient='index')

but this is also largely a guess, i have no idea what the actual API for this python code is

desert oar Oct 5, 2021, 1:17 PM

#

gray tartan so it affect one of the later versions

i wonder if this is some weird thread-safety/concurrency issue

eager imp Oct 5, 2021, 1:17 PM

#

oh man, how much time i wasted on convoluted, badly documented inconsistent xml metadata to extract useful data

lapis sequoia Oct 5, 2021, 1:17 PM

#

@desert oar im 30 years old I dont have any homework, thank you Ill give it a try.

desert oar Oct 5, 2021, 1:18 PM

#

lapis sequoia <@!389497659087650836> im 30 years old I dont have any homework, thank you Ill g...

we get lots of university students and teenagers here begging for homework help, it wasn't meant to be a dig at you

eager imp Oct 5, 2021, 1:21 PM

#

i wonder if there's an online tool that lets you pick WYSIWYG-style and generates that kind of extraction code

desert oar Oct 5, 2021, 1:21 PM

#

that'd be cool

#

i'd use that for sure

gray tartan Oct 5, 2021, 1:23 PM

#

desert oar i wonder if this is some weird thread-safety/concurrency issue

well there is already a known bug with groupby if you don't use as_index=False
the first few groups generated does have the group keys as column until it's cached and you get them only as index/name after then

#

i wonder if it's linked to that

#

actually it's not really a bug since having as_index=True means you expect to get the group keys as index/name anyway, but still, it's pretty weird behavior

clever island Oct 5, 2021, 1:50 PM

#

desert oar what database? postgresql for example supports array-valued columns

Sorry missed your reply, had a meeting.... we are on mariadb

prisma mulch Oct 5, 2021, 1:57 PM

#

https://deepmind.com/blog/article/wavenet-generative-model-raw-audio

Deepmind

WaveNet: A Generative Model for Raw Audio

This post presents WaveNet, a deep generative model of raw audio waveforms. We show that WaveNets are able to generate speech which mimics any human voice and which sounds more natural than the best existing Text-to-Speech systems, reducing the gap with human performance by over 50%. We also demonstrate that the same network can be used to syn...

#

this seems quite interesting

desert oar Oct 5, 2021, 2:04 PM

#

gray tartan well there is already a known bug with `groupby` if you don't use `as_index=Fals...

the first few groups generated does have the group keys as column until it's cached and you get them only as index/name after then
really? that's strange. this is only in 1.3.x? you're saying that if i do as_index=True it will omit the group columns from the individual groups in the first few groups? wtf.

desert oar Oct 5, 2021, 2:05 PM

#

clever island Sorry missed your reply, had a meeting.... we are on mariadb

you can use an array in a json column then, not sure about the performance implications though

clever island Oct 5, 2021, 3:00 PM

#

desert oar you can use an array in a json column then, not sure about the performance impli...

kind of what I was thinking... and the details I was hopping someone here know..

#

thanks for your response !

bold timber Oct 5, 2021, 3:12 PM

#

What is the best chart for discrete and continuous variables?

desert oar Oct 5, 2021, 3:20 PM

#

bold timber What is the best chart for discrete and continuous variables?

there's no single best chart, it depends on how many discrete levels you have, among other things. what are the variables?

bold timber Oct 5, 2021, 3:21 PM

#

desert oar there's no single best chart, it depends on how many discrete levels you have, a...

total adult mortality (discrete) vs life_expectancy (continuous)

gray tartan Oct 5, 2021, 4:14 PM

#

desert oar > > the first few groups generated does have the group keys as column until it's...

na the contrary, it will put them with as_index=True

#

https://github.com/pandas-dev/pandas/issues/12839
https://github.com/pandas-dev/pandas/issues/14013

GitHub

GroupBy.nth includes group key inconsistently · Issue #12839 · pand...

Code Sample, a copy-pastable example if possible nth doesn't inlcude group key as the same as first and last. df = pd.DataFrame({'A': [1, 2, 3, 4, 5], 'B': [...

GitHub

Rolling groupby should not maintain the by column in the resulting ...

I found another oddity while digging through #13966. Begin with the initial DataFrame in that issue: df = pd.DataFrame({'A': [1] * 20 + [2] * 12 + [3] * 8, 'B': np.a...

waxen girder Oct 5, 2021, 4:18 PM

#

I have a Boolean series and I'm looking to copy a value in the column if my condition is true, how can I do this?

#

#

My data isn't tidy and I'm trying to make it so.

#

I'm pattern matching the first name row and I want to copy first name, last name, and date into their own separate columns matching the row number they come from.

scarlet bronze Oct 5, 2021, 4:26 PM

#

desert oar Oct 5, 2021, 4:26 PM

#

@waxen girder can you give an example of the output you want from this? i don't understand the example

scarlet bronze Oct 5, 2021, 4:27 PM

#

I have data like this, i want to plot the contour line of temperature on the map do any one best libraries fot this?

waxen girder Oct 5, 2021, 4:29 PM

#

@desert oar So far I want something like this:

#

But I'm getting an issue:

#

So I probably should be making a copy not a view.

desert oar Oct 5, 2021, 4:31 PM

#

waxen girder But I'm getting an issue:

how did you create ex_df2? personally i wouldn't try to read this all into one big df, i would read each "sub-table" separately, using row offsets to only read the rows i needed (skip_rows and nrows)

waxen girder Oct 5, 2021, 4:32 PM

#

The issue is the tables are not uniform, each "sub-table" has it's own size.

desert oar Oct 5, 2021, 4:32 PM

#

how many of them are there?

waxen girder Oct 5, 2021, 4:36 PM

#

~6k

#

going off how many first, last and date categories there are.

#

each tuple of (first, last, date) corresponds to it's own sub-table.

#

it's possible to do it in one dataframe.

desert oar Oct 5, 2021, 4:57 PM

#

oh that's a fuckton

#

at that point i might consider loading this all into a list and pandas-ifying it at the end

waxen girder Oct 5, 2021, 4:58 PM

#

There's a solution posted, I was trying to go at it before watching. I feel like I need to concede.

desert oar Oct 5, 2021, 4:59 PM

#

waxen girder But I'm getting an issue:

well to answer this specific question: you can use .copy() after .loc[] defensively to avoid this issue

arctic crown Oct 5, 2021, 5:08 PM

#

@serene scaffold can i dm you?

serene scaffold Oct 5, 2021, 5:08 PM

#

arctic crown <@!253696366952316929> can i dm you?

technically yes, but I won't be able to respond for a while

#

it's best to put questions here so you're not dependent on one person

arctic crown Oct 5, 2021, 5:09 PM

#

ok

#

can someoe please explain

#

#

the for loops and why i need them

serene scaffold Oct 5, 2021, 5:10 PM

#

it's easier if you give the code as text

#

!code

arctic wedgeBOT Oct 5, 2021, 5:10 PM

#

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

serene scaffold Oct 5, 2021, 5:11 PM

#

what are you trying to do, anyway?

arctic crown Oct 5, 2021, 5:11 PM

#

learning tf

#


# Load dataset.
dftrain = pd.read_csv('https://storage.googleapis.com/tf-datasets/titanic/train.csv') # training data
dfeval = pd.read_csv('https://storage.googleapis.com/tf-datasets/titanic/eval.csv') # testing data
# print(dftrain.head()) shows the first 5 lines/entries in the traning dataset
y_train = dftrain.pop('survived')
y_eval = dfeval.pop('survived')

CATEGORICAL_COLUMNS = ['sex', 'n_siblings_spouses', 'parch', 'class', 'deck', 'embark_town', 'alone']
NUMERIC_COLUMNS = ['age', 'fare']

feature_columns = []
for feature_name in CATEGORICAL_COLUMNS:
  vocabulary = dftrain[feature_name].unique()  # gets a list of all unique values from given feature column
  feature_columns.append(tf.feature_column.categorical_column_with_vocabulary_list(feature_name, vocabulary))

for feature_name in NUMERIC_COLUMNS:
  feature_columns.append(tf.feature_column.numeric_column(feature_name, dtype=tf.float32))

print(feature_columns)```

desert oar Oct 5, 2021, 5:14 PM

#

@arctic crown the first for loop builds up a list of all the text features, the second loop appends all the numerical features to that list

arctic crown Oct 5, 2021, 5:14 PM

#

whats does this do?

#

tf.feature_column.categorical_column_with_vocabulary_list(feature_name, vocabulary)```

#

and this

#

tf.feature_column.numeric_column(feature_name, dtype=tf.float32)```

solar oar Oct 5, 2021, 5:27 PM

#

@keen fable DM me to join a neural network discussion server !

keen fable Oct 5, 2021, 5:31 PM

#

solar oar <@!421362214558105611> DM me to join a neural network discussion server !

Oh you son of a gun! I’m in 😎

arctic crown Oct 5, 2021, 5:32 PM

#

please help

#

whats a feature_column in tensorflow?

quiet vault Oct 5, 2021, 10:11 PM

#

vale hedge Would you happen to be accidentally scaling the Y some how as well? It sounds l...

Yeah I am, even if I don't scale it, it predicts much higher than the output should be

serene scaffold Oct 5, 2021, 11:10 PM

#

Has anyone worked with PySpark? This isn't me trying to field a question. I'm just going to be learning how to use it for work and I've gathered that the API design is a bit controversial.

#

From what I understand, it's a platform for distributing AI stuff?

pseudo wren Oct 5, 2021, 11:16 PM

#

I am currently in a data science bootcamp and am relatively new at python

#

i feel like i have no issue reading code or understanding it when someone is explaining

#

but writing is my issue

#

i completely blank when i have to do it on my own

#

but i get it when it's in my face

#

my other classmates in my group have a bit more experience than i do

#

and i really wanna catch up

serene scaffold Oct 5, 2021, 11:19 PM

#

@pseudo wren can you give me an example of something you were asked to do where you were completely blank?

pseudo wren Oct 5, 2021, 11:19 PM

#

well for example today

#

they wanted us to create a lambda function

#

of a list of our classmates

#

and randomize them into groups

#

when asked to write it, i blanked a little and felt like i didn't know the first place to start

#

i usually understand it when i'm reading it

#

or someone is explaining

#

but i just

#

blank

serene scaffold Oct 5, 2021, 11:20 PM

#

I don't quite understand why a lambda would need to be part of that

pseudo wren Oct 5, 2021, 11:21 PM

#

they're just asking us to do it today as an assignment

#

not totally sure myself

serene scaffold Oct 5, 2021, 11:21 PM

#

suppose you had a list of 15 strings, where each string is a name of a classmate. How would you make that into three lists of five classmates each, where each list is random?

#

(so not the first five, then the next five, then the next five)

#

the answer involves import random

pseudo wren Oct 5, 2021, 11:22 PM

#

well yeah that part i get

#

first we import random

#

we create a list name

#

and then create a list with strings inside that have the class names in it

serene scaffold Oct 5, 2021, 11:22 PM

#

you can just type the solution into the chat, letting students be the list of strings.

serene scaffold Oct 5, 2021, 11:22 PM

#

pseudo wren and then create a list with strings inside that have the class names in it

the class names?

pseudo wren Oct 5, 2021, 11:22 PM

#

people in the class

serene scaffold Oct 5, 2021, 11:23 PM

#

ah yes.

pseudo wren Oct 5, 2021, 11:23 PM

#

for example

#

import random

#

class_names = ["Amy", "Adam","Alex",]

#

etc.

#

i can get that far

serene scaffold Oct 5, 2021, 11:24 PM

#

how would you randomly change the order of the elements?

#

!docs random

arctic wedgeBOT Oct 5, 2021, 11:24 PM

#

random

Source code: Lib/random.py

This module implements pseudo-random number generators for various distributions.

For integers, there is uniform selection from a range. For sequences, there is uniform selection of a random element, a function to generate a random permutation of a list in-place, and a function for random sampling without replacement.

On the real line, there are functions to compute uniform, normal (Gaussian), lognormal, negative exponential, gamma, and beta distributions. For generating distributions of angles, the von Mises distribution is available.

pseudo wren Oct 5, 2021, 11:24 PM

#

you can do random.choice

#

i think

serene scaffold Oct 5, 2021, 11:26 PM

#

hmm, what does random.choice do?

pseudo wren Oct 5, 2021, 11:26 PM

#

it randomizes the names in the list

#

and picks one at random

serene scaffold Oct 5, 2021, 11:27 PM

#

it just picks one at random, it doesn't "randomize the names"

pseudo wren Oct 5, 2021, 11:27 PM

#

it picks one at random

#

the issue is

#

i feel like i have trouble writing this

#

i know what i want to do

#

but i have a lot of trouble actually getting there

serene scaffold Oct 5, 2021, 11:32 PM

#

pseudo wren i know what i want to do

so, what do you want to do?

pseudo wren Oct 5, 2021, 11:33 PM

#

well i want to randomize the list and pull a random name out

#

i have provided the list

#

and imported the random module

#

but i get lost in the syntax ig

serene scaffold Oct 5, 2021, 11:35 PM

#

pseudo wren well i want to randomize the list and pull a random name out

if you pull a random name out with random.choice, does that remove it from the list?

#

keep in mind that "pull" doesn't have a formal meaning here. We would usually say "select" or "pop".

pseudo wren Oct 5, 2021, 11:39 PM

#

No it doesn’t remove it from the list

#

It just makes it the output

serene scaffold Oct 5, 2021, 11:46 PM

#

pseudo wren No it doesn’t remove it from the list

so, if you use random.choice, do you understand why this wouldn't solve what you're trying to do?

pseudo wren Oct 5, 2021, 11:50 PM

#

The random choice bit was part of a larger problem

#

Let me see if I can explain

#

We created a list that has the names of everyone in the class

#

And we wanted it to randomize it

#

After we randomized it

#

We wanted to remove a name from the list

#

After it had already been chosen

serene scaffold Oct 5, 2021, 11:53 PM

#

pseudo wren And we wanted it to randomize it

what do you mean by randomize it?

pseudo wren Oct 6, 2021, 12:04 AM

#

@serene scaffold basically we make a random choice

serene scaffold Oct 6, 2021, 12:04 AM

#

no

pseudo wren Oct 6, 2021, 12:04 AM

#

to assign someone as group leader

#

here i'll show the code

#

team_1 = []
team_2 = []
team_3 = []

while len(team_1) < 6:
team_1.append(random.choice(list_of_names))
for name in list_of_names:
if name in team_1:
list_of_names.remove(name)

while len(team_2) < 6:
team_2.append(random.choice(list_of_names))
for name in list_of_names:
if name in team_2:
list_of_names.remove(name)

while len(team_3) < 6:
team_3.append(random.choice(list_of_names))
for name in list_of_names:
if name in team_3:
list_of_names.remove(name)

team_4 = list_of_names
print(team_1)
print(team_2)
print(team_3)
print(team_4)

velvet thorn Oct 6, 2021, 12:17 AM

#

pseudo wren here i'll show the code

!code

arctic wedgeBOT Oct 6, 2021, 12:17 AM

#

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

pseudo wren Oct 6, 2021, 12:18 AM

#

velvet thorn !code

thank u

velvet thorn Oct 6, 2021, 12:18 AM

#

👋

serene scaffold Oct 6, 2021, 12:44 AM

#

@pseudo wren

import random

random.shuffle(list_of_names)

team_1 = list_of_names[:5]
team_2 = list_of_names[5:10]
team_3 = list_of_names[10:]

pseudo wren Oct 6, 2021, 12:45 AM

#

i tried that but my group didn't think it was a good idea

serene scaffold Oct 6, 2021, 12:45 AM

#

why not

pseudo wren Oct 6, 2021, 12:55 AM

#

Don’t know!

#

Idk I have a lot more practice to do

#

But for rn I’m feeling pretty defeated

serene scaffold Oct 6, 2021, 12:56 AM

#

pseudo wren But for rn I’m feeling pretty defeated

I wouldn't worry about it. just keep coding and it will come together

pseudo wren Oct 6, 2021, 12:56 AM

#

Eh it’s week 4

serene scaffold Oct 6, 2021, 12:56 AM

#

the worst code you'll ever see will be your own

pseudo wren Oct 6, 2021, 12:56 AM

#

And I’m not feeling too much closer