distant needle Mar 11, 2021, 4:10 AM

#

Tkinter is what it is. I don't have a problem with Tkinter. More so, I have a problem with the complete and utter lack of clarity managing these figure objects with matplotlib. Quite infuriating there isn't a simple ".destroy()" method to delete the associated figure object immediately

misty flint Mar 11, 2021, 4:11 AM

#

as far as i remember, there isnt

#

pithink

distant needle Mar 11, 2021, 4:11 AM

#

yaeh big wtf pikachu face right now

misty flint Mar 11, 2021, 4:11 AM

#

DoggoKek

lapis sequoia Mar 11, 2021, 4:18 AM

#

does anyone know basic machine learning ?

wheat island Mar 11, 2021, 4:19 AM

#

i made pytorch

exotic maple Mar 11, 2021, 4:19 AM

#

lapis sequoia does anyone know basic machine learning ?

define basic

uncut orbit Mar 11, 2021, 4:19 AM

#

i do

wheat island Mar 11, 2021, 4:19 AM

#

uncut orbit i do

what lib and where u learn it

uncut orbit Mar 11, 2021, 4:19 AM

#

sklearn

wheat island Mar 11, 2021, 4:19 AM

#

what lib

uncut orbit Mar 11, 2021, 4:19 AM

#

bit of tensorflow

wheat island Mar 11, 2021, 4:19 AM

#

o

#

is tensorflow hard

uncut orbit Mar 11, 2021, 4:20 AM

#

not too much

wheat island Mar 11, 2021, 4:20 AM

#

o

#

on god?

#

i quit b4 i even started when i looked at the docs

uncut orbit Mar 11, 2021, 4:20 AM

#

lmao

#

it can be complicated

#

don't look at the docs

#

look online

wheat island Mar 11, 2021, 4:20 AM

#

m

uncut orbit Mar 11, 2021, 4:20 AM

#

for some code

wheat island Mar 11, 2021, 4:21 AM

#

ur online

#

teach me tensorlflow pl z

uncut orbit Mar 11, 2021, 4:21 AM

#

and its meaning

lapis sequoia Mar 11, 2021, 4:21 AM

#

FeelsThinkingMan

uncut orbit Mar 11, 2021, 4:21 AM

#

i hv this notebook i wrote

#

in colab

lapis sequoia Mar 11, 2021, 4:21 AM

#

I hate you whoever texted me

wheat island Mar 11, 2021, 4:21 AM

#

LMOOAO

#

noo:(

exotic maple Mar 11, 2021, 4:22 AM

#

lapis sequoia I hate you whoever texted me

bro your name is amazing lol

lapis sequoia Mar 11, 2021, 4:22 AM

#

We all can relate sir

exotic maple Mar 11, 2021, 4:22 AM

#

from parallel_universe_2 import wife_&_kids

Traceback: wife_&_kids not found

uncut orbit Mar 11, 2021, 4:22 AM

#

LMAO

lapis sequoia Mar 11, 2021, 4:23 AM

#

Sad

#

So sad

uncut orbit Mar 11, 2021, 4:23 AM

#

LOVE THE JOKES

exotic maple Mar 11, 2021, 4:23 AM

#

the big sad

lapis sequoia Mar 11, 2021, 4:23 AM

#

Might not be just jokes

uncut orbit Mar 11, 2021, 4:23 AM

#

huh

#

never thought that

lapis sequoia Mar 11, 2021, 4:23 AM

#

MegaKek

uncut orbit Mar 11, 2021, 4:23 AM

#

here

#

i wrote this reference notebook

#

https://colab.research.google.com/drive/13JgTPzQ7kWb8vCsJ2pOmDGJGpYHBWccS#scrollTo=DEhHHibnSbK2

Google Colaboratory

lapis sequoia Mar 11, 2021, 4:24 AM

#

Wow

uncut orbit Mar 11, 2021, 4:28 AM

#

code is valuable

#

its on classification tho

misty flint Mar 11, 2021, 6:12 AM

#

lapis sequoia We all can relate sir

what if i want a bf

#

blobhyperthink

misty flint Mar 11, 2021, 6:12 AM

#

uncut orbit its on classification tho

gj

#

Blob_pat

uncut orbit Mar 11, 2021, 6:32 AM

#

Thx

#

Love my expression on the emoji lmao

harsh trellis Mar 11, 2021, 8:06 AM

#

umm

#

can someone tell me

#

what is the meaning of skewness?

#

how can i observe the skewness, i the data is rightly skewed or not

restive flare Mar 11, 2021, 9:19 AM

#

harsh trellis what is the meaning of skewness?

Skew means the avg or mean of data is shifted from normal position in normal distribution.

grave frost Mar 11, 2021, 9:37 AM

#

uncut orbit it can be complicated

its not that bad. IMO its the easiest framework to learn (especially when comparing with PyTorch)

raven knoll Mar 11, 2021, 9:49 AM

#

Hey, I am new to datascience. I am currently following a minor about bigData and I have a project, but I cannot figure out how I need to solve this problem. I have currently only learned the basics like pandas, webscraping and a bit of classifiers like KNN.

My project is to create a program that can predict if a hotel review is positive or negative. Every row of data I have has a positive and negative review, but I don't know how to start. Can anyone help me out?

primal tulip Mar 11, 2021, 10:10 AM

#

Why not use pd.Series() instead?

harsh trellis Mar 11, 2021, 10:13 AM

#

restive flare Skew means the avg or mean of data is shifted from normal position in normal dis...

i see, and how can i identify one if the data is rightly skewed or not?

solid kindle Mar 11, 2021, 10:15 AM

#

How to export pandas dataframe into text clipboard that I could paste into script as string and reimport as pandas dataframe again? this is for MVE

serene dragon Mar 11, 2021, 10:37 AM

#

hi

#

Is there any way to leave header empty in Df when i use mulitple headers?

#

#

level_0 will be always present

#

but for levels_1 to 4 i want to have empyt field when there is no name present

serene dragon Mar 11, 2021, 11:09 AM

#

for i, columns_old in enumerate(df_value.columns.levels):
    columns_new = np.where(columns_old.str.contains('Unnamed'), '', columns_old)
    df_value.rename(columns=dict(zip(columns_old, columns_new)), level=i, inplace=True)

#

got it

primal tulip Mar 11, 2021, 11:09 AM

#

serene dragon Is there any way to leave header empty in Df when i use mulitple headers?

Yes, but depends on what you actually want to do and what your data is like. For example, if you're passing only 1 argument at a time, I would pass it as a string then append it at the end of a list.

serene dragon Mar 11, 2021, 11:10 AM

#

untold cove Mar 11, 2021, 12:31 PM

#

Can anyone tell me how to add labels/text or a second yaxis from another Col to this: Do you know how I can add this legend or text to my px.bar, need to display the text for each of my values:

def SetColor(y):
        if(y <= 1):
            return "red"
        elif(y <= 2):
            return "orange" 
        elif(y <= 3):
            return "yellow"
        elif(y <= 4):
            return "lightgreen"
        elif(y <= 5 or y <= 6):
            return "green"
        elif(y <= 7):
            return "darkgreen"
        elif(y <= 8):
            return "silver"
        elif(y <= 9):
            return "gold"
def Setlabel(y):
        if(y <= 1):
            return "Very low (1)"
        elif(y <= 2):
            return "Low (2)" 
        elif(y <= 3):
            return "Below average (3)"
        elif(y <= 4):
            return "Average (4)"
        elif(y <= 5 or y <= 6):
            return "Average (5, 6)"
        elif(y <= 7):
            return "Above average (7)"
        elif(y <= 8):
            return "High (8)"
        elif(y <= 9):
            return "Very High (9)"


px.bar(filtered_df, x=filtered_df["ID"], y=filtered_df["Score"]).update_traces(marker = dict(color=list(map(SetColor, filtered_df['Score']))))


This doesn’t work:
##update_layout(legend = dict(list(map(Setlabel, SetColor, filtered_df['Score']))))

hoary wigeon Mar 11, 2021, 2:24 PM

#

Hello

#

I need help with SCRAPING,

I manually saved the webpage in html format from browser.
Im able to retrieve dataframe from MANUALLY SAVED method.

I saved webpage using request url module in html format.
But i cannot retrieve dataframe from that.

Both looks same but I don't know this happens.
Thanks if you helped me.

austere swift Mar 11, 2021, 2:34 PM

#

how do you want to get a dataframe from the html

#

is there a table in the webpage?

broken stratus Mar 11, 2021, 2:49 PM

#

depends on u scape

#

*depends on how u scape

broken stratus Mar 11, 2021, 2:50 PM

#

hoary wigeon I need help with SCRAPING, I manually saved the webpage in html format from bro...

it isnt that easy to mak a datframe from the table....u need to carefully examine the html corpus to make the dataframe

hoary wigeon Mar 11, 2021, 2:57 PM

#

hold on,

it was very easy with pandas to read_html(mannualysave.html)
but not with request method

grave frost Mar 11, 2021, 3:38 PM

#

untold cove Can anyone tell me how to add labels/text or a second yaxis from another Col to ...

that's probably the most un-pythonic and inefficient way to store something. A dict with keys being numbers and values strings would have served you better

untold cove Mar 11, 2021, 3:39 PM

#

@grave frost have you used PlotlyExpress?

grave frost Mar 11, 2021, 3:39 PM

#

untold cove <@738058085083381760> have you used PlotlyExpress?

no, I have no idea about that. I was just advising you to save time and effort by using data structures

untold cove Mar 11, 2021, 3:41 PM

#

It doesn’t behave the same, take this for example:


=dict(list(Setlabel(bro))))
ValueError: dictionary update sequence element #0 has length 1; 2 is required.

That was with a basic data structure.’bro’

grave frost Mar 11, 2021, 3:41 PM

#

you can try by using an example dict with sample data to better explain your issue here

#

or use a help-channel

untold cove Mar 11, 2021, 3:42 PM

#

I got it working with the function anyway, doing it under the color trace not by updating traces. Haven’t managed to work out the second y axis tho or verify the data but I’m assuming adding the second y as a trace will resolve it.

#

Trust me a basic dict and values was the first thing I tried with the df

misty flint Mar 11, 2021, 3:47 PM

#

i feel like every time work with numpy i end up having to reshape something

#

MercedesGun

hollow sentinel Mar 11, 2021, 3:48 PM

#

that is numpy for you

misty flint Mar 11, 2021, 3:54 PM

#

blobtableflip

undone lotus Mar 11, 2021, 3:59 PM

#

Hello, wanted to know if I could get some assistance in using dynamic pivot in a query using TSQL. Please let me know if this is the right channel to ask this question or point me to the correct channel

misty flint Mar 11, 2021, 4:00 PM

#

#databases is your best bet

undone lotus Mar 11, 2021, 4:01 PM

#

misty flint <#342318764227821568> is your best bet

thank you

misty flint Mar 11, 2021, 4:11 PM

#

ah yes, high degree polynomial regression, the ability to make anything fit anything

#

DoggoKek

misty flint Mar 11, 2021, 4:27 PM

#

funfetti-flavored

#

partyparrot

keen gull Mar 11, 2021, 4:39 PM

#

Hi, Im programming a mini mathematics/scientific dice rolling game

#

its only a couple lines but I want it to be able to count the median of the sums

#

if someone dms i would be ecstatic

hollow sentinel Mar 11, 2021, 4:40 PM

#

just post the code here

#

we will all take a look

keen gull Mar 11, 2021, 4:52 PM

#

okay, code is in swedish though

#

import random
antalForsok = int(input("Hur många gånger vill du kasta?"))
tarningsSumma = 0
for n in range(0,antalForsok):
tarning1 = random.randint(1,6)
tarning2 = random.randint(1,6)
forsokSumma = tarning1 + tarning2
tarningsSumma += forsokSumma
print(tarning1,tarning2,forsokSumma,"\t",tarningsSumma)
print("Summan är",tarningsSumma)

#

if you run this in python, it will ask you how many times do you want to roll

#

and you can choose a number and it will give you the total sum

#

what if I want it to give the sum divided by the number chosen

#

tarning means dice

#

summa is sum

grave frost Mar 11, 2021, 4:58 PM

#

!e

import random
antalForsok = int(input("Hur många gånger vill du kasta?"))
tarningsSumma = 0
for n in range(0,antalForsok):
 tarning1 = random.randint(1,6) 
 tarning2 = random.randint(1,6)
 forsokSumma = tarning1 + tarning2
 tarningsSumma += forsokSumma
 print(tarning1,tarning2,forsokSumma,"\t",tarningsSumma)
print("Summan är",tarningsSumma)

arctic wedgeBOT Mar 11, 2021, 4:58 PM

#

You are not allowed to use that command here. Please use the #bot-commands channel instead.

keen gull Mar 11, 2021, 5:10 PM

#

anything @hollow sentinel ?

arctic wedgeBOT Mar 11, 2021, 5:10 PM

#

Hey @thin remnant!

It looks like you tried to attach file type(s) that we do not allow (.csv). We currently allow the following file types: .gif, .jpg, .jpeg, .mov, .mp4, .mpg, .png, .mp3, .wav, .ogg, .webm, .webp, .flac, .m4a.

Feel free to ask in #community-meta if you think this is a mistake.

thin remnant Mar 11, 2021, 5:11 PM

#

I'm having a csv file that has 32 cols

#

but all data is in the first col

#

how to seperate it overthe cols

misty flint Mar 11, 2021, 5:21 PM

#

open with excel

#

@keen gull

import random
antalForsok = int(input("Hur många gånger vill du kasta?"))
tarningsSumma = 0
for n in range(0,antalForsok):
 tarning1 = random.randint(1,6) 
 tarning2 = random.randint(1,6)
 medelSumma = (tarning1 + tarning2)/antalForsok
 tarningsSumma += medelSumma
 print(tarning1,tarning2,medelSumma,"\t",tarningsSumma)
print("Genomsnittet är",tarningsSumma)

#

idk swedish so i used google translate

#

sorry if the words are wrong

#

lol

keen gull Mar 11, 2021, 5:25 PM

#

ur a god bro wtf

hasty mountain Mar 11, 2021, 5:25 PM

#

Guys, I have an excel file that got many spread sheets. One shows info about "Win rate with X character", other has data about "Win rate when X character follows Y route", and so on. They have different lengths. Can I just make a DataFrame with them? Or would it be better to just separate them all in different files and use them as different datasets?

keen gull Mar 11, 2021, 5:25 PM

#

can I dm u if i need more help?

misty flint Mar 11, 2021, 5:25 PM

#

keen gull ur a god bro wtf

its just this line lol

medelSumma = (tarning1 + tarning2)/antalForsok

#

uhh ill probably be busy later so just post here

#

DoggoKek

hasty mountain Mar 11, 2021, 5:26 PM

#

hasty mountain Guys, I have an excel file that got many spread sheets. One shows info about "Wi...

(I want to use them to make predictions using machine learning)

keen gull Mar 11, 2021, 5:26 PM

#

okay thank you ❤️

misty flint Mar 11, 2021, 5:27 PM

#

np

#

ok_handbutflipped

tidal bough Mar 11, 2021, 5:32 PM

#

hasty mountain Guys, I have an excel file that got many spread sheets. One shows info about "Wi...

I'd make a dataframe per. It's weird to have different data in one DF.

hasty mountain Mar 11, 2021, 5:33 PM

#

tidal bough I'd make a dataframe per. It's weird to have different data in one DF.

I see. So do I just have to use each one as a different dataset? What should I do if I want to train my algorithm to predict a mix of possibilities?

#

For example: If I got a dataset that shows "Win rate when playing with X character"
and another that shows "Win rate when using Y item"
What if I want to predict "Win rate when playing with X character and using Y item"?

tidal bough Mar 11, 2021, 5:34 PM

#

You can make a single dataset out of them if you can figure out how to make them all, well, the same kind of data

hasty mountain Mar 11, 2021, 5:35 PM

#

I see

tidal bronze Mar 11, 2021, 5:37 PM

#

yo pandas is struggling to read a csv file with just about less than half a milion rows is that normal?

#

It did issue me a warning about mixed dtypes but I then specified them

keen gull Mar 11, 2021, 5:38 PM

#

@misty flint it didnt work, its still only showing the result

#

import random
antalForsok = int(input("Hur många gånger vill du kasta?"))
tarningsSumma = 0
for n in range(0,antalForsok):
tarning1 = random.randint(1,6)
tarning2 = random.randint(1,6)
medelSumma = (tarning1 + tarning2)/antalForsok
tarningsSumma += medelSumma
print(tarning1,tarning2,medelSumma,"\t",tarningsSumma)
print("Genomsnittet är",tarningsSumma)

misty flint Mar 11, 2021, 5:40 PM

#

ah i forgot to change the last line

keen gull Mar 11, 2021, 5:40 PM

#

i changed it

#

to medelSumma

misty flint Mar 11, 2021, 5:40 PM

#

yeah

#

thats it

keen gull Mar 11, 2021, 5:40 PM

#

but its a bit off, by like 0.15

misty flint Mar 11, 2021, 5:41 PM

#

pithink

keen gull Mar 11, 2021, 5:41 PM

#

hmm

#

its off depending on the number, not constant

misty flint Mar 11, 2021, 5:42 PM

#

probably this line

tarningsSumma += medelSumma

#

not at my comp anymore so i cant check lol

keen gull Mar 11, 2021, 5:44 PM

#

hmm i tried changing to only +, only =, only - and *

#

none worked

#

wait i just realized that it now shows numbers with decimals? its a dice roll so it cant be anything other than the numbers 1-6

misty flint Mar 11, 2021, 5:47 PM

#

keen gull import random antalForsok = int(input("Hur många gånger vill du kasta?")) tarnin...

print(tarning1,tarning2,medelSumma,"\t",medelSumma)

misty flint Mar 11, 2021, 5:48 PM

#

keen gull wait i just realized that it now shows numbers with decimals? its a dice roll so...

but you wanted the average right? not really possible to get average without decimals

#

the "Genomsnittet"

keen gull Mar 11, 2021, 5:48 PM

#

yes the average can be in decimals but not the actual results

#

but when it shows u the sequence e.g 2 5 8, it shows now 2 5 1.3

misty flint Mar 11, 2021, 5:49 PM

#

i think i dont understand the question. sorry bud

keen gull Mar 11, 2021, 5:49 PM

#

KljYCqBWlbUVQPnJKALWinC48KlvaCKBWlLYVQfnIKQH0jW9yuvCobGkjgO6K0rYiKB85JYBaUU4XHpUtbQRQK0rbiqB85JQAakU.png

#

for example, 1.6 is fine for it to have decimals

#

but here, you can see in line 3 that it says 5 6 2.2, that means the dice rolled 2.2, which isnt possible

misty flint Mar 11, 2021, 5:52 PM

#

your code is definitely different than mine i think lol

#

oh wait i see the problem

keen gull Mar 11, 2021, 5:52 PM

#

I just want it to show the median at the results

misty flint Mar 11, 2021, 5:52 PM

#

i misunderstood the initial problem

keen gull Mar 11, 2021, 5:52 PM

#

its the last line right_

#

?

#

that should be edited

#

thats my fault, im not that good at explaining

misty flint Mar 11, 2021, 5:54 PM

#

keen gull I just want it to show the median at the results

you mean average right?

keen gull Mar 11, 2021, 5:54 PM

#

I was thinking:
print("Genomsnittet är", (tarning1 + tarning2)/antalForsok)

#

yes average

misty flint Mar 11, 2021, 5:54 PM

#

import random
antalForsok = int(input("Hur många gånger vill du kasta?"))
tarningsSumma = 0
for n in range(0,antalForsok):
tarning1 = random.randint(1,6)
tarning2 = random.randint(1,6)
medelSumma = (tarning1 + tarning2)/2
tarningsSumma += medelSumma
print(tarning1,tarning2,medelSumma,"\t",medelSumma)
print("Genomsnittet är",medelSumma)

keen gull Mar 11, 2021, 5:54 PM

#

the result divided by the number the person chose

misty flint Mar 11, 2021, 5:54 PM

#

oh man this is hard to do on mobile lol

keen gull Mar 11, 2021, 5:55 PM

#

im having trouble on a laptop xD

misty flint Mar 11, 2021, 5:55 PM

#

keen gull the result divided by the number the person chose

wait now i definitely dont understand. dont think that code will work for what you are looking for

#

pithink

keen gull Mar 11, 2021, 5:55 PM

#

let me try to formulate it

#

import random
antalForsok = int(input("Hur många gånger vill du kasta?"))
tarningsSumma = 0
for n in range(0,antalForsok):
tarning1 = random.randint(1,6)
tarning2 = random.randint(1,6)
forsokSumma = tarning1 + tarning2
tarningsSumma += forsokSumma
print(tarning1,tarning2,forsokSumma,"\t",tarningsSumma)
print("Summan är",tarningsSumma)

#

the original code shows the sum of all the dices you rolled

#

so if u chose 3 and got 1, 2, and 3 you would get the sum 6

#

i want it to take 6 and divided it by 3, the number chosen

#

which should give 2

grand monolith Mar 11, 2021, 5:57 PM

#

dudes im having a slight problem, I have a (10, 1561) feature matrix but when i do feature[i] its shape is (1561, )

misty flint Mar 11, 2021, 5:57 PM

#

keen gull i want it to take 6 and divided it by 3, the number chosen

for this part just write another line of

x = forsokSumma/antalForsok
print(×)

#

then place x wherever you want it

keen gull Mar 11, 2021, 5:58 PM

#

wdym place x wherever?, I just started coding 2 days ago sorry xD

misty flint Mar 11, 2021, 5:59 PM

#

uhh just place it at the end and you will see what im talking about

#

print(x)

grand monolith Mar 11, 2021, 5:59 PM

#

anyone here good at numpy?

keen gull Mar 11, 2021, 5:59 PM

#

misty flint Mar 11, 2021, 6:00 PM

#

that x is different than the x above it

#

ID_BoomKek

keen gull Mar 11, 2021, 6:01 PM

#

LMFAO HOW

#

omg i see it

#

htwBAggggAACCCCAAAIIIJAzASrAnDkRhQACCCCAAAIIIIAAAghYvgAVoOWvIXeAAAIIIIAAAggggAACCORM4P8B14AehY22DbYA.png

#

36/5 isnt 1.8 unfortunately 😭

misty flint Mar 11, 2021, 6:02 PM

#

ID_BoomKek

keen gull Mar 11, 2021, 6:02 PM

#

PLSSS im a nooob

#

🤣

misty flint Mar 11, 2021, 6:02 PM

#

ah you want THAT number

keen gull Mar 11, 2021, 6:02 PM

#

yeah xD

#

what is THAT in english xD

misty flint Mar 11, 2021, 6:03 PM

#

change it to

x = tarningsSumma/antalForsok

keen gull Mar 11, 2021, 6:04 PM

#

now it only shows sum

misty flint Mar 11, 2021, 6:04 PM

#

grand monolith dudes im having a slight problem, I have a (10, 1561) feature matrix but when i ...

have you tried reshaping it

keen gull Mar 11, 2021, 6:04 PM

#

WNJlb52pny2TSvLtEI7297v2lx0XdlGAAAQgAAEIQAACEIAABCAAAQiMXYCKintm7FEQAQIQgAAEIAABCEwpgeQknd3e7R7gRnLW.png

misty flint Mar 11, 2021, 6:04 PM

#

you deleted

print(x)

#

ID_BoomKek

keen gull Mar 11, 2021, 6:05 PM

#

LMFAOO

misty flint Mar 11, 2021, 6:05 PM

#

ID_BoomKek

keen gull Mar 11, 2021, 6:05 PM

#

FINALLY

#

HOLY SHIT

misty flint Mar 11, 2021, 6:06 PM

#

Praise

#

ID_BoomKek

keen gull Mar 11, 2021, 6:06 PM

#

so all I had to do was add those two lines?

misty flint Mar 11, 2021, 6:07 PM

#

yes, you can even change the last one to

#

print("Genomsnittet är", x)

keen gull Mar 11, 2021, 6:09 PM

#

can i kiss u

misty flint Mar 11, 2021, 6:10 PM

#

DoggoKek

#

np dude

#

ok_handbutflipped

keen gull Mar 11, 2021, 6:12 PM

#

how can i make an empty space between the sum and the average thingy

#

#

between the last two

misty flint Mar 11, 2021, 6:12 PM

#

just do

print()
in between the two lines

keen gull Mar 11, 2021, 6:12 PM

#

oh makes sense

misty flint Mar 11, 2021, 6:13 PM

#

ok_handbutflipped

keen gull Mar 11, 2021, 6:14 PM

#

oh i love you bro

misty flint Mar 11, 2021, 6:16 PM

#

DoggoKek

astral path Mar 11, 2021, 6:23 PM

#

hey y'all im gonna be doing a soundcloud network analysis project to recommend new users to listeners and I have to combine it with another dataset for an assignment. Any ideas for what I could combine it with for some interesting insights?

uncut orbit Mar 11, 2021, 6:24 PM

#

whats in ur first dataset

#

the columsn

keen gull Mar 11, 2021, 6:24 PM

#

@misty flint dont kill me but...

#

import random
antalForsok = int(input("Hur många gånger vill du kasta?"))
raknare = 0
for n in range(0, antalForsok):
tarning1 = random.randint(1,6)
tarning2 = random.randint(1,6)
forsokSumma = tarning1 + tarning2
if forsokSumma == 9:
raknare += 1
print(tarning1, tarning2, forsokSumma, "\t", raknare)
print("Antal gånger summan 9 slogs:", raknare)

#

this is a new code

#

and i wanna make it find the liklihood of getting the sum 15

astral path Mar 11, 2021, 6:25 PM

#

uncut orbit whats in ur first dataset

I don't know yet but it should be things like genre of songs, other artists an artist is following/who follows them, songs #, song length, external social media links, etc...

uncut orbit Mar 11, 2021, 6:26 PM

#

popularity?

#

of the song out of ten

astral path Mar 11, 2021, 6:26 PM

#

that will be one metric yea

#

ah

#

yeah i could do that

#

the assignment needs an external dataset though, nothing i could scrape from soundcloud

uncut orbit Mar 11, 2021, 6:26 PM

#

hmm

astral path Mar 11, 2021, 6:27 PM

#

im thinking i could combine it with data i scrape from their twitter links in their bios but not many artists do that

uncut orbit Mar 11, 2021, 6:27 PM

#

prolly not

#

how big is the data set

#

the rows

astral path Mar 11, 2021, 6:28 PM

#

i have no idea yet, im planning out the architecture of the project first

uncut orbit Mar 11, 2021, 6:28 PM

#

oh

#

i dont think i should be doing this much cuz of the rules

#

astral path Mar 11, 2021, 6:30 PM

#

Hmm im not sure which part of that applies?

uncut orbit Mar 11, 2021, 6:30 PM

#

rule 5 sorry

astral path Mar 11, 2021, 6:30 PM

#

Ah ok this isn't a solution, this part is ungraded

#

My prof only grades on our actual analaysis

uncut orbit Mar 11, 2021, 6:31 PM

#

oh

#

then lets see

#

i cant think

astral path Mar 11, 2021, 6:31 PM

#

Same im having a hard time w this

uncut orbit Mar 11, 2021, 6:32 PM

#

maybe loudness of the song?

#

beats per measure?

#

whats your target column

keen gull Mar 11, 2021, 6:33 PM

#

guys, what is it called when you say "what two numbers added together ranging from 1-12 are equal to 9"

#

how do you turn that into a python line or what is it even called mathematically

#

so for example, 1 and 8, 2 and 7, 3 and 6, 4 and 5

astral path Mar 11, 2021, 6:35 PM

#

uncut orbit whats your target column

what do you mean?

uncut orbit Mar 11, 2021, 6:47 PM

#

your target column is what you would define as y...its what you are trying to predict

astral path Mar 11, 2021, 6:50 PM

#

ah ok

#

it would be other similar users

uncut orbit Mar 11, 2021, 6:55 PM

#

so on that train of though

#

what do other users do?

astral path Mar 11, 2021, 6:59 PM

#

they're all music artists with the same feature types and stuff

grave frost Mar 11, 2021, 7:58 PM

#

astral path hey y'all im gonna be doing a soundcloud network analysis project to recommend n...

you have to predict listener to artist or vice versa?

astral path Mar 11, 2021, 8:04 PM

#

grave frost you have to predict listener to artist or vice versa?

i'm just trying to cluster artists together

#

so like given an artist as input, it would produce a list of artists who are related

#

and i'm looking for ways to implement external datasets in here

grave frost Mar 11, 2021, 8:10 PM

#

astral path and i'm looking for ways to implement external datasets in here

genre would be the closest one then. do you anything about music?

astral path Mar 11, 2021, 8:11 PM

#

do i anything about music?

grave frost Mar 11, 2021, 8:15 PM

#

astral path do i anything about music?

I can't identify your tone 🤷 but If I were you, I would use the time key signature, bpm, and indices of rests in songs to create features and cluster artists via them. (you could take median of that most prob, average doesn't seem appropriate)

#

or you could construct an artificial feature concatenating indices of rest and their distance along with time key in a formula and use that

astral path Mar 11, 2021, 8:25 PM

#

hey sorry i'm in a zoom but i'll be back in a sec

fiery maple Mar 11, 2021, 8:47 PM

#

Anyone using Kedro here?

umbral raptor Mar 11, 2021, 8:49 PM

#

Hello all, nice to meet you.

fiery maple Mar 11, 2021, 8:49 PM

#

Hhahahha an impostor

umbral raptor Mar 11, 2021, 8:52 PM

#

fiery maple Hhahahha an impostor

Just a very easy to happen here coincidence. I use the same username name at gaming and social servers where no one understands it.

fiery maple Mar 11, 2021, 8:53 PM

#

Yes, I think if we search BackPropa* on the usernames, thousands of them would appear on results

umbral raptor Mar 11, 2021, 8:59 PM

#

@fiery maple So Kedro? No I don't feel comfortable with this type of frameworks

lavish tundra Mar 11, 2021, 10:37 PM

#

someone know how to set gridline color on a seaborn lineplot graphic?

exotic maple Mar 11, 2021, 10:39 PM

#

you can always find the matplotlib object directly

#

i think for grid it was... plt.gcf().ygrids

lavish tundra Mar 11, 2021, 10:40 PM

#

i dont use matplotlib . _.

exotic maple Mar 11, 2021, 10:40 PM

#

havent used matplotlib in a while

lavish tundra Mar 11, 2021, 10:41 PM

#

i was thinking if i left seaborn to only use matplotlib if could be better for performance, cause i only need to graphics like this:

exotic maple Mar 11, 2021, 10:41 PM

#

you can use both

#

but for that id probably just use mpl

#

for something prettier define seaborn, much easier

lavish tundra Mar 11, 2021, 10:46 PM

#

idk, i'm trying to use less imports, cause i have a big list rn . _.

#

i'm with fear if my bot can have problems with this

grave frost Mar 11, 2021, 11:21 PM

#

lavish tundra idk, i'm trying to use less imports, cause i have a big list rn . _.

a big list of imports doesn't matter - tip: you can also import multiple things at the same time example:
from math import sqrt, ...<multiple_other_modules>.

#

or you can do import math and use it like math.sqrt()

undone heron Mar 12, 2021, 12:19 AM

#

hey guys, I need some help with Tableu/Power BI stuff, is this the right place?

hollow sentinel Mar 12, 2021, 12:20 AM

#

I think we only help with Python libraries for data science/ML

grave frost Mar 12, 2021, 12:21 AM

#

For discussion of scientific python, matplotlib, statistics, machine learning and related topics.

iron basalt Mar 12, 2021, 12:23 AM

#

I'm guessing @undone heron is trying to run a python script in Power BI.

serene scaffold Mar 12, 2021, 12:25 AM

#

grave frost > For discussion of scientific python, matplotlib, statistics, machine learning ...

We can talk about data science topics in a general, language-agnostic way, though any discussion about implementations should be with respect to Python.

velvet thorn Mar 12, 2021, 12:38 AM

#

lavish tundra i was thinking if i left seaborn to only use matplotlib if could be better for p...

performance difference would be minimal

#

most of the overhead of MPL is from drawing

bitter harbor Mar 12, 2021, 12:41 AM

#

If I wanted to drop rows in a df where the column 'victory_status' is equal to 'outoftime' would I not just do: df.drop(index=np.where(df.data["victory_status"] == "outoftime"))?

serene scaffold Mar 12, 2021, 12:46 AM

#

bitter harbor If I wanted to drop rows in a df where the column 'victory_status' is equal to '...

can you give an example CSV that I can use to try it?

bitter harbor Mar 12, 2021, 12:47 AM

#

https://www.kaggle.com/datasnaek/chess

iron basalt Mar 12, 2021, 12:48 AM

#

You can flip it around: keep all of the rows where 'victory_status' is not 'outoftime'.

serene scaffold Mar 12, 2021, 12:48 AM

#

!paste If you copy and paste enough rows for me to try it, I will try it or look for other solutions.

arctic wedgeBOT Mar 12, 2021, 12:48 AM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in discord, you can paste your code here:
https://paste.pydis.com/

After pasting your code, save it by clicking the floppy disk icon in the top right, or by typing ctrl + S. After doing that, the URL should change. Copy the URL and post it here so others can see it.

bitter harbor Mar 12, 2021, 12:50 AM

#

https://paste.pythondiscord.com/umaxoloyul.apache

serene scaffold Mar 12, 2021, 1:00 AM

#

bitter harbor <https://paste.pythondiscord.com/umaxoloyul.apache>

remember how you can use masks?

#

I'm not sure why it didn't occur to me sooner

bitter harbor Mar 12, 2021, 1:00 AM

#

I'm ngl I have no clue what a mask is

serene scaffold Mar 12, 2021, 1:00 AM

#

df = df[df['victory_status'] != 'outoftime']

#

you use the mask, df['victory_status'] != 'outoftime', to get a boolean series of what you want

#

and then you use that as a mask

#

which is what the outer df[...] does

iron basalt Mar 12, 2021, 1:01 AM

#

lavish tundra i was thinking if i left seaborn to only use matplotlib if could be better for p...

If you want some real-time plotting and maybe some other GUI stuff I recommend: https://github.com/hoffstadt/DearPyGui It's a very good option for all GUI things in python and comes with plotting features too (can even make games with it).

GitHub

hoffstadt/DearPyGui

Dear PyGui: A fast and powerful Graphical User Interface Toolkit for Python with minimal dependencies - hoffstadt/DearPyGui

bitter harbor Mar 12, 2021, 1:01 AM

#

oh ok ya I've used that but only in R

#

is that just a pandas thing?

serene scaffold Mar 12, 2021, 1:02 AM

#

bitter harbor is that just a pandas thing?

numpy works similarly

iron basalt Mar 12, 2021, 1:03 AM

#

bitter harbor is that just a pandas thing?

You will have a generally easier time if you filter for what you want, rather than dropping what you don't want.

bitter harbor Mar 12, 2021, 1:03 AM

#

huh I'll definitely have to remember that that's hella useful

iron basalt Mar 12, 2021, 1:04 AM

#

Think of it like doing a search, where you keep narrowing it down with each filter.

bitter harbor Mar 12, 2021, 1:04 AM

#

iron basalt You will have a generally easier time if you filter for what you want, rather th...

that sounds extremely wasteful memory-wise tho

iron basalt Mar 12, 2021, 1:05 AM

#

You're using python and you are worried about memory (use c instead)? But ok, you can just apply multiple filters at once, and yes you have all those resulting rows duplicated, but it's much less error prone since you are not modifying state (original df unchanged). If you have a memory issue, then fix it at that time. Until then it's premature optimization.

#

Generally I doubt you will have a memory issue, but if you do, consider using a DB since it will store most of the data on disk for you (well all, but it will keep some in memory for faster access).

lean ledge Mar 12, 2021, 1:10 AM

#

iron basalt If you want some real-time plotting and maybe some other GUI stuff I recommend: ...

Oh my god I'm going to use this so much in the future

#

That's actually amazing for scientific and engineering applications

iron basalt Mar 12, 2021, 1:10 AM

#

lean ledge Oh my god I'm going to use this so much in the future

Yeah it's so nice, I already used DearIMGUI and when I found this I was so happy.

#

I use dearpygui for all kinds of things now, like telemetry.

lean ledge Mar 12, 2021, 1:12 AM

#

If I ever find some free time I might remake the features of MATLAB's Control systems toolbox or something using that

bitter harbor Mar 12, 2021, 1:12 AM

#

well honestly I think using a db would solve most of my issues at this point but my prof wants us to use pandas
I get what you're saying about not needing micro-optimisations but at the same time creating multiple dfs seems unnatural to me and I like micro-optimisations 🤷‍♂️

iron basalt Mar 12, 2021, 1:12 AM

#

Yeah it has all of dear imgui's stuff including the raw draw commands which you can use to make custom widgets.

iron basalt Mar 12, 2021, 1:13 AM

#

bitter harbor well honestly I think using a db would solve most of my issues at this point but...

It's not a micro-optimization, but at the same time, you have probably like 8GB of RAM.

#

8GB is a lot, unless you are dealing with like raw video.

bitter harbor Mar 12, 2021, 1:14 AM

#

4 but ya I see what you're saying

iron basalt Mar 12, 2021, 1:14 AM

#

Also you have virtual memory too, so it will use the disk also.

#

(All modern operating systems do, as long as you don't cause pages to fly in and out really fast you basically have unlimited memory)

warm seal Mar 12, 2021, 1:17 AM

#

Quick question: I'm trying to run a correlation test between an ordinal DV and a continuous IV. Stuck between choosing Spearman's or Kendall's. Any advice?

lavish tundra Mar 12, 2021, 3:05 AM

#

i was thinking about to swipe from seaborn to plotpy, what u guys think about? u guys have one favorite for performance and beaultiful visualization?

misty flint Mar 12, 2021, 3:14 AM

#

pithink

#

anything but matplotlib

#

please

#

ID_BoomKek

lavish tundra Mar 12, 2021, 3:18 AM

#

why matplotlib? looks like i dont need it

#

i mean i could use plotpy only

exotic maple Mar 12, 2021, 3:24 AM

#

iron basalt If you want some real-time plotting and maybe some other GUI stuff I recommend: ...

Jesus Christ @iron basalt you're like the god of repos lol

misty flint Mar 12, 2021, 3:26 AM

#

lavish tundra why matplotlib? looks like i dont need it

i mean DONT use matplotlib if you dont have to

#

DoggoKek

exotic maple Mar 12, 2021, 3:32 AM

#

I love how I've changed from dumb social networks to github repos and from bullshit bookmarks to python or ds articles o.O

#

I think somenoe shared this here earlier, and it looks amazing:

#

http://introtodeeplearning.com/

MIT Deep Learning 6.S191

MIT's official introductory course on deep learning methods and applications.

misty flint Mar 12, 2021, 3:33 AM

#

Sip

uncut orbit Mar 12, 2021, 4:21 AM

#

i wish people talked here more

#

there can be so many great discussions on this topic

lean ledge Mar 12, 2021, 4:33 AM

#

There's like 3-5 people who actually know ML stuff here that are regulars and they generally have better things to do than talk all day

iron basalt Mar 12, 2021, 4:34 AM

#

I'm coding and reading, I have already been distracted too much here xd, but Raggy's stuff was p cool so kind of worth it.

uncut orbit Mar 12, 2021, 4:35 AM

#

huh thats true

misty flint Mar 12, 2021, 4:35 AM

#

uncut orbit i wish people talked here more

it has its moments

uncut orbit Mar 12, 2021, 4:36 AM

#

alr

misty flint Mar 12, 2021, 4:37 AM

#

rn im looking at starting an ocr project

#

so looking into that

iron basalt Mar 12, 2021, 4:38 AM

#

Right now i'm a bit more in a reading phase so I can just drop in, but when I go back to a coding / engineering phase I will pretty narrowly focused on it.

misty flint Mar 12, 2021, 4:40 AM

#

im just here for the vibes

#

DoggoKek

#

and to learn by osmosis

#

🧠

uncut orbit Mar 12, 2021, 5:03 AM

#

im here for the data science and the ai

#

its fun

misty flint Mar 12, 2021, 5:04 AM

#

uncut orbit im here for the data science and the ai

which part do you like the best

#

Sip

#

ive just started grad school coming from a dif field so im mainly in the learning phase

uncut orbit Mar 12, 2021, 5:06 AM

#

all of it

#

neural nets mainly

#

GANS

#

CNNs

#

all that good stuff

#

opencv is fun to work with

#

its awesome what you can do with it

misty flint Mar 12, 2021, 5:09 AM

#

oh nice

#

one of my last projects was with opencv. it was cool

#

our prof wants us to train a CNN for this project

#

pithink

uncut orbit Mar 12, 2021, 5:10 AM

#

oh

#

u using keras?

misty flint Mar 12, 2021, 5:11 AM

#

yeah thats the plan. still reading more before actually training anything

#

probs will start tomorrow morning tho

#

we have a meeting with the TA in the afternoon

#

amegablobsweats

uncut orbit Mar 12, 2021, 5:13 AM

#

cool

#

i wish i had assignments like those

#

but im not in college yert

#

*yet

misty flint Mar 12, 2021, 5:15 AM

#

noice. bet you know more than me tho

#

DoggoKek

#

regarding this stuff. maybe not other things lol

misty flint Mar 12, 2021, 5:16 AM

#

uncut orbit i wish i had assignments like those

dw. you will have plenty of assignments like these if you choose to study this stuff

#

ID_BoomKek

uncut orbit Mar 12, 2021, 5:19 AM

#

i definetly will

#

this stuff makes dopamine in my brain

misty flint Mar 12, 2021, 5:21 AM

#

DoggoKek

#

haha nice expression

uncut orbit Mar 12, 2021, 5:22 AM

#

thx

misty flint Mar 12, 2021, 6:35 AM

#

60000 by 785

#

lets see if excel lags

#

kannaSus

#

hmmmmm

#

theres something weird

misty flint Mar 12, 2021, 7:04 AM

#

theres stuff up to rows 300k in excel but when i read the file with python, it says its shape is only 60k by 785

#

confuseddog

placid drum Mar 12, 2021, 8:58 AM

#

hello, XML question:

when do you use
<device name="SEP12345"></device>

and when do you use
<device>SEP12345</device>

tidal bronze Mar 12, 2021, 9:06 AM

#

I did a df.groupby["column"].count()

#

how come a bunch of values are 0?

#

shouldn't every value be at least 1?

#

Otherwise they wouldn't exist no?

summer fractal Mar 12, 2021, 10:36 AM

#

Hello, is anyone familiar with using Google Data Studio? I've got an issue with custom data not displaying correctly in charts?

obtuse sable Mar 12, 2021, 10:44 AM

#

What's a good resource for a beginner to learn about neural networks and implement one in PyTorch? I want to implement a binary classifier and compare it to a logistic regression model in sklearn

lapis sequoia Mar 12, 2021, 11:09 AM

#

whats data science?

lean ledge Mar 12, 2021, 11:19 AM

#

https://en.m.wikipedia.org/wiki/Data_science

#

TL;DR buzzword for statistics and machine learning with coding

grave frost Mar 12, 2021, 12:12 PM

#

!resources @obtuse sable

arctic wedgeBOT Mar 12, 2021, 12:12 PM

#

Resources

The Resources page on our website contains a list of hand-selected learning resources that we regularly recommend to both beginners and experts.

serene scaffold Mar 12, 2021, 12:36 PM

#

lapis sequoia whats data science?

in short, it's when you use programming to make use of large amounts of data.

misty flint Mar 12, 2021, 12:45 PM

#

Praise

serene scaffold Mar 12, 2021, 12:59 PM

#

I have a program that calculates precision, recall, and f1 scores, and it breaks if there's a class with 0 tps. in that case should I set all three scores to zero, or would one usually use nan?

grave frost Mar 12, 2021, 1:33 PM

#

Anyone know what is the technical term for the multi-branch networks (the ones where we would concatenate layers)?

lean ledge Mar 12, 2021, 1:38 PM

#

grave frost Anyone know what is the technical term for the multi-branch networks (the ones w...

Multi branch network is the term lol

#

Multi output is sorta common too

tidal bronze Mar 12, 2021, 2:07 PM

#

so I have a subset of a df, columns(productID and orders), there are 5 unique product id, when I try to to a histplot with hue=productID, all productID from the original df show up

lapis sequoia Mar 12, 2021, 2:08 PM

#

Hey! So I am just starting with python

tidal bronze Mar 12, 2021, 2:08 PM

#

common = agg_df.sort_values(by=["sh_18_Ln_PalletsEquivalent"], ascending=False).index.tolist()[:5]
df_most_ordered = df[df["sh_ItemId"].isin(common)]
sns.histplot(data=df_most_ordered,
             x = "sh_18_Ln_PalletsEquivalent",
             hue="sh_ItemId")

lapis sequoia Mar 12, 2021, 2:09 PM

#

Question... modules are synonymous of libraries?

tidal bronze Mar 12, 2021, 2:09 PM

#

a modume is another python scrip

#

your question would fit more in #python-discussion

lapis sequoia Mar 12, 2021, 2:10 PM

#

Excellent thanks!!

tidal bronze Mar 12, 2021, 2:10 PM

#

my question is why are all the orginal productID showing up

#

where they aren't even part of the dataframe I am basing the plot on

lapis sequoia Mar 12, 2021, 2:18 PM

#

yo

#

i am trying out a beginner project

#

#

as u can see, the data is the documents

#

and i really dont understand how it predicted both "that kud" and "HOHOHO" as belonging to the same cluster in this case

#

is there anything that i should fix there? i dont think that it's actually legit

#

Hey guys I'm working with matplotlib subplot animations and I'm running into some issues, could someone please just tell me what is wrong with this code? I don't get the logic behind it.

import matplotlib.animation as animation
import matplotlib.pyplot as plt
import numpy as np

figure = plt.figure()
n = 1000

x1 = np.random.normal(-2.5, 1, 10000)
x2 = np.random.gamma(2, 1.5, 10000)
x3 = np.random.exponential(2, 10000) + 7
x4 = np.random.uniform(14, 20, 10000)
x = [x1, x2, x3, x4]


def plot_hist(curr):
    if curr == n:
        a.even_source.stop()
    for i in range(len(axs)):
        axs[i].cla()
        axs[i].hist(x[i], bins=100)


fig, ((top_left, top_right), (bottom_left, bottom_right)) = plt.subplots(2, 2, sharex=True)
axs = [top_left, top_right, bottom_left, bottom_right]

a = animation.FuncAnimation(figure, plot_hist, interval=100)```

grave frost Mar 12, 2021, 2:37 PM

#

lean ledge Multi branch network is the term lol

yeet! 😅

#

Is there any specific advantage that architecture offers?

lean ledge Mar 12, 2021, 2:39 PM

#

Multi channel is multiple branches as inputs

lean ledge Mar 12, 2021, 2:40 PM

#

grave frost Is there any specific advantage that architecture offers?

It's less an advantage of an architecture more speeding up multiple networks on the same data.

#

Multiple output branches are basically multiple networks that share input parameters

grave frost Mar 12, 2021, 2:40 PM

#

lean ledge Multi channel is multiple branches as inputs

ahh, so multichannel has multiple input gates while branched ones just have multiple branches on one input gate?

lean ledge Mar 12, 2021, 2:41 PM

#

Means you can do less processing because you're likely to be learning the same lower level features in early layers anyway

#

See: Mask RCNN

#

It's basically Faster RCNN that shares its early parameters with a segmentation branch

grave frost Mar 12, 2021, 2:42 PM

#

tbh that's pretty creative

#

but it doesn't do much to push SOTA, does it?

lapis sequoia Mar 12, 2021, 2:43 PM

#

lapis sequoia

@grave frost what do u think of my case?

#

i am pretty unsure tbh

grave frost Mar 12, 2021, 2:44 PM

#

you can use cosine similarity or simple euclidean distance to check whether the vectors are indeed correct and debug that way

#

most prob there is little to no correlation

#

you can visualize them too so that its easy to understand and looks pretty too

lapis sequoia Mar 12, 2021, 2:45 PM

#

i dont think i can visualize the datas tho

grave frost Mar 12, 2021, 2:45 PM

#

I mean the vectors

lapis sequoia Mar 12, 2021, 2:45 PM

#

they are in texts

#

oh?

#

how

grave frost Mar 12, 2021, 2:46 PM

#

there are plenty of tutorials online

lapis sequoia Mar 12, 2021, 2:46 PM

#

cool

#

thx

grave frost Mar 12, 2021, 2:47 PM

#

I think there was a guy recently who posted the same article here, but you would have to search for it

lapis sequoia Mar 12, 2021, 2:48 PM

#

#

the prediction makes more sense now

grave frost Mar 12, 2021, 2:49 PM

#

!code

arctic wedgeBOT Mar 12, 2021, 2:49 PM

#

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

pseudo raptor Mar 12, 2021, 2:52 PM

#

Does anyone knows a good tutorial to start python tensorflow learning ?

grave frost Mar 12, 2021, 2:55 PM

#

Linear Support Vector Machine is widely regarded as one of the best text classification algorithms.
google got this ^^^

lean ledge Mar 12, 2021, 2:59 PM

#

grave frost but it doesn't do much to push SOTA, does it?

Being useable is pretty useful in breaking SOTA. It's also sort of useful in terms of representation theory. NNs don't necessarily learn lower level features early on.

grave frost Mar 12, 2021, 3:10 PM

#

I had a question - can anyone suggest some method where I can incorporate document vectors with TF-Idf? it would be pretty easy with word vectors with simple scalar multiplication. but what approach could we use in documents??

austere swift Mar 12, 2021, 3:21 PM

#

https://www.reddit.com/r/MachineLearning/comments/m3boyo/d_why_is_tensorflow_so_hated_on_and_pytorch_is/ here's a pretty interesting conversation on tensorflow vs pytorch

r/MachineLearning - [D] Why is tensorflow so hated on and pytorch i...

264 votes and 125 comments so far on Reddit

austere swift Mar 12, 2021, 3:22 PM

#

pseudo raptor Does anyone knows a good tutorial to start python tensorflow learning ?

you can check out the tensorflow guide on tensorflow's website

#

or the tutorials

#

https://www.tensorflow.org/tutorials

TensorFlow

Tutorials | TensorFlow Core

Complete, end-to-end examples to learn how to use TensorFlow for ML beginners and experts. Try tutorials in Google Colab - no setup required.

pseudo raptor Mar 12, 2021, 3:23 PM

#

austere swift you can check out the tensorflow guide on tensorflow's website

Ok, thank you 🙂

lapis sequoia Mar 12, 2021, 3:35 PM

#

hi, i want some help. So i am unsure of the type of machine learning algorithm i should use, i hope yall know the type of learning that suits my project.

a) i want it to be able to be able to categorize sentences, such as "chitchat" and "task" and even sub categories based on any pattern that it can find from the sentences. ( main objective )

b) i want to let it be able to be trained by this way. I first define the categories, then i put some sentences in the category. it tries to predict the category of test sentence and i can tell the correct category if it's wrong ( i think i can do the same with supervised learning for the "telling the correct category" part )

grave frost Mar 12, 2021, 3:39 PM

#

austere swift https://www.reddit.com/r/MachineLearning/comments/m3boyo/d_why_is_tensorflow_so_...

nice. personally, I am more of a TF Person, but I see that people are comparing TF 1.x more than TF2. The basic summary is: I started with TF1, it sucked. I switched to pytorch and would never switch again.

#

For me, TF just adds a lot of native support which makes it easier to do stuff that uses other Google products (like GCP, Tfrecords, TPU, etc.) having native support is a big thing that prevents TF people from going to PyTorch. I wanted to do PyT with TPU and that was a mess of errors. with TF - literally 5 lines. Same with model parallelism - its just easier.
Thats why a lot of people stick with TF because it just makes life easy 🤷 Tho my aim is to be familiar with pytorch too by the time I end undergrad or smthing

austere swift Mar 12, 2021, 3:43 PM

#

for me it depends on the complexity of the project

#

i like using keras with tensorflow for simple projects

#

since its a lot easier

#

but for more complex stuff i use pytorch

#

i've never really used raw tensorflow with no keras

lapis sequoia Mar 12, 2021, 3:44 PM

#

@grave frost @austere swift what do u think of my project?

grave frost Mar 12, 2021, 3:44 PM

#

there is no need to use raw TF when keras works

austere swift Mar 12, 2021, 3:44 PM

#

yeah

grave frost Mar 12, 2021, 3:44 PM

#

its only for researchers and power users I guess

austere swift Mar 12, 2021, 3:45 PM

#

as for TPU afaik thats like one of the biggest advantages that TF has

#

pytorch just doesn't really like TPUs

#

but I train on local servers rather than kaggle or colab

#

so I use GPUs anyways

grave frost Mar 12, 2021, 3:45 PM

#

ye, XLA f-ing sucks. that thing produces a shit-ton of errors that are basically the exact opposite of what the error says

#

never. again.

grave frost Mar 12, 2021, 3:46 PM

#

austere swift but I train on local servers rather than kaggle or colab

Cloud?

#

rich boi

#

I would never be able to take the brave step with local hardware (but then I don't really have $$)

austere swift Mar 12, 2021, 3:47 PM

#

my dad gets grants for research stuff, so thats how i get funded

#

I'm the one who mostly uses the hardware though

#

my dads a professor and physicist at a university

grave frost Mar 12, 2021, 3:47 PM

#

that is pretty cool

austere swift Mar 12, 2021, 3:47 PM

#

yeah

grave frost Mar 12, 2021, 3:47 PM

#

so whats the config?

austere swift Mar 12, 2021, 3:47 PM

#

this is the main server i use

#

4 rtx 6000s, dual xeon 6242s

#

384gb ram

#

i also have a second one which is similar except it has dual 4210s and a single rtx 6000

#

but i'm soon gonna be upgrading it to have an A40, then moving the rtx 6000 from it to the one with 4 rtx 6000s

grave frost Mar 12, 2021, 3:48 PM

#

dude, please stop. my potato computer would crash seeing such expensive hardware

austere swift Mar 12, 2021, 3:48 PM

#

so its gonna be one with an A40, and one with 5 rtx 6000s

grave frost Mar 12, 2021, 3:48 PM

#

do you do kaggle?

austere swift Mar 12, 2021, 3:49 PM

#

kaggle competitions?

#

yeah sometimes

grave frost Mar 12, 2021, 3:49 PM

#

ye

#

so do you use your rig to train massive models?

austere swift Mar 12, 2021, 3:49 PM

#

yeah

grave frost Mar 12, 2021, 3:49 PM

#

cuz I doubt then you wouldn't atleast come at top 5

austere swift Mar 12, 2021, 3:49 PM

#

sometimes i just use it to train multiple models at once too

#

one on each gpu, or 2 each using 2 GPUS

#

or any config like that

#

mix and match :)

grave frost Mar 12, 2021, 3:50 PM

#

lucky you

#

experimentation must be a breeze when you don't have to think about resources

#

its all just writing the code

austere swift Mar 12, 2021, 3:51 PM

#

yeah but the code is more complicated when you have to make it take advantage of the GPUs

#

coding for multi gpu is more complicated

grave frost Mar 12, 2021, 3:52 PM

#

still

austere swift Mar 12, 2021, 3:52 PM

#

yeah lol its nice

grave frost Mar 12, 2021, 3:52 PM

#

you can't experiment with that code unless you have multi-gpus

austere swift Mar 12, 2021, 3:52 PM

#

but the power bills are 📈 📈 📈

grave frost Mar 12, 2021, 3:52 PM

#

just train it at the night 🙂

austere swift Mar 12, 2021, 3:53 PM

#

it draws too much power to be plugged into one circuit

#

so i have an extension cord to have one of the psus connected to a different circuit

#

cus it has 2 psus

grave frost Mar 12, 2021, 3:53 PM

#

nice

austere swift Mar 12, 2021, 3:54 PM

#

yeah

#

it tripped the breaker 3 times before i figured that out

exotic maple Mar 12, 2021, 4:12 PM

#

lapis sequoia hi, i want some help. So i am unsure of the type of machine learning algorithm i...

Your question a bit too general as it is.

On one side it looks like a classification problem. On the other hand it looks like an NPL problem "put sentences in the category"?

I'm not sure if i'm being lost in translation here but I can't conclusively determine what your goal is here. is it classification?

lapis sequoia Mar 12, 2021, 4:13 PM

#

it's to classify sentences into sub categories

#

but it's ok now

#

i figured that it wouldnt work

exotic maple Mar 12, 2021, 4:14 PM

#

its not that it wouldnt work. Its just you need to clearly define what you want lol

#

sounds more like an NPL kind of problem though.

#

NLP*

spare vine Mar 12, 2021, 4:19 PM

#

any ideas why this mnist dataset has more than the expected 70,000 rows? https://www.openml.org/d/554

OpenML: exploring machine learning better, together.

OpenML

OpenML: exploring machine learning better, together. An open science platform for machine learning.

misty flint Mar 12, 2021, 4:26 PM

#

ahhh i messed up preprocessing my data

#

just now caught it

#

Kermit_KMS

spare vine Mar 12, 2021, 4:42 PM

#

spare vine any ideas why this mnist dataset has more than the expected 70,000 rows? https:/...

nvm i'm dumb

uncut orbit Mar 12, 2021, 4:44 PM

#

nah you're not

#

there must have been more data

spare vine Mar 12, 2021, 4:45 PM

#

thanks for checking it. i just realised that it is 70,000 but i was also counting the metadata lines

uncut orbit Mar 12, 2021, 4:46 PM

#

oh

spare vine Mar 12, 2021, 4:46 PM

#

that's why i'm dumb 😛

uncut orbit Mar 12, 2021, 4:46 PM

#

oh

#

spare vine Mar 12, 2021, 4:47 PM

#

numberofinstances is 70,000 == number of rows. number of columns is 785

lapis sequoia Mar 12, 2021, 4:52 PM

#

exotic maple its not that it wouldnt work. Its just you need to clearly define what you want ...

well

#

this is why i just gonna stop trying it

#

i wanted it to work like a personal assistant

#

the problem is

#

it wouldnt understand the way to execute the commands

#

such as

#

turn on my fan

#

every command has to be coded

#

i can actually use ml for the part i mentioned

#

the problem is executing my commands

austere swift Mar 12, 2021, 5:04 PM

#

spare vine any ideas why this mnist dataset has more than the expected 70,000 rows? https:/...

#

oh nvm you already figured it out

grave frost Mar 12, 2021, 5:22 PM

#

@lapis sequoia it would be much helpful if you can condense your issue down to a single paragraph over one message

hollow sentinel Mar 12, 2021, 6:59 PM

#

Is bruher oui oui baguette

boreal summit Mar 12, 2021, 7:01 PM

#

For those who are into web scraping and selenium.

#

I'm reading this book, and I need to have Gecko to run some code.

#

I've been searching how to download Gecko driver but I couldn't find much information like the .exe file itself.

#

I already have Firefox installed on my laptop, so do I just use the Firefox installation as the driver or is there a way to download Gecko driver's.exe file?

#

Thanks.

#

Also, I'm on windows.

spare vine Mar 12, 2021, 7:20 PM

#

boreal summit For those who are into web scraping and selenium.

I think this is what you want: https://github.com/mozilla/geckodriver/releases

GitHub

Releases · mozilla/geckodriver

WebDriver for Firefox. Contribute to mozilla/geckodriver development by creating an account on GitHub.

#

find the win zip (either 32 or 64 based on your PC, most likely 64)

boreal summit Mar 12, 2021, 7:24 PM

#

spare vine find the win zip (either 32 or 64 based on your PC, most likely 64)

Okay man, Thanks.

misty flint Mar 12, 2021, 7:32 PM

#

ive told the dataframe to add 10 to every number in the 0th index

#

why does it add it to the first index

#

Blobcat_tableflip

sudden ether Mar 12, 2021, 7:32 PM

#

~~how can i make sense of stanord's stanza's outputs?~~

misty flint Mar 12, 2021, 7:39 PM

#

aha

#

why is this not 0??

#

#

this is the culprit

#

CocoGun

exotic maple Mar 12, 2021, 8:12 PM

#

Impressive

That dataframe is the least explicative table i've ever seen in my life lol

misty flint Mar 12, 2021, 8:14 PM

#

its the worst

#

also interesting note

#

if you try to read a csv on google colab before its fully uploaded, it still reads it

#

it just reads the rows it has currently

#

so like, half the data DoggoKek

#

idk why colab doesnt return an error

#

was supposed to be ~400k rows but only 200k were in the dataframe

#

ID_BoomKek

keen pivot Mar 12, 2021, 8:28 PM

#

Which pdf library should one use to read PDFs?

#

is there a "best one" or standard one most folks use?

exotic maple Mar 12, 2021, 8:46 PM

#

PDF is a pain to work with

#

depending on what you want, either library works best

misty flint Mar 12, 2021, 8:49 PM

#

pypdf is nice

#

also you know what they should make in the future?

#

some kind of way to estimate how long a model will take to train

#

based on how big the dataset is, etc., etc.

#

im here staring at the rotating circle

#

like

#

K_KittyComfy

keen pivot Mar 12, 2021, 8:55 PM

#

exotic maple PDF is a pain to work with

I'm looking to extract text from it. scrape information from a PDF. like say if it were a form

misty flint Mar 12, 2021, 8:55 PM

#

pypdf

keen pivot Mar 12, 2021, 8:56 PM

#

I'm trying it right now and I'm not finding it working well

#

I'm about to check out PDFminer.six

misty flint Mar 12, 2021, 8:56 PM

#

rip

#

works for me

keen pivot Mar 12, 2021, 8:56 PM

#

what does your code look like?

#

maybe I'm doing something wrong

exotic maple Mar 12, 2021, 8:56 PM

#

keen pivot I'm looking to extract text from it. scrape information from a PDF. like say if ...

best wishes to you. I found working with PDF a pain in the ass

#

i literally converted it to a different file type because it was easier lol

#

I used this

#

https://smallpdf.com/pdf-converter

PDF Converter - Convert files to and from PDFs Free Online

Internet's #1 and 100% free online PDF converter to convert your files to and from PDFs. No registration or installation needed. Start converting today!

#

but if your data is sensitive...well, no idea tbh

misty flint Mar 12, 2021, 8:57 PM

#

um im training a model rn so...lol

#

also i have a meeting with the ta where im supposed to show this model

#

Clown2

keen pivot Mar 12, 2021, 8:58 PM

#

^^

#

Good luck!

misty flint Mar 12, 2021, 8:58 PM

#

thanks. the model wont train in time bc i waited last minute

#

ID_BoomKek

#

look into ocr maybe

#

tesseract was the library we used for our team project

keen pivot Mar 12, 2021, 9:00 PM

#

.extractText() just doesn't seem to work

keen pivot Mar 12, 2021, 9:00 PM

#

misty flint look into ocr maybe

OCR sounds like overkill if the text is in there somewhere

exotic maple Mar 12, 2021, 9:00 PM

#

maybe its one of those beautiful PDFs that store the data as in image

#

-pukes-

keen pivot Mar 12, 2021, 9:01 PM

#

might be

#

but I can definitely highlight the text

#

in the pdf... so the text must be in there right?

#

i guess I can copy over another pdf and test it out

exotic maple Mar 12, 2021, 9:02 PM

#

should be. Unfortunately i dont think i can help you anymore thou 😦

keen pivot Mar 12, 2021, 9:03 PM

#

Looks like It works with another, simpler PDF.

#

The text seems to be broken up in some really weird points... hmm

#

There's definitely bits missing...

#

okay so pyPDF doesnt' work and when it does it misses out text

#

buuuut

#

when i use pdfminer the text comes out for both example pdfs

#

it works like a charm

#

the only issue i think is just it's a lot... harder to use

molten hamlet Mar 12, 2021, 9:24 PM

#

How can I use sklearn PCA with pandas table where I got string values?

#

car model ValueError: could not convert string to float: 'alfa-romero'

keen pivot Mar 12, 2021, 9:34 PM

#

keen pivot the only issue i think is just it's a lot... harder to use

actually it turned out to be quite simple

exotic maple Mar 12, 2021, 9:43 PM

#

molten hamlet How can I use sklearn `PCA` with pandas table where I got string values?

you need to convert the categorical column to numbers

#

for exmple you can use sklearns OneHotEncoder

exotic maple Mar 12, 2021, 9:43 PM

#

keen pivot the only issue i think is just it's a lot... harder to use

I admire your resilliance lol.

#

PDFs are unbearable

lean ledge Mar 12, 2021, 9:47 PM

#

The PCA of categorial data is weird, you might also just want to not include that in your PCA.

misty flint Mar 12, 2021, 9:50 PM

#

keen pivot actually it turned out to be quite simple

oh you got it to work? grats

#

Praise

keen pivot Mar 12, 2021, 9:51 PM

#

misty flint oh you got it to work? grats

Yeah pdfminer seems pretty good.

misty flint Mar 12, 2021, 9:51 PM

#

ValkNaruhodo

keen pivot Mar 12, 2021, 9:52 PM

#

It's high-level function makes it real easy

#

https://pdfminersix.readthedocs.io/en/latest/reference/highlevel.html

misty flint Mar 12, 2021, 9:53 PM

#

ill have to look into this

#

to see if we can also use this for our project

#

thanks for the link

#

hmmm

#

WARNING:tensorflow:Learning rate reduction is conditioned on metric val_acc which is not available. Available metrics are: loss,categorical_accuracy,auc,val_loss,val_categorical_accuracy,val_auc,lr

#

the model is still training so...

#

pithink

#

why doesnt the TF documentation have a list of the warnings

#

oh well ig the learning rate reduction just wont happen

keen pivot Mar 12, 2021, 10:03 PM

#

What are you building?

misty flint Mar 12, 2021, 10:28 PM

#

a basic CNN to read handwritten characters

exotic maple Mar 12, 2021, 11:04 PM

#

lean ledge The PCA of categorial data is weird, you might also just want to not include tha...

in that case what do you do? PCA or MDS your continous variables only?

#

and then later encode your categorical?

austere swift Mar 12, 2021, 11:17 PM

#

misty flint > WARNING:tensorflow:Learning rate reduction is conditioned on metric `val_acc` ...

looks like you'd want to change val_acc to val_categorical_accuracy

atomic swan Mar 12, 2021, 11:21 PM

#

Does anyone know how to resolve matplotlib graphing frequency on the x axis? My chart is the wrong way round

lean ledge Mar 12, 2021, 11:23 PM

#

exotic maple and then later encode your categorical?

That's what I would do. You can run separate feature selection processes on categorical variables

exotic maple Mar 12, 2021, 11:23 PM

#

what is a good feature selection process for categoricals? PCA, MDS and t-SNE probably wouldnt work with those right?

atomic swan Mar 12, 2021, 11:25 PM

#

seriously... nobody knows?

#

in the data science channel

lean ledge Mar 12, 2021, 11:47 PM

#

exotic maple what is a good feature selection process for categoricals? PCA, MDS and t-SNE pr...

I'm weak on that side of statistics but generally hypothesis testing type methods which you'd use to figure out the difference between two scientific testing groups are what's relevant. Chi squared, rank correlation, etc.

exotic maple Mar 12, 2021, 11:48 PM

#

Chi squared? I hve never used X2 like that. interestnig. I'll read about it. thanks!

lean ledge Mar 12, 2021, 11:51 PM

#

It's just like usual hypothesis testing honestly

#

https://towardsdatascience.com/chi-square-test-for-feature-selection-in-machine-learning-206b1f0b8223

Medium

Chi-Square Test for Feature Selection in Machine learning

We always wonder where the Chi-Square test is useful in machine learning and how this test makes difference.Feature selection is an…

misty flint Mar 13, 2021, 12:08 AM

#

austere swift looks like you'd want to change `val_acc` to `val_categorical_accuracy`

ok let me try it. thanks

misty flint Mar 13, 2021, 12:24 AM

#

atomic swan in the data science channel

you didnt paste your code at all

#

memecringeharold

#

your problem could be different things

#

maybe the data isnt sorted right

#

maybe your variable names are wrong

#

etc.

misty flint Mar 13, 2021, 3:00 AM

#

what do people do when their model is still training

#

just chill

#

grab lunch?

#

etc.

#

DoggoKek

exotic maple Mar 13, 2021, 3:08 AM

#

misty flint grab lunch?

Anime or netflix

#

or troll in this discord :v

misty flint Mar 13, 2021, 3:10 AM

#

BongoCat

#

i see you speak from personal experience

#

how wise

exotic maple Mar 13, 2021, 3:12 AM

#

lean ledge It's just like usual hypothesis testing honestly

The last time i did hypothesis testing of categorical variables was in college. RIP

#

time to Khan Academy 🃏

misty flint Mar 13, 2021, 3:13 AM

#

theres a good open source textbook

#

if youre a textbook guy

#

@exotic maple chapter 6 https://www.openintro.org/book/os/

OpenIntro Statistics

OpenIntro's mission is to make educational products that are free, transparent, and lower barriers to education. We're a registered 501(c)(3) nonprofit.

#

i just read chapters 5 and 6 and they were pretty decently written

#

on the shorter side for textbooks too

#

or ig you could watch the videos too

#

if youre a video guy

#

DoggoKek

exotic maple Mar 13, 2021, 3:18 AM

#

I'd prefer classrooms for mathc lasses but in these beautiful times

#

videos will do

misty flint Mar 13, 2021, 3:18 AM

#

feelsbongoman

exotic maple Mar 13, 2021, 3:18 AM

#

honestly khan academy is pretty good; i just need a refresher

misty flint Mar 13, 2021, 3:18 AM

#

ye

#

the book is just if you need more specifics ig

#

i make my students that i tutor watch khan academy

#

DoggoKek

exotic maple Mar 13, 2021, 3:20 AM

#

that's an improvement over college teachers that give you a book and sya "let me know if you dont get something" hyperlemon

misty flint Mar 13, 2021, 3:21 AM

#

and then they never respond to their email if you actually need them

#

ID_BoomKek

hollow sentinel Mar 13, 2021, 3:24 AM

#

I emailed my writing professor like a week ago about an assignment bc I couldn’t find it

#

She never even responded

misty flint Mar 13, 2021, 3:25 AM

#

memecringeharold

#

when youre waiting for a model to train for your project, so you end up working on your other project

#

2_AppClown

#

moral of the story:

#

dont procrastinate kids

#

ID_BoomKek

iron basalt Mar 13, 2021, 3:38 AM

#

During training I think about the model and then realize that I have a (logic / not immediately noticeable) bug and all that time spent was for nothing.

#

Or the model is not doing well and I can't tell if it's a bug or if my experimental model is just bad.

exotic maple Mar 13, 2021, 3:40 AM

#

nah bro

#

the best is when you perform a gridsearch with 3 paramters, 5 variations for each parameters, but you forget to set the scoring to something other than accuracy (which is why i needed)

#

and you waste 1 hour of your life watching your PC burn

misty flint Mar 13, 2021, 3:47 AM

#

ID_BoomKek

#

could be worse tho

#

you couldve wasted even more time with a larger one

#

DoggoKek

exotic maple Mar 13, 2021, 3:53 AM

#

my friend just sent me the greatest piece of ehresy ive ever seen

import pandas as np
import numpy as pd

hollow sentinel Mar 13, 2021, 4:06 AM

#

The real menaces don’t even import pandas as pd

#

they just import pandas

arctic wedgeBOT Mar 13, 2021, 4:28 AM

#

@exotic maple I'm going to secretly execute pd, np = np, pd under the hood to switch it back. I will not be mocked!

twilit pilot Mar 13, 2021, 4:38 AM

#

This is my pandas dataframe ```
image label
0 [146, 151, 156, 158, 160, 154, 144, 131, 127, ... butterfly
1 [156, 156, 156, 157, 158, 159, 160, 160, 160, ... butterfly
2 [146, 198, 172, 200, 168, 226, 186, 183, 192, ... butterfly
3 [66, 57, 53, 51, 42, 63, 95, 123, 139, 121, 77... butterfly
4 [48, 110, 212, 226, 232, 248, 144, 186, 190, 1... butterfly
... ... ...
26174 [157, 150, 122, 131, 149, 151, 162, 190, 132, ... squirrel
26175 [128, 119, 99, 73, 58, 132, 108, 110, 97, 88, ... squirrel
26176 [172, 162, 151, 108, 109, 115, 132, 174, 183, ... squirrel
26177 [16, 89, 112, 109, 65, 30, 46, 97, 98, 118, 8,... squirrel
26178 [97, 106, 94, 101, 55, 121, 129, 77, 35, 18, 8... squirrel

[26179 rows x 2 columns]
All the `images` are 1d arrays of length 2500 (all same length, type="uint8") I made my py
X = df['image']
y = df['label']
and I am trying to use an `sklearn.svm.SVC()` model and this is error i getpy
model = SVC()
model.fit(X, y)

#

TypeError: only size-1 arrays can be converted to Python scalars

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "c:\Users\sohan\OneDrive\Documents\ProgrammingProjects\ImageClassification\main.py", line 22, in <module>
    model.fit(X_train, y_train)
  File "C:\Users\sohan\AppData\Local\Packages\PythonSoftwareFoundation.Python.3.8_qbz5n2kfra8p0\LocalCache\local-packages\Python38\site-packages\sklearn\svm\_base.py", line 160, in fit
    X, y = self._validate_data(X, y, dtype=np.float64,
  File "C:\Users\sohan\AppData\Local\Packages\PythonSoftwareFoundation.Python.3.8_qbz5n2kfra8p0\LocalCache\local-packages\Python38\site-packages\sklearn\base.py", line 432, in _validate_data
    X, y = check_X_y(X, y, **check_params)
  File "C:\Users\sohan\AppData\Local\Packages\PythonSoftwareFoundation.Python.3.8_qbz5n2kfra8p0\LocalCache\local-packages\Python38\site-packages\pandas\core\arrays\numpy_.py", line 211, in __array__
    return np.asarray(self._ndarray, dtype=dtype)
  File "C:\Users\sohan\AppData\Local\Packages\PythonSoftwareFoundation.Python.3.8_qbz5n2kfra8p0\LocalCache\local-packages\Python38\site-packages\numpy\core\_asarray.py", line 102, in asarray
    return array(a, dtype, copy=False, order=order)
ValueError: setting an array element with a sequence.

#

Can someone please help me with this error

serene scaffold Mar 13, 2021, 4:47 AM

#

@twilit pilot do you understand what is meant by TypeError: only size-1 arrays can be converted to Python scalars?

#

Refer to this:

>>> import numpy as np
>>> arr = np.array([1])
>>> arr
array([1])
>>> int(arr)
1
>>> arr = np.array([1, 2, 3])
>>> int(arr)
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
TypeError: only size-1 arrays can be converted to Python scalars
>>> arr = np.array([[1]])
>>> int(arr)
1

twilit pilot Mar 13, 2021, 5:09 AM

#

serene scaffold Refer to this: ```py >>> import numpy as np >>> arr = np.array([1]) >>> arr arra...

i get that only size-1 arrays can be converted to Python scalars, but to fit my data, obv my X won't be a size-1 array

misty flint Mar 13, 2021, 5:17 AM

#

exotic maple my friend just sent me the greatest piece of ehresy ive ever seen ```py import p...

i love how you got all the lurkers to come out with that comment

#

ID_BoomKek

velvet thorn Mar 13, 2021, 5:27 AM

#

twilit pilot i get that only size-1 arrays can be converted to Python scalars, but to fit my ...

think about what shape it needs to be

#

vs what shape it is

twilit pilot Mar 13, 2021, 5:34 AM

#

velvet thorn think about what shape it needs to be

well i want it to be a 1d array, and it is a 1d array

#

@velvet thorn does it have to be a 2d array?

fair stream Mar 13, 2021, 5:43 AM

#

How I can add a link to a word?

velvet thorn Mar 13, 2021, 5:49 AM

#

twilit pilot well i want it to be a 1d array, and it is a 1d array

...why would you want X to be 1D

twilit pilot Mar 13, 2021, 6:03 AM

#

velvet thorn ...why would you want `X` to be 1D

its still the same issue if i convert to a 2d array (50x50)

velvet thorn Mar 13, 2021, 6:03 AM

#

twilit pilot its still the same issue if i convert to a 2d array (50x50)

show code

twilit pilot Mar 13, 2021, 6:03 AM

#

ok

#

wait

#

@velvet thorn ```py
import os
import cv2
import pickle
import numpy as np
import pandas as pd
from sklearn.svm import SVC
import matplotlib.pyplot as plt
from sklearn.preprocessing import LabelEncoder
from sklearn.model_selection import train_test_split

Load the dataset

with open('data/dataframe.txt', 'rb') as infile:
df = pickle.load(infile)

"""
image label
0 [[146, 151, 156, 158, 160, 154, 144, 131, 127,... butterfly
1 [[156, 156, 156, 157, 158, 159, 160, 160, 160,... butterfly
2 [[146, 198, 172, 200, 168, 226, 186, 183, 192,... butterfly
3 [[66, 57, 53, 51, 42, 63, 95, 123, 139, 121, 7... butterfly
4 [[48, 110, 212, 226, 232, 248, 144, 186, 190, ... butterfly
... ... ...
26174 [[157, 150, 122, 131, 149, 151, 162, 190, 132,... squirrel
26175 [[128, 119, 99, 73, 58, 132, 108, 110, 97, 88,... squirrel
26176 [[172, 162, 151, 108, 109, 115, 132, 174, 183,... squirrel
26177 [[16, 89, 112, 109, 65, 30, 46, 97, 98, 118, 8... squirrel
26178 [[97, 106, 94, 101, 55, 121, 129, 77, 35, 18, ... squirrel

[26179 rows x 2 columns]
"""

X, y, X_train, X_test, y_train, y_test

X = df['image']
y = df['label']
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2)

Creating and training model

model = SVC()
model.fit(X_train, y_train) # <-- Error here
print(model.score(X_test, y_test))

brave owl Mar 13, 2021, 6:20 AM

#

Hello Everyone, I'm new to ML and NumPy, I've a basic doubt posted at #help-cupcake , It'll be nice if you can help me out, Thank You.

iron basalt Mar 13, 2021, 6:29 AM

#

twilit pilot ok

Flatten the input.

twilit pilot Mar 13, 2021, 6:29 AM

#

iron basalt Flatten the input.

thats what i tried earlier and that also had the same issue

#

i mentioned it above

iron basalt Mar 13, 2021, 6:30 AM

#

I don't see it in your code posted though.

twilit pilot Mar 13, 2021, 6:31 AM

#

twilit pilot This is my pandas dataframe ``` ...

@iron basalt I didn't post the code, but here is what the pandas dataframe looked like

iron basalt Mar 13, 2021, 6:34 AM

#

Which scikit version number?

twilit pilot Mar 13, 2021, 6:38 AM

#

iron basalt Which scikit version number?

'0.23.2'

iron basalt Mar 13, 2021, 6:42 AM

#

oh uh, I never gave a df directly to scikit-learn before, seems incorrect.

twilit pilot Mar 13, 2021, 6:43 AM

#

converting it to numpy gave the same error

#

this has become very frustrating for me now 😅

iron basalt Mar 13, 2021, 6:43 AM

#

Might as well upgrade to 0.24 first to make sure

twilit pilot Mar 13, 2021, 6:44 AM

#

ok ill try that although there shouldnt be that big of a difference

iron basalt Mar 13, 2021, 6:46 AM

#

How are you converting the columns to a numpy array

#

Your code posted is missing a bunch of things.

#

@twilit pilot

twilit pilot Mar 13, 2021, 6:52 AM

#

@iron basalt I didn't show it in the code, but the pandas.Dataframe has a .to_numpy() method

iron basalt Mar 13, 2021, 6:53 AM

#

Show the real code

fading burrow Mar 13, 2021, 6:54 AM

#

can you show which line is causing the error?

#

the first occurence of the exception i mean.

twilit pilot Mar 13, 2021, 6:54 AM

#

twilit pilot <@!171929073063297024> ```py import os import cv2 import pickle import numpy as ...

@fading burrow its the second to last line

twilit pilot Mar 13, 2021, 6:55 AM

#

twilit pilot ```py TypeError: only size-1 arrays can be converted to Python scalars The abov...

this was the error

twilit pilot Mar 13, 2021, 6:55 AM

#

iron basalt Show the real code

import os
import cv2
import pickle
import numpy as np
import pandas as pd
from sklearn.svm import SVC
import matplotlib.pyplot as plt
from sklearn.preprocessing import LabelEncoder
from sklearn.model_selection import train_test_split

# Load the dataset
with open('data/dataframe.txt', 'rb') as infile:
    df = pickle.load(infile)

"""
                                                   image      label
0      [146, 151, 156, 158, 160, 154, 144, 131, 127,...  butterfly
1      [156, 156, 156, 157, 158, 159, 160, 160, 160,...  butterfly
2      [146, 198, 172, 200, 168, 226, 186, 183, 192,...  butterfly
3      [66, 57, 53, 51, 42, 63, 95, 123, 139, 121, 7...  butterfly
4      [48, 110, 212, 226, 232, 248, 144, 186, 190, ...  butterfly
...                                                  ...        ...
26174  [157, 150, 122, 131, 149, 151, 162, 190, 132,...   squirrel
26175  [128, 119, 99, 73, 58, 132, 108, 110, 97, 88,...   squirrel
26176  [172, 162, 151, 108, 109, 115, 132, 174, 183,...   squirrel
26177  [16, 89, 112, 109, 65, 30, 46, 97, 98, 118, 8...   squirrel
26178  [97, 106, 94, 101, 55, 121, 129, 77, 35, 18, ...   squirrel

[26179 rows x 2 columns]
"""

# X, y, X_train, X_test, y_train, y_test
X = df['image'].to_numpy()
y = df['label'].to_numpy()
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2)

# Creating and training model
model = SVC()
model.fit(X_train, y_train) # <-- Error here
print(model.score(X_test, y_test))
```Its essectially the same thing

iron basalt Mar 13, 2021, 6:56 AM

#

Where is the flatten?

twilit pilot Mar 13, 2021, 6:56 AM

#

iron basalt Where is the flatten?

its already flattened

iron basalt Mar 13, 2021, 6:56 AM

#

X is [[[]], [[]], ...]

#

Needs to be [[], [], [], ....]

twilit pilot Mar 13, 2021, 6:57 AM

#

iron basalt Needs to be [[], [], [], ....]

it is let me print it

iron basalt Mar 13, 2021, 6:57 AM

#

So you flattened it an loaded the flattened one

fading burrow Mar 13, 2021, 6:57 AM

#

it seems like your function is getting an array when it should be getting numbers

#

if your data is (1,2,2), then the function will recieve arrays instead of numbers

iron basalt Mar 13, 2021, 6:58 AM

#

what does x and y print like after becoming a numpy array.

twilit pilot Mar 13, 2021, 6:58 AM

#

[array([146, 151, 156, ..., 149, 144, 142], dtype=uint8)
 array([156, 156, 156, ..., 141, 142, 141], dtype=uint8)
 array([146, 198, 172, ..., 204, 192, 190], dtype=uint8) ...
 array([172, 162, 151, ..., 199, 199, 198], dtype=uint8)
 array([ 16,  89, 112, ..., 240, 240, 240], dtype=uint8)
 array([ 97, 106,  94, ..., 253, 253, 253], dtype=uint8)]

#

that is X

fading burrow Mar 13, 2021, 6:58 AM

#

and y?

twilit pilot Mar 13, 2021, 6:58 AM

#

wait

#

['butterfly' 'butterfly' 'butterfly' ... 'squirrel' 'squirrel' 'squirrel']

#

@iron basalt @fading burrow

iron basalt Mar 13, 2021, 6:59 AM

#

dtype?

twilit pilot Mar 13, 2021, 6:59 AM

#

uint8

#

X dtype = uint8

iron basalt Mar 13, 2021, 6:59 AM

#

and y?

twilit pilot Mar 13, 2021, 6:59 AM

#

y is just a string

#

you can use label encoder, but will still get the same error

iron basalt Mar 13, 2021, 7:00 AM

#

numpy arrays have multiple ways to store strings, idr is scikit learn accepts all

#

the error is probably x, should look more like this: [[...], [...], ...] but you have something strange going on with [ndarray(...), ...]

twilit pilot Mar 13, 2021, 7:02 AM

#

so you want me to convert to regular list?

#

ok ill try

iron basalt Mar 13, 2021, 7:02 AM

#

well one big 2d numpy array

#

not list of numpy arrays

fading burrow Mar 13, 2021, 7:03 AM

#

i don't think that's the issue but you can try

twilit pilot Mar 13, 2021, 7:03 AM

#

yea ill try rn

iron basalt Mar 13, 2021, 7:03 AM

#

I'm just trying to match everything as much as possible.

twilit pilot Mar 13, 2021, 7:05 AM

#

i can send you the whole thing if you want

#

so you can test it on your own computer

#

@iron basalt

iron basalt Mar 13, 2021, 7:05 AM

#

sure

twilit pilot Mar 13, 2021, 7:05 AM

#

ok gimme a little while

iron basalt Mar 13, 2021, 7:06 AM

#

I don't need the whole thing, just some of the data

#

well, you have it pickled.

twilit pilot Mar 13, 2021, 7:16 AM

#

@iron basalt here is the python code https://github.com/sohan-py/testhelp/blob/main/main.py and here is the dataset https://github.com/sohan-py/testhelp/blob/main/help_dataframe.txt

#

its late for me rn, so i will head for bed. you can try running the code on your computer or editing it to make it work. Good Night!

iron basalt Mar 13, 2021, 7:18 AM

#

I'm pretty sure the error is that X thing

misty flint Mar 13, 2021, 7:18 AM

#

ID_BoomKek

iron basalt Mar 13, 2021, 7:18 AM

#

X is a numpy array with dtype "object"

#

because it contains multiple numpy arrays

twilit pilot Mar 13, 2021, 7:19 AM

#

iron basalt I'm pretty sure the error is that X thing

yea im pretty sure its there too

misty flint Mar 13, 2021, 7:19 AM

#

all the examples have a different kind of input for X

#

at least for sklearn svm svc

#

Sip

iron basalt Mar 13, 2021, 7:19 AM

#

The lesson here is to not store images as arrays in pandas, usually people have the dataset in some other format

twilit pilot Mar 13, 2021, 7:19 AM

#

what format

misty flint Mar 13, 2021, 7:19 AM

#

ValkNaruhodo

#

i will note this so i do not make the same mistake

iron basalt Mar 13, 2021, 7:20 AM

#

like let's say you want to load mnist

#

you just use an mnist loader

#

If you have images and are making your own dataset

misty flint Mar 13, 2021, 7:20 AM

#

and yes i remember the image dataset i was working with was NOT stored with np arrays

#

pithink

iron basalt Mar 13, 2021, 7:20 AM

#

instead of storing the image data in the table, store paths to the image files, and use those to load all of them

#

combine into one thing

twilit pilot Mar 13, 2021, 7:21 AM

#

but when i put into model, i need numpy array

iron basalt Mar 13, 2021, 7:21 AM

#

Or do what many datasets like mnist do and store multiple images in one file

#

makes it easier

#

yea you do, and you still can

#

you just need some extra work now to make this X something like array([[...], [...], ...], dtype=np.uint8)

twilit pilot Mar 13, 2021, 7:22 AM

#

so an np array of regular arrays?

iron basalt Mar 13, 2021, 7:23 AM

#

yup got it running

#

import pickle
import numpy as np
import pandas as pd
from sklearn.svm import SVC
import matplotlib.pyplot as plt
from sklearn.preprocessing import LabelEncoder
from sklearn.model_selection import train_test_split

# Load the dataset
with open('help_dataframe.txt', 'rb') as infile:
    df = pickle.load(infile)

"""
                                                   image      label
0      [146, 151, 156, 158, 160, 154, 144, 131, 127,...  butterfly
1      [156, 156, 156, 157, 158, 159, 160, 160, 160,...  butterfly
2      [146, 198, 172, 200, 168, 226, 186, 183, 192,...  butterfly
3      [66, 57, 53, 51, 42, 63, 95, 123, 139, 121, 7...  butterfly
4      [48, 110, 212, 226, 232, 248, 144, 186, 190, ...  butterfly
...                                                  ...        ...
26174  [157, 150, 122, 131, 149, 151, 162, 190, 132,...   squirrel
26175  [128, 119, 99, 73, 58, 132, 108, 110, 97, 88,...   squirrel
26176  [172, 162, 151, 108, 109, 115, 132, 174, 183,...   squirrel
26177  [16, 89, 112, 109, 65, 30, 46, 97, 98, 118, 8...   squirrel
26178  [97, 106, 94, 101, 55, 121, 129, 77, 35, 18, ...   squirrel

[26179 rows x 2 columns]
"""

# X, y, X_train, X_test, y_train, y_test
X = df['image'].to_numpy()
X = np.stack(X) # <- this stacks all the arrays in the array creating a 2d array
y = df['label'].to_numpy()
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2)

# Creating and training model
model = SVC()
model.fit(X_train, y_train) # <-- Error here
print(model.score(X_test, y_test))

#

Now if you print X you see something like:

#

[[123 125 123 ... 137 108 132]
 [ 96 104 101 ... 125 157  83]
 [147 143 147 ... 133 139 150]
 ...
 [231 232 227 ... 141 151 177]
 [225 222 217 ... 199 201 199]
 [115 116 116 ...  71  68 110]]

twilit pilot Mar 13, 2021, 7:24 AM

#

so a 2d array what shape?

iron basalt Mar 13, 2021, 7:24 AM

#

let me give you an example

twilit pilot Mar 13, 2021, 7:26 AM

#

ok

iron basalt Mar 13, 2021, 7:26 AM

#

>>> import numpy as np
>>> a = np.array([np.arange(3), np.arange(3), np.arange(3)])
>>> a
array([[0, 1, 2],
       [0, 1, 2],
       [0, 1, 2]])
>>> a = np.empty(3, object)
>>> a[:] = [np.arange(3), np.arange(3), np.arange(3)]
>>> a
array([array([0, 1, 2]), array([0, 1, 2]), array([0, 1, 2])], dtype=object)
>>> np.stack(a)
array([[0, 1, 2],
       [0, 1, 2],
       [0, 1, 2]])
>>>

#

got it now?

twilit pilot Mar 13, 2021, 7:27 AM

#

oooooohhhhhhh

#

i see

#

gosh man thanks!!

iron basalt Mar 13, 2021, 7:27 AM

#

pandas to_numpy treats each cell's entry as a numpy array (object)

twilit pilot Mar 13, 2021, 7:28 AM

#

i see

#

i didn't know this

iron basalt Mar 13, 2021, 7:28 AM

#

Always check the types

twilit pilot Mar 13, 2021, 7:28 AM

#

thanks to you i also learned something new thanks man!

iron basalt Mar 13, 2021, 7:28 AM

#

Type errors in python can trickle down to later parts of the code

#

It's one of the big issues with dynamically typed languages

twilit pilot Mar 13, 2021, 7:28 AM

#

i see

iron basalt Mar 13, 2021, 7:28 AM

#

They are convenient, but can cause more errors.

twilit pilot Mar 13, 2021, 7:29 AM

#

yes true

misty flint Mar 13, 2021, 7:29 AM

#

python

iron basalt Mar 13, 2021, 7:29 AM

#

Or more specifically, errors in other parts of the code, even though the error is elsewhere (error propagation)

#

So you as a python programmer need to error backprop 😉

misty flint Mar 13, 2021, 7:30 AM

#

then you have to search like a pirate

#

kannaSus

twilit pilot Mar 13, 2021, 7:30 AM

#

iron basalt So you as a python programmer need to error backprop 😉

got it man

#

@iron basalt Thanks for taking your time to fix my error man I really appreciate it! Have a great rest of your day!

iron basalt Mar 13, 2021, 7:32 AM

#

This is also a software engineering issue on scikit-learn's end, it should have assertions for the dtype of the inputs and shapes. As it can only really work with a certain subset of all dtypes possible (and shapes).

#

The assertion would have triggered and made the issue immediately obvious.

#

Assertions == good, use them everywhere to check things (check pre-conditions).

#

Yea I don't see any of the code dealing with the input being "object", only when it's specifically a string object. Just "object" will pass all checks and so it will error later on in numpy (looking inside the fit function on github).

#

The better approach would be to flip it around. Only allow a specific set of types, instead of having a bunch of checks for different ones and do conversions and stuff just for those (less error prone).

#

(Overly generic code, while not actually being fully generic)

misty flint Mar 13, 2021, 7:46 AM

#

blobhyperthink

harsh trellis Mar 13, 2021, 8:40 AM

#

guys

#

i have a question about, why do we use scaling in a data set like if my data is not skwed and have outliers then its not like scaling would be getting rid of them?

#

there are scalers like stander scallers and minmax scaler

#

can anyone help me?

ripe forge Mar 13, 2021, 9:02 AM

#

Scaling isn't used to deal with outliers per se, or to deal with skewness. So your question needs to simply boil down to, what does scaling do.

#

And the answer there is, it depends on your model architecture. So tree based models don't necessarily need scaling since they create decision points from the data itself

#

But for deep learning and linear regression scaling offers unique benefits. For deep learning it helps with model fitting because the activation functions operate best within a certain range, and the weights are also initialized with a certain expectation for input scales*

#

For linear regression, scaling allows you to meaningfully compare coefficients across two features, which you can't do without scaling.

#

There might be other reasons too, but those are the ones I can think of off the top of my head

tidal bronze Mar 13, 2021, 9:07 AM

#

If I were to use a box-cox transformation, is it possible to reconvert the results back to the original units? If so how could I do that?

#

@ripe forge somewhat related question

#

https://docs.scipy.org/doc/scipy/reference/generated/scipy.special.inv_boxcox.html

#

would this be it?

ripe forge Mar 13, 2021, 9:10 AM

#

https://stackoverflow.com/questions/26391454/reverse-box-cox-transformation

Stack Overflow

Reverse Box-Cox transformation

I am using SciPy's boxcox function to perform a Box-Cox transformation on a continuous variable.

from scipy.stats import boxcox
import numpy as np
y = np.random.random(100)
y_box, lambda_ = ss.box...

#

Yes.

tidal bronze Mar 13, 2021, 9:11 AM

#

awesome sorry for bothering you for nothing 😘

harsh trellis Mar 13, 2021, 9:23 AM

#

@tidal bronze for that u need to have copy of the dataset so u might wont lose it accidentally, or else u can restart the kernel and re run the cell again

#

@ripe forge i see, and does it work on adaboost and sgboost? or else ensemble models

tidal bronze Mar 13, 2021, 9:44 AM

#

thanks for the tip

scenic ravine Mar 13, 2021, 9:56 AM

#

df.loc[df_pl['placement_ts'] >= pd.to_datetime('2018')]

error Invalid comparison between dtype=datetime64[ns, pytz.FixedOffset(330)] and Timestamp

help please!

tidal bronze Mar 13, 2021, 10:12 AM

#

it seems your two columns are not using the same datetime format

lean ledge Mar 13, 2021, 10:48 AM

#

ripe forge But for deep learning and linear regression scaling offers unique benefits. For ...

The bigger benefit for DL is gradient updates treat different dimensions more equally. Your loss landscape where one variable goes from 1e-4 to 2e-4 and one goes from 0 to 10000 is going to be weird. Your step size on the state of 1 isn't going to make much difference on variable while being a huge leap on the other

#

Scaling makes it so that the size of gradient updates is more fair to all dimensions

#

Otherwise your large steps combined with some gradient noise would make optimisation incredibly hard for many smaller scaled dimensions, or smaller learning rates wouldn't make meaningful progress on other dimensions

tidal bronze Mar 13, 2021, 11:09 AM

#

using seaborn how could I make the last bars darker?

harsh trellis Mar 13, 2021, 12:16 PM

#

lean ledge The bigger benefit for DL is gradient updates treat different dimensions more eq...

i see, thanks !!

hoary wigeon Mar 13, 2021, 2:17 PM

#

Hello

#

I want to retrieve tables from html file.

My code : https://paste.pythondiscord.com/moyiyowatu.py
Running this python script will copy the html code from the url mentioned in the code in current directory with name = f'{today_date}.html'
After saving the code, I want the table of that copied html file.

tidal bough Mar 13, 2021, 3:28 PM

#

Sounds like you want to change the aspect ratio.

misty flint Mar 13, 2021, 3:31 PM

#

Sip

lavish tundra Mar 13, 2021, 3:31 PM

#

tidal bough Sounds like you want to change the aspect ratio.

u know how i can do it on a lineplot graphic on seaborn?

mighty cobalt Mar 13, 2021, 3:32 PM

#

Hello everyone, need a little here

#

am trying to manipulate discord profile pics ran into a problem

#

how do i read gif's from Url's using cv and urllib

tidal bough Mar 13, 2021, 3:37 PM

#

lavish tundra u know how i can do it on a lineplot graphic on seaborn?

You can do so on the Axis object lineplot returns:
https://matplotlib.org/stable/api/axes_api.html#aspect-ratio

#

.set_aspect(1/2) for 1:2 height:width

lavish tundra Mar 13, 2021, 3:43 PM

#

how i can know the right proportion?

tidal bough Mar 13, 2021, 3:45 PM

#

WDYM right?

misty flint Mar 13, 2021, 3:45 PM

#

lavish tundra how i can know the right proportion?

test it out

mighty cobalt Mar 13, 2021, 3:55 PM

#

how do i combine all frames and save them as Gif?

tidal bough Mar 13, 2021, 3:55 PM

#

What library are you using? PIL has a guide on that, IIRC.

mighty cobalt Mar 13, 2021, 3:56 PM

#

if asking me cv2

bronze jacinth Mar 13, 2021, 4:11 PM

#

hello im new to machine learning and i just had a few doubts about a project that im doing

#

not really understanding the final output of the project (involves svm)

#

help?

lavish tundra Mar 13, 2021, 4:13 PM

#

looks like he do the graphic and only after put the graphic on the center he put the legend over the graphic, i'll try to see if i have a way to put the legend on the side if that can fix it

hoary wigeon Mar 13, 2021, 4:19 PM

#

tidal bough What library are you using? PIL has a guide on that, IIRC.

i need help

tidal bough Mar 13, 2021, 4:20 PM

#

hoary wigeon i need help

...why are you pinging me here instead of, say, writing your question in a help channel, or anywhere?

hoary wigeon Mar 13, 2021, 4:20 PM

#

i tried that 3 time in help

#

and twice here

lapis sequoia Mar 13, 2021, 4:24 PM

#

i need help

#

help me

#

@hoary wigeon hi

#

help me

hoary wigeon Mar 13, 2021, 4:25 PM

#

How may i help you ?

viscid quest Mar 13, 2021, 4:26 PM

#

@hoary wigeon can you see the error in this code?

#

I mean there is an error in last commamd

#

raven halo Mar 13, 2021, 4:27 PM

#

hello

viscid quest Mar 13, 2021, 4:27 PM

#

Hye there.

misty flint Mar 13, 2021, 4:27 PM

#

memecringeharold

raven halo Mar 13, 2021, 4:27 PM

#

misty flint <:memecringeharold:683313398662037527>

:D

misty flint Mar 13, 2021, 4:28 PM

#

you should copy and paste instead of taking a picture like that

lapis sequoia Mar 13, 2021, 4:28 PM

#

import speech_recognition as sr
import pyttsx3
import pywhatkit
import datetime
import wikipedia

listener = sr.Recognizer()
engine = pyttsx3.init()
voices = engine.getProperty('voices')
engine.setProperty('voice', voices[1].id)

def talk(text):
engine.say(text)
engine.runAndWait()

def take_command():
try:
with sr.Microphone() as source:
print('listening...')
voice = listener.listen(source)
command = listener.recognize_google(voice)
command = command.lower()
if 'alexa' in command:
command = command.replace('alexa', '')
print(command)
except:
pass
return command

def run_alexa():
command = take_command()
print(command)
if 'play' in command:
song = command.replace('play', '')
talk('playing ' + song)
pywhatkit.playonyt(song)
elif 'time' in command:
time = datetime.datetime.now().strftime('%I:%M %p')
talk('Current time is ' + time)
elif 'who the heck is' in command:
person = command.replace('who the heck is', '')
info = wikipedia.summary(person, 1)
print(info)
talk(info)
elif 'date me' in command:
talk('fucker')
elif 'are you single' in command:
talk('I am in a relationship with wifi')
elif 'joke' in command:
talk(pyjokes.get_joke())
else:
talk('Please say the command again.')

while True:
run_alexa()

raven halo Mar 13, 2021, 4:28 PM

#

:O

lapis sequoia Mar 13, 2021, 4:28 PM

#

misty flint you should copy and paste instead of taking a picture like that

elp

raven halo Mar 13, 2021, 4:28 PM

#

@pájthon

lapis sequoia Mar 13, 2021, 4:29 PM

#

D:\Users\Nadia\pyton\python.exe D:/Users/Nadia/PycharmProjects/alexa.py/mine_alexa.py
listening...
Traceback (most recent call last):
File "D:\Users\Nadia\PycharmProjects\alexa.py\mine_alexa.py", line 59, in <module>
run_alexa()
File "D:\Users\Nadia\PycharmProjects\alexa.py\mine_alexa.py", line 36, in run_alexa
if 'play' in command:
TypeError: argument of type 'NoneType' is not iterable
None

Process finished with exit code 1

#

errors

serene scaffold Mar 13, 2021, 4:29 PM

#

@raven halo what are you talking about? please keep this channel on-topic

serene scaffold Mar 13, 2021, 4:29 PM

#

lapis sequoia import speech_recognition as sr import pyttsx3 import pywhatkit import datetime ...

are you trying to make a program for Alexa?

raven halo Mar 13, 2021, 4:29 PM

#

sorry

#

pandaSad

tidal bough Mar 13, 2021, 4:29 PM

#

lapis sequoia D:\Users\Nadia\pyton\python.exe D:/Users/Nadia/PycharmProjects/alexa.py/mine_ale...

    if 'play' in command:
TypeError: argument of type 'NoneType' is not iterable

This implies immediately that command is None.

lapis sequoia Mar 13, 2021, 4:29 PM

#

serene scaffold are you trying to make a program for Alexa?

yes

serene scaffold Mar 13, 2021, 4:30 PM

#

lapis sequoia yes

have you made a web app of any kind before? if you want it to run on Alexa devices, the actual AI component is abstracted away by Amazon.

#

so this is a web development question, and you can do it using the flask framework for Alexa: https://flask-ask.readthedocs.io/en/latest/

bronze jacinth Mar 13, 2021, 4:42 PM

#

https://www.kaggle.com/fedesoriano/company-bankruptcy-prediction
im using this dataset and training a model using svm
since im new i dont exactly know what the output im getting is
can anyone help?

Company Bankruptcy Prediction

Bankruptcy data from the Taiwan Economic Journal for the years 1999–2009

spice shuttle Mar 13, 2021, 5:01 PM

#

Hello. I searched and think I've selected the right topic for this question. I'm new to Python and wrote a while loop that ultimately produces a number stored in a variable called 'cycles' after each round of cycles is completed. How can I store these 'cycles' values and then print them after all rounds are completed? Here's what I have so far, but I keep getting an array with the same values for each cycle, ex. [7, 7, 7]
results = []
for i in range(totalGames):
results.append(cycles)

tidal bough Mar 13, 2021, 5:05 PM

#

spice shuttle Hello. I searched and think I've selected the right topic for this question. I'm...

If cycles is a list, I presume you're changing the same object each round, so your results list ends up with many references to the same list you're been changing all along.

#

What you probably want instead is to store a copy of the current cycles:
results.append(cycles.copy())

spice shuttle Mar 13, 2021, 5:07 PM

#

Add that copy outside of the while loop?

tidal bough Mar 13, 2021, 5:08 PM

#

Not sure what you mean.

spice shuttle Mar 13, 2021, 5:09 PM

#

https://replit.com/join/bqclltdr-brettgowder

#

I thought it'd make more sense to share it.

tidal bough Mar 13, 2021, 5:12 PM

#

Why, every game, throw away all results and replace them with tons of copies of the latest one?

#

For that matter, you seem to have been planning to add them to cycleslist, not result.

#

Just, each iteration of the while-loop, do cycleslist.append(cycles) at the end.

spice shuttle Mar 13, 2021, 5:12 PM

#

The assignment is to create a list of how many cycles for each game.

#

Thanks!!

#data-science-and-ml

Load the dataset

X, y, X_train, X_test, y_train, y_test

Creating and training model