#data-science-and-ml | Python | Page 300

quiet dawn Mar 24, 2021, 1:09 PM

#

is there

#

?

grave frost Mar 24, 2021, 1:43 PM

#

And, well, linear function do not useful computation make
Too much CGP Grey?

tidal bough Mar 24, 2021, 1:43 PM

#

haven't actually watched that much of him, but maybe that's where I picked up that phrase, yes

hollow sentinel Mar 24, 2021, 1:57 PM

#

there is

lapis sequoia Mar 24, 2021, 2:14 PM

#

AI is purely applied maths. Usually it all boils down to linear algebra. You don't have to understand how many of these models work and what they do mathematically. It would help no doubt. But it's probably better to keep the implementation of them a black-box and focus on the when / what / why / pros / cons of models. You can always dive deeper into the maths later

short heart Mar 24, 2021, 2:22 PM

#

Is it possible to pass a list of X_trains to LSTM

#

or how do i fit several x trains into one lstm

grave frost Mar 24, 2021, 2:27 PM

#

Does anyone have any ideas of implementations of fully unsupervised local POS taggers?

tidal bough Mar 24, 2021, 2:37 PM

#

short heart Is it possible to pass a list of X_trains to LSTM

Do you mean, like, merging several datasets into one?

short heart Mar 24, 2021, 2:38 PM

#

oops

#

im stupid

#

thanks

tidal bough Mar 24, 2021, 2:59 PM

#

Okay, data science question: I have a pandas dataframe of UTC timestamps and values. I want to plot the average (maybe +-std) value by time of day (disregarding the date). How would I do that? My current solution is very hacky and possibly incorrect.

grave frost Mar 24, 2021, 3:00 PM

#

tidal bough Okay, data science question: I have a pandas dataframe of UTC timestamps and val...

binning by hours?

#

so 24 bins, count values and plot accordingly?

tidal bough Mar 24, 2021, 3:02 PM

#

pretty much

#

except, obviously, I want to do it "the right way"

#

since it seems to me like a common-ish operation

ripe forge Mar 24, 2021, 3:29 PM

#

Can you share your current way?

eternal narwhal Mar 24, 2021, 3:33 PM

#

grave frost Does anyone have any ideas of implementations of fully unsupervised local POS ta...

I think a transformer can cluster data pretty well. Bidirectional LSTM autoencoder could work as well. But is there a reason to do it unsupervised? There are a bunch of huge datasets out there.

misty flint Mar 24, 2021, 3:43 PM

#

tidal bough Okay, data science question: I have a pandas dataframe of UTC timestamps and val...

something like how this guy does it? is that a similar scenario https://youtu.be/jV24N7SPXEU?t=171

YouTube

Data School

pandas best practices (8/10): Plotting a time series

This is part 8 of my pandas tutorial from PyCon 2018. Watch all 10 videos: https://www.youtube.com/playlist?list=PL5-da3qGB5IBITZj_dYSFqnd_15JgqwA6
This video covers the following topics: math with booleans, groupby, datetime attributes, line plots.

NEW TO PANDAS? Watch my introductory series (30+ videos):
https://www.youtube.com/playlist?list=...

▶ Play video

#

he did pd.to_datetime() previously before working with that column by the way

#

so his approach was

df.groupby(df.column_datetime.dt.hour).column.mean()

#

column being the column of interest

tidal bough Mar 24, 2021, 4:08 PM

#

hmm

#

let me figure out how to apply this to mine

tidal bough Mar 24, 2021, 4:14 PM

#

misty flint so his approach was > df.groupby(df.column_datetime.dt.hour).column.mean()

This does indeed work, though I wonder if I can also apply seaborn to the task instead of manually calculating the standard deviations.

means = df.groupby(df.date.dt.hour).eu.mean()
stds = df.groupby(df.date.dt.hour).eu.std()
plt.plot(means,label="mean")
plt.plot(means+stds)
plt.plot(means-stds)

#

Oh, this works and does what I want:

hours = df[["date","eu"]].copy()
hours["hour"] = hours["date"].dt.hour
del hours["date"]
sns.relplot(data=hours,x="hour",y="eu",kind="line")

misty flint Mar 24, 2021, 4:22 PM

#

nice

#

didnt know seaborn could do that

tidal bough Mar 24, 2021, 4:23 PM

#

essentially creating a new column with the dates replaced by hours, then passing this dataframe (with a lot of y-values for each x) to relplot

#

#

result looks like this, which is what I wanted

misty flint Mar 24, 2021, 4:23 PM

#

oh and the shaded is the std?

#

MHXwoah

tidal bough Mar 24, 2021, 4:23 PM

#

yup

misty flint Mar 24, 2021, 4:23 PM

#

magic

#

does relplot automatically display std?

#

pithink

#

or was there a parameter you had to specify

#

i think i might try and use seaborn then for my project

tidal bough Mar 24, 2021, 4:30 PM

#

misty flint does relplot automatically display std?

It's on by default

misty flint Mar 24, 2021, 4:31 PM

#

nice

#

ValkNaruhodo

#

i need to remember this

tidal bough Mar 24, 2021, 4:31 PM

#

it's the ci parameter to lineplot

#

(which relplot calls when doing kind="line")

misty flint Mar 24, 2021, 4:32 PM

#

interesting

tidal bough Mar 24, 2021, 4:32 PM

#

it also calculates them using bootstrapping, which takes like a second for my barely-2k datapoints 😅

#

(but it scales well with big n or something, don't remember why bootstrapping is considered nice)

misty flint Mar 24, 2021, 4:32 PM

#

thats pretty nifty

grave frost Mar 24, 2021, 4:39 PM

#

eternal narwhal I think a transformer can cluster data pretty well. Bidirectional LSTM autoencod...

Low resource language 😦

#

the problem is not in the clustering - but that I don't have any ground labels

#

I did find a way using NLTK (after a lot of hours of searching) , but thought that maybe there is some resource I missed

little path Mar 24, 2021, 4:51 PM

#

Write a program to generate a series of marks of 10 students. Give grace marks up to 5 of those who are having <33 marks and print the new list of the marks.
I've tried this but not working:-
import pandas as pd def Ser_stumarks(): std_marks = [] for i in range(1,11): m = int(input("Enter the marks:")) std_marks.append(m) s = pd.Series(index=range(1201,1211),data=std_marks) s[s<33]=s+5 print("New List is:") print(s[s>=33]) Ser_stumarks()

#

Is there any easy and simple code to do that?

tidal bough Mar 24, 2021, 4:54 PM

#

what's "grace marks"?

#

oh, I see, increase their mark by 5

#

why are you filtering the printed result to only the >33 ones, though?

short heart Mar 24, 2021, 5:27 PM

#

i added quiet alot of data for lstm to learn with, but the results are only worse

#

any ideas why?

uncut barn Mar 24, 2021, 5:33 PM

#

twin mantle Mar 24, 2021, 6:00 PM

#

Hello

#

I have this

#


sns.histplot(datos, bins=9, binrange=(10,99), color='gray', kde=True,
             line_kws= {'color':'blue','linestyle': 'dashed'},
             fill=False)

#

But it's still entirely gray

#

How can I change the kde line color?

grave frost Mar 24, 2021, 6:12 PM

#

uncut barn

reduce learning rate, try different optimizers

quiet dawn Mar 24, 2021, 6:13 PM

#

lapis sequoia AI is purely applied maths. Usually it all boils down to linear algebra. You don...

thanks

twin mantle Mar 24, 2021, 6:21 PM

#

That sounds like a bag of words

grave frost Mar 24, 2021, 6:26 PM

#

twin mantle That sounds like a bag of words

VSM is basically an improved version of BOW that use some more complex information (like tf-idf score for a word)

#

https://stats.stackexchange.com/questions/31060/bag-of-words-vs-vector-space-model

Cross Validated

Bag of words vs vector space model?

What is/are the difference/s between these text representation models: Bag of words and vector space model?

rancid vine Mar 24, 2021, 6:33 PM

#

I mean, alot of it. Mostly linear algebra. A nightmarish amount. xD

twin mantle Mar 24, 2021, 6:39 PM

#

LOL, there's a lot of math inside AI

twin mantle Mar 24, 2021, 6:41 PM

#

twin mantle ```py sns.histplot(datos, bins=9, binrange=(10,99), color='gray', kde=True, ...

Any help?

twin mantle Mar 24, 2021, 6:44 PM

#

grave frost https://stats.stackexchange.com/questions/31060/bag-of-words-vs-vector-space-mod...

Ty for this, reading this

quiet dawn Mar 24, 2021, 6:51 PM

#

twin mantle LOL, there's a lot of math inside AI

actually i said that there isn't something about math in site

#

not in ai

little path Mar 24, 2021, 6:59 PM

#

Write a panda program to enter marks in main five subjects of a student and Calculate sum of all marks.

b) Also write a small python code to create a dataframe with headings(Name and Age) from the list given below :

[[‘’Alex”,26],[“Maddy”,44],[“Rolex”,26],[“Mona“,37]]

Now sort the data as per the name

SUM=DF[‘ENG’]+DF[‘MATH’]+DF[‘HINDI’]

DF[‘PER’]

#

I can't do the sorting part

#

Rest I've done plz tell me how to do that

twin mantle Mar 24, 2021, 7:01 PM

#

Did you create the dataframe?

little path Mar 24, 2021, 7:01 PM

#

Yes

twin mantle Mar 24, 2021, 7:01 PM

#

Sort it

grave frost Mar 24, 2021, 7:02 PM

#

quiet dawn actually i said that there isn't something about math in site

there is no side of AI that is free from math

twin mantle Mar 24, 2021, 7:02 PM

#

.sorted()

twin mantle Mar 24, 2021, 7:02 PM

#

little path I can't do the sorting part

https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.sort_values.html

little path Mar 24, 2021, 7:05 PM

#

What to do with this:- SUM=DF[‘ENG’]+DF[‘MATH’]+DF[‘HINDI’]
DF[‘PER’]

#

Do I have to use sum command?

twin mantle Mar 24, 2021, 7:08 PM

#

little path What to do with this:- SUM=DF[‘ENG’]+DF[‘MATH’]+DF[‘HINDI’] DF[‘PER’]

What's that?

little path Mar 24, 2021, 7:08 PM

#

Idk it's written in question

#

I am thinking what to do with that

twin mantle Mar 24, 2021, 7:09 PM

#

little path Idk it's written in question

What part of the question?

little path Mar 24, 2021, 7:09 PM

#

There's the question

#

Last part

twin mantle Mar 24, 2021, 7:11 PM

#

little path Do I have to use sum command?

https://stackoverflow.com/questions/25748683/pandas-sum-dataframe-rows-for-given-columns

Stack Overflow

Pandas: sum DataFrame rows for given columns

I have the following DataFrame:

In [1]:

import pandas as pd
df = pd.DataFrame({'a': [1,2,3], 'b': [2,3,4], 'c':['dd','ee','ff'], 'd':[5,9,1]})
df
Out [1]:
a b c d
0 1 2 dd 5
1 2 3 e...

exotic maple Mar 24, 2021, 7:12 PM

#

tidal bough

This plot is so sexy i want to marry it

little path Mar 24, 2021, 7:13 PM

#

exotic maple This plot is so sexy i want to marry it

🥴

#

I'm getting diverted nvm

exotic maple Mar 24, 2021, 7:13 PM

#

its like being in college all over again:

hollow sentinel Mar 24, 2021, 7:17 PM

#

are there sorting algorithms in pandas

#

or do you just do .sort

#

I actually don't remember much about pandas

little path Mar 24, 2021, 7:18 PM

#

.sort_values

twin mantle Mar 24, 2021, 7:20 PM

#

exotic maple its like being in college all over again:

To be quite honest, sorting is one of the basic things that he could look in google

#

Meanwhile, I am stuck with my seaborn question from 2 hours ago

exotic maple Mar 24, 2021, 7:21 PM

#

twin mantle To be quite honest, sorting is one of the basic things that he could look in goo...

oh ik now, im just laughing because it reminded of my interactions with ym college teachers

twin mantle Mar 24, 2021, 7:21 PM

#

exotic maple oh ik now, im just laughing because it reminded of my interactions with ym colle...

I know the pain too

#

Instructions so vague so that you cant go anywhere but give the impression you're still teaching something

exotic maple Mar 24, 2021, 7:21 PM

#

"prof how do i balance this load (Dynamics)
"do you have the equations?"
"yes"
"solve them"

#

and im like: This mfker getting paid so much to say that shit?

little path Mar 24, 2021, 7:22 PM

#

Lol

twin mantle Mar 24, 2021, 7:22 PM

#

exotic maple and im like: This mfker getting paid so much to say that shit?

Academy is like that

#

They are paid for research papers, not to actually give a fuck about the future professionals

exotic maple Mar 24, 2021, 7:22 PM

#

If i told my boss "solve it" id be out of the door before i said "im so-"

twin mantle Mar 24, 2021, 7:23 PM

#

Anyone here knows seaborn?

exotic maple Mar 24, 2021, 7:23 PM

#

what do you need?

twin mantle Mar 24, 2021, 7:23 PM

#

I want to do a histogram with a kde line overlay

#

The histogram bars should be gray and the kde line should be blue

#

This the code

#

sns.histplot(datos, bins=9, binrange=(10,99), color='gray', kde=True,
             line_kws= {'color':'blue','linestyle': 'dashed'},
             fill=False)

#

All the elements are gray

exotic maple Mar 24, 2021, 7:24 PM

#

image?

#

ah

#

I see yourmistake lmao,

#

straight from the doc @twin mantle

#

#

you're using line_kws

#

you need kdw_kws

#

kdw

#

kde keywords contain "color" as well

twin mantle Mar 24, 2021, 7:26 PM

#

Yeah but when I use kde_kws

#

With a dict with key color and value 'blue'

#

I get this

little path Mar 24, 2021, 7:27 PM

#

There's a question where I have to put 10 values in series but its saying that init() takes from 1 to 7 positional arguments

twin mantle Mar 24, 2021, 7:27 PM

#

#☕help-coffee message

exotic maple Mar 24, 2021, 7:28 PM

#

thats....weird

#

try again and show mt he full traceback?

#

bwecause that guy was using distplot, but im checking histplot

twin mantle Mar 24, 2021, 7:28 PM

#

That's the full traceback

little path Mar 24, 2021, 7:29 PM

#

little path There's a question where I have to put 10 values in series but its saying that _...

^^ pithink

twin mantle Mar 24, 2021, 7:29 PM

#

little path ^^ <:pithink:652247559909277706>

Pass the values inside a list

little path Mar 24, 2021, 7:29 PM

#

Ooo

twin mantle Mar 24, 2021, 7:29 PM

#

So a([1,2]) instead of a(1,2)

little path Mar 24, 2021, 7:29 PM

#

Ya got it

twin mantle Mar 24, 2021, 7:30 PM

#

LOL

#

It's a bug

#

A seaborn bug

exotic maple Mar 24, 2021, 7:30 PM

#

yes

#

thats defeinitely a bug

#

nice finding lmao

#

first i find a bug haha

#

that's a problem with the class instantiationg since it calls back to init

shadow frigate Mar 24, 2021, 7:31 PM

#

hello! is there a way of getting the same result as

import numpy as np

labels = np.random.randint(0, 4, size=10)

a = np.zeros(shape=(10,4))

for idx, _ in enumerate(labels):
    a[idx, _] = 1

print(labels)
print(a)

without that horrible loop?

exotic maple Mar 24, 2021, 7:32 PM

#

shadow frigate hello! is there a way of getting the same result as ```py import numpy as np l...

uh, are you trying to create an aray with just ones?

#

because...

twin mantle Mar 24, 2021, 7:32 PM

#

What are you trying to do?

exotic maple Mar 24, 2021, 7:32 PM

#

https://numpy.org/doc/stable/reference/generated/numpy.ones.html

shadow frigate Mar 24, 2021, 7:33 PM

#

on each row in a, I want to set a single value to 1, the position is given by labels

#

a = [[0. 0. 1. 0.]
 [0. 1. 0. 0.]
 [0. 1. 0. 0.]
 [0. 1. 0. 0.]
 [0. 0. 0. 1.]
 [1. 0. 0. 0.]
 [0. 0. 0. 1.]
 [0. 0. 0. 1.]
 [0. 1. 0. 0.]
 [0. 0. 1. 0.]]```

#

so label[0]=2 means a[0,2]=1, label[1]=1 means a[1,1]=1

#

etc

worldly sigil Mar 24, 2021, 7:34 PM

#

hey everyone, if anyone's looking to throw down in "Sliced: a Data Science Competition", reach out to nickwan on Twitter and tell him that Nschamps sent you. The competition heats back up in June with 16 competitors as the goal. I had a lot of fun this season look forward to seeing some of you there.

https://www.twitch.tv/videos/960771956

Twitch

SLICED CHAMPIONSHIP! COMPETITIVE DATA SCI !sliced - nickwan_datasci...

nickwan_datasci went live on Twitch. Catch up on their Science & Technology VOD now.

▶ Play video

exotic maple Mar 24, 2021, 7:36 PM

#

shadow frigate ```labels= [2 1 1 1 3 0 3 3 1 2] a = [[0. 0. 1. 0.] [0. 1. 0. 0.] [0. 1. 0. 0....

that's...strange

#

does that have some programatic logic?

#

because i cant see any other way, as-is

twin mantle Mar 24, 2021, 7:37 PM

#

shadow frigate on each row in `a`, I want to set a single value to 1, the position is given by ...

I still don't understand this

#

Do the positions change?

twin mantle Mar 24, 2021, 7:37 PM

#

exotic maple does that have some programatic logic?

Same here, unless is an exercise

exotic maple Mar 24, 2021, 7:37 PM

#

Like, if he wanted all first indeces he could something like array[:,0] = 1

#

or so

shadow frigate Mar 24, 2021, 7:38 PM

#

I need to prepare a label matrix to pass to a loss function

exotic maple Mar 24, 2021, 7:38 PM

#

but i dont see any logic there

#

I dont see a way outside the loop

#

because you have your indeces as a list too

#

or maybe...

#

mmm

twin mantle Mar 24, 2021, 7:39 PM

#

twin mantle Do the positions change?

???

#

Frey

exotic maple Mar 24, 2021, 7:39 PM

#

shadow frigate I need to prepare a label matrix to pass to a loss function

are those indeces related to the row?

shadow frigate Mar 24, 2021, 7:40 PM

#

twin mantle ???

not sure what you mean here, labels are generated randomly at each iteration so yeah?

exotic maple Mar 24, 2021, 7:40 PM

#

sorry man i cant help you tbh im not seeing the order in what you're trying to do

shadow frigate Mar 24, 2021, 7:40 PM

#

hm so

#

I'm trying to compute the cross entropy on the output of a nn

#

which has size 5000 total samples x 3000 possible outcomes

#

in my code, labels is the position of the correct outcome out of the 3000

#

on each of the 5000 rows

twin mantle Mar 24, 2021, 7:42 PM

#

Mate, with all respect

#

You're using big words for this problem. The question is simple:

#

What is your criterion to change values?

#

How do you decide the indexes of the values, row-wise, column-wise?

grave frost Mar 24, 2021, 7:43 PM

#

shadow frigate ```labels= [2 1 1 1 3 0 3 3 1 2] a = [[0. 0. 1. 0.] [0. 1. 0. 0.] [0. 1. 0. 0....

size is constant right?

little path Mar 24, 2021, 7:44 PM

#

Write a program to generate a series of marks of 10 students. Give grace marks up to 5 of those who are having <33 marks and print the new list of the marks.

shadow frigate Mar 24, 2021, 7:44 PM

#

the size of the matrices is constant, the values change with each iteration

little path Mar 24, 2021, 7:44 PM

#

How to give that grace marks

twin mantle Mar 24, 2021, 7:45 PM

#

little path How to give that grace marks

Mate, try to do some effort, pls

little path Mar 24, 2021, 7:45 PM

#

twin mantle Mate, try to do some effort, pls

🥲

#

Ok as you say

grave frost Mar 24, 2021, 7:45 PM

#

@shadow frigate well, what you are doing is called one-hot encoding - your label range is determined by the length of the row so that may confuse someone, but it is (in essence) one-hot encoding on a fixed array

shadow frigate Mar 24, 2021, 7:45 PM

#

yeah now that you put it that way, it is OuroborosSlain

grave frost Mar 24, 2021, 7:45 PM

#

uh-huh

shadow frigate Mar 24, 2021, 7:46 PM

#

holy moly I'm tired OuroborosSlain

grave frost Mar 24, 2021, 7:46 PM

#

so you can use the pre-built modules in sklearn, or generate an array each time and append in on the appropriate axis

exotic maple Mar 24, 2021, 7:46 PM

#

grave frost <@!235843435184128000> well, what you are doing is called one-hot encoding - you...

oh shit you're right

#

he's manually one hot encoding????

grave frost Mar 24, 2021, 7:46 PM

#

yeah, something like that

shadow frigate Mar 24, 2021, 7:46 PM

#

might be

exotic maple Mar 24, 2021, 7:46 PM

#

holy spirit of God

#

this madman

grave frost Mar 24, 2021, 7:46 PM

#

but its easy to confuse (AFA I have understood)

shadow frigate Mar 24, 2021, 7:46 PM

#

'twas a long day ok OuroborosSlain

grave frost Mar 24, 2021, 7:47 PM

#

better take a break 😁 I find it helps a lot when doing long stuff

exotic maple Mar 24, 2021, 7:47 PM

#

shadow frigate 'twas a long day ok <:OuroborosSlain:779601509196496926>

BEGONE

#

exotic maple Mar 24, 2021, 7:48 PM

#

grave frost better take a break 😁 I find it helps a lot when doing long stuff

its pretty well studied that most realizations come in a break AFTER working

#

so i'm trying to make a habit of work, then rest 10-15 mins

#

repeat

grave frost Mar 24, 2021, 7:48 PM

#

yeah, shower for me 😄

hollow sentinel Mar 24, 2021, 7:48 PM

#

yeah I'm gonna take a fat nap

exotic maple Mar 24, 2021, 7:48 PM

#

the brain needs some time for the information to settle

grave frost Mar 24, 2021, 7:48 PM

#

hollow sentinel yeah I'm gonna take a fat nap

lol whats that?

hollow sentinel Mar 24, 2021, 7:48 PM

#

grave frost lol whats that?

a long nap

exotic maple Mar 24, 2021, 7:48 PM

#

grave frost lol whats that?

when he naps on his fat a##?

#

xd

shadow frigate Mar 24, 2021, 7:49 PM

#

yep time to stop, thanks for pointing that out, I'm definitely done for the day

#

cheers

hollow sentinel Mar 24, 2021, 7:49 PM

#

it means a long nap people

grave frost Mar 24, 2021, 7:49 PM

#

ahhh...I am too young for naps

exotic maple Mar 24, 2021, 7:49 PM

#

grave frost ahhh...I am too young for naps

ive had naps since i was 10 yrs old lmao

#

-tropics life-

#

its heaven dude

#

a noon nap and you're refreshed all afternoon

grave frost Mar 24, 2021, 7:50 PM

#

sad. I just can't sleep any time 😦

exotic maple Mar 24, 2021, 7:50 PM

#

I call it nap, but i dont relaly sleep

#

just close my eyes

#

and calm my mind

#

that works too

little path Mar 24, 2021, 7:53 PM

#

why its showing nan ive passed values na 😐

dreamy jewel Mar 24, 2021, 7:53 PM

#

I am making a dungeon crawler and I have done all the basic stuff (the player,tile,collision etc.) and now I wanna make a test level for which I have to make a BOSS I have created the sprite of the boss but IDK how to implement the AI for boss as I never did these kinda stuff ( I am using pygame) so please help me.

exotic maple Mar 24, 2021, 7:54 PM

#

little path why its showing nan ive passed values na 😐

you want a dataframe with a single column?

#

its better to create the series with index right away...

#

and use this method

#

https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.Series.to_frame.html#pandas.Series.to_frame

little path Mar 24, 2021, 7:55 PM

#

Hmm I see

exotic maple Mar 24, 2021, 7:56 PM

#

try reading documentation my man. saves a lot of time :p

#

but for the sake of it, paste your lists here i'll to reproduce

#

#

it worked fine for me?

#

oooh

spiral peak Mar 24, 2021, 8:00 PM

#

exotic maple when he naps on his fat a##?

This isn't really appropriate for this server and can be considered a bit rude. Please be mindful of your wording in the future.

exotic maple Mar 24, 2021, 8:00 PM

#

if i change it to strings it doesnt work, weird

#

exotic maple Mar 24, 2021, 8:00 PM

#

spiral peak This isn't really appropriate for this server and can be considered a bit rude. ...

whoops, didnt know. thanks for the heads up

little path Mar 24, 2021, 8:01 PM

#

Ya bro done @exotic maple

exotic maple Mar 24, 2021, 8:02 PM

#

little path Ya bro done <@263491859173736449>

now i'm curious about why it doesnt work though

#

it seems you cant convert like that

#

it inherits the indeces

little path Mar 24, 2021, 8:02 PM

#

Lol I have to write all my codes in my file , will see you next time

exotic maple Mar 24, 2021, 8:06 PM

#

dang i cant answer why we get those nans vals

twin mantle Mar 24, 2021, 8:09 PM

#

exotic maple now i'm curious about why it doesnt work though

Why do you use the list(range())?

exotic maple Mar 24, 2021, 8:13 PM

#

twin mantle Why do you use the list(range())?

placeholder index

#

you cant pass range alone

#

its a generator object, not an iterable

#

I THINK -hesitant-

#

#

that the problem there in the 2nd case is that tries to look for the values of index in the parent series

#

yes, that's exactly what it does. Its not setting the argument of index as the index, its searching for it

#

#

I have my answer now 😄

#

how can I tell the people in pandas to modify that documentation section?

misty flint Mar 24, 2021, 8:37 PM

#

github?

#

CCL_Kek

grave frost Mar 24, 2021, 8:57 PM

#

This is the docs for the NLTK HMM - I want to do unsupervised tagging on my dataset

 class nltk.tag.hmm.HiddenMarkovModelTrainer(states=None, symbols=None)[source]

    Bases: object

    Algorithms for learning HMM parameters from training data. These include both supervised learning (MLE) and unsupervised learning (Baum-Welch).

    Creates an HMM trainer to induce an HMM with the given states and output symbol alphabet. A supervised and unsupervised training method may be used. If either of the states or symbols are not given, these may be derived from supervised training.

    Parameters

            states (sequence of any) – the set of state labels

            symbols (sequence of any) – the set of observation symbols

Does anyone know about states and symbols? I can't find much from googling

abstract zealot Mar 25, 2021, 12:26 AM

#

Any improvements as to how I can speed up the following:

Df.groupby([a,b,c]).agg({col1: [funca, funcb, funcc], col2: [funca, funcb, funcc]})

??

velvet thorn Mar 25, 2021, 12:34 AM

#

abstract zealot Any improvements as to how I can speed up the following: Df.groupby([a,b,c]).a...

doubt you can

#

what are the functions?

abstract zealot Mar 25, 2021, 12:36 AM

#

velvet thorn what are the functions?

Statistical functions like kstest(), some of the use standardscaler() etc

velvet thorn Mar 25, 2021, 12:36 AM

#

abstract zealot Statistical functions like kstest(), some of the use standardscaler() etc

that’s about as efficient as you can get, assuming everything is vectorised

abstract zealot Mar 25, 2021, 12:36 AM

#

What you mean vectorised ?

velvet thorn Mar 25, 2021, 12:36 AM

#

and memoisation won’t help

velvet thorn Mar 25, 2021, 12:37 AM

#

abstract zealot What you mean vectorised ?

parallel application of certain basic operations

#

it’s a numpy thing

abstract zealot Mar 25, 2021, 12:37 AM

#

Nice I’ll look into this

#

Running this on my data frame is taking >5 hours

velvet thorn Mar 25, 2021, 12:37 AM

#

as in, if none of those are your own functions

#

they are likely already vectorised.

velvet thorn Mar 25, 2021, 12:38 AM

#

abstract zealot Running this on my data frame is taking >5 hours

how big is it?

abstract zealot Mar 25, 2021, 12:38 AM

#

25 mill

velvet thorn Mar 25, 2021, 12:38 AM

#

5 hours seems a bit long

#

what functions specifically?

abstract zealot Mar 25, 2021, 12:38 AM

#

Yes that’s what I thought

velvet thorn Mar 25, 2021, 12:38 AM

#

show code

abstract zealot Mar 25, 2021, 12:38 AM

#

Out of interest does reshaping take a lot of time ?

velvet thorn Mar 25, 2021, 12:39 AM

#

generally, no

#

well, more accurately, it depends.

#

on whether a copy is made

abstract zealot Mar 25, 2021, 12:43 AM

#

velvet thorn what functions specifically?

sure can i dm you them?

abstract zealot Mar 25, 2021, 12:51 AM

#

velvet thorn what functions specifically?

an example would be

def func1(x):
    x = pd.Series([e*100 for e in x.values])
    _scaled =     
StandardScaler(with_std=False).fit_transform(x.values.reshape(-1,1))
    return kstest(rvs=_scaled, cdf='t', N=len(_scaled), args=(1, ))[1]

velvet thorn Mar 25, 2021, 12:53 AM

#

abstract zealot sure can i dm you them?

no thank you

velvet thorn Mar 25, 2021, 12:54 AM

#

abstract zealot an example would be ```py def func1(x): x = pd.Series([e*100 for e in x.val...

...yeah

#

that's probably not a good idea

abstract zealot Mar 25, 2021, 12:55 AM

#

oh no hahaha

velvet thorn Mar 25, 2021, 12:55 AM

#

you're new to pandas and numpy, right

#

so

#

you should just do this

#

(for example)

#

!e

import numpy as np

a = np.array([1, 2, 3])
print(a)

b = a * 100
print(b)

arctic wedgeBOT Mar 25, 2021, 12:55 AM

#

@velvet thorn :white_check_mark: Your eval job has completed with return code 0.

001 | [1 2 3]
002 | [100 200 300]

velvet thorn Mar 25, 2021, 12:55 AM

#

you have a list comprehension there

#

which of course will be slow

#

and then you go through the overhead of converting it back into a Series

#

I would suggest

#

reading up on the basics of numpy

#

it would help you write better code

abstract zealot Mar 25, 2021, 12:56 AM

#

this makes complete sense thank you very much for the example

velvet thorn Mar 25, 2021, 12:56 AM

#

also I question the wisdom of using StandardScaler there?

#

!e

import numpy as np

a = np.random.rand(5)
print(a)

zero_mean = a - a.mean()
print(zero_mean)
print(zero_mean.mean().round(5))

arctic wedgeBOT Mar 25, 2021, 12:57 AM

#

@velvet thorn :white_check_mark: Your eval job has completed with return code 0.

001 | [0.2841178  0.3210537  0.3932124  0.62640306 0.96628655]
002 | [-0.2340969  -0.197161   -0.1250023   0.10818836  0.44807184]
003 | 0.0

velvet thorn Mar 25, 2021, 12:57 AM

#

@abstract zealot this is basically what you're doing, right

#

centering around 0

abstract zealot Mar 25, 2021, 12:57 AM

#

yes

velvet thorn Mar 25, 2021, 12:58 AM

#

again, I would suggest a bit of research on the purpose of sklearn's transformers

#

they are helpful for building a pipeline

#

but in this case what you are doing is just a single operation of centering

hollow sentinel Mar 25, 2021, 12:58 AM

#

nooo gm what happened to your username

velvet thorn Mar 25, 2021, 12:58 AM

#

it would make more sense to use a plain numpy operation

velvet thorn Mar 25, 2021, 12:58 AM

#

hollow sentinel nooo gm what happened to your username

blame Google for making GMail

#

I kept getting pinged

hollow sentinel Mar 25, 2021, 12:58 AM

#

velvet thorn blame Google for making GMail

oh damn you're right I didn't consider that

tardy crest Mar 25, 2021, 12:58 AM

#

Hey guys what all comes under data science engineering? Do you guys think it's gonna be worth it ? I'm kinda confused whether I should be taking cs/data science/AI...do you guys think the placements different in them?

velvet thorn Mar 25, 2021, 12:59 AM

#

tardy crest Hey guys what all comes under data science engineering? Do you guys think it's g...

what is data science engineering

abstract zealot Mar 25, 2021, 12:59 AM

#

velvet thorn it would make more sense to use a plain `numpy` operation

Thank you so much for all this it has definitely given me some valuable pointers

velvet thorn Mar 25, 2021, 12:59 AM

#

there is data science, and there is data engineering, but I have not heard of "data science engineering"

velvet thorn Mar 25, 2021, 12:59 AM

#

abstract zealot Thank you so much for all this it has definitely given me some valuable pointers

yw! atb

hollow sentinel Mar 25, 2021, 1:00 AM

#

does he mean data engineering/ data science

#

the world may never know

void shale Mar 25, 2021, 1:13 AM

#

does anyone know how to get matplotlib on python? I am taking a class on udemy and its a little outdated so there is no proper instruction. I couldn't find anything online. Does anyone know?

hollow sentinel Mar 25, 2021, 1:14 AM

#

https://matplotlib.org/stable/users/installing.html

#

try that

#

if you're on mac OS it should be pip install matplotlib

void shale Mar 25, 2021, 1:15 AM

#

THANK YOU SO MUCH!!!

hollow sentinel Mar 25, 2021, 1:15 AM

#

yeah no problem

serene scaffold Mar 25, 2021, 1:31 AM

#

velvet thorn there is data science, and there is data engineering, but I have not heard of "d...

!otn a data science engineering

arctic wedgeBOT Mar 25, 2021, 1:31 AM

#

:ok_hand: Added data-science-engineering to the names list.

serene scaffold Mar 25, 2021, 1:31 AM

#

now everyone will hear about it

modest void Mar 25, 2021, 1:51 AM

#

anyone know how to use between_time or something equivalent to select rows in a pandas dataframe that are between a given start time and end time, but for a dataframe that has multiple days in it, so like all the rows that are between say 7am and 9am for a dataframe that has a datetime index with rows going across multiple days

velvet thorn Mar 25, 2021, 1:53 AM

#

modest void anyone know how to use between_time or something equivalent to select rows in a ...

filter using datetime accessor

#

Google “datetime accessor”, I can’t type code right now

modest void Mar 25, 2021, 2:08 AM

#

velvet thorn filter using datetime accessor

sorry, I should be more specific, I want to be able to filter between arbitrary hour and minute combinations, just like with .between_time, but across multiple days

misty flint Mar 25, 2021, 2:25 AM

#

but I have not heard of "data science engineering"
me neither

#

ive heard of "full stack data science"

#

which is like front + backend skills + DS + (some Ops skills maybe)

#

🦄

velvet thorn Mar 25, 2021, 2:29 AM

#

modest void sorry, I should be more specific, I want to be able to filter between arbitrary ...

yeah, why wouldn’t that work

#

or could you provide an example please

serene scaffold Mar 25, 2021, 2:44 AM

#

misty flint which is like front + backend skills + DS + (some Ops skills maybe)

I can conceive of "full stack data scientist" referring to "understanding data science having general programming skills", but idk why having web dev skills would matter

misty flint Mar 25, 2021, 2:55 AM

#

i dont either. if i find the listing again, ill show you

misty flint Mar 25, 2021, 2:57 AM

#

serene scaffold I can conceive of "full stack data scientist" referring to "understanding data s...

#

they are listed as two separate skill set categories

#

as you can see

#

DoggoKek

serene scaffold Mar 25, 2021, 2:58 AM

#

Is nosql like guis with flowchart blocks for code?

misty flint Mar 25, 2021, 3:00 AM

#

ID_BoomKek

#

i think MongoDB and Cassandra are nosql

exotic maple Mar 25, 2021, 3:01 AM

#

i still dont know what exactly is so attractive about mongodb

exotic maple Mar 25, 2021, 3:02 AM

#

misty flint

bro that profile would need to pay upwards of 150k in the US lmao

#

i know some remote workers in my country working as FS devs for us companies and they make 100k

#

REMOTE

misty flint Mar 25, 2021, 3:03 AM

#

exotic maple i know some remote workers in my country working as FS devs for us companies and...

~~which country~~

#

ID_blurryeyes

exotic maple Mar 25, 2021, 3:03 AM

#

latin america :p

#

no more details ay

misty flint Mar 25, 2021, 3:05 AM

#

oh hey i know someone in the same situation

#

DoggoKek

#

they have an advantage bc same time zone

#

unlike EU or Aus

exotic maple Mar 25, 2021, 3:06 AM

#

I'd like to get a remote junior data scientist / analyst from my country, but that's too much dreaming i guess lol

misty flint Mar 25, 2021, 3:06 AM

#

one of the companies i think im going to work for has a croatian branch

#

and im like

#

how does that work

#

with time zones and such

#

memecringeharold

exotic maple Mar 25, 2021, 3:06 AM

#

lmao i work with my colleagues in China, India, Russia, Bulgaria, etc

misty flint Mar 25, 2021, 3:06 AM

#

ig its fine if its morning here + later afternoon there

exotic maple Mar 25, 2021, 3:07 AM

#

trust me, you get used to it

misty flint Mar 25, 2021, 3:07 AM

#

i see

#

ValkNaruhodo

exotic maple Mar 25, 2021, 3:07 AM

#

a good scheduler will amke sure there's at least a bit of overlap

#

depending on business needs

misty flint Mar 25, 2021, 3:07 AM

#

im sure they will more than likely put me in project teams that the members are more local

#

so we can sync better

#

anyway

exotic maple Mar 25, 2021, 3:08 AM

#

depending on the nature of the job, haivng sometime dif can be good

#

I can at least restrain myself from shouthing at my india/china colleagues because they arent live :v

#

so i'lljust send an email and vent more "professionally" ay

misty flint Mar 25, 2021, 3:21 AM

#

ID_BoomKek

#

💀

#

but yeah remote jobs seem more popular moving forward

#

post-covid

#

Praise

exotic maple Mar 25, 2021, 3:32 AM

#

unpopular opinion

#

mixed is much btter than just remote

marble dune Mar 25, 2021, 3:56 AM

#

hi, i got a 'long' pandas dataframe, that has 3 columns: property, value and playlist, basically is a dataframe converted from wide to long format using pd.melt(), the problem comes when i try to plot a bar catplot with seaborn, and i pass a column name as the x values and when i show the plot the x values don't show up

#

this is the dataframe

#

and here are my code and how the plot currently looks

#

screenshot-127.0.0.1_8000-2021.03.25-00_50_23.png

#

        #bar catplot
        bar_catplot = sns.catplot(
            kind="bar", x="property", y="value", hue="playlist", legend=True, data=long_frame2, dodge=True
        )
        bar_catplot_figure = bar_catplot.fig
        catplot_render = mpld3.fig_to_html(bar_catplot_figure)

misty flint Mar 25, 2021, 4:34 AM

#

exotic maple mixed is much btter than just remote

~~i agree. im more of an in-person person anyway~~

#

DoggoKek

#

which is why im glad this company usually requires in-person

#

work

#

in the office

rough otter Mar 25, 2021, 5:58 AM

#

can anyone help explain what p-value is

lapis sequoia Mar 25, 2021, 6:59 AM

#

`#HOMEWORK
#Q.1.Write a function to find the factors of a number.

number = int(input("Enter a number:"))
factors=[]
for i in range(1,number+1):
if number%i == 0:
factors.append(i) #Append: It adds a single item to the list. It modifies the list by adding an item to -->
#------> the end of the list.
print("Factors of the {} = {}".format(number,factors))`

#

#Q.2.Write a function to identify whether a number is palindrome or not.

num = int(input("Enter a number:"))
temp = num
rev = 0
while (num>0):
dig = num%10
rev = rev*10+dig
num = num//10
if (temp == rev):
print("The number is a palindrome.")
else:
print("The number is not a palindrome.")

#

#Q.3.Write a function to identify whether a string is palindrome or not. string = input("Enter a string:") if (string == string [::-1]): print("The string is a palindrome.") else: print("The string is not a palindrome.")

gray arch Mar 25, 2021, 7:26 AM

#

Does anyone have issue with running Tensorflow in Python 3.9?

grave frost Mar 25, 2021, 8:48 AM

#

misty flint ~~i agree. im more of an in-person person anyway~~

if you do remote work, then isn't there a chance of you working more to get a project done as opposed to set hours in the office? I read in an article that a lot of tech people are being exploited this way

rancid gazelle Mar 25, 2021, 9:07 AM

#

Hey, did you know any active discord about tensorflow?

primal tulip Mar 25, 2021, 9:19 AM

#

grave frost if you do remote work, then isn't there a chance of you working more to get a pr...

You should always negotiate according to your needs and preferences. I don't see why that is bad for speedy programmers, in which they'll make more money than others locked at the same wage. I'm a slow worker so that model is not feasible for me for big projects, but I've done it as a way to get extra money other than my full-time job. Also, it's a great way to improve your efficiency.

glacial sparrow Mar 25, 2021, 9:59 AM

#

is dash plotly used with mongodb?

primal tulip Mar 25, 2021, 10:13 AM

#

glacial sparrow is dash plotly used with mongodb?

You could, yes. What do you what to do in dash? If I may ask.

glacial sparrow Mar 25, 2021, 10:14 AM

#

making a dashboard

primal tulip Mar 25, 2021, 10:15 AM

#

Disregarding the type of data manager you use, you could always graph data with dash. Even more if it's tabular data.

glacial sparrow Mar 25, 2021, 10:20 AM

#

in short, I can connect with pymongo, do manipulations (most important is json_normalize) and create some graphs I want
but I wanted to make an 'interactive' dashboard that updates via mongodb
not sure if it makes sense, but I guess there are 3 options?
mongodb charts - but I think I cannot manipulate the data as with pymongo
powerbi - which seems to be able to connect with mongodb and allows the needed manipulations but with M which I'm not really familiar with
dash plotly - which I guess I can re-use my previous code, but I can't find many results online how to keep getting data

#

do the above make sense?

primal tulip Mar 25, 2021, 10:31 AM

#

Yeah, makes sense. I've never done anything with mongo other than simple queries and not particulary good with dash either, but you could either do it in PBI or Dash. If your data is a behemot sized monster, I'd suggest using Python with Dash, since you can setup a buffer. I've seen a lot of resources in Dash so it's doable, but you'll have to do some trial and error. If the data is medium (Less than 8gb for example) you could do it in PBI which is higher level and overall easier, also with tons of resources.

glacial sparrow Mar 25, 2021, 10:37 AM

#

I guess then my real question is how the dashboard can be 'live'. But actually even 'updating' daily would be fine for me.

primal tulip Mar 25, 2021, 10:38 AM

#

Oh, then in that case I would transform everything to a Pandas Dataframe and update it

#

PBI also has (something like a checkbox option) alternative were you can toggle updating the datasets and their relations.

#

I forgot the name, but it should be under 'ñManage Relationships'

glacial sparrow Mar 25, 2021, 10:42 AM

#

ok i will check

primal tulip Mar 25, 2021, 10:42 AM

#

https://dash.plotly.com/live-updates

Live Updates | Dash for Python Documentation | Plotly

Update your apps on page load or
on a predefined interval (e.g. every 5 seconds)

#

You have to update both the data and the dashboard if you'd like to go with the PBI route.

https://docs.microsoft.com/en-us/power-bi/connect-data/refresh-scheduled-refresh

https://docs.microsoft.com/en-us/power-bi/connect-data/refresh-data#types-of-refresh

Configure scheduled refresh - Power BI

This covers the steps to select a gateway and configure scheduled refresh.

Data refresh in Power BI - Power BI

This article describes the data refresh features of Power BI and their dependencies at a conceptual level.

tidal bronze Mar 25, 2021, 10:51 AM

#

how can I visulize clusters if they are based on a single feature?

lapis sequoia Mar 25, 2021, 10:55 AM

#

So i am trying to build a speech recognition model. I am not that skilled so i am using sk learn. Lets say i have some recordings of my voices in .wav format. What do i need to do to make them trainable data?

primal tulip Mar 25, 2021, 11:13 AM

#

tidal bronze how can I visulize clusters if they are based on a single feature?

Is the second dimension a scale? You could do something like a % distribution on the other axis.

primal tulip Mar 25, 2021, 11:24 AM

#

lapis sequoia So i am trying to build a speech recognition model. I am not that skilled so i a...

What are you using to read the wav files and what is your data like?

sharp prairie Mar 25, 2021, 11:33 AM

#

Hi guys. I have a CSV file with lot's of empty strings. How do I drop or delete them with pandas?

So far, this is my code.

df = df.dropna(how='any', axis=0, thresh=2, inplace=True)

Running it gives me none. When I remove the inplace I also don't get the dropped rows.

lapis sequoia Mar 25, 2021, 11:41 AM

#

primal tulip What are you using to read the wav files and what is your data like?

I am not sure of the way i am gonna read my wav files. The datas are like folders of different words such as a folder of hello and a folder of bye

#

What way would u suggest me to read the files?

primal tulip Mar 25, 2021, 11:42 AM

#

https://stackoverflow.com/questions/2060628/reading-wav-files-in-python

Stack Overflow

Reading *.wav files in Python

I need to analyze sound written in a .wav file. For that I need to transform this file into set of numbers (arrays, for example). I think I need to use the wave package. However, I do not know how

lapis sequoia Mar 25, 2021, 11:42 AM

#

Could i use that for sk learn as well?

primal tulip Mar 25, 2021, 11:43 AM

#

You need your data in text format first so it could be fed to the sklearn library.

lavish tundra Mar 25, 2021, 11:48 AM

#

can someone help me to think about one thing?

#

its about data visualization

primal tulip Mar 25, 2021, 11:48 AM

#

Arroje su pregunta Señor Diego

lavish tundra Mar 25, 2021, 11:49 AM

#

i dont speak spanish . _.

primal tulip Mar 25, 2021, 11:54 AM

#

I said "Ask away, mister". Your name is pretty common in Latin América lol.

Seems like a correct assumption about the xticks, but not sure why it's happening. Give me a minute pls.

tidal bough Mar 25, 2021, 12:11 PM

#

linspace(start,stop,number) always gives out start as the first point and stop as the last point.

#

(there's a parameter to change this behaviour)

frigid forum Mar 25, 2021, 12:17 PM

#

str onject has no attribute decode

#

i keep getting this error

#

anyone knows wha to do

tidal bough Mar 25, 2021, 12:18 PM

#

decode is a method of bytes that converts them into strs, the opposite is str.encode.

frigid forum Mar 25, 2021, 12:24 PM

#

tidal bough `decode` is a method of `bytes` that converts them into `str`s, the opposite is ...

ok.. so bytes has the decode attribute? not str?

tidal bough Mar 25, 2021, 12:24 PM

#

the decode method, yes

balmy junco Mar 25, 2021, 12:42 PM

#

I want to use python to calculate the antiderivative of a function and store it as a function that i can use

#

how might i do that?

tidal bough Mar 25, 2021, 12:47 PM

#

balmy junco I want to use python to calculate the antiderivative of a function and store it...

check out scipy.integrate

#

!docs scipy.integrate

arctic wedgeBOT Mar 25, 2021, 12:47 PM

#

`scipy.integrate`

This appears to be a generic page not tied to a specific symbol.

tidal bough Mar 25, 2021, 12:47 PM

#

if you mean numerical integration. If you mean analytical, sympy.

balmy junco Mar 25, 2021, 1:23 PM

#

tidal bough check out `scipy.integrate`

i see functions for integation, but would you know how i could basically pass in some function F(X) to integrate

#

and then it could return me the integral as a function

#

so i could just pass variables into it

tidal bough Mar 25, 2021, 1:26 PM

#

you can just make the function call scipy.integrate.quad each time, from (say) 0 to the argument

#

that'd be time-inefficient, but will require no extra memory

#

alternatively, precalculate the integral's values for the entire interval you'll be working on and use values from it

balmy junco Mar 25, 2021, 1:27 PM

#

sure

#

but there is no explicit way to sav eit as a function right

#

?

#

if so, i can just create my own function

#

and assign attributes i guess

tidal bough Mar 25, 2021, 1:28 PM

#

yeah, something like that

balmy junco Mar 25, 2021, 1:34 PM

#

thanks

#

so then

#

if i want to pass in a function f that takes in a value of kx instead of x, do i just multiply the integral range?
quad(f, 0, math.pi)

#

like quad(f, 0, k*math.pi)

#

i feel like there needs to be another way

tidal bough Mar 25, 2021, 1:51 PM

#

not sure what you mean by this

tidal bronze Mar 25, 2021, 2:07 PM

#

primal tulip Is the second dimension a scale? You could do something like a % distribution on...

no it is not it's a continuous variable

carmine iron Mar 25, 2021, 2:27 PM

#

Does anyone know how to find the nth largest drawdown of a portfolio

lapis sequoia Mar 25, 2021, 2:32 PM

#

anyone know a good api for facial landmarks?

#

or anything

#

to get coordinates of them

misty flint Mar 25, 2021, 2:35 PM

#

the dlib library is a popular one

#

we used that in our face recognition project

#

gives you 68 x,y coordinate points

misty flint Mar 25, 2021, 2:36 PM

#

lapis sequoia to get coordinates of them

try that one

lapis sequoia Mar 25, 2021, 2:36 PM

#

well I have dlib

#

with

#

face recognition library

misty flint Mar 25, 2021, 2:36 PM

#

there you go

lapis sequoia Mar 25, 2021, 2:36 PM

#

but in some cases

#

like in my pfp

#

it doesnt detect

#

an eye

misty flint Mar 25, 2021, 2:36 PM

#

rip

#

yeah its not trained to do it on those types of images

#

we actually proved that in our project

#

lol

#

best one i know, so gl bud

lapis sequoia Mar 25, 2021, 2:51 PM

#

whats ur project

carmine iron Mar 25, 2021, 3:00 PM

#

How can i return the nth largest drawdown for example
r = [.01,-.01,.004, -.02,.01] n = 2

grave frost Mar 25, 2021, 3:04 PM

#

lapis sequoia but in some cases

for that, you would have to train your own model on your own custom data

alpine fern Mar 25, 2021, 3:26 PM

#

I'm not sure whether this should be in this channel, but if I'm looking at historical data in the form of candlesticks, how would I be able to find local mins/maxes ,using say a dataframe format, for my data?

lapis sequoia Mar 25, 2021, 3:27 PM

#

grave frost for that, you would have to train your own model on your own custom data

basically it checks for face landmarks like eyes for example. then I get those coordinates and i paste something else on top of it

grave frost Mar 25, 2021, 3:28 PM

#

lapis sequoia basically it checks for face landmarks like eyes for example. then I get those c...

what's the end goal?

lapis sequoia Mar 25, 2021, 3:28 PM

#

fun

#

#

went from a normla picture

#

pasted flares on it

#

or paste whatever you want to on the eyes

grave frost Mar 25, 2021, 3:29 PM

#

so you want just eyes or all facial features

lapis sequoia Mar 25, 2021, 3:29 PM

#

yeah

#

I mean in the future maybe could do smth with the rest but eyes are like main thing

grave frost Mar 25, 2021, 3:30 PM

#

just get their coordinates then 🤷 train a model for that - data wouldn't be too hard

#

or just google "get location of eyes from face in python" and youd probably get some indian tutorial using OpenCv

lapis sequoia Mar 25, 2021, 3:31 PM

#

probably

#

tried looking into that already

#

or well

#

thats what im doing rn

#

just have to figure out how to get the end picture of opencv into a pil image or bytesIO

grave frost Mar 25, 2021, 3:32 PM

#

researching things is a pretty important skill

lapis sequoia Mar 25, 2021, 3:33 PM

#

I figured

grave frost Mar 25, 2021, 3:33 PM

#

and with google scholar, its not as hard as it was before

dawn cargo Mar 25, 2021, 4:39 PM

#

Hey guys, I've been trying to do a Gaussian blur of a RGB image.
I know how to blur a grayscale image (with 2d convolution kernel), but I'm having a problem with implementing the process for RGB image and 3d kernel. Should all layers of the kernel be the same or not?

tidal bough Mar 25, 2021, 4:40 PM

#

Yup, all the same, unless you want the kernel to also mess with colors.

#

so it'd just be 3 gaussian kernels stacked on top of each other

dawn cargo Mar 25, 2021, 4:41 PM

#

Thank you very much

#

Going to look for a bug in another place then

tidal bronze Mar 25, 2021, 5:12 PM

#

how can I visulize clusters if they are based on a single feature?

lapis sequoia Mar 25, 2021, 5:33 PM

#

Hey anyone knows how to apply groupby().agg() on index instead of columns?

ripe forge Mar 25, 2021, 5:57 PM

#

You can always reset index to turn index into column

jade tinsel Mar 25, 2021, 6:20 PM

#

Hi all! I've recently gotten into data science and I'm currently trying to do some research into Linear Regression. I'm able to train and make one prediction (the basics), but I'm not sure what keyword(s) I should be looking for when I want to use the trained model in order to predict with a given variable.

E.x. I have a dataset with country, text (nl: hallo wereld for example), I'd like to pass a variable to the model to predict what the given text is. What keywords would I have to look for and is linear regression even the way to go for such a thing? Sorry for the confusing question, I tried my best but still trying to get the hang of this thing 😄

tranquil loom Mar 25, 2021, 6:31 PM

#

Hi, I want work on recommedation system ,but i can't find a source ,course exc. Can you recommend a source 😄

gray arch Mar 25, 2021, 6:50 PM

#

tranquil loom Hi, I want work on recommedation system ,but i can't find a source ,course exc....

I am self-learning via a book called Intelligent Projects using Python (Packt Publishing), they have a project called "Intelligent Recommender System". For the source code you can find it here: https://github.com/PacktPublishing/Intelligent-Projects-Using-Python/tree/master/Chapter06

But without the book it might be hard to understand how the source code is implemented so I still suggest you look more online

tranquil loom Mar 25, 2021, 6:51 PM

#

thank you 👍 👍 👍

polar dock Mar 25, 2021, 7:16 PM

#

Hi data scientists, which disk based storage formats do y'all use most often for dataframes?
Use cases are for long term storage, as well fast read/ write capabilities.

I'm a dev on an analytics team. Currently, we are using pickle almost exclusively.
Been exploring parquet, and it's different engines but was hoping someone had some experience 🙂

serene scaffold Mar 25, 2021, 7:42 PM

#

polar dock Hi data scientists, which disk based storage formats do y'all use most often for...

pandas supports sql, if that matters

#

https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.to_sql.html

misty flint Mar 25, 2021, 7:44 PM

#

spark dataframes is good if youre looking for more production stuff

polar dock Mar 25, 2021, 7:44 PM

#

yeah, I know. We're just a small analytics team in a big company. Our oracle servers are all hosted internally.

Though, I guess the question I should bring up with the analysts is why do they want disk based storage

#

I wasn't really told a specific "prove parquet > pickle" or something, mostly just to explore the options

misty flint Mar 25, 2021, 7:45 PM

#

oh wait spark doesnt used disk-based storage unless it has to

#

mapreduce does

#

this makes it literally 100x faster (spark)

#

that would be a good question to ask

sterile kernel Mar 25, 2021, 9:39 PM

#

xd

serene scaffold Mar 25, 2021, 9:40 PM

#

sterile kernel xd

This is the channel for talking about data science.

grave frost Mar 25, 2021, 10:10 PM

#

Making code reproducible sucks AF

thorn bobcat Mar 25, 2021, 10:14 PM

#

yo

misty flint Mar 25, 2021, 10:40 PM

#

docker

#

logo_docker

grave frost Mar 25, 2021, 10:40 PM

#

I can't use docker with colab

#

I don't know how to

thorn bobcat Mar 25, 2021, 10:41 PM

#

is there a repo that can change my voice to another persons voice?

#

i just recently ran into https://github.com/NVIDIA/tacotron2

GitHub

NVIDIA/tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference - NVIDIA/tacotron2

#

still wondering if I should work on it

grave frost Mar 25, 2021, 10:41 PM

#

Tacotron is for TTS

grave frost Mar 25, 2021, 10:42 PM

#

thorn bobcat is there a repo that can change my voice to another persons voice?

there's a lot of research done on that, its pretty easy

#

but you may not be necessarily be able to use tacotron

thorn bobcat Mar 25, 2021, 10:42 PM

#

grave frost there's a lot of research done on that, its pretty easy

got links to any papers?

grave frost Mar 25, 2021, 10:43 PM

#

its basically TTS - but the problem is the data.

thorn bobcat Mar 25, 2021, 10:43 PM

#

I thought of using speech recognition to convert my voice into text and then converting that text via tacotron 2

grave frost Mar 25, 2021, 10:43 PM

#

Two Minute papers had a method that can replicate exact voice using 2 minutes of train data

#

but I would have to hunt for it tho

thorn bobcat Mar 25, 2021, 10:43 PM

#

grave frost Two Minute papers had a method that can replicate exact voice using 2 minutes of...

5 seconds?

grave frost Mar 25, 2021, 10:43 PM

#

wdym?

thorn bobcat Mar 25, 2021, 10:44 PM

#

there was one that did it in 5

#

just watched it

grave frost Mar 25, 2021, 10:44 PM

#

must be on the cutting-edge - no way you are deploying that unless yove done your masters

#

or the contributors are active

misty flint Mar 25, 2021, 10:45 PM

#

grave frost I can't use docker with colab

if you have colab, you dont need docker

grave frost Mar 25, 2021, 10:45 PM

#

misty flint if you have colab, you dont need docker

yeah, it helps but not always

thorn bobcat Mar 25, 2021, 10:45 PM

#

grave frost must be on the cutting-edge - no way you are deploying that unless yove done you...

https://github.com/CorentinJ/Real-Time-Voice-Cloning

GitHub

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time - CorentinJ/Real-Time-Voice-Cloning

misty flint Mar 25, 2021, 10:45 PM

#

only issue with colab is the other person has to have access to the dataset too

thorn bobcat Mar 25, 2021, 10:46 PM

#

this was the one featured in the 2 minutes paper, python code mostly but not tensor flow now.

grave frost Mar 25, 2021, 10:46 PM

#

misty flint only issue with colab is the other person has to have access to the dataset too

for me, sometimes I get problems with different GPU's on Apex

misty flint Mar 25, 2021, 10:46 PM

#

lol youre probs training too much for colab

grave frost Mar 25, 2021, 10:46 PM

#

thorn bobcat this was the one featured in the 2 minutes paper, python code mostly but not te...

so?

thorn bobcat Mar 25, 2021, 10:47 PM

#

I was wondering if there was something newer in the field of speech synthesis

misty flint Mar 25, 2021, 10:48 PM

#

this is a good method that also allows you to share with others https://cloud.google.com/dataproc?hl=en_US

Google Cloud

Dataproc | Google Cloud

Dataproc is a fast, easy-to-use, fully managed cloud service for running Apache Spark and Apache Hadoop clusters in a simpler, more cost-efficient way

#

you can even run a jupyter notebook on it

#

for bigger models

grave frost Mar 25, 2021, 10:50 PM

#

misty flint this is a good method that also allows you to share with others https://cloud.go...

I don't have the $$ I only use colab

misty flint Mar 25, 2021, 10:53 PM

#

you get free credits if its your first time

#

more than enough to play around with

grave frost Mar 25, 2021, 10:53 PM

#

done it; blew it

misty flint Mar 25, 2021, 10:54 PM

#

you wasted it already?

#

on what?

grave frost Mar 25, 2021, 10:54 PM

#

yea

misty flint Mar 25, 2021, 10:54 PM

#

memecringeharold

grave frost Mar 25, 2021, 10:54 PM

#

t R a I N i N g

#

wanted to do some some classification

misty flint Mar 25, 2021, 10:54 PM

#

you blew $300 only on training?

#

i-

#

Pika

grave frost Mar 25, 2021, 10:55 PM

#

yeet

#

im a broke boi

#

tho there is a workaround to use unlimited GCP 😏

#

~~Which I am currently using~~

fickle surge Mar 25, 2021, 11:47 PM

#

Hey! im 13 and I am currently learning python via codecademy because I want to get into machine learning... any tips?
im also thinking about ordering these books. https://www.amazon.com/dp/1119245516/ref=cm_sw_r_sms_api_glt_fabc_WMFMSRNVV4PQBHX2B3TH and https://nnfs.io/

#

if anyone responds please dont hesitate to ping or dm me!

grave frost Mar 25, 2021, 11:50 PM

#

fickle surge Hey! im 13 and I am currently learning python via codecademy because I want to g...

Those seem great starters. ML/AI is pretty complicated - especially the mathematics involved. if you do not understand something, you can get an intuitive knowledge of it from youtube. it would still take you some years to understand, but I promise it would be a pretty fun journey 🙂

As you learn more maths in school, things would make more sense. but don't stress if you don't understand anything. we are always here to help!! 🤗

fickle surge Mar 25, 2021, 11:51 PM

#

alright! thanks.

grave frost Mar 25, 2021, 11:51 PM

#

I would recommend you make Youtube your primary source of knowledge. visualizing things is very easy to understand

fickle surge Mar 25, 2021, 11:51 PM

#

ok

median agate Mar 25, 2021, 11:58 PM

#

@fickle surge I'm not a pro but I would start with codecademy (SoloLearn has a course on Machine Learning which is preety good, they're both preety similar though). Go from there to codecademy/udemy, there are a bunch of more advanced courses. Also on YouTube there's a lecture series on ML by Steven Brunton (sp) but it's quite theoretical/mathsy. I wouldn't worry about the maths/theoretical aspect until you've built some practical/fun projects and you still like it Also Unity ML Agents is a great practical intro, no need to know any of the inner workings.

grave frost Mar 25, 2021, 11:59 PM

#

https://www.reddit.com/r/learnmachinelearning/wiki/index is a great index for resources

fickle surge Mar 26, 2021, 12:00 AM

#

alright, im already in a python course on codecademy so i think im going to finish that to learn the basics

stiff barn Mar 26, 2021, 12:23 AM

#

polar dock I wasn't really told a specific "prove parquet > pickle" or something, mostly ju...

Pickle is generally a bad choice since it is only Python readable essentially. Parquet can be read by other things

#

Parquet works really well with services like Azure Data Bricks

#

We generally just use csv or json though or read directly from a DB

gray arch Mar 26, 2021, 1:07 AM

#

What book/course/websites that you all recommend to build a first project on Google Cloud? I literally never use it before and I just mess around with it today I don't have a clue where I should start...

misty flint Mar 26, 2021, 1:08 AM

#

they have what are called Quests on gcp

#

try to complete those. i think thats a good place to start

stiff barn Mar 26, 2021, 1:08 AM

#

There are a lot of services on GCP. It’ll be easier if you have a project in mind so you can narrow it down to a few core services

gray arch Mar 26, 2021, 1:11 AM

#

@misty flint @stiff barn thanks, just signed up for the quests, wish to learn more haha
Feel lagging so behind in the industry 😦

stiff barn Mar 26, 2021, 1:11 AM

#

gray arch <@!446424248479645706> <@!247847269267800074> thanks, just signed up for the que...

No better time to start catching up than today haha

#

If it helps, I’d say the core services to start with would be cloud storage and cloud functions. From there pub/sub and a database like Firestore or BigQuery

#

You can do quite a lot with that combination

gray arch Mar 26, 2021, 1:18 AM

#

@stiff barn thank you so much! I will do my best to be better in that
Hope you don't mind if I pm you in the future if I have any question

stiff barn Mar 26, 2021, 1:28 AM

#

Go for it @gray arch

fickle surge Mar 26, 2021, 1:33 AM

#

@grave frost my end goal is to make an assistant that i plan on modeling off of jarvis from iron man. I want to hook it up to smart home stuff and have it write emails to name a few things. how hard is it to do something like that?

stiff barn Mar 26, 2021, 1:37 AM

#

fickle surge <@738058085083381760> my end goal is to make an assistant that i plan on modelin...

That’s a big project. Google, Apple, Amazon, ect... have a bunch of engineers designated to solving just that.

fickle surge Mar 26, 2021, 1:38 AM

#

What are some things to do with machine learning?

#

Or I least I want to make something that I can Interact with that. I could probably have it comunicate with Phillips hue api easily so when

stiff barn Mar 26, 2021, 1:40 AM

#

Check out https://ifttt.com/

IFTTT

Get started with IFTTT, the easiest way to do more with your favorite apps and devices for free. Make your home more relaxing. Make your work more productive. Keep your data private and secure. We believe every thing works better together.

#

You can probably use that to build a mvp more simply

fickle surge Mar 26, 2021, 1:46 AM

#

Alright

#

Thanks

#

@stiff barn just want to point out, I’m trying to learn machine learning not just make something that serves that purpose

#

Trying to think of a project to do

bronze wolf Mar 26, 2021, 1:52 AM

#

Large data sorting?

#

Have any large datasets you want to teach a computer to organize for you?

stiff barn Mar 26, 2021, 1:52 AM

#

fickle surge <@247847269267800074> just want to point out, I’m trying to learn machine learni...

I’d probably pick a more approachable project

fickle surge Mar 26, 2021, 1:53 AM

#

Like....

fickle surge Mar 26, 2021, 1:53 AM

#

bronze wolf Have any large datasets you want to teach a computer to organize for you?

Not necessarily

stiff barn Mar 26, 2021, 1:53 AM

#

Go on kaggle.com and try the beginner projects like the titanic one

fickle surge Mar 26, 2021, 1:53 AM

#

Ok

stiff barn Mar 26, 2021, 1:54 AM

#

I’d also pick up a book on the subject or sign up for an online course

hollow sentinel Mar 26, 2021, 1:58 AM

#

Uh

#

Jarvis is a pretty lofty goal

#

I don’t really understand why that’s so many people’s goal when they first learn DS/ML/AI

#

it took quite a while to build Siri and Alexa

#

I’m not saying it’s impossible for one person to build something similar on their own but it’s definitely very difficult

stiff barn Mar 26, 2021, 2:02 AM

#

It’s probably just the first use case that comes to people’s mind

misty flint Mar 26, 2021, 2:04 AM

#

#

Data engineering specific interviews increased by 40% in the past year. The second fastest position growth within data science roles went to business and data analysts which increased by 20%.

#

Data engineering is the new data science

#

blobhyperthink

stiff barn Mar 26, 2021, 2:08 AM

#

I was looking at that this morning @misty flint haha

stiff barn Mar 26, 2021, 2:09 AM

#

misty flint > Data engineering specific interviews increased by 40% in the past year. The se...

Guess I should just stay a data engineer haha

misty flint Mar 26, 2021, 2:11 AM

#

for now yeah

#

DoggoKek

#

maybe just keep an eye on the waters for now

#

they say once companies establish their data infrastructure, there will still be some data eng jobs just less afterwards

misty flint Mar 26, 2021, 2:12 AM

#

stiff barn I was looking at that this morning <@!446424248479645706> haha

i saw it from a YTuber i follow

#

https://www.youtube.com/watch?v=UjYc8uH6lHw&t=698s&ab_channel=KarolinaSowinska

YouTube

Karolina Sowinska

Why NOT to become a Data Engineer

I have created multiple videos about data engineering, including a data engineering course for beginners. Why would I advise anyone against pursuing a career in data engineering? I like being as transparent as possible - while this job will be great for many people, it might be disappointing for the others. In this video I'm outlining three reas...

▶ Play video

#

DoggoKek

fickle surge Mar 26, 2021, 2:14 AM

#

hollow sentinel Jarvis is a pretty lofty goal

I mean like a barebones version of Jarvis

stiff barn Mar 26, 2021, 2:14 AM

#

There will always be more data engineers that scientists.

misty flint Mar 26, 2021, 2:14 AM

#

im just glad i signed up for this graduate level databases class next semester

#

DoggoKek

#

its like super full and the waitlist is super long

hollow sentinel Mar 26, 2021, 2:14 AM

#

even a “barebones version” of Jarvis is going to take a while

misty flint Mar 26, 2021, 2:14 AM

#

~~its not even part of my degree plan~~

#

RunFail

#

~~but i thought it would be interesting~~

stiff barn Mar 26, 2021, 2:15 AM

#

misty flint they say once companies establish their data infrastructure, there will still be...

This won’t be for a very long time if ever. It takes years to bring an established company into the modern data infrastructure. There will always be more companies and new and improved infrastructure to bring them to.

misty flint Mar 26, 2021, 2:15 AM

#

pithink

#

this is true

stiff barn Mar 26, 2021, 2:16 AM

#

The correct project in working on to bring a company into the cloud won’t be done until 2022 at the earliest

fickle surge Mar 26, 2021, 2:16 AM

#

This kinda inspired me... seems like a good starting point

stiff barn Mar 26, 2021, 2:16 AM

#

But yeah, getting data engineering skills even if the goal is to be a data scientist will only help

misty flint Mar 26, 2021, 2:16 AM

#

the YTber was just saying as stuff like DataBricks, Azure Data Factory, and Denodo standardizes and virtualizes data, there will be less tasks

hollow sentinel Mar 26, 2021, 2:16 AM

#

Yeah i got that

#

but idk how much experience you have

misty flint Mar 26, 2021, 2:17 AM

#

idk if thats true, thats just what she said

hollow sentinel Mar 26, 2021, 2:17 AM

#

V what that guy does

stiff barn Mar 26, 2021, 2:17 AM

#

It takes away the annoying stuff. Like setting up and maintaining a Hadoop cluster

fickle surge Mar 26, 2021, 2:17 AM

#

Are you talking to me?@hollow sentinel

stiff barn Mar 26, 2021, 2:17 AM

#

Who wants to do that really

hollow sentinel Mar 26, 2021, 2:18 AM

#

@fickle surge yeah

fickle surge Mar 26, 2021, 2:18 AM

#

If so I’m learning and that’s kinda my goal once I get everything down

hollow sentinel Mar 26, 2021, 2:18 AM

#

idk how fast you learn but it’s quite a bit of stuff

#

but you’ll make progress

#

If you do it consistently

stiff barn Mar 26, 2021, 2:19 AM

#

You’ll need to have a solid understanding of software engineering as well to build something of that scope @fickle surge. That won’t just be an ML model, it’ll be a system of things that all need to be developed and interact with each other

fickle surge Mar 26, 2021, 2:20 AM

#

Alright

stiff barn Mar 26, 2021, 2:20 AM

#

I’d save that as an aspirational goal

#

Work your way up to that

hollow sentinel Mar 26, 2021, 2:23 AM

#

yeah that’s what I was trying to say

exotic maple Mar 26, 2021, 2:41 AM

#

This notion that because something "big" is coming from "big tech" means you shouldnt learn a skill is crap that should disspear

lost ridge Mar 26, 2021, 2:42 AM

#

Hi all anyone here ever made a trading algorithm ?

exotic maple Mar 26, 2021, 2:42 AM

#

we have steel mills and automated carpentry nowadays, but carpenters and blacksmiths still exist (as niche, true) careers, and are also well paid

serene scaffold Mar 26, 2021, 2:42 AM

#

lost ridge Hi all anyone here ever made a trading algorithm ?

A lot of people have. That would actually be a question for #data-science-and-ml

#

oh fuck that's where we are

exotic maple Mar 26, 2021, 2:42 AM

#

serene scaffold oh fuck that's where we are

LMAO

serene scaffold Mar 26, 2021, 2:42 AM

#

thought we were in algos and data structs

lost ridge Mar 26, 2021, 2:42 AM

#

Lol

exotic maple Mar 26, 2021, 2:43 AM

#

Stelercus.exe has stopped working

serene scaffold Mar 26, 2021, 2:43 AM

#

pkill -u stelercus

exotic maple Mar 26, 2021, 2:44 AM

#

-> googles: "HOW TO KILL A CHILD"

serene scaffold Mar 26, 2021, 2:44 AM

#

what
no

exotic maple Mar 26, 2021, 2:44 AM

#

-> corrects: "HOW TO KILL A CHILD PROCESS I'M SORRY#

pearl vault Mar 26, 2021, 2:44 AM

#

Sry for disturbing
amd still have driver and software issues?? should i buy an amd or intel laptop?

exotic maple Mar 26, 2021, 2:45 AM

#

I have to say, since i've gotten used to pandas...I kind of dread touching excel lol

serene scaffold Mar 26, 2021, 2:49 AM

#

exotic maple I have to say, since i've gotten used to pandas...I kind of dread touching excel...

You have ascended 👼

misty flint Mar 26, 2021, 2:56 AM

#

exotic maple This notion that because something "big" is coming from "big tech" means you sho...

is that gil

#

gilgaLUL

exotic maple Mar 26, 2021, 2:56 AM

#

ofc it's gil

#

mongrel

misty flint Mar 26, 2021, 2:56 AM

#

DoggoKek

#

my 2nd favorite archer

exotic maple Mar 26, 2021, 2:56 AM

#

2nd?

misty flint Mar 26, 2021, 2:56 AM

#

RunFail

exotic maple Mar 26, 2021, 2:56 AM

#

pathetic mongrel I AM THE KING

#

Enuma Elish

misty flint Mar 26, 2021, 2:58 AM

#

serene scaffold thought we were in algos and data structs

💀

#

anyway

#

@exotic maple idk if you saw the charts earlier but maybe data eng remote job?

#

growing more popular

exotic maple Mar 26, 2021, 2:59 AM

#

I only know python, not enough backend to do engineering

#

well, python and MYSQL

#

if sql is considered a programming lnaguage

misty flint Mar 26, 2021, 2:59 AM

#

exotic maple if sql is considered a programming lnaguage

#

Currently most data engineering roles require only three main types of skillsets: SQL, Python, and algorithms.

#

oh theres also this but less common

#

We're seeing a rise though in data engineers needing to understand system design and architecture problems as well.

#

https://www.interviewquery.com/blog-data-science-interview-report

Interview Query Blog

The 2021 Data Science Interview Report

We analyzed over 10,000 data science interview experiences. Here are our findings.

exotic maple Mar 26, 2021, 3:00 AM

#

omg id love statistics and A/B testing

#

shit's easy AF once you get the hang of it

#

and its easy to show face with it lmao

exotic maple Mar 26, 2021, 3:01 AM

#

misty flint > Currently most data engineering roles require only three main types of skillse...

you srs?

#

SQL I kiiind of know, python id say "intermediate" and algos...i've never taken a formal class but feedback from CS friends tell me i have the logic down

#

eh, who knows, i might just try it out

misty flint Mar 26, 2021, 3:03 AM

#

Praise

#

do it dude

stiff barn Mar 26, 2021, 3:06 AM

#

exotic maple SQL I kiiind of know, python id say "intermediate" and algos...i've never taken ...

I work as a data engineer now and almost everything boils down to SQL or Python. I do a lot of cloud work and ML which will set yourself apart but I wouldn’t say it’s required. Cloud is becoming more and more but that’s easier to pick up

exotic maple Mar 26, 2021, 3:07 AM

#

screw me. all this time i have been shy to apply when I have at least the basic skills for it?!

stiff barn Mar 26, 2021, 3:09 AM

#

Lol yeah I’d give it a shot

misty flint Mar 26, 2021, 3:18 AM

#

DoggoKek

#

tbf you didnt even know

fickle sinew Mar 26, 2021, 3:39 AM

#

you might want to learn Scala too

hollow sentinel Mar 26, 2021, 3:42 AM

#

Yes

waxen girder Mar 26, 2021, 3:43 AM

#

What are good resources to learn SQL but dive deep in things like efficient querying and such.

#

Beyond the basic here's how you do X.

fickle sinew Mar 26, 2021, 3:44 AM

#

spark is written in scala, and it's going to gain traction as data pipelines start leveraging spark more and SQL less...

fickle sinew Mar 26, 2021, 3:45 AM

#

waxen girder Beyond the basic here's how you do X.

get some o'reilly books and learn how to read query plans for the most common database engines

#

tuning SQL is such a weird art though. It's a declarative language, so its not like you can easily tell the query optimizer "do it like this"

exotic maple Mar 26, 2021, 3:49 AM

#

isnt SQL pretty much super optimized by definitiion? I mean, the DB structure and optimization has to be done by a full DBA, not an user of the DBSs

fickle sinew Mar 26, 2021, 3:53 AM

#

I like to say "SQL gives you the benefit of a bunch of really smart people that already figured out how to do most of the simple things"

#

like simple joins, you dont have to decide what the best way to join tables is. as long as you have good choices of indexes and keys, the database will usually do the joins in a very efficient way

waxen girder Mar 26, 2021, 3:55 AM

#

As of right now, I created my own db with postgresql in ubuntu running on wsl2.

#

As of right now I haven't figured out how to connect a SQL gui instance to the DB but honestly I kinda want to just use something like psycopg2 then move on to SQLAlchmey.

fickle sinew Mar 26, 2021, 3:56 AM

#

use psql if you want a handy terminal client for postgres

waxen girder Mar 26, 2021, 3:56 AM

#

Yeah I do use that.

fickle sinew Mar 26, 2021, 3:57 AM

#

python libraries are good but they don't do the admin stuff too well. they kind of assume the database is already built.

waxen girder Mar 26, 2021, 3:57 AM

#

I haven't set up my DB for production so to speak. But as an aspiring analyst I hope I won't have to.

#

Some of the user accounts have their passwords stored as plain text. I don't think I intalled the DB the most secure way according to the docs but I'm just using it to learn.

fickle sinew Mar 26, 2021, 3:59 AM

#

you can run postgres in docker too, that might make some aspects easier (or it might make it worse)

waxen girder Mar 26, 2021, 4:00 AM

#

Apparently you're supposed to install it in its own user w/ the least possible privileges of any user and not have any other software installed on that user.

fickle sinew Mar 26, 2021, 4:00 AM

#

are you running in linux?

waxen girder Mar 26, 2021, 4:01 AM

#

Yeah

exotic maple Mar 26, 2021, 4:02 AM

#

waxen girder As of right now I haven't figured out how to connect a SQL gui instance to the D...

this is literally what i did lmao

exotic maple Mar 26, 2021, 4:02 AM

#

waxen girder As of right now I haven't figured out how to connect a SQL gui instance to the D...

but for postgre you can use pgadmin

fickle sinew Mar 26, 2021, 4:03 AM

#

ive had good luck just using the vanilla packages installed using apt or whatever package manager, very little manual setup

waxen girder Mar 26, 2021, 4:04 AM

#

Yeah but for production you want to be careful.

fickle sinew Mar 26, 2021, 4:05 AM

#

waxen girder Yeah but for production you want to be careful.

sounds like a good reason to run it in docker logo_docker

misty flint Mar 26, 2021, 4:22 AM

#

i banged my head for a day trying to dockerize our team project

#

but eventually i got there

#

logo_docker

#

Praise

#

~~all your dependencies are belong to me~~ blobhyperthink

fickle sinew Mar 26, 2021, 4:24 AM

#

misty flint ~~all your dependencies are belong to me~~ <:blobhyperthink:683298669872545921>

did you do it using an Alpine image?

misty flint Mar 26, 2021, 4:25 AM

#

i think i used buster

#

pithink

#

our project was just really finicky

#

had a flask component to it too

#

also i never had used docker beforehand

#

so there was that

#

DoggoKek

fickle sinew Mar 26, 2021, 4:28 AM

#

buster was a wise choice... but if it was built on flask, i have to ask... what web server did you use

silver widget Mar 26, 2021, 7:41 AM

#

Hi guys.
I need some help about a data analysis project i am working on. I'm working on a bank customer data with transactions and salary info. These information is available for 3 months, and I need to calculate the annual salary of the each customer.
new_df = df.groupby(['account','month'])[['amount']].sum()
I grouped each customer's salaries. however I cannot use each months data as columns. Is there a way to create columns such as 'august', 'september', and 'october' and append the new_df['amount'] values to these columns?
Thanks in advance

pure quiver Mar 26, 2021, 7:49 AM

#

Shouldn't you just groupby month and sum without the account names?

#

Is it because you need to append the average salary as a new column in your original data frame?

silver widget Mar 26, 2021, 7:53 AM

#

The data based on the transaction movements of the customers. for instance, one customer has more than one row in the data.

pure quiver Mar 26, 2021, 7:54 AM

#

So you need each customer's average salary by month, I see

silver widget Mar 26, 2021, 7:55 AM

#

#

This is what i get from the code above

pure quiver Mar 26, 2021, 7:59 AM

#

Okay so you have a multilevel index because of this. I'd approach it differently, create a new data frame with all unique account numbers only as a column, then write a series of groupbys on account where month =8,9, 10 etc and append each series to the new data frame

#

You can write a simple function to speed this up and pass a list of months to it

silver widget Mar 26, 2021, 8:00 AM

#

Oh, that's great. Thank you very much. I'll try that immediately.

pure quiver Mar 26, 2021, 8:01 AM

#

Or rather, when you groupby, you get accounts and their sum salaries, then join on the account numbers

#

Something like this (sorry I'm on phone)

silver widget Mar 26, 2021, 8:03 AM

#

Thanks Dyllyn. I appreciate it.

pure quiver Mar 26, 2021, 8:09 AM

#

Ugh I swear code is impossible to write on phone

#

Well lemme get back to my com, but lmk if you get it

silver widget Mar 26, 2021, 8:10 AM

#

No no pls don't write it 🙂 I'm trying to learn it.I appreciated your help, that I was trying to say

pure quiver Mar 26, 2021, 8:28 AM

#

#

Sorry for the picture of the screen, Im on a closed system

silver widget Mar 26, 2021, 8:29 AM

#

Thank you very much.

pure quiver Mar 26, 2021, 8:33 AM

#

Ah, I made a mistake, at salary the account and month will still be on the index I think. That's for you to fix :)

#

I don't use groupby that much

past arch Mar 26, 2021, 9:16 AM

#

Hello, In NLP text summarisation, is there any way to programmatically differentiate between extractive and abstractive summarisation?

thorn bobcat Mar 26, 2021, 10:23 AM

#

anyone worked with end to end speech synthesis before?

lavish tundra Mar 26, 2021, 12:01 PM

#

Someone who really understand very well about Data-visualization can help me on the #🤡help-banana pls? i'm stuck on this problem for a while...

covert seal Mar 26, 2021, 12:24 PM

#

silver widget

I believe unstack() is what you are looking for

thorn bobcat Mar 26, 2021, 1:30 PM

#

anyone used Flowtron, FastSpeech2, WaveRNN, Tacotron2 or Real-Time Voice Cloning before? I'm thinking of starting out on one of them and was hoping to find something simple to begin working on.

grave frost Mar 26, 2021, 2:18 PM

#

thorn bobcat anyone used Flowtron, FastSpeech2, WaveRNN, Tacotron2 or Real-Time Voice Cloning...

what's wrong with the ones you mentioned?

thorn bobcat Mar 26, 2021, 2:18 PM

#

grave frost what's wrong with the ones you mentioned?

I am just starting out so I want some resources and something simple to start with.

grave frost Mar 26, 2021, 2:19 PM

#

thorn bobcat I am just starting out so I want some resources and something simple to start wi...

They are pre-trained models lol. what else do you want?

#

That's about as simple as your task gets.

thorn bobcat Mar 26, 2021, 2:20 PM

#

grave frost They are pre-trained models lol. what else do you want?

Yea but I'm not just gonna use it as it is.

grave frost Mar 26, 2021, 2:20 PM

#

voice cloning isn't something easy like visualization or regression

thorn bobcat Mar 26, 2021, 2:20 PM

#

Don't I have to tweak it and stuff?

grave frost Mar 26, 2021, 2:20 PM

#

thorn bobcat Don't I have to tweak it and stuff?

not much, compared to training your own model from scratch

thorn bobcat Mar 26, 2021, 2:21 PM

#

grave frost not much, compared to training your own model from scratch

I wanna train my model but not from scratch

grave frost Mar 26, 2021, 2:21 PM

#

thorn bobcat I wanna train my model but not from scratch

you mean you want to fine-tune a model?

thorn bobcat Mar 26, 2021, 2:21 PM

#

grave frost you mean you want to fine-tune a model?

yep.

grave frost Mar 26, 2021, 2:22 PM

#

well they provide pre-trained models in those repos

thorn bobcat Mar 26, 2021, 2:22 PM

#

also for some reason there's no tutorials in youtube regarding this

#

in python atleast

#

most of it is in jupyter notebooks.

grave frost Mar 26, 2021, 2:23 PM

#

what? no one is going to spoon feed something so complex. you would have to research and understand things on your own

#

There is no shortcut that would work well for you. they might give decent results, but not very convincing/realistic

thorn bobcat Mar 26, 2021, 2:26 PM

#

can't you understand deep learning practically?

#

through working on projects.

grave frost Mar 26, 2021, 2:27 PM

#

ofc you can - but it would take a lot more projects. learning with projects is great and I consider it the best way to learn; but to learn something, you have to understand some theory too, not just copy the code by some guy on youtube

thorn bobcat Mar 26, 2021, 2:28 PM

#

grave frost ofc you can - but it would take a lot more projects. learning with projects is g...

copy the code, learn about it, mess around with it and build on top of it is what I was actually thinking.

hollow sentinel Mar 26, 2021, 2:28 PM

#

hol up

#

how are your machine learning basics

grave frost Mar 26, 2021, 2:28 PM

#

thorn bobcat copy the code, learn about it, mess around with it and build on top of it is wha...

that's pretty shallow learning

thorn bobcat Mar 26, 2021, 2:28 PM

#

for example I want to be able to clone Morgan freeman's voice to generate speech from text, that sounds like him, it's been done before but it's just an example.

hollow sentinel Mar 26, 2021, 2:28 PM

#

before you do all this voice cloning shit

#

how are your basics

grave frost Mar 26, 2021, 2:28 PM

#

hollow sentinel before you do all this voice cloning shit

that's what I am trying to explain to him. but he seems open and receptive (unlike some of the others)

hollow sentinel Mar 26, 2021, 2:29 PM

#

bc voice cloning is pretty ambitious if you're just a beginner

thorn bobcat Mar 26, 2021, 2:29 PM

#

hollow sentinel how are your basics

well I'm sorta informed but nothing indepth.

hollow sentinel Mar 26, 2021, 2:29 PM

#

define sorta informed

#

like

#

do you know the math behind the field?

#

the math is what you're going to need if you want to finetune parameters

thorn bobcat Mar 26, 2021, 2:30 PM

#

by informed I mean I know about LSTM, RNN, CNN, GAN, stylegan

#

and some of the underlying logic..

grave frost Mar 26, 2021, 2:30 PM

#

thats a start

#

but you have to do some research, learn a few more things etc.

thorn bobcat Mar 26, 2021, 2:30 PM

#

hollow sentinel do you know the math behind the field?

supervised / unsupervised learning.

thorn bobcat Mar 26, 2021, 2:30 PM

#

hollow sentinel do you know the math behind the field?

not quiet actually.

thorn bobcat Mar 26, 2021, 2:31 PM

#

grave frost but you have to do some research, learn a few more things etc.

so where do I start?

grave frost Mar 26, 2021, 2:31 PM

#

depends on how old you are

thorn bobcat Mar 26, 2021, 2:31 PM

#

grave frost depends on how old you are

grave frost Mar 26, 2021, 2:31 PM

#

thorn bobcat 25.

so are you in college?

thorn bobcat Mar 26, 2021, 2:31 PM

#

grave frost so are you in college?

yup computer science major.

grave frost Mar 26, 2021, 2:31 PM

#

well, then just learn it the proper way! take the AI course

#

attend the stats and math lectures

#

college is the easiest time to learn IMO

hollow sentinel Mar 26, 2021, 2:32 PM

#

https://mml-book.com/

Mathematics for Machine Learning

#

there is this book here

#

this will show you what you need to know for the math in ML

grave frost Mar 26, 2021, 2:32 PM

#

A simple start off point https://www.reddit.com/r/learnmachinelearning/wiki/index

index - learnmachinelearning

r/learnmachinelearning: A subreddit dedicated to learning machine learning

thorn bobcat Mar 26, 2021, 2:32 PM

#

hollow sentinel this will show you what you need to know for the math in ML

alright perfect.

#

what would be a challenging yet rewarding task to undertake as a start in ML journey?

#

thing is I wanna also develop my python skills which is why i wanna do something..

grave frost Mar 26, 2021, 2:34 PM

#

thorn bobcat alright perfect.

do you have Ai/stats course in your college?

thorn bobcat Mar 26, 2021, 2:34 PM

#

grave frost do you have Ai/stats course in your college?

next semester.

#

we do have simulation and modeling and statistics this sem

grave frost Mar 26, 2021, 2:34 PM

#

thorn bobcat next semester.

is that AI or stats?

thorn bobcat Mar 26, 2021, 2:34 PM

#

grave frost is that AI or stats?

AI

grave frost Mar 26, 2021, 2:35 PM

#

ehh, you are in college. just see what books there are and read em up. ask what you don't understand - there are many highly expereinced people here that can answer almost all of your queries

thorn bobcat Mar 26, 2021, 2:36 PM

#

grave frost ehh, you are in college. just see what books there are and read em up. ask what ...

so your recommendation is for now hold off on the voice cloning ambition and work on the core concepts?

grave frost Mar 26, 2021, 2:36 PM

#

thorn bobcat so your recommendation is for now hold off on the voice cloning ambition and wor...

exactly

hollow sentinel Mar 26, 2021, 2:37 PM

#

voice cloning will only get easier once you know the core concepts

#

otherwise you're just grasping at straws

grave frost Mar 26, 2021, 2:37 PM

#

if you aren't enjoying that, you can see some of the technical articles for cloning and they would teach you maths too (albeit with less explanations since there is a lot to cover)

hollow sentinel Mar 26, 2021, 2:37 PM

#

which makes knowing the math even more important

grave frost Mar 26, 2021, 2:37 PM

#

but you would find yourself frequently finding topics to learn and making a list of it

#

and of course, we are always here

thorn bobcat Mar 26, 2021, 2:38 PM

#

thanks for all the advice, guess I'll start up with some of the math, core concepts and theory before applying things practically

#

I'll try to break it up into bite sized chunks tho so I don't get bored, cause I actually like working on projects to solidify what I learnt.

grave frost Mar 26, 2021, 2:40 PM

#

thorn bobcat I'll try to break it up into bite sized chunks tho so I don't get bored, cause I...

me too buddy

lapis sequoia Mar 26, 2021, 3:20 PM

#

@grave frost what was the jedi alternative you were recommending for jupyter?

obtuse marlin Mar 26, 2021, 3:31 PM

#

Pylint (Microsoft Server)

#

I doubt

lapis sequoia Mar 26, 2021, 4:06 PM

#

Is it possible to draw a 3d shape on an image using matplotlib3d?
I know we can do it for 2d, but I am trying to do it for 3d but can't figure it out
I am looking for something like this: https://stackoverflow.com/a/15592168

Stack Overflow

Image overlay in 3d plot using python

I have a 3d plot of lines generated by matplotlib. I want to overlay an image at a specific xy (or yz, xz) slice. How do I do that using python? Thanks.

I have a simple 3d plot code as:

fig = plt.

plucky harness Mar 26, 2021, 4:07 PM

#

Is there machine learning app possible with python?

odd lion Mar 26, 2021, 4:19 PM

#

plucky harness Is there machine learning app possible with python?

Yes.... probably most of ML is done in python

shut valve Mar 26, 2021, 4:42 PM

#

thorn bobcat so your recommendation is for now hold off on the voice cloning ambition and wor...

I’m doing voice cloning to my problems has just been data cleaning for the last two weeks and prob the next two too

#

Honestly I wouldn’t bother trying to learn the pure math on your own bc like unless you enjoy that you prob won’t finish it you can do a lot of machine learning and deep learning withOUT in depth math

thorn bobcat Mar 26, 2021, 4:44 PM

#

shut valve I’m doing voice cloning to my problems has just been data cleaning for the last ...

why don't you use some of the pre insisting datasets or do they not fit your purpose?

shut valve Mar 26, 2021, 4:44 PM

#

Bc that’s not what I want I want a certain voice

#

Like there a lot of real time voice cloning libs on git that are not bad (not great but amazing for the small amount of data given)

thorn bobcat Mar 26, 2021, 4:49 PM

#

how much experience would i need working with them?

#

@shut valve https://github.com/CorentinJ/Real-Time-Voice-Cloning this one is interesting

GitHub

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time - CorentinJ/Real-Time-Voice-Cloning

#

also tactotron 2

#

WaveRNN and Glow,

shut valve Mar 26, 2021, 4:49 PM

#

Yeah some of them are real click record and run it depends what your trying to do with it

thorn bobcat Mar 26, 2021, 4:50 PM

#

TTS tranformers too

#

I want things I can integrate into bigger projects tbh.

#

I don't want to re invent the wheel but I'd like to use it build a car, if you catch my drift..

shut valve Mar 26, 2021, 4:51 PM

#

Well yeah then that’s just like taking what you want from it how modular it is to pick up and move differs from project to project

#

Like most probably allow you to re train a pre trained model and then use that model in your project now that’s an awesome skill to have

#

But that’s a little more advanced but you don’t need to understand linear to do that

thorn bobcat Mar 26, 2021, 4:53 PM

#

shut valve Like most probably allow you to re train a pre trained model and then use that m...

that's what I actually want to do.

#

retrain, tweak and perhaps understand what's going on.

dusk kite Mar 26, 2021, 4:56 PM

#

Hey guys, I am a Data Scientist looking to grow my skills as a Machine Learning Engineer. Does anyone have recommendations for learning resources?

shut valve Mar 26, 2021, 4:56 PM

#

well depending on how much you know Im gonna assume you know python but not much or no ml. https://www.kaggle.com/learn/overview
intro to ml
intermediate to ml
intro to deep
computer vision

Learn Python, Data Viz, Pandas & More | Tutorials | Kaggle

Practical data skills you can apply immediately: that's what you'll learn in these free micro-courses. They're the fastest (and most fun) way to become a data scientist or improve your current skills.

thorn bobcat Mar 26, 2021, 4:56 PM

#

shut valve well depending on how much you know Im gonna assume you know python but not much...

I know python but no ml

#

I do know some of the math involved and I actually love math sometimes.

#

I like working on projects more than reading tho idk why.

dusk kite Mar 26, 2021, 4:57 PM

#

Looking more into MLops and architecture information / resources

shut valve Mar 26, 2021, 4:58 PM

#

i say computer vision because it takes about data augmentation and training pre trained models. now thats a kinda big stack might take you a couple weeks but it would be really good and quick to getting started with ml. Then you can see if you really wanna stay in AI/ML

thorn bobcat Mar 26, 2021, 4:58 PM

#

@shut valve Thanks alot, I've added it to the resource list. I'll be sure to check it out.

#

Goodluck with your work

#

and thanks for all the helpful answers.

shut valve Mar 26, 2021, 4:59 PM

#

enjoy

#

if you struggle it means its working

thorn bobcat Mar 26, 2021, 5:00 PM

#

shut valve if you struggle it means its working

actually after checking it out I'm adding this to the top of my resource list.

#

seems like it will help alot.

lapis sequoia Mar 26, 2021, 5:00 PM

#

@dusk kite read pattern classification by stork, duda and hart

grave frost Mar 26, 2021, 5:13 PM

#

lapis sequoia <@!738058085083381760> what was the jedi alternative you were recommending for j...

kite and TabNine

lapis sequoia Mar 26, 2021, 5:18 PM

#

grave frost kite and TabNine

yeah tabnine

#

let me check it out

#

thanks man

gray arch Mar 26, 2021, 7:26 PM

#

This topic seems to be less heated today than usual lmao
When I check every hour it always has 50+ messages

grave frost Mar 26, 2021, 7:30 PM

#

everyone's lazy and bored

#

they need a controversial topic to be stimulated

humble widget Mar 26, 2021, 7:35 PM

#

Today I came across a job description for a junior data science position that requires "Hands-on experience with ML tools such as TensorFlow, Keras, PyTorch; Experience with Data Mining and Data Analysis technologies and language, including Python, pandas, Jupyter Notebook, Matplotlib, NumPy." Would you say that this is kind of a standard requirement overall?

rancid orbit Mar 26, 2021, 7:36 PM

#

hi

grave frost Mar 26, 2021, 7:37 PM

#

humble widget Today I came across a job description for a junior data science position that re...

wait, they want you to use Jupyter notebookss?

humble widget Mar 26, 2021, 7:38 PM

#

grave frost wait, they want you to use Jupyter notebookss?

I don't know, it seems so, there are no other details

grave frost Mar 26, 2021, 7:53 PM

#

I don't see how jupyter notebook is a skill

#

its just good for rapid experimentation and visualization

gray arch Mar 26, 2021, 7:54 PM

#

humble widget Today I came across a job description for a junior data science position that re...

I am not sure about what exactly the job position is but from my AI/ML internship I would say experience in TensorFlow, Keras, pandas, numpy are the top requirements. Jupyter Notebook is useful only when you wanna display graphs and visualizations and such but it shouldn't be a required skill if you are making applications

#

About PyTorch, it seems legit, but I have only used it in school research not in real industry so far

humble widget Mar 26, 2021, 7:56 PM

#

It is a position related to climate change modelling at a financial advisor, so I guess that the visualization might be related to GHG emissions.

humble widget Mar 26, 2021, 7:56 PM

#

gray arch About PyTorch, it seems legit, but I have only used it in school research not in...

Got it

gray arch Mar 26, 2021, 7:57 PM

#

I see, but it's not too difficult to practice Jupyter Notebook anyway, Google Colab is one of the ways to go haha

short heart Mar 26, 2021, 8:26 PM

#

Is it ok if my training dataset has some generic floats and some of them are numpy.float s

grave frost Mar 26, 2021, 8:33 PM

#

gray arch About PyTorch, it seems legit, but I have only used it in school research not in...

You would need it when implementing research level findings in your projects - some are for TF, but most use PyTorch

misty flint Mar 26, 2021, 9:07 PM

#

gray arch This topic seems to be less heated today than usual lmao When I check every hour...

its the weekend here so

#

im chillaxing today

#

Oopsies

west bolt Mar 26, 2021, 10:18 PM

#

How do I get Jupyter to display a sympy Matrix?

#

Currently I have

init_printing()```
In cell 1 and
```A = Matrix([[1, 2, 3], [4, 5, 6], [7,8,9]])
A```
In cell 2

balmy junco Mar 26, 2021, 10:40 PM

#

Is there a function in Python to check whether or not a set is a basis?

#

Or do I just need to do it myself lol?

tidal bough Mar 26, 2021, 10:49 PM

#

oh, that's a nice question

tidal bough Mar 26, 2021, 10:50 PM

#

balmy junco Is there a function in Python to check whether or not a set is a basis?

hmm, pretty simple actually if you have n n-dimensional vectors - just write them into an n x n matrix as rows and check that the determinant is nonzero.

#

(but no, don't think there's a function for that in numpy/scipy)

#

if you have more than n vectors and you want to check whether that set contains a basis, then I actually don't know, hmm. Is there a simple way to check for that?..

exotic maple Mar 26, 2021, 11:09 PM

#

Idk how but I got lip-won the HR VP and now I have to develop "an ML model to improve our recruiting"

hollow sentinel Mar 26, 2021, 11:10 PM

#

So you got the job?

#

Or internship

#

idk what lip won means I’m inferring

exotic maple Mar 26, 2021, 11:11 PM

#

Its not a job

hollow sentinel Mar 26, 2021, 11:11 PM

#

oh

exotic maple Mar 26, 2021, 11:11 PM

#

Im trying to transition away fron my mid position into slmethint about data or abalytics. There is no such thing where I work

#

So i thought about convincing the CEO, but thars too far up for me

#

So... HR VP lol.

hollow sentinel Mar 26, 2021, 11:12 PM

#

Oh

exotic maple Mar 26, 2021, 11:12 PM

#

Basically, sold her the idea or ML / Data department

hollow sentinel Mar 26, 2021, 11:12 PM

#

sorry I completely misunderstood

exotic maple Mar 26, 2021, 11:12 PM

#

So if i get it right

#

I can get her to tell the CEO and crearw the department

#

Thats my plan at least

hollow sentinel Mar 26, 2021, 11:12 PM

#

that sounds very good

exotic maple Mar 26, 2021, 11:13 PM

#

Yeah but i need to get the ball rolling on my own now lol

hollow sentinel Mar 26, 2021, 11:14 PM

#

Yeah idk about ML helping recruitment

exotic maple Mar 26, 2021, 11:14 PM

#

I have a clue of something that can help. Not recruitment itself but after it. Reducing attrition, churn, and other negative metrics

#

A mix of classification and regression might help there.
Predicting churn. Probabilities or attritition based on profiles, etc etc

hollow sentinel Mar 26, 2021, 11:16 PM

#

it isn’t impossible 🙂

grave frost Mar 26, 2021, 11:23 PM

#

exotic maple Yeah but i need to get the ball rolling on my own now lol

NER on CV to quickly locate high achievers and multiple points of interest. fine-tune the model

exotic maple Mar 26, 2021, 11:23 PM

#

NER?

grave frost Mar 26, 2021, 11:24 PM

#

Named Entity Recognition

#

Named-entity recognition is a subtask of information extraction that seeks to locate and classify named entities mentioned in unstructured text into pre-defined categories such as person names, organizations, locations, medical codes, time expressions, quantities, monetary values, percentages, etc. Wikipedia

exotic maple Mar 26, 2021, 11:24 PM

#

-shivers- omg not NLP pl00x

#

I suck at NLP atm haha
Didnt learn properly.

#

Besides, is there good support for NLP.in Spanish?

grave frost Mar 26, 2021, 11:25 PM

#

still, you can get by with some basics

exotic maple Mar 26, 2021, 11:25 PM

#

Everythign ive seen is in Spanish

grave frost Mar 26, 2021, 11:25 PM

#

exotic maple Everythign ive seen is in Spanish

the CV's?

exotic maple Mar 26, 2021, 11:25 PM

#

grave frost still, you can get by with some basics

Ill look into it

exotic maple Mar 26, 2021, 11:25 PM

#

grave frost the CV's?

Yes. Im from latin america

#

English wordnets and stuff are useless to me

#

For that

grave frost Mar 26, 2021, 11:26 PM

#

there are plenty of spanish pre-trained models

#

spanbert

#

gpt2 spanish

#

https://huggingface.co/models?search=spanish

Hugging Face – The AI community building the future.

#

I myself am currently working with Low resource languages, so your task seems a piece of cake

exotic maple Mar 26, 2021, 11:27 PM

#

I wonder if its ok to use pretrained models... legally speaking and all that

grave frost Mar 26, 2021, 11:27 PM

#

exotic maple I wonder if its ok to use pretrained models... legally speaking and all that

yes

#

AFAIK

exotic maple Mar 26, 2021, 11:27 PM

#

I suppose open sourve it shoulsnt matter

grave frost Mar 26, 2021, 11:28 PM

#

exotic maple I suppose open sourve it shoulsnt matter

yea, both weights and code