#data-science-and-ml

1 messages · Page 300 of 1

quiet dawn
#

is there

#

?

grave frost
#

And, well, linear function do not useful computation make
Too much CGP Grey?

tidal bough
#

haven't actually watched that much of him, but maybe that's where I picked up that phrase, yes

hollow sentinel
#

there is

lapis sequoia
#

AI is purely applied maths. Usually it all boils down to linear algebra. You don't have to understand how many of these models work and what they do mathematically. It would help no doubt. But it's probably better to keep the implementation of them a black-box and focus on the when / what / why / pros / cons of models. You can always dive deeper into the maths later

short heart
#

Is it possible to pass a list of X_trains to LSTM

#

or how do i fit several x trains into one lstm

grave frost
#

Does anyone have any ideas of implementations of fully unsupervised local POS taggers?

tidal bough
short heart
#

oops

#

im stupid

#

thanks

tidal bough
#

Okay, data science question: I have a pandas dataframe of UTC timestamps and values. I want to plot the average (maybe +-std) value by time of day (disregarding the date). How would I do that? My current solution is very hacky and possibly incorrect.

grave frost
#

so 24 bins, count values and plot accordingly?

tidal bough
#

pretty much

#

except, obviously, I want to do it "the right way"

#

since it seems to me like a common-ish operation

ripe forge
#

Can you share your current way?

eternal narwhal
misty flint
# tidal bough Okay, data science question: I have a pandas dataframe of UTC timestamps and val...

something like how this guy does it? is that a similar scenario https://youtu.be/jV24N7SPXEU?t=171

This is part 8 of my pandas tutorial from PyCon 2018. Watch all 10 videos: https://www.youtube.com/playlist?list=PL5-da3qGB5IBITZj_dYSFqnd_15JgqwA6
This video covers the following topics: math with booleans, groupby, datetime attributes, line plots.

NEW TO PANDAS? Watch my introductory series (30+ videos):
https://www.youtube.com/playlist?list=...

▶ Play video
#

he did pd.to_datetime() previously before working with that column by the way

#

so his approach was

df.groupby(df.column_datetime.dt.hour).column.mean()

#

column being the column of interest

tidal bough
#

hmm

#

let me figure out how to apply this to mine

tidal bough
#

Oh, this works and does what I want:

hours = df[["date","eu"]].copy()
hours["hour"] = hours["date"].dt.hour
del hours["date"]
sns.relplot(data=hours,x="hour",y="eu",kind="line")
misty flint
#

nice

#

didnt know seaborn could do that

tidal bough
#

essentially creating a new column with the dates replaced by hours, then passing this dataframe (with a lot of y-values for each x) to relplot

#

result looks like this, which is what I wanted

misty flint
#

oh and the shaded is the std?

tidal bough
#

yup

misty flint
#

magic

#

does relplot automatically display std?

#

or was there a parameter you had to specify

#

i think i might try and use seaborn then for my project

tidal bough
misty flint
#

nice

#

i need to remember this

tidal bough
#

it's the ci parameter to lineplot

#

(which relplot calls when doing kind="line")

misty flint
#

interesting

tidal bough
#

it also calculates them using bootstrapping, which takes like a second for my barely-2k datapoints 😅

#

(but it scales well with big n or something, don't remember why bootstrapping is considered nice)

misty flint
#

thats pretty nifty

grave frost
#

the problem is not in the clustering - but that I don't have any ground labels

#

I did find a way using NLTK (after a lot of hours of searching) , but thought that maybe there is some resource I missed

little path
#

Write a program to generate a series of marks of 10 students. Give grace marks up to 5 of those who are having <33 marks and print the new list of the marks.
I've tried this but not working:-
import pandas as pd def Ser_stumarks(): std_marks = [] for i in range(1,11): m = int(input("Enter the marks:")) std_marks.append(m) s = pd.Series(index=range(1201,1211),data=std_marks) s[s<33]=s+5 print("New List is:") print(s[s>=33]) Ser_stumarks()

#

Is there any easy and simple code to do that?

tidal bough
#

what's "grace marks"?

#

oh, I see, increase their mark by 5

#

why are you filtering the printed result to only the >33 ones, though?

short heart
#

i added quiet alot of data for lstm to learn with, but the results are only worse

#

any ideas why?

uncut barn
twin mantle
#

Hello

#

I have this

#

sns.histplot(datos, bins=9, binrange=(10,99), color='gray', kde=True,
             line_kws= {'color':'blue','linestyle': 'dashed'},
             fill=False)
#

But it's still entirely gray

#

How can I change the kde line color?

grave frost
twin mantle
#

That sounds like a bag of words

grave frost
rancid vine
#

I mean, alot of it. Mostly linear algebra. A nightmarish amount. xD

twin mantle
#

LOL, there's a lot of math inside AI

quiet dawn
#

not in ai

little path
#

Write a panda program to enter  marks in main five subjects  of a student and Calculate sum of all marks.

b) Also write a small python code to create a dataframe with headings(Name and Age) from the list given below :

[[‘’Alex”,26],[“Maddy”,44],[“Rolex”,26],[“Mona“,37]]

Now sort the data as per the name

SUM=DF[‘ENG’]+DF[‘MATH’]+DF[‘HINDI’]

DF[‘PER’]

#

I can't do the sorting part

#

Rest I've done plz tell me how to do that

twin mantle
#

Did you create the dataframe?

little path
#

Yes

twin mantle
#

Sort it

grave frost
twin mantle
#

.sorted()

little path
#

What to do with this:- SUM=DF[‘ENG’]+DF[‘MATH’]+DF[‘HINDI’]
DF[‘PER’]

#

Do I have to use sum command?

little path
#

Idk it's written in question

#

I am thinking what to do with that

twin mantle
little path
#

There's the question

#

Last part

twin mantle
exotic maple
little path
#

I'm getting diverted nvm

exotic maple
#

its like being in college all over again:

hollow sentinel
#

are there sorting algorithms in pandas

#

or do you just do .sort

#

I actually don't remember much about pandas

little path
#

.sort_values

twin mantle
#

Meanwhile, I am stuck with my seaborn question from 2 hours ago

exotic maple
twin mantle
#

Instructions so vague so that you cant go anywhere but give the impression you're still teaching something

exotic maple
#

"prof how do i balance this load (Dynamics)
"do you have the equations?"
"yes"
"solve them"

#

and im like: This mfker getting paid so much to say that shit?

little path
#

Lol

twin mantle
#

They are paid for research papers, not to actually give a fuck about the future professionals

exotic maple
#

If i told my boss "solve it" id be out of the door before i said "im so-"

twin mantle
#

Anyone here knows seaborn?

exotic maple
#

what do you need?

twin mantle
#

I want to do a histogram with a kde line overlay

#

The histogram bars should be gray and the kde line should be blue

#

This the code

#
sns.histplot(datos, bins=9, binrange=(10,99), color='gray', kde=True,
             line_kws= {'color':'blue','linestyle': 'dashed'},
             fill=False)
#

All the elements are gray

exotic maple
#

image?

#

ah

#

I see yourmistake lmao,

#

straight from the doc @twin mantle

#

you're using line_kws

#

you need kdw_kws

#

kdw

#

kde keywords contain "color" as well

twin mantle
#

Yeah but when I use kde_kws

#

With a dict with key color and value 'blue'

#

I get this

little path
#

There's a question where I have to put 10 values in series but its saying that init() takes from 1 to 7 positional arguments

twin mantle
exotic maple
#

thats....weird

#

try again and show mt he full traceback?

#

bwecause that guy was using distplot, but im checking histplot

twin mantle
#

That's the full traceback

twin mantle
little path
#

Ooo

twin mantle
#

So a([1,2]) instead of a(1,2)

little path
#

Ya got it

twin mantle
#

LOL

#

It's a bug

#

A seaborn bug

exotic maple
#

yes

#

thats defeinitely a bug

#

nice finding lmao

#

first i find a bug haha

#

that's a problem with the class instantiationg since it calls back to init

shadow frigate
#

hello! is there a way of getting the same result as

import numpy as np

labels = np.random.randint(0, 4, size=10)

a = np.zeros(shape=(10,4))

for idx, _ in enumerate(labels):
    a[idx, _] = 1

print(labels)
print(a)

without that horrible loop?

exotic maple
#

because...

twin mantle
#

What are you trying to do?

shadow frigate
#

on each row in a, I want to set a single value to 1, the position is given by labels

#
a = [[0. 0. 1. 0.]
 [0. 1. 0. 0.]
 [0. 1. 0. 0.]
 [0. 1. 0. 0.]
 [0. 0. 0. 1.]
 [1. 0. 0. 0.]
 [0. 0. 0. 1.]
 [0. 0. 0. 1.]
 [0. 1. 0. 0.]
 [0. 0. 1. 0.]]```
#

so label[0]=2 means a[0,2]=1, label[1]=1 means a[1,1]=1

#

etc

worldly sigil
#

hey everyone, if anyone's looking to throw down in "Sliced: a Data Science Competition", reach out to nickwan on Twitter and tell him that Nschamps sent you. The competition heats back up in June with 16 competitors as the goal. I had a lot of fun this season look forward to seeing some of you there.

https://www.twitch.tv/videos/960771956

Twitch

nickwan_datasci went live on Twitch. Catch up on their Science & Technology VOD now.

▶ Play video
exotic maple
#

does that have some programatic logic?

#

because i cant see any other way, as-is

twin mantle
#

Do the positions change?

twin mantle
exotic maple
#

Like, if he wanted all first indeces he could something like array[:,0] = 1

#

or so

shadow frigate
#

I need to prepare a label matrix to pass to a loss function

exotic maple
#

but i dont see any logic there

#

I dont see a way outside the loop

#

because you have your indeces as a list too

#

or maybe...

#

mmm

twin mantle
#

Frey

exotic maple
shadow frigate
# twin mantle ???

not sure what you mean here, labels are generated randomly at each iteration so yeah?

exotic maple
#

sorry man i cant help you tbh im not seeing the order in what you're trying to do

shadow frigate
#

hm so

#

I'm trying to compute the cross entropy on the output of a nn

#

which has size 5000 total samples x 3000 possible outcomes

#

in my code, labels is the position of the correct outcome out of the 3000

#

on each of the 5000 rows

twin mantle
#

Mate, with all respect

#

You're using big words for this problem. The question is simple:

#

What is your criterion to change values?

#

How do you decide the indexes of the values, row-wise, column-wise?

little path
#

Write a program to generate a series of marks of 10 students. Give grace marks up to 5 of those who are having <33 marks and print the new list of the marks.

shadow frigate
#

the size of the matrices is constant, the values change with each iteration

little path
#

How to give that grace marks

twin mantle
little path
#

Ok as you say

grave frost
#

@shadow frigate well, what you are doing is called one-hot encoding - your label range is determined by the length of the row so that may confuse someone, but it is (in essence) one-hot encoding on a fixed array

shadow frigate
#

yeah now that you put it that way, it is OuroborosSlain

grave frost
#

uh-huh

shadow frigate
#

holy moly I'm tired OuroborosSlain

grave frost
#

so you can use the pre-built modules in sklearn, or generate an array each time and append in on the appropriate axis

exotic maple
#

he's manually one hot encoding????

grave frost
#

yeah, something like that

shadow frigate
#

might be

exotic maple
#

holy spirit of God

#

this madman

grave frost
#

but its easy to confuse (AFA I have understood)

shadow frigate
#

'twas a long day ok OuroborosSlain

grave frost
#

better take a break 😁 I find it helps a lot when doing long stuff

exotic maple
#

so i'm trying to make a habit of work, then rest 10-15 mins

#

repeat

grave frost
#

yeah, shower for me 😄

hollow sentinel
#

yeah I'm gonna take a fat nap

exotic maple
#

the brain needs some time for the information to settle

grave frost
hollow sentinel
exotic maple
#

xd

shadow frigate
#

yep time to stop, thanks for pointing that out, I'm definitely done for the day

#

cheers

hollow sentinel
#

it means a long nap people

grave frost
#

ahhh...I am too young for naps

exotic maple
#

-tropics life-

#

its heaven dude

#

a noon nap and you're refreshed all afternoon

grave frost
#

sad. I just can't sleep any time 😦

exotic maple
#

I call it nap, but i dont relaly sleep

#

just close my eyes

#

and calm my mind

#

that works too

little path
#

why its showing nan ive passed values na 😐

dreamy jewel
#

I am making a dungeon crawler and I have done all the basic stuff (the player,tile,collision etc.) and now I wanna make a test level for which I have to make a BOSS I have created the sprite of the boss but IDK how to implement the AI for boss as I never did these kinda stuff ( I am using pygame) so please help me.

exotic maple
#

its better to create the series with index right away...

#

and use this method

little path
#

Hmm I see

exotic maple
#

try reading documentation my man. saves a lot of time :p

#

but for the sake of it, paste your lists here i'll to reproduce

#

it worked fine for me?

#

oooh

spiral peak
exotic maple
#

if i change it to strings it doesnt work, weird

exotic maple
little path
#

Ya bro done @exotic maple

exotic maple
#

it seems you cant convert like that

#

it inherits the indeces

little path
#

Lol I have to write all my codes in my file , will see you next time

exotic maple
#

dang i cant answer why we get those nans vals

twin mantle
exotic maple
#

you cant pass range alone

#

its a generator object, not an iterable

#

I THINK -hesitant-

#

that the problem there in the 2nd case is that tries to look for the values of index in the parent series

#

yes, that's exactly what it does. Its not setting the argument of index as the index, its searching for it

#

I have my answer now 😄

#

how can I tell the people in pandas to modify that documentation section?

misty flint
#

github?

grave frost
#

This is the docs for the NLTK HMM - I want to do unsupervised tagging on my dataset

 class nltk.tag.hmm.HiddenMarkovModelTrainer(states=None, symbols=None)[source]

    Bases: object

    Algorithms for learning HMM parameters from training data. These include both supervised learning (MLE) and unsupervised learning (Baum-Welch).

    Creates an HMM trainer to induce an HMM with the given states and output symbol alphabet. A supervised and unsupervised training method may be used. If either of the states or symbols are not given, these may be derived from supervised training.

    Parameters

            states (sequence of any) – the set of state labels

            symbols (sequence of any) – the set of observation symbols


Does anyone know about states and symbols? I can't find much from googling

abstract zealot
#

Any improvements as to how I can speed up the following:

Df.groupby([a,b,c]).agg({col1: [funca, funcb, funcc], col2: [funca, funcb, funcc]})

??

velvet thorn
#

what are the functions?

abstract zealot
velvet thorn
abstract zealot
#

What you mean vectorised ?

velvet thorn
#

and memoisation won’t help

velvet thorn
#

it’s a numpy thing

abstract zealot
#

Nice I’ll look into this

#

Running this on my data frame is taking >5 hours

velvet thorn
#

as in, if none of those are your own functions

#

they are likely already vectorised.

velvet thorn
abstract zealot
#

25 mill

velvet thorn
#

5 hours seems a bit long

#

what functions specifically?

abstract zealot
#

Yes that’s what I thought

velvet thorn
#

show code

abstract zealot
#

Out of interest does reshaping take a lot of time ?

velvet thorn
#

generally, no

#

well, more accurately, it depends.

#

on whether a copy is made

abstract zealot
abstract zealot
# velvet thorn what functions specifically?

an example would be

def func1(x):
    x = pd.Series([e*100 for e in x.values])
    _scaled =     
StandardScaler(with_std=False).fit_transform(x.values.reshape(-1,1))
    return kstest(rvs=_scaled, cdf='t', N=len(_scaled), args=(1, ))[1]
velvet thorn
velvet thorn
#

that's probably not a good idea

abstract zealot
#

oh no hahaha

velvet thorn
#

you're new to pandas and numpy, right

#

so

#

you should just do this

#

(for example)

#

!e

import numpy as np

a = np.array([1, 2, 3])
print(a)

b = a * 100
print(b)
arctic wedgeBOT
#

@velvet thorn :white_check_mark: Your eval job has completed with return code 0.

001 | [1 2 3]
002 | [100 200 300]
velvet thorn
#

you have a list comprehension there

#

which of course will be slow

#

and then you go through the overhead of converting it back into a Series

#

I would suggest

#

reading up on the basics of numpy

#

it would help you write better code

abstract zealot
#

this makes complete sense thank you very much for the example

velvet thorn
#

also I question the wisdom of using StandardScaler there?

#

!e

import numpy as np

a = np.random.rand(5)
print(a)

zero_mean = a - a.mean()
print(zero_mean)
print(zero_mean.mean().round(5))
arctic wedgeBOT
#

@velvet thorn :white_check_mark: Your eval job has completed with return code 0.

001 | [0.2841178  0.3210537  0.3932124  0.62640306 0.96628655]
002 | [-0.2340969  -0.197161   -0.1250023   0.10818836  0.44807184]
003 | 0.0
velvet thorn
#

@abstract zealot this is basically what you're doing, right

#

centering around 0

abstract zealot
#

yes

velvet thorn
#

again, I would suggest a bit of research on the purpose of sklearn's transformers

#

they are helpful for building a pipeline

#

but in this case what you are doing is just a single operation of centering

hollow sentinel
#

nooo gm what happened to your username

velvet thorn
#

it would make more sense to use a plain numpy operation

velvet thorn
#

I kept getting pinged

hollow sentinel
tardy crest
#

Hey guys what all comes under data science engineering? Do you guys think it's gonna be worth it ? I'm kinda confused whether I should be taking cs/data science/AI...do you guys think the placements different in them?

velvet thorn
abstract zealot
velvet thorn
#

there is data science, and there is data engineering, but I have not heard of "data science engineering"

hollow sentinel
#

does he mean data engineering/ data science

#

the world may never know

void shale
#

does anyone know how to get matplotlib on python? I am taking a class on udemy and its a little outdated so there is no proper instruction. I couldn't find anything online. Does anyone know?

hollow sentinel
#

try that

#

if you're on mac OS it should be pip install matplotlib

void shale
#

THANK YOU SO MUCH!!!

hollow sentinel
#

yeah no problem

serene scaffold
arctic wedgeBOT
#

:ok_hand: Added data-science-engineering to the names list.

serene scaffold
#

now everyone will hear about it

modest void
#

anyone know how to use between_time or something equivalent to select rows in a pandas dataframe that are between a given start time and end time, but for a dataframe that has multiple days in it, so like all the rows that are between say 7am and 9am for a dataframe that has a datetime index with rows going across multiple days

velvet thorn
#

Google “datetime accessor”, I can’t type code right now

modest void
misty flint
#

but I have not heard of "data science engineering"
me neither

#

ive heard of "full stack data science"

#

which is like front + backend skills + DS + (some Ops skills maybe)

#

🦄

velvet thorn
#

or could you provide an example please

serene scaffold
misty flint
#

i dont either. if i find the listing again, ill show you

misty flint
#

they are listed as two separate skill set categories

#

as you can see

serene scaffold
#

Is nosql like guis with flowchart blocks for code?

misty flint
#

i think MongoDB and Cassandra are nosql

exotic maple
#

i still dont know what exactly is so attractive about mongodb

exotic maple
# misty flint

bro that profile would need to pay upwards of 150k in the US lmao

#

i know some remote workers in my country working as FS devs for us companies and they make 100k

#

REMOTE

exotic maple
#

latin america :p

#

no more details ay

misty flint
#

oh hey i know someone in the same situation

#

they have an advantage bc same time zone

#

unlike EU or Aus

exotic maple
#

I'd like to get a remote junior data scientist / analyst from my country, but that's too much dreaming i guess lol

misty flint
#

one of the companies i think im going to work for has a croatian branch

#

and im like

#

how does that work

#

with time zones and such

exotic maple
#

lmao i work with my colleagues in China, India, Russia, Bulgaria, etc

misty flint
#

ig its fine if its morning here + later afternoon there

exotic maple
#

trust me, you get used to it

misty flint
#

i see

exotic maple
#

a good scheduler will amke sure there's at least a bit of overlap

#

depending on business needs

misty flint
#

im sure they will more than likely put me in project teams that the members are more local

#

so we can sync better

#

anyway

exotic maple
#

depending on the nature of the job, haivng sometime dif can be good

#

I can at least restrain myself from shouthing at my india/china colleagues because they arent live :v

#

so i'lljust send an email and vent more "professionally" ay

misty flint
#

💀

#

but yeah remote jobs seem more popular moving forward

#

post-covid

exotic maple
#

unpopular opinion

#

mixed is much btter than just remote

marble dune
#

hi, i got a 'long' pandas dataframe, that has 3 columns: property, value and playlist, basically is a dataframe converted from wide to long format using pd.melt(), the problem comes when i try to plot a bar catplot with seaborn, and i pass a column name as the x values and when i show the plot the x values don't show up

#

this is the dataframe

#

and here are my code and how the plot currently looks

#
        #bar catplot
        bar_catplot = sns.catplot(
            kind="bar", x="property", y="value", hue="playlist", legend=True, data=long_frame2, dodge=True
        )
        bar_catplot_figure = bar_catplot.fig
        catplot_render = mpld3.fig_to_html(bar_catplot_figure)
misty flint
#

which is why im glad this company usually requires in-person

#

work

#

in the office

rough otter
#

can anyone help explain what p-value is

lapis sequoia
#

`#HOMEWORK
#Q.1.Write a function to find the factors of a number.

number = int(input("Enter a number:"))
factors=[]
for i in range(1,number+1):
if number%i == 0:
factors.append(i) #Append: It adds a single item to the list. It modifies the list by adding an item to -->
#------> the end of the list.
print("Factors of the {} = {}".format(number,factors))`

#

#Q.2.Write a function to identify whether a number is palindrome or not.

num = int(input("Enter a number:"))
temp = num
rev = 0
while (num>0):
dig = num%10
rev = rev*10+dig
num = num//10
if (temp == rev):
print("The number is a palindrome.")
else:
print("The number is not a palindrome.")

#

#Q.3.Write a function to identify whether a string is palindrome or not. ​ string = input("Enter a string:") if (string == string [::-1]): print("The string is a palindrome.") else: print("The string is not a palindrome.")

gray arch
#

Does anyone have issue with running Tensorflow in Python 3.9?

grave frost
rancid gazelle
#

Hey, did you know any active discord about tensorflow?

primal tulip
glacial sparrow
#

is dash plotly used with mongodb?

primal tulip
glacial sparrow
#

making a dashboard

primal tulip
#

Disregarding the type of data manager you use, you could always graph data with dash. Even more if it's tabular data.

glacial sparrow
#

in short, I can connect with pymongo, do manipulations (most important is json_normalize) and create some graphs I want
but I wanted to make an 'interactive' dashboard that updates via mongodb
not sure if it makes sense, but I guess there are 3 options?
mongodb charts - but I think I cannot manipulate the data as with pymongo
powerbi - which seems to be able to connect with mongodb and allows the needed manipulations but with M which I'm not really familiar with
dash plotly - which I guess I can re-use my previous code, but I can't find many results online how to keep getting data

#

do the above make sense?

primal tulip
#

Yeah, makes sense. I've never done anything with mongo other than simple queries and not particulary good with dash either, but you could either do it in PBI or Dash. If your data is a behemot sized monster, I'd suggest using Python with Dash, since you can setup a buffer. I've seen a lot of resources in Dash so it's doable, but you'll have to do some trial and error. If the data is medium (Less than 8gb for example) you could do it in PBI which is higher level and overall easier, also with tons of resources.

glacial sparrow
#

I guess then my real question is how the dashboard can be 'live'. But actually even 'updating' daily would be fine for me.

primal tulip
#

Oh, then in that case I would transform everything to a Pandas Dataframe and update it

#

PBI also has (something like a checkbox option) alternative were you can toggle updating the datasets and their relations.

#

I forgot the name, but it should be under 'ñManage Relationships'

glacial sparrow
#

ok i will check

primal tulip
#
tidal bronze
#

how can I visulize clusters if they are based on a single feature?

lapis sequoia
#

So i am trying to build a speech recognition model. I am not that skilled so i am using sk learn. Lets say i have some recordings of my voices in .wav format. What do i need to do to make them trainable data?

primal tulip
primal tulip
sharp prairie
#

Hi guys. I have a CSV file with lot's of empty strings. How do I drop or delete them with pandas?

So far, this is my code.

df = df.dropna(how='any', axis=0, thresh=2, inplace=True)

Running it gives me none. When I remove the inplace I also don't get the dropped rows.

lapis sequoia
#

What way would u suggest me to read the files?

primal tulip
lapis sequoia
#

Could i use that for sk learn as well?

primal tulip
#

You need your data in text format first so it could be fed to the sklearn library.

lavish tundra
#

can someone help me to think about one thing?

#

its about data visualization

primal tulip
#

Arroje su pregunta Señor Diego

lavish tundra
#

i dont speak spanish . _.

primal tulip
#

I said "Ask away, mister". Your name is pretty common in Latin América lol.

Seems like a correct assumption about the xticks, but not sure why it's happening. Give me a minute pls.

tidal bough
#

linspace(start,stop,number) always gives out start as the first point and stop as the last point.

#

(there's a parameter to change this behaviour)

frigid forum
#

str onject has no attribute decode

#

i keep getting this error

#

anyone knows wha to do

tidal bough
#

decode is a method of bytes that converts them into strs, the opposite is str.encode.

frigid forum
tidal bough
#

the decode method, yes

balmy junco
#

I want to use python to calculate the antiderivative of a function and store it as a function that i can use

#

how might i do that?

tidal bough
#

!docs scipy.integrate

arctic wedgeBOT
tidal bough
#

if you mean numerical integration. If you mean analytical, sympy.

balmy junco
#

and then it could return me the integral as a function

#

so i could just pass variables into it

tidal bough
#

you can just make the function call scipy.integrate.quad each time, from (say) 0 to the argument

#

that'd be time-inefficient, but will require no extra memory

#

alternatively, precalculate the integral's values for the entire interval you'll be working on and use values from it

balmy junco
#

sure

#

but there is no explicit way to sav eit as a function right

#

?

#

if so, i can just create my own function

#

and assign attributes i guess

tidal bough
#

yeah, something like that

balmy junco
#

thanks

#

so then

#

if i want to pass in a function f that takes in a value of kx instead of x, do i just multiply the integral range?
quad(f, 0, math.pi)

#

like quad(f, 0, k*math.pi)

#

i feel like there needs to be another way

tidal bough
#

not sure what you mean by this

tidal bronze
carmine iron
#

Does anyone know how to find the nth largest drawdown of a portfolio

lapis sequoia
#

anyone know a good api for facial landmarks?

#

or anything

#

to get coordinates of them

misty flint
#

the dlib library is a popular one

#

we used that in our face recognition project

#

gives you 68 x,y coordinate points

misty flint
lapis sequoia
#

well I have dlib

#

with

#

face recognition library

misty flint
#

there you go

lapis sequoia
#

but in some cases

#

like in my pfp

#

it doesnt detect

#

an eye

misty flint
#

rip

#

yeah its not trained to do it on those types of images

#

we actually proved that in our project

#

lol

#

best one i know, so gl bud

lapis sequoia
#

whats ur project

carmine iron
#

How can i return the nth largest drawdown for example
r = [.01,-.01,.004, -.02,.01] n = 2

grave frost
alpine fern
#

I'm not sure whether this should be in this channel, but if I'm looking at historical data in the form of candlesticks, how would I be able to find local mins/maxes ,using say a dataframe format, for my data?

lapis sequoia
lapis sequoia
#

fun

#

went from a normla picture

#

pasted flares on it

#

or paste whatever you want to on the eyes

grave frost
#

so you want just eyes or all facial features

lapis sequoia
#

yeah

#

I mean in the future maybe could do smth with the rest but eyes are like main thing

grave frost
#

just get their coordinates then 🤷 train a model for that - data wouldn't be too hard

#

or just google "get location of eyes from face in python" and youd probably get some indian tutorial using OpenCv

lapis sequoia
#

probably

#

tried looking into that already

#

or well

#

thats what im doing rn

#

just have to figure out how to get the end picture of opencv into a pil image or bytesIO

grave frost
#

researching things is a pretty important skill

lapis sequoia
#

I figured

grave frost
#

and with google scholar, its not as hard as it was before

dawn cargo
#

Hey guys, I've been trying to do a Gaussian blur of a RGB image.
I know how to blur a grayscale image (with 2d convolution kernel), but I'm having a problem with implementing the process for RGB image and 3d kernel. Should all layers of the kernel be the same or not?

tidal bough
#

Yup, all the same, unless you want the kernel to also mess with colors.

#

so it'd just be 3 gaussian kernels stacked on top of each other

dawn cargo
#

Thank you very much

#

Going to look for a bug in another place then

tidal bronze
#

how can I visulize clusters if they are based on a single feature?

lapis sequoia
#

Hey anyone knows how to apply groupby().agg() on index instead of columns?

ripe forge
#

You can always reset index to turn index into column

jade tinsel
#

Hi all! I've recently gotten into data science and I'm currently trying to do some research into Linear Regression. I'm able to train and make one prediction (the basics), but I'm not sure what keyword(s) I should be looking for when I want to use the trained model in order to predict with a given variable.

E.x. I have a dataset with country, text (nl: hallo wereld for example), I'd like to pass a variable to the model to predict what the given text is. What keywords would I have to look for and is linear regression even the way to go for such a thing? Sorry for the confusing question, I tried my best but still trying to get the hang of this thing 😄

tranquil loom
#

Hi, I want work on recommedation system ,but i can't find a source ,course exc. Can you recommend a source 😄

gray arch
# tranquil loom Hi, I want work on recommedation system ,but i can't find a source ,course exc....

I am self-learning via a book called Intelligent Projects using Python (Packt Publishing), they have a project called "Intelligent Recommender System". For the source code you can find it here: https://github.com/PacktPublishing/Intelligent-Projects-Using-Python/tree/master/Chapter06

But without the book it might be hard to understand how the source code is implemented so I still suggest you look more online

tranquil loom
#

thank you 👍 👍 👍

polar dock
#

Hi data scientists, which disk based storage formats do y'all use most often for dataframes?
Use cases are for long term storage, as well fast read/ write capabilities.

I'm a dev on an analytics team. Currently, we are using pickle almost exclusively.
Been exploring parquet, and it's different engines but was hoping someone had some experience 🙂

misty flint
#

spark dataframes is good if youre looking for more production stuff

polar dock
#

yeah, I know. We're just a small analytics team in a big company. Our oracle servers are all hosted internally.

Though, I guess the question I should bring up with the analysts is why do they want disk based storage

#

I wasn't really told a specific "prove parquet > pickle" or something, mostly just to explore the options

misty flint
#

oh wait spark doesnt used disk-based storage unless it has to

#

mapreduce does

#

this makes it literally 100x faster (spark)

#

that would be a good question to ask

sterile kernel
#

xd

serene scaffold
grave frost
#

Making code reproducible sucks AF

thorn bobcat
#

yo

misty flint
#

docker

grave frost
#

I can't use docker with colab

#
  • I don't know how to
thorn bobcat
#

is there a repo that can change my voice to another persons voice?

#

still wondering if I should work on it

grave frost
#

Tacotron is for TTS

grave frost
#

but you may not be necessarily be able to use tacotron

thorn bobcat
grave frost
#

its basically TTS - but the problem is the data.

thorn bobcat
#

I thought of using speech recognition to convert my voice into text and then converting that text via tacotron 2

grave frost
#

Two Minute papers had a method that can replicate exact voice using 2 minutes of train data

#

but I would have to hunt for it tho

grave frost
#

wdym?

thorn bobcat
#

there was one that did it in 5

#

just watched it

grave frost
#

must be on the cutting-edge - no way you are deploying that unless yove done your masters

#

or the contributors are active

misty flint
grave frost
thorn bobcat
misty flint
#

only issue with colab is the other person has to have access to the dataset too

thorn bobcat
#

this was the one featured in the 2 minutes paper, python code mostly but not tensor flow now.

grave frost
misty flint
#

lol youre probs training too much for colab

thorn bobcat
#

I was wondering if there was something newer in the field of speech synthesis

misty flint
#

you can even run a jupyter notebook on it

#

for bigger models

grave frost
misty flint
#

you get free credits if its your first time

#

more than enough to play around with

grave frost
#

done it; blew it

misty flint
#

you wasted it already?

#

on what?

grave frost
#

yea

misty flint
grave frost
#

t R a I N i N g

#

wanted to do some some classification

misty flint
#

you blew $300 only on training?

#

i-

grave frost
#

yeet

#

im a broke boi

#

tho there is a workaround to use unlimited GCP 😏

#

Which I am currently using

fickle surge
#

if anyone responds please dont hesitate to ping or dm me!

grave frost
# fickle surge Hey! im 13 and I am currently learning python via codecademy because I want to g...

Those seem great starters. ML/AI is pretty complicated - especially the mathematics involved. if you do not understand something, you can get an intuitive knowledge of it from youtube. it would still take you some years to understand, but I promise it would be a pretty fun journey 🙂

As you learn more maths in school, things would make more sense. but don't stress if you don't understand anything. we are always here to help!! 🤗

fickle surge
#

alright! thanks.

grave frost
#

I would recommend you make Youtube your primary source of knowledge. visualizing things is very easy to understand

fickle surge
#

ok

median agate
#

@fickle surge I'm not a pro but I would start with codecademy (SoloLearn has a course on Machine Learning which is preety good, they're both preety similar though). Go from there to codecademy/udemy, there are a bunch of more advanced courses. Also on YouTube there's a lecture series on ML by Steven Brunton (sp) but it's quite theoretical/mathsy. I wouldn't worry about the maths/theoretical aspect until you've built some practical/fun projects and you still like it Also Unity ML Agents is a great practical intro, no need to know any of the inner workings.

grave frost
fickle surge
#

alright, im already in a python course on codecademy so i think im going to finish that to learn the basics

stiff barn
#

Parquet works really well with services like Azure Data Bricks

#

We generally just use csv or json though or read directly from a DB

gray arch
#

What book/course/websites that you all recommend to build a first project on Google Cloud? I literally never use it before and I just mess around with it today I don't have a clue where I should start...

misty flint
#

they have what are called Quests on gcp

#

try to complete those. i think thats a good place to start

stiff barn
#

There are a lot of services on GCP. It’ll be easier if you have a project in mind so you can narrow it down to a few core services

gray arch
#

@misty flint @stiff barn thanks, just signed up for the quests, wish to learn more haha
Feel lagging so behind in the industry 😦

stiff barn
#

If it helps, I’d say the core services to start with would be cloud storage and cloud functions. From there pub/sub and a database like Firestore or BigQuery

#

You can do quite a lot with that combination

gray arch
#

@stiff barn thank you so much! I will do my best to be better in that
Hope you don't mind if I pm you in the future if I have any question

stiff barn
#

Go for it @gray arch

fickle surge
#

@grave frost my end goal is to make an assistant that i plan on modeling off of jarvis from iron man. I want to hook it up to smart home stuff and have it write emails to name a few things. how hard is it to do something like that?

stiff barn
fickle surge
#

What are some things to do with machine learning?

#

Or I least I want to make something that I can Interact with that. I could probably have it comunicate with Phillips hue api easily so when

stiff barn
#
#

You can probably use that to build a mvp more simply

fickle surge
#

Alright

#

Thanks

#

@stiff barn just want to point out, I’m trying to learn machine learning not just make something that serves that purpose

#

Trying to think of a project to do

bronze wolf
#

Large data sorting?

#

Have any large datasets you want to teach a computer to organize for you?

stiff barn
fickle surge
#

Like....

stiff barn
#

Go on kaggle.com and try the beginner projects like the titanic one

fickle surge
#

Ok

stiff barn
#

I’d also pick up a book on the subject or sign up for an online course

hollow sentinel
#

Uh

#

Jarvis is a pretty lofty goal

#

I don’t really understand why that’s so many people’s goal when they first learn DS/ML/AI

#

it took quite a while to build Siri and Alexa

#

I’m not saying it’s impossible for one person to build something similar on their own but it’s definitely very difficult

stiff barn
#

It’s probably just the first use case that comes to people’s mind

misty flint
#

Data engineering specific interviews increased by 40% in the past year. The second fastest position growth within data science roles went to business and data analysts which increased by 20%.

#

Data engineering is the new data science

stiff barn
#

I was looking at that this morning @misty flint haha

stiff barn
misty flint
#

for now yeah

#

maybe just keep an eye on the waters for now

#

they say once companies establish their data infrastructure, there will still be some data eng jobs just less afterwards

misty flint
#

I have created multiple videos about data engineering, including a data engineering course for beginners. Why would I advise anyone against pursuing a career in data engineering? I like being as transparent as possible - while this job will be great for many people, it might be disappointing for the others. In this video I'm outlining three reas...

▶ Play video
fickle surge
stiff barn
#

There will always be more data engineers that scientists.

misty flint
#

im just glad i signed up for this graduate level databases class next semester

#

its like super full and the waitlist is super long

hollow sentinel
#

even a “barebones version” of Jarvis is going to take a while

misty flint
#

its not even part of my degree plan

#

but i thought it would be interesting

stiff barn
misty flint
#

this is true

stiff barn
#

The correct project in working on to bring a company into the cloud won’t be done until 2022 at the earliest

fickle surge
#

This kinda inspired me... seems like a good starting point

stiff barn
#

But yeah, getting data engineering skills even if the goal is to be a data scientist will only help

misty flint
#

the YTber was just saying as stuff like DataBricks, Azure Data Factory, and Denodo standardizes and virtualizes data, there will be less tasks

hollow sentinel
#

Yeah i got that

#

but idk how much experience you have

misty flint
#

idk if thats true, thats just what she said

hollow sentinel
#

V what that guy does

stiff barn
#

It takes away the annoying stuff. Like setting up and maintaining a Hadoop cluster

fickle surge
#

Are you talking to me?@hollow sentinel

stiff barn
#

Who wants to do that really

hollow sentinel
#

@fickle surge yeah

fickle surge
#

If so I’m learning and that’s kinda my goal once I get everything down

hollow sentinel
#

idk how fast you learn but it’s quite a bit of stuff

#

but you’ll make progress

#

If you do it consistently

stiff barn
#

You’ll need to have a solid understanding of software engineering as well to build something of that scope @fickle surge. That won’t just be an ML model, it’ll be a system of things that all need to be developed and interact with each other

fickle surge
#

Alright

stiff barn
#

I’d save that as an aspirational goal

#

Work your way up to that

hollow sentinel
#

yeah that’s what I was trying to say

exotic maple
#

This notion that because something "big" is coming from "big tech" means you shouldnt learn a skill is crap that should disspear

lost ridge
#

Hi all anyone here ever made a trading algorithm ?

exotic maple
#

we have steel mills and automated carpentry nowadays, but carpenters and blacksmiths still exist (as niche, true) careers, and are also well paid

serene scaffold
#

oh fuck that's where we are

exotic maple
serene scaffold
#

thought we were in algos and data structs

lost ridge
#

Lol

exotic maple
#

Stelercus.exe has stopped working

serene scaffold
#

pkill -u stelercus

exotic maple
#

-> googles: "HOW TO KILL A CHILD"

serene scaffold
#

what
no

exotic maple
#

-> corrects: "HOW TO KILL A CHILD PROCESS I'M SORRY#

pearl vault
#

Sry for disturbing
amd still have driver and software issues?? should i buy an amd or intel laptop?

exotic maple
#

I have to say, since i've gotten used to pandas...I kind of dread touching excel lol

exotic maple
#

ofc it's gil

#

mongrel

misty flint
#

my 2nd favorite archer

exotic maple
#

2nd?

misty flint
exotic maple
#

pathetic mongrel I AM THE KING

#

Enuma Elish

misty flint
#

anyway

#

@exotic maple idk if you saw the charts earlier but maybe data eng remote job?

#

growing more popular

exotic maple
#

I only know python, not enough backend to do engineering

#

well, python and MYSQL

#

if sql is considered a programming lnaguage

misty flint
#

Currently most data engineering roles require only three main types of skillsets: SQL, Python, and algorithms.

#

oh theres also this but less common

#

We're seeing a rise though in data engineers needing to understand system design and architecture problems as well.

exotic maple
#

omg id love statistics and A/B testing

#

shit's easy AF once you get the hang of it

#

and its easy to show face with it lmao

exotic maple
#

SQL I kiiind of know, python id say "intermediate" and algos...i've never taken a formal class but feedback from CS friends tell me i have the logic down

#

eh, who knows, i might just try it out

misty flint
#

do it dude

stiff barn
exotic maple
#

screw me. all this time i have been shy to apply when I have at least the basic skills for it?!

stiff barn
#

Lol yeah I’d give it a shot

misty flint
#

tbf you didnt even know

fickle sinew
#

you might want to learn Scala too

hollow sentinel
#

Yes

waxen girder
#

What are good resources to learn SQL but dive deep in things like efficient querying and such.

#

Beyond the basic here's how you do X.

fickle sinew
#

spark is written in scala, and it's going to gain traction as data pipelines start leveraging spark more and SQL less...

fickle sinew
#

tuning SQL is such a weird art though. It's a declarative language, so its not like you can easily tell the query optimizer "do it like this"

exotic maple
#

isnt SQL pretty much super optimized by definitiion? I mean, the DB structure and optimization has to be done by a full DBA, not an user of the DBSs

fickle sinew
#

I like to say "SQL gives you the benefit of a bunch of really smart people that already figured out how to do most of the simple things"

#

like simple joins, you dont have to decide what the best way to join tables is. as long as you have good choices of indexes and keys, the database will usually do the joins in a very efficient way

waxen girder
#

As of right now, I created my own db with postgresql in ubuntu running on wsl2.

#

As of right now I haven't figured out how to connect a SQL gui instance to the DB but honestly I kinda want to just use something like psycopg2 then move on to SQLAlchmey.

fickle sinew
#

use psql if you want a handy terminal client for postgres

waxen girder
#

Yeah I do use that.

fickle sinew
#

python libraries are good but they don't do the admin stuff too well. they kind of assume the database is already built.

waxen girder
#

I haven't set up my DB for production so to speak. But as an aspiring analyst I hope I won't have to.

#

Some of the user accounts have their passwords stored as plain text. I don't think I intalled the DB the most secure way according to the docs but I'm just using it to learn.

fickle sinew
#

you can run postgres in docker too, that might make some aspects easier (or it might make it worse)

waxen girder
#

Apparently you're supposed to install it in its own user w/ the least possible privileges of any user and not have any other software installed on that user.

fickle sinew
#

are you running in linux?

waxen girder
#

Yeah

exotic maple
exotic maple
fickle sinew
#

ive had good luck just using the vanilla packages installed using apt or whatever package manager, very little manual setup

waxen girder
#

Yeah but for production you want to be careful.

fickle sinew
misty flint
#

i banged my head for a day trying to dockerize our team project

#

but eventually i got there

#

all your dependencies are belong to me blobhyperthink

fickle sinew
misty flint
#

i think i used buster

#

our project was just really finicky

#

had a flask component to it too

#

also i never had used docker beforehand

#

so there was that

fickle sinew
#

buster was a wise choice... but if it was built on flask, i have to ask... what web server did you use

silver widget
#

Hi guys.
I need some help about a data analysis project i am working on. I'm working on a bank customer data with transactions and salary info. These information is available for 3 months, and I need to calculate the annual salary of the each customer.
new_df = df.groupby(['account','month'])[['amount']].sum()
I grouped each customer's salaries. however I cannot use each months data as columns. Is there a way to create columns such as 'august', 'september', and 'october' and append the new_df['amount'] values to these columns?
Thanks in advance

pure quiver
#

Shouldn't you just groupby month and sum without the account names?

#

Is it because you need to append the average salary as a new column in your original data frame?

silver widget
#

The data based on the transaction movements of the customers. for instance, one customer has more than one row in the data.

pure quiver
#

So you need each customer's average salary by month, I see

silver widget
#

This is what i get from the code above

pure quiver
#

Okay so you have a multilevel index because of this. I'd approach it differently, create a new data frame with all unique account numbers only as a column, then write a series of groupbys on account where month =8,9, 10 etc and append each series to the new data frame

#

You can write a simple function to speed this up and pass a list of months to it

silver widget
#

Oh, that's great. Thank you very much. I'll try that immediately.

pure quiver
#

Or rather, when you groupby, you get accounts and their sum salaries, then join on the account numbers

#

Something like this (sorry I'm on phone)

silver widget
#

Thanks Dyllyn. I appreciate it.

pure quiver
#

Ugh I swear code is impossible to write on phone

#

Well lemme get back to my com, but lmk if you get it

silver widget
#

No no pls don't write it 🙂 I'm trying to learn it.I appreciated your help, that I was trying to say

pure quiver
#

Sorry for the picture of the screen, Im on a closed system

silver widget
#

Thank you very much.

pure quiver
#

Ah, I made a mistake, at salary the account and month will still be on the index I think. That's for you to fix :)

#

I don't use groupby that much

past arch
#

Hello, In NLP text summarisation, is there any way to programmatically differentiate between extractive and abstractive summarisation?

thorn bobcat
#

anyone worked with end to end speech synthesis before?

lavish tundra
#

Someone who really understand very well about Data-visualization can help me on the #🤡help-banana pls? i'm stuck on this problem for a while...

covert seal
thorn bobcat
#

anyone used Flowtron, FastSpeech2, WaveRNN, Tacotron2 or Real-Time Voice Cloning before? I'm thinking of starting out on one of them and was hoping to find something simple to begin working on.

grave frost
thorn bobcat
grave frost
#

That's about as simple as your task gets.

thorn bobcat
grave frost
#

voice cloning isn't something easy like visualization or regression

thorn bobcat
#

Don't I have to tweak it and stuff?

grave frost
thorn bobcat
grave frost
thorn bobcat
grave frost
#

well they provide pre-trained models in those repos

thorn bobcat
#

also for some reason there's no tutorials in youtube regarding this

#

in python atleast

#

most of it is in jupyter notebooks.

grave frost
#

what? no one is going to spoon feed something so complex. you would have to research and understand things on your own

#

There is no shortcut that would work well for you. they might give decent results, but not very convincing/realistic

thorn bobcat
#

can't you understand deep learning practically?

#

through working on projects.

grave frost
#

ofc you can - but it would take a lot more projects. learning with projects is great and I consider it the best way to learn; but to learn something, you have to understand some theory too, not just copy the code by some guy on youtube

thorn bobcat
hollow sentinel
#

hol up

#

how are your machine learning basics

grave frost
thorn bobcat
#

for example I want to be able to clone Morgan freeman's voice to generate speech from text, that sounds like him, it's been done before but it's just an example.

hollow sentinel
#

before you do all this voice cloning shit

#

how are your basics

grave frost
hollow sentinel
#

bc voice cloning is pretty ambitious if you're just a beginner

thorn bobcat
hollow sentinel
#

define sorta informed

#

like

#

do you know the math behind the field?

#

the math is what you're going to need if you want to finetune parameters

thorn bobcat
#

by informed I mean I know about LSTM, RNN, CNN, GAN, stylegan

#

and some of the underlying logic..

grave frost
#

thats a start

#

but you have to do some research, learn a few more things etc.

thorn bobcat
thorn bobcat
thorn bobcat
grave frost
#

depends on how old you are

thorn bobcat
grave frost
thorn bobcat
grave frost
#

well, then just learn it the proper way! take the AI course

#

attend the stats and math lectures

#

college is the easiest time to learn IMO

hollow sentinel
#
#

there is this book here

#

this will show you what you need to know for the math in ML

grave frost
thorn bobcat
#

what would be a challenging yet rewarding task to undertake as a start in ML journey?

#

thing is I wanna also develop my python skills which is why i wanna do something..

grave frost
thorn bobcat
#

we do have simulation and modeling and statistics this sem

grave frost
thorn bobcat
grave frost
#

ehh, you are in college. just see what books there are and read em up. ask what you don't understand - there are many highly expereinced people here that can answer almost all of your queries

thorn bobcat
hollow sentinel
#

voice cloning will only get easier once you know the core concepts

#

otherwise you're just grasping at straws

grave frost
#

if you aren't enjoying that, you can see some of the technical articles for cloning and they would teach you maths too (albeit with less explanations since there is a lot to cover)

hollow sentinel
#

which makes knowing the math even more important

grave frost
#

but you would find yourself frequently finding topics to learn and making a list of it

#

and of course, we are always here

thorn bobcat
#

thanks for all the advice, guess I'll start up with some of the math, core concepts and theory before applying things practically

#

I'll try to break it up into bite sized chunks tho so I don't get bored, cause I actually like working on projects to solidify what I learnt.

lapis sequoia
#

@grave frost what was the jedi alternative you were recommending for jupyter?

obtuse marlin
#

Pylint (Microsoft Server)

#

I doubt

lapis sequoia
#

Is it possible to draw a 3d shape on an image using matplotlib3d?
I know we can do it for 2d, but I am trying to do it for 3d but can't figure it out
I am looking for something like this: https://stackoverflow.com/a/15592168

plucky harness
#

Is there machine learning app possible with python?

odd lion
shut valve
#

Honestly I wouldn’t bother trying to learn the pure math on your own bc like unless you enjoy that you prob won’t finish it you can do a lot of machine learning and deep learning withOUT in depth math

thorn bobcat
shut valve
#

Bc that’s not what I want I want a certain voice

#

Like there a lot of real time voice cloning libs on git that are not bad (not great but amazing for the small amount of data given)

thorn bobcat
#

how much experience would i need working with them?

#

also tactotron 2

#

WaveRNN and Glow,

shut valve
#

Yeah some of them are real click record and run it depends what your trying to do with it

thorn bobcat
#

TTS tranformers too

#

I want things I can integrate into bigger projects tbh.

#

I don't want to re invent the wheel but I'd like to use it build a car, if you catch my drift..

shut valve
#

Well yeah then that’s just like taking what you want from it how modular it is to pick up and move differs from project to project

#

Like most probably allow you to re train a pre trained model and then use that model in your project now that’s an awesome skill to have

#

But that’s a little more advanced but you don’t need to understand linear to do that

thorn bobcat
#

retrain, tweak and perhaps understand what's going on.

dusk kite
#

Hey guys, I am a Data Scientist looking to grow my skills as a Machine Learning Engineer. Does anyone have recommendations for learning resources?

shut valve
thorn bobcat
#

I do know some of the math involved and I actually love math sometimes.

#

I like working on projects more than reading tho idk why.

dusk kite
#

Looking more into MLops and architecture information / resources

shut valve
#

i say computer vision because it takes about data augmentation and training pre trained models. now thats a kinda big stack might take you a couple weeks but it would be really good and quick to getting started with ml. Then you can see if you really wanna stay in AI/ML

thorn bobcat
#

@shut valve Thanks alot, I've added it to the resource list. I'll be sure to check it out.

#

Goodluck with your work

#

and thanks for all the helpful answers.

shut valve
#

enjoy

#

if you struggle it means its working

thorn bobcat
#

seems like it will help alot.

lapis sequoia
#

@dusk kite read pattern classification by stork, duda and hart

lapis sequoia
#

let me check it out

#

thanks man

gray arch
#

This topic seems to be less heated today than usual lmao
When I check every hour it always has 50+ messages

grave frost
#

everyone's lazy and bored

#

they need a controversial topic to be stimulated

humble widget
#

Today I came across a job description for a junior data science position that requires "Hands-on experience with ML tools such as TensorFlow, Keras, PyTorch; Experience with Data Mining and Data Analysis technologies and language, including Python, pandas, Jupyter Notebook, Matplotlib, NumPy." Would you say that this is kind of a standard requirement overall?

rancid orbit
#

hi

grave frost
humble widget
grave frost
#

I don't see how jupyter notebook is a skill

#

its just good for rapid experimentation and visualization

gray arch
#

About PyTorch, it seems legit, but I have only used it in school research not in real industry so far

humble widget
#

It is a position related to climate change modelling at a financial advisor, so I guess that the visualization might be related to GHG emissions.

gray arch
#

I see, but it's not too difficult to practice Jupyter Notebook anyway, Google Colab is one of the ways to go haha

short heart
#

Is it ok if my training dataset has some generic floats and some of them are numpy.float s

grave frost
misty flint
#

im chillaxing today

west bolt
#

How do I get Jupyter to display a sympy Matrix?

#

Currently I have

init_printing()```
In cell 1 and
```A = Matrix([[1, 2, 3], [4, 5, 6], [7,8,9]])
A```
In cell 2
balmy junco
#

Is there a function in Python to check whether or not a set is a basis?

#

Or do I just need to do it myself lol?

tidal bough
#

oh, that's a nice question

tidal bough
#

(but no, don't think there's a function for that in numpy/scipy)

#

if you have more than n vectors and you want to check whether that set contains a basis, then I actually don't know, hmm. Is there a simple way to check for that?..

exotic maple
#

Idk how but I got lip-won the HR VP and now I have to develop "an ML model to improve our recruiting"

hollow sentinel
#

So you got the job?

#

Or internship

#

idk what lip won means I’m inferring

exotic maple
#

Its not a job

hollow sentinel
#

oh

exotic maple
#

Im trying to transition away fron my mid position into slmethint about data or abalytics. There is no such thing where I work

#

So i thought about convincing the CEO, but thars too far up for me

#

So... HR VP lol.

hollow sentinel
#

Oh

exotic maple
#

Basically, sold her the idea or ML / Data department

hollow sentinel
#

sorry I completely misunderstood

exotic maple
#

So if i get it right

#

I can get her to tell the CEO and crearw the department

#

Thats my plan at least

hollow sentinel
#

that sounds very good

exotic maple
#

Yeah but i need to get the ball rolling on my own now lol

hollow sentinel
#

Yeah idk about ML helping recruitment

exotic maple
#

I have a clue of something that can help. Not recruitment itself but after it. Reducing attrition, churn, and other negative metrics

#

A mix of classification and regression might help there.
Predicting churn. Probabilities or attritition based on profiles, etc etc

hollow sentinel
#

it isn’t impossible 🙂

grave frost
exotic maple
#

NER?

grave frost
#

Named Entity Recognition

#

Named-entity recognition is a subtask of information extraction that seeks to locate and classify named entities mentioned in unstructured text into pre-defined categories such as person names, organizations, locations, medical codes, time expressions, quantities, monetary values, percentages, etc. Wikipedia

exotic maple
#

-shivers- omg not NLP pl00x

#

I suck at NLP atm haha
Didnt learn properly.

#

Besides, is there good support for NLP.in Spanish?

grave frost
#

still, you can get by with some basics

exotic maple
#

Everythign ive seen is in Spanish

grave frost
exotic maple
exotic maple
#

English wordnets and stuff are useless to me

#

For that

grave frost
#

there are plenty of spanish pre-trained models

#

spanbert

#

gpt2 spanish

#

I myself am currently working with Low resource languages, so your task seems a piece of cake

exotic maple
#

I wonder if its ok to use pretrained models... legally speaking and all that

exotic maple
#

I suppose open sourve it shoulsnt matter

grave frost