solemn hull Aug 20, 2020, 10:51 AM

#

player_df.loc[player_df.SEASON_ID == '2019-20',:]```

#

@dire pollen l

muted oyster Aug 20, 2020, 10:51 AM

#

yes this should work

dire pollen Aug 20, 2020, 10:52 AM

#

`from nba_api.stats.endpoints import playercareerstats

Anthony Davis

career = playercareerstats.PlayerCareerStats(player_id='203076')
player_df = career.get_data_fames()[0]
player_df.loc[player_df.SEASON_ID == '2019-20',:]`

#

like this?

solemn hull Aug 20, 2020, 10:52 AM

#

looks right, oh wait i typoed

#

get_data_fames, should be get_data_frames

dire pollen Aug 20, 2020, 10:52 AM

#

wow brilliant, it work!!

muted oyster Aug 20, 2020, 10:52 AM

#

nice

dire pollen Aug 20, 2020, 10:53 AM

#

yeah i saw the error haha

solemn hull Aug 20, 2020, 10:53 AM

#

hurray PartyGlasses

muted oyster Aug 20, 2020, 10:53 AM

#

i was working on similar thing when i saw your question

dire pollen Aug 20, 2020, 10:53 AM

#

I think this was the easy part how can I get this row and other rows and join them?

solemn hull Aug 20, 2020, 10:54 AM

#

so try to take that last line

this_year = player_df.loc[player_df.SEASON_ID == '2019-20',:]
type(this_year)

#

im guessing its also dataframe but what does that say

tidal bough Aug 20, 2020, 10:55 AM

#

ML questions get posted in this channel once in a while, so here's mine:
When creating AIs for playing board games, it's common to take advantage of symmetry to reduce the number of possible states. How is that actually done in practice? I had to take advantage of symmetry in one case before(not ML, just a metaheuristic optimization task), but I achieved rather meager results. Is there some sort of hashing algorithm for a 2d array of values that is invariant under rotations/reflections?

muted oyster Aug 20, 2020, 10:56 AM

#

@solemn hull what if i want to get 3 months of 3rd quarter using similar code ?

dire pollen Aug 20, 2020, 10:56 AM

#

so try to take that last line

this_year = player_df.loc[player_df.SEASON_ID == '2019-20',:]
type(this_year)

@solemn hull Im not sure I quite get it what you are trying to say

solemn hull Aug 20, 2020, 10:57 AM

#

so you can build a function to parse each specific player/dataframe, then iterate through the players or days etc

dire pollen Aug 20, 2020, 10:58 AM

#

🤔 I think I need to learn more python, I try to understand

muted oyster Aug 20, 2020, 10:58 AM

#

DF1 = DF.loc[DF['Month'] == '07',:]
DF1

i also want 08 and 09

solemn hull Aug 20, 2020, 11:00 AM

#

from nba_api.stats.endpoints import playercareerstats
# Anthony Davis
def get_player_current_year(player_id):
  career = playercareerstats.PlayerCareerStats(player_id=player_id)
  player_df = career.get_data_fames()[0]
  return player_df.loc[player_df.SEASON_ID == '2019-20',:]

player_results = []
for player_id in ['203076', ...]:
  player_results.append(get_player_current_year(player_id)]
print(player_results )```

dire pollen Aug 20, 2020, 11:01 AM

#

So I can put different IDs at the same time and I would get the row I want?

solemn hull Aug 20, 2020, 11:01 AM

#

so i think pandas has specific syntax for multiple conditionals.. no idea if this will work but

DF1 = DF.loc[DF['Month'] in ['07', '08', '09'],:]```

#

it will call the api for each player, get the year 2019-20 then build up a list

#

and at the end print the list.. there is probably a better way to do it though, im a pandas newb

#

and yeah carly, it will get only that row for each player

tidal bough Aug 20, 2020, 11:03 AM

#

Hmm. Maybe DF.loc["07"<=DF['Month']<="09",:]? Not quite the same, mind.

muted oyster Aug 20, 2020, 11:04 AM

#

@solemn hull oh sorry, my doubt was a separate thing

solemn hull Aug 20, 2020, 11:04 AM

#

i think they are strings so comparison wont compare the digits

muted oyster Aug 20, 2020, 11:04 AM

#

not related to Carly's

solemn hull Aug 20, 2020, 11:04 AM

#

no worries

muted oyster Aug 20, 2020, 11:05 AM

#

im asking in general.. if i want to get 3 values out of rows 07, 08, 09 are for 3 months of 3rd quarter

#

DF1 = DF.loc[DF['Month'] == '07',:]
DF1

if i give this it will onl return for month of july

solemn hull Aug 20, 2020, 11:06 AM

#

did you try the above DF['Month'] in ['07', '08', '09']

muted oyster Aug 20, 2020, 11:07 AM

#

yes is an error

solemn hull Aug 20, 2020, 11:07 AM

#

ah xD

tidal bough Aug 20, 2020, 11:07 AM

#

what you definitely can do is

DF1 = DF.loc[(DF['Month'] == '07') | (DF['Month'] == '08') | (DF['Month'] == '09'),:]

#

shame the other way doesn't work, though

#

there's probably a way to make it work.

solemn hull Aug 20, 2020, 11:08 AM

#

freakin pandas, eating up all the bamboo and making strange syntaxes 🐼

muted oyster Aug 20, 2020, 11:08 AM

#

oh yes it worked. what does | did here ? @tidal bough

solemn hull Aug 20, 2020, 11:09 AM

#

| is or

muted oyster Aug 20, 2020, 11:09 AM

#

so we cant simply pass or ?

solemn hull Aug 20, 2020, 11:09 AM

#

try and see

muted oyster Aug 20, 2020, 11:09 AM

#

ValueError: The truth value of a Series is ambiguous. Use a.empty, a.bool(), a.item(), a.any() or a.all().

solemn hull Aug 20, 2020, 11:10 AM

#

its some pandas specific syntax

#

i guess

#

different rules than normal python

muted oyster Aug 20, 2020, 11:10 AM

#

I see, thanks 🙂 I saw that guy's question and was similar to what I was doing otherwise im a noob myself lol

solemn hull Aug 20, 2020, 11:11 AM

#

heh, i think im gonna go learn the basics

dire pollen Aug 20, 2020, 11:12 AM

#

from nba_api.stats.endpoints import playercareerstats
# Anthony Davis
def get_player_current_year(player_id):
  career = playercareerstats.PlayerCareerStats(player_id=player_id)
  player_df = career.get_data_fames()[0]
  return player_df.loc[player_df.SEASON_ID == '2019-20',:]

player_results = []
for player_id in ['203076', ...]:
  player_results.append(get_player_current_year(player_id)]
print(player_results )```

@solemn hull It worked but the data got a weird look, is there a way to get the data and make it look 'pretty'?

#

📎 stats.png

solemn hull Aug 20, 2020, 11:13 AM

#

for that, i think you need to use the pandas method of joining data rows.. someone said merge before.

dire pollen Aug 20, 2020, 11:14 AM

#

Yeah, I will take a look but you definitely helped a lot!

solemn hull Aug 20, 2020, 11:14 AM

#

or instead of print, you could do

for result in player_results:
    print(result)

#

that way its not printing a list but each specific item individually...

#

awsum glad u got it working

muted oyster Aug 20, 2020, 11:15 AM

#

u can convert it to dataframe if u want tabular form

#

pd.DataFrame

#

Another question, can we plot trend graph for 2 separate values in same graph ?

dire pollen Aug 20, 2020, 11:17 AM

#

I want to give format to the data to display it nicer ultimately

tidal bough Aug 20, 2020, 11:17 AM

#

@muted oyster | is bitwise OR, which for Series is overloaded to act elementwise.

#

or isn't really the same thing

dire pollen Aug 20, 2020, 11:18 AM

#

Something like this but in a noob way, since thats from the official page

📎 nbaaa.png

muted oyster Aug 20, 2020, 11:19 AM

#

@tidal bough is it only for pandas or used in other libraries too ?

solemn hull Aug 20, 2020, 11:20 AM

#

try

import pandas
from nba_api.stats.endpoints import playercareerstats
# Anthony Davis
def get_player_current_year(player_id):
  career = playercareerstats.PlayerCareerStats(player_id=player_id)
  player_df = career.get_data_fames()[0]
  return player_df.loc[player_df.SEASON_ID == '2019-20',:]

player_results = pandas.DataFrame()
for player_id in ['203076', ...]:
  player_results.append(get_player_current_year(player_id))
#if youre using jupyter you can call display()
display(player_results)```

tidal bough Aug 20, 2020, 11:21 AM

#

@muted oyster Well, having | work elementwise on Series is just a Pandas thing.

#

the operator itself is of course used often when working with bits.

muted oyster Aug 20, 2020, 11:22 AM

#

ok I understood. thx : -)

dire pollen Aug 20, 2020, 11:22 AM

#

@solemn hull I got no result from that

solemn hull Aug 20, 2020, 11:22 AM

#

😱

dire pollen Aug 20, 2020, 11:22 AM

#

Im not sure which kind of result I would get

#

Not even an error

solemn hull Aug 20, 2020, 11:28 AM

#

lol, dang.. ok, i guess go back to list []

import pandas
pandas.set_option('display.max_rows', None)
pandas.set_option('display.max_columns', None)
pandas.set_option('display.width', None)
pandas.set_option('display.max_colwidth', -1)

from nba_api.stats.endpoints import playercareerstats
# Anthony Davis
def get_player_current_year(player_id):
  career = playercareerstats.PlayerCareerStats(player_id=player_id)
  player_df = career.get_data_fames()[0]
  return player_df.loc[player_df.SEASON_ID == '2019-20',:]

player_results = []
for player_id in ['203076', ...]:
  player_results.append(get_player_current_year(player_id))
for result in player_results:
    print(result)```
@dire pollen

#

thats supposed to remove the abbreviating '...' stuff

#

https://thispointer.com/python-pandas-how-to-display-full-dataframe-i-e-print-all-rows-columns-without-truncation/ anywho, going to sleep gn

thispointer.com

Varun

Python Pandas : How to display full Dataframe i.e. print all rows &...

dire pollen Aug 20, 2020, 11:30 AM

#

Oh I see, well anyways thank you for your help I will try to take a look about the other stuff!

solemn hull Aug 20, 2020, 11:32 AM

#

np dogeblanky2

ripe forge Aug 20, 2020, 11:39 AM

#

Terminology question : I came across the term "interval" for a column data type. (for context, this terminology is used in sas documentation). Does interval data refer to continuous data?

lapis sequoia Aug 20, 2020, 11:41 AM

#

Could be referring to timestamp data

#

usually interval refers to the interval between two given dates

#

or whatever time periods are required

desert oar Aug 20, 2020, 12:21 PM

#

pandas has a "time period" data type

muted oyster Aug 20, 2020, 12:25 PM

#

I have a dataframe like this which i want to convert into:

📎 unknown.png

#

this

📎 unknown.png

#

like a state wise counts of closed and open and its total at the end

tidal bough Aug 20, 2020, 12:28 PM

#

ML questions get posted in this channel once in a while, so here's mine:
When creating AIs for playing board games, it's common to take advantage of symmetry to reduce the number of possible states. How is that actually done in practice? I had to take advantage of symmetry in one case before(not ML, just a metaheuristic optimization task), but I achieved rather meager results. Is there some sort of hashing algorithm for a 2d array of values that is invariant under rotations/reflections?

velvet thorn Aug 20, 2020, 1:44 PM

#

I have a dataframe like this which i want to convert into:
@muted oyster groupby count unstack

muted oyster Aug 20, 2020, 1:45 PM

#

@velvet thorn can u look over #help-apple

#

DF2.groupby(['State', 'Final_Status' == 'Open' | 'Final_Status' == 'Closed']).size().unstack(fill_value=0)

#

do u mean like this ? but its giving error

#

I tried something like this:

📎 unknown.png

#

but is giving total closed and open values for all states and not individually

#

ok I figured out to get closed and open in rows and sort of this code worked:

DF3 = DF2.groupby('State')['Final_Status'].value_counts()
DF3 = pd.DataFrame(DF3)
DF3

#

can i get the values in columns ?

📎 unknown.png

#

like closed and open in columns instead of rows

#

ok figured it out lol

#

thanks buddy @velvet thorn

velvet thorn Aug 20, 2020, 2:11 PM

#

groupby just 'State' actually, but yeah

muted oyster Aug 20, 2020, 2:12 PM

#

ok I figured out to get closed and open in rows and sort of this code worked:
DF3 = DF2.groupby('State')['Final_Status'].value_counts()
DF3 = pd.DataFrame(DF3)
DF3

I added .unstack().fillna(0) so it worked

plucky cairn Aug 20, 2020, 2:12 PM

#

i'm pretty inexperienced in ds/ml coming from an econ background. i want to fit a supervised learning model to associate bodies of text with items from a list of shorter texts. in the training set i know which large texts should be associated with the short texts and in the out-of-sample dataset i have groupings in each list

#

does that make sense

#

my data look like this https://gist.github.com/weverett96/31b30a1cb201bf9fe357d0ed5c3ec860

Gist

Fund names and strategies from Eaton Vance filing.

Fund names and strategies from Eaton Vance filing. - dataexample.py

#

where the matched pairs are 'name' and 'strategy'

#

then in the unmatched set i want to associate strategies with the 'name' fields

calm wagon Aug 20, 2020, 2:34 PM

#

is this turtorial outdated?
https://www.youtube.com/watch?v=wypVcNIH6D4

YouTube

Tech With Tim

Python Chat Bot Tutorial - Chatbot with Deep Learning (Part 1)

Ever wanted to create an AI Chat bot? This python chatbot tutorial will show you how to create a chatbot with python using deep learning .

Playlist: https://www.youtube.com/watch?v=wypVcNIH6D4&list=PLzMcBGfZo4-ndH9FoC4YWHGXG5RZekt-Q

Download JSON File: https://techwithtim.n...

▶ Play video

#

is it?

tidal bough Aug 20, 2020, 2:51 PM

#

it's from 2019, so hardly 🤔

bold olive Aug 20, 2020, 3:15 PM

#

df.sparse.to_dense() is returning sparse not found? Am I missing something?

tidal bough Aug 20, 2020, 3:18 PM

#

hmm

#

https://pandas.pydata.org/pandas-docs/stable/user_guide/sparse.html

Sparse accessor

New in version 0.24.0.

Pandas provides a .sparse accessor, similar to .str for string data, .cat for categorical data, and .dt for datetime-like data. This namespace provides attributes and methods that are specific to sparse data.

#

Maybe you have an old version?

#

check pandas.__version__

bold olive Aug 20, 2020, 3:21 PM

#

1.1.0

#

Shouldn't be a problem I guess

#

Really strange

drowsy kite Aug 20, 2020, 3:24 PM

#

thats happened to me but it was because i renamed my dataframe

#

@bold olive

bold olive Aug 20, 2020, 3:25 PM

#

Nope, not renaming my dataset anywhere

#

It is actually the output value of a multilabel classification which is in sparse format and I need to convert it into a dense matrix for the metrics

#

Weird thing is that it worked before but when I got back to it and tried running it again, it's returning this error

drowsy kite Aug 20, 2020, 3:34 PM

#

story of my life

#

you could try restart the runtime and clear any outputs

bold olive Aug 20, 2020, 3:41 PM

#

Tried it, no luck! The function works alone in a separate instance though

#

This is so weird

keen root Aug 20, 2020, 3:52 PM

#

Hi, this is probably an annoying question, but I have to ask: Does anyone recommend any book to learn Machine Learning? I've looked online but there's just so much stuff!! It's hard to distinguish from hyped stuff, oversimplified things and the actual useful things. So I was looking for something to hook into that would get me through things. I come from a physics background and I'm confortable with python (if it helps in some way)

tidal bough Aug 20, 2020, 4:01 PM

#

@keen root I personally:

Did this amazing coursera course:https://www.coursera.org/learn/machine-learning as an overview of the field.
Am now doing the Practical Reinforcement Learning course from this specialization (just because that's what I'm interested in): https://www.coursera.org/specializations/aml
For reading material, I found useful the materials the AI discord suggests:

MACHINE LEARNING
Before you start specialising in any particular field, it's important to learn the core theory of Machine Learning for a broad exposure to ideas and techniques that you can likely apply to any field.

Core
• Bishop - Pattern Recognition and Machine Learning

Also check out Model-Based Machine Learning by the same author
• Tibshirani, Friedman, Hastie - The Elements of Statistical Learning
• ColumbiaX on edX - Machine Learning

#

The first course is free. The ones from the Advanced specialization aren't, but coursera's audit mode allows free access to basically everything from the course except quizzes for some reason (programming assignments are available).

keen root Aug 20, 2020, 4:04 PM

#

@tidal bough Thank you, that's amazing, I'll follow the first course, seems to be quite complete, however it is based on matlab/octava, will it be crucial to understand the contents if I've never worked with them?

trail walrus Aug 20, 2020, 4:05 PM

#

ooh, I finished that specialization on coursera, not all courses are equally good, but overall I learned a lot

keen root Aug 20, 2020, 4:05 PM

#

Also, did you find it important to follow some book at the same time?

trail walrus Aug 20, 2020, 4:06 PM

#

nah, if you want to know something google is your friend.

#

but I do recommend to supplement the material in courses by looking things up whenever you're curious or confused about something

tidal bough Aug 20, 2020, 4:06 PM

#

@keen root I've never worked with Octave before that course. I didn't find it hard to learn - it's very nice in its native support of matrix and vector calculations.

Also, did you find it important to follow some book at the same time?
I didn't read any ML books until my Practical RL course. The first course provides its own materials, which are quite enough.

keen root Aug 20, 2020, 4:07 PM

#

Got it, thank you

odd yoke Aug 20, 2020, 4:10 PM

#

The Elements of Statistical Learning is absolutely fantastic

muted oyster Aug 20, 2020, 4:29 PM

#

@keen root there are lot of books from OReilly

lapis sequoia Aug 20, 2020, 4:29 PM

#

@muted oyster O'Reilly is pretty good

#

I learnt the basics from those books

muted oyster Aug 20, 2020, 4:30 PM

#

yes i started with Head First Python brain friendly

lapis sequoia Aug 20, 2020, 4:31 PM

#

Also, I recommend that you guys check out StatQuest with Josh Starmer on youtube

#

He covers basic statistics and ML. The channel is amazing for beginners and experts alike.

muted oyster Aug 20, 2020, 4:33 PM

#

sure, anytime! I'm new to everything and evrything helps : )

#

and also most of the O'Reilly books are available in pdfs just a google search would do

#

@keen root

keen root Aug 20, 2020, 4:41 PM

#

That's great, thank you :)

pearl crystal Aug 20, 2020, 4:52 PM

#

Can I say join distribution instead of mutivariate disctribution or it is better to say multivariate distribution for multi dimensional distributions and joint distribution for jointly distribution between different random variables?

muted oyster Aug 20, 2020, 5:19 PM

#

if i pass this

DF3.set_index('Date').plot();

I get a plot of very small size, how can i enlarge it ?

#

i guess i should ask in help section 😅

muted oyster Aug 20, 2020, 6:18 PM

#

got it, but had to change it to something much messy

safe sparrow Aug 20, 2020, 6:45 PM

#

Anyone here with experience with LSTM layers in keras?

#

Im not sure how to interpret the shapes, input and output

plucky cairn Aug 20, 2020, 7:09 PM

#

@pearl crystal either is fine, multivariate distribution implies that the random variables covary - so it's really the same thing as explicitly saying that the distributions are joint. a multivariate distribution with zero covariance wouldn't really be multivariate, it would just be a collection of univariate distributions

#

@muted oyster see https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.plot.html

#

you can use .plot(figsize(width,height)) where width and height are in inches

#

or you can just use matplot lib and build the graph yourself

#

which will probably end up looking better

ancient lichen Aug 20, 2020, 8:08 PM

#

hey I'm trying to make a classifier to identify if a burger is burger king or mcdonalds. Can some people help me build a dataset? 1 is the worst, and 100 is the best

Can you give me a rating on a scale of 1-100 on how good a burger king burger tastes?
Can you give me a rating on a scale of 1-100 on how healthy a burger king burger is?
Can you give me a rating on a scale of 1-100 on how good a mcdonalds burger tastes?
Can you give me a rating on a scale of 1-100 on how healthy a mcdonalds burger is?
Anyone willing to take a few seconds and think back to when they've had a burger would be great!

muted oyster Aug 20, 2020, 8:13 PM

#

which will probably end up looking better
@plucky cairn yes I plot it using matplotlib. It's messy bcoz i wanted 12 lines on graph and for every line i had to copy that code 12 times and passing every column in it.

tidal bough Aug 20, 2020, 8:15 PM

#

Interesting. Of all the scipy solvers, only DOP583, supposedly a very precise RK solver, has any problems with this equation.

📎 unknown.png

#

this is dy/dt = 1/y - 1/(1-y) + 10*abs(y-0.5) + np.cos(t/10), from y(0)=0.6

ancient lichen Aug 20, 2020, 8:16 PM

#

anyone want to rate burger king and mcdonalds burgers?

muted oyster Aug 20, 2020, 8:19 PM

#

Interesting. Of all the scipy solvers, only DOP583, supposedly a very precise RK solver, has any problems with this equation.
@tidal bough this looks interesting. What are the legends about ? As u mentioned dop853 is the only one ? Or alsoLSODA

#

And what are these things actually ?

tidal bough Aug 20, 2020, 8:31 PM

#

https://docs.scipy.org/doc/scipy/reference/generated/scipy.integrate.solve_ivp.html#scipy.integrate.solve_ivp

#

I only really know how the RK ones work

#

Close-up of the diverging interval

📎 unknown.png

#

RK23 is actually oscillating a bit too

#

RK45 oscillates less

#

the rest are nigh-perfect

polar berry Aug 20, 2020, 9:47 PM

#

hey what is a good way to learn python for machine learning if I have absolutely no experience with coding at all
pls ping

rapid ridge Aug 20, 2020, 9:47 PM

#

someone uses nginx here?

tidal bough Aug 20, 2020, 9:48 PM

#

@polar berry https://www.coursera.org/learn/machine-learning-with-python, or the Advanced ML specialization on coursera.
But first, you'd have to learn Python in general. See !resources.

ancient lichen Aug 20, 2020, 9:56 PM

#

anyone know any good datasets for training very basic classifiers?

tidal bough Aug 20, 2020, 10:06 PM

#

What kind of classifiers?

#

Like, any ones? Check out the Titanic dataset, it's a classic.

#

oh god, I found a really unforgiving equation

#

📎 unknown.png

#

dy/dt= y**2 - 50/(1-y)**2 + 50*np.cos(t/5) - 10*np.sin(t/10)

polar berry Aug 20, 2020, 10:42 PM

#

@tidal bough where is resources?

tidal bough Aug 20, 2020, 10:43 PM

#

!resources

arctic wedgeBOT Aug 20, 2020, 10:43 PM

#

Resources

The Resources page on our website contains a list of hand-selected learning resources that we regularly recommend to both beginners and experts.

polar berry Aug 20, 2020, 11:46 PM

#

@tidal bough which one is the best?

#

https://pythondiscord.com/pages/resources/courses/

Python Discord | Courses

We're a large, friendly community focused around the Python programming language. Our community is open to those who wish to learn the language, as well as those looking to help others.

#

📎 unknown.png

velvet thorn Aug 21, 2020, 12:53 AM

#

@tidal bough which one is the best?
@polar berry do you really expect them to be able to tell

polar berry Aug 21, 2020, 12:55 AM

#

@velvet thorn idk bro

#

gonna use codecademy

#

https://www.coursera.org/learn/python-for-applied-data-science-ai
https://www.coursera.org/learn/machine-learning-with-python
which of these courses should i do after

Coursera

Machine Learning with Python | Coursera

Learn Machine Learning with Python from IBM. This course dives into the basics of machine learning using an approachable, and well-known programming language, Python. In this course, we will be reviewing two main components: First, you will be ...

bitter harbor Aug 21, 2020, 12:58 AM

#

IBM’s always a good choice

thin solstice Aug 21, 2020, 1:11 AM

#

Thought this is kinda relevant, since it's using numpy with large amounts of data;

Just wondering, I've got two arrays, both of the same shape. They look like this;

a = [[1,2],[6,4,2]]
b = [[3,4],[5,3,4]]```

Both a and b are numpy arrays, and I was wondering how I'd go about adding them together, to get a result like so:
```python
[[4,6],[11,7,6]]```

Would this be possible?

fervent bridge Aug 21, 2020, 1:39 AM

#

    model = tf.keras.Sequential()
    model.add(tf.keras.layers.Conv2D(32, (3, 3), activation='relu', input_shape=(58, 78, 3)))
    model.add(tf.keras.layers.Conv2D(64, (3, 3), activation='relu'))
    model.add(tf.keras.layers.Conv2D(128, (3, 3), activation='relu'))
    model.add(tf.keras.layers.Flatten())
    model.add(tf.keras.layers.Dropout(0.5))
    model.add(tf.keras.layers.Dense(1024, activation='relu'))
    model.add(tf.keras.layers.Dropout(0.2))
    model.add(tf.keras.layers.Dense(196, activation='softmax'))
    model.compile(optimizer=tf.keras.optimizers.Adam(), loss='sparse_categorical_crossentropy', metrics=['accuracy'])```Isn't my shape supposed to get smaller per layer in a CNN? if so then why do I get this error ```python
 OOM when allocating tensor with shape[479232,1024] and type float on ``` whats with the input shape of `479232`

desert parcel Aug 21, 2020, 1:58 AM

#

@desert parcel did you read what I said above
@velvet thorn I just did

#

📎 unknown.png

#

It does return those

#

when you said you should return loss.item() and the other stuff

polar berry Aug 21, 2020, 2:02 AM

#

@bitter harbor they're both IBM?

bitter harbor Aug 21, 2020, 2:03 AM

#

You’re asking about courses that I bet not many people here have looked at, look at the reviews as it’ll probably mostly be up to your/the general opinion

velvet thorn Aug 21, 2020, 2:48 AM

#

when you said you should return loss.item() and the other stuff
@desert parcel huh

#

no, it's literally returning a string

#

don't you see the .format call?

#

    model = tf.keras.Sequential()
    model.add(tf.keras.layers.Conv2D(32, (3, 3), activation='relu', input_shape=(58, 78, 3)))
    model.add(tf.keras.layers.Conv2D(64, (3, 3), activation='relu'))
    model.add(tf.keras.layers.Conv2D(128, (3, 3), activation='relu'))
    model.add(tf.keras.layers.Flatten())
    model.add(tf.keras.layers.Dropout(0.5))
    model.add(tf.keras.layers.Dense(1024, activation='relu'))
    model.add(tf.keras.layers.Dropout(0.2))
    model.add(tf.keras.layers.Dense(196, activation='softmax'))
    model.compile(optimizer=tf.keras.optimizers.Adam(), loss='sparse_categorical_crossentropy', metrics=['accuracy'])```Isn't my shape supposed to get smaller per layer in a CNN? if so then why do I get this error ```python
 OOM when allocating tensor with shape[479232,1024] and type float on ``` whats with the input shape of `479232`

@fervent bridge you cut out half the error...

#

anyway, a CNN does decrease the size of the input (channels last) along the 2nd and 3rd dimensions (assuming you don't have padding), while increasing the size of the 4th dimension (assuming the number of filters increases)

#

but then you flatten.

#

(which is a prerequisite for passing to a dense layer if you want to work with all the dimensions, but)

#

say you have an input image of size 640x480x3

#

after going through the first three convolutional layers it would end up being 636x476x128 = 38750208.

#

in a vanilla CNN the operation that decreases the size of the image is not really convolution, but pooling.

#

look up MaxPooling2D

#

Thought this is kinda relevant, since it's using numpy with large amounts of data;

Just wondering, I've got two arrays, both of the same shape. They look like this;
a = [[1,2],[6,4,2]]
b = [[3,4],[5,3,4]]```

Both a and b are numpy arrays, and I was wondering how I'd go about adding them together, to get a result like so:
```python
[[4,6],[11,7,6]]```

Would this be possible?

@thin solstice how can those be arrays?

#

they're not the right shape

#

unless you're saying they're object arrays (print(a.dtype)), which is not what numpy should be used for, in general

desert parcel Aug 21, 2020, 3:19 AM

#

don't you see the .format call?
@velvet thorn oh.. Now I See

velvet thorn Aug 21, 2020, 3:20 AM

#

yes.

#

which is why I said you really would benefit from some more work on your fundamentals

#

don't try to dive into DS (and DL) so quickly...

desert parcel Aug 21, 2020, 3:20 AM

#

my fundamentals are good

#

... despite

#

everything...

#

lol

velvet thorn Aug 21, 2020, 3:20 AM

#

incidentally, if you had had type hints there

#

your IDE would have made that clear

desert parcel Aug 21, 2020, 3:20 AM

#

I'm using google colab

#

and I use a text editor

velvet thorn Aug 21, 2020, 3:20 AM

#

not really sure if it has support for type hints

desert parcel Aug 21, 2020, 3:20 AM

#

I don't have an IDE installed on my machine

velvet thorn Aug 21, 2020, 3:20 AM

#

but you can try mypy

desert parcel Aug 21, 2020, 3:20 AM

#

what's mypy

#

lemme search it up

velvet thorn Aug 21, 2020, 3:21 AM

#

but anyway, I'm not going to argue with you about your learning path...?

desert parcel Aug 21, 2020, 3:21 AM

#

yeah sounds fair lol

velvet thorn Aug 21, 2020, 3:21 AM

#

the last thing I'll say is that that would have been a trivial error to debug

desert parcel Aug 21, 2020, 3:21 AM

#

I'm good in a sense that

velvet thorn Aug 21, 2020, 3:21 AM

#

IMO

#

but well, to each their own

desert parcel Aug 21, 2020, 3:21 AM

#

I agree with that

#

like... um

velvet thorn Aug 21, 2020, 3:21 AM

#

not that I mind solving this kind of problem since I love procrastinating 🙂

desert parcel Aug 21, 2020, 3:21 AM

#

I know what to do ut sometimes

#

but*

#

but sometimes I just don't look too into detail like return and print

#

I just assumed they were interchangeable well... until now of course

velvet thorn Aug 21, 2020, 3:22 AM

#

that...is a very scary statement

desert parcel Aug 21, 2020, 3:22 AM

#

well I mean

#

Don't let it be..?

#

xd

velvet thorn Aug 21, 2020, 3:22 AM

#

how long have you spent using Python

desert parcel Aug 21, 2020, 3:23 AM

#

about 4 months

velvet thorn Aug 21, 2020, 3:23 AM

#

hm.

desert parcel Aug 21, 2020, 3:23 AM

#

but I don't really know anymore lol

velvet thorn Aug 21, 2020, 3:23 AM

#

well, I hope it works out for you

desert parcel Aug 21, 2020, 3:23 AM

#

haven't been keeping track

#

It worked

#

so all my issues with my code

#

was because I was returning a string

#

ohhhh no wonder I couldn't get any where

velvet thorn Aug 21, 2020, 3:24 AM

#

yes

#

I would suggest you look up type hinting

desert parcel Aug 21, 2020, 3:24 AM

#

it's like

velvet thorn Aug 21, 2020, 3:24 AM

#

trivial way to prevent such errors

desert parcel Aug 21, 2020, 3:25 AM

#

def (x: str, y: int)

#

something like that right?

velvet thorn Aug 21, 2020, 3:25 AM

#

you're missing a function name

desert parcel Aug 21, 2020, 3:25 AM

#

yeah I know lol

velvet thorn Aug 21, 2020, 3:25 AM

#

and you can annotate the return type too

#

but yes

#

that's the basic idea

desert parcel Aug 21, 2020, 3:25 AM

#

I use this in my functions

#

or whatever they are used in

#

I don't need to keep track of it too much

graceful ice Aug 21, 2020, 3:51 AM

#

How to join 2 df's using pandas on the basis of a column but the column data matches partially is that possible

#

hello work (df a) ------- hello world data.

velvet thorn Aug 21, 2020, 3:52 AM

#

not really.

#

not without a fair bit of processing

#

that's quite a high-level problem

thin solstice Aug 21, 2020, 3:55 AM

#

unless you're saying they're object arrays (print(a.dtype)), which is not what numpy should be used for, in general
@velvet thorn in that case, oops

graceful ice Aug 21, 2020, 3:55 AM

#

can anybody help

soft dock Aug 21, 2020, 3:55 AM

#

You may be able to incorporate something like Levenshtein distance as a conditional check whether to join a specific column, but I think it would be kinda awkward depending on the structure of the two dataframes.

graceful ice Aug 21, 2020, 3:56 AM

#

let me give you an example to be more specific

bitter harbor Aug 21, 2020, 4:00 AM

#

about 4 months
@desert parcel I'm honestly impressed that you got away with that for so long

velvet thorn Aug 21, 2020, 4:00 AM

#

you cannot just use Levenshtein distance or some other difference metric

#

or rather, not alone

#

I would suggest some form of clustering

#

then join on the cluster IDs

#

which is why I said "not without a fair bit of processing"

graceful ice Aug 21, 2020, 4:02 AM

#

In one df the there is a column named as model i,.e equal to Galaxy s2
another df there is a colum named model i.e equal to Galaxy s2 a

#

I want to match these 2

velvet thorn Aug 21, 2020, 4:02 AM

#

yes, we understand the problem.

#

it's not a simple problem

#

it is not difficult to find the distance between two rows given a specific column.

graceful ice Aug 21, 2020, 4:03 AM

#

@velvet thorn you are taking it ina complex manner

#

wait let me thik a bit out of it

velvet thorn Aug 21, 2020, 4:03 AM

#

just use some form of string metric

#

@velvet thorn you are taking it ina complex manner
@graceful ice do you understand why I say it is not simple...?

desert parcel Aug 21, 2020, 4:05 AM

#

@desert parcel I'm honestly impressed that you got away with that for so long
@bitter harbor lol so am I

#

I learnt some selenium and other stuff

bitter harbor Aug 21, 2020, 4:05 AM

#

hello work (df a)
would these be three different columns

velvet thorn Aug 21, 2020, 4:06 AM

#

no, "hello work" is the value in a column in one dataframe, and "hello world" is the value in an identically named column in the other dataframe

#

it was a pretty poorly formatted example TBH

bitter harbor Aug 21, 2020, 4:07 AM

#

yikes ya I thought it was comparing 'words' not the phrase

#

would it work if you checked if the indices of the characters in the column were equal on both objects+/had the same spacing (ei (data) > data - different indices but the letters are the same spacing)

velvet thorn Aug 21, 2020, 4:08 AM

#

and the thing is

#

(I'm not so sure about this)

#

because their use of terminology isn't very clear

#

but they might want to join the rows

#

as opposed to match them

graceful ice Aug 21, 2020, 4:09 AM

#

I did it

velvet thorn Aug 21, 2020, 4:09 AM

#

and the thing is...where do you stop?

#

because with a high enough threshold any number of strings can be matched

bitter harbor Aug 21, 2020, 4:09 AM

#

ah ya I just read it

#

I'm confused now

graceful ice Aug 21, 2020, 4:10 AM

#

df['PartialModel'] =df['Model'].apply(lambda x: difflib.get_close_matches(x, invoiceDf['Model'])[0] if len(difflib.get_close_matches(x, invoiceDf['Model']))>0 else "Unknown")

#

read this

velvet thorn Aug 21, 2020, 4:11 AM

#

that's literally not what you originally said though

#

you said "join"

graceful ice Aug 21, 2020, 4:11 AM

#

yes

#

I took this apprach

velvet thorn Aug 21, 2020, 4:11 AM

#

good for you then

graceful ice Aug 21, 2020, 4:11 AM

#

@velvet thorn I will join with this

#

@velvet thorn why are you getting agry

velvet thorn Aug 21, 2020, 4:12 AM

#

huh?

#

what do you mean

graceful ice Aug 21, 2020, 4:12 AM

#

angry

#

never mind

#

@velvet thorn thanks for your help

#

and time

velvet thorn Aug 21, 2020, 5:06 AM

#

@velvet thorn thanks for your help
@graceful ice yw

lofty scarab Aug 21, 2020, 5:38 AM

#

Hello all! I had a question involving route optimization and distance matrices.

#

It's similar to the traveling salesman problem but if you had multiple salesmen

bitter harbor Aug 21, 2020, 5:53 AM

#

@lofty scarab So no overlapping of the salesmen?

thin solstice Aug 21, 2020, 8:04 AM

#

http://matrixmultiplication.xyz/
what's numpy's name for this function?

Matrix Multiplication

An interactive matrix multiplication calculator for educational purposes

pale thunder Aug 21, 2020, 8:05 AM

#

you can use the @ operator

thin solstice Aug 21, 2020, 8:05 AM

#

thanks! :)

pale thunder Aug 21, 2020, 8:05 AM

#

or np.matmul

thin solstice Aug 21, 2020, 8:06 AM

#

wait... it seems to only return an array with one value?..

#

lemme show you what I've got...

#

>>> a = np.array([0.0019, -0.01])
>>> ht = np.array([[-0.09],[0.04]])
>>> a@ht
array([-0.000571])
# shouldn't this array be shaped as (2,2)?

#

@pale thunder ^

#

since here on this website, matrix multiplication of two arrays returns an array shaped as 2,2

📎 unknown.png

#

& this is what I get when I try that same thing in python:

>>> A = np.array([1,2])
>>> B = np.array([3,4])
>>> A@B
11
>>>

red pike Aug 21, 2020, 8:09 AM

#

try np.matmul

bitter harbor Aug 21, 2020, 8:10 AM

#

*!!!

thin solstice Aug 21, 2020, 8:10 AM

#

same thing

>>> np.matmul(A,B)
11

#

>>> A*B
array([3, 8])

#

still not a 2,2 array

pale thunder Aug 21, 2020, 8:11 AM

#

In [17]: A = np.array([[1,2]])

In [18]: B = np.array([[3],[4]])

In [19]: A @ B
Out[19]: array([[11]])

In [20]: B @ A
Out[20]:
array([[3, 6],
       [4, 8]])

thin solstice Aug 21, 2020, 8:11 AM

#

ohh

#

>>> B @ A
11 ```
@pale thunder same issue

#

wait no

#

I see

#

thank you

bitter harbor Aug 21, 2020, 8:12 AM

#

this is why linear algebra is hard 😛

thin solstice Aug 21, 2020, 8:12 AM

#

yeah haha

pale thunder Aug 21, 2020, 8:12 AM

#

you need to have have a row and a column, so they need to be 2D.

thin solstice Aug 21, 2020, 8:13 AM

#

I'm only in year 10 and I'm struggling to wrap my head around matrix multiplication :P

pale thunder Aug 21, 2020, 8:13 AM

#

good luck!

thin solstice Aug 21, 2020, 8:13 AM

#

haven't done anything like this in school lol, thanks! :)

bitter harbor Aug 21, 2020, 8:13 AM

#

id suggest watching 3b1b's series on it

pale thunder Aug 21, 2020, 8:13 AM

#

another useful thing is transpose, which you do as A.T

lost yoke Aug 21, 2020, 8:46 AM

#

& this is what I get when I try that same thing in python:
>>> A = np.array([1,2])
>>> B = np.array([3,4])
>>> A@B
11
>>> 

@thin solstice try with B = np.array([[3],[4]])

#

oh already said. sorry.

#

also, np.array([3,4]).T

#

that transposes ("turn the other way") the vector

thin solstice Aug 21, 2020, 8:56 AM

#

yup, I used that :)

#

thanks

molten hamlet Aug 21, 2020, 9:59 AM

#

Someone is using plotly? I wonder if there is a way to not use browser to plot

lapis sequoia Aug 21, 2020, 12:26 PM

#

Matplotlib has always been the go-to plotting library for all my use cases.

#

Heck, even pandas has a plotting method.
https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.plot.html

covert rover Aug 21, 2020, 12:36 PM

#

Hey ppl can anyone tell me why everyone uses Jupyter Notebook for data science? I use Spyder but if that many people uses Jupyter it has to be a reason right?

tidal bough Aug 21, 2020, 12:38 PM

#

@covert rover Main advantage for me is the cell structure.

#

It really is convenient. It's a nice balance between running entire programs, and running single lines of code (REPL).

#

So you can have a cell with imports, one that calculates stuff, one that plots stuff...

#

And if you want to plot with different settings, you just change the plotting cell and rerun that cell without recalculating the data.

covert rover Aug 21, 2020, 12:40 PM

#

@tidal bough that's convincing
thanks!

solemn topaz Aug 21, 2020, 12:58 PM

#

How would I go about testing image recognition code that I wrote using OpenCV?

#

Are there any libraries/tools that can help with this?

#

I haven't found any good resources online

#

The only thing I can think of is just to have a folder with a bunch of test images and some json file with text or numbers or whatever that I expect to find in each of them. Is there a better way?

deft harbor Aug 21, 2020, 2:09 PM

#

Data pipeline

dire pollen Aug 21, 2020, 2:17 PM

#

anyone knows how to export a pandas dataframe to csv? Im reading the doc but I got this 'list' object has no attribute 'to_csv'

solemn topaz Aug 21, 2020, 2:35 PM

#

@deft harbor could you elaborate?

desert oar Aug 21, 2020, 2:41 PM

#

@dire pollen then it's not a dataframe, it's a list

#

read the error message

molten hamlet Aug 21, 2020, 2:51 PM

#

📎 Screenshot_from_2020-08-21_16-51-05.png

#

Decision tree prototype, I mean, its functioning and predicting, the plotting is prototype 😄

tidal bough Aug 21, 2020, 2:52 PM

#

~~that's not a tree, it has cycles~~ 😛

molten hamlet Aug 21, 2020, 2:52 PM

#

nooo

#

I cant draw arows , im learning plotly :d

#

its all one direction starting from want

raven mulch Aug 21, 2020, 2:53 PM

#

In this video we continue on the topic of Lipschitz continuity by presenting a paper which proposes a projection method to enforce it! If you enjoy this video consider watching others which I have on the topic! 🙂 I would love to have discussion here or on the comment section, the goal of this youtube channel is to create knowledge and interesting discussions in this area of ML.

Video: https://www.youtube.com/watch?v=9kxhEdiTwek

Paper: https://arxiv.org/abs/1804.04368

Abstract: We investigate the effect of explicitly enforcing the Lipschitz continuity of neural networks with respect to their inputs. To this end, we provide a simple technique for computing an upper bound to the Lipschitz constant---for multiple p-norms---of a feed forward neural network composed of commonly used layer types. Our technique is then used to formulate training a neural network with a bounded Lipschitz constant as a constrained optimisation problem that can be solved using projected stochastic gradient methods. Our evaluation study shows that the performance of the resulting models exceeds that of models trained with other common regularisers. We also provide evidence that the hyperparameters are intuitive to tune, demonstrate how the choice of norm for computing the Lipschitz constant impacts the resulting model, and show that the performance gains provided by our method are particularly noticeable when only a small amount of training data is available.

YouTube

Federico Barbero

Regularisation of Neural Networks by Enforcing Lipschitz Continuity

In this video we continue on the topic of Lipschitz continuity by presenting a paper which proposes a projection method to enforce it!

Paper: https://arxiv.org/abs/1804.04368

Abstract: We investigate the effect of explicitly enforcing the Lipschitz continuity of neural net...

▶ Play video

arXiv.org

Regularisation of Neural Networks by Enforcing Lipschitz Continuity

We investigate the effect of explicitly enforcing the Lipschitz continuity of
neural networks with respect to their inputs. To this end, we provide a simple
technique for computing an upper bound...

desert oar Aug 21, 2020, 3:19 PM

#

@raven mulch has this technique been adpoted at all? its interesting but i havent heard of it before

lapis sequoia Aug 21, 2020, 3:19 PM

#

if i have a dataset with a lot of missing values and i want to calculate (cramers) correlation, is it important to impute the missing values first?

desert oar Aug 21, 2020, 3:20 PM

#

you need to do something with them. either drop them or impute them @lapis sequoia

raven mulch Aug 21, 2020, 3:20 PM

#

Similar techniques have had great success with GANs

desert oar Aug 21, 2020, 3:20 PM

#

imputation is kind of a can of worms but maybe you can get away with mean/median/mode imputation

lapis sequoia Aug 21, 2020, 3:20 PM

#

ty

raven mulch Aug 21, 2020, 3:21 PM

#

And experimental section shows very promising results with feed forward nets and conv nets

desert oar Aug 21, 2020, 3:21 PM

#

looks like you're interested in regularization, i see you have a video on another "obscure" technique

raven mulch Aug 21, 2020, 3:21 PM

#

My main area of interest is ML security

#

That’s what I do research in at my uni

#

But I’m interested in this stuff too yeah

#

Which is quite related

desert oar Aug 21, 2020, 3:22 PM

#

i suspect regularization would be an important topic in that area

raven mulch Aug 21, 2020, 3:22 PM

#

Yep

desert oar Aug 21, 2020, 3:22 PM

#

very interesting

lapis sequoia Aug 21, 2020, 3:36 PM

#

Hi.
Recently I've started exploring graph-like data (complete beginner). Does anyone have a resource recommendation for modelling 'labeled property graph' data? I want to learn how to properly represent such data in python.

pearl crystal Aug 21, 2020, 4:17 PM

#

Nowadays, should we use criterion like AIC to compare models?
AIC= 2k-2ln(L)
We can compare models based on the accuracy in test data and utilize cross validation techniques. So, why do we need these absurd criterion?

desert oar Aug 21, 2020, 4:52 PM

#

@pearl crystal the criteria aren't absurd. they are meant for cases where you don't necessarily have enough data, or good enough data, to use cross validation or a train/test split

#

also they use different goodness of fit criteria, in this case the likelihood of the model

#

that said, there are some nice asymptotic results relating model fit criteria like AIC DIC and WAIC with LOOCV

#

https://www.youtube.com/watch?v=xS4jDHQfP2o

YouTube

Ben Lambert

Evaluating model fit through AIC, DIC, WAIC and LOO-CV

This video is part of a lecture course which closely follows the material covered in the book, "A Student's Guide to Bayesian Statistics", published by Sage, which is available to order on Amazon here: https://www.amazon.co.uk/Students-Guide-Bayesian-Statistics/dp/1473916364

...

▶ Play video

#

in a lot of todays' machine learning problems, you dont usually need these criteria. and not all of them are actually good criteria. but to call them "absurd" is imo ignorant of their intended purpose

pearl crystal Aug 21, 2020, 5:15 PM

#

@desert oar
Ben Lambert is a great and expert data scientist. I have seen some of his videos. They were perfect. thanks

desert oar Aug 21, 2020, 5:16 PM

#

👍 indeed

#

one place you still see AIC used is in time series modeling

#

although its not necessarily ideal there either

#

but in time series work it's often much harder to cross-validate or otherwise hold out test data

pearl crystal Aug 21, 2020, 5:16 PM

#

I do not know why his videos in youtube do not have enough views

desert oar Aug 21, 2020, 5:17 PM

#

he's a pretty well respected researcher, so he probably just doesn't spend effort promoting his work

#

i agree i really like his content

lapis sequoia Aug 21, 2020, 7:25 PM

#

anyone here use eta-squared before?

solid aurora Aug 21, 2020, 9:12 PM

#

matplotlib's imshow() on a 3-d array treats the third axis as RGB, right?

tidal bough Aug 21, 2020, 9:13 PM

#

As 10 seconds of googling show, yes:
https://matplotlib.org/api/_as_gen/matplotlib.pyplot.imshow.html

The image data. Supported array shapes are:

(M, N): an image with scalar data. The values are mapped to colors using normalization and a colormap. See parameters norm, cmap, vmin, vmax.
(M, N, 3): an image with RGB values (0-1 float or 0-255 int).
(M, N, 4): an image with RGBA values (0-1 float or 0-255 int), i.e. including transparency.

solid aurora Aug 21, 2020, 9:13 PM

#

right, oops

#

sorry

tidal bough Aug 21, 2020, 9:13 PM

#

if shape[2] is more than 4, higher indexes by that dim are ignored.

molten hamlet Aug 21, 2020, 10:08 PM

#

@tidal bough

📎 tree.jpg

modest rune Aug 22, 2020, 1:16 AM

#

I am having trouble understanding this: https://www.statsmodels.org/stable/generated/statsmodels.robust.robust_linear_model.RLM.html#statsmodels.robust.robust_linear_model.RLM

#

exog : array_like
A nobs x k array where nobs is the number of observations and k is the number of regressors. An intercept is not included by default and should be added by the user. See statsmodels.tools.add_constant.

#

I googled nobs array and got nothing.

#

Here is an example of this array being constructed...

nsample = 50
sig = 0.5
x = np.linspace(0, 20, nsample)
X = np.column_stack((x, np.sin(x), (x-5)**2, np.ones(nsample)))
beta = [0.5, 0.5, -0.02, 5.]

y_true = np.dot(X, beta)
y = y_true + sig * np.random.normal(size=nsample)

X is exog the nobs array

#

that code comes from this example:
https://www.statsmodels.org/stable/examples/notebooks/generated/ols.html

#

To restate my request for help: What is nobs? how are the 4 elements in each array element for nobs used? Any suggestions on something I could read to inform myself?

odd yoke Aug 22, 2020, 1:25 AM

#

nobs is the number of observations as per the text you posted
and I'm unsure what beta is in the snippet as it is unused

rapid ridge Aug 22, 2020, 1:27 AM

#

#web-development

odd yoke Aug 22, 2020, 1:28 AM

#

in the code you posted, nobs = 50, k = 4

#

the k comes from the 4 elements in the argument to np.column_stack

modest rune Aug 22, 2020, 1:30 AM

#

oh, ignore beta. it is used to calculate y_true and I accidently left out the line of code that explains how it was used

#

added that line back

#

what is k?

#

number of regressors? What do I need to read to better understand that?

odd yoke Aug 22, 2020, 1:32 AM

#

also looking at this https://www.statsmodels.org/stable/endog_exog.html it appears exog/endog are the generic terms it uses for x/y or input/output

modest rune Aug 22, 2020, 1:33 AM

#

yes, I sorted that much out

#

which was an initial source of confusion

odd yoke Aug 22, 2020, 1:34 AM

#

regressors are basically how you would call features

modest rune Aug 22, 2020, 1:34 AM

#

my remaining confusion is about the four k elements... 1: x, 2: sin(x), 3: (x-5)^2, 4: 1

#

hey hey hey... I wouldn't call them anything 😉 I don't even understand what they are

#

what do you mean by "you would call features"

odd yoke Aug 22, 2020, 1:35 AM

#

the independent variables

#

the "things" that make up the observations

#

like you had a pandas dataset, you'd have some column "output", and 4 other columns that would correspond to these

#

these would be used to predict the output

#

i'm not exactly a stats major so pardon my lack of proper terminology

modest rune Aug 22, 2020, 1:37 AM

#

Ok... so how do those 4 features apply to the plot that was generated?

📎 examples_notebooks_generated_ols_18_0.png

#

x is raw data. that part is easy.

odd yoke Aug 22, 2020, 1:38 AM

#

i'm guessing the OLS curve is the one generated from the model ?

modest rune Aug 22, 2020, 1:38 AM

#

yes

odd yoke Aug 22, 2020, 1:40 AM

#

if so, then i'm guessing it'd be something like OLS would try to fit y = ax + b sin(x) + c (x-5)^2 + d

modest rune Aug 22, 2020, 1:41 AM

#

That is what I was worried about, because that means that OLS was given a ton of data about what the plot should look like. So, what work is OLS actually doing if most of the curve is already defined?

odd yoke Aug 22, 2020, 1:42 AM

#

the model predicted the a, b, c, d

modest rune Aug 22, 2020, 1:42 AM

#

Interesting. OK! That helps a ton!

odd yoke Aug 22, 2020, 1:43 AM

#

do you have the code that gets the result of the model, and plot it ?

modest rune Aug 22, 2020, 1:43 AM

#

Often times, a person's confusion is more about their lack of ability to view the problem from the right perspective.

#

It is completely copy and paste from the example I linked above.

odd yoke Aug 22, 2020, 1:44 AM

#

ah yeah i see it now

modest rune Aug 22, 2020, 1:45 AM

#

Except I naively swapped the data out with a Google option chain volatility smile expecting to get a nice curve fit without changing much in the code. Finally figured out that it wasn't working because I wasn't defining the k parameters.

#

But, I think you gave me enough of a hint... I have an idea of what I need to do now.

odd yoke Aug 22, 2020, 1:45 AM

#

this part shows the a/b/c/d

📎 unknown.png

#

and if we do graph it, we can see it matches the graph from above

#

📎 unknown.png

modest rune Aug 22, 2020, 1:49 AM

#

does the number of k parameters define the number of orders of a polynomial equation that is used?

odd yoke Aug 22, 2020, 1:49 AM

#

yes

modest rune Aug 22, 2020, 1:49 AM

#

cool

odd yoke Aug 22, 2020, 1:49 AM

#

if you wanted to use a polynomial that is

modest rune Aug 22, 2020, 1:50 AM

#

so, it wouldn't be y = ax + b sin(x) + c (x-5)^2 + d it would be y = ax^3 + b sin(x)^2 + c (x-5)^2 + d?

#

or not.

#

it is not how many orders then, it just defines an equation.

#

you could make it a polynomial or not, your choice

odd yoke Aug 22, 2020, 1:51 AM

#

no this model accepts anything apparently, you'd have to replace sin(x) , (x - 5)**2 etc with actual x ** 2, x ** 3 etc

#

you could make it a polynomial or not, your choice
@modest rune exactly

modest rune Aug 22, 2020, 1:52 AM

#

cool. I think I get it. Thanks!

#

@odd yoke Thanks! I was able to make progress! Have more to learn now, but I was able to get a decent curve fit.

#

It was a straight line before, now it is curvy and fitty! all in one 🙂

📎 unknown.png

odd yoke Aug 22, 2020, 2:06 AM

#

nice

thin solstice Aug 22, 2020, 2:26 AM

#

okey, I've got a question about something...
I'm trying to write a neural network library, and I've got something so far. it works at predicting, it's got weights & biases, an mutate functions, etc. it's fully functional if you use a genetic algorithm to train it, but personally I'd like to incorporate backpropagation, however I'm having some trouble

arctic wedgeBOT Aug 22, 2020, 2:26 AM

#

Hey @thin solstice!

It looks like you tried to attach a Python file - please use a code-pasting service such as https://paste.pythondiscord.com

thin solstice Aug 22, 2020, 2:26 AM

#

https://paste.pythondiscord.com/ekejokewuk.rb

#

there's my code, and it seems to be very strange during the training process;

#


if __name__ == '__main__':

    n = Network( [2,3,1] )

    tests = [
        [[1,0],[1]],
        [[0,1],[1]],
        [[1,1],[0]],
        [[0,0],[0]]
    ]

    for i in range(2500):
        test = random.choice(tests)
        print('\n')
        print(test)
        print(n.feedforward(test[0]))
        n.train( test[0], test[1] )
        print(n.feedforward(test[0]))
    
    for test in tests:
        print(test, n.feedforward(test[0]))```
I'm attempting to teach it the XOR problem, but I'll send a sample of what happens when it is run..

#

this is during training;

[[1, 0], [1]]
[0.43385151]
[0.44138086]


[[1, 1], [0]]
[0.44142151]
[0.43554815]


[[0, 0], [0]]
[0.43553338]
[0.42969438]```

#

the first line is the test, second line is the network's guess, and the third line is (hopefully) the improved network's guess

#

and as you can tell, it is improving, but only per question

#

by the end of the 2,500 training examples, it seems like it hasn't learnt a thing, apart from making all the answers equal for some odd reason

#

[[1, 0], [1]] [0.45336777]
[[0, 1], [1]] [0.45343231]
[[1, 1], [0]] [0.45340774]
[[0, 0], [0]] [0.45339234]```

#

the first list is the test, and the second list is the network's guesses

#

as you can see, they seem to converge to 0.45

#

I've been trying to follow along with this, but clearly I messed something up somewhere along the way, but I can't figure out what I've done wrong..
https://www.youtube.com/watch?v=tlqinMNM4xs&list=PLRqwX-V7Uu6aCibgK1PTWWu9by6XFdCfh&index=18

#

any help is appreciated greatly, and please @ me in replies, thanks :)

#

( I moved my question to #help-peanut )

muted oyster Aug 22, 2020, 5:17 AM

#

a quick question, how do i get max value of a column along with corresponding row element?

#

max value I know, DF['column'].max()

#

DF['column'].idxmax()
this gives index of the max value but i want value from another column which falls in same row as max value

#

🥴 idk if someone will understand what im trying to say

#

Ok got it, sometimes just need to revisit the basics:
DF[DF.Column == DF.Column.max()]

lapis sequoia Aug 22, 2020, 5:48 AM

#

Hello, how are you guys? I want to learn data science and artificial intelligence, and I know that I have to start learning linear algebra, differentiation and integration, statistics, probabilities, and data analysis. Is there anything more I should learn?

static aurora Aug 22, 2020, 6:02 AM

#

I want to make every cell that's >0.4 yellow and I'm trying to do it like this --> ```python
df.style.applymap( lambda x: 'background-color : yellow' if x > 0.4 else '')

#

@lapis sequoia no, that's everything

bitter harbor Aug 22, 2020, 7:11 AM

#

Hello, how are you guys? I want to learn data science and artificial intelligence, and I know that I have to start learning linear algebra, differentiation and integration, statistics, probabilities, and data analysis. Is there anything more I should learn?
@lapis sequoia I'd suggest basic neural network architecture

viral scroll Aug 22, 2020, 8:44 AM

#

Hi Guys,

I have a pandas dataset with a datetime field and a value field.

I would like to get the sum of the records sorted week wise in such a way so that the all the records before that week should be included in the sum.

Week 1 should have sum of values for week 1 dates
Week 2 should have sum of values for week 1 dates+week 2 dates
Week 3 should have sum of values for week 1 + week 2 + week 3 dates
and so on

#

Your help will be very much appreciated

Thanks in advance 🙂

velvet thorn Aug 22, 2020, 8:46 AM

#

Your help will be very much appreciated

Thanks in advance 🙂
@viral scroll sort, groupby, sum, cumsum

viral scroll Aug 22, 2020, 8:46 AM

#

Ohh...that was fast...Thanks a lot 🙂

#

let me give it a try

solar jungle Aug 22, 2020, 9:18 AM

#

Hello, so I had this question about neural networks,
when we merge outputs from 2 different layers, we usually use 'add' layer

#

In keras there are many such layers like 'add', 'multiply', 'average', etc.

#

https://keras.io/api/layers/merging_layers/

Keras documentation: Merging layers

#

does anybody have a practical explanation of which one to use to merge when ?

jovial oriole Aug 22, 2020, 2:03 PM

#

Im working with a dataframe in pandas,
I dont know how to search by a specific year

Basically my question is In 2016, which person sold the most in each category?

#group = df.groupby(["Category", "person"]).sum()
#group."Ship Date"].to_datetime()
#total_sales = group["Units Sold"].groupby(level=0, group_keys=False)
#total_sales.nlargest(1)

But how do I group by the specific year aswell

#

the data type of the date column is datetime64[ns]

velvet thorn Aug 22, 2020, 2:25 PM

#

df.groupby(['Ship Date'].dt.year)

jovial oriole Aug 22, 2020, 2:38 PM

#

not working

desert oar Aug 22, 2020, 3:52 PM

#

@velvet thorn use resample instead

#

df.resample('1Y', on='date')

#

I think

#

Off the top of my head

lapis sequoia Aug 22, 2020, 4:00 PM

#

hey there

#

I'm having a hard time understanding sync and async..

#

I looked up simple explanations, it says : sync is when request 1 -> response, before you run request2..

#

async is request1 and request2 get executed at the same time without waiting for either to complete

#

I don't have anything I personally do to correlate this with, so this explanation isn't useful..

#

anyway, I'm ultimately trying to understand this in relation to model training:

#

Synchronous training has all worker training on different subsets of input data and incrementally combines results. In asynchronous training, workers operate independently and update variables asynchronously

#

@desert oar

desert oar Aug 22, 2020, 4:40 PM

#

@lapis sequoia this belongs in #async-and-concurrency, also id appreciate it if i wasnt randomly pinged

jolly hinge Aug 22, 2020, 6:35 PM

#

Hello fam,

sacred sierra Aug 22, 2020, 8:18 PM

#

Hey guys, if anyone is good with pandas, I am having some issues with duplicate values that I've tried to describe out in #help-popcorn channel, not sure if there is a more appropriate channel to post this too so apologies if this isn't the place

modest rune Aug 22, 2020, 10:25 PM

#

I am trying to wrap my head around something with regards to surface fitting. Libraries like scikit-learn and statsmodels provide the tools to fit a curve, but not the tools to fit a surface (3D surface). I get the feeling, that given 3 axis, X, Y, and Z, there is a way to curve fit Z with respect to X but do it for every value of Y, then seperately curve fit Z with respect to Y and do it for every value of X, and then somehow combine those curve fits to form a surface fit.

Like i mentioned above, scikit-learn and statsmodels libraries have lots of curve fitting algoritms but no surface fitting algorithms.

#

I think scikit has a few surface fit funtions, but not for the vast majority of their curve fit algoritms (ex. OLS, RLM, LOESS, LOWESS, etc.)

tidal bough Aug 22, 2020, 10:49 PM

#

Many curve fitting methods work on any number of dimensions. It's weird if scikit can't do it, lemme check...

modest rune Aug 22, 2020, 10:50 PM

#

They might and maybe they simply lack examples showing how to do it with an extra dimension.

#

Being a noob in this area, I am certain I don't understand much of the documentation.

tidal bough Aug 22, 2020, 10:51 PM

#

https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.Ridge.html#sklearn.linear_model.Ridge
Can't this do multidimensional?

This model solves a regression model where the loss function is the linear least squares function and regularization is given by the l2-norm. Also known as Ridge Regression or Tikhonov regularization. This estimator has built-in support for multi-variate regression (i.e., when y is a 2d-array of shape (n_samples, n_targets)).

#

In fact, it looks like the first example is that case:

>>> from sklearn.linear_model import Ridge
>>> import numpy as np
>>> n_samples, n_features = 10, 5
>>> rng = np.random.RandomState(0)
>>> y = rng.randn(n_samples)
>>> X = rng.randn(n_samples, n_features)
>>> clf = Ridge(alpha=1.0)
>>> clf.fit(X, y)
Ridge()

#

and this constructs polynomial features, even for multidimensional data: https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.PolynomialFeatures.html#sklearn.preprocessing.PolynomialFeatures

And this tutorial shows using them together, though only on 1d data: https://scikit-learn.org/stable/auto_examples/linear_model/plot_polynomial_interpolation.html

modest rune Aug 22, 2020, 10:56 PM

#

I looked, didnt see an example of them using that regression for surface fitting. That doesn't mean it can't though. I think there is a high probability what I want to do is supported and easy, I just am missong a piece of the mental puzzle

#

I think you are right. I bet I can pass a properly dimensioned array to pull this off. But, I guess I don't quite get how to do that... An example would be sweet. I am a bit suprised I can' find one if it is supposidly easy.

bitter fiber Aug 22, 2020, 11:03 PM

#

What is special about the ridge model though?

velvet thorn Aug 22, 2020, 11:03 PM

#

@velvet thorn use resample instead
@desert oar depends on what you wanna do I guess?

#

like if you wanted to transform instead of aggregate you couldn’t resample

bitter fiber Aug 22, 2020, 11:04 PM

#

ridge sounds like an esoteric statistical model.

odd yoke Aug 22, 2020, 11:06 PM

#

I see it very often at work, alongside lasso and elasticnet

tidal bough Aug 22, 2020, 11:06 PM

#

lemme try LinearRegression

odd yoke Aug 22, 2020, 11:07 PM

#

it's a scarily simple way to regularize linear models, and it generally doesn't cost anything other than adding a few characters to your code to specify you want to use ridge

velvet thorn Aug 22, 2020, 11:08 PM

#

ridge sounds like an esoteric statistical model.
@bitter fiber it is mega common

#

at least IME

odd yoke Aug 22, 2020, 11:08 PM

#

i have the same experience

modest rune Aug 22, 2020, 11:08 PM

#

My data has lots of outliers, so i was hoping to use something stable like lowess or robust linear models

velvet thorn Aug 22, 2020, 11:08 PM

#

not working
@jovial oriole what do you mean not working

#

actually .groupby(pd.Grouper(‘Ship Date’, axis=‘year’)) would have been more appropriate

bitter fiber Aug 22, 2020, 11:14 PM

#

I've used the facebook ai model prophet since 2016 > for time series specifically

jovial oriole Aug 22, 2020, 11:14 PM

#

@velvet thorn I got it in the end , I did
dfbyyear2014= df[df['Order Date'].dt.strftime('%Y') == '2014']
So I reworked the dataframe, basicly a pre applied filter

bitter fiber Aug 22, 2020, 11:14 PM

#

way better than any other model i've tried

tidal bough Aug 22, 2020, 11:15 PM

#

@modest rune
Yup, it works.
https://repl.it/repls/FlawlessBlueModules#main.py

#

As you can see, it correctly finds out the coefficients:

[ 0.00000000e+00  1.00000000e+00  1.11022302e-15  3.33066907e-16
  1.00000000e+00 -1.00000000e+00]

#

wait, or does it

#

~~that's not correct at all, lol~~

velvet thorn Aug 22, 2020, 11:17 PM

#

@velvet thorn I got it in the end , I did
dfbyyear2014= df[df['Order Date'].dt.strftime('%Y') == '2014']
So I reworked the dataframe, basicly a pre applied filter
@jovial oriole that's not grouping by though

tidal bough Aug 22, 2020, 11:17 PM

#

well, actually it's not completely wrong

#

it estimated 0 + X + XY-Y^2

#

real answer is 2 + X + XY - Y^2

#

and I'm not sure how it missed the bias

#

...or maybe it's the model as a whole that does it?

jovial oriole Aug 22, 2020, 11:21 PM

#

group = dfbyyear2014.groupby(["Category", "person"]).sum()

odd yoke Aug 22, 2020, 11:22 PM

#

the intercept is in lin.intercept_ @tidal bough

tidal bough Aug 22, 2020, 11:23 PM

#

ah, nice

#

still kinda misleading, since the coef_ also has it, but it's 0 always 😅

odd yoke Aug 22, 2020, 11:23 PM

#

uh, does it ?

velvet thorn Aug 22, 2020, 11:24 PM

#

group = dfbyyear2014.groupby(["Category", "person"]).sum()
@jovial oriole okay, so you actually wanted to filter and then groupby I guess

odd yoke Aug 22, 2020, 11:24 PM

#

coef_ has 6 elems, it fits a x2y2 + b x2y + c xy2 + d xy + e x + f y

#

looks fine to me

#

then we have the + g with the intercept

tidal bough Aug 22, 2020, 11:24 PM

#

so yeah, successful fitting.

#

max absolute error of 3.552713678800501e-15

velvet thorn Aug 22, 2020, 11:25 PM

#

still kinda misleading, since the coef_ also has it, but it's 0 always 😅
@tidal bough no, the bias is not in the coefficient vector

#

I would guess they just both happen to be 0

tidal bough Aug 22, 2020, 11:25 PM

#

@odd yoke

coef_ has 6 elems
that's the problem, they correspond to 1, x, y, x^2, xy and y^2

#

and yet the first of these is actually always 0, which the actual bias is in intercept_

odd yoke Aug 22, 2020, 11:26 PM

#

multi variate polynomials are 1, x, y, x^2y, xy^2, x^2y^2

#

and xy

tidal bough Aug 22, 2020, 11:27 PM

#

Generate a new feature matrix consisting of all polynomial combinations of the features with degree less than or equal to the specified degree. For example, if an input sample is two dimensional and of the form [a, b], the degree-2 polynomial features are [1, a, b, a^2, ab, b^2].
That's the output of polynomialFeatures

#

(that's exactly my case)

velvet thorn Aug 22, 2020, 11:28 PM

#

ah, okay, because you have an explicit constant feature

odd yoke Aug 22, 2020, 11:30 PM

#

ah I see, PolynomialFeatures has a include_bias parameter, and so does LinearRegression (fit_intercept)

#

lin = LinearRegression(fit_intercept=False) fixes it

tidal bough Aug 22, 2020, 11:33 PM

#

ah, that makes sense

#

the slighly more efficient way is probably include_bias=False in the features.

#

(I'm assuming that to add the constant to the result array is cheaper than having the input have one more column).

modest rune Aug 22, 2020, 11:38 PM

#

Thanks yall. I'm afraid I need some time to process what you all are saying. It seems you have confirmed it is easy and possible, I just need to figure out the cobfusion in my head. If I can't make sense of things, I might come back with sample inout data and sone code tomorrow or Monday.

tidal bough Aug 22, 2020, 11:46 PM

#

@modest rune Basically, the general idea is that you can do polynomial curve fitting by only linear regression by generating tons of new features (like, if you have arrays of X and Y coordinates, you also generate arrays of multiples X*Y, X**2, Y**2 (for order=2)) and then fitting a line to this 5-dimensional data. PolynomialFeatures for the former, LinearRegression for the latter (or something with normalization like Ridge)

So in general, you just do

lin = LinearRegression()
model = make_pipeline(PolynomialFeatures(degree), lin)

And then pass it the input and output: the input an array of shape (m,n), the output of shape (m,k), where:
n is the number of points - in my case, a total of 10000 points.
m is the number of dimensions of each input point - in my case, 2.
k is the number of dimensions of each output point. In my case, it's 1. Having it >1 is the same as having several models with the same inputs, but predicting different parameters of the output.

modest rune Aug 23, 2020, 12:09 AM

#

@tidal bough thankyou so much!

lapis sequoia Aug 23, 2020, 12:36 AM

#

@desert oar I dont think it particularly belongs in async, because it's about model training strategies.. and ok

winter citrus Aug 23, 2020, 12:52 AM

#

i'm trying to translate natural language text into text for a program..anyone know where I can get started?

soft dock Aug 23, 2020, 12:57 AM

#

Just check the documentation on spacy.io

prime elm Aug 23, 2020, 1:48 AM

#

is it possible to make a dictionary where the keys increase by an increment

say

a = 5

...
Results in dict = {1: None, 2: None, 3: None. 4: None, 5: None}```

lapis sequoia Aug 23, 2020, 2:43 AM

#

@winter citrus you can use google translate api

desert oar Aug 23, 2020, 3:07 AM

#

@bitter fiber ridge regression is L2 regularization, if that's something you are familiar with

velvet thorn Aug 23, 2020, 3:40 AM

#

is it possible to make a dictionary where the keys increase by an increment

say
a = 5

...
Results in dict = {1: None, 2: None, 3: None. 4: None, 5: None}```

@prime elm ...you want all the keys to be None?

#

>>> dict.fromkeys(range(5))
{0: None, 1: None, 2: None, 3: None, 4: None}

adapt as necessary

desert parcel Aug 23, 2020, 4:10 AM

#

inputs = np.array([
    [1, 2, 3, 4, 5, 6], 
    [7, 8, 9, 10, 11, 12],
    [11, 22, 33, 44, 55, 66],
    [100, 200, 330, 400, 500, 123],
    [99, 123, 33, 32, 12, 44],
    [9999, 123123, 123123, 444343, 5555, 66699]
    ], dtype='float32')

targets = np.array([
    [4], [6], [8], [10], [12], [14]
    ], dtype='float32')

inputs = torch.from_numpy(inputs)
targets = torch.from_numpy(targets)

print(inputs.shape)
print(targets.shape)

train_ds = TensorDataset(inputs, targets)
train_dl = DataLoader(train_ds, shuffle=True)

#

There is a problem with converting the inputs into tensors from numpy arrays

#

Because it says it expected a numpy array but a tensor was given instead

#

But having only one row works just fine

desert parcel Aug 23, 2020, 4:38 AM

#

tensor([[  -2.3799],
        [  -6.7324],
        [ -23.5662],
        [-145.1938],
        [ -52.2061],
        [1322.5454]]

Some of my predictions still have negative values even though I used mse_loss

desert parcel Aug 23, 2020, 4:58 AM

#

never mind I figured it out

bitter harbor Aug 23, 2020, 5:07 AM

#

when computing limits, is it possible to do it all with factoring/one function, or do other methods have to be implemented?

desert parcel Aug 23, 2020, 8:06 AM

#

I have an error with the final line in this file

#

https://hastebin.com/inezunuboz.py

#

Error output:```
untimeError Traceback (most recent call last)

<ipython-input-55-90c5585d3b40> in <module>()
1 opt = torch.optim.Adam(model.parameters(), lr=7)
----> 2 fit(5, model, loss_fn, opt, train_dl, eval_dl, accuracy)

3 frames

<ipython-input-49-afd130f584e4> in forward(self, xb)
18
19 def forward(self, xb):
---> 20 xb = xb.reshape(-1, 784)
21 outputs = self.linear(xb)
22 return outputs

RuntimeError: shape '[-1, 784]' is invalid for input of size 200

#

I have tried stack overflow but the solutions that are covered are part of a more advanced model and I am unable to follow along with it.

#

And there may be a few lines of code that are not needed so don't mind those too much

#

ping me btw

velvet thorn Aug 23, 2020, 8:56 AM

#

ping me btw
@desert parcel uh

#

do you understand

#

what reshaping does?

desert parcel Aug 23, 2020, 9:23 AM

#

yeah

#

at least I think so

#

doesn't it just well

#

reshape a tensor

#

into a difference shape

pearl crystal Aug 23, 2020, 9:25 AM

#

Hi. I have already watched "Udemy_The_Data_Science_Course_2020_Complete_Data_Science_Bootcamp_2020". It was simple and I think it was for beginners and at a shallow level. Could you suggest me better online course (deep knowledge) to become a data scientist? I have M.S. degree in artificial intelligence, thanks.
I do not know where I can ask similar questions about it, here or another channel

desert parcel Aug 23, 2020, 9:25 AM

#

do you understand
@velvet thorn yeah I think so

velvet thorn Aug 23, 2020, 9:25 AM

#

yeah so

#

how can you reshape a tensor of shape (200) into one of shape (1, 784)?

#

it doesn't make sense

#

they have different numbers of elements

desert parcel Aug 23, 2020, 9:26 AM

#

I was following the tutorial

#

And the tutorial didn't have a problem

velvet thorn Aug 23, 2020, 9:27 AM

#

presumably it has different data

#

that's the only explanation

desert parcel Aug 23, 2020, 9:28 AM

#

so then

#

it's the MNIST dataset though

#

So I don't think that's the case

#

unless there are multiple instances

#

or I made an error

#

so what can I do then

#

instead of making it into 784

#

I just change 784 to 200?

#

But the MNIST dataset is a 1x28x28

#

changing it from (-1, 784) to (-1, 200) just gave a matrix multiplication error

#

@velvet thorn

velvet thorn Aug 23, 2020, 9:38 AM

#

it's 784

#

because

#

784 is 1 * 28 * 28

desert parcel Aug 23, 2020, 9:38 AM

#

I understand that

#

But I'm not sure what to do

gentle tide Aug 23, 2020, 9:39 AM

#

I have this endpoint code to get the average stock closing price given a stock name, month, and year

@app.route('/stock=<stock>/date=<date>/average', methods = ['GET'])
def average(stock, date):
    if request.method == 'GET':
        dict = {'FB': 0, 'AAPL': 0, 'NFLX': 0, 'GOOG': 0}
        if stock not in dict:
            return "This stock does not exist. List of stocks (case sensitive): \nFB \nAAPL \nNFLX \nGOOG \n"
        try:
            dt = datetime.datetime.strptime(date, "%Y-%m")
        except:
            return "Please enter a valid month and year \nExample: 12-2020 \n"
        df['date'] = pd.to_datetime(df['date'])
        by_stock_month_year = df[(df["company_ticker"] == stock) & (df['date'].dt.month == dt.month) & (df['date'].dt.year == dt.year)]
        if by_stock_month_year.empty:
            return "There is no available price for that date \n"
        prices = by_stock_month_year["closing_price"]
        data = {}
        data['price'] = round(prices.mean(), 2)
        return json.dumps(data, indent = 2)
    else:
        return "Only GET methods are supported \n"

For this csv file

company_ticker,date,closing_price
AAPL,1989-09-19,1.54
AAPL,1989-09-20,1.59
AAPL,1994-12-08,1.28
AAPL,2019-11-15,265.76
GOOG,2004-08-19,49.98
GOOG,2004-08-20,53.95
GOOG,2019-11-15,1334.87

Is there a way to make this cleaner

desert parcel Aug 23, 2020, 9:40 AM

#

784 is 1 * 28 * 28
@velvet thorn I understand why it's it's 784 but I have no idea what to do next

#

Someone in SO helped me out

#

the solution worked

#

the problem was

#

I left out an argument in a function loss_batch

sweet ember Aug 23, 2020, 12:04 PM

#

Hi, I am trying to scrape emails from yelp by crawling into individual listing. Using bs4 and selenium for it but not able to scrape them. Where do I ask this?

desert oar Aug 23, 2020, 2:15 PM

#

that might be against yelp terms of service, in which case we can't help with that on this server @sweet ember

#

!rules 5

arctic wedgeBOT Aug 23, 2020, 2:15 PM

#

Rules

5. Do not provide or request help on projects that may break laws, breach terms of services, be considered malicious/inappropriate or be for graded coursework/exams.

calm wagon Aug 23, 2020, 4:46 PM

#

how much should i have this?

net = tflearn.fully_connected(net, 12)

and of what size?

tidal bough Aug 23, 2020, 4:49 PM

#

depends on the task

#

hmm, I wonder what are the advantages of using tflearn, really

calm wagon Aug 23, 2020, 4:49 PM

#

;)

#

im making a chat bot in python

tidal bough Aug 23, 2020, 4:50 PM

#

it claims to be higher-level than TF itself, but doesn't TF has its own Sequential class that allows building models the same way?

calm wagon Aug 23, 2020, 4:50 PM

#

so how many should i have?

#

@tidal bough

bitter harbor Aug 23, 2020, 4:51 PM

#

is tflearn separate from tf?

calm wagon Aug 23, 2020, 4:51 PM

#

wdym

tidal bough Aug 23, 2020, 4:52 PM

#

it's a wrapper over TF, basically

#

TFlearn is a modular and transparent deep learning library built on top of Tensorflow. It was designed to provide a higher-level API to TensorFlow in order to facilitate and speed-up experimentations, while remaining fully transparent and compatible with it.

calm wagon Aug 23, 2020, 4:52 PM

#

how much should i have this?
net = tflearn.fully_connected(net, 12)
and of what size?
@calm wagon

tidal bough Aug 23, 2020, 4:52 PM

#

@calm wagon you should probably find some existing simple implemetation/guide and see how they do it

bitter harbor Aug 23, 2020, 4:52 PM

#

speed-up experimentations ok

lapis sequoia Aug 23, 2020, 4:52 PM

#

is it possible to build a face-recognition using Tensorflow

tidal bough Aug 23, 2020, 4:52 PM

#

that, or just guess. 3 layers of 100 neurons or something.

#

is it possible to build a face-recognition using Tensorflow
well, yes, this is one of the things ML tends to be used for 😛

bitter harbor Aug 23, 2020, 4:53 PM

#

that does seem redundant tho

lyric canopy Aug 23, 2020, 4:54 PM

#

!tempmute 739406136981192784 1d Be silent.

arctic wedgeBOT Aug 23, 2020, 4:54 PM

#

:incoming_envelope: :ok_hand: applied mute to @lapis sequoia until 2020-08-24 16:54 (23 hours and 59 minutes).

lapis sequoia Aug 23, 2020, 4:55 PM

#

Dose Tensorflow specifically made for face-recognition stuff like that?

bitter harbor Aug 23, 2020, 4:55 PM

#

tf is specifically made for machine learning yes

tidal bough Aug 23, 2020, 4:56 PM

#

Tensorflow is a pretty low-level framework for machine learning and neural networks.

#

It's not, like

from tensorflow import FaceRecognition
FaceRecognition().recognize(faces)

😛

lapis sequoia Aug 23, 2020, 4:56 PM

#

TensorFlow or Opencv which one is great for face-recognition?

tidal bough Aug 23, 2020, 4:57 PM

#

for a simple solution, google gives me https://pypi.org/project/face-recognition/

bitter harbor Aug 23, 2020, 4:57 PM

#

opencv can't do recognition on it's own

lapis sequoia Aug 23, 2020, 4:57 PM

#

ik

bitter harbor Aug 23, 2020, 4:57 PM

#

whereas tf can train on images

lapis sequoia Aug 23, 2020, 4:57 PM

#

numpy support

#

opencv and numpy

bitter harbor Aug 23, 2020, 4:58 PM

#

not necessarily live ones

#

idk what you're on about

sweet ember Aug 23, 2020, 5:20 PM

#

Thanks @desert oar I was just trying projects on webscraping/crawling for my github. I ll try with someother site

molten hamlet Aug 23, 2020, 5:25 PM

#

cvlib can detect faces

#

https://towardsdatascience.com/object-detection-with-less-than-10-lines-of-code-using-python-2d28eebc5b11

Medium

Object Detection with Less Than 10 Lines of Code Using Python

Find out what objects are in the image

#

and then max pooling extracts numbers from each frame separately 😛

#

for example, image (32x32x1)
image -> convolution of 10 filters -> result is 10 x (30, 30, 1)

desert oar Aug 23, 2020, 5:46 PM

#

@sweet ember wikipedia.org is a good place to start. you also dont need selenium for that which makes it a lot simpler

modest rune Aug 23, 2020, 7:09 PM

#

Recommendations for best way to write dataframes and numpy arrays to a file? I assume numpy and pandas has builtin functionality to do this. Should I use those or is there something better?

#

Whatever direction I go, I'd like something that can handle large datasets and I can be reasonably confident won't break as I upgrade my library versions over the years.

#

Human readable files would be a plus, but not at the cost of huge file sizes.

#

Looking at pandas file IO documentation, it seems they give lots of options. Which one is the best fit?
https://pandas.pydata.org/pandas-docs/stable/user_guide/io.html

#

I assume pickling is a bad idea because pickle files are likely to fail to load if you try to load a pickle file saved in a previous library version?

tidal bough Aug 23, 2020, 7:17 PM

#

Recommendations for best way to write dataframes and numpy arrays to a file?
numpy has save, which saves to a numpy's own npz binary format

modest rune Aug 23, 2020, 7:20 PM

#

I saw that. I am leaning that direction. But, I think I would prefer something that works with numpy AND pandas.

#

specifically, I think I would have to use a different function to save a dataframe to file. Which isn't the end of the world, but not ideal.

bitter harbor Aug 23, 2020, 7:22 PM

#

you can specify the file type with save

#

I usually send it to a .dat

modest rune Aug 23, 2020, 7:23 PM

#

The idea that json is human readable is very attractive... maybe the files wouldn't be too big in json. If I had to guess, the largest files I would save would be maybe 1 billion doubles.

tidal bough Aug 23, 2020, 7:23 PM

#

honestly, a dataframe/array in json will probably not be very readable 😛

bitter harbor Aug 23, 2020, 7:23 PM

#

^^

tidal bough Aug 23, 2020, 7:23 PM

#

by the way, if they are 2d, you can use just csv

bitter harbor Aug 23, 2020, 7:24 PM

#

or excel's version

modest rune Aug 23, 2020, 7:25 PM

#

honestly, a dataframe/array in json will probably not be very readable 😛
@tidal bough

Good point, I hadn't even contemplated what it would look like.

#

Interest point from this stack exchange discussion: "Another useful point is that although ASCII CSV encoding isn't very efficient, using a file compression utility (like zip, gzip, etc.) on your ascii file will typically bring the file size down to something similar to the size of a binary file."
https://scicomp.stackexchange.com/questions/8404/binary-vs-ascii-file-size

Computational Science Stack Exchange

Binary vs. ASCII file size

I need to write some data from a computation, that will be read later by Paraview (.vtu or vtk file).

When it comes to file size , should I go for the ASCII format or the Binary format ?

bitter harbor Aug 23, 2020, 7:31 PM

#

.npy is binary, .npz is compressed @tidal bough btw

#

so german if you care about efficiency/compression i'd suggest looking at numpy's file types

desert oar Aug 23, 2020, 7:39 PM

#

csv is fine depending on what you have

#

npy is good if you have only numerical data

molten hamlet Aug 23, 2020, 7:40 PM

#

cvlib is awesome

#

📎 last_frame.jpg

#

lemon_thinking 😆

#

I saw scissors

#

toothbrush pretty close

#

;d

modest rune Aug 23, 2020, 7:42 PM

#

while(True)
  if (cvlib.object == 'person') AND (cvlib.wields == 'baseball bat'):
    police_state.send_swat_team()

molten hamlet Aug 23, 2020, 7:43 PM

#

omg dont do this @modest rune

#

ah I see

#

xD

#

I always miss jokes

#

📎 last_frame.jpg

modest rune Aug 23, 2020, 7:45 PM

#

hahaha! I just received an email from the Central Intelligence Agency asking me to code their new Auto-Policing robots

#

hahaha

#

what did it label the books?

molten hamlet Aug 23, 2020, 7:47 PM

#

yes

#

📎 bottles.jpg

#

nice ai

#

I got two bottles

#

😄

modest rune Aug 23, 2020, 7:50 PM

#

while(True)
  if (cvlib.object == 'person') AND (cvlib.bottles >= 2) AND (cvlib.ethnicity != 'russian'):
    bar.refuse_service()

tidal bough Aug 23, 2020, 7:50 PM

#

@molten hamlet here, I fixed it for you

📎 cvlib.png

modest rune Aug 23, 2020, 7:54 PM

#

And these examples are why I tell my brother that AI is going to cause a disaster at some point.

molten hamlet Aug 23, 2020, 7:54 PM

#

xD

#

you can use slavic instead, im not russian 😄

#

russian can understand polish but we can't understand theirs :d

modest rune Aug 23, 2020, 7:57 PM

#

Petty Officer Dirk: "General Dukes, the AI has detected the launch of 56 thermonuclear warheads. But, I am pretty sure it is just a bunch of pencils that fell out of a teacher's satchel. The AI is recommending a counter-offensive."

General Dukes: "Son, trust the AI, launch the counter-offensive."

#

@molten hamlet you could be American for all I know 🙂 I didn't mean to insinuate you were Russian. Was only making a joke that Russian's can hold their liquor.

molten hamlet Aug 23, 2020, 8:01 PM

#

nah chill 😄

#

im fine

#

I love that korean soju

#

its cheap in korea

#

but not cheap here due import 😄

#

should I know something specific in detecing road signs or keeping car on road between line and edge 😄

#

got interview tommorow

modest rune Aug 23, 2020, 8:03 PM

#

Can't help you with that. Maybe someone else can chime in.

molten hamlet Aug 23, 2020, 8:05 PM

#

you know any models?

#

I just read that yolo is fast, but is it popular? 😄

modest rune Aug 23, 2020, 8:06 PM

#

Nope, zero experience with machine learning. Other than I have starting trying to get better at curve fitting.

gentle tide Aug 23, 2020, 8:06 PM

#

"dates": "[[\"2004-08-20\", 53.95], [\"2019-11-15\", 1600.63]]"

Does anyone knowhow to get rid of that weird formatting on the dates

untold hare Aug 23, 2020, 8:06 PM

#

@modest rune you have no idea how close that was to actually happen. Soviet early warning syatem got confused by sun reflecring off clouds and assumed nato had launched a first strike.

modest rune Aug 23, 2020, 8:07 PM

#

@untold hare wow.

#

I think a similar thing happened with America's early warning system.

untold hare Aug 23, 2020, 8:08 PM

#

Lots of incidents yeah. There is a good book about this lemme see if I can find it. Basically a must read if you do data science and ML for defense companiea

#

https://www.amazon.com/Army-None-Autonomous-Weapons-Future/dp/0393608980#:~:text=Army of None%3A Autonomous Weapons,9780393608984%3A Amazon.com%3A Books

modest rune Aug 23, 2020, 8:11 PM

#

Cool! Thanks, I might kindle taht.

untold hare Aug 23, 2020, 8:11 PM

#

Do it

languid warren Aug 23, 2020, 8:26 PM

#

Hey someone can help me to fix:

#

print("Train data:")
for i in tqdm(range(0, X_train_windowed.shape[0] - seq_len+1)):
        X_train_Conv_LSTM[i] = current_seq_X
        y_train_Conv_LSTM[i] = y_train[i + seq_len - 1]

(262, 3, 50, 50, 3) X_train_Conv_LSTM.shape = (1, 3, 50, 50, 3) current_seq_X.shape
(262, 1) y_train_Conv_LSTM.shape            = (264,) y_train.shape

cupy\core\core.pyx in cupy.core.core.ndarray.__setitem__()

cupy\core\_routines_indexing.pyx in cupy.core._routines_indexing._ndarray_setitem()

cupy\core\_routines_indexing.pyx in cupy.core._routines_indexing._scatter_op()

cupy\core\_kernel.pyx in cupy.core._kernel.ufunc.__call__()

cupy\core\_kernel.pyx in cupy.core._kernel._get_out_args()

ValueError: Out shape is mismatched```

desert oar Aug 23, 2020, 10:11 PM

#

is there some library that lets you create an index on a column in a pandas dataframe that isnt the index of the dataframe?

#

e.g. some data structure that keeps a sorted collection of rows, or a hash table, and does binary search or a hash lookup to find the dataframe rows that you want (or whatever other index implementation is out there, trees etc)

#

df = pd.DataFrame(...)
product_category_index = ColumnIndex(df['product_category'], algorithm='b-tree')
df_pants = df.iloc[product_category_index('pants')]

something like that

#

would be a fun project if nobody has done this already

peak bolt Aug 23, 2020, 10:22 PM

#

Could someone help me in the help voice channel?

velvet thorn Aug 23, 2020, 11:33 PM

#

is there some library that lets you create an index on a column in a pandas dataframe that isnt the index of the dataframe?
@desert oar hm.

#

not possible in general, because the index would need to update with the DataFrame

desert oar Aug 23, 2020, 11:34 PM

#

good point

#

or you could just, not bother

velvet thorn Aug 23, 2020, 11:34 PM

#

like you could hack it, but it'd be prone to breaking with pandas updates

desert oar Aug 23, 2020, 11:34 PM

#

and the caller would re-index as desired

velvet thorn Aug 23, 2020, 11:35 PM

#

and the caller would re-index as desired
@desert oar then what would the benefit over normal pandas indexing be

#

since filtering is at worst linear, and index-building is at best linear

desert oar Aug 23, 2020, 11:35 PM

#

for big datasets where you already have an index but need to do repeated lookups on non-index fields, or a variety of fields

#

not that uncommon in my work

velvet thorn Aug 23, 2020, 11:35 PM

#

I see

#

fair enough

#

okay I'm going to need you to stop talking about use cases

#

because I don't think I need another side project

desert oar Aug 23, 2020, 11:36 PM

#

lol

velvet thorn Aug 23, 2020, 11:36 PM

#

this is a pretty cool idea

#

I'll see what I can do in an hour

desert oar Aug 23, 2020, 11:36 PM

#


class BaseIndex(metaclass=ABCMeta):
    def __init__(self, data: Sequence[_T]):
        self.data = data

    @abstractmethod
    def lookup(self, val: _T) -> Optional[int]:
        pass


class BinsearchIndex:
    data: Sequence[_T]
    data_sorted: Sequence[_T]
    sort_key: Optional[Callable[[_T], Any]]

    def __init__(self, data: Sequence[_T], sort_key: Optional[Callable[[_T], Any]] = None):
        super().__init__(data)
        self.sort_key = sort_key
        self.data_sorted = sorted(data, key=sort_key)

    def lookup(self, val: _T) -> Optional[int]:
        # https://docs.python.org/3/library/bisect.html#searching-sorted-lists
        i = bisect_left(self.data_sorted, val)
        if i >= len(self.data) or self.data_sorted[i] != val:
            return None
        return i

i slapped this together, not sure if it actually works

velvet thorn Aug 23, 2020, 11:36 PM

#

what do you see this line doing though

#

df_pants = df.iloc[product_category_index('pants')]

desert oar Aug 23, 2020, 11:36 PM

#

yeah

#

looking it up in the index in < O(n) time

#

then looking it up in the dataframe in O(1) time

velvet thorn Aug 23, 2020, 11:36 PM

#

I mean

desert oar Aug 23, 2020, 11:37 PM

#

idk if it actually works that way

velvet thorn Aug 23, 2020, 11:37 PM

#

what's the expected output

#

the column pants sorted by the value of product_category?

#

i.e. df.sort_values(by='product_category')['pants']?

desert oar Aug 23, 2020, 11:37 PM

#

it would be equivalent to df.loc[df['product_category'] == 'pants']

velvet thorn Aug 23, 2020, 11:39 PM

#

wouldn't that just be df[df['product_category'] == 'pants']

#

but okay I get it

#

if you say that the index doesn't need to change with the DataFrame

#

then it seems to me that you could just use a dict

#

where the keys are unique values of the given category and the values are row numbers

#

which would reduce lookups to constant time

desert oar Aug 23, 2020, 11:43 PM

#

thats what i was thinking too

#

that was on my TODO list

#

you could use a B-tree or whatever

#

but yeah a dict is easy

#

also this doesnt support range index lookups (yet)

#

eg if there is more than 1 row with that value

velvet thorn Aug 23, 2020, 11:44 PM

#

time to poke around pandas source code

#

and see what they do with __setattribute__

desert oar Aug 23, 2020, 11:44 PM

#

and obviously something like this is kinda useless except on pretty large dataframes

#

heh have fun 😄

velvet thorn Aug 23, 2020, 11:46 PM

#

indeed

#

how big are your dataframes?

#

honestly I don't think I've ever been at the point that this would be a necessary optimisation

desert oar Aug 24, 2020, 12:04 AM

#

not that big anymore

#

but ive worked on problems with > 1bn rows in memory

#

or where the lookups just needed to be faster than they were

velvet thorn Aug 24, 2020, 12:13 AM

#

and you needed to index on arbitrary columns

#

such that a multi-level index wouldn't have worked?

desert oar Aug 24, 2020, 2:12 AM

#

@velvet thorn thats an interesting option, i still like this separate index idea though 😛

#

im curious if it can actually produce any speed improvements on bigger datasets

flat quest Aug 24, 2020, 2:28 AM

#

and a new project is born

soft dock Aug 24, 2020, 2:59 AM

#

I'm working on a project generating guitar hero charts based on tablature but honestly I don't think I'll ever finish

flat quest Aug 24, 2020, 4:02 AM

#

using ML?

Does it even need ML for that?

desert parcel Aug 24, 2020, 7:38 AM

#

does anyone know what nan means when you're calculating your loss?

molten hamlet Aug 24, 2020, 8:10 AM

#

not a number

#

probably too small

#

or some other numerical error

desert parcel Aug 24, 2020, 9:14 AM

#

Hmm alright

velvet thorn Aug 24, 2020, 9:17 AM

#

also possibly too big

#

or divide by 0

molten hamlet Aug 24, 2020, 9:19 AM

#

ah right

#

divided by zero most possible

velvet thorn Aug 24, 2020, 9:20 AM

#

yeah

molten hamlet Aug 24, 2020, 9:20 AM

#

due to numerical error, some number just get smaller than epsilon

velvet thorn Aug 24, 2020, 9:20 AM

#

too big distinct from that usually comes when your learning rate is too high

#

so gradient descent becomes gradient ascent 🎢

desert parcel Aug 24, 2020, 9:27 AM

#

hmm well I have that right now

#

I messed around with different lrs

#

but it didn't work after changing it differently

velvet thorn Aug 24, 2020, 9:28 AM

#

do you get nan loss immediately?

#

or after a while

desert parcel Aug 24, 2020, 9:28 AM

#

Immediately

velvet thorn Aug 24, 2020, 9:28 AM

#

then it's not that

desert parcel Aug 24, 2020, 9:28 AM

#

Then what could it be

velvet thorn Aug 24, 2020, 9:28 AM

#

too big distinct from that usually comes when your learning rate is too high
@velvet thorn not this

#

the other stuff we said

desert parcel Aug 24, 2020, 9:28 AM

#

huh

velvet thorn Aug 24, 2020, 9:28 AM

#

okay

#

if the loss starts out finite

desert parcel Aug 24, 2020, 9:28 AM

#

so my loss could be too large?

velvet thorn Aug 24, 2020, 9:29 AM

#

but becomes nan after a while

#

(and you see it going up real quick)

#

that suggests that your learning rate is too high

#

because your model's parameters bounce out of the valley of low loss into the skies of float overflow

#

but if your loss starts out as nan

#

that implies that the problem is something else

#

e.g. division by 0 somewhere

desert parcel Aug 24, 2020, 9:29 AM

#

because your model's parameters bounce out of the valley of low loss into the skies of float overflow
@velvet thorn what does that mean

velvet thorn Aug 24, 2020, 9:29 AM

#

do you know how gradient descent works?

desert parcel Aug 24, 2020, 9:29 AM

#

yeah

velvet thorn Aug 24, 2020, 9:29 AM

#

then you should understand that...?

desert parcel Aug 24, 2020, 9:30 AM

#

My english isn't the best lol

velvet thorn Aug 24, 2020, 9:30 AM

#

if your learning rate is too high

#

okay never mind

#

let me draw this

desert parcel Aug 24, 2020, 9:30 AM

#

An increasing gradient requires a low learning rate right?

#

and a decreasing gradient is the opposite of that

velvet thorn Aug 24, 2020, 9:31 AM

#

📎 unknown.png

#

basically

#

hm. I should take drawing classes.

desert parcel Aug 24, 2020, 9:31 AM

#

Naw it's alright lol

#

It's good enough

velvet thorn Aug 24, 2020, 9:31 AM

#

basically if you adjust your weights by too much each iteration it is possible that you will "bounce" to the other side of the loss landscape

desert parcel Aug 24, 2020, 9:31 AM

#

So you wanna check my code? Could it be because of the way I added in my input data?

velvet thorn Aug 24, 2020, 9:31 AM

#

increasing loss in the process

#

So you wanna check my code? Could it be because of the way I added in my input data?
@desert parcel no thank you

desert parcel Aug 24, 2020, 9:31 AM

#

Because I've never done it this way before

velvet thorn Aug 24, 2020, 9:32 AM

#

hard to say, could be a few things

desert parcel Aug 24, 2020, 9:32 AM

#

lol

#

is my code that bad

velvet thorn Aug 24, 2020, 9:32 AM

#

I don't like debugging DL code

#

it's very time-consuming

desert parcel Aug 24, 2020, 9:32 AM

#

📎 unknown.png

velvet thorn Aug 24, 2020, 9:32 AM

#

because of the level of abstraction

#

nothing about you personally

desert parcel Aug 24, 2020, 9:32 AM

#

well I did that there are no issues but I'm just wondering

velvet thorn Aug 24, 2020, 9:32 AM

#

why is your target 2D

#

any reason?

desert parcel Aug 24, 2020, 9:32 AM

#

Because I'm only trying to predict one thing

velvet thorn Aug 24, 2020, 9:33 AM

#

yes, so it should be 1D ,right

desert parcel Aug 24, 2020, 9:33 AM

#

📎 unknown.png

velvet thorn Aug 24, 2020, 9:33 AM

#

yes, do you not see it is 2D

#

9 is the first dimension

#

1 is the second dimension

desert parcel Aug 24, 2020, 9:33 AM

#

Oh yeah

velvet thorn Aug 24, 2020, 9:33 AM

#

(9, 1) is different from (9,)

#

TBH I don't have much experience with Torch so I don't know how it would handle such things

#

but it's at least a little strange IMO

desert parcel Aug 24, 2020, 9:34 AM

#

Well but I have 2, 2D tensors should be fine right?

velvet thorn Aug 24, 2020, 9:35 AM

#

what?

#

didn't get that, sorry

#

are you Malaysian btw

desert parcel Aug 24, 2020, 9:35 AM

#

Oh nice

#

yeah you're right

random perch Aug 24, 2020, 9:35 AM

#

I'm trying to set up an upstream for tensorflow by doing
git remote add upstream git@github.com:tensorflow/tensorflow.git
however when i try to run
git pull upstream master
I get the error seen in the screen shot. If anyone knows what im doing wrong please lmk. Sorry if im intruding in a conversation

📎 Screen_Shot_2020-08-24_at_3.04.55_PM.png

desert parcel Aug 24, 2020, 9:36 AM

#

well I meant that I have two 2D tensors multiplying them together should be fine

velvet thorn Aug 24, 2020, 9:36 AM

#

well I meant that I have two 2D tensors multiplying them together should be fine
@desert parcel okay, you have kind of lost me

#

which two tensors are you multiplying together

desert parcel Aug 24, 2020, 9:36 AM

#

preds and targets

velvet thorn Aug 24, 2020, 9:36 AM

#

I'm trying to set up an upstream for tensorflow by doing
git remote add upstream git@github.com:tensorflow/tensorflow.git
however when i try to run
git pull upstream master
I get the error seen in the screen shot. If anyone knows what im doing wrong please lmk. Sorry if im intruding in a conversation
@random perch you can't pull directly from the TF repo

desert parcel Aug 24, 2020, 9:36 AM

#

preds = model(inputs)

#

model = (11, 1)

velvet thorn Aug 24, 2020, 9:37 AM

#

preds = model(inputs)
@desert parcel ah, okay

#

that seems reasonable

#

could be something else in the data

#

hard to say from here

#

just experiment a little

random perch Aug 24, 2020, 9:37 AM

#

@random perch you can't pull directly from the TF repo
@velvet thorn How do I update my forked repo to match the TF repo

velvet thorn Aug 24, 2020, 9:37 AM

#

okay it's been a while since I actually forked a repo

#

so I don't wanna tell you the wrong thing that I'm not sure about

desert parcel Aug 24, 2020, 9:38 AM

#

Lol i'm not even familiar with git

velvet thorn Aug 24, 2020, 9:38 AM

#

but you might wanna try in #tools-and-devops?

#

think that's more appropriate

random perch Aug 24, 2020, 9:38 AM

#

ite bet

velvet thorn Aug 24, 2020, 9:38 AM

#

actually

#

I feel like what I said might be wrong

#

about not being able to pull directly

#

hm

#

let me try

desert parcel Aug 24, 2020, 9:38 AM

#

so if it's not the tensor issue

#

then it's the data?

velvet thorn Aug 24, 2020, 9:39 AM

#

could be your model too...?

#

@random perchnever mind

#

I'm p sure I'm wrong

desert parcel Aug 24, 2020, 9:40 AM

#

Hmm

velvet thorn Aug 24, 2020, 9:40 AM

#

it's a different issue

desert parcel Aug 24, 2020, 9:40 AM

#

well then I'm not sure how to proceed then

velvet thorn Aug 24, 2020, 9:40 AM

#

Git can't access your credentials

#

are you using Windows or *nix

#

oh okay I think I get it

desert parcel Aug 24, 2020, 9:40 AM

#

do you mean unix?

velvet thorn Aug 24, 2020, 9:40 AM

#

*nix = Unix, Linux, etc.

#

it's because

random perch Aug 24, 2020, 9:41 AM

#

Im using mac

velvet thorn Aug 24, 2020, 9:41 AM

#

you're trying to connect using SSL

random perch Aug 24, 2020, 9:41 AM

#

so unix

velvet thorn Aug 24, 2020, 9:41 AM

#

the SIMPLEST way to fix this

#

is

#

do this instead

#

git remote add upstream https://github.com/tensorflow/tensorflow.git

random perch Aug 24, 2020, 9:41 AM

#

oh mm

velvet thorn Aug 24, 2020, 9:41 AM

#

although I would suggest you look into setting up SSH keys

random perch Aug 24, 2020, 9:41 AM

#

yeah that actually might work lol

velvet thorn Aug 24, 2020, 9:41 AM

#

like you notice

#

the URL is different

random perch Aug 24, 2020, 9:41 AM

#

i did set up my SSH key

#

but idk why its being wack

velvet thorn Aug 24, 2020, 9:42 AM

#

no Mac experience, sorry

random perch Aug 24, 2020, 9:42 AM

#

what do u use

velvet thorn Aug 24, 2020, 9:42 AM

#

Ubuntu

desert parcel Aug 24, 2020, 9:42 AM

#

well with my code the tensors seem to be working fine

📎 unknown.png

velvet thorn Aug 24, 2020, 9:42 AM

#

why is there

#

a nan

#

in the input?

#

if you have nan inputs of course the output will be nan too

desert parcel Aug 24, 2020, 9:43 AM

#

oh yeah

velvet thorn Aug 24, 2020, 9:43 AM

#

well

desert parcel Aug 24, 2020, 9:43 AM

#

I just saw that

#

huh

#

ohh

#

I didn't see that

velvet thorn Aug 24, 2020, 9:43 AM

#

I mean

random perch Aug 24, 2020, 9:43 AM

#

git remote add upstream https://github.com/tensorflow/tensorflow.git
@velvet thorn 10/10 it worked ty!

velvet thorn Aug 24, 2020, 9:43 AM

#

yw

#

I'm not sure how to put this in a non-condescending/offensive way

#

but this is really basic debugging

#

so...yeah...

desert parcel Aug 24, 2020, 9:44 AM

#

Lol I don't get offended easily

#

so no worries

velvet thorn Aug 24, 2020, 9:44 AM

#

like to reiterate

#

you really should take a step back on work on more basic things (like coding and mathematics)

#

this isn't even an architecture problem

desert parcel Aug 24, 2020, 9:46 AM

#

My maths is alright

#

Well the tutorial didn't really cover too much of the math side other than when it is talking about calculating the loss

#data-science-and-ml

Anthony Davis