#data-science-and-ml | Python | Page 380

neat anvil Feb 25, 2022, 2:55 PM

#

overfitting also is usually determined by comparing performance on the validation set versus the performance on the test set, and is really quite impossible to determine w/o at the very least that comparison

pastel valley Feb 25, 2022, 2:55 PM

#

if i used this i should also indicate the labels for my validation set?

somber prism Feb 25, 2022, 2:56 PM

#

wym indicate the labels ?

somber prism Feb 25, 2022, 2:56 PM

#

pastel valley if i used this i should also indicate the labels for my validation set?

https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.train_test_split.html

scikit-learn

sklearn.model_selection.train_test_split

Examples using sklearn.model_selection.train_test_split: Release Highlights for scikit-learn 0.23 Release Highlights for scikit-learn 0.23, Release Highlights for scikit-learn 0.24 Release Highligh...

pastel valley Feb 25, 2022, 2:58 PM

#

somber prism https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.train_...

this means the x ,y right? image and class?

#

somber prism Feb 25, 2022, 2:59 PM

#

pastel valley

yes this is how you do it ```

train_test_split( x, y, test_size = some float, stratify = y
)```

#

only if you specify stratify it will even split the distributions for train and validation

pastel valley Feb 25, 2022, 3:01 PM

#

this is sample of the split i did top is train bottom is validation but its possible that the validation set is unbalanced ? like maybe its composed of 50% class_a right?

#

based on imagedatagenerator its what it does

#

it did not specify if its balance or not

#

is it to necessary to have balanced validation set?

#

oh i see what you mean here is the validation data of fit()

#

but that part about regularization layer drop out meaning during prediction and validation my dropout layer is ignored?

#

drop out layers are just used during training?

neat anvil Feb 25, 2022, 3:16 PM

#

typically dropout layers are only used during training, yes. That doesn't necessarily have to be true but that is the convention.

prime hearth Feb 25, 2022, 3:21 PM

#

hello, i would appreciate any feedback on this article:
https://medium.com/@alexm5492/linear-regression-from-scratch-3-methods-2e803d82137c

pastel valley Feb 25, 2022, 3:22 PM

#

neat anvil typically dropout layers are only used during training, yes. That doesn't necess...

so in my validation if i understand it correctly then dropout are ignored? is it only on fit or also on predict?

neat anvil Feb 25, 2022, 3:24 PM

#

I cannot speak to your particular codebase. You'll have to refer to the documentation and/or explore the source code.

#

The typical situation would be that dropout layers are essentially only actually "in" the model during back-propogation in the training process

#

and are not used at any other time

pastel valley Feb 25, 2022, 3:27 PM

#

its ignored based on this if i understand it correctly

pastel valley Feb 25, 2022, 3:27 PM

#

neat anvil The typical situation would be that dropout layers are essentially only actually...

back prop is the time where the weights are being updated righ?

#

so only half of the units on a layer will be updated?

#

at first i thought the dropout happens during forward prop?

neat anvil Feb 25, 2022, 3:30 PM

#

pastel valley back prop is the time where the weights are being updated righ?

yes. I'd highly consider taking this course if you have any confusion on any of these details we've been discussing. https://www.coursera.org/learn/machine-learning

Coursera

Machine Learning

Learn Machine Learning from Stanford University. Machine learning is the science of getting computers to act without being explicitly programmed. In the past decade, machine learning has given us self-driving cars, practical speech recognition, ...

#

consider taking some time to write a basic neural network "from scratch", meaning only using numpy functions and data structures. It'll really build up a lot of these complicated concepts of the internals of deep learning in your mind

#

why things are done a certain way

#

instead of just having to memorize "people only apply dropout during backwards propagation not during forward propagation b/c that's the way it's done"

pastel valley Feb 25, 2022, 3:33 PM

#

i only read the blog version of it hahaha this series i think https://www.analyticsvidhya.com/blog/2018/12/guide-convolutional-neural-network-cnn/

Analytics Vidhya

Pulkit Sharma

CNN Tutorial | Tutorial On Convolutional Neural Networks

A complete CNN tutorial to learn about what they are and how they work. In this tutorial on convolutional neural networks learn the fundamentals of it.

#

it mentioned about normalization and drop outs also but i forgot and its probably not to detailed compared to the course

#

thank you for the reference 😅 👍

mighty agate Feb 25, 2022, 3:50 PM

#

hello, i'm trying to display my data frame and i don't know why i'm trying to show the elements from my json array it doesn't...

#

import json
import pandas as pd
import matplotlib.pyplot as plt
from pandas.io.json import json_normalize```

#

with open('C:/Users/PC/Desktop/desktop/git/projets/python/Data-Analysis-Velib/station_status.json', 'r') as f:
    velos = pd.DataFrame(json.loads(f.read()))
#df_velos = pd.json_normalize(velos['data'], record_path=['stations'])
df_velos = pd.json_normalize(velos['data'])```

serene scaffold Feb 25, 2022, 3:52 PM

#

@mighty agate use this instead of json.loads or any other explicit file IO: https://pandas.pydata.org/docs/reference/api/pandas.read_json.html

mighty agate Feb 25, 2022, 3:53 PM

#

all right, like that? py with open('C:/Users/PC/Desktop/desktop/git/projets/python/Data-Analysis-Velib/station_status.json', 'r') as f: velos = json.loads(f.read()) #df_velos = pd.json_normalize(velos['data'], record_path=['stations'])

lapis sequoia Feb 25, 2022, 3:59 PM

#

yo

#

a little help please?

#

i am not able to install ecapture module

#

pip install ecapture

#

it says scikit-image wheels cannot build

#

using python 3.10

serene scaffold Feb 25, 2022, 4:04 PM

#

lapis sequoia it says scikit-image wheels cannot build

please do pip install -U pip setuptools wheel and then try the other install again. If that doesn't work, copy and paste the whole console output starting from the command in the paste bin

#

!paste

arctic wedgeBOT Feb 25, 2022, 4:04 PM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the floppy disk icon in the top right, or by typing ctrl + S. After doing that, the URL should change. Copy the URL and post it here so others can see it.

lapis sequoia Feb 25, 2022, 4:04 PM

#

serene scaffold please do `pip install -U pip setuptools wheel` and then try the other install a...

thanks...

serene scaffold Feb 25, 2022, 6:48 PM

#

It doesn't look like this question is on topic for this channel.

lapis sequoia Feb 25, 2022, 6:49 PM

#

serene scaffold It doesn't look like this question is on topic for this channel.

whoops

strange zealot Feb 25, 2022, 7:05 PM

#

!code

arctic wedgeBOT Feb 25, 2022, 7:05 PM

#

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

graceful glacier Feb 25, 2022, 7:34 PM

#

i have the following table

#

#

and i want to add a 'Games out of Position' column.

it will detailing how many games he has played in a position that is
NOT the position listed in the 'position' column

#

for example
adrian it would be 2 and then 9,
and for gomez it would be 17, 5, 14(read from top to bottom)

#

how can i accomplish this?

desert oar Feb 25, 2022, 7:58 PM

#

graceful glacier and i want to add a 'Games out of Position' column. it will detailing how many ...

it seems like your data is organized as "triples" of (player, position_preferred, position_played) or something like that. is that right?

graceful glacier Feb 25, 2022, 7:59 PM

#

yes, thats the index

desert oar Feb 25, 2022, 7:59 PM

#

it sounds like you are seeking to aggregate the data to just 1 data point per player

#

so you are looking to create a new table with just player as the index

#

and your dataframe is using a multiindex?

graceful glacier Feb 25, 2022, 8:00 PM

#

no i dont think so. just looking to add a column to the existing table. aggregation might be involved in acchieving this however im not sure

#

yes multi index

desert oar Feb 25, 2022, 8:00 PM

#

well that number seems like a number per player only

#

so you could then re-join that back into the original table, but the number will be repeated for each row for each player

graceful glacier Feb 25, 2022, 8:01 PM

#

no let me just send a picture of what the solution column will look like

desert oar Feb 25, 2022, 8:03 PM

#

ok. i think i am missing some context

#

what do you want to do with this quantity that you calculate?

graceful glacier Feb 25, 2022, 8:06 PM

#

graceful glacier Feb 25, 2022, 8:07 PM

#

desert oar what do you want to _do_ with this quantity that you calculate?

essentialy add up the 'appeared' column for for each player(the first index level) for rows that are not the current row

desert oar Feb 25, 2022, 8:07 PM

#

graceful glacier

hm... so you want the sum of all the other rows for that given player?

graceful glacier Feb 25, 2022, 8:08 PM

#

so for gomez, he played 17 games outside the RB position, 5 games outside the RBP position and 14 games outside the SUB position

graceful glacier Feb 25, 2022, 8:08 PM

#

desert oar hm... so you want the sum of all the _other_ rows for that given player?

yes

desert oar Feb 25, 2022, 8:08 PM

#

i think i understand that. what i recommend is writing a function that takes a dataframe as an input and returns this new column, that works on one single player. then you can do a .groupby(level='player name').apply(your_new_function) to get your desired column

graceful glacier Feb 25, 2022, 8:10 PM

#

ok i think i got it

#

a window function totaling the total appearences for that player

#

and subtract that number from the appeared column

desert oar Feb 25, 2022, 8:11 PM

#

im not sure you need a window function, but that is one way to do it

#

you would probably need to define a custom "window" which i think is pretty complicated

#

i tried to do it once and gave up, the docs weren't clear and the examples that ship with pandas are kind of convoluted internally

#

that's why i suggested groupby

#

honestly i'd just loop over rows inside the function

#

these individual per-player dataframes are so small that the performance isn't important

graceful glacier Feb 25, 2022, 8:12 PM

#

desert oar you would probably need to define a custom "window" which i think is pretty comp...

hmm im not sure how to do this, but i can bypass this by resetting index right

desert oar Feb 25, 2022, 8:12 PM

#

i wouldn't even bother

graceful glacier Feb 25, 2022, 8:13 PM

#

ok, ill try both way none the less and learn something new.

graceful glacier Feb 25, 2022, 8:14 PM

#

desert oar i tried to do it once and gave up, the docs weren't clear and the examples that ...

i personally need to get better at debugging with pandas bc what i do understand abt how pandas works is pretty loosy goosey in my head

desert oar Feb 25, 2022, 8:14 PM

#

the super-naive way is something like this:

def compute_player_oop(player_df):
    result = []
    for row in player_df.itertuples():
        other_rows = player_df.loc[player_df.index != row.index]
        other_total = other_rows['appeared'].sum()
        result.append(other_total)
    return pd.Series(result, index=player_df.index)

data['games_out_of_position'] = data.groupby(level='player name').apply(computer_player_oop)

#

something like that anyway

#

untested code written by volunteer strangers etc.

graceful glacier Feb 25, 2022, 8:15 PM

#

ahhh nice. thanks by the way

desert oar Feb 25, 2022, 8:15 PM

#

this is pretty inefficient but easy

graceful glacier Feb 25, 2022, 8:15 PM

#

desert oar untested code written by volunteer strangers etc.

the best type of code

desert oar Feb 25, 2022, 8:15 PM

#

there are probably faster ways using set operations on indexes or something with window functions

graceful glacier Feb 25, 2022, 8:15 PM

#

desert oar this is pretty inefficient but easy

i always hesitate before using loops

desert oar Feb 25, 2022, 8:15 PM

#

yeah for small dataframes it's fine

#

if you are looping over 5 rows who cares

#

if you are looping over 5000 rows even who cares

graceful glacier Feb 25, 2022, 8:16 PM

#

true

#

ok this was constructive, thanks

desert oar Feb 25, 2022, 8:17 PM

#

the reason to avoid loops on small datasets is more for readability + concision than for performance

#

you're welcome

serene scaffold Feb 25, 2022, 8:18 PM

#

you're welcome to who? I was going to say that sometimes the only reason I encourage people to avoid loops is to force them to learn the API lemon_sweat

desert oar Feb 25, 2022, 8:18 PM

#

serene scaffold you're welcome to who? I was going to say that sometimes the only reason I encou...

for sure, that's a good reason too

#

idk if there's a tidy pandas solution for this problem though

#

"un-selecting" one row at a time

serene scaffold Feb 25, 2022, 8:19 PM

#

also I see that you're speaking to Ahmad. I was momentarily confused because you wrote that message quite quickly while I was typing, so I thought that was related.

desert oar Feb 25, 2022, 8:19 PM

#

basically a "leave-one-out" operation

desert oar Feb 25, 2022, 8:19 PM

#

serene scaffold also I see that you're speaking to Ahmad. I was momentarily confused because you...

haha no but i was curious if you were going to chime in with a nicer approach

#

it'd be nice to have a general efficient idiom for "leave-one-out" operations with pandas

#

or even "leave-n-out", like an inverse window function

#

could be a good SO question (if it isn't already)

serene scaffold Feb 25, 2022, 8:20 PM

#

if you ever figure it out, you can ask it and answer it

#

if there's a way to make a boolean series where n are True, you can use that to get the retained rows and invert it with ~ to get those that aren't.

desert oar Feb 25, 2022, 8:24 PM

#

that's pretty much what i did, other_rows = data.loc[this_row.index != data.index]

#

(assuming that the indexes are unique, which they really should be)

#

non-unique indexes in pandas are just a Bad Idea, like non-string column names

minor elbow Feb 25, 2022, 8:51 PM

#

u could sum all of them and then subtract the row

marsh juniper Feb 25, 2022, 8:52 PM

#

why there is no data in due_date and a few other columns when the dataset originally has it

#

please if anyone can help me asap, I have to turn in my project

desert oar Feb 25, 2022, 9:00 PM

#

minor elbow u could sum all of them and then subtract the row

that's a good solution for this particular case

frozen hedge Feb 25, 2022, 9:15 PM

#

does anyone know how to use kronecker delta or levi civita symbols with autograd / jax?

wide rose Feb 25, 2022, 10:38 PM

#

Anyone have idea how to stack 3d matricies in numpy?

#

corgi_1 = np.asarray(cv2.imread(corgi_jpgs[0]))
corgi_2 = np.asarray(cv2.imread(corgi_jpgs[1]))
corgi_3 = np.asarray(cv2.imread(corgi_jpgs[1]))

corgis = np.stack((corgi_1, corgi_2))
corgis.shape```

#

(2,100,100,3)

#

when I try to add the third one to "master" array I get a value error

#

corgi_3.shape
(100,100,3)

#

do I just need to add an extra dimension along the fourth axisa

serene scaffold Feb 25, 2022, 10:41 PM

#

@wide rose see if this or one of the other functions listed has the desired effect: https://numpy.org/doc/stable/reference/generated/numpy.vstack.html

wide rose Feb 25, 2022, 10:41 PM

#

tried it same error

#

tried literally all of them hahaha

#

before coming here

#

corgi_3 = np.expand_dims(corgi_3, axis=0)
corgis = np.stack((corgis, corgi_3))```

#

doesnt work

#

vstack gets it with the extra dim added

#

cool thanks

iron basalt Feb 25, 2022, 10:44 PM

#

wide rose Anyone have idea how to stack 3d matricies in numpy?

https://numpy.org/doc/stable/reference/generated/numpy.concatenate.html

wide rose Feb 25, 2022, 10:44 PM

#

yea I just had to add the extra dim for vstack to work

#

wasnt expecting that behavior but in retrospect makes perfect sense

#

realized as soon as I typed here :/

desert oar Feb 25, 2022, 11:03 PM

#

wide rose do I just need to add an extra dimension along the fourth axisa

yes

#

vstack and hstack don't add dimensions for you

#

you can use .reshape to "wrap" with an extra dimension

wide rose Feb 25, 2022, 11:04 PM

#

yea I used expand dim

desert oar Feb 25, 2022, 11:04 PM

#

ah i see, yep

wide rose Feb 25, 2022, 11:04 PM

#

I get so use to numpy being so smart sometimes i do dumb shit like that lmao

desert oar Feb 25, 2022, 11:05 PM

#

it'd be nice to have a function like "concat with extra dimension" function

#

there are stack_columns and stack_rows but i think those are 2d-only

wide rose Feb 25, 2022, 11:20 PM

#

desert oar it'd be nice to have a function like "concat with extra dimension" function

yea it would

#

wonder how hard that would be

iron basalt Feb 25, 2022, 11:27 PM

#

>>> def combine(x, y):
...     return np.concatenate((np.expand_dims(x, axis=0), np.expand_dims(y, axis=0)))

#

Make it asserts that x and y have the same dims though*

#

Or use stack*

desert oar Feb 25, 2022, 11:36 PM

#

yeah it's not that bad

#

just nobody did it

misty flint Feb 25, 2022, 11:50 PM

#

has anyone been able to play around with gpt-3 before? how was your experience

#

blobhyperthink

graceful glacier Feb 26, 2022, 12:35 AM

#

i have the following table

#

#

and i want to apply somthing like this

#

np.where(dfx['space'], dfx['full_name'].str.extract('.*\s(.*)'), dfx['full_name'])

#

its raises an error and i understand why

#

any ideas on how to execute this?

#

basically i want to get the last name, which is anything after the first space, if a space exists

#

i found a round about way of doing this(i created multiple columns and then removed them) but i would still like to know if its possible with one line of code

wide rose Feb 26, 2022, 12:59 AM

#

iron basalt ```py >>> def combine(x, y): ... return np.concatenate((np.expand_dims(x, ax...

I meant like actually add it to the numpy library

wide rose Feb 26, 2022, 12:59 AM

#

misty flint has anyone been able to play around with gpt-3 before? how was your experience

I think you can with hugging face iirc

misty flint Feb 26, 2022, 1:03 AM

#

gpt-2 yes, openai-gpt (gpt1) yes, not gpt-3

#

ive used the first two in my NLP class

minor elbow Feb 26, 2022, 1:06 AM

#

u can use np.newaxis/None as a new dim eg np.ones((3,4))[:,:,np.newaxis].shape

pastel valley Feb 26, 2022, 1:46 AM

#

if i dont plan to validate or change my model do i just use train and test set? and use the test set in validation_data ?

#

instead of using validation set in validation_data for fit() method then i can just use the test set to evaluate it? or it is necessary to just evaluate the model on test set after the training?

prime hearth Feb 26, 2022, 2:18 AM

#

hello, i would liek to please ask, im currently self teaching machine learning. The algorithms I learned mostly are about linear regression and neural networks and logitic regression. I learned a bit about featring engineerin like how to handle misisng values or imbalance dataset and do hyper paramter tuning a bit. I am focusing deeper now in NLP

#

am i on the right path to landing first internship?

#

im also making a machine learning project with NLP that uses full stack and kubernet

frosty flower Feb 26, 2022, 2:23 AM

#

I have an array with shape (2, 3) and an array with shape (2, )

#

I want to do an element-wise division

#

a = [[1,2,3], [4,5,6]]
b = [3, 3]
# Want: 
c = [[0.333, 0.666, 1], [1.333, 1.666, 2]]

prime hearth Feb 26, 2022, 2:26 AM

#

you can just do

#

a/b

#

assuming b is 1 element

frosty flower Feb 26, 2022, 2:28 AM

#

It says "operands could not be broadcast together with shapes (2,3) (2,)"

prime hearth Feb 26, 2022, 2:30 AM

#

yeah because must be same shape

#

since b has same element

#

can just use first index

minor elbow Feb 26, 2022, 2:31 AM

#

frosty flower I want to do an element-wise division

a / b[:,np.newaxis]

minor elbow Feb 26, 2022, 2:34 AM

#

pastel valley instead of using validation set in validation_data for fit() method then i can j...

you can use the validation data as a test set yes. the idea of the validation set is to keep some of your data from any train/test hyperparameter optimization process for the final model evaluation. if ur not doing any paramer or model changes, you can use the test set for more training data points and the validation set as the test set

frosty flower Feb 26, 2022, 2:35 AM

#

minor elbow ```a / b[:,np.newaxis]```

Thankssss

#

It works. Do you mind explain what exactly the code does?

minor elbow Feb 26, 2022, 2:36 AM

#

it is as the other commenter mentioned they need to be broadcastable. there is a good explanation here https://numpy.org/doc/stable/user/basics.broadcasting.html

pastel valley Feb 26, 2022, 2:37 AM

#

minor elbow you can use the validation data as a test set yes. the idea of the validation se...

will it be the same as this?

#

or maybe at first i split the trainin data to tain and validation set and after some changes i use all training data for training set and test set for validation?

minor elbow Feb 26, 2022, 2:39 AM

#

ok theres 2 things here

#

lets say u just have a train and test set

#

you build your model on the train set, and it seems to go ok, then you run it on your test set but it does not do as well.

#

so you change something in your model, do the training again on the train set, test it again with the test set and it does better

#

at this point u have no idea how your model will perform on unseen/new data

#

like it should be ok, but you dont actually know, because you have reused ur test set and have potentially implicitly introduced overfitting

#

so how do u know? the validation set

#

if u go and make changes after evaluating performance on some new data set, that new data set cease to be a useful indicator of how the model will perform on unseen data

#

you explictly said in your question you would not be making any changes to the model

#

so u need to be clear if you are going to tweak the model, or accept its performance as is

pastel valley Feb 26, 2022, 2:44 AM

#

i am trying to compare image augmentation techniques on which method will give the best raw performance in order to do so without bias i created a model from scratch and compare their performance without optimizing the model because if i do change the models it should always be identical and if i do that optimizing it will just make biases for the models

#

if i explained it right so i dont need to use validation i just create 2 identical models train with different augmented techniques and test which one gives the best raw performance
by raw performance i mean no optimizations or tweak to be applied to the models so they will stay the same

minor elbow Feb 26, 2022, 2:47 AM

#

yeah i think if i understand u, just a train and test set is needed and you can report the results for each augmenation type

pastel valley Feb 26, 2022, 2:48 AM

#

i even go to the lengths of copying the initial weights of models so they really have the same starting point

minor elbow Feb 26, 2022, 2:48 AM

#

yeah you should be able to set the random seed which should give the same initialization

pastel valley Feb 26, 2022, 2:49 AM

#

minor elbow yeah i think if i understand u, just a train and test set is needed and you can ...

and if i implement it using keras do i use the test set in validation_data during training or on evaluate() after training? is the the same?
btw am using tensorflow

minor elbow Feb 26, 2022, 2:49 AM

#

i am not familiar enough with those libraries to be able to say

pastel valley Feb 26, 2022, 2:49 AM

#

minor elbow yeah you should be able to set the random seed which should give the same initia...

this is what i did to copy the weights is there a correct procedure?
cst_model.set_weights(gt_model.get_weights())

minor elbow Feb 26, 2022, 2:51 AM

#

i prefer setting the seed unless you are really sure there are no other random things happening in other layers

#

but im not really a dl person

#

i dont know enough about keras/tf to be able to say either way

pastel valley Feb 26, 2022, 2:52 AM

#

this is what is says on validation data of sequential.fit()
if i understand it correctly the models is seeing the validation every end of epoch but its not being trained for it right? so its the same as testing the models on new data did i understand it correct?

pastel valley Feb 26, 2022, 2:53 AM

#

minor elbow but im not really a dl person

what dl?

minor elbow Feb 26, 2022, 2:53 AM

#

deep learning

#

yeah the validation_data wont be used to train, so u could use your test set there and the results of your experiment would be whatever the final metrics are from the final epoch

pastel valley Feb 26, 2022, 2:55 AM

#

nice nice

#

btw this is the example i trained without validation data

#

the metrics showed their are the blind guess of the model on training right?

minor elbow Feb 26, 2022, 2:57 AM

#

im not 100% but if you didnt provide any validation data then yeah they would be the training data metrics

#

its good to track the metrics with a validation set

#

im always really suspicious of a model that learns really well such as that one seems to be doing

#

its often a sign of the model overfitting the training data

#

but it depends on the type of data you are working with, sometimes things do work very well

pastel valley Feb 26, 2022, 3:01 AM

#

overfitting is when the model is too good on training but garbage on new data right?

minor elbow Feb 26, 2022, 3:01 AM

#

yeah

pastel valley Feb 26, 2022, 3:01 AM

#

underfitting is what?

#

not learning enough?

minor elbow Feb 26, 2022, 3:02 AM

#

more or less

#

its the bias/variance tradeoff

pastel valley Feb 26, 2022, 3:03 AM

#

if the model predict something like this what does it mean?

minor elbow Feb 26, 2022, 3:04 AM

#

thats just the class

pastel valley Feb 26, 2022, 3:04 AM

#

also softmax is probability distribution right?
so if am getting 1 is the model too confident of the answer?

minor elbow Feb 26, 2022, 3:05 AM

#

softmax is a probability-like distribution

pastel valley Feb 26, 2022, 3:05 AM

#

minor elbow thats just the class

so the middle one is the predicted class?

minor elbow Feb 26, 2022, 3:05 AM

#

yeah

pastel valley Feb 26, 2022, 3:05 AM

#

minor elbow softmax is a probability-like distribution

what does it mean?

minor elbow Feb 26, 2022, 3:05 AM

#

the element with the highest number is the most likely class

#

relative to the others

pastel valley Feb 26, 2022, 3:06 AM

#

the overall values of the classes should be 1 right?

minor elbow Feb 26, 2022, 3:07 AM

#

the values should all sum to 1 yes, thats what makes it probability-like

#

and they are all in the range [0,1]

pastel valley Feb 26, 2022, 3:08 AM

#

oh isee thank you

#

btw how is overfitting underfitting solved?

minor elbow Feb 26, 2022, 3:09 AM

#

hyper parameter optimization, regularisation, different types of models, stuff like that

#

its kinda a how long is a piece of string type thing

#

depends on ur data, model, approach, desired outcome etc

pastel valley Feb 26, 2022, 3:39 AM

#

in my case if i am comparing if the other model is overfitting while the other one is not then i can say the dataset is the one responsible ?

minor elbow Feb 26, 2022, 5:25 AM

#

overfitting is fundamentally a model parameter problem

#

but it may also be due to not having enough data

#

in which case, you can still change model paramters to prevent overfitting, however it likely wont be a very good model

lone drum Feb 26, 2022, 5:28 AM

#

Hello
I am trying df.to_csv()
But it is not creating CSV file
It has created folder name as file name but not file
Ping me when replying

#

I am not getting output as CSV file in specified path

mint palm Feb 26, 2022, 6:10 AM

#

is state space search heuristic method??

lapis sequoia Feb 26, 2022, 6:53 AM

#

mint palm is state space search heuristic method??

can you say which state space search method?

#

like best first search uses heuristic, while bfs/dfs/uniform search doesnt.

serene scaffold Feb 26, 2022, 6:54 AM

#

lone drum Hello I am trying df.to_csv() But it is not creating CSV file It has created fo...

Please show the exact code

mint palm Feb 26, 2022, 7:05 AM

#

lapis sequoia can you say which state space search method?

simple state space that acts like BFS....and its one variant that tracks visited nodes that prevent cycles

#

both are not heuristic, right??

lapis sequoia Feb 26, 2022, 7:07 AM

#

mint palm both are not heuristic, right??

yes. bfs does not use heuristic, and even while we track nodes, its not heuristic.

mint palm Feb 26, 2022, 7:08 AM

#

#

look at this

lapis sequoia Feb 26, 2022, 7:13 AM

#

mint palm look at this

I am not aware of the source of given slide, but as much I've studied in this field, I never used heuristics in bfs and dfs while making programms.

#

hill climber needs heuristic but not bfs or dfs.

mint palm Feb 26, 2022, 7:13 AM

#

yeah i agree its wrong...it even specifies blind bfs...which has no heuristic

lapis sequoia Feb 26, 2022, 7:14 AM

#

may be the person put it to explain that we dont do that in bfs and dfs by putting blind because well, it does not have any info about environment, except what environment serves as next state.

mint palm Feb 26, 2022, 7:14 AM

#

hmm may be

pastel valley Feb 26, 2022, 8:17 AM

#

how can i get the the true positives etc from keras after training so that i can create confusion matrix? multiclass

lapis sequoia Feb 26, 2022, 8:20 AM

#

pastel valley how can i get the the true positives etc from keras after training so that i can...

you have y_test and y_pred both are enough.

pastel valley Feb 26, 2022, 8:37 AM

#

y pred is i get from this?

#

what is y test?

odd meteor Feb 26, 2022, 8:54 AM

#

pastel valley what is y test?

Y_test = your target variable's true value (the portion which has not been seen by your model.) Y_train on the other hand is also the true value of your target variable but the portion used to train your model.

Y_pred = The prediction made by your model

teal mortar Feb 26, 2022, 8:56 AM

#

anyone saw any difference between normalising whole data vs using only batch normalisation, or using both at the same time, what is a more optimal solution?

pastel valley Feb 26, 2022, 8:59 AM

#

odd meteor Y_test = your target variable's true value (the portion which has not been seen...

oh so the y test is the one i will provide manually?

#

during prediction the row here are the test images and the column are the classes right?

#

#

but the order is not the same from the generator to me trying to manually input a single image

#

i tried to predict using the very first test image and the output is different from the 1st row of the predictions using test_generator

#

i did not use the shuffle parameter so my test set probably reading the test images in order?

#

oh it default shuffles hahaha my bad

odd meteor Feb 26, 2022, 9:27 AM

#

teal mortar anyone saw any difference between normalising whole data vs using only batch nor...

Normalising your whole neural network inputs improves your model no doubt. But remember that deeper layers are trained based on the output of the previous layer. And since the weight gets updated via gradient descent, the consecutive layers unfortunately will no longer benefit from the earlier normalisation since they need to adapt to the previous layer's weight changes; hence, finding it much troublesome to learn their own weight!

With Batch Normalisation, we can evade such incidence with finesse! This is because Batch Normalization makes sure that, independently of the changes, the input to the next layer is normalized. And above all, it does this inna smart way with trainable parameters that also learn how much of this Normalization kept scaling or shifting it.

I hope you understand it better now. ✌️

teal mortar Feb 26, 2022, 9:32 AM

#

odd meteor Normalising your whole neural network inputs improves your model no doubt. But r...

thanks, is there there any sense at all to normalise whole dataset and use it with batchnormalization compared to using only batchnormalization?

odd meteor Feb 26, 2022, 9:34 AM

#

pastel valley oh so the y test is the one i will provide manually?

I'm not sure I understand what you mean mean by 'providing Y_test manually' but both Y_test and Y_train are gotten from the original Y when you split your whole data into train set and holdout set.

Y_test is very important because that's what you'll use to evaluate the accuracy/lapses/difference in the prediction made by your model (Y_pred).

pastel valley Feb 26, 2022, 9:37 AM

#

odd meteor I'm not sure I understand what you mean mean by 'providing Y_test manually' but ...

i used flow from directory to get my test images and this is what it returns

#

odd meteor Feb 26, 2022, 9:37 AM

#

teal mortar thanks, is there there any sense at all to normalise whole dataset and use it wi...

I guess maybe if you aren't using NN. Again, it also might depend on the individual, the task, and how they wanna approach the problem. But as always, Batch Normalization >>>>>

pastel valley Feb 26, 2022, 9:39 AM

#

if i try to get the y
its said too much to unpack

#

how to i get my y_test from the test_generator?

odd meteor Feb 26, 2022, 9:43 AM

#

pastel valley

I don't see any error here though. I'm more of an NLP guy (because that's what I'm learning at the moment) So I don't have enough experience in Computer Vision yet.

So if the problem is actually beyond what's in the pics then, other people here can help out

odd meteor Feb 26, 2022, 9:44 AM

#

pastel valley if i try to get the y its said too much to unpack

Consider increasing your number of batches, perhaps it'll help.

pastel valley Feb 26, 2022, 9:44 AM

#

i got it now this helped me https://stackoverflow.com/questions/45413712/keras-get-true-labels-y-test-from-imagedatagenerator-or-predict-generator

Stack Overflow

Keras: Get True labels (y_test) from ImageDataGenerator or predict_...

I am using ImageDataGenerator().flow_from_directory(...) to generate batches of data from directories.

After the model builds successfully I'd like to get a two column array of True and Predicted ...

#

i now get the true values of my test set

#

btw is this normal predictions for the model? .

#

alot of negatives some doesnt even have a positive prediction that means the model cant predict that input?

acoustic halo Feb 26, 2022, 11:03 AM

#

pastel valley alot of negatives some doesnt even have a positive prediction that means the mod...

it's e powers again, they are just small fractions

lapis sequoia Feb 26, 2022, 11:18 AM

#

its this simple lol

#

1eN = 1 x 10^N

pastel valley Feb 26, 2022, 11:48 AM

#

acoustic halo it's e powers again, they are just small fractions

its negative values right?

acoustic halo Feb 26, 2022, 11:48 AM

#

no

pastel valley Feb 26, 2022, 11:49 AM

#

aww

pastel valley Feb 26, 2022, 11:49 AM

#

lapis sequoia its this simple lol

here is the example haaha my bad

#

oh you guys back at it again haha

merry wadi Feb 26, 2022, 1:23 PM

#

I'm working with some data and its ballooned into a massive amount of if statements that need to include or exclude certain key words. Looking like this
if ((routes[0] == 'IN' or routes[0] == 'BANG') and (routes[1] == 'IN' or routes[1] == 'BANG') ): return 'DBL DIG' if (((routes[0] == 'IN' or routes[0] == 'BANG') and routes[1] == 'UNDER') or (routes[0] == 'UNDER' and (routes[1] == 'IN' or routes[1] == 'BANG')) ): return 'DRIVE 6' if ((routes[0] == 'UNDER' and (routes[1] == 'SHORT OUT') and (routes[2] !='SHORT OUT' and routes[2] != 'RETURN' and routes[2] != 'RETURN')) ): return 'DRIVE 7'

I think to simplify this I'd be able to use a dictionary structure but I'm unsure how to proceed/ how it would work to exclude certain values as well. Any help would be appreciated!

urban prism Feb 26, 2022, 1:49 PM

#

Would it overfit if I were to run a model, save it and then run it again with the same data?

midnight crater Feb 26, 2022, 2:05 PM

#

Hello, I want to create a dataset regarding heart rate and oxygen saturation that will determine whether a person has fainted 0 or 1. My question is how will I get the fainted value for training?

Im creating my own dataset because I couldn't find any dataset that have these features.

serene scaffold Feb 26, 2022, 3:34 PM

#

midnight crater Hello, I want to create a dataset regarding `heart rate` and `oxygen saturation`...

you have to already know if the person fainted or not. if you can figure out if the person fainted using some function of their heart rate and oxygen saturation, then you don't need ML.

arctic venture Feb 26, 2022, 3:45 PM

#

hello

serene scaffold Feb 26, 2022, 3:48 PM

#

arctic venture hello

hello, do you have anything to say?

midnight crater Feb 26, 2022, 3:49 PM

#

serene scaffold you have to already know if the person fainted or not. if you can figure out if ...

I've used the wrong word, It should predict whether the person will faint or have a chance of fainting.

serene scaffold Feb 26, 2022, 3:49 PM

#

midnight crater I've used the wrong word, It should predict whether the person will faint or hav...

so you already have whether or not they fainted in the training data?

#

it might be easiest to just show the data that you have. like copy/pasting the CSV into the chat, if you have that.

midnight crater Feb 26, 2022, 3:54 PM

#

serene scaffold so you already have whether or not they fainted in the training data?

not yet, I don't have any data yet because I'm having trouble with the value for the faint column. In short, I want to create my own dataset (because I can't find anything on kaggle nor google) but I don't know how to fill the column for faint.

Here's an example of the dataset that I would want.

midnight crater Feb 26, 2022, 3:55 PM

#

serene scaffold it might be easiest to just show the data that you have. like copy/pasting the C...

sadly, I don't have any dataset (I couldn't find any on google nor kaggle). But I am willing to create one. I have bought sensors MAX30102) to be used in data gathering (HR and SPO2)

serene scaffold Feb 26, 2022, 3:56 PM

#

For the rest of this conversation, I will only look at text (no screenshots).

So, like I said, you have to already know if they fainted or not. The point of machine learning is that it learns from real examples of the inputs (gender, age, HR, SPO2) and outputs (FAINT). So, you don't have access to a dataset with this information, you will have to conduct a study.

#

The alternative is to make up fake answers and see what happens, if this is just for educational purposes.

midnight crater Feb 26, 2022, 3:57 PM

#

serene scaffold For the rest of this conversation, I will only look at text (no screenshots). S...

my bad. won't happen again

midnight crater Feb 26, 2022, 3:58 PM

#

serene scaffold The alternative is to make up fake answers and see what happens, if this is just...

this is my last resort of doing it. But is it possible to use ML without dataset and only parameters?

serene scaffold Feb 26, 2022, 4:00 PM

#

midnight crater this is my last resort of doing it. But is it possible to use ML without dataset...

So, there's X data and y data. X are the pieces of information that supposedly cause y. Your Xs are (gender, age, HR, SPO2), and your y is (FAINT). You're asking if you can predict y given only X, but you don't actually know what y is.

midnight crater Feb 26, 2022, 4:01 PM

#

serene scaffold So, there's X data and y data. X are the pieces of information that supposedly c...

yes

serene scaffold Feb 26, 2022, 4:02 PM

#

if you make a model that returns a y value for a given X, but you don't know what y is, you'll never be able to confirm that the model is correct.

midnight crater Feb 26, 2022, 4:03 PM

#

well, that makes sense. it's really hard to find data that have similar attributes as mine

#

that's my only problem

serene scaffold Feb 26, 2022, 4:03 PM

#

you can use unsupervised learning, which could tell you which X instances are more similar, and you might discover that there's two discernable subsets. But you'd have no way of knowing which is "faint" and which is "did not faint".

serene scaffold Feb 26, 2022, 4:04 PM

#

midnight crater well, that makes sense. it's really hard to find data that have similar attribut...

you might look for datasets with similar features (gender, age, HR, and SPO2 are all features) and see if you can derive these features from that.

#

for example, if you had a dataset of people that gave their birthday (as a timestamp), but you want their age (as a number of years), you could calculate that based on what you know about how age works.

midnight crater Feb 26, 2022, 4:08 PM

#

serene scaffold for example, if you had a dataset of people that gave their birthday (as a times...

I get this one and it's kinda making sense to me now

#

thanks!!

steep cypress Feb 26, 2022, 5:21 PM

#

Hello everyone, I've been learning ML, in neural networks now [Andrew Ng course]...was wondering if I should go ahead and try out neural network implementation in pytorch tutorial docs or try and implement it from scratch first, from course material and other resources. I do have a basic knowledge of the working, feed forwards, backprop and stuff [sentdex, 3b1b] BUT I wanted to try it out on a dataset and pytorch docs seems fun. What should I go for first?

frosty flower Feb 26, 2022, 5:22 PM

#

I'm coding (manually) a MNIST categroizer using no hidden layer. So it's just input -> output -> softmax -> cross entropy loss. I was trying to calculate the output's derivative wrt the loss, and stumbled upon this formula:

#

So... I don't even need to calculate the loss to do back prop??

#

thoonk

graceful glacier Feb 26, 2022, 5:40 PM

#

hello

#

i was wondering if you could call a function from within .assign() that returns a dataframe instead of a series

#

so somthing like this

#

...asign(new_col = lambda dfx : get_df(dfx)[0])

#

would the '[0]' part be sufficient to turn it back into a series?

cosmic lynx Feb 26, 2022, 5:52 PM

#

how large is difficulty spike for starting to learn how to make machine learning? ngl, I still feel like I'm shaking off rust, and I have no experience with interacting with large datasets...
Doing a little digging, it sounds kind of necessary to learn SQL to a certain degree at very least.

balmy willow Feb 26, 2022, 5:52 PM

#

i made ai tictactoe guys

#

chk out this link

#

https://youtu.be/EujJPqNbeBo

YouTube

Sri Kumar

Knowledge Based AI TIC TAC TOE (Declarative Approach)

First move is played by computer & for now it plays it's first move as the middle box only
You can see how the AI wins /makes game tie in the vedio
Thanks for watching
Consider sharing&subscribing

▶ Play video

serene scaffold Feb 26, 2022, 5:59 PM

#

cosmic lynx how large is difficulty spike for starting to learn how to make machine learning...

ML is a large space. you can do some stuff by taking datasets, cleaning the data, and plugging it to a model with minimal configuration. if you're just doing it for interest's sake, that might be enough. but a career in ML would require SQL, as well as linear algebra, probability/statistics, and sometimes calculus. among other things.

cosmic lynx Feb 26, 2022, 6:01 PM

#

okay thanks

vague kindle Feb 26, 2022, 6:14 PM

#

Would it be possible (or a good idea) for someone with geometry level knowledge to learn the algebra and calculus that is needed for ml?

serene scaffold Feb 26, 2022, 6:28 PM

#

vague kindle Would it be possible (or a good idea) for someone with geometry level knowledge ...

do you know any amount of trigonometry? derivative calculus isn't that big of a jump from algebra. integral calculus is more difficult.

vague kindle Feb 26, 2022, 6:30 PM

#

serene scaffold do you know any amount of trigonometry? derivative calculus isn't that big of a ...

Yeah I know a good amount of trigonometry, and I've looked at some of the math behind it and it doesn't seem that hard, I just don't know if it's that big of a deal that I don't know much algebra 2

serene scaffold Feb 26, 2022, 6:31 PM

#

vague kindle Yeah I know a good amount of trigonometry, and I've looked at some of the math b...

algebra 1 and algebra 2 are just two courses for one subject. they aren't universally recognized ways of splitting up algebra.

safe elk Feb 26, 2022, 6:32 PM

#

We didnt split algebra like that in my uni lol

vague kindle Feb 26, 2022, 6:33 PM

#

serene scaffold algebra 1 and algebra 2 are just two courses for one subject. they aren't univer...

Right

#

Do you think it would be a good idea or should I wait a few years until I take all the math courses?

serene scaffold Feb 26, 2022, 6:35 PM

#

vague kindle Do you think it would be a good idea or should I wait a few years until I take a...

that's really a matter of how much free time you have and how you want to spend it.

#

though if you're learning algebra, it might not be a bad time to learn array/matrix arithmetic.

vague kindle Feb 26, 2022, 6:39 PM

#

serene scaffold though if you're learning algebra, it might not be a bad time to learn array/mat...

Alright

serene scaffold Feb 26, 2022, 6:40 PM

#

other branches of math you could look into are set theory and graph theory. they differ from the kind of math you learn in high school in that they're a lot more conceptual. there isn't a whole lot of calculating.

#

set theory is just about having things in groups, and graph theory is just about things and relationships between things. they're used to model real-world phenomena in precise terms.

vague kindle Feb 26, 2022, 6:45 PM

#

serene scaffold set theory is just about having things in groups, and graph theory is just about...

Interesting. Would a lot of ml algorithms like knn or logistic regression require less math than neural networks?

serene scaffold Feb 26, 2022, 6:46 PM

#

vague kindle Interesting. Would a lot of ml algorithms like knn or logistic regression requir...

not really. they all involve a lot of math, if you want to understand how they work. though you can understand what they do and when you might use them without knowing their exact formulae.

vague kindle Feb 26, 2022, 6:48 PM

#

serene scaffold not really. they all involve a lot of math, if you want to understand how they w...

Ok, thank you for the advice

urban prism Feb 26, 2022, 7:20 PM

#

My kaggle notebook gives memory error seemingly at random (can run smoothly one time and return an error another) what should I do?

frosty flower Feb 26, 2022, 7:22 PM

#

Training on MNIST using a 1 layer NN (left, no hidden layer) and a 2 layer NN (right, 1 hidden layer).
So, in the 1 layer case, the accuracy reaches maximum after just 1 epoch and basically just fluctuates around there. Is it normal?

junior lintel Feb 26, 2022, 7:32 PM

#

Hello, I am totally lost.
I am trying to learn machine learning ( but I am 14 which means I don’t have a lot of experience)
So I bought the famous and recommended book “Hands-On Machine learning with Scikit-Learn, Keras & Tensorflow”.
I then saw in the Prerequisites that this book assumes that I am familiar with Python’s main scientific libraries in particular Numpy, Pandas and Matplotlib.
I then learned these libraries but I don’t really understand the code part of the book still because it uses scikit learn, keras and Tensorflow without explaining what the syntax means so I told myself that i should start learning ML on youtube first but the explanations are too simplified so the reason I wrote this big message is to ask you please tell me where to start

#

Or for anyone who has already read this book does it explain later on the syntax of Tensorflow, Keras and Scikit-Learn?

pastel valley Feb 26, 2022, 7:35 PM

#

junior lintel Hello, I am totally lost. I am trying to learn machine learning ( but I am 14 wh...

probably python? my guess is the book uses python so if you dont understand the syntax maybe its python?
or you already knew python?

junior lintel Feb 26, 2022, 7:35 PM

#

I already know python

#

I am talking about the syntax of the machine learning libraries like tensor flow and Keras

pastel valley Feb 26, 2022, 7:37 PM

#

it is python objects and such i think

#

maybe you want to find is the documentations of those library to know what those methods objects do?

junior lintel Feb 26, 2022, 7:39 PM

#

Yeah what I am saying is I don’t understand the methods of these ML libraries but I am asking if (for anyone who read the book) The book will explain the methods later on and I am also asking where should I start to learn ML

pastel valley Feb 26, 2022, 7:41 PM

#

there is a course recommended here its machine learning by andrew ng iirc

junior lintel Feb 26, 2022, 7:41 PM

#

Ok

#

You just answered my question / helped me so thank you a lot

#

I quickly watched it and although math is extremely important for ML it only talks about math and not the libraries

pastel valley Feb 26, 2022, 7:45 PM

#

that course is for theories i think and for libraries documentations is the way i think

junior lintel Feb 26, 2022, 7:46 PM

#

Sorry?

pastel valley Feb 26, 2022, 7:48 PM

#

i mean the course is for ml fundamentals and if you want to learn about the libraries its the documentations you should read i think

junior lintel Feb 26, 2022, 7:49 PM

#

Ok

#

Do you know where I can find these documentations?

junior lintel Feb 26, 2022, 7:50 PM

#

pastel valley i mean the course is for ml fundamentals and if you want to learn about the libr...

.

iron basalt Feb 26, 2022, 7:50 PM

#

If you Google them they will come up, e.g. "pytorch docs": https://pytorch.org/docs/stable/index.html

junior lintel Feb 26, 2022, 7:51 PM

#

Ok thanks

iron basalt Feb 26, 2022, 7:52 PM

#

Also these libraries have their entire own web sites which may have tutorials on them: https://pytorch.org/tutorials/

junior lintel Feb 26, 2022, 7:52 PM

#

Yes this is where I just went

#

Thank you a lot then @pastel valley and thank you @iron basalt have a great day

#

Both of you

grave frost Feb 26, 2022, 8:06 PM

#

junior lintel Yeah what I am saying is I don’t understand the methods of these ML libraries bu...

its a bad idea to learn that way

#

my advice: get up to scratch with your math background first, then do ML side-by-side

minor elbow Feb 26, 2022, 9:04 PM

#

frosty flower So... I don't even need to calculate the loss to do back prop??

you get an error at the output layer and back prop is a way of propagating that error to earlier layers, so if u have no earlier layer theres nothing to propagate the error to

minor elbow Feb 26, 2022, 9:06 PM

#

frosty flower Training on MNIST using a 1 layer NN (left, no hidden layer) and a 2 layer NN (r...

yes its normal because only using the output is literally a logistic regression, ie deterministic and optimal wrt mse

#

you can think of a neural network as a weighted set of linear/logistics regression, the "learning" part is finding the weights

#

if you have no weights (no hidden layer) theres nothing to learn

tidal bough Feb 26, 2022, 9:08 PM

#

if there's no activation functions, your network is just a linear operator (regardless of the number of layers it has, actually), and so equivalent to logistic regression. Since without a hidden layer there's no activations either, the same happens here.

iron basalt Feb 26, 2022, 9:09 PM

#

(if we are being pedantic, it's affine (from linear algebra POV, from calculus POV it's "linear" (deg. 0 or 1)), the activation function is linear)

#

("linear layer" then refers to the activation function)

junior lintel Feb 26, 2022, 9:23 PM

#

grave frost my advice: get up to scratch with your math background first, then do ML side-by...

Ok now at least it’s an option but the reason I am not directly doing it this way is because I am only 14 so I tell my self I have time but If I see I seriously need the math I’ll learn it

grave frost Feb 26, 2022, 9:27 PM

#

junior lintel Ok now at least it’s an option but the reason I am not directly doing it this wa...

I do agree - I'm 17 myself, and my approach was to do it both sides - learn things bottom up as well as top down; I really don't think that's the best or the most efficient approach, but use what feels best for you

urban prism Feb 26, 2022, 10:12 PM

#

Don't let your age bring you down. I'm 18 and got my first ML Emgineer job recently -which tbh I'm doing terrible at-

urban prism Feb 26, 2022, 11:42 PM

#

Is there an explanation for my kaggle notebook to terminate itself with a memory allocation error every now and then?
It works fine one time and then gives me the error on another run

serene scaffold Feb 26, 2022, 11:43 PM

#

urban prism Is there an explanation for my kaggle notebook to terminate itself with a memory...

can you show one such error as an example? the whole thing please, as text.

urban prism Feb 26, 2022, 11:46 PM

#

It normally just this:
Your notebook tried to allocate more memory than is available. It has restarted. but the notebook isn't consistent with the error.
Also I can send the part(s) where I think may be causing the errors but it will take time to reproduce

urban prism Feb 27, 2022, 12:02 AM

#

After I run

img_height =256
img_width = 256
num_channels = 3
unet = unet_model((img_height, img_width, num_channels))

#

!paste

arctic wedgeBOT Feb 27, 2022, 12:02 AM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the floppy disk icon in the top right, or by typing ctrl + S. After doing that, the URL should change. Copy the URL and post it here so others can see it.

urban prism Feb 27, 2022, 12:03 AM

#

which isn't an issue, I don't think

#

https://paste.pythondiscord.com/aqeyimohis

#

@serene scaffold Tried the give all the information needed, you can ask anything

#

Kind of desperate, been on this for hours now

serene scaffold Feb 27, 2022, 12:18 AM

#

@urban prism sorry I've been afk. Though generally speaking, I don't know what guarantees Colab makes about how much compute power they'll give you

urban prism Feb 27, 2022, 12:20 AM

#

It's alright. Thanks

misty flint Feb 27, 2022, 2:46 AM

#

urban prism It normally just this: `Your notebook tried to allocate more memory than is avai...

this happens to me sometimes with colab with bigger models/datasets

#

the recommendation is to reduce one or the other

#

or get colab pro

#

even then sometimes you run out of memory

#

ID_BoomKek

serene scaffold Feb 27, 2022, 2:53 AM

#

misty flint this happens to me sometimes with colab with bigger models/datasets

is there any way to predict when it's going to happen? idk if the memory allocation per user is dynamic

misty flint Feb 27, 2022, 3:09 AM

#

it really seems random at times tbh. the info at the top right is sometimes useful, but i think sometimes youre sharing with others

#

its like some cloud providers

urban prism Feb 27, 2022, 9:04 AM

#

I see. Thanks for the info! @misty flint

river maple Feb 27, 2022, 9:47 AM

#

got this while training a custom model

#

does changing random in cfg file to 0 solve this?

river maple Feb 27, 2022, 10:59 AM

#

im using google colab

zenith oar Feb 27, 2022, 12:18 PM

#

Hi

#

how can i tackle this problem (which data structure i should use)

#

Basicaly i have 2 columns that represent each the white and black player in chess

#

I need to find out which player has played the most with another player

#

I was suggested to use this view

#

I am just not sure how is it called

safe elk Feb 27, 2022, 12:37 PM

#

Looks like a crosstab

#

https://pandas.pydata.org/docs/reference/api/pandas.crosstab.html

junior lintel Feb 27, 2022, 2:22 PM

#

Hello, I am trying to learn the main ML libraries like Tensorflow and scikit learn but when I watch courses on YouTube like the ones made by “FreeCodeCamp.com” people (in the comments) say it’s a great tutorial but personally I don’t understand what’s going on as they just write the scikit learn methods without explaining what they do I am lost.

neat anvil Feb 27, 2022, 2:48 PM

#

The documentation and tutorials on the tensorflow and sklearn websites is a great resource

#

As well, if you don’t understand the fundamental mathematics/logic of the ML algorithms, you may need to study up on that background knowledge before things start to make sense

weary flint Feb 27, 2022, 2:49 PM

#

anyone want to start a data science podcast with me? i literally just started learning and have tons of questions so the podcast would just be me asking questions and my partner answering them

#

i think it could potentially be entertaining

#

or it could be some else that is also new and we report on our progress together

serene scaffold Feb 27, 2022, 3:18 PM

#

junior lintel Hello, I am trying to learn the main ML libraries like Tensorflow and scikit lea...

well, you shouldn't try to "learn libraries". you should learn to solve problems. come up with an ML project that may involve sklearn or tensorflow and do it.

pastel valley Feb 27, 2022, 3:43 PM

#

yo this is overfitting right?

#

btw in that metrices i can also see that around 30 epochs the model started to learn almost nothing?

serene scaffold Feb 27, 2022, 3:47 PM

#

pastel valley yo this is overfitting right?

could be. how did the model perform on the test data?

#

or did you predict on the test data after every epoch...?

pastel valley Feb 27, 2022, 3:52 PM

#

serene scaffold or did you predict on the test data after every epoch...?

yeah its test data already

#

but the accuracy is 80% above mostly is it still good?

#

or do models should naturally get higher?

serene scaffold Feb 27, 2022, 4:04 PM

#

pastel valley but the accuracy is 80% above mostly is it still good?

depends on your use case

#

if 80% is better than human performance for that task, then yes. if it's for something that's not very important, maybe

pastel valley Feb 27, 2022, 4:10 PM

#

base on the test-train metrices their margins is it somewhat accceptable? or also depending on the task?

serene scaffold Feb 27, 2022, 4:14 PM

#

it always depends. for all I know, you want a model that's always wrong.

pastel valley Feb 27, 2022, 4:18 PM

#

there is not right or wrong models its just about tweaking it to how you want it to perform? its bugging me if i created a right model or a wrong one but if i understand you correctly then if this performance is acceptable to me then the model is right ?

neat anvil Feb 27, 2022, 4:20 PM

#

So there’s a quote often used when making new theories in science in general- “there’s no such thing as a model that is right, only models that are useful”

#

This quote is doubly true for machine learning where you’re essentially trying to skip the “actually understand what is happening” part of science right to the “have a model that can predict future events” part of science

#

If you think 90% test accuracy on your data is good enough to be useful, congrats! you’re done fine-tuning that model

pastel valley Feb 27, 2022, 4:22 PM

#

i still dont get it

serene scaffold Feb 27, 2022, 4:23 PM

#

pastel valley there is not right or wrong models its just about tweaking it to how you want it...

the individual predictions that a model make can be right or wrong. beyond that, whether or not the model itself is good enough for a certain situation depends.

neat anvil Feb 27, 2022, 4:23 PM

#

We can’t make that decision for you, there’s no arbitrary “model is good enough now” cutoff that applies to all models

pastel valley Feb 27, 2022, 4:25 PM

#

neat anvil If you think 90% test accuracy on your data is good enough to be useful, congrat...

i did not do any fine tuning i just want to compare identical models(models with same initialized weights and layers ) performance based in the data they are trained with so maybe my questions are out of place hahaha

#

but is there ever a model that has 98%+ accuracy? like is there ever someone capable of creating a very good model?

neat anvil Feb 27, 2022, 4:26 PM

#

And yea, like you observed it seemed your model may have stopped learning anything new around 20-30 epochs of training. At least in terms of achieving better statistical metrics. That’s quite normal, google “neural network training early stopping” and you’ll find resources in that

serene scaffold Feb 27, 2022, 4:27 PM

#

there are models that are basically 100% for problems that don't have ambiguous cases.

neat anvil Feb 27, 2022, 4:27 PM

#

pastel valley but is there ever a model that has 98%+ accuracy? like is there ever someone cap...

What a good accuracy is completely depends on the prior distribution of targets and features in the dataset. There’s no generic answer

pastel valley Feb 27, 2022, 4:32 PM

#

neural network training early stopping there are cases called over training where the model will decrease performance? its like after the ath its will just curve down? is this what it means?

neat anvil Feb 27, 2022, 4:34 PM

#

One model being more “overtrained” than another means the difference between test metrics and train metrics is higher

#

This may or may not have anything to do with how many epochs of training the model undergoes

pastel valley Feb 27, 2022, 4:40 PM

#

so if i plan to compare performances of models ill just use the same epochs for all models?

neat anvil Feb 27, 2022, 4:43 PM

#

Unfortunately it’s not that simple. Models with different architecture may take different numbers of epochs to train to equivalent performance. Usually this type of thing is addressed by using a consistent validation set and consistent early stopping criteria between the models you are trying to compare.

pastel valley Feb 27, 2022, 4:44 PM

#

neat anvil Unfortunately it’s not that simple. Models with different architecture may take ...

in my case i am comparing identical models same architecture

#

i just create 3 models and train them to classify same classes but the data they will be trained is different per model

neat anvil Feb 27, 2022, 4:46 PM

#

pastel valley i just create 3 models and train them to classify same classes but the data they...

IMO a good way to conceptualize this is that a model is not separate from the data is trained on. Identical architecture + identical training procedure + different data -> different models , not identical models

pastel valley Feb 27, 2022, 4:52 PM

#

they are originally have the same data but applied with different augmentations techniques, for example the data set of cats, dogs, birds they will give their own training to the model
now if i apply to augmentationA to those original dataset then how much does the same model improved or if it performed worst then the same model trained on augmentationB with the original dataset then is A better or B or the base model without augmentation applied to the cats, dogs, birds dataset

#

does it make sense? 😅

#

its like experiment, does applying this kind of augmentation to this type of classes will be better or not or how about this type of augmentation or this one

#

like that

#

sorry my English is not that good 😅

neat anvil Feb 27, 2022, 4:55 PM

#

Yes, it’s a sensible question, but since you’re introducing different augmentation data into the models you cannot just give them the same architecture and same training procedure and compare their performance and use this as a way to understand which augmentation is always “best”

#

How would you know if perhaps a slightly different model architecture trained on augmentation B data wouldn’t outperform the initial model architecture on un-augmented data?

pastel valley Feb 27, 2022, 5:05 PM

#

neat anvil Yes, it’s a sensible question, but since you’re introducing different augmentati...

yes this is also the one thing that will negate this experiment because there are cases where what if its a different architecture used then maybe the output will not be the same
but if i say like the class features
for example if i apply this experiment on classifying cars and say like augA is rotating etc, and augB is color casting etc, then it wouldnt makes sense because there seems to be nothing wrong with those augmentations
but if for example my model should classify something like color of a ball then there is a chance that augB will perform worst because applying colorcasting on the sample that color is a special feature would be a problem, example classes will be blue ball and red ball and i have a original sample of blue ball and when i applied augB it produced a red ball like augmented image then the model will learn that its red ball but infact its just augmented image from a blue ball
of course it is all only a "what if"

pastel valley Feb 27, 2022, 5:08 PM

#

neat anvil How would you know if perhaps a slightly different model architecture trained on...

by not focusing on the model instead of the data. models learn what we give to them but if we un intentionally gave them a wrong data like the ball augmentation sample maybe just maybe i can conclude that augB is bad for classses that has color as a important feature

#

if i said it correct
this is the biggest question does it make sense? or i am wasting my time?

scarlet light Feb 27, 2022, 5:11 PM

#

Hi can someone help me

#

https://stackoverflow.com/q/71285886/17115121

Stack Overflow

How to use different colour to plot in folium map?

So i have many csv files each one of them has three columns.

latitude
longitude
distance

for example:
car1.csv
lat long total_dist
23.33 73.32. 0
23.45. 73.34. 10
23.64. ...

neat anvil Feb 27, 2022, 5:23 PM

#

pastel valley if i said it correct this is the biggest question does it make sense? or i am w...

So your experiment captures “given a fixed model architecture, which data augmentation performs best”. That’s fine. The disconnect that confuses me is that there is nothing stopping anyone from changing your model architecture, thus invalidating the test results. You can define a more generic test that includes more model architectures

pastel valley Feb 27, 2022, 5:26 PM

#

neat anvil So your experiment captures “given a fixed model architecture, which data augmen...

oh right right that one can be said i havent figure out the possible reason for that so its a big flaw
btw in terms of cnn models what differences are there on their architectures the amount of layers and types of layers?

#

or there are more?

neat anvil Feb 27, 2022, 5:27 PM

#

Many, many more

pastel valley Feb 27, 2022, 5:31 PM

#

if i say that given this architecture and this classes which type of augmentation is the best and worst then its back to why not use different model? hahaha wew

neat anvil Feb 27, 2022, 5:32 PM

#

Indeed

broken shell Feb 27, 2022, 6:28 PM

#

Hey, I want to learn more about AI and implement it into python scripts, but I don't know where to start. Any suggestions ? (I'm a total beginner when it comes to AI)

serene scaffold Feb 27, 2022, 6:33 PM

#

broken shell Hey, I want to learn more about AI and implement it into python scripts, but I d...

including something in a program that already exists is not "implementing it". for some reason the word "implement" gets thrown around a lot.

anyway, what is your understanding of what AI is, at a high level?

urban prism Feb 27, 2022, 6:41 PM

#

I'm trying to implement this

from tensorflow.python.ops.numpy_ops import np_config
np_config.enable_numpy_behavior()
def dice_metric(inputs, target):
    intersection = 2.0 * (target * inputs).sum()
    union = target.sum() + inputs.sum()
    if target.sum() == 0 and inputs.sum() == 0:
        return 1.0
    return intersection / union
def dice_loss(inputs, target):
    num = target.size(0)
    inputs = inputs.reshape(num, -1)
    target = target.reshape(num, -1)
    smooth = 1.0
    intersection = (inputs * target)
    dice = (2. * intersection.sum(1) + smooth) / (inputs.sum(1) + target.sum(1) + smooth)
    dice = 1 - dice.sum() / num
    return dice
def bce_dice_loss(inputs, target):
    dicescore = dice_loss(inputs, target)
    bcescore = tf.keras.losses.BinaryCrossentropy()
    bceloss = bcescore(inputs, target)

    return bceloss + dicescore

Though it returns:

    /opt/conda/lib/python3.7/site-packages/keras/engine/training.py:853 train_function  *
        return step_function(self, iterator)
    /tmp/ipykernel_35/144329947.py:17 bce_dice_loss  *
        dicescore = dice_loss(inputs, target)
    /tmp/ipykernel_35/144329947.py:8 dice_loss  *
        num = target.size(0)

    TypeError: 'NoneType' object is not callable

I'm a bit lost. Any ideas?
unet.compile(optimizer=Adam(learning_rate=1e-4), loss=[bce_dice_loss], metrics=[dice_metric])

broken shell Feb 27, 2022, 6:44 PM

#

serene scaffold including something in a program that already exists is not "implementing it". f...

All I know is that AI's are a bunch of algorithms that do specefic tasks depending on factors you have set. so yeah, nothing above average knowledge x) I might even be wrong here.

serene scaffold Feb 27, 2022, 6:46 PM

#

broken shell All I know is that AI's are a bunch of algorithms that do specefic tasks dependi...

That's a lot better than the general public's understanding. Yay!

#

@urban prism if you get an error about NoneType, it usually means something returned none when you thought it returned something

urban prism Feb 27, 2022, 6:48 PM

#

Yay, debugging time

#

time to use bunches of prints :P

broken shell Feb 27, 2022, 6:50 PM

#

serene scaffold That's a lot better than the general public's understanding. Yay!

I have no idea how to use one in python and get it to do what I want it to tho, and where can I learn more about it. is there some kind of full course on the internet that you "validate" ?

serene scaffold Feb 27, 2022, 6:58 PM

#

broken shell I have no idea how to use one in python and get it to do what I want it to tho, ...

https://www.pythondiscord.com/resources/?topics=data-science

Python Discord | Resources

We're a large, friendly community focused around the Python programming language. Our community is open to those who wish to learn the language, as well as those looking to help others.

#

there's also an online course by Andrew Ng, but it hasn't been reviewed by our staff yet. I also recommend 3blue1brown on YouTube for the math stuff.

broken shell Feb 27, 2022, 7:02 PM

#

Thanks !

merry wadi Feb 27, 2022, 7:03 PM

#

whats the best way to code a lot of if statements?

serene scaffold Feb 27, 2022, 7:04 PM

#

merry wadi whats the best way to code a lot of if statements?

not sure I'm following. lots of if statements are usually regarded as poor design. is this a data science question or a #software-architecture question?

merry wadi Feb 27, 2022, 7:05 PM

#

serene scaffold not sure I'm following. lots of if statements are usually regarded as poor desig...

might be a software design. But i have code that needs to check if a list contains specific words but excludes specific ones as well

serene scaffold Feb 27, 2022, 7:05 PM

#

merry wadi might be a software design. But i have code that needs to check if a list contai...

if you want refactoring advice, try there instead.

#

if you need general help, see #❓｜how-to-get-help

misty flint Feb 27, 2022, 8:50 PM

#

sigh

#

i hate minitorch

#

if anyone asks you to do it for fun to "learn ML from scratch", you should heavily reconsider

#

unless thats something youre passionate about, then feel free

grave frost Feb 27, 2022, 8:52 PM

#

doesn't make sense re-implementing literally everything fom scratch

#

I'd say just learn how to implement stuff, implement papers then. Writing your own autograd engine is pretty useless IMO since its just heuristic/rule-based anyways

misty flint Feb 27, 2022, 8:53 PM

#

yeah too bad its assignment for deep learning class

#

so no choice but to drag my feet

#

even tho i will never use this again

grave frost Feb 27, 2022, 8:54 PM

#

atleast you'll get a better understanding of tensor manipulation

#

use einops if you aren't already - might save you a ton of time

iron basalt Feb 27, 2022, 9:43 PM

#

grave frost I'd say just learn how to implement stuff, implement papers then. Writing your o...

Autodiff systems are actually pretty straight forward, the real difficulty by these kinds of things (libraries) is having ALL the features. It's having to implement N different things. And if they can interact with each other it starts to become an N^2 problem. For example when implementing a database and wanting all the different selections, HTTPS interface, maybe a GUI, etc vs just having the base systems (the record storage system / file format, some indexing).

#

Even if the different features are not hard to implement (especially if someone already did it before), it just takes a lot of time, especially for debugging it all together.

#

On the other hand, if you know you only need a few of the features, then it wont take too long and you can probably optimize it further because it's more specific and more specific / less general tends to be faster on computers (and less code, so easier to debug and maintain and browse, etc).

grave frost Feb 27, 2022, 9:50 PM

#

agreed. in the end, its more of a programming exercise than conceptual one

iron basalt Feb 27, 2022, 9:50 PM

#

Yeah, since it's been done before (it's called minitorch after all, so it probably has nothing new in it).

grave frost Feb 27, 2022, 9:51 PM

#

oh well. I suppose I'd have to do it too one day

iron basalt Feb 27, 2022, 9:51 PM

#

Ofc, being able to do that grind is super important if you want to make something new, it's a huge initial hill to climb before you see any results.

#

Or in RL terms, a very very delayed reward.

grave frost Feb 27, 2022, 9:52 PM

#

doesn't seem like a particularly useful grind - but I suppose my programming skills needs a lot of work

#

yea. still...

iron basalt Feb 27, 2022, 9:55 PM

#

If you do want to make something new that requires implementing a new library / system, then I would recommend making it very specific, don't let the feature count get too high because the time needed is not a linear function of the number of features, so adding even one more might make it way more work than expected.

iron basalt Feb 27, 2022, 10:01 PM

#

misty flint if anyone asks you to do it for fun to "learn ML from scratch", you should *heav...

It does seem a bit much to learn ML from scratch from browsing it a bit. This is more of what you might do later to package it into a nice generic library.

misty flint Feb 27, 2022, 10:04 PM

#

yes im glad the guy who does this for work agrees with me

#

i now feel 200% validated

#

CLe_FeelsEvilLurk

upper spindle Feb 27, 2022, 11:27 PM

#

When I execute this code, trainx = input_df.loc[train_index], I get this error KeyError: "None of [DatetimeIndex(['2020-01-01', '2020-01-02', '2020-01-03', '2020-01-04',\n '2020-01-05', '2020-01-06', '2020-01-07', '2020-01-08',\n '2020-01-09', '2020-01-10',\n ...\n '2021-05-17', '2021-05-18', '2021-05-19', '2021-05-20',\n '2021-05-21', '2021-05-22', '2021-05-23', '2021-05-24',\n '2021-05-25', '2021-05-26'],\n dtype='datetime64[ns]', name='Date', length=512, freq=None)] are in the [index]"

#

Im not sure why

frosty flower Feb 27, 2022, 11:58 PM

#

How should I adjust the learning rate wrt the batch size?

vague kindle Feb 28, 2022, 12:01 AM

#

Sorry if this is a stupid question but what are some actual uses of svm, knn, logistic or linear regression, and other similar algorithms? Wouldn't a human be able to figure out patterns like this and accurately predict outcomes?

frosty flower Feb 28, 2022, 12:07 AM

#

vague kindle Sorry if this is a stupid question but what are some actual uses of svm, knn, lo...

Some of them are taught because they were relevant historically and so the school thinks you should know about them

#

Some of them are used as components of larger and more complex systems

carmine gulch Feb 28, 2022, 12:11 AM

#

Hello, does anyone know how to deploy and easily share notebooks from Jupyter without having to spin up a separate streamlit server? In other words, I want a way to share notebooks like they were separate dashboard pages. I don’t want to pay for plotly enterprise.

minor elbow Feb 28, 2022, 12:13 AM

#

vague kindle Sorry if this is a stupid question but what are some actual uses of svm, knn, lo...

i would argue that those models are more widespread than deep learning

carmine gulch Feb 28, 2022, 12:13 AM

#

carmine gulch Hello, does anyone know how to deploy and easily share notebooks from Jupyter wi...

I basically want to create an on-prem instance with some way to publish the out of Jupyter notbook plots in a way that any non-software engineer can go to and view. I don’t want to get in the business of trying to design separate pages using html. There has to be a better way.

minor elbow Feb 28, 2022, 12:14 AM

#

linear and logistic regression is also valuable as a tool for interpretation, they are widely used in medical research and other sciences

#

also its not a question of if humans can do it, humans can drive cars great too but ppl are spending a lot of time on models for that

#

ML is best suited when u have relatively simple tasks you need to do a gagillion times very quickly

#

also the different model types do better with different types of data/problems, its not necessarily possible to tell in advance what is the best type of model to use for a given problem

#

its a rookie mistake i see over and over to pick deep learning models for everything

frosty flower Feb 28, 2022, 12:22 AM

#

minor elbow its a rookie mistake i see over and over to pick deep learning models for everyt...

Actually I think a more common mistake is thinking there's a model for everything

#

pepeshrug

minor elbow Feb 28, 2022, 12:23 AM

#

good point

neat anvil Feb 28, 2022, 1:49 AM

#

carmine gulch I basically want to create an on-prem instance with some way to publish the out ...

You could host a Jupyter lab instance with read-only notebooks? https://stackoverflow.com/questions/58944458/read-only-python-notebook-in-jupyter-lab

Stack Overflow

Read Only Python Notebook in Jupyter Lab

How can we edit an existing .ipynb in Jupyter Lab to be read-only?
More specifically, how can we make specific cells within an existing .ipynb become read-only?

Basically, other users would be abl...

carmine gulch Feb 28, 2022, 1:51 AM

#

neat anvil You could host a Jupyter lab instance with read-only notebooks? https://stackove...

Interesting…. I’m just curious why there is not a mainstream approach. I feel like there are so many disparate systems.

#

Thanks for the link though

plucky willow Feb 28, 2022, 5:03 AM

#

does anyone have a really simple example of multivariate regression with tensor flow?

#

I have done regression with a csv with just an x and a y, but if i had two independent variables what would i do

#

the data that i am using is 100 random floats from -10 to 10 for all 3 columns

#

but idk how to find any relationship btw them

#

i understand 2 variable linear regression, but i am completly lost with 3 variable regression

somber prism Feb 28, 2022, 5:29 AM

#

guys is there any AI assisted image labelling tool thats actually free to use? like all we need to do is draw a bounding box for some images and rest will be taken care by the ai

#

i have like 278k images but i want to label that 😭

river maple Feb 28, 2022, 6:06 AM

#

somber prism guys is there any AI assisted image labelling tool thats actually free to use? l...

https://github.com/theAIGuysCode/OIDv4_ToolKit

GitHub

GitHub - theAIGuysCode/OIDv4_ToolKit: Download and visualize single...

Download and visualize single or multiple classes from the huge Open Images v4 dataset - GitHub - theAIGuysCode/OIDv4_ToolKit: Download and visualize single or multiple classes from the huge Open I...

#

this might help?

somber prism Feb 28, 2022, 6:07 AM

#

river maple https://github.com/theAIGuysCode/OIDv4_ToolKit

thanks ill look into it

steep cypress Feb 28, 2022, 6:14 AM

#

plucky willow i understand 2 variable linear regression, but i am completly lost with 3 variab...

with tensorflow, I found this: https://www.tensorflow.org/tutorials/keras/regression

TensorFlow

Basic regression: Predict fuel efficiency | TensorFlow Core

#

---
hello, I was learning pytorch from the docs and following some sentdex neural networks examples. Was wondering if it is necessary to transform the target labels into one-hot encoded vectors. If I transform I'm having problem with nll_loss()

remote ridge Feb 28, 2022, 6:21 AM

#

hey guys! i am new here, i am having trouble choosing a project for my final sem in uni, i would like to make a project on machine learning, any idea where can i get help from and get the project done? are there any good courses in udemy which can help?

barren jungle Feb 28, 2022, 6:22 AM

#

Hey i need to know how to use stegnatography to hide a code inside an image and when omage opened by someone it pastes a code inside the browser console , pls this is the imformation i required for my project if u know pls help

#

Nobody help regarding ai and datascience in this server

shell drift Feb 28, 2022, 7:03 AM

#

Hello, I want to start my AI/Ml journey can anyone please share some roadmap or structure which I can follow for learning

urban ore Feb 28, 2022, 7:37 AM

#

can i train a neural network with untrainable params in them?

urban prism Feb 28, 2022, 7:53 AM

#

How do I make it save according to the increase of metric?

312/312 [==============================] - 74s 197ms/step - loss: 0.4691 - dice_metric: 0.7883 - val_loss: 0.5346 - val_dice_metric: 0.7840

Epoch 00001: dice_metric improved from inf to 0.78828, saving model to model_unet.h5
Epoch 2/32
312/312 [==============================] - 67s 207ms/step - loss: 0.2512 - dice_metric: 0.8916 - val_loss: 0.2132 - val_dice_metric: 0.9031

Epoch 00002: dice_metric did not improve from 0.78828
Epoch 3/32
312/312 [==============================] - 71s 217ms/step - loss: 0.1809 - dice_metric: 0.9258 - val_loss: 0.1983 - val_dice_metric: 0.9246

tacit basin Feb 28, 2022, 8:05 AM

#

shell drift Hello, I want to start my AI/Ml journey can anyone please share some roadmap or...

Start with the area that interests you. Like medicine, agriculture, art etc.

tacit basin Feb 28, 2022, 8:05 AM

#

urban ore can i train a neural network with untrainable params in them?

Could you explain the untrainable parameters?

tacit basin Feb 28, 2022, 8:07 AM

#

barren jungle Nobody help regarding ai and datascience in this server

What's ai or ds in this task? Seems potentially harmful

tacit basin Feb 28, 2022, 8:08 AM

#

remote ridge hey guys! i am new here, i am having trouble choosing a project for my final sem...

What the project is about?

remote ridge Feb 28, 2022, 8:09 AM

#

CNN for traffic sign classification using LE-Net

tacit basin Feb 28, 2022, 8:09 AM

#

urban prism How do I make it save according to the increase of metric? ``` 312/312 [========...

As i can see it saves model on metric increase

odd meteor Feb 28, 2022, 8:10 AM

#

urban ore can i train a neural network with untrainable params in them?

Yes it's possible. It depends on your NN architecture.

tacit basin Feb 28, 2022, 8:11 AM

#

remote ridge CNN for traffic sign classification using LE-Net

Do you have data to train the model. Will be first task.

urban prism Feb 28, 2022, 8:12 AM

#

tacit basin As i can see it saves model on metric increase

312/312 [==============================] - 74s 197ms/step - loss: 0.4691 - dice_metric: 0.7883 - val_loss: 0.5346 - val_dice_metric: 0.7840

Epoch 00001: dice_metric improved from inf to 0.78828, saving model to model_unet.h5
Epoch 2/32
312/312 [==============================] - 67s 207ms/step - loss: 0.2512 - dice_metric: 0.8916 - val_loss: 0.2132 - val_dice_metric: 0.9031

Epoch 00002: dice_metric did not improve from 0.78828
But didn't it from 0.78828 to 0.8916 ?

remote ridge Feb 28, 2022, 8:13 AM

#

tacit basin Do you have data to train the model. Will be first task.

taking help from udemy for my project the course should have a data model

#

#

i just dont know if its viable for resume and project

tacit basin Feb 28, 2022, 8:14 AM

#

urban prism 312/312 [==============================] - 74s 197ms/step - loss: 0.4691 - dice_...

Can you set a difference which triggers model save?

urban prism Feb 28, 2022, 8:14 AM

#

What do you mean?

urban ore Feb 28, 2022, 8:20 AM

#

tacit basin Could you explain the untrainable parameters?

freezing layers?

urban ore Feb 28, 2022, 8:20 AM

#

odd meteor Yes it's possible. It depends on your NN architecture.

can u help me with it

#

i have an autoencoder

#

with 5 layers initially trained and now i have frozen the first 3 for transfer learning

#

and have created new layers as well

desert bear Feb 28, 2022, 8:57 AM

#

hi i am working on a fun project. it is a sort of alexa and it is traint with tensorflow. the text is classifier with a nlu. now i was wondering if there is a character that i can put in my training model that can mean any word? or is this not necessary and do i need to train it with for instance city names.

barren jungle Feb 28, 2022, 9:23 AM

#

@tacit basin let it be btw knowledge not a bad thing

tacit basin Feb 28, 2022, 9:27 AM

#

remote ridge i just dont know if its viable for resume and project

In practice Le net is not used currently. Resents more like it for image classification.

tacit basin Feb 28, 2022, 9:28 AM

#

urban prism What do you mean?

Like if you set difference level at 1, a difference of 0.5 will not be registered as improvement

urban prism Feb 28, 2022, 9:30 AM

#

I solved it. Apperantly I wrote mode="min" to the wrong kernel (two of the same kaggle notebooks were open)

arctic wedgeBOT Feb 28, 2022, 10:54 AM

#

:incoming_envelope: :ok_hand: applied mute to @stray crystal until <t:1646046249:f> (9 minutes and 59 seconds) (reason: duplicates rule: sent 4 duplicated messages in 10s).

midnight crater Feb 28, 2022, 11:09 AM

#

hello, why is this returning an error?

for x in range(len(y_pred)):
    print(X_test[x], y_test[x], y_pred[x])```
I've checked the length of the 3 variables and they are all the same, but somehow the `X_test` and `y_test` throws an error

pine wolf Feb 28, 2022, 11:10 AM

#

are they all lists?

#

if one or more are dicts, say, then there might be an error

midnight crater Feb 28, 2022, 11:11 AM

#

ops, no, I already assumed they are all in array

#

only the y_pred is in array

pine wolf Feb 28, 2022, 11:12 AM

#

i would try using zip instead

#

it would look like this:

for x, y, z in zip(x_test, y_test, y_pred):
    print(x, y, z)

midnight crater Feb 28, 2022, 11:13 AM

#

I just used .to_numpy() to convert them to list

#

Thanks!!

pine wolf Feb 28, 2022, 11:14 AM

#

np

wicked grove Feb 28, 2022, 1:03 PM

#

Hello,how can i plot the training and validation curve after kfold cross validation??

hard basalt Feb 28, 2022, 1:35 PM

#

carmine gulch I basically want to create an on-prem instance with some way to publish the out ...

I am also searching for something similar but haven't found something that works so far. Tell us if you find something that works.

tacit basin Feb 28, 2022, 1:52 PM

#

wicked grove Hello,how can i plot the training and validation curve after kfold cross validat...

Sci-kit learn example

from sklearn.model_selection import validation_curve

max_depth = [1, 5, 10, 15, 20, 25]
train_scores, test_scores = validation_curve(
    regressor, data, target, param_name="max_depth", param_range=max_depth,
    cv=cv, scoring="neg_mean_absolute_error", n_jobs=2)
train_errors, test_errors = -train_scores, -test_scores
plt.plot(max_depth, train_errors.mean(axis=1), label="Training error")
plt.plot(max_depth, test_errors.mean(axis=1), label="Testing error")
plt.legend()

plt.xlabel("Maximum depth of decision tree")
plt.ylabel("Mean absolute error (k$)")
_ = plt.title("Validation curve for decision tree")

https://inria.github.io/scikit-learn-mooc/python_scripts/cross_validation_validation_curve.html

wicked grove Feb 28, 2022, 1:54 PM

#

tacit basin Sci-kit learn example ```py from sklearn.model_selection import validation_curve...

Thank you so much

tacit basin Feb 28, 2022, 2:03 PM

#

hard basalt I am also searching for something similar but haven't found something that works...

You could use binder and voila. Or you could share your notebook with --collaborative flag https://jupyterlab.readthedocs.io/en/stable/user/rtc.html

tacit basin Feb 28, 2022, 2:04 PM

#

hard basalt I am also searching for something similar but haven't found something that works...

Jupyterlite is also an option https://jupyterlite.readthedocs.io/en/latest/

desert oar Feb 28, 2022, 3:37 PM

#

carmine gulch I basically want to create an on-prem instance with some way to publish the out ...

do you just want to publish the plots somehow? it's pretty easy to save a plot to PNG and then you can host it on a fileserver somewhere

#

otherwise there is nbviewer which hosts notebooks in read-only "view" mode https://nbviewer.org/

nbviewer

#

if you just want to share notebooks as finished products in some on-prem fashion, your options are:

use nbconvert to convert a notebook to plain html, which can then be shared however you want to share plain html
nbviewer, which just does (1) automatically a server application

#

@tacit basin fyi since you were interested too ☝️

tacit basin Feb 28, 2022, 4:05 PM

#

Google colab is an option too. Or deep note...

urban ore Feb 28, 2022, 4:54 PM

#

any idea on how to add new layers to a model in torch?

neat anvil Feb 28, 2022, 5:02 PM

#

https://discuss.pytorch.org/t/how-to-add-a-layer-to-an-existing-neural-network/30129/2

PyTorch Forums

How to add a layer to an existing Neural Network?

It should generally work. Here is a small example: class MyModel(nn.Module): def init(self): super(MyModel, self).init() self.fc = nn.Linear(10, 2) def forward(self, x): x = self.fc(x) return x model = MyModel() x = torch.randn(1, 10) print(model(x)) > tensor([[-0.2403, 0.8158]], grad...

#

this delightfully simple example from there

class MyModel(nn.Module):
    def __init__(self):
        super(MyModel, self).__init__()
        self.fc = nn.Linear(10, 2)
        
    def forward(self, x):
        x = self.fc(x)
        return x


model = MyModel()
x = torch.randn(1, 10)
print(model(x))
> tensor([[-0.2403,  0.8158]], grad_fn=<ThAddmmBackward>)

model = nn.Sequential(
    model,
    nn.Softmax(1)
)
print(model(x))
> tensor([[0.2581, 0.7419]], grad_fn=<SoftmaxBackward>)

urban ore Feb 28, 2022, 5:49 PM

#

neat anvil this delightfully simple example from there ```python class MyModel(nn.Module): ...

i tried this actually with this because i my neural network is nothing but an autoencoder

#

and i need to add my encoding function in it and idk how

thin palm Feb 28, 2022, 6:37 PM

#

What's up Python gang, when building a project is it bad to have all our data cleaning, encoding, and data engineering all in one file?

desert oar Feb 28, 2022, 6:57 PM

#

thin palm What's up Python gang, when building a project is it bad to have all our data cl...

yes, seems like it would be hard to maintain that

#

well actually.. not if it's small

#

it's not bad to start that way

#

actually i take back what i said because i misread

#

yes, all of your data processing stuff can be in one file

#

however consider that if the file is really big you might benefit from writing some separate modules

arctic wedgeBOT Feb 28, 2022, 7:00 PM

#

:incoming_envelope: :ok_hand: applied mute to @topaz gale until <t:1646075445:f> (9 minutes and 59 seconds) (reason: duplicates rule: sent 4 duplicated messages in 10s).

neat anvil Feb 28, 2022, 7:39 PM

#

thin palm What's up Python gang, when building a project is it bad to have all our data cl...

I'd highly recommend taking a look at DAG execution tools like https://www.nextflow.io/blog.html, https://airflow.apache.org/, https://www.prefect.io/, or https://dvc.org/. Which one works best for you will depend on your specific needs. They have many benefits beyond encouraging you to adopt the better practice of separating your python code into separate modules

Data Version Control · DVC

Open-source version control system for Data Science and Machine Learning projects. Git-like experience to organize your data, models, and experiments.

tacit basin Feb 28, 2022, 7:58 PM

#

Or ploomber https://ploomber.io/
This is example on how to rewrite one huge notebook into a pipeline
https://ploomber.io/blog/refactor-nb-i/
There are many tools to chose from as usual
https://ploomber.io/blog/survey/

Ploomber - Build data pipelines. FAST.⚡️

Refactoring a Jupyter notebook into a maintainable pipeline: A step...

A detailed guide to convert a Jupyter notebook into a modular and maintainable project - part 1.

Open-source Workflow Management Tools: A Survey

An Open-source Workflow Management Tools Survey.

neat anvil Feb 28, 2022, 7:59 PM

#

so many DAG engines

#

it's frankly silly

tacit basin Feb 28, 2022, 7:59 PM

#

Yep

iron basalt Feb 28, 2022, 8:08 PM

#

If you want nice visual DAGs (which can also run Python, but it's its own language too (visual scripting)), then check out Enso: https://enso.org/

Enso | Get insights you can rely on. In real time.

Enso is an award-winning interactive programming language with dual visual and textual representations. It is a tool that spans the entire stack, going from high-level visualisation and communication to the nitty-gritty of backend services, all in a single language.

desert oar Feb 28, 2022, 8:59 PM

#

it's silly but i think the big problem is that they all do slightly different things and have slightly different benefits

#

this ploomber article is useful but it's frustrating that they are also clearly (somewhat) biased

desert bear Feb 28, 2022, 9:01 PM

#

what is the best way to train a ai to recconice citys and countys

serene scaffold Feb 28, 2022, 9:05 PM

#

desert bear what is the best way to train a ai to recconice citys and countys

can you be more specific? are you asking about a model that can identify which city a given image is of?

desert bear Feb 28, 2022, 9:07 PM

#

not realy i am at a problem that i wanne train a model like a alexa or siri and i dont realy know how to begin with this i use a nlu model and tensorflow

serene scaffold Feb 28, 2022, 9:09 PM

#

can you give a specific use case? "recognizing cities and counties" could mean a lot of things

desert bear Feb 28, 2022, 9:09 PM

#

i have a start but i wanne use it that when i ask where new amsetdam lays that it will awnser in the usa so i need to train it to knwo the sentens and to inow that new amsetdam is a city

tacit basin Feb 28, 2022, 9:09 PM

#

desert oar this ploomber article is useful but it's frustrating that they are also clearly ...

Yes. Is there a less biased comparison?

desert oar Feb 28, 2022, 9:09 PM

#

not that i know of

#

i wish there was

desert oar Feb 28, 2022, 9:10 PM

#

desert bear i have a start but i wanne use it that when i ask where new amsetdam lays that i...

you probably don't need to train a model for this. you can start by just getting a list of cities and looking for them!

#

otherwise you are looking for a general category called "named entity recognition"

serene scaffold Feb 28, 2022, 9:10 PM

#

desert bear i have a start but i wanne use it that when i ask where new amsetdam lays that i...

so what you're trying to do is called named entity recognition. you want something that can identify geopolitical locations.

#

and since that's a popular problem, there are lots of off-the-shelf solutions. you can use spaCy.

desert bear Feb 28, 2022, 9:11 PM

#

alright i think i get it thanks.

neat anvil Feb 28, 2022, 9:12 PM

#

spaCy is great. 100% recommend

desert bear Feb 28, 2022, 9:14 PM

#

neat anvil spaCy is great. 100% recommend

how fast is it

neat anvil Feb 28, 2022, 9:17 PM

#

very

tacit basin Feb 28, 2022, 9:17 PM

#

desert bear alright i think i get it thanks.

Some more model ideas for question answering task https://paperswithcode.com/task/question-answering

Papers with Code - Question Answering

Question Answering is the task of answering questions (typically reading comprehension questions), but abstaining when presented with a question that cannot be answered based on the provided context.

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular b...

desert bear Feb 28, 2022, 9:19 PM

#

tacit basin Some more model ideas for question answering task https://paperswithcode.com/tas...

yes i did look into this and i have some ideas for how to get some contect that is needed for the awnser

thin palm Feb 28, 2022, 9:21 PM

#

Python gang I have a question for making a Class for a "Trainer" to train my machine learning model. If I want to add a Scaler, where would I add this within my code? I'm trying to understand how I'd incorporate a scaler into this:

    def __init__(self, X, y):
        #  X: pandas DataFrame
        #  y: pandas Series
        self.X = X
        self.y = y
        self.knn_model = None

    def set_model(self):
        """ defines our model as a class asttribute"""
        self.knn_model =  KNeighborsClassifier(n_neighbors=10)

    def run(self):
        self.set_model()
        self.knn_model.fit(self.X,self.y)

    def evaluate(self, X_test, y_test):
        r2_test = self.knn_model.score(X_test, y_test)
        y_pred = self.knn_model.predict(X_test)

        # Confusion Matrix
        print(confusion_matrix(y_test, y_pred))
        # Accuracy
        print(accuracy_score(y_test, y_pred))
        # Recall
        print(recall_score(y_test, y_pred, average=None))
        # Precision
        print(precision_score(y_test, y_pred, average=None))

    def save_model(self):
        joblib.dump(self.knn_model, 'model.joblib')
        print(colored("model.joblib saved locally", "green"))

desert oar Feb 28, 2022, 9:24 PM

#

(note: self.knn_model is an instance attribute, not a class attribute)

#

you would add any transformers as additional instance attributes

#

alternatively, use a Pipeline and set that as self.knn_model

#

i think scikit-learn pipelines are very useful

thin palm Feb 28, 2022, 9:25 PM

#

desert oar alternatively, use a `Pipeline` and set _that_ as `self.knn_model`

they are useful but I'm being lazy by not incorporating a pipeline haha

desert oar Feb 28, 2022, 9:25 PM

#

it's easier to use them

thin palm Feb 28, 2022, 9:25 PM

#

desert oar it's easier to use them

people say that, but since I've already begun transferring notebooks to pacakging I don't want to go back into notebooks

desert oar Feb 28, 2022, 9:25 PM

#

what does a pipeline have to do with a notebook?

thin palm Feb 28, 2022, 9:26 PM

#

I'd have to go back and test my pipeline in notebooks before I move into production

#

was told to always experiement on notebooks then go to Visual Studio

thin palm Feb 28, 2022, 9:27 PM

#

desert oar you would add any transformers as additional instance attributes

seems as if I need to review my classes

desert oar Feb 28, 2022, 9:27 PM

#

you don't have to do that

#

test it however you need to test it

thin palm Feb 28, 2022, 9:27 PM

#

desert oar test it however you need to test it

true I do find it easier testing on notebooks

desert oar Feb 28, 2022, 9:27 PM

#

i also question the value of this set_model method

thin palm Feb 28, 2022, 9:27 PM

#

maybe you're right with the Pipelines being easier

thin palm Feb 28, 2022, 9:27 PM

#

desert oar i also question the value of this `set_model` method

yes

desert oar Feb 28, 2022, 9:28 PM

#

class Trainer:
    def __init__(self, X, y):
        self.X = X
        self.y = y
        self.model = None

    def build_model(self):
        return make_pipeline(
            StandardScaler(),
            KNeighborsClassifier(n_neighbors=10),
        )

    def run(self):
        self.model = self.build_model()
        self.model.fit(self.X,self.y)

#

https://scikit-learn.org/stable/modules/generated/sklearn.pipeline.make_pipeline.html
https://scikit-learn.org/stable/modules/generated/sklearn.pipeline.Pipeline.html

scikit-learn

sklearn.pipeline.make_pipeline

Examples using sklearn.pipeline.make_pipeline: Release Highlights for scikit-learn 1.0 Release Highlights for scikit-learn 1.0, Release Highlights for scikit-learn 0.23 Release Highlights for sciki...

scikit-learn

sklearn.pipeline.Pipeline

Examples using sklearn.pipeline.Pipeline: Feature agglomeration vs. univariate selection Feature agglomeration vs. univariate selection, Pipeline ANOVA SVM Pipeline ANOVA SVM, Poisson regression an...

#

https://scikit-learn.org/stable/modules/compose.html#pipeline

scikit-learn

6.1. Pipelines and composite estimators

Transformers are usually combined with classifiers, regressors or other estimators to build a composite estimator. The most common tool is a Pipeline. Pipeline is often used in combination with Fea...

thin palm Feb 28, 2022, 9:30 PM

#

desert oar ```python class Trainer: def __init__(self, X, y): self.X = X ...

love it.

thin palm Feb 28, 2022, 9:30 PM

#

desert oar ```python class Trainer: def __init__(self, X, y): self.X = X ...

thank you, although I feel like my code does the same thing yours is easier to understand

#

Sorry some times I find my self asking questions that only confuse my self... in my notebooks I decide to scale once after we have our X_train, X_test, y_train, y_test = train_test_split(X,y, test_size = .3, random_state=0)
how do I incorporate this in my class or do I need to do it in my class?

urban prism Feb 28, 2022, 9:38 PM

#

Is there a way I can calculate my model's gpu memory and ram usage? The methods I've found seem to focus on the usage while training. I want to see how it is while using it

neat anvil Feb 28, 2022, 9:47 PM

#

I just use htop for stuff like that, but I'm sure there's a better way

urban prism Feb 28, 2022, 10:04 PM

#

Yeah

desert oar Feb 28, 2022, 10:53 PM

#

thin palm Sorry some times I find my self asking questions that only confuse my self... in...

you could put this in your run method

proven ermine Feb 28, 2022, 11:05 PM

#

Anybody have a rec for a course or website to learn python data science best practices? I'm an experienced data scientist in R and I have familiarity with python but I'm interested in learning the Pydata stack (particularly Dask)

neat anvil Feb 28, 2022, 11:20 PM

#

If you're already experienced in data science generally, the tutorials & user guides for the big libraries would probably be the best place to turn IMO. Tensorflow, pytorch, sklearn, pandas, all have extensive and well-written guides/tutorials.

ember minnow Mar 1, 2022, 1:09 AM

#

Hi, so I was recently trying to predict stock prices using an LSTM (seems really overdone i know) but when I was predicting on data that is outside the dataset, I get a really weird graph that I do not think is correct, am I doing something wrong?
Prediction: https://github.com/Alpheron/StockPred/blob/master/predictions/MSFT-5-Year-LSTM.ipynb
Training:
https://github.com/Alpheron/StockPred/blob/master/MSFT-5-Year-LSTM.ipynb

#

In order to process the data, I used the lookback index of 60 points, so when trying to predict on data that is outside of the dataset, I would need the last 60 points as well, but I am I doing something wrong with the way I am predicting?

azure lagoon Mar 1, 2022, 1:42 AM

#

hello! i got redirected from #discord-bots ... was wondering if anyone had a plt.style built for outputting to discord?

serene scaffold Mar 1, 2022, 3:00 AM

#

azure lagoon hello! i got redirected from <#343944376055103488> ... was wondering if anyone h...

I can't help, but you should probably clarify what you mean by "outputting to discord". like in an embed?

pallid bramble Mar 1, 2022, 3:52 AM

#

I have two lists of States. is it possible to use Pandas to compare the two lists, and create a new series?

#

I'm going to try set intersection

#

I'm not being very clear. I want a list of the items from list A that are NOT in list B. Forming list C. or subtracting list B from list A.

serene scaffold Mar 1, 2022, 3:57 AM

#

yes, sounds like you should use sets and then convert the result back to a Series @pallid bramble

#

yes but I don't know what you're going to ask. you should always just ask your actual question, not if someone knows about something you haven't asked.

#

Nope

#

@stuck storm all you have to do is ask a question that someone can look at and see what the question is. then whoever's available can try to answer it.

pallid bramble Mar 1, 2022, 4:02 AM

#

thank you. that works great. its the difference method, not intersection

serene scaffold Mar 1, 2022, 4:05 AM

#

pallid bramble thank you. that works great. its the difference method, not intersection

!e you can also use the minus operator.

result = {1, 2, 3, 4} - {1, 4}
print(result)

I find it easier to read.

arctic wedgeBOT Mar 1, 2022, 4:05 AM

#

@serene scaffold :white_check_mark: Your eval job has completed with return code 0.

{2, 3}

serene scaffold Mar 1, 2022, 4:07 AM

#

why do you keep deleting stuff that you say

pallid bramble Mar 1, 2022, 4:08 AM

#

thank you. i like that better too.

serene scaffold Mar 1, 2022, 4:09 AM

#

it was better before when you had the function header. but thanks for saying what the variables are. what are you confused by? (by the way, your question is really about neural network theory, not so much numpy. that's another reason why it's better to ask your actual question, not say what you think the topic is.)

#

def loss_and_gradient(X, A, y):
    """Compute the loss"""
    loss = np.sum((X @ A - y) ** 2) / 2
    #compute the gradient
    grad = X.T @ (X @ A - y)  # this is an array
    # return the loss and gradient
    return loss, grad

#

is this a tuple of (X, A, y)? also, what does it do that is different than what you expected?

short delta Mar 1, 2022, 4:14 AM

#

Hi guys, I am trying to debug a hand me down code base.

The script ingest an input excel form to generate an output excel form to add in the query column but it seems like its not working it kept complain about [Column A, Column B, Column C] is not in index.

I added Column A and B to the input excel form and it cleared the error for their respective not in index error.

However, Column C that is not supposed to be in the input excel form is being flagged not in index but is expected to be in the output excel form and it is not there after it was generated.

Looking for anyone that can assist with this debugging

serene scaffold Mar 1, 2022, 4:14 AM

#

not entirely sure. I rewrote it at as this:

def loss_and_grad(X, A, y):
    foo = (X @ A) - y
    return (
        np.sum(foo ** 2) / 2,
        X.T @ foo
    )

serene scaffold Mar 1, 2022, 4:15 AM

#

short delta Hi guys, I am trying to debug a hand me down code base. The script ingest an in...

please show the code and an example of what data is in the excel. if you can't show the data, make an example that captures what the data is like.

short delta Mar 1, 2022, 4:16 AM

#

erm i cannot show the code..

#

that is the issue lol

serene scaffold Mar 1, 2022, 4:16 AM

#

@stuck storm do you not want me to help? I was looking at what you had written right as you deleted it

short delta Mar 1, 2022, 4:16 AM

#

I need some time to prep those then

tacit basin Mar 1, 2022, 4:59 AM

#

urban prism Is there a way I can calculate my model's gpu memory and ram usage? The methods ...

for gpu ram and other stats nvidia-smi tool would help. it will show the gpu stats

tacit basin Mar 1, 2022, 5:02 AM

#

proven ermine Anybody have a rec for a course or website to learn python data science best pra...

i didn't read the book or took the courese, but i've seen it the other day: https://www.udemy.com/course/clean-machine-learning-code/

Udemy

Clean Machine Learning Code

Even Bad ML Code Can Function. This course provides practical solutions to prevent disasters in ML software product.

tacit basin Mar 1, 2022, 5:08 AM

#

short delta Hi guys, I am trying to debug a hand me down code base. The script ingest an in...

Add column C as well to input to debug?

short delta Mar 1, 2022, 5:09 AM

#

tacit basin Add column C as well to input to debug?

it will flag sql query error and says 'Unknown column' in the list of index (which is not supposed to be there)

#

I will update with details later on, gotta get some work done in office now

urban knoll Mar 1, 2022, 6:42 AM

#

I'm not too sure if this is the thread I should use to ask questions about clustering , but since its about AI, here I go,. I'm interested in applying agglomoretive clustering to something I'm working on, specifically the Ward method, but I'm unable to find usefule information on how to fill out the parameter connectivity, I know what it is but I spnt know what exactly I should place in there, like whats the foramt basically?Is it some np.shape() tyoe thing I'm su[poosed to place in there? I've already looked here: https://scikit-learn.org/0.15/modules/generated/sklearn.cluster.Ward.htmlAgglomerativeClustering(n_clusters=7,. Doesnt say what the format is.

desert bear Mar 1, 2022, 8:28 AM

#

neat anvil very

oke it is not just very fast it is insane

idle tartan Mar 1, 2022, 9:02 AM

#

Holaa guys I am working on deepfake detection system I have to submit it within one week can anyone help me with it?

lone drum Mar 1, 2022, 9:07 AM

#

Traceback (most recent call last):

  File "D:\college_project\modules\model_train.py", line 21, in <module>
    model.add(MaxPooling2D(pool_size = (2,2)))

  File "C:\Users\shubh\anaconda3\lib\site-packages\tensorflow\python\training\tracking\base.py", line 629, in _method_wrapper
    result = method(self, *args, **kwargs)

  File "C:\Users\shubh\anaconda3\lib\site-packages\keras\utils\traceback_utils.py", line 67, in error_handler
    raise e.with_traceback(filtered_tb) from None

  File "C:\Users\shubh\anaconda3\lib\site-packages\tensorflow\python\framework\ops.py", line 2013, in _create_c_op
    raise ValueError(e.message)

ValueError: Exception encountered when calling layer "max_pooling2d_7" (type MaxPooling2D).

Negative dimension size caused by subtracting 2 from 1 for '{{node max_pooling2d_7/MaxPool}} = MaxPool[T=DT_FLOAT, data_format="NHWC", explicit_paddings=[], ksize=[1, 2, 2, 1], padding="VALID", strides=[1, 2, 2, 1]](Placeholder)' with input shapes: [?,1,1,16].

Call arguments received:
  • inputs=tf.Tensor(shape=(None, 1, 1, 16), dtype=float32)``` how to fix this error ping me when replying

#

my complete code here https://paste.pythondiscord.com/roqehapeze

tidal bough Mar 1, 2022, 9:09 AM

#

lone drum ```python Traceback (most recent call last): File "D:\college_project\modules...

you're trying to max-pool an image of size 1x1, it seems, which is naturally impossible.

#

probably something went wrong at an earlier stage, since 1x1 is a weird size to have.

lone drum Mar 1, 2022, 9:10 AM

#

tidal bough probably something went wrong at an earlier stage, since 1x1 is a weird size to ...

earlier code ```python
model = Sequential()

model.add(Convolution2D(16, 3, 3, input_shape = (32, 32, 3), activation = 'relu'))

model.add(MaxPooling2D(pool_size = (2,2)))

model.add(Convolution2D(16, 3, 3, activation = 'relu'))

model.add(MaxPooling2D(pool_size = (2,2)))```

abstract falcon Mar 1, 2022, 9:14 AM

#

Can we plot accuracy/loss curve of each iteration in ML algorithms? If yes, Can somebody suggest me any resource to refer?

tidal bough Mar 1, 2022, 9:21 AM

#

abstract falcon Can we plot accuracy/loss curve of each iteration in ML algorithms? If yes, Can ...

What ML library are you using? Many of them have a builtin tool for that.

abstract falcon Mar 1, 2022, 9:21 AM

#

sklearn

tidal bough Mar 1, 2022, 9:24 AM

#

Oh, I see. And what kind of model are you training? Is it a neural network or something more classical?

abstract falcon Mar 1, 2022, 9:26 AM

#

I will be training four models Logistic Regression, SVM, DecisionTrees and Naive Bayes. And I want to see how does each model perform in that dataset.

tidal bough Mar 1, 2022, 9:26 AM

#

It looks like it has this tutorial:
https://scikit-learn.org/stable/auto_examples/model_selection/plot_learning_curve.html
which uses this function, which is supposed to work for every fit-predict capable model :
https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.learning_curve.html#sklearn.model_selection.learning_curve

lone drum Mar 1, 2022, 9:26 AM

#

tidal bough probably something went wrong at an earlier stage, since 1x1 is a weird size to ...

hii, can u please help me to find the issue in my code ?

tidal bough Mar 1, 2022, 9:27 AM

#

lone drum hii, can u please help me to find the issue in my code ?

What's the size of your images? 32x32, right?

lone drum Mar 1, 2022, 9:27 AM

#

tidal bough What's the size of your images? 32x32, right?

yes

gloomy prawn Mar 1, 2022, 9:34 AM

#

Hi, I have table with properties in first row and object in first column and I must put an X in each intersection of object with its property like this: https://github.com/xflr6/concepts/blob/master/examples/relations.csv, after that I need to do some manipulations like deleting one column , deleting one row etc... my question is what's the best way to do that ? Pandas ?
Thanks

tidal bough Mar 1, 2022, 9:34 AM

#

lone drum earlier code ```python model = Sequential() model.add(Convolution2D(16, 3, 3, i...

It looks like you convolve with a 3x3 kernel, which reduces the size to 30x30. Then you do a 2x2 maxpool2d, which reduces it to 15x15. Then another conv2d, for 13x13 and another maxpool2d, for 6x6 or so. Hmm. strange, it really should work

#

oh hold on

#

@lone drumah, your convolution layers aren't just 3x3, they have a kernel_size of 3 (so 3x3) and a strides of 3 (so 3x3).

lone drum Mar 1, 2022, 9:38 AM

#

tidal bough <@!680099760836968475>ah, your convolution layers aren't just 3x3, they have a `...

yes

tidal bough Mar 1, 2022, 9:39 AM

#

so then:
0) input: 32x32

after first convolution: something like 10x10
after max pooling: 5x5
after second convolution: 1x1
and the second max pooling fails, since the input size is too small for a 2x2 pooling

lone drum Mar 1, 2022, 9:39 AM

#

ah okay

#

but when i use this code previously it worked

lone drum Mar 1, 2022, 9:43 AM

#

tidal bough so then: 0) input: 32x32 1) after first convolution: something like 10x10 2) aft...

can u guide me here how i can fix this ?

tidal bough Mar 1, 2022, 9:43 AM

#

Alter your layers in some way so that they don't squeeze the image too much. Can't really recommend how exactly, I haven't worked with CNNs.

maiden shore Mar 1, 2022, 11:01 AM

#

I'm not sure how much of a mouthful this is but could someone please explain linear regression and cost functions to me like I'm 5

tacit basin Mar 1, 2022, 11:26 AM

#

maiden shore I'm not sure how much of a mouthful this is but could someone please explain lin...

Linear regression algorithm approximate observations with a straight line. The line is chosen that minimizes cost function. Cost function could be an average value of absolute distances from actual observations to the fitted line.

idle tartan Mar 1, 2022, 11:45 AM

#

I am working on deepfake detection system I have to submit it within one week can anyone help me with it?

onyx island Mar 1, 2022, 11:51 AM

#

Hello, I'm using K-NN (SVM may be) classifier to detect defective part on image.
After training the classifier, I have the confusion matrix, etc.
Now I want to put a label on the original image to show which part is defective or not.
How can I retrieve data after training the data ?

maiden shore Mar 1, 2022, 11:51 AM

#

tacit basin Linear regression algorithm approximate observations with a straight line. The l...

So is linear regression drawing a line of best fit through a set of data? And cost function is drawing lines from your data points and recording the distance from those points to the line of best fit?
I'm struggling to understand how you would calculate all of that, and apply that to an IRL example

tranquil oak Mar 1, 2022, 12:36 PM

#

whats the best begginer course for ML?

tacit basin Mar 1, 2022, 1:03 PM

#

maiden shore So is linear regression drawing a line of best fit through a set of data? And co...

That's exactly what it is.

tacit basin Mar 1, 2022, 1:06 PM

#

maiden shore So is linear regression drawing a line of best fit through a set of data? And co...

Just draw a set of points. Then draw a straight line. Then measure distance from each point to the line. Add absolute values of all distances together and divide by number of points. You will get one value. Which you want to minimize. So you draw a new line and calculate again.

maiden shore Mar 1, 2022, 1:20 PM

#

tacit basin Just draw a set of points. Then draw a straight line. Then measure distance from...

Where would I draw the line?

tender talon Mar 1, 2022, 1:31 PM

#

def hello():
return 'Goodbye, Mars!'

#

need helps please

#

help how to print "hello world!"

prime hearth Mar 1, 2022, 1:37 PM

#

@tranquil oak can try this article https://medium.com/@alexm5492/linear-regression-from-scratch-3-methods-2e803d82137c

Medium

Linear Regression From Scratch- 3 Methods

Introduction:

tranquil oak Mar 1, 2022, 1:41 PM

#

I heard the basics in ML are very large, so a course that would cover them would be wonderful, thanks for the article, very useful aswell!

prime hearth Mar 1, 2022, 1:46 PM

#

Yeah i agree, this article I found to be helpful as it really is targeted for beginners and explains all the math concepts easily with examples and data science terms

#

But there are many other articles out there that are also great for lineae regression

junior fractal Mar 1, 2022, 1:48 PM

#

Hi

maiden shore Mar 1, 2022, 1:53 PM

#

prime hearth <@!481070779203584000> can try this article https://medium.com/@alexm5492/linea...

Actually this seems like a good starting point for me as well

final kiln Mar 1, 2022, 2:15 PM

#

can anyone recommend a good, math-heavy book on AI and data science?

serene scaffold Mar 1, 2022, 2:39 PM

#

final kiln can anyone recommend a good, math-heavy book on AI and data science?

there's "the deep learning book" https://www.deeplearningbook.org/

hard basalt Mar 1, 2022, 2:40 PM

#

desert oar if you just want to share notebooks as finished products in some on-prem fashion...

Thank you for your solutions. A slightly different question: I am not familiar with interactive notebooks, eg. using ipywidgets, but do we know if a such notebook could be somehow packaged into an application? Does pyinstaller have any result on doing that?

#

I am just thinking of giving my users a set-solution for them to browse and tinker with their data

prime hearth Mar 1, 2022, 2:56 PM

#

@maiden shore yeah so drawingg any random line at first

#

It same when implementing linear regression , we initialize the paramterers for our weights and bias as random values or 0

#

So initially line will be flat if everything initialized to 0

thorn venture Mar 1, 2022, 3:24 PM

#

Hi I need to add some data from 3csv files to a Excel file as it is ; but the catch is there are dates available all files have series of dates like 5th feb to 1st mar. all dates need to be in a same section in master file. e.g. all 3rd feb will be consequent then all 4th will be added.
Should I copy paste and then sort only the date? will that work or any other ways there ? should I use pandas or openpyxl ? Pls help me guys I`m noob in this field.

serene scaffold Mar 1, 2022, 3:25 PM

#

thorn venture Hi I need to add some data from 3csv files to a Excel file as it is ; but the c...

We can figure this out; I just need to understand what the data looks like. can you show examples of the three CSVs as text? use our pastebin: paste.pythondiscord.com

#

!paste

arctic wedgeBOT Mar 1, 2022, 3:25 PM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the floppy disk icon in the top right, or by typing ctrl + S. After doing that, the URL should change. Copy the URL and post it here so others can see it.

serene scaffold Mar 1, 2022, 3:27 PM

#

the steps will go something like this:

open all three CSVs as DataFrames (with pandas)
Normalize the date representation for each DataFrame
Concatenate all the DataFrames into one
Sort the rows by date

final kiln Mar 1, 2022, 3:31 PM

#

serene scaffold there's "the deep learning book" https://www.deeplearningbook.org/

I'm looking for general relativity levels of math heavy

serene scaffold Mar 1, 2022, 3:31 PM

#

final kiln I'm looking for general relativity levels of math heavy

the book is only about the math. there is no code.

thorn venture Mar 1, 2022, 3:34 PM

#

serene scaffold We can figure this out; I just need to understand what the data looks like. can ...

Wait let me share

#

This is how csv file data are there. I have to paste it like the way where all same dates should be together. then apply formulla to the new column.

serene scaffold Mar 1, 2022, 3:38 PM

#

thorn venture This is how csv file data are there. I have to paste it like the way where all s...

Please re-read my instructions for how to share the data. I will not look at screenshots.

desert oar Mar 1, 2022, 3:38 PM

#

hard basalt Thank you for your solutions. A slightly different question: I am not familiar w...

part of the problem w/ interactive notebooks is that both the notebook frontend rendering engine (jupyter notebook, jupyterlab, nbconvert, etc.) and the kernel running the notebook both need to have the widget-related stuff installed. so you will need to include both the notebook frontend as well as the notebook kernel in your pyinstaller distribution

desert oar Mar 1, 2022, 3:40 PM

#

final kiln can anyone recommend a good, math-heavy book on AI and data science?

in addition to The Deep Learning Book, you can look into The Hundred Page Machine Learning Book by Burkov and Probabilistic Machine Learning by Murphy

#

the latter has a new 2022 edition which is free to read (as a draft) online

#

the 2012 version is also free to read online, and is more polished, but might seem a bit out of date

#

Burkov: http://themlbook.com/
Murphy: https://probml.github.io/pml-book/book1.html

The Hundred-Page Machine Learning Book by Andriy Burkov

All you need to know about Machine Learning in a hundred pages. Supervised and unsupervised learning, support vector machines, neural networks, ensemble methods, gradient descent, cluster analysis and dimensionality reduction, autoencoders and transfer learning, feature engineering and hyperparameter tuning! Math, intuition, illustrations, all i...

upper spindle Mar 1, 2022, 3:44 PM

#

what version of numpy runs with tf

solar phoenix Mar 1, 2022, 3:44 PM

#

hi all, i have a bunch of numbers that i want to average, atm i do it like this G = countsdataG1.at['G']
RG = (countsdataG1.at['RG']+countsdataR1.at['RG'])/2
RGFr = (countsdataG1.at['RGFr']+countsdataR1.at['RGFr']+countsdataFr1.at['RGFr'])/3
GFr = (countsdataG1.at['GFr']+countsdataFr1.at['GFr'])/2
R = countsdataR1.at['R']
Fr = countsdataFr1.at['Fr']
RFr = (countsdataR1.at['RFr']+countsdataFr1.at['RFr'])/2

#

sometimes one of the values will = 0

#

in that case i want to take one of the numbers only and ignore the 0

desert oar Mar 1, 2022, 3:45 PM

#

upper spindle what version of numpy runs with tf

there isn't a specific version equivalence, any recent version of numpy should be okay

solar phoenix Mar 1, 2022, 3:45 PM

#

if both are 0 i want to return 0

desert oar Mar 1, 2022, 3:46 PM

#

solar phoenix in that case i want to take one of the numbers only and ignore the 0

don't overthink it. these are scalar values, i.e. just plain numbers, because you are using at. so just use if and all the other usual python stuff

#

it's easy to get so lost in all the pandas stuff that you forget to use the basic tools that are part of python!

solar phoenix Mar 1, 2022, 3:47 PM

#

Thanks for the response, sorry, can you extrapolate on that?

#

oh so it is like

#

if x > 0: type thing?

desert oar Mar 1, 2022, 3:47 PM

#

yeah, exactly

#

pandas isn't a separate programming language, it's still python

solar phoenix Mar 1, 2022, 3:47 PM

#

sure sure, i get it, thanks

#

appreciate it

upper spindle Mar 1, 2022, 3:50 PM

#

what does it mean if TypeError: 'NoneType' object is not callable

desert oar Mar 1, 2022, 3:51 PM

#

upper spindle what does it mean if `TypeError: 'NoneType' object is not callable`

"callable" means "use it like a function", e.g. input is the name of a function and input() is calling the input function. so this error means you tried to use something like a function, but the thing is None (which has type NoneType), and of course None cannot be "called" like a function

#

usually this is a mistake in your program logic somewhere

#

perhaps you overwrote the name of a function (e.g. sum = None; sum([1,2,3]))

#

you should show your code and the full error traceback

#

it's unlikely that this is a datascience-specific problem

upper spindle Mar 1, 2022, 3:54 PM

#

validation_split=0.2, shuffle=True, verbose=0, batch_size=batch_size, epochs=200)```

#

i executed this code, but says the error came from the mat_X_train mat_y_train line

real flame Mar 1, 2022, 3:57 PM

#

Anyone here knows how to use R script?

serene scaffold Mar 1, 2022, 4:03 PM

#

instead of being sorry for the ping, just don't ping. I am busy.

desert oar Mar 1, 2022, 4:12 PM

#

real flame Anyone here knows how to use R script?

in general it's a good idea not to "ask to ask" like this. you can ask if something is on topic or not, but "does anyone know how to X" is not a good way to get help, because it forces people to "interview" you before getting to any useful work

lapis sequoia Mar 1, 2022, 4:12 PM

#

Fricking

real flame Mar 1, 2022, 4:14 PM

#

desert oar in general it's a good idea not to "ask to ask" like this. you can ask if someth...

never forced anyone but ok lol i just had a question its not that deep

desert oar Mar 1, 2022, 4:16 PM

#

real flame never forced anyone but ok lol i just had a question its not that deep

it's just a general principle about asking things online

real flame Mar 1, 2022, 4:16 PM

#

ok?

serene scaffold Mar 1, 2022, 4:17 PM

#

@real flame salt rock lamp is giving you advice to help you get help in any online context. he is helping you, not criticizing you.

real flame Mar 1, 2022, 4:18 PM

#

serene scaffold <@!755891573123711086> salt rock lamp is giving you advice to help you get help ...

you do realize I asked a question on a python discord server to see if anyone could help, again it’s not that deep and you don’t have to be here defending them

#

jeez stop attacking me I asked one question and I don’t need anyone’s advice I never asked for it nor do I care

serene scaffold Mar 1, 2022, 4:19 PM

#

No one is attacking you. We answer a lot of questions in this server, so we're giving you suggestions that will increase the likelihood that you will receive help in the future.

real flame Mar 1, 2022, 4:20 PM

#

‘It forces people to interview you’ where is the correlation between the question I asked and this response

#

I appreciate your help, but it was never my intention make it seem that way

serene scaffold Mar 1, 2022, 4:20 PM

#

because someone would have to interview you about what your R question is before they could attempt to answer it. people usually don't want to do that--they want to see a question and start answering it.

real flame Mar 1, 2022, 4:23 PM

#

I wanted to see if anyone knew R in the first place since this isn’t a channel exactly for it, nothing wrong with seeing if someone understood the subject

serene scaffold Mar 1, 2022, 4:24 PM

#

nothing wrong with seeing if someone understood the subject
the best way for someone to know if they understand the subject matter of your question is to see the actual question.

#

how much R do they have to know to be able to help? they'll never know until the question is there.

real flame Mar 1, 2022, 4:25 PM

#

desert oar in general it's a good idea not to "ask to ask" like this. you can ask if someth...

okay then whatever this was should’ve been worded properly

serene scaffold Mar 1, 2022, 4:25 PM

#

Even though R is out-of-scope, if you decide to ask your question, I'll allow it as a gesture of goodwill.

real flame Mar 1, 2022, 4:25 PM

#

because I had no idea what they meant and it sounded like I meant something way different than what it actually was, don’t assume I was forcing anyone to interview me

#

#

they could have said this

#

anyways I understand what you meant, next time I’ll ask the question instead of asking who knows how to help

scenic ferry Mar 1, 2022, 4:27 PM

#

I was told that I should join this community if I want to get into actual pyhton

serene scaffold Mar 1, 2022, 4:28 PM

#

scenic ferry I was told that I should join this community if I want to get into actual pyhton

that was probably a decent suggestion, yes. are you trying to get into data science specifically?

scenic ferry Mar 1, 2022, 4:30 PM

#

I accidentally posted this here sorry

#

I meant to send this in general

scenic ferry Mar 1, 2022, 4:30 PM

#

serene scaffold that was probably a decent suggestion, yes. are you trying to get into data scie...

Is there a channel for creating ui's?

serene scaffold Mar 1, 2022, 4:32 PM

#

scenic ferry Is there a channel for creating ui's?

you can make UIs with python, yes

scenic ferry Mar 1, 2022, 4:33 PM

#

Nvmd found it

serene scaffold Mar 1, 2022, 4:33 PM

#

where was it?

scenic ferry Mar 1, 2022, 4:33 PM

#

#software-architecture

serene scaffold Mar 1, 2022, 4:34 PM

#

that's not the UI channel; #user-interfaces is

scenic ferry Mar 1, 2022, 4:34 PM

#

Oh okay

serene scaffold Mar 1, 2022, 4:34 PM

#

be sure to read the channel description before using a channel for the first time

timid forge Mar 1, 2022, 4:34 PM

#

Am I allowed to post a reddit post here that I made instead of rewriting my question out?

serene scaffold Mar 1, 2022, 4:35 PM

#

timid forge Am I allowed to post a reddit post here that I made instead of rewriting my ques...

yes, but you should probably restate enough of the question in the chat to attract those who know about the question.

timid forge Mar 1, 2022, 4:37 PM

#

I'm having a problem running code from the machine learning course on freecodecamp. I am getting this when I run it: Process finished with exit code -1073740791 (0xC0000409). I'm just wondering if this is because my computer can't handle the bigger data or if I am doing something wrong. I explained it a little better here https://www.reddit.com/r/MLQuestions/comments/t3vsj7/code_never_runs_and_getting_similar_error_message/

r/MLQuestions - Code never runs and getting similar error message e...

1 vote and 0 comments so far on Reddit

hard basalt Mar 1, 2022, 5:09 PM

#

desert oar part of the problem w/ interactive notebooks is that both the notebook frontend ...

I see thanks. I am already looking for non-jupyter solutions lemon_pensive

desert oar Mar 1, 2022, 5:11 PM

#

hard basalt I see thanks. I am already looking for non-jupyter solutions <:lemon_pensive:754...

i think the additional problem is that these widgets only work with jupyter

#

maybe you can use nbconvert to convert the notebook to plain html with the widgets intact

arctic crown Mar 1, 2022, 5:22 PM

#

please help what is a tensor?

hard basalt Mar 1, 2022, 5:26 PM

#

desert oar maybe you can use nbconvert to convert the notebook to plain html with the widge...

noted thanks

serene scaffold Mar 1, 2022, 5:47 PM

#

arctic crown please help what is a tensor?

a generalization of an array. An individual number, like 5 is a 0-dimensional tensor. [7, 9, 10] is a one-dimensional tensor of shape (3,). [[4, 9, 7], [1, 0, 6]] is a two-dimensional tensor of shape (2, 3). and so on.

arctic crown Mar 1, 2022, 6:16 PM

#

serene scaffold a generalization of an array. An individual number, like `5` is a 0-dimensional ...

What do the numbers represent?

serene scaffold Mar 1, 2022, 6:16 PM

#

arctic crown What do the numbers represent?

the numbers in my example are just random, for the example

shut trail Mar 1, 2022, 6:24 PM

#

timid forge I'm having a problem running code from the machine learning course on freecodeca...

it ran for me without error

arctic wedgeBOT Mar 1, 2022, 6:25 PM

#

Hey @shut trail!

You either uploaded a .txt file or entered a message that was too long. Please use our paste bin instead.

timid forge Mar 1, 2022, 6:26 PM

#

shut trail it ran for me without error

Interesting. So it is my computer that just can't handle it? Is there some way I can improve performance?

#

Feels like a big task tbh I don't know this stuff 😛 I just want to write code Dx

shut trail Mar 1, 2022, 6:29 PM

#

you could be missing pyqt

tidal bough Mar 1, 2022, 6:30 PM

#

timid forge Feels like a big task tbh I don't know this stuff 😛 I just want to write code D...

https://github.com/tensorflow/tensorflow/issues/47934
might be your issue (found it by your exit code)

GitHub

Process finished with exit code -1073740791 (0xC0000409) when train...

System information Have I written custom code (as opposed to using a stock example script provided in TensorFlow): copied code from tensorflow transfer learning documentation OS Platform and Distri...

graceful glacier Mar 1, 2022, 6:36 PM

#

i have the following table

#

#

its per company per month

#

how can i create a 'last month sales' column?

neat anvil Mar 1, 2022, 6:43 PM

#

@graceful glacier try using a HAVING block in your query

graceful glacier Mar 1, 2022, 6:44 PM

#

HAVING?

#

isnt that SQL specific only?

serene scaffold Mar 1, 2022, 6:46 PM

#

graceful glacier i have the following table

what are you using to manipulate the table? SQL, Pandas, or something else?

#

also, what is "last month sales"?

#

is it the sum of each sales value grouped by company and calendar month?

#

are March 2022 and March 2021 the same month?

graceful glacier Mar 1, 2022, 6:49 PM

#

yea sure, im using pandas and last month sales would indicate the sales from the month previous in the current row. the sales are summed already. and this particular dataset has only two months in it for the same year(2020)

#

so the last month sales would look somthing like this

serene scaffold Mar 1, 2022, 6:50 PM

#

graceful glacier yea sure, im using pandas and last month sales would indicate the sales from the...

please do print(df.sample(10).to_dict('list')) and copy and paste the text (no screenshots) into this chat as text.

graceful glacier Mar 1, 2022, 6:51 PM

#

i can print the whole table, its not big

serene scaffold Mar 1, 2022, 6:52 PM

#

you can do print(df.to_dict('list')) if you want.

graceful glacier Mar 1, 2022, 6:52 PM

#

{'Company': ['British Soaps', 'British Soaps', 'Chin & Beard Suds Co', 'Chin & Beard Suds Co', 'Soap and Splendour', 'Soap and Splendour', 'Squeaky Cleanies', 'Squeaky Cleanies', 'Sudsie Malone', 'Sudsie Malone'], 'Date': [Timestamp('2020-03-01 00:00:00'), Timestamp('2020-04-01 00:00:00'), Timestamp('2020-03-01 00:00:00'), Timestamp('2020-04-01 00:00:00'), Timestamp('2020-03-01 00:00:00'), Timestamp('2020-04-01 00:00:00'), Timestamp('2020-03-01 00:00:00'), Timestamp('2020-04-01 00:00:00'), Timestamp('2020-03-01 00:00:00'), Timestamp('2020-04-01 00:00:00')], 'Sales': [671772175.7995872, 687222935.8429785, 483505038.56760347, 508107776.28321755, 1896382984.9812155, 1933258732.721503, 1790308563.639309, 1818003538.774453, 1747533597.012284, 1794467930.657847]}

serene scaffold Mar 1, 2022, 6:53 PM

#

thanks, let me see

#

df.groupby('Company')['Sales'].diff()
Out[8]:
0             NaN
1    1.545076e+07
2             NaN
3    2.460274e+07
4             NaN
5    3.687575e+07
6             NaN
7    2.769498e+07
8             NaN
9    4.693433e+07
Name: Sales, dtype: float64

Here's one way to do it

#

this works because there's only one row per (company, month) and they're already in chronological order.

#

if one company had more than one row for a given month, you'd have to aggregate them to get unique (company, month) rows.

#

also this is the change in sales. I think that's what you meant

graceful glacier Mar 1, 2022, 6:58 PM

#

yes this would be change in sales

#

and thanks for the solution

#

i think another solution might also be a rolling sum of 2 rows subtracted by the value in the sales column

serene scaffold Mar 1, 2022, 7:00 PM

#