crude needle Nov 4, 2021, 3:27 AM

#

#

I used this but I don't know how to transpose

night dragon Nov 4, 2021, 3:28 AM

#

same column letter each time?

crude needle Nov 4, 2021, 3:28 AM

#

Can you elaborate?

night dragon Nov 4, 2021, 3:28 AM

#

also looks like p haha

crude needle Nov 4, 2021, 3:30 AM

#

I have these multiple csv file and I want to extract only "p" data then I want to transpose and concatenate all the p datas stacked

#

I just want to automate the process as I have hundred of the csv files.

night dragon Nov 4, 2021, 3:39 AM

#

are all the files in the same folder?

crude needle Nov 4, 2021, 3:39 AM

#

yes

night dragon Nov 4, 2021, 3:46 AM

#

give me a sec

crude needle Nov 4, 2021, 3:47 AM

#

sure

night dragon Nov 4, 2021, 4:18 AM

#

You still there @crude needle ?

crude needle Nov 4, 2021, 4:19 AM

#

yes

night dragon Nov 4, 2021, 4:19 AM

#

import csv
import pathlib


def main():
    p = pathlib.Path(r'path_to_csv_folder')
    csv_files = [f for f in p.iterdir() if f.suffix == '.csv']

    with open('new_file.csv', 'w', newline='') as new_file:
        writer = csv.writer(new_file)
        for file in csv_files:
            with open(file, 'r') as csv_file:
                reader = csv.reader(csv_file)
                row_data = []
                for row in reader:
                    if row[-1] != 'p':
                        row_data.append(row[-1])
                writer.writerow(row_data)


if __name__ == '__main__':
    main()

#

should do the trick

crude needle Nov 4, 2021, 4:22 AM

#

It did work

#

Thanks

night dragon Nov 4, 2021, 4:23 AM

#

must be an end column that is different

crude needle Nov 4, 2021, 4:24 AM

#

yeah I need to figure this out

#

How can this happen as we have mentioned 'p'

night dragon Nov 4, 2021, 4:25 AM

#

if row[-1].isdigit() and row[-1] != 0:

I didn't specify the column, just took the last element

crude needle Nov 4, 2021, 4:25 AM

#

nvm the last csv file didn't have p file

#

You did great. Awesome. Thanks 🙂

frozen marten Nov 4, 2021, 7:15 AM

#

if we want to begin a career as a data scientist what are the skills required? and what role should we take up as a fresher?

lapis sequoia Nov 4, 2021, 11:12 AM

#

where do i learn tenser flow lemon_eyes

shy nimbus Nov 4, 2021, 11:26 AM

#

Hello, how i can fit SVC ( sklearn ) by a few text data vectorized by Tfidf?

from sklearn.feature_extraction.text import TfidfVectorizer
from sklearn.svm import SVC

vectorizer = TfidfVectorizer()

keywords = vectorizer.fit_transform(train_df['keyword'].astype('str'))
locations = vectorizer.fit_transform(train_df['location'].astype('str'))
texts = vectorizer.fit_transform(train_df['text'].astype('str'))

features = np.hstack((keywords, locations, texts))

classifier = SVC()
classifier.fit(features, labels)

This raise

TypeError                                 Traceback (most recent call last)
TypeError: float() argument must be a string or a number, not 'csr_matrix'

The above exception was the direct cause of the following exception:
ValueError: setting an array element with a sequence.

How i can fit SVC by multi-feature data?

rough mountain Nov 4, 2021, 11:27 AM

#

Always a good sign in binary classification: val_accuracy: 0.5000

lapis sequoia Nov 4, 2021, 11:32 AM

#

Hey all, I am a post graduate data science researcher trying to develop an AI tool to help league of legends players improve their gameplay. Before I start I need to do some preliminary data collection on the sentiment towards existing improvement tools. I would greatly appreciate if LoL players take a look at this survey, its aprox. 3 mins
https://unibocconi.qualtrics.com/jfe/form/SV_06PRQq80VZE3lEa

Online Survey Software | Qualtrics Survey Solutions

Qualtrics sophisticated online survey software solutions make creating online surveys easy. Learn more about Research Suite and get a free account today.

lapis sequoia Nov 4, 2021, 11:48 AM

#

lapis sequoia where do i learn tenser flow <:lemon_eyes:754441879885447296>

bump

lapis sequoia Nov 4, 2021, 12:27 PM

#

for this question i would have thought that the bigo notation would have been O(1) due to the list being sorted and that unsorted lists have to have space made for additional objects where a sorted list does not? or am i miss understanding something here?
if someone can tag me too with a reply that would be wonderful!

serene scaffold Nov 4, 2021, 1:40 PM

#

lapis sequoia for this question i would have thought that the bigo notation would have been O(...

try #algos-and-data-structs

#

also we can't really read that screenshot. Try regular text.

odd meteor Nov 4, 2021, 1:42 PM

#

shy nimbus Hello, how i can fit SVC ( sklearn ) by a few text data vectorized by Tfidf? ```...

Hi rokiboxoffical, after vectorizing your text, your output is gon be a sparse matrix. So I think you should convert it to a 'dense' numpy array which you can ultimately pass into a pandas DataFrame.

Example

vect = TfidfVectorizer()
vect.fit(df[['keyword', 'location', 'text']] )
X = vect.transform(df[['keyword', 'location', 'text']] )

X_df = pd.DataFrame(X.toarray(), columns = vect.get_feature_names())

X_df.head()

Then you can proceed further to train your model with SVC

odd meteor Nov 4, 2021, 1:56 PM

#

shy nimbus Hello, how i can fit SVC ( sklearn ) by a few text data vectorized by Tfidf? ```...

I do have a question... How are you able to post your python code here? Is there a special command to use to fo this on Discord?

serene scaffold Nov 4, 2021, 1:56 PM

#

odd meteor Hi rokiboxoffical, after vectorizing your text, your output is gon be a sparse m...

converting it to dense might use up all their memory

serene scaffold Nov 4, 2021, 1:56 PM

#

odd meteor I do have a question... How are you able to post your python code here? Is there...

!code

arctic wedgeBOT Nov 4, 2021, 1:56 PM

#

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

serene scaffold Nov 4, 2021, 1:56 PM

#

shy nimbus Hello, how i can fit SVC ( sklearn ) by a few text data vectorized by Tfidf? ```...

you don't want to call fit_transform more than once with the same vectorizer. This will reset how the vectorizer works and make your results meaningless.

odd meteor Nov 4, 2021, 2:02 PM

#

serene scaffold converting it to dense might use up all their memory

What do you suggest is the best way to mitigate the memory problem? I usually convert mine back to a numpy array though without having any memory issues.

Right now, the only thing I can think of that can help a bit as regards the memory concern is perhaps specifying the max_features argument inside the vectorizer.

I'd like to know how you'd personally approach such if you were to be having such challenge.

serene scaffold Nov 4, 2021, 2:03 PM

#

odd meteor What do you suggest is the best way to mitigate the memory problem? I usually co...

I'm pretty sure you can use sparse arrays with sklearn

hollow ember Nov 4, 2021, 3:00 PM

#

https://paste.pythondiscord.com/raw/rabonatemi

hollow ember Nov 4, 2021, 3:01 PM

#

hollow ember https://paste.pythondiscord.com/raw/rabonatemi

error in the link

#

@serene scaffold senpai

#

serene scaffold Nov 4, 2021, 3:09 PM

#

hollow ember

please do car_missing.dtypes

hollow ember Nov 4, 2021, 3:09 PM

#

serene scaffold Nov 4, 2021, 3:10 PM

#

hollow ember

going forward, please copy and paste text directly. I do not like to read screenshots of text.

it appears that the Odometer (KM) column is either all strings, or contains a mix of floats and strings.

#

try car_missing['Odometer (KM)'].astype(float) and see if that causes an error.

hollow ember Nov 4, 2021, 3:11 PM

#

1 sec

#

https://paste.pythondiscord.com/raw/ujunifeyed

serene scaffold Nov 4, 2021, 3:12 PM

#

hollow ember https://paste.pythondiscord.com/raw/ujunifeyed

so you have the string 'missing' in the column, and this cannot be converted to a number.

#

Try car_missing.loc[car_missing['Odometer (KM)'] == 'missing', 'Odometer (KM)'] = pd.NA

limber depot Nov 4, 2021, 3:14 PM

#

hi, does anyone know how to use pandas to drop duplicate rows, where values in specific columns match? for example

df = pd.DataFrame({
    'a':[1,1,3,4],
    'b':[1,1,3,4],
    'c':[1,1,3,4],
    'd':[1,2,3,4],
})

So for the above dataframe, I want to remove index 0 and 1, where the values in a, b and c match, ignoring the value in column d

hollow ember Nov 4, 2021, 3:14 PM

#

serene scaffold Try `car_missing.loc[car_missing['Odometer (KM)'] == 'missing', 'Odometer (KM)']...

@serene scaffold no output

serene scaffold Nov 4, 2021, 3:15 PM

#

hollow ember <@!253696366952316929> no output

that statement was not intended to have an output. Try doing fillna again.

hollow ember Nov 4, 2021, 3:15 PM

#

it worked

serene scaffold Nov 4, 2021, 3:16 PM

#

limber depot hi, does anyone know how to use pandas to drop duplicate rows, where values in s...

!docs pandas.DataFrame.drop

arctic wedgeBOT Nov 4, 2021, 3:16 PM

#

pandas.DataFrame.drop


DataFrame.drop(labels=None, axis=0, index=None, columns=None, level=None, inplace=False, errors='raise')```
Drop specified labels from rows or columns.

Remove rows or columns by specifying label names and corresponding axis, or by specifying directly index or column names. When using a multi-index, labels on different levels can be removed by specifying the level. See the user guide <advanced.shown\_levels> for more information about the now unused levels.

limber depot Nov 4, 2021, 3:16 PM

#

Aah columns edit: nope, couldn't work it out. lol

hollow ember Nov 4, 2021, 3:16 PM

#

can u explain the whole thing in simple language pls? @serene scaffold

serene scaffold Nov 4, 2021, 3:18 PM

#

hollow ember can u explain the whole thing in simple language pls? <@!253696366952316929>

the string "missing" is not the same as a NaN, so fillna won't know what to do with it. So car_missing.loc[car_missing['Odometer (KM)'] == 'missing', 'Odometer (KM)'] = pd.NA replaces all the "missing" strings with a NaN (that's what pd.NA is), and then you can do fillna as normal.

hollow ember Nov 4, 2021, 3:18 PM

#

got it thank you

serene scaffold Nov 4, 2021, 3:18 PM

#

limber depot Aah columns edit: nope, couldn't work it out. lol

do you know how to get a boolean series of which rows you want?

limber depot Nov 4, 2021, 3:20 PM

#

I don't. I'm not sure of the best approach for it. I have tried drop_duplicates, but i don't know how to do it across multiple columns

serene scaffold Nov 4, 2021, 3:21 PM

#

limber depot I don't. I'm not sure of the best approach for it. I have tried drop_duplicates,...

!e

import pandas as pd
df = pd.DataFrame({
    'a':[1,1,3,4],
    'b':[1,1,3,4],
    'c':[1,1,3,4],
    'd':[1,2,3,4],
})

result = df.query("(a == b) & (b == c)")
print(result)

junior matrix Nov 4, 2021, 3:21 PM

#

can any0ne help me with pyspark

arctic wedgeBOT Nov 4, 2021, 3:21 PM

#

@serene scaffold :white_check_mark: Your eval job has completed with return code 0.

001 |    a  b  c  d
002 | 0  1  1  1  1
003 | 1  1  1  1  2
004 | 2  3  3  3  3
005 | 3  4  4  4  4

serene scaffold Nov 4, 2021, 3:21 PM

#

hmm that's not what I wanted

serene scaffold Nov 4, 2021, 3:22 PM

#

junior matrix can any0ne help me with pyspark

you should say what help you need with pyspark. "asking to ask" actually slows down the process.

junior matrix Nov 4, 2021, 3:22 PM

#

jus a sec

limber depot Nov 4, 2021, 3:23 PM

#

serene scaffold hmm that's not what I wanted

It worked. Just changed it to

#

result = df.query("~(a == b) & (b == c)")
print(result)

#

it removed the matching rows. Thanks a lot!

#

I'll look further into .query

junior matrix Nov 4, 2021, 3:24 PM

#

i need to explode dwells into two colums named march and april and get the values of mapType in the columns...

serene scaffold Nov 4, 2021, 3:24 PM

#

limber depot ``` result = df.query("~(a == b) & (b == c)") print(result) ```

might want to do ~((a == b) & (b == c)) so you get the right order of operations

#

~(a == b) & (b == c) is the same as (a != b) & (b == c)

limber depot Nov 4, 2021, 3:24 PM

#

lovely. Thnks a lot

junior matrix Nov 4, 2021, 3:29 PM

#

I tried the hardcoded method but i need something more general..

#

dfkom=(dfL
.withColumn('April', dfL.dwells['2020-03-30'])
.withColumn('March', dfL.dwells['2020-03-02'])
)
.drop('safegraph_place_id', 'dwells')
.fillna('N/A', ['March', 'April'])
dfkom.show()

odd meteor Nov 4, 2021, 3:48 PM

#

limber depot hi, does anyone know how to use pandas to drop duplicate rows, where values in s...

!e

import pandas as pd
df = pd.DataFrame({
    'a':[1,1,3,4],
    'b':[1,1,3,4],
    'c':[1,1,3,4],
    'd':[1,2,3,4],
}) 

df.drop_duplicates(keep='first', inplace=True)

print(df)

limber depot Nov 4, 2021, 3:49 PM

#

odd meteor !e ```py import pandas as pd df = pd.DataFrame({ 'a':[1,1,3,4], 'b':[1,1...

Thanks! It feels like there are a million ways to do the same thing in python 🙂

wicked grove Nov 4, 2021, 4:05 PM

#

hello,i have an image dataset and the labels for these images are in a csv

#

i used the below code to load the images

#

to preprocess these images, should i put it in a list and do the same?

#


from os import walk
for dirname,_,filenames in os.walk(r'D:\dataset'):
    for filename in filenames:
        image = Image.open(os.path.join(dirname, filename))

#

since the labels are in a csv im unable tp understand how i should train the model such that it know this image is for this label

robust night Nov 4, 2021, 4:14 PM

#

Hi, I have two different files of data with different samples that correspond to a type of cancer, file 1 specifies which sample corresponds to which cancer, while file 2 just has a whole list of these kinds of samples. I have categorise and calculate how many samples corresponds to what type of cancer in file 2. Any idea how to do this using file 1 data?

junior matrix Nov 4, 2021, 5:08 PM

#

I am getting index 0 out of bounds for axis 0 with size 0 when using join in pyspark…any idea why that might be?

mighty spoke Nov 4, 2021, 5:28 PM

#

Hi, I'm trying to make a loop, where I have two columns x1 and x2 I want to take the first element in x1 and subtract it from the first element from x2 then I want to move on to the next element in x2 and take away the first element in x1 repeat this until i have subtracted the first element in x1 from every element in x2, once I have done this I move on the second element of x1 and repeat the above steps. This is my code so far but i'm not sure its right; the columns are from csv files, appreciate any help!

smoky birch Nov 4, 2021, 7:08 PM

#

How do you fine tune a bert model?

desert bear Nov 4, 2021, 7:10 PM

#

Hello, can anyone help me with some definitions related to genetic algorithms?

Basically I don't know if I understand this correctly.
Population is for example list of sequences: ["12345", "54321", "25431"]
An individual is a sequence from the population: e.g. "12345"
Gene (in that context) is a random integer from that individual: e.g. 5

Is it called a gene or a genotype?

uncut barn Nov 4, 2021, 7:45 PM

#

hi is there a way to convert this binary image into a network (i.e. with nodes and edges)?

cobalt jetty Nov 4, 2021, 7:52 PM

#

You might want to look into something called Topological Data Analysis, where the idea is that you can create graphs/topological spaces out of such pictures. For something more in-depth, you might want to read up on the concept of simplicial complexes (and how to grow them via the Vietoris-Rips method).

There is a library that allows you to do such a thing called GUDHI, a free-to-use library created at the French INRIA: https://gudhi.inria.fr/.

Beyond the theoretical, you have a big set of available tutorials on that topic here: https://github.com/GUDHI/TDA-tutorial

GitHub

GitHub - GUDHI/TDA-tutorial: A set of jupyter notebooks for the pra...

A set of jupyter notebooks for the practice of TDA with the python Gudhi library together with popular machine learning and data sciences libraries. - GitHub - GUDHI/TDA-tutorial: A set of jupyter ...

#

However, that might be using a nuke to kill a bird kind of use for this library. I'm pretty sure that you can find something easier to implement to relies on similar ideas.

sick wedge Nov 4, 2021, 8:01 PM

#

hey guys, this is the silhouette score I got trying to calculate optimal number of K for KMeans clustering algorithm, is there any hope for me in clustering this data?

#

I'm new to clustering so idk if I should give up

shy nimbus Nov 4, 2021, 8:01 PM

#

Hello, is exists ways how to stop sklearn fit method if it freeze PC. When I choose algorithm for my model, I test a many algorithms, and any of this classifiers make my PC freeze. How I can avoid this, or check complexity of classifier before use it?

tight walrus Nov 4, 2021, 8:30 PM

#

if p_value adf test = 0
does that mean the serie is stationary?

wise pelican Nov 4, 2021, 8:55 PM

#

Small question
I'm creating multiple plots with matplotlib simultaneously with multiprocessing
But matplotlib using Qt doesn't like being used in a multiprocessed way and keeps throwing this warning:
WARNING: QApplication was not created in the main() thread.
There are no actual issues with the program, so I want to just to ignore these warnings - how would I go about doing that?
I've tried this:

import warnings
warnings.filterwarnings("ignore", module = "matplotlib\..*" )

rough mountain Nov 4, 2021, 9:16 PM

#

Could someone explain to me that one really large number?

_________________________________________________________________
Layer (type)                 Output Shape              Param #
=================================================================
conv2d (Conv2D)              (None, 298, 298, 64)      1792
_________________________________________________________________
max_pooling2d (MaxPooling2D) (None, 149, 149, 64)      0
_________________________________________________________________
activation (Activation)      (None, 149, 149, 64)      0
_________________________________________________________________
conv2d_1 (Conv2D)            (None, 147, 147, 32)      18464
_________________________________________________________________
max_pooling2d_1 (MaxPooling2 (None, 73, 73, 32)        0
_________________________________________________________________
activation_1 (Activation)    (None, 73, 73, 32)        0
_________________________________________________________________
flatten (Flatten)            (None, 170528)            0
_________________________________________________________________
dense (Dense)                (None, 64)                10913856
_________________________________________________________________
dense_1 (Dense)              (None, 1)                 65
=================================================================
Total params: 10,934,177
Trainable params: 10,934,177
Non-trainable params: 0
_________________________________________________________________```

#

"dense (Dense) (None, 64) 10913856"

limber depot Nov 4, 2021, 9:26 PM

#

Hi! I have 2 dataframes, I want to use 'Reference_Code' in df1 to join with 'Smyths_Code' from df2, and then transfer the matches from column 'asin' from df2 to df1. What should I use to achieve that?

serene scaffold Nov 4, 2021, 9:56 PM

#

limber depot Hi! I have 2 dataframes, I want to use 'Reference_Code' in df1 to join with 'Smy...

!docs pandas.DataFrame.merge

arctic wedgeBOT Nov 4, 2021, 9:56 PM

#

pandas.DataFrame.merge


DataFrame.merge(right, how='inner', on=None, left_on=None, right_on=None, left_index=False, right_index=False, sort=False, suffixes=('_x', '_y'), copy=True, indicator=False, validate=None)```
Merge DataFrame or named Series objects with a database-style join.

A named Series object is treated as a DataFrame with a single named column.

The join is done on columns or indexes. If joining columns on columns, the DataFrame indexes *will be ignored*. Otherwise if joining indexes on indexes or indexes on a column or columns, the index will be passed on. When performing a cross merge, no column specifications to merge on are allowed.

jaunty pine Nov 4, 2021, 10:00 PM

#

Can someone explain to me what reshaping data is? Why do we do that?

velvet thorn Nov 4, 2021, 10:09 PM

#

jaunty pine Can someone explain to me what reshaping data is? Why do we do that?

are you familiar with features vs samples?

shut trail Nov 4, 2021, 10:20 PM

#

jaunty pine Can someone explain to me what reshaping data is? Why do we do that?

do you just mean like wide and long tables ??

nova cliff Nov 4, 2021, 10:36 PM

#

anyone know why the fbprophet time series model doesnt work on python 3.9

#

but works on 3.8.11?

jaunty pine Nov 4, 2021, 10:39 PM

#

velvet thorn are you familiar with features vs samples?

no, im new to ML

shut trail Nov 4, 2021, 10:40 PM

#

nova cliff but works on 3.8.11?

isnt it up to 3.7

#

https://pypi.org/project/fbprophet/

PyPI

fbprophet

Automatic Forecasting Procedure

#

wait you were probably asking 'why'

nova cliff Nov 4, 2021, 10:41 PM

#

just tried 3.8.11 it worked

#

Im just that OCD guy who likes to use everything thats updated

#

which might be a bad thing bc I should probably understand the discrepancy between 3.9 and 3.8.11

#

it just feels weird to bump the kernel down bc I want to use time series

shut trail Nov 4, 2021, 10:42 PM

#

environments are your friend haha

nova cliff Nov 4, 2021, 10:42 PM

#

and another weird thing

#

is prophet and fbprophet both run

#

so its like....which one do I do ppl say do one or the other like

#

got me in a tight squeeze im being way to analytical for no reason

shut trail Nov 4, 2021, 10:43 PM

#

its great! get involved in the project

#

you can help update

nova cliff Nov 4, 2021, 10:44 PM

#

Lol, Im trying to build up my portfolio rn

#

im getting rejected for every DS intern application bc I spent my time in a non open source friendly internship prior

#

so I have like nothing on github and no projects

#

only a house price prediction with ann and xgboost XD

#

also in final year of undergrad @shut trail

shut trail Nov 4, 2021, 10:46 PM

#

k, well break those anxieties apart. start contributing because it makes you a part of thing. but the getting work side, not so huge. do a couple case studies or example projects, put up a portfolio, and get some work somewhere. your internship will make fine experience if you write it up well

#

work no stress

nova cliff Nov 4, 2021, 10:46 PM

#

thanks for the insights bro!

#

yea i been mad stressed recently, im actually in marketing and chose to take the DS route so

shut trail Nov 4, 2021, 10:47 PM

#

marketing is good

nova cliff Nov 4, 2021, 10:47 PM

#

i need to stall so i actually am doing grad school for DS

#

applied to some schools etc

#

startup life is not for me

shut trail Nov 4, 2021, 10:48 PM

#

#career-advice

#

why do you need to stall ? stress only ?

nova cliff Nov 4, 2021, 10:49 PM

#

Got you ill take this there

shut trail Nov 4, 2021, 10:49 PM

#

over pay? then get something random. marketing analysts are in lots of demand, remote jobs.. ya should be okay. stress is the real killer

odd meteor Nov 4, 2021, 11:38 PM

#

shy nimbus Hello, is exists ways how to stop sklearn fit method if it freeze PC. When I cho...

If your pc is freezing while training your model, it could be because:

The ram of your Pc is quite low for the task you're trying to use it to do
You're working with a very large dataset.

Solution

if you're working with a very large dataset, try to use a sample of the data for your training
if it's a pc ram problem, try upgrading the ram of your pc or you can use Google Colab, Kaggle Kernel, or Binder to train your model.

robust night Nov 5, 2021, 5:06 AM

#

Hello ! Does anymore know how to code these counts to print from max to min number?

#

prisma mist Nov 5, 2021, 5:21 AM

#

robust night

maybe numpy? np.sort(array)[::-1]

mighty spoke Nov 5, 2021, 6:48 AM

#

Hi anyone know why my function is not printing anything, `import pandas as pd
import matplotlib.pyplot as plt#imported pyplot to plot graphs
import datetime as dt#date time to read first column of csv file
import numpy as np
from datetime import datetime

df = pd.read_csv('FREY.csv')
df2 = pd.read_csv('SLI.csv')

pd.to_datetime(df["Date"]).dt.strftime("%d%m%Y")#reads the date time objects
pd.to_datetime(df2["Date"]).dt.strftime("%d%m%Y")

df['Date'] = pd.to_datetime(df['Date'])
df2['Date'] = pd.to_datetime(df2['Date'])
x1=(df['Date'] - dt.datetime(1970,1,1)).dt.total_seconds()/86400
x2=(df2['Date'] - dt.datetime(1970,1,1)).dt.total_seconds()/86400 def dcf(x1, x2):
x1_n = len(x1)
x2_n = len(x2)
x1_mean = np.mean(x1)
x2_mean = np.mean(x2)
x1_stdv = np.std(x1)
x2_stdv = np.std(x2)
dcf = np.zeros((x1_n, x2_n))
for i, x1 in enumerate(x1):
for j, x2 in enumerate(x2):
#x2_val=[t+x2[j]]
d = (x1 - x1_mean) * (x2 - x2_mean) / (x1_stdv*x2_stdv)
dcf[i, j] = d
return dcf
print(dcf)`

boreal summit Nov 5, 2021, 12:09 PM

#

I am trying to plot a Histogram from the PySpark Data that I am working on. From the book I am studying & all the online resource I've searched. The way to plot is through "histogram_data = df.select('column_name').rdd.flatMap(lambda x: x).histogram()" . Trying this in practice throws an error.

#

so I am wondering if the PySpark API has been updated or something. Thanks.

#

ALll the examples I searched online showed the same procedure. Yet, the code kept throwing an error.

last abyss Nov 5, 2021, 12:22 PM

#

Does someone know if there is a way to scikit-surprise with python 3.9?

serene scaffold Nov 5, 2021, 12:50 PM

#

mighty spoke Hi anyone know why my function is not printing anything, ...

Are you familiar with variable scoping? print(dcf) is just printing the function object itself. you did not call it.

#

inside the dcf function, you use the name dcf to refer to something else, but this only applies within the funciton.

civic ivy Nov 5, 2021, 2:51 PM

#

hey all i want to make a Self Learning Optical Auditory Machine which is gonna be a visual and auditory self learning artificial intelligence which will be connected some sort of robotics. i was wondering where i could start. Also i have adhd so im also how i would keeps my engaged with this.

lapis sequoia Nov 5, 2021, 3:34 PM

#

Heyyyy peeps!! I want to save different .png for every index value, any idea how i Could do that?

#

#

I also another for loop that runs this whole process for every year's data.

#

I want to save this loop's word cloud for every year in separate folders. Is that too complicated to do? any suggestions?

robust jungle Nov 5, 2021, 3:58 PM

#

What should I use instead of train.py for efficientnet?

lapis sequoia Nov 5, 2021, 4:04 PM

#

robust jungle What should I use instead of train.py for efficientnet?

model.py thats where the model gets trained and you can put he architecture into another file called RNN.py

#

if you were looking to orgranise your code. If you meant in other means, do give more information

robust jungle Nov 5, 2021, 4:08 PM

#

lapis sequoia if you were looking to orgranise your code. If you meant in other means, do give...

Sorta new, just downloaded efficientnet to follow an outdated tutorial, trying to find out how to run it (already did the configs)

lapis sequoia Nov 5, 2021, 4:11 PM

#

run the file that contains the loop for epoch!!

pure wagon Nov 5, 2021, 4:54 PM

#

Is there someone that's able to tell me what's wrong with my code?

pale elbow Nov 5, 2021, 5:08 PM

#

got error Attribute Error: 'SeriesGroupBy' object has no attribute 'ewm'
solving kaggle problem Google brain ventilator pressure prediction problem

Notebook url
https://www.kaggle.com/dmitryuarov/ventilator-pressure-eda-Istm-0-189/notebook?script Versionld=75835345& cellld=13

In cell 13 where df["'ewn_u_in_mean"] is assign It it to fix ewm error but can't find it's fix Even in stackoverflow

mighty spoke Nov 5, 2021, 6:27 PM

#

serene scaffold Are you familiar with variable scoping? `print(dcf)` is just printing the functi...

I see so how would i print the values of the dcf, i tried to append it to a list but this also doesn’t work?

serene scaffold Nov 6, 2021, 12:18 AM

#

@mighty spoke have you changed the code from what you showed earlier?

balmy junco Nov 6, 2021, 12:28 AM

#

Does anybody know of any interesting problems in terms of gene sequencing?

#

preferably based on open source data lol

urban knoll Nov 6, 2021, 6:29 AM

#

Hi guys, I'm trying to create a CNN algorithm to classify images with dust and without dust. I am using this tutorial to create my own algorithm. and I'm having a hard time with this step: labels = ['Dusty', 'clear'] img_size = 224 def get_data(data_dir): data = [] for label in labels: path = os.path.join(data_dir, label) class_num = labels.index(label) for img in os.listdir(path): try: img_arr = cv2.imread(os.path.join(path, img))[...,::-1] #convert BGR to RGB format resized_arr = cv2.resize(img_arr, (img_size, img_size)) # Reshaping images to preferred size data.append([resized_arr, class_num]) except Exception as e: print(e) return np.array(data) train =get_data('\\home\\Users\\Philip\\Desktop\\miscellenous\\Fall 2021\\Camera-Lens-Smear-Detection-OpenCV-and-Python-master\\Sample1\\Training') val = get_data('\\home\\Users\\Philip\\Desktop\\miscellenous\\Fall 2021\\Camera-Lens-Smear-Detection-OpenCV-and-Python-master\\Sample1\\Test')

#

I get the error : ```\AppData\Local\Temp/ipykernel_17848/790153361.py in get_data(data_dir)
6 path = os.path.join(data_dir, label)
7 class_num = labels.index(label)
----> 8 for img in os.listdir(path):
9 try:
10 img_arr = cv2.imread(os.path.join(path, img))[...,::-1] #convert BGR to RGB format

FileNotFoundError: [WinError 3] The system cannot find the path specified: '\home\Users\Philip\Desktop\miscellenous\Fall 2021\Camera-Lens-Smear-Detection-OpenCV-and-Python-master\Sample1\Training\Dusty ' No clue how to fix this path error. Article does not really do well to explain in my opinion, perhaps just my lack of experience. Am I to have folder inTrainingcalledDusty? should my pictures have the word Dusty``` in them? I dont know, Here is the link to the tutorial, Just look at step 2: https://www.analyticsvidhya.com/blog/2020/10/create-image-classification-model-python-keras/#wait_approval

Analytics Vidhya

Tanishq Gautam

Image Classification in Python with Keras | Image Classification

Image Classification means assigning an input image, one label from a fixed set of categories. Learn to Build an image classification model.

void pagoda Nov 6, 2021, 7:20 AM

#

I'm a beginner and am starting to learn, I'm having trouble indicating overfitting and underfitting, if anyone could give me some guidance I appreciate it!

lapis sequoia Nov 6, 2021, 8:16 AM

#


#Using the replace function, replace NaN in the contact attribute with unknown.
# continue working on this at home
# NEED TO UNDERSTAND WHY THIS PIECE OF CODE IS NOT WORKING
bank['contact'] = bank['contact'].replace(to_replace=['NaN'],value='unknown', inplace=True)
bank.head(5)

``` would somone be able to explain why my piece of code is not filtering my 'contact' category please i feel my code is correct the closest i have got is for all of them to `none` which is not what i am looking for? many thanks!

foggy moon Nov 6, 2021, 8:48 AM

#

how to understand if the data is stationary or not
I have a dataset of frequency Hourly. If i pass this dataset to adfuller test, the result is stationary. But if i average the result of the day, the result becomes non stationary

bold timber Nov 6, 2021, 9:06 AM

#

whether the type of target in classification shouldn't have boolean?

clear glacier Nov 6, 2021, 9:49 AM

#

er, I have this multiindex

MultiIndex([(                               'No',                     '-'),
            (    'Wilayah Kerja Statistik - BPS', 'Nama Desa / Kelurahan'),
            (    'Wilayah Kerja Statistik - BPS', 'Kode Desa / Kelurahan'),
            ('Wilayah Administrasi - Kemendagri', 'Nama Desa / Kelurahan'),
            ('Wilayah Administrasi - Kemendagri', 'Kode Desa / Kelurahan')],
           )

how can i easily ungroup this? reset_index doesnt seem to work :<
reset index just makes this worse 😭

MultiIndex([(                            'index',                      ''),
            (                               'No',                     '-'),
            (    'Wilayah Kerja Statistik - BPS', 'Nama Desa / Kelurahan'),
            (    'Wilayah Kerja Statistik - BPS', 'Kode Desa / Kelurahan'),
            ('Wilayah Administrasi - Kemendagri', 'Nama Desa / Kelurahan'),
            ('Wilayah Administrasi - Kemendagri', 'Kode Desa / Kelurahan')],
           )

shy nimbus Nov 6, 2021, 11:40 AM

#

Hello, soon I'll be participate in machine learning contest,and I have one question. When I fit my model, I split train and test data, but is I should train my model on full data before send my model? I think it may be cause overfit?

serene oasis Nov 6, 2021, 12:03 PM

#

Hello, I am trying to use ocr to read an Arabic text from different images but I can't find any resources related to that if anyone tried it before and has an idea I will appreciate it.

odd meteor Nov 6, 2021, 1:20 PM

#

shy nimbus Hello, soon I'll be participate in machine learning contest,and I have one quest...

You have to split your data before training your model.

Split your data into train and validation set.
train your model on X_train and y_train
predict on X_test

velvet thorn Nov 6, 2021, 1:41 PM

#

shy nimbus Hello, soon I'll be participate in machine learning contest,and I have one quest...

if they have a holdout set, then yes, you should train on the full dataset.

#

and they probably will, it being a competition

twilit forum Nov 6, 2021, 2:16 PM

#

Anyone knows about any library for Segmentation models such as Unet or Segnet?

mighty spoke Nov 6, 2021, 2:39 PM

#

serene scaffold <@761588935616102422> have you changed the code from what you showed earlier?

Hi @serene scaffold I tried letting d be nonlocal and many other things but nothing seems to be working

serene scaffold Nov 6, 2021, 2:40 PM

#

mighty spoke Hi <@!253696366952316929> I tried letting d be nonlocal and many other things b...

I'm not at my desktop right now, but I can't comment without seeing the code.

mighty spoke Nov 6, 2021, 2:47 PM

#

serene scaffold I'm not at my desktop right now, but I can't comment without seeing the code.

Nws i'll try work on it more and get back to you

simple ivy Nov 6, 2021, 4:50 PM

#

hey y'all, im working on an object detection and am sort of confused. am i supposed to manually add a label-map.pbtxt file in my project (and if so, where?) or is that generated when generating a TF record?

wicked grove Nov 6, 2021, 5:22 PM

#

hello

#

i am trying to load an image dataset whose labels are in a csv

#

i am using the following code but i think it is incorrect

#

can someone please guide me

serene scaffold Nov 6, 2021, 5:24 PM

#

wicked grove i am using the following code but i think it is incorrect

are you still looking for the code?

wicked grove Nov 6, 2021, 5:24 PM

#

yupp

#

from os import walk
image=[]
for dirname,_,filenames in os.walk(r'D:\dataset'):
    for filename in filenames:
        if(filename==merged_labels['image']):
            img = cv2.imread(os.path.join(dirname, filename,merged_labels['level']))
            image.append(img)

serene scaffold Nov 6, 2021, 5:26 PM

#

wicked grove ```py from os import walk image=[] for dirname,_,filenames in os.walk(r'D:\datas...

For one thing, remember not to wrap entire if conditions in parentheses.

# oman
if(filename==merged_labels['image']):
    ...
# yemen
if filename == merged_labels['image']:
    ...

#

also, isn't the third element given by os.walk always a string, not a list of strings?

#

!docs os.walk

arctic wedgeBOT Nov 6, 2021, 5:26 PM

#

os.walk


os.walk(top, topdown=True, onerror=None, followlinks=False)```
Generate the file names in a directory tree by walking the tree either top-down or bottom-up. For each directory in the tree rooted at directory *top* (including *top* itself), it yields a 3-tuple `(dirpath, dirnames, filenames)`.

serene scaffold Nov 6, 2021, 5:27 PM

#

no, you were right

wicked grove Nov 6, 2021, 5:27 PM

#

serene scaffold For one thing, remember not to wrap entire if conditions in parentheses. ```py #...

okayy thank you! i will keep that in mind

serene scaffold Nov 6, 2021, 5:27 PM

#

so is image not what you expected at the end?

#

also you have from os import walk but then you have os.walk. did you do import os somewhere else?

wicked grove Nov 6, 2021, 5:28 PM

#

yupp, it throws an error and i think what i am doing is wrong

serene scaffold Nov 6, 2021, 5:28 PM

#

wicked grove yupp, it throws an error and i think what i am doing is wrong

if you get an error message, always show the whole error

wicked grove Nov 6, 2021, 5:28 PM

#

serene scaffold also you have `from os import walk` but then you have `os.walk`. did you do `imp...

yeah i imported it in the beginning

serene scaffold Nov 6, 2021, 5:28 PM

#

the error message is almost always the most valuable of all possible information you could share

wicked grove Nov 6, 2021, 5:28 PM

#

  Traceback (most recent call last)
<ipython-input-32-1b61408943dc> in <module>
      3 for dirname,_,filenames in os.walk(r'D:\dataset'):
      4     for filename in filenames:
----> 5         if(filename==merged_labels['image']):
      6             img = cv2.imread(os.path.join(dirname, filename,merged_labels['level']))
      7             image.append(img)

~\anaconda3\lib\site-packages\pandas\core\generic.py in __nonzero__(self)
   1440     @final
   1441     def __nonzero__(self):
-> 1442         raise ValueError(
   1443             f"The truth value of a {type(self).__name__} is ambiguous. "
   1444             "Use a.empty, a.bool(), a.item(), a.any() or a.all()."

ValueError: The truth value of a Series is ambiguous. Use a.empty, a.bool(), a.item(), a.any() or a.all().

serene scaffold Nov 6, 2021, 5:29 PM

#

yes, if you had shown this originally, we would have gone in an entirely different direction

#

the problem is that merged_labels is a series, so filename==merged_labels['image'] is also a series

#

and as stated, The truth value of a Series is ambiguous.

wicked grove Nov 6, 2021, 5:29 PM

#

#

This is my csv

serene scaffold Nov 6, 2021, 5:30 PM

#

wicked grove

please always share csv data by copying the actual csv text into the chat. You can get a sample with print(df.head().to_csv())

wicked grove Nov 6, 2021, 5:32 PM

#

image    level
10_left    0
10_right    0
13_left    0
13_right    0
15_left    1
15_right    2
16_left    4
16_right    4
17_left    0
17_right    1
19_left    0
19_right    0
20_left    0
20_right    0
21_left    0
21_right    0
22_left    0
22_right    0
23_left    0
23_right    0
25_left    0
25_right    0
30_left    1
30_right    2
31_left    0
31_right    0
33_left    0
33_right    0
36_left    1
36_right    0
40_left    2
40_right    0
41_left    0
41_right    0
42_left    0
42_right    0
46_left    0
46_right    0
47_left    0
47_right    0
49_left    0
49_right    0
51_left    2
51_right    0
52_left    0
52_right    0
54_left    2
54_right    2
56_left    0
56_right    0
57_left    0
57_right    0
58_left    0
58_right    0
59_left    0
59_right    0
60_left    0
60_right    0
62_left    0
62_right    0
64_left    0
64_right    0
65_left    0
65_right    0
66_left    0
66_right    0
67_left    0
67_right    0
70_left    0
70_right    0
72_left    0
72_right    0
73_left    0
73_right    0
74_left    0
74_right    0
75_left    0
75_right    0
78_left    2
78_right    2
79_left    2
79_right    2
81_left    0
81_right    0
82_left    2

serene scaffold Nov 6, 2021, 5:33 PM

#

wicked grove ``` image level 10_left 0 10_right 0 13_left 0 13_right 0 15_left...

looks like this is tab separated?

wicked grove Nov 6, 2021, 5:34 PM

#

it is in an excel file

serene scaffold Nov 6, 2021, 5:34 PM

#

anyway, what is filename==merged_labels['image'] intended to do?

wicked grove Nov 6, 2021, 5:34 PM

#

so these are my training labels

serene scaffold Nov 6, 2021, 5:34 PM

#

because keep in mind, merged_labels['image'] is an entire series of strings. so what are you trying to say when a given file name "equals" the entire series?

wicked grove Nov 6, 2021, 5:35 PM

#

wicked grove Nov 6, 2021, 5:35 PM

#

serene scaffold because keep in mind, `merged_labels['image']` is an entire series of strings. s...

And this is my image dataset
I want to map the labels of that image to the filename

#

And i have removed some image names from merged_labels and i don't want to map the ones which aren't in the merged_labels

#

i am using an image dataset whose training labels have 5 classes(no dr,mild,severe and proliferate), and i converted this to binary . i also removed the mild class from the df. the problem is these mild images are in the image dataset

serene scaffold Nov 6, 2021, 5:42 PM

#

wicked grove And this is my image dataset I want to map the labels of that image to the file...

what you want is a dataframe with the full path for each file in one column, and the part of the name found in your image/level dataframe in the other

#

and then you can merge on that name.

wicked grove Nov 6, 2021, 5:44 PM

#

ohh but how will that reflect on the images' filenames? or can i do that after it is merged?

serene scaffold Nov 6, 2021, 5:45 PM

#

wicked grove ohh but how will that reflect on the images' filenames? or can i do that after i...

I don't know what you mean by "reflect on the images' filenames" What you'll end up with is something like this:

image         level          path
10_left       0              D:/path/to/10_left.png
10_right      0              D:/path/to/10_right.png

#

because you'll have two dataframes, one that only has (image, level), and another that only has (image, path), and then you merge them on image.

wicked grove Nov 6, 2021, 5:46 PM

#

yeah exactly but i want make the image's filepath D:/path/to/10_left_0.png

serene scaffold Nov 6, 2021, 5:47 PM

#

wicked grove yeah exactly but i want make the image's filepath D:/path/to/10_left_0.png

so you want to move existing files?

wicked grove Nov 6, 2021, 5:47 PM

#

yeah rename the existing files to match its respective labels

serene scaffold Nov 6, 2021, 5:48 PM

#

wicked grove yeah rename the existing files to match its respective labels

does every single path end in .png?

wicked grove Nov 6, 2021, 5:48 PM

#

wicked grove yeah rename the existing files to match its respective labels

because idk how i can train the model when the labels are in a csv

wicked grove Nov 6, 2021, 5:48 PM

#

serene scaffold does every single path end in `.png`?

.jpeg

#

every single path ends in .jpeg

serene scaffold Nov 6, 2021, 5:50 PM

#

wicked grove .jpeg

then you can do df['new_path'] = df['path'].str[:-5] + '_' + df['level'].astype(str) + '.jpeg'

#

that might actually be inefficient shrug2

#

!docs shutil

arctic wedgeBOT Nov 6, 2021, 5:50 PM

#

shutil

Source code: Lib/shutil.py

The shutil module offers a number of high-level operations on files and collections of files. In particular, functions are provided which support file copying and removal. For operations on individual files, see also the os module.

Warning

Even the higher-level file copying functions (shutil.copy(), shutil.copy2()) cannot copy all file metadata.

On POSIX platforms, this means that file owner and group are lost as well as ACLs. On Mac OS, the resource fork and other metadata are not used. This means that resources will be lost and file type and creator codes will not be correct. On Windows, file owners, ACLs and alternate data streams are not copied.

serene scaffold Nov 6, 2021, 5:50 PM

#

you can use this to move files though.

wicked grove Nov 6, 2021, 5:56 PM

#

serene scaffold then you can do `df['new_path'] = df['path'].str[:-5] + '_' + df['level'].astype...

ohh alrightt!thank youu😄 and once i do that i can use os.walk and os.rename?

wicked grove Nov 6, 2021, 6:02 PM

#

serene scaffold so you want to move existing files?

I don't really want to move them ,i just want to rename them so that they can get mapped to their labels

serene scaffold Nov 6, 2021, 6:07 PM

#

wicked grove I don't really want to move them ,i just want to rename them so that they can ge...

why will changing their names help with that

wicked grove Nov 6, 2021, 6:14 PM

#

Hmm for that particular image path i want to fetch its label from the csv for example while doing os.walk if get image14.jpeg, can check "image " "image14" in the df and then get the level

wicked grove Nov 6, 2021, 6:15 PM

#

serene scaffold why will changing their names help with that

I thought when the images are in a training folder and the labels are in a csv this is how it should be done but idk if i am correct

serene scaffold Nov 6, 2021, 6:21 PM

#

wicked grove I thought when the images are in a training folder and the labels are in a csv t...

the name of the file has no role in how the machine learning happens.

torpid cape Nov 6, 2021, 6:32 PM

#

Sorry to interupt, but does anyone have any Ai Dev Ideas?

wicked grove Nov 6, 2021, 6:32 PM

#

serene scaffold the name of the file has no role in how the machine learning happens.

But wont the name help me track whether the prediction is correct or not

serene scaffold Nov 6, 2021, 6:32 PM

#

wicked grove But wont the name help me track whether the prediction is correct or not

not really

wicked grove Nov 6, 2021, 6:33 PM

#

Then what do you suggest i do if i have my labels in a csv and images in a training folder? The method you mentioned? To put it all in a df?

serene scaffold Nov 6, 2021, 6:34 PM

#

wicked grove Then what do you suggest i do if i have my labels in a csv and images in a train...

yes, you usually has one column that has the expected label.

wicked grove Nov 6, 2021, 6:36 PM

#

serene scaffold yes, you usually has one column that has the expected label.

Ohh alright!!
I also want to remove the mild images(label is 1) so i thought if i change the names i can do os.remove( 01.jpeg)

serene scaffold Nov 6, 2021, 6:37 PM

#

wicked grove Ohh alright!! I also want to remove the mild images(label is 1) so i thought if...

remove them in what sense? delete them from your computer?

wicked grove Nov 6, 2021, 6:37 PM

#

Yupp delete it from the training folder

serene scaffold Nov 6, 2021, 6:38 PM

#

wicked grove Yupp delete it from the training folder

do you have a copy somewhere, just in case?

wicked grove Nov 6, 2021, 6:38 PM

#

Yeah i created a backup of the folder

serene scaffold Nov 6, 2021, 6:40 PM

#

wicked grove Yeah i created a backup of the folder

I usually use pathlib for this kind of thing instead of os

#

!docs pathlib

arctic wedgeBOT Nov 6, 2021, 6:40 PM

#

pathlib

New in version 3.4.

Source code: Lib/pathlib.py

This module offers classes representing filesystem paths with semantics appropriate for different operating systems. Path classes are divided between pure paths, which provide purely computational operations without I/O, and concrete paths, which inherit from pure paths but also provide I/O operations.

../_images/pathlib-inheritance.png If you’ve never used this module before or just aren’t sure which class is right for your task, Path is most likely what you need. It instantiates a concrete path for the platform the code is running on.

Pure paths are useful in some special cases; for example:

woeful cape Nov 6, 2021, 8:25 PM

#

Hello,

I have a question about porting TensorFlow models between Python and Java.

I currently have a model written in Python, and would like to make a GUI in Java. I know that there is a TensorFlow package in Java, however I have not found any sample code or description of how to load an h5 model anywhere.

Has anyone solved any such problem and could direct me if this is possible?

It is a binary image classifier.

Thanks

iron basalt Nov 6, 2021, 8:50 PM

#

woeful cape Hello, I have a question about porting TensorFlow models between Python and Ja...

I have never had this exact problem, but I would consider making the GUI in Python.

ripe forge Nov 6, 2021, 9:13 PM

#

there might be other formats for saving the model that make this easier too, explore things that let you make an api from your model as well

#

tensorflow itself had something, though i havent explored this much, via their protobuf format i think. or theres a couple names like bentoml. (but all uncharted territory for me)

quiet vault Nov 6, 2021, 10:33 PM

#

iron basalt I have never had this exact problem, but I would consider making the GUI in Pyth...

Making a GUI with Java is much better

austere swift Nov 6, 2021, 11:54 PM

#

woeful cape Hello, I have a question about porting TensorFlow models between Python and Ja...

https://www.tensorflow.org/api_docs/java/org/tensorflow/SavedModelBundle here's the tensorflow java docs for model loading

TensorFlow

SavedModelBundle | Java | TensorFlow

#

actually that's the old api, heres the link for the new one https://www.tensorflow.org/jvm/api_docs/java/org/tensorflow/proto/framework/SavedModel

TensorFlow

SavedModel | JVM | TensorFlow

rough mountain Nov 7, 2021, 2:00 AM

#

Adding the early stopping callback gives me: tensorflow/core/kernels/data/generator_dataset_op.cc:107] Error occurred when finalizing GeneratorDataset iterator: Failed precondition: Python interpreter state is not initialized. The process may be terminated. [[{{node PyFunc}}]] after about 4 epochs.

lapis sequoia Nov 7, 2021, 3:12 AM

#

I am an AI

solid urchin Nov 7, 2021, 3:16 AM

#

not sure if this is the right channel and sorry for the repost, just hard to not have my posts immediately get bumped off screen in #python-discussion

what library/api/whatever would you all recommend for plotting data on a map? i want to:

just be able to plot a list of coordinates as markers on an interactive map
be able to plot a choropleth map
be able to interact with a map to place a marker, and store the coordinates from where it was placed into a tuple or some variables or whatever

#

i also will probably need geocoding functionality

spark dirge Nov 7, 2021, 3:18 AM

#

fluf what have you tried? interactive is hard

solid urchin Nov 7, 2021, 3:18 AM

#

tried the google maps library, but i couldn't find a way to display a map. just get coordinates and geocode and stuff

#

i'd be fine without #3 (interacting with/placing markers)

#

in #1 by interactive what i mean is like, having the ability to zoom and drag around

#

if that's not possible i suppose a static image could still be fine, just wouldn't be as nice

solid urchin Nov 7, 2021, 4:14 AM

#

if anyone responds pls @ me

main fox Nov 7, 2021, 4:17 AM

#

solid urchin if anyone responds pls @ me

Checkout mapbox

lapis sequoia Nov 7, 2021, 11:36 AM

#

Hi, I just discovered Cleverbot (Conversational Agent) which was created using Deep Learning, however, I don't see how the bot can predict words using numbers and theorem. Any explanation?

cloud widget Nov 7, 2021, 11:53 AM

#

hi, I want to create a tts model for sinhala language, can someone guide me how to go with it

#

I used wavent on google colab to train some pre-recorded wav files on sinhala language

#

but I think I don't understand what I am really doing, please someone tell me how to achieve this task

lapis sequoia Nov 7, 2021, 11:58 AM

#

umm

#

is sinhala on google translator ?

cloud widget Nov 7, 2021, 11:58 AM

#

yeah

#

i know, but it's not good enough to put on a virtual assistant

lapis sequoia Nov 7, 2021, 11:59 AM

#

then u can use library such as gtts to make one

lapis sequoia Nov 7, 2021, 11:59 AM

#

cloud widget i know, but it's not good enough to put on a virtual assistant

so are u trying to make a virtual assiatant ?

cloud widget Nov 7, 2021, 12:00 PM

#

lapis sequoia so are u trying to make a virtual assiatant ?

yes

lapis sequoia Nov 7, 2021, 12:00 PM

#

umm

#

watching it on yt would be better

#

i have some rash ideas only

cloud widget Nov 7, 2021, 12:01 PM

#

but nothing about other languages other than english and even if there, they are uysing google cloud which they don't have a sinhala tts

#

the only sinhala tts available right now is the one in google translator

lapis sequoia Nov 7, 2021, 12:02 PM

#

cannot help it then man

cloud widget Nov 7, 2021, 12:02 PM

#

hmm

lapis sequoia Nov 7, 2021, 12:03 PM

#

i dont know much about that either

cloud widget Nov 7, 2021, 12:03 PM

#

ok thanks

mighty spoke Nov 7, 2021, 12:36 PM

#

Hi @serene scaffold I managed to print the vales of the dcf function however I'm trying to now print the lag values and the corresponding dcf values in two columns would you know how to do that?

stable kestrel Nov 7, 2021, 12:42 PM

#

hey, does someone have a bit of experience in statistics and coding?

#

im fully stuck right now

serene scaffold Nov 7, 2021, 1:30 PM

#

mighty spoke Hi <@!253696366952316929> I managed to print the vales of the dcf function howev...

I can't comment without seeing the exact code as currently written.

serene scaffold Nov 7, 2021, 1:31 PM

#

stable kestrel hey, does someone have a bit of experience in statistics and coding?

Try asking a specific question. Then people can see if they know how to help.

stable kestrel Nov 7, 2021, 1:32 PM

#

and understand a bt what a likelihood function is

serene scaffold Nov 7, 2021, 1:32 PM

#

Please always post code as text

stable kestrel Nov 7, 2021, 1:32 PM

#

buttt

serene scaffold Nov 7, 2021, 1:32 PM

#

!code

arctic wedgeBOT Nov 7, 2021, 1:32 PM

#

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

stable kestrel Nov 7, 2021, 1:32 PM

#

serene scaffold Please always post code as text

this in in jupyter as text ehh

serene scaffold Nov 7, 2021, 1:32 PM

#

stable kestrel this in in jupyter as text ehh

You can copy and paste the relevant parts out.

#

helping people with programming questions often involves replicating their code locally, and with screenshots, that would require typing all of it out by hand.

stable kestrel Nov 7, 2021, 1:33 PM

#

its just a lot of text and not a lot of code

#

lemme try

#

yeah no

#

there is not really any code to write

#

or give

#

just a bunch of text edited in latex

#

serene scaffold Nov 7, 2021, 1:35 PM

#

so you're just being asked to implement this likelihood function, yes?

stable kestrel Nov 7, 2021, 1:36 PM

#

i am yeah for Pr(𝑥⃗ ∣𝜇′,𝜎′)

#

so i first need to know if the function i have is correct

#

and maby then try to implement it

#

because i have no idea if im doing things correctly

serene scaffold Nov 7, 2021, 1:37 PM

#

stable kestrel i am yeah for Pr(𝑥⃗ ∣𝜇′,𝜎′)

can you show your current solution for Pr(𝑥⃗ ∣𝜇′,𝜎′)?

#

Remember to use code formatting
```py
code
```

stable kestrel Nov 7, 2021, 1:37 PM

#

stable kestrel

basically this

#

the big function

serene scaffold Nov 7, 2021, 1:38 PM

#

stable kestrel the big function

so the latex part is the solution? you're not implementing that function in python?

stable kestrel Nov 7, 2021, 1:38 PM

#

i want to , that is question 7 but i dont really know if question 6 is even correct so i can implement the right code

#

# samples are fixed, modify shape and location on the left side, so mu and sigma
#mu = 78.3
#sigma = 3
#samples is every sample

mu = mean_xx_c #(fill in mean of samples)
sigma = 3
samples = x_values


def likelihood(mu, sigma, samples):
    (1/)
    
    
    
    pass```

#

this is my question 7 right now

#

just almost non existent

serene scaffold Nov 7, 2021, 1:39 PM

#

stable kestrel i want to , that is question 7 but i dont really know if question 6 is even corr...

I can help you implement a given formula in python, but I can't help you confirm that the formula itself is correct.

stable kestrel Nov 7, 2021, 1:40 PM

#

ahh okay hmm

#

i think i can implement it

#

not sure about the summation part

#

maby just use sum()

#

?

serene scaffold Nov 7, 2021, 1:41 PM

#

stable kestrel maby just use `sum()`

if you're working with (numpy) arrays, the array will have a sum method

#

also this: https://discord.gg/G7nB7UF9

#

for example, if x is a one-dimensional array, then the summation part of your likelihood formula is ((x - mu) ** 2).sum()

#

(x - mu) subtracts mu from every element of x, returning a new array.
((x - mu) ** 2) raises each element to the second power

stable kestrel Nov 7, 2021, 1:43 PM

#

okay okay

#

thnxx ❤️

#

i implemented it as this

#

# samples are fixed, modify shape and location on the left side, so mu and sigma
#mu = 78.3
#sigma = 3
#samples is every sample

mu = mean_xx_c #(fill in mean of samples)
sigma = 3
samples = x_values


def likelihood(mu, sigma, samples):
    likelihood = ((1/2*np.pi*sigma**2)** len(samples)/2)* np.exp(-1/(2*sigma**2))* sum(samples-mu)**2

    
    
    
    return likelihood

likelihood(mu, sigma, samples)```

serene scaffold Nov 7, 2021, 2:05 PM

#

stable kestrel ```py # samples are fixed, modify shape and location on the left side, so mu and...

can you space things out so it isn't so hard to read?

#

sum(samples-mu)**2 this means that the sum is taken, and then that one number is raised to two. is that what you want?

#

def likelihood(mu, sigma, samples):
    a = (1/2 * np.pi * sigma ** 2) ** len(samples) / 2
    b = np.exp(-1 / (2 * sigma ** 2)) * sum(samples - mu) ** 2
    return a * b

stable kestrel Nov 7, 2021, 2:08 PM

#

hmm

#

yeah it probably works

#

but the result is too large apparently

#

so that is annooying

serene scaffold Nov 7, 2021, 2:09 PM

#

"too large"?

stable kestrel Nov 7, 2021, 2:14 PM

#

yep

#

# samples are fixed, modify shape and location on the left side, so mu and sigma
#mu = 78.3
#sigma = 3
#samples is every sample

mu = mean_xx_c #(fill in mean of samples)
sigma = 3
samples = x_values


def likelihood(mu, sigma, samples):
    a = (1/2 * np.pi * sigma ** 2) ** len(samples) / 2
    b = np.exp(-1 / (2 * sigma ** 2)) * sum(samples - mu) ** 2
    return a * b

likelihood(mu, sigma, samples)```

lapis sequoia Nov 7, 2021, 2:24 PM

#

Hey, anyone has a clue why linear reg model is structured like this: y = - W[0] / W[1] * x + (0.5 - b) / W[1]

Full code, just in case:

input_dim = 2
output_dim = 1
W = tf.Variable(initial_value=tf.random.uniform(shape=(input_dim, output_dim)))
b = tf.Variable(initial_value=tf.zeros(shape=(output_dim,)))

def model(inputs):
    return tf.matmul(inputs, W) + b

def square_loss(targets, predictions):
    per_sample_losses = tf.square(targets - predictions)
    return tf.reduce_mean(per_sample_losses)

learning_rate = 0.1

def training_step(inputs, targets):
    with tf.GradientTape() as tape:
        predictions = model(inputs)
        loss = square_loss(predictions, targets)
    grad_loss_wrt_W, grad_loss_wrt_b = tape.gradient(loss, [W, b])
    W.assign_sub(grad_loss_wrt_W * learning_rate)
    b.assign_sub(grad_loss_wrt_b * learning_rate)
    return loss

for step in range(40):
    loss = training_step(inputs, targets)
    print(f"Loss at step {step}: {loss:.4f}")

x = np.linspace(-1, 4, 100)
y = - W[0] /  W[1] * x + (0.5 - b) / W[1]

frozen pasture Nov 7, 2021, 3:12 PM

#

Hey everyone , I have 2 input independent variables and i have to predict 3 output dependent variables . i have a dataset of 9000 rows . Which technique should i use to predict the data in multioutput regression ?

mighty spoke Nov 7, 2021, 3:37 PM

#

serene scaffold I can't comment without seeing the exact code as currently written.

Hi @serene scaffold here's my code: `
import pandas as pd#import pandas package to read data more easily
import matplotlib.pyplot as plt#imported pyplot to plot graphs
import datetime as dt#date time to read first column of csv file
import numpy as np
from datetime import datetime

df = pd.read_csv('FREY.csv')
df2 = pd.read_csv('SLI.csv')

pd.to_datetime(df["Date"]).dt.strftime("%d%m%Y")#reads the date time objects
pd.to_datetime(df2["Date"]).dt.strftime("%d%m%Y")

df['Date'] = pd.to_datetime(df['Date'])
df2['Date'] = pd.to_datetime(df2['Date'])
x1=(df['Date'] - dt.datetime(1970,1,1)).dt.total_seconds()/86400
x2=(df2['Date'] - dt.datetime(1970,1,1)).dt.total_seconds()/86400

t0=[]
for i in range(len(x1)):
for j in range(len(x2)):
t=x2[j]-x1[i]
if x2[j]==x2[0]:
x2[j]+=1
if x2[j]==len(x2):
x1[i]+=1
t0.append(t)

def dcf(x1, x2):
x_n = len(x1)
y_n = len(x2)
x_mean = np.mean(x1)
y_mean = np.mean(x2)
x_stdv = np.std(x1)
y_stdv = np.std(x2)
dcf = np.zeros((x_n, y_n))
for i, x_val in enumerate(x1):
for j, y_val in enumerate(x2):
d = (x_val - x_mean) * (y_val - y_mean) / (x_stdv*y_stdv)
dcf[i, j] = d
return dcf

y=np.array(dcf(x1,x2))
x=np.array(t0)

"""
plt.plot(x,y, ls='-', lw='1', color='red', marker='.', label=r"$x$")
plt.title('DCF vs Lag')
plt.xlabel('time lag')
plt.ylabel('DCF')
plt.show()
""" `

lapis sequoia Nov 7, 2021, 3:51 PM

#

Hello peeps!! how do i create dict from two lists of list.???

#

list = [[1, 30, 5], [2, 64, 4]]
list2 =[ [2, 64, 4], [1, 30, 5]]

calm thicket Nov 7, 2021, 3:53 PM

#

what should the resulting dict look like

lapis sequoia Nov 7, 2021, 3:55 PM

#

dict = {1:2,30:64, 5:4, 2:1,64:30, 4:5}

#

each element of list1 as key and each element of list2 as the value

#

possible?? i found it really hard to do it

calm thicket Nov 7, 2021, 4:01 PM

#

it is certainly possible

lapis sequoia Nov 7, 2021, 4:01 PM

#

could you please help me out on this?

calm thicket Nov 7, 2021, 4:01 PM

#

dict(zip(itertools.chain.from_iterable(list1), itertools.chain.from_iterable(list2)))

#

you have to import itertools of course

lapis sequoia Nov 7, 2021, 4:06 PM

#

calm thicket Nov 7, 2021, 4:08 PM

#

that's why you don't name things the same name as builtin things

lapis sequoia Nov 7, 2021, 6:08 PM

#

any way to flatten a pandas dataframe into a list

serene scaffold Nov 7, 2021, 7:33 PM

#

lapis sequoia any way to flatten a pandas dataframe into a list

Is it all numbers?

#

Flattening a dataframe seems like a weird thing to want to do, since a list can't express relations like a dataframe does

#

But if it's all numbers, you can use to_numpy and reshape that.

austere swift Nov 7, 2021, 8:58 PM

#

github copilot literally just wrote an entire rnn based encoder for me

#

(i wasn't gonna use an rnn but that's interesting though)

lyric ermine Nov 7, 2021, 9:31 PM

#

so i got a column in my df and i wanna drop all rows which are duplicates but keep all the NaN values, what is the best way to do so? anybody got some tip?

serene scaffold Nov 7, 2021, 9:50 PM

#

!docs pandas.DataFrame.drop_duplicates

arctic wedgeBOT Nov 7, 2021, 9:50 PM

#

pandas.DataFrame.drop\_duplicates


DataFrame.drop_duplicates(subset=None, keep='first', inplace=False, ignore_index=False)```
Return DataFrame with duplicate rows removed.

Considering certain columns is optional. Indexes, including time indexes are ignored.

serene scaffold Nov 7, 2021, 9:51 PM

#

Hmmmmm

#

@lyric ermine are you referring to nan values in the column of interest?

main fox Nov 7, 2021, 10:06 PM

#

lyric ermine so i got a column in my df and i wanna drop all rows which are duplicates but k...

df[(~df.duplicated()) | (df['col'].isnull())]

lyric ermine Nov 7, 2021, 10:07 PM

#

i think i found a solution

df[(~df['website'].duplicated()) | (df['website'].isnull())]

#

sorry was busy fixing it @serene scaffold @main fox

sour abyss Nov 7, 2021, 11:29 PM

#

anyone know good courses for learning data sci/ml?

deft harbor Nov 8, 2021, 12:19 AM

#

Is there anything specific that interests you?

unkempt jay Nov 8, 2021, 12:41 AM

#

I have a question to do with saving data in python

var=(x,y,pygame.image)

when i try to use pickle it says i cant save a pygame.surface so it there anyway to convert it to something that can be saved

sour abyss Nov 8, 2021, 12:53 AM

#

deft harbor Is there anything specific that interests you?

things like web scraping, data collecting, and tensorflow

random nest Nov 8, 2021, 1:07 AM

#

Kaggle.com/courses looks good

small bison Nov 8, 2021, 1:28 AM

#

does anyone know how to work with datetime in pandas dataframe? I'm trying to use this more giving error.

#

-> ValueError: unconverted data remains: e

shut trail Nov 8, 2021, 2:41 AM

#

small bison does anyone know how to work with datetime in pandas dataframe? I'm trying to us...

does it run without the .dt.month ?

shut trail Nov 8, 2021, 2:45 AM

#

unkempt jay I have a question to do with saving data in python var=(x,y,pygame.image) when...

ive used buffer objects to save matplotlib figures for later editing. would that work ?

small bison Nov 8, 2021, 3:16 AM

#

shut trail ive used buffer objects to save matplotlib figures for later editing. would that...

I didn't get it right. but with .dt.mont or without it, it's giving the same error

shut trail Nov 8, 2021, 3:20 AM

#

small bison I didn't get it right. but with .dt.mont or without it, it's giving the same err...

even without formatting ?

#

id try without formatting then using .month

#

gotta see the data to understand the problem

small bison Nov 8, 2021, 3:46 AM

#

shut trail gotta see the data to understand the problem

can i send you the csv file

arctic wedgeBOT Nov 8, 2021, 3:46 AM

#

Hey @small bison!

Uh-oh! It looks like your message got zapped by our spam filter. We currently don't allow .csv attachments, so here are some tips to help you travel safely:

• If you attempted to send a message longer than 2000 characters, try shortening your message to fit within the character limit or use a pasting service (see below)

• If you tried to show someone your code, you can use codeblocks
(run !code-blocks in #bot-commands for more information) or use a pasting service like:

https://paste.pythondiscord.com

small bison Nov 8, 2021, 3:48 AM

#

shut trail gotta see the data to understand the problem

https://archive.ics.uci.edu/ml/datasets/Online+Shoppers+Purchasing+Intention+Dataset

next lance Nov 8, 2021, 7:44 AM

#

What is the first step to make a object detection model

fickle hill Nov 8, 2021, 11:36 AM

#

hi anyone done any gtp-2 modeling ?

serene scaffold Nov 8, 2021, 12:30 PM

#

@next lance you need a dataset

fickle hill Nov 8, 2021, 12:31 PM

#

serene scaffold <@810795950246854664> you need a dataset

I'm, gathering the dataset

serene scaffold Nov 8, 2021, 12:31 PM

#

fickle hill hi anyone done any gtp-2 modeling ?

Even if someone has, you'll get help faster if you put your real question out there

fickle hill Nov 8, 2021, 12:31 PM

#

the question is the type of data

serene scaffold Nov 8, 2021, 12:31 PM

#

fickle hill I'm, gathering the dataset

I was talking to giant.

fickle hill Nov 8, 2021, 12:31 PM

#

soz

lapis sequoia Nov 8, 2021, 1:25 PM

#

!e

arctic wedgeBOT Nov 8, 2021, 1:25 PM

#

Command Help

!eval [code]
Can also use: e

*Run Python code and get the results.

This command supports multiple lines of code, including code wrapped inside a formatted code block. Code can be re-evaluated by editing the original message within 10 seconds and clicking the reaction that subsequently appears.

We've done our best to make this sandboxed, but do let us know if you manage to find an issue with it!*

lapis sequoia Nov 8, 2021, 1:26 PM

#

!e

print("black" * 999)````

arctic wedgeBOT Nov 8, 2021, 1:26 PM

#

@lapis sequoia :white_check_mark: Your eval job has completed with return code 0.

blackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblackblack
... (truncated - too long)

Full output: https://paste.pythondiscord.com/sogejuhete.txt?noredirect

lapis sequoia Nov 8, 2021, 1:26 PM

#

sience

#

!e black = "black" * 999
print(black.lenght)

arctic wedgeBOT Nov 8, 2021, 1:27 PM

#

@lapis sequoia :x: Your eval job has completed with return code 1.

001 | Traceback (most recent call last):
002 |   File "<string>", line 2, in <module>
003 | AttributeError: 'str' object has no attribute 'lenght'

prisma jay Nov 8, 2021, 2:07 PM

#

i have a big functions that process pandas dataframes. it run incredibly slow. Can i speed up the process by replacing pandas dataframes with pyspark dataframes? will it give significant impact to the performance?

misty flint Nov 8, 2021, 3:04 PM

#

if theres anyway you can improve the functions, thatd be ideal. if not, test using pyspark. in theory, it should be much faster but it can depend on the function itself

wide meadow Nov 8, 2021, 3:36 PM

#

Is L2 norm preferred over L1 norm for regularising the weights in neural networks?

bold dove Nov 8, 2021, 3:41 PM

#

Hi everyone.
I'm trying to build a module that for a given text, it returns a "tag" that defines what does the text talks about.

Not sure if this is data science, ML, or any other topic, I'm new but would like to learn...
If someone has done something similar, I'd appreciate if you tell me how to get started (docs/topics/etc).

My current approach is using regex to identify words related to the tag, but it looks to me that this isn't the most accurate way to do this task..

odd meteor Nov 8, 2021, 4:29 PM

#

bold dove Hi everyone. I'm trying to build a module that for a given text, it returns a "t...

I'm currently working on something that's kinda related to this in Natural Language Understanding (I'm learning NLP).

This could be related to NER, more specifically; extraction of intent and entity from a message.

You might wanna check Rasa NLU model and other related model for entity extraction from a text.

bold dove Nov 8, 2021, 4:45 PM

#

odd meteor I'm currently working on something that's kinda related to this in Natural Langu...

sounds good. Looking at it. Thanks

past plover Nov 8, 2021, 4:47 PM

#

Hey, so I'm really interested in ML and wanted to run this basic NN and do some fidgeting around on what effect different hyper parameters have using the MISNT dataset but I just cant get to run the neural network, it was build for python 2.7 and I'm also using that as my interpreter in pycharm, but it still spits out an error I don't understand (basically have no experience whatsoever, other than the raw theory)

#

I think I got the wrong/no numpy version or sth

empty flame Nov 8, 2021, 5:12 PM

#

I am trying to do basic image processing. I have a 8k image when I try to open tha image using tkinter canvas I can only see a part of the image( image is adjusted to my 2k screen) Is there a way to see the full 8k image without resizing it. If I resize the image there is huge drop in quality. Is there any other way?

shut trail Nov 8, 2021, 5:21 PM

#

Fitting an 8k down to two is resizing it... Are you just trying to look at another part of the image?

#

Trying to pan and zoom out I guess?

green wasp Nov 8, 2021, 5:44 PM

#

aquaSalute hello hello! Can someone guide me in the right direction for making an OCR?

hexed wind Nov 8, 2021, 6:06 PM

#

Hello, I have few python scripts which contain some etl functions. I want to schedule them to run one by one in an ec2 once in a week. What's the best way to do it to have logs of all scripts ?

shut trail Nov 8, 2021, 6:21 PM

#

what part of your question is about python data science and ai ?

#

sorry a little unclear maybe 🙂

#

you want to use python to schedule running python scripts. if so thats easy enough

hexed wind Nov 8, 2021, 6:26 PM

#

sorry if I have posted in wrong channel, I want to schedule several scripts and get a log stored in one file

shut trail Nov 8, 2021, 6:28 PM

#

ya, wrong channel for sure lol but ummm where are you stuck ? is the instance always running? are you using an api?

serene scaffold Nov 8, 2021, 7:04 PM

#

shut trail ya, wrong channel for sure lol but ummm where are you stuck ? is the instance al...

Please don't answer questions that don't relate to the topic of the channel.

#

Instead, please direct the question to the appropriate channel.

shut trail Nov 8, 2021, 7:11 PM

#

which is?

serene scaffold Nov 8, 2021, 7:22 PM

#

shut trail which is?

I don't understand the question, so I can't say, I'm just going by your assessment that this is the "wrong channel for sure". They can also claim a channel per the instructions in #❓｜how-to-get-help

shut trail Nov 8, 2021, 7:34 PM

#

i like how organized this server is. breath of fresh air

gritty obsidian Nov 8, 2021, 7:40 PM

#

what do people think of anaconda over vs?

serene scaffold Nov 8, 2021, 7:40 PM

#

shut trail i like how organized this server is. breath of fresh air

I'm glad that you see this as a positive. Thanks!

serene scaffold Nov 8, 2021, 7:40 PM

#

gritty obsidian what do people think of anaconda over vs?

vs, like venv?

gritty obsidian Nov 8, 2021, 7:40 PM

#

yea

serene scaffold Nov 8, 2021, 7:41 PM

#

gritty obsidian yea

I find venvs more straightforward, but anaconda supports non-python dependencies, or something

gritty obsidian Nov 8, 2021, 7:41 PM

#

tensorflow recommend creating venv

serene scaffold Nov 8, 2021, 7:41 PM

#

I would use venv unless you're in a situation where you're sure you need to use anaconda

gritty obsidian Nov 8, 2021, 7:41 PM

#

can i do venv in vsc

serene scaffold Nov 8, 2021, 7:42 PM

#

gritty obsidian can i do venv in vsc

vsc is just the code editor. the venv is where the python executable is.

gritty obsidian Nov 8, 2021, 7:42 PM

#

can you help me understand pls

#

so

#

its like a seperate python installation away from your main one

misty flint Nov 8, 2021, 7:45 PM

#

wide meadow Is L2 norm preferred over L1 norm for regularising the weights in neural network...

usually but i believe scikit-learn also has a 'mixed' option where you can adjust how much you want to apply of either L1 and L2

tranquil folio Nov 8, 2021, 7:46 PM

#

gritty obsidian its like a seperate python installation away from your main one

Kinda yeah. Just run python -m venv venv in your project folder, then reopen the folder in VSCode and it should automatically detect venv as your interpreter. You can start using pip to install packages and it will use that venv folder instead of you main Python installation

gritty obsidian Nov 8, 2021, 7:47 PM

#

ah

deft harbor Nov 8, 2021, 8:37 PM

#

gritty obsidian what do people think of anaconda over vs?

Deploying models to production based off conda and jupyter notebooks is a nightmare

#

We insist on pipenv or poetry at this point

serene scaffold Nov 8, 2021, 9:39 PM

#

deft harbor Deploying models to production based off conda and jupyter notebooks is a nightm...

imo, jupyter notebooks should always be treated as disposable.

deft harbor Nov 8, 2021, 9:41 PM

#

There seems to be a gap between people new to DS or ML and the rest of the environment. It would be nice if they explored in a notebook, but then where still capable of coding up the rest.

#

Not everyone struggles with that, but a surprisingly large number of people do

serene scaffold Nov 8, 2021, 9:50 PM

#

deft harbor Not everyone struggles with that, but a surprisingly large number of people do

It seems that data science courses that don't assume prior programming experience start people out in Jupyter, and I feel like if you start there, you don't learn modularity. But then again, people who write in languages that are lower-level than Python say that Python natives don't learn certain fundamentals. Maybe I'm like them.

heady tide Nov 8, 2021, 9:51 PM

#

I beleive that maths is much more important than programming for DS and ML

#

programming is just a tool

iron basalt Nov 8, 2021, 9:57 PM

#

heady tide I beleive that maths is much more important than programming for DS and ML

It depends what you are doing in ML. If you are the one writing something like Pytorch, you will need to not only know low level programming, but the very hard class of programming that is parallel programming.

#

(And GPU programming ofc)

deft harbor Nov 8, 2021, 10:38 PM

#

heady tide programming is just a tool

ds is just a tool too. 😉

heady tide Nov 8, 2021, 10:39 PM

#

but math is higher in the higherarchy

#

we use math as a tool to explain the world

#

we use python for DS and ML as a tool to explain math

desert oar Nov 8, 2021, 10:42 PM

#

i agree with squiggle, it depends a lot on the kind of work that you do

misty flint Nov 8, 2021, 11:30 PM

#

omg speaking of gpus, i was retraining a model and it ran out of memory yesterday feelsbongoman

velvet thorn Nov 8, 2021, 11:41 PM

#

heady tide I beleive that maths is much more important than programming for DS and ML

and if you're a machine learning engineer?

#

there is no getting around the need for software engineering ability.

heady tide Nov 8, 2021, 11:44 PM

#

Yeah but what I am saying is that maths is much more essential for DS and ML than coding

#

coding is a tool used to describe the maths

desert oar Nov 8, 2021, 11:52 PM

#

but practically you don't need to know much math, certainly not any theory as such, in order to be "dangerous" with regression modeling and other entry-level machine learning

#

i think it's perfectly reasonable to approach math on a need-to-know basis

#

learn some techniques on an intuitive level, learn how to do it in code, then learn the math in order to deepen your undersatnding

#

that's generally the "learning loop" that i recommend for beginners: learn a technique (intuition + math at your current level of understanding), learn to use the relevant library, deepen/solidify understanding with underlying theory

#

as you get more advanced, you can take a more "math-first" approach, especially as your use cases diverge from what is readily available in libraries

#

so "more essential" at a beginner level? no, i disagree. in principle, yes, people have been doing data analysis and statistics long before the word "computer" even referred to a type of person, let alone to a type of machine

#

it's like learning music; you don't need to learn the full theory, just start learning to use your main tool/instrument and learn theory as you go

#

think of programming as equivalent to practicing technique and precision, versus learning about data analysis as equivalent to learning about interpretation and performance, versus learning calculus and linear algebra as equivalent to learning about music theory

#

you need all 3, but you need a balance of all 3 as you learn

misty flint Nov 9, 2021, 12:08 AM

#

thats a good analogy

#

pithink

desert oar Nov 9, 2021, 12:53 AM

#

yeah honestly it worked better than i thought it would 😆

main fox Nov 9, 2021, 1:53 AM

#

I had fun with the math behind k-means clustering. The math itself was easy and the intuition was clear.

#

The math behind logistic and polynomial regression was more involved.

dapper hatch Nov 9, 2021, 1:58 AM

#

Good night community !. I have this code, I need to go through the list of each user, and then do some mathematical operations


cnn = mysql.connector.connect(
    host = 'localhost',
    user= 'root',
    passwd = '123456',
    database = ''
)

cur = cnn.cursor()
cur.execute('SELECT * FROM usuarios')
datos = cur.fetchall()

for row in datos:

    lista = row[:]
    print(lista)

desert oar Nov 9, 2021, 1:58 AM

#

logistic regression makes a lot more sense when you learn generalized linear models

#

polynomial regression is just linear regression 😉

desert oar Nov 9, 2021, 1:59 AM

#

dapper hatch Good night community !. I have this code, I need to go through the list of each ...

did you have a question about this code?

serene scaffold Nov 9, 2021, 2:01 AM

#

@desert oar your analysis was excellent. I wish it was in one message so I could pin it.

dapper hatch Nov 9, 2021, 2:02 AM

#

desert oar did you have a question about this code?

Yes, how do I have to go through the list to see each user?

main fox Nov 9, 2021, 2:04 AM

#

I'm comfortable with the intuition behind logistic regression now. Initially seeing the equation 1/(1+e^-(coeffs)) and knowing it resulted in values between 0 and 1 was impressive. Like it's pretty crafty 😄

And yes, you end up doing linear regression with non-linear relationships once you transform them. But interpreting the equation wasn't too straightforward with some transformations. Sometimes you'd trade accuracy for explainability in some transformations.

dapper hatch Nov 9, 2021, 2:05 AM

#

the users change every hour and that is why I need to consult the database, to know when there is a new user

desert oar Nov 9, 2021, 2:08 AM

#

serene scaffold <@!389497659087650836> your analysis was excellent. I wish it was in one message...

maybe pin the first message? or i can copy and paste it into one messages

#

ty for the endorsement

desert oar Nov 9, 2021, 2:09 AM

#

main fox I'm comfortable with the intuition behind logistic regression now. Initially see...

i guess that's my point, once you know what a GLM is and how it works, it's no longer a crafty trick but a natural and intuitive model for binary data

desert oar Nov 9, 2021, 2:10 AM

#

dapper hatch the users change every hour and that is why I need to consult the database, to k...

so every hour you want to check the set of users in the database, and compare with the previous hour to see if any have been added? does the program keep running in that hour, or does the program run once every hour? in the 2nd case, you will have to save the list of users somewhere, maybe to a file on the computer.

#

also i am happy to help with this here, but in the future this might be a better question for #databases or even a help channel (see #❓｜how-to-get-help)

dapper hatch Nov 9, 2021, 2:17 AM

#

desert oar so every hour you want to check the set of users in the database, and compare wi...

users table 10, after an hour users table 15, I have to save that data somewhere, and then perform various operations. For example: the user with id 4220, I need to know how many accounts he has, now I know the position, but I would have to handle it dynamically, since it changes. Is it better understood?

arctic wedgeBOT Nov 9, 2021, 2:17 AM

#

Rules

4. Use English to the best of your ability. Be polite if someone speaks English imperfectly.

median idol Nov 9, 2021, 2:24 AM

#

Guys, could someone help me why seaborn throws me that error

#

ValueError: Could not interpret value `Open` for parameter `y

#

plt.show()```

#

Here is my code

#

#

First I had problem with x-axis, but I fixed using .index , but right now I dont really know what to do

median idol Nov 9, 2021, 2:28 AM

#

median idol ```sns.lineplot(x= df.index ,y='Open',date=df) plt.show()```

And it is the same for every column if i try to put 'High' or 'Low into y, the same will occur :c

desert oar Nov 9, 2021, 2:34 AM

#

dapper hatch users table 10, after an hour users table 15, I have to save that data somewhere...

not entirely, sorry. maybe a more complete example might help

raven jackal Nov 9, 2021, 3:27 AM

#

im not sure why my code isnt working

#

#

i dont really understand the help it is given me. Any help please😅

desert oar Nov 9, 2021, 3:40 AM

#

raven jackal

!code can you please post your code as text, not a screenshot? read carefully below 👇

arctic wedgeBOT Nov 9, 2021, 3:40 AM

#

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

desert oar Nov 9, 2021, 3:41 AM

#

!paste also include the complete error output. if the output is very long, use our "paste site". again, read carefully below for instructions:

arctic wedgeBOT Nov 9, 2021, 3:41 AM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the floppy disk icon in the top right, or by typing ctrl + S. After doing that, the URL should change. Copy the URL and post it here so others can see it.

raven jackal Nov 9, 2021, 3:42 AM

#

from ij import IJ

def GetPixelValue(process, width, height):
GetPix = []
for x in range (0, width):
for y in range(0, width):
pixel = process.getPixel (x, y)
GetPix.append(pixel)
return GetPix

myimage = IJ.getImage()
processor = myimage.getProcessor()
w = processor = myimage.getWidth()
h = processor.getHeight()
ValueList = GetPixelValue (processor, w, h)
bins = [0] * 26

for pixel in ValuesList:
bin_index = int(pixel/10)
bins[bin_index] += 1

print(bins)

#

Traceback (most recent call last):
File "New_.py", line 14, in <module>
AttributeError: 'int' object has no attribute 'getHeight'

lapis sequoia Nov 9, 2021, 3:43 AM

#

does anyone know how to create RNNs?

#

I'm having a lot of trouble wiht mine

austere swift Nov 9, 2021, 3:44 AM

#

What issues are you having?

lapis sequoia Nov 9, 2021, 3:44 AM

#

I have two numpy arrays each in the shape of (5, 200)

#

the inputs and outpuits

#

and i want to train an RNN to predict the outputs from the inputs

#

when i watch vids on people making it, they change the shapes of the inputs and outputs

#

do you know why they do that?

austere swift Nov 9, 2021, 3:46 AM

#

change the shape to what?

lapis sequoia Nov 9, 2021, 3:46 AM

#

some 3d shape

#

different from the original shape of the dataset

austere swift Nov 9, 2021, 3:46 AM

#

It could just be because the model expects batches (this is the same with most models anyways)

#

the first axis is interpreted as the number of samples

raven jackal Nov 9, 2021, 3:46 AM

#

is anyone here familiar with ImageJ?

lapis sequoia Nov 9, 2021, 3:46 AM

#

data = final_array_final
target = different_arrays

# normalize data by dividing every value by 100
data /= 100
target /= 100

# Make training and testing data
x_train, x_test, y_train, y_test = train_test_split(data, target, test_size=0.2, random_state=4)

# RNN Model
model = Sequential()

# add LSTM layers
# Before adding multiple layers, return_sequences was False
model.add(LSTM((1), batch_input_shape=(None, 5, 1), return_sequences=True))
model.add(LSTM((1), return_sequences=False))

model.compile(loss='mean_absolute_error', optimizer='adam', metrics=['accuracy'])
print(model.summary())

history = model.fit(x_train, y_train, epochs=100, validation_data=(x_test, y_test))

# see predictions
results = model.predict(x_test)
plt.scatter(range(20), results[:20], c='r')
plt.scatter(range(20), y_test[:20], c='g')
plt.show()

desert oar Nov 9, 2021, 3:47 AM

#

raven jackal from ij import IJ def GetPixelValue(process, width, height): GetPix = [] ...

can you please use the code block instructions i posted?

lapis sequoia Nov 9, 2021, 3:47 AM

#

this is my code

austere swift Nov 9, 2021, 3:47 AM

#

so if you have one sample, you'd reshape it to (1, 5, 200)

lapis sequoia Nov 9, 2021, 3:47 AM

#

data and target are both in the shape of (5, 200)

#

im getting an error when I run that because of the shape

#

it's in

model.add(LSTM((1), batch_input_shape=(None, 5, 1), return_sequences=True))
model.add(LSTM((1), return_sequences=False))

#

how do I fix it

austere swift Nov 9, 2021, 3:48 AM

#

lapis sequoia im getting an error when I run that because of the shape

what error specifically

lapis sequoia Nov 9, 2021, 3:48 AM

#

ValueError: Input 0 of layer sequential is incompatible with the layer: expected ndim=3, found ndim=2. Full shape received: (None, 200)

#

I thought None meant any shape

austere swift Nov 9, 2021, 3:49 AM

#

also I don't think batch_input_shape is an argument for that, you're using tf.keras.layers.LSTM right?

austere swift Nov 9, 2021, 3:50 AM

#

lapis sequoia I thought None meant any shape

-1 means any shape (in terms of reshape functions)

lapis sequoia Nov 9, 2021, 3:50 AM

#

the Sequential() im using is keras.engine.sequential

austere swift Nov 9, 2021, 3:50 AM

#

lapis sequoia the Sequential() im using is keras.engine.sequential

i mean the LSTM

lapis sequoia Nov 9, 2021, 3:50 AM

#

raven jackal Nov 9, 2021, 3:51 AM

#

@desert oar im not sure how to do the end back ticks without pressing enter😅

lapis sequoia Nov 9, 2021, 3:51 AM

#

when i do

model.add(LSTM((1), batch_input_shape=(5, 200), return_sequences=True))
model.add(LSTM((1), return_sequences=False))

I get the error
ValueError: Input 0 of layer lstm_2 is incompatible with the layer: expected ndim=3, found ndim=2. Full shape received: (None, 1)

#

the first line is the problem

desert oar Nov 9, 2021, 3:51 AM

#

raven jackal <@!389497659087650836> im not sure how to do the end back ticks without pressing...

what keyboard layout do you use?

#

you can press enter after you type the 3 ```

raven jackal Nov 9, 2021, 3:52 AM

#

https://paste.pythondiscord.com/ibokoqoquq.properties

desert oar Nov 9, 2021, 3:52 AM

#

alternatively you can press shift-enter to put a new line into your message without sending it

#

thanks for the code

raven jackal Nov 9, 2021, 3:52 AM

#

is the link alright?

desert oar Nov 9, 2021, 3:52 AM

#

yes

austere swift Nov 9, 2021, 3:52 AM

#

lapis sequoia when i do ```py model.add(LSTM((1), batch_input_shape=(5, 200), return_sequence...

use (1, 5, 200) as your shape, and reshape the inputs accordingly

desert oar Nov 9, 2021, 3:52 AM

#

it looks like you accidentally overwrite the processor variable

#

processor = myimage.getProcessor()
w = processor = myimage.getWidth()
h = processor.getHeight()

look at the 2nd line here @raven jackal

#

you replace processor with a number

#

and of course the number has no attribute getWidth(), which is what the error says

raven jackal Nov 9, 2021, 3:54 AM

#

right the second line. I am not sure what you mean by replacing it with a number though

lapis sequoia Nov 9, 2021, 3:54 AM

#

thanks

#

ok so when i run history = model.fit(x_train, y_train, epochs=100, validation_data=(x_test, y_test)) i get an error

#

ValueError: Input 0 of layer sequential_1 is incompatible with the layer: expected ndim=3, found ndim=2. Full shape received: (None, 200)

#

full code

data = final_array_final
target = different_arrays

np.reshape(data, (1, 5, 200))
np.reshape(target, (1, 5, 200))

# normalize data by dividing every value by 100
data /= 100
target /= 100

# Make training and testing data
x_train, x_test, y_train, y_test = train_test_split(data, target, test_size=0.2, random_state=4)

# RNN Model
model = Sequential()

# add LSTM layers
# Before adding multiple layers, return_sequences was False
model.add(LSTM((1), batch_input_shape=(1, 5, 200), return_sequences=True))
model.add(LSTM((1), return_sequences=False))

model.compile(loss='mean_absolute_error', optimizer='adam', metrics=['accuracy'])
print(model.summary())

history = model.fit(x_train, y_train, epochs=100, validation_data=(x_test, y_test))

# see predictions
results = model.predict(x_test)
plt.scatter(range(20), results[:20], c='r')
plt.scatter(range(20), y_test[:20], c='g')
plt.show()

raven jackal Nov 9, 2021, 3:56 AM

#

should i remove the processor part of that line?

lapis sequoia Nov 9, 2021, 3:56 AM

#

also i get an error when I try and run
results = model.predict(x_test)

#

ValueError: Input 0 of layer sequential_1 is incompatible with the layer: expected ndim=3, found ndim=2. Full shape received: (None, 200)

austere swift Nov 9, 2021, 3:57 AM

#

what is this data anyways?

#

as in, what do the different axes represent

raven jackal Nov 9, 2021, 3:57 AM

#

also is anyone familiar with fiji?

#

/imageJ

lapis sequoia Nov 9, 2021, 3:58 AM

#

austere swift as in, what do the different axes represent

the inputs are 200 numbers from -5 to 5. I normalized them so they are very small numbers

#

for the outputs, I basically generate a random number from 0-360 (for degrees). Then I add all the inputs together on top of that initial random number

#

whenever it goes above 360, it goes back to 0. Whenever it goes below 0, it goes back up to 360

austere swift Nov 9, 2021, 4:00 AM

#

lapis sequoia the inputs are 200 numbers from -5 to 5. I normalized them so they are very smal...

so theres 5 sets of these 200 numbers, and those correspond to timesteps correct?

lapis sequoia Nov 9, 2021, 4:00 AM

#

yeah

austere swift Nov 9, 2021, 4:00 AM

#

so you only have one sample?

lapis sequoia Nov 9, 2021, 4:00 AM

#

yeah i have 1 sample of 5 time steps

#

wait there are 200 time steps

#

5 are the repeats

#

repeats = 5
timesteps = 200

random.seed(1)
different_arrays = None
final_array = None
final_array_final = None

for f in range(repeats):
    starting_value = random.randint(0, 360)
    final_array = np.array([])

    print("Starting value is " + str(starting_value))

    i = 0
    while i < timesteps:
        number = random.randint(-5, 5)
        final_array = np.append(final_array, number)
        i += 1

    values = np.array([starting_value])

    for x in final_array:
        if x + starting_value > 360:
            starting_value = 0
        elif x + starting_value < 0:
            starting_value = 360
        starting_value += x
        values = np.append(values, starting_value)

    values = values[:-1]

    if different_arrays is None:
        different_arrays = np.zeros([repeats, values.__len__()])

    if final_array_final is None:
        final_array_final = np.zeros([repeats, final_array.__len__()])

    # OUTPUTS
    different_arrays[f] = values
    # INPUTS
    final_array_final[f] = final_array

#

that's the code

austere swift Nov 9, 2021, 4:02 AM

#

well the input should be in the shape (batch size, timesteps, features)

#

so you'd need to reshape that

lapis sequoia Nov 9, 2021, 4:02 AM

#

i have it in (1, 5, 200)

austere swift Nov 9, 2021, 4:03 AM

#

so if you have 200 timesteps then you should have it in (1, 200, 5)

lapis sequoia Nov 9, 2021, 4:03 AM

#

how do i know how many timesteps i have

#

are timesteps the number of rows or columns

austere swift Nov 9, 2021, 4:04 AM

#

do you know the basics of RNNs?

lapis sequoia Nov 9, 2021, 4:04 AM

#

kinda

austere swift Nov 9, 2021, 4:04 AM

#

so you know that they recursively loop on themselves, in timesteps

#

(hence the name, recurrent neural network)

lapis sequoia Nov 9, 2021, 4:05 AM

#

yeah then 200 time steps

#

5 samples

austere swift Nov 9, 2021, 4:06 AM

#

well you'd call the 5 the values, the samples is the number of total training samples you have (which is 1)

lapis sequoia Nov 9, 2021, 4:06 AM

#

oh

austere swift Nov 9, 2021, 4:06 AM

#

so your shape should be (1, 200, 5)

lapis sequoia Nov 9, 2021, 4:06 AM

#

do I need to make more training samples?

#

to make it more accurate

austere swift Nov 9, 2021, 4:06 AM

#

yes

lapis sequoia Nov 9, 2021, 4:07 AM

#

ill stick with 1 for now I'll add more later

#

so I changed the shape and still get the same error

#

ValueError: Input 0 of layer sequential_2 is incompatible with the layer: expected ndim=3, found ndim=2. Full shape received: (None, 200)

#

(1, 200, 5) is my shape

austere swift Nov 9, 2021, 4:08 AM

#

what version of tf/keras do you have btw?

lapis sequoia Nov 9, 2021, 4:09 AM

#

2.5.0 for tf

austere swift Nov 9, 2021, 4:10 AM

#

what about keras

#

or are you using tf's integrated keras?

#

(if so that's good, keep it that way)

lapis sequoia Nov 9, 2021, 4:13 AM

#

no im importing keras

#

also I restarted my progrma

#

and now when I make the training and testing data I get an error

#

ValueError: With n_samples=1, test_size=0.2 and train_size=None, the resulting train set will be empty. Adjust any of the aforementioned parameters.

#

data = final_array_final
target = different_arrays

data = np.reshape(data, (1, 200, 5))
target = np.reshape(target, (1, 200, 5))

# normalize data by dividing every value by 100
data /= 100
target /= 100

# Make training and testing data
x_train, x_test, y_train, y_test = train_test_split(data, target, test_size=0.2, random_state=4)

austere swift Nov 9, 2021, 4:22 AM

#

lapis sequoia no im importing keras

use tf.keras instead

#

your keras version is probably very old, because keras.engine.Sequential and keras.layers.recurrent dont even exist in the current version

lapis sequoia Nov 9, 2021, 4:24 AM

#

the train_test_split is still giving error

austere swift Nov 9, 2021, 4:25 AM

#

you have one sample, you can't expect to get a train and test set with 1 sample

lapis sequoia Nov 9, 2021, 4:25 AM

#

oh

#

wait

#

i ran the loop 5 times

#

which meant 5 differnet columns

#

i wanna make that into a sample

#

(200, 5)

#

i'll change the shape

austere swift Nov 9, 2021, 4:26 AM

#

then there are no timestamps

lapis sequoia Nov 9, 2021, 4:26 AM

#

(5, 200, 1)

#

aren't the timesteps 200?

#

or are they inputs and outputs

austere swift Nov 9, 2021, 4:26 AM

#

okay so you want 5 samples, 200 timestamps, 1 value per timestamp

lapis sequoia Nov 9, 2021, 4:26 AM

#

yeah

austere swift Nov 9, 2021, 4:27 AM

#

ok

#

so yeah

#

(5, 200, 1)

lapis sequoia Nov 9, 2021, 4:27 AM

#

ok the train_test_split works now

#

but im stuck at the history one now

#

history = model.fit(x_train, y_train, epochs=100, validation_data=(x_test, y_test))

#

as well as predict

austere swift Nov 9, 2021, 4:27 AM

#

did you change the lstm shape

lapis sequoia Nov 9, 2021, 4:27 AM

#

data = final_array_final
target = different_arrays

data = np.reshape(data, (5, 200, 1))
target = np.reshape(target, (5, 200, 1))

# normalize data by dividing every value by 100
data /= 100
target /= 100

# Make training and testing data
# need more samples for train test split
x_train, x_test, y_train, y_test = train_test_split(data, target, test_size=0.2, random_state=4)

# RNN Model
model = tf.keras.Sequential()

# add LSTM layers
# Before adding multiple layers, return_sequences was False
model.add(tf.keras.layers.LSTM((1), batch_input_shape=(1, 5, 200), return_sequences=True))
model.add(tf.keras.layers.LSTM((1), return_sequences=False))

model.compile(loss='mean_absolute_error', optimizer='adam', metrics=['accuracy'])
print(model.summary())

history = model.fit(x_train, y_train, epochs=100, validation_data=(x_test, y_test))
results = model.predict(x_test)

#

oh

#

ok so i changed it

#

it went Epoch 1/100

#

then it gave error
Function call stack: train_function -> train_function -> train_function

#

then when i run model.predict, it gives error predict_function -> predict_function -> predict_function

unique pulsar Nov 9, 2021, 6:18 AM

#

hello

#

Can anyone help me in #help-apple , please?

fresh kraken Nov 9, 2021, 6:50 AM

#

hi i think data scientists do not make any impactful thing and just crunch the data to get some new trick to sell people , and on the other hand machine learning engineers are more genius they actually make something that is revolutionary , and that is why i do not like data science , please tell me is this thinking correct ?

desert oar Nov 9, 2021, 7:02 AM

#

fresh kraken hi i think data scientists do not make any impactful thing and just crunch the d...

no

#

either you're trolling or you don't really understand what either field does

fresh kraken Nov 9, 2021, 7:05 AM

#

desert oar no

i ain't trolling , i am schoked myself , that why do i think that , but from my general understanding of the surroundings of the field, i get a perspective that machine learning guys work on things like alpha go protein folding self driving cars, which are so much cooler than ,people who work as data scientists, just optimising and manipulating data to find better solution for the sales and marketting team

#

pleas tell me this is not truth and why it is not, i do not want to be that idiot hater kind of guy

unique pulsar Nov 9, 2021, 7:09 AM

#

:(( excuse me, I am struggling with label image process, can someone help me? Please

main fox Nov 9, 2021, 7:18 AM

#

fresh kraken i ain't trolling , i am schoked myself , that why do i think that , but from my ...

https://www.springboard.com/blog/ai-machine-learning/machine-learning-engineer-vs-data-scientist/

TLDR; a data scientist would help find which models are appropriate and an engineer would help scale and deploy

Springboard Blog

Andrew Zola

Machine Learning Engineer vs. Data Scientist

There’s some confusion surrounding the roles machine learning engineer vs. data scientist. However, if you parse things out, the distinctions become clear.

#

Also, it would be odd to have a data engineer without them performing data science, or without a data science team. They usually work together.

rotund trellis Nov 9, 2021, 7:22 AM

#

Hi, I have to code the GLCM for feature extraction in python, it is a school project using the description of formulas given in this document.

rotund trellis Nov 9, 2021, 7:23 AM

#

rotund trellis Hi, I have to code the GLCM for feature extraction in python, it is a school pro...

This is what i have done thus far but am still not able to get it right, any assistance would be appreciated, thanks.

iron basalt Nov 9, 2021, 7:58 AM

#

fresh kraken i ain't trolling , i am schoked myself , that why do i think that , but from my ...

They are just job titles. People do amazing things, not titles.

#

*Titles also change over time. Back in the day people were employed as "Scientist".

fresh kraken Nov 9, 2021, 8:02 AM

#

iron basalt *Titles also change over time. Back in the day people were employed as "Scientis...

i see

analog lantern Nov 9, 2021, 8:23 AM

#

hello

thin cargo Nov 9, 2021, 8:33 AM

#

https://colab.research.google.com/drive/1busVNyUwUR85WSMsoIf5qZtw21Uzh5ii?usp=sharing
Help

Google Colaboratory

woeful falcon Nov 9, 2021, 10:19 AM

#

is nlp 'easy' ?

#

\👀

warped helm Nov 9, 2021, 12:05 PM

#

woeful falcon is nlp 'easy' ?

depends what you mean by easy

#

what parts of NLP are you trying to learn?

woeful falcon Nov 9, 2021, 12:07 PM

#

i was wondering if it's as easy as training a tiny model on a laptop,
with a small database of a non-english language : )

warped helm Nov 9, 2021, 12:08 PM

#

https://spacy.io/usage/training

Training Pipelines & Models

Training Pipelines & Models · spaCy Usage Documentation

Train and update components on your own data and integrate custom models

#

go for it

brazen spire Nov 9, 2021, 12:35 PM

#

RuntimeError: Given input size: (192x2x2). Calculated output size: (192x0x0). Output size is too small

#

why do i have this error?

#

Can't post the whole code

#

https://pastebin.com/WfRrxvpR

Pastebin

import torchfrom torchvision import modelsimport numpy as npimport ...

Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.

#

weird it works on google colab but not localy on jupyter notebook..

shut tapir Nov 9, 2021, 2:41 PM

#

Hello guys!
Has anyone tried using HuggingFace transformers model without an internet connection by using pretrained models to be loaded from the disk?

desert oar Nov 9, 2021, 2:42 PM

#

fresh kraken i ain't trolling , i am schoked myself , that why do i think that , but from my ...

ok, that's kind of funny. your perception of machine learning is more glamorous than in real life. your perception of data science is uncharitable, but more accurate than most data scientists care to admit.

yes, there are some data science teams that are glorified business analytics units attached to a marketing department. that is a result of some complicated forces in the industry, related to a lack of good-quality data in general, the general unwillingness of business owners to spend money on "research", and the enormous amount of money still to be made on digital advertising. but there is plenty of constructive data science work and plenty of very smart people doing it.

consider that data science is not any one thing, but is more a type of knowledge worker, who uses tools from statistics and machine learning to answer business problems. the distinction between "data science" and "machine learning" tends to be blurry. it's better to think of "data science" as a broad category, encompassing the traditional fields of "applied statistics" and "applied machine learning". that said, the "science" part of the name is generally a misnomer.

note also that many (most) people who work on machine learning do not build self-driving cars, and a lot of machine learning happens in support of what you might call "business data science". for example, the insurance industry is investing in machine learning for natural language processing, in order to process decades of old claim documents and enormous quantities of unstructured data like accident reports. is that "machine learning" or "data science"? arguably it's both.

if you personally find it more interesting to work on self-driving cars than processing insurance claims, i don't blame you (note: i prefer insurance claims). but don't assume that many or even most machine learning researchers are doing "genius" or "revolutionary" work.

i also generally dislike marketing people and i try to avoid working with/for them.

brazen spire Nov 9, 2021, 2:46 PM

#

Anyone ever got this problem while using cuda?

#

RuntimeError: CUDA out of memory. Tried to allocate 440.00 MiB (GPU 0; 8.00 GiB total capacity; 2.03 GiB already allocated; 4.17 GiB free; 2.24 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

#

with pytorch

shy nimbus Nov 9, 2021, 2:55 PM

#

Hello everyone, soon I will be participatung in a machine learning competition and my question is there somewhere something like a table of training algorithms for data. For example: use Decision Tree Classifier for balanced dataset multi class classification tasks? I need get the highest accuracy among all participants, I will be use GridSearchCV for tune hyperparametrs, but I want use the best learning algorithm

rigid zodiac Nov 9, 2021, 3:01 PM

#

Hi everyone, Quick question, can we use groupby and feed it to ML ?

#

like i'm trying to do df = data.groupby() and consider it as a predictors

fresh kraken Nov 9, 2021, 3:13 PM

#

desert oar ok, that's kind of funny. your perception of machine learning is more glamorous ...

the answer you gave cleared so many doubts and misconceptions that I would not have been able to clear without spending much more time and effort, I read the answer more than 2 times understanding the depth of what you are trying to say, and I realise how wrong I was towards data science and my general respect towards it is has been increased a lot now. I knew in my mind that i was wrong , but i really needed to talk to someone regarding this,Thank you very much, for taking your time and answering my question , it really means a lot

desert oar Nov 9, 2021, 3:19 PM

#

good luck out there, stay away from the marketing department 😉

devout sail Nov 9, 2021, 3:52 PM

#

Did anyone have the pleasure of trying to set up a pytorch environment using poetry?
The issue is that I want both torch and torchvision, and I need their cuda versions, so I specified the URLs in pyproject.toml, but I get the following error:

Because torchvision (0.11.1+cu113) depends on torch (1.10.0)
   and project depends on torch (1.10.0+cu113), torchvision is forbidden.
  So, because project depends on torchvision (0.11.1+cu113), version solving failed.

So it can't tell that the +cu113 version is acceptable.

desert oar Nov 9, 2021, 3:53 PM

#

devout sail Did anyone have the pleasure of trying to set up a pytorch environment using poe...

how did you specify the torch dependency?

devout sail Nov 9, 2021, 3:54 PM

#

with a url to download the wheel

desert oar Nov 9, 2021, 3:55 PM

#

i didn't realize +foo was a valid suffix, but it is in fact valid according to pep 440. maybe this is a bug in poetry?

desert oar Nov 9, 2021, 3:55 PM

#

devout sail with a url to download the wheel

show the pyproject file?

#

is the wheel hosted somewhere special?

devout sail Nov 9, 2021, 3:56 PM

#

Currently reading https://github.com/python-poetry/poetry/issues/2613, which doesn't bring me confidence 😅

#

torch = {url = "https://download.pytorch.org/whl/cu113/torch-1.10.0%2Bcu113-cp39-cp39-linux_x86_64.whl"}
torchvision = {url = "https://download.pytorch.org/whl/cu113/torchvision-0.11.1%2Bcu113-cp39-cp39-linux_x86_64.whl"}

It's able to resolve the versions just fine, but it seems it's unable to properly compare versions with labels

desert oar Nov 9, 2021, 3:57 PM

#

hmmm

#

https://github.com/python-poetry/poetry/issues/2613#issuecomment-853227939 this is the exact issue

#

alas

#

use conda 😛

devout sail Nov 9, 2021, 3:58 PM

#

Guess I will

#

Actually conda is not great for me because I want a docker image that's as small as I can (reasonably) make it. requirements.txt then? dogekek

#

I might use poetry and then have a separate script just for these two dependencies

lapis sequoia Nov 9, 2021, 4:13 PM

#

Hi yall! Can somebody point me to a simple python template for classifying pictures? It's just a very basic model that i need. I have a folder1 with a lot of 128x128 pixel pictures and a folder2 with the same pictures but they have been manipulated on pixel-level. I need to train a simple model to be able to classify if the picture has been manipulated or not.

#

is there maybe some public simple jupyter notebook or keras model that comes to your mind when reading my problem? I am not that deep into machine learning so if anybody could point me to some easy and quick to implement resource that would be great!

wheat ice Nov 9, 2021, 4:18 PM

#

interesting (i think) problem if someone wants to take a peek 👀 #help-pancakes 🥞

formal lava Nov 9, 2021, 4:37 PM

#

What guide is decent and is .net decent or should I setup smth else

rigid zodiac Nov 9, 2021, 5:19 PM

#

Hi everyone , I have a few question:

what is this type of dataframe
how can I turn my normal data into that

lapis sequoia Nov 9, 2021, 5:47 PM

#

@rigid zodiac it's a list containing lists which contain lists ...

#

you can take your data and do something like this:

list1=[]
for data_value in dataset_row:
  list1.append(data_value)```

#

and then append the list1 together with more lists to a gigalist 🙂

rigid zodiac Nov 9, 2021, 5:53 PM

#

lapis sequoia <@!380930360407621646> it's a list containing lists which contain lists ...

Thank you, can i use it for ML and DL

idle root Nov 9, 2021, 6:15 PM

#

hey guys

#

how do i make the line continous? the dots and line are random from the normal distribution

#

and i want the line to be continous through the entire plot given 2 dots

fierce patio Nov 9, 2021, 7:51 PM

#

HI

#

in ML if i have R-squar close to 1 it s that mean that i have a problm in my model ?plz it s my first ML project

rigid zodiac Nov 9, 2021, 7:54 PM

#

fierce patio in ML if i have R-squar close to 1 it s that mean that i have a problm in my mo...

it really doesnt mean anything... for short, you will need to validation it with the data to determine which model will be better

#

R-squared determine the fit, sometime in stats if you have a near perfect fit (i.e. 99%) but when you check with validation data, it look like crap..... i.e. you just committed over fitting

brazen spire Nov 9, 2021, 7:56 PM

#

Why does pytorch use only 2gb of my GPU?

hearty briar Nov 9, 2021, 7:58 PM

#

brazen spire Why does pytorch use only 2gb of my GPU?

that's a hard question to answer without more information

fierce patio Nov 9, 2021, 8:02 PM

#

rigid zodiac it really doesnt mean anything... for short, you will need to validation it with...

then what s the solution for avoid the over fitting ,thanks u for ur help

rigid zodiac Nov 9, 2021, 8:08 PM

#

fierce patio then what s the solution for avoid the over fitting ,thanks u for ur help

When you split the data set into train test right? make an extra one for validation. This could be in the following ratio (0.8 - 0.1 - 0.1)

#

once you complete your ML or DL, run it one more time with the validation set

#

Question: Anyone know how to combine npy data file? I have like about 300k npy file and want to combine it into 1 big NPY

fierce patio Nov 9, 2021, 8:28 PM

#

rigid zodiac When you split the data set into train test right? make an extra one for validat...

thanks u

ashen umbra Nov 9, 2021, 8:28 PM

#

Hi guys how do I learn python fir data science?

rigid zodiac Nov 9, 2021, 8:29 PM

#

ashen umbra Hi guys how do I learn python fir data science?

coursera

ashen umbra Nov 9, 2021, 8:30 PM

#

rigid zodiac coursera

Oh any course recommendations?

rigid zodiac Nov 9, 2021, 8:30 PM

#

Machine learning python or IBM machine learning

ashen umbra Nov 9, 2021, 8:31 PM

#

Thanks!

austere swift Nov 9, 2021, 9:01 PM

#

brazen spire RuntimeError: CUDA out of memory. Tried to allocate 440.00 MiB (GPU 0; 8.00 GiB ...

this means your gpu ran out of memory, you can fix it by decreasing the amount of data you're putting on the gpu, such as lowering your batch size, lowering the size of your model, or lowering the size of your samples (i.e in image classification you can decrease the resolution)

#

the only other way to do it would be to get a new gpu with more memory

#

also check if it runs out of memory during the phase where it sends the model to gpu or when it sends the samples to gpu, that'll tell you whether to shrink your model or your data

dawn sapphire Nov 9, 2021, 9:07 PM

#

hi

#

Where do I go to get help?

rigid zodiac Nov 9, 2021, 9:07 PM

#

dawn sapphire Where do I go to get help?

here just ask question.

brazen spire Nov 9, 2021, 9:08 PM

#

austere swift this means your gpu ran out of memory, you can fix it by decreasing the amount o...

fixed thanks

rigid zodiac Nov 9, 2021, 9:08 PM

#

https://tenor.com/view/sometimes-maybe-good-sometimes-maybe-bad-sometimes-maybe-shit-not-good-feeling-shit-gif-16938525

Tenor

brazen spire Nov 9, 2021, 9:08 PM

#

but i can't get the program to recognise handwriting numbers

#

does the preprocessing affect that?

austere swift Nov 9, 2021, 9:08 PM

#

you're doing mnist?

brazen spire Nov 9, 2021, 9:09 PM

#

Yes

#

input_size = 150
preprocess_train = transforms.Compose([
transforms.Grayscale(num_output_channels=3),
transforms.Resize(input_size),

transforms.RandomResizedCrop(input_size),

transforms.RandomHorizontalFlip(),

transforms.ToTensor(),
transforms.Normalize(
mean=[0.485, 0.456, 0.406],
std=[0.229, 0.224, 0.225]
)])

preprocess_test = transforms.Compose([
transforms.Grayscale(num_output_channels=3),
transforms.Resize(input_size),
transforms.ToTensor(),
transforms.Normalize(
mean=[0.485, 0.456, 0.406],
std=[0.229, 0.224, 0.225]
)])

from torchvision import datasets
mnist_train = datasets.MNIST('./MNIST/', train=True, download=True,
transform=preprocess_train)
mnist_test = datasets.MNIST('./MNIST/', train=False, download=True,
transform=preprocess_test)

#

i'm using a google model

austere swift Nov 9, 2021, 9:09 PM

#

mnist is only 64x64 btw

#

so you'd just be upscaling it and increasing your data size, it wont do anything to help it

#

what model are you using?

brazen spire Nov 9, 2021, 9:12 PM

#

Alexnet

dawn sapphire Nov 9, 2021, 9:12 PM

#

I am falling behind in class. Write a program that implements and tests an AVL tree delete(key) method. This method accepts a key to remove. If the key is in the tree, the method finds the node with the key, removes the node, re-balances the tree, and returns True. If the key is not in the tree, False is returned.

Similar to the Binary Search Tree implementation, the delete(key) method would need a remove(node) method to remove the node containing the key. This method would be similar in implementation to BST's remove(node) method except that it would also need to ensure the tree is balanced (similar to _put() method).

So, to support the re-balancing operation, implement update_balance_delete(node) method that would recursively update the balance factor of the parent node and call rebalance(node) method if needed.

Finally, make sure update_balance_delete(node) method is called from within remove(node) method during node removal operation. I just tneed to do this.

brazen spire Nov 9, 2021, 9:14 PM

#

I thought AlexNet was expecting 256x256

austere swift Nov 9, 2021, 9:14 PM

#

alexnet is convolutional, it wont matter what input size you give it

brazen spire Nov 9, 2021, 9:14 PM

#

maybe i shouldn't be using mnist at all

#

ah

austere swift Nov 9, 2021, 9:14 PM

#

and anyways, alexnet is very overcomplicated for mnist

#

you can train on mnist with a purely dense nn

dawn sapphire Nov 9, 2021, 9:16 PM

#

It is a really long code and I need some voice help if possible

brazen spire Nov 9, 2021, 9:16 PM

#

i'll try that.

rigid zodiac Nov 9, 2021, 9:16 PM

#

dawn sapphire It is a really long code and I need some voice help if possible

you gotta ask a specific question about your code... We here to help you to figure out not to do your homework lol

dawn sapphire Nov 9, 2021, 9:17 PM

#

I know. I just need to do some one on one with questions

austere swift Nov 9, 2021, 9:18 PM

#

you can go ahead and ask your questions here

dawn sapphire Nov 9, 2021, 9:20 PM

#

I trying to figure out where and I implementing my update_balance_delete(node) for AVL tree

#

#

I have a update_balance coded in my AVL Tree Class. I am trying to makes sense for each item that they ask for? So don't a dick and say stupid crap when I am trying to figure stuff out rather look online for a answer.

hearty briar Nov 9, 2021, 9:34 PM

#

dawn sapphire I have a update_balance coded in my AVL Tree Class. I am trying to makes sense ...

???

desert oar Nov 9, 2021, 9:51 PM

#

dawn sapphire I have a update_balance coded in my AVL Tree Class. I am trying to makes sense ...

this belongs in #algos-and-data-structs. "data science" is applied statistics and machine learning

#

!code also please use code formatting and not screenshots when posting code. read carefully below 👇

arctic wedgeBOT Nov 9, 2021, 9:51 PM

#

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

dawn sapphire Nov 9, 2021, 10:01 PM

#

@desert oarthx

brazen spire Nov 9, 2021, 11:30 PM

#

MNIST is onyl for numbers?

#

or it can recognise images as well

#

like fish or something else

#

there is only 10 classes in it, right? 0 to 9

desert oar Nov 9, 2021, 11:44 PM

#

brazen spire MNIST is onyl for numbers?

It's specifically a data set of hand written digits

#

0-9

quiet vault Nov 9, 2021, 11:52 PM

#

brazen spire like fish or something else

That would be a different dataset

median idol Nov 10, 2021, 12:10 AM

#

Guys, could anyone please advise how to deal with this type of column, as final result I want to strip url links and get the location(last word) ik how to do this, but how do i leave another rows untouched?

#

Any tips and advices are appreciated

desert oar Nov 10, 2021, 12:43 AM

#

median idol Guys, could anyone please advise how to deal with this type of column, as final ...

That's easier than your other problem

#

In general you can easily modify individual columns while leaving the others untouched

#

That's part of the point of pandas

#

I recommend using a URL parsing library like yarl to extract the url parts

#

Write a function that accepts 1 string as input. If it contains :// then parse it like a url. Otherwise handle it like a full city name. Then you can use apply in pandas to apply the function to every element of that column

#

See https://pandas.pydata.org/pandas-docs/stable/user_guide/basics.html#function-application

#

And https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#basics

median idol Nov 10, 2021, 12:51 AM

#

Oh, thanks 🙏

median idol Nov 10, 2021, 12:51 AM

#

desert oar That's easier than your other problem

Yeah, I just skipped those part with currencies and just wanted to finish this dataset with clearing this column

desert oar Nov 10, 2021, 1:55 AM

#

median idol Oh, thanks 🙏

!pypi yarl

arctic wedgeBOT Nov 10, 2021, 1:55 AM

#

yarl v1.7.2

Yet another URL library

desert oar Nov 10, 2021, 1:55 AM

#

!pypi hyperlink

arctic wedgeBOT Nov 10, 2021, 1:55 AM

#

hyperlink v21.0.0

A featureful, immutable, and correct URL for Python.

median idol Nov 10, 2021, 2:05 AM

#

@desert oar you mean def function something like that with the help of yarl? def clear_url(dataframe): for x in dataframe: if url.scheme == 'http': dataframe = dataframe.apply(lambda i: i.split("/")[-1]) else: pass

desert oar Nov 10, 2021, 2:07 AM

#

median idol <@!389497659087650836> you mean def function something like that with the help ...

def process_location(text):
    if '://' in text:
        url = yarl.URL(text)
        path_tail = url.path.split('/')[-1]
        return path_tail
    else:
        return text

df['location_clean'] = df['location'].apply(process_location)

#

process_location is a function of one element. .apply applies the function to every element individually

wise cipher Nov 10, 2021, 2:08 AM

#

When I use pandas to load an excel table, it takes the first entry as the colums of the DataFrame ... does anyone know how to fix it smartly? 🙂

desert oar Nov 10, 2021, 2:09 AM

#

wise cipher When I use pandas to load an excel table, it takes the first entry as the colums...

use the header=None option if you are using read_csv

arctic wedgeBOT Nov 10, 2021, 2:09 AM

#

Fat chance.

No documentation found for the requested symbol.

desert oar Nov 10, 2021, 2:10 AM

#

it's the same in read_excel

#

!d pandas.read_excel

arctic wedgeBOT Nov 10, 2021, 2:10 AM

#

pandas.read\_excel

pandas.read_excel(io, sheet_name=0, header=0, names=None, index_col=None, usecols=None, squeeze=False, dtype=None, engine=None, converters=None, true_values=None, ...)```
Read an Excel file into a pandas DataFrame.

Supports xls, xlsx, xlsm, xlsb, odf, ods and odt file extensions read from a local filesystem or URL. Supports an option to read a single sheet or a list of sheets.

wise cipher Nov 10, 2021, 2:12 AM

#

desert oar use the `header=None` option if you are using `read_csv`

Thank you so much, I didn't know that ❤️

median idol Nov 10, 2021, 2:15 AM

#

desert oar ```python def process_location(text): if '://' in text: url = yarl.U...

Oh, got you. Thank a lot 🙏

fresh kraken Nov 10, 2021, 4:58 AM

#

Hi there, I have prepared a survey for students, on how different students are approaching their studies.
Any participation in the survey will be much appreciated.
You can dm me your answers. Kindly mention your current education level.

1.)) Do you feel more comfortable studying from
|A)computer books,
|B) video lectures,
|C) going hands-on, practical implementation
|D) Different ways for different things?

2.)) You often google your doubts, but what do you do if you do not find the answer on google do you
|A) ask teachers,
|B) seek community help,
|C) solve yourself,
|D) Contact your friends or what else you do?

3.)) On a scale of 0 - 10, How much are you worried about the job(money) in your field? (0 = not worried at all)

4.)) Do you
|A) get inspired by startup success stories and want to set up your startup
|B) you are not interested in business and would just focus on your expertise
|C) you have not currently thought about it and may think in future
|D) you would go with wherever life takes you.

I will provide a report after sufficient data collection.
Thank you very much for answering,
and sharing your perspective

misty flint Nov 10, 2021, 5:32 AM

#

interesting

#

drop-in replacement library for NumPy, bringing distributed and accelerated computing on the NVIDIA platform to the Python community

#

ZoomEyes

lapis sequoia Nov 10, 2021, 5:37 AM

#

holy crap

desert oar Nov 10, 2021, 5:41 AM

#

misty flint > drop-in replacement library for NumPy, bringing distributed and accelerated co...

everyone is trying to plant their flag and hopefully get a slice of that sweet machine learning cash

#

see also: all these alternative python implementations coming out of the woodwork (after years of stagnation and lack of interest) at major tech companies

#

i'm still baffled at how late to the game amd has been with all this, but now that things actually run on rocm there is a glimmer of hope

#

hope that one day i might be able to run linux, play dota, and do machine learning all on the same desktop machine

fresh kraken Nov 10, 2021, 5:52 AM

#

Hey i have created this survey for students, would you like to share your opinions https://forms.gle/nHYVEikL9v7Rsn9M7

Google Docs

Student Survey

Hi there, This is a survey how different students are approaching their studies.
Any participation in the survey will be much appreciated.

magic dune Nov 10, 2021, 6:29 AM

#

misty flint interesting

magic dune Nov 10, 2021, 6:29 AM

#

misty flint interesting

send link

storm river Nov 10, 2021, 8:13 AM

#

I have a scatter plot for price vs bhk. I want to add the mean price dots in it. Please give a solution.

#

X axis = BHK
Y axis = Price

terse epoch Nov 10, 2021, 8:15 AM

#

I have a script with pytrends matplotlib and pandas for trends everything works except in the image the name of countries in x axis doesn't show up properly any help? https://media.discordapp.net/attachments/884012495658967041/907904589469212702/unknown.png

lapis sequoia Nov 10, 2021, 9:08 AM

#

Hello Party People! Quick and easy question: Is more data always the answer when trying to give a model more accuracy?

#

I posted my problem to reddit: https://www.reddit.com/r/learnmachinelearning/comments/qqpp3c/binary_image_classification_for_detecting_face/

r/learnmachinelearning - Binary Image Classification for detecting ...

0 votes and 0 comments so far on Reddit

#

maybe someone of you has an idea what the best approach is to solve this and achieve more accuracy. I guess I need to generate more data but maybe someone of you has an idea

teal mortar Nov 10, 2021, 10:32 AM

#

lapis sequoia maybe someone of you has an idea what the best approach is to solve this and ach...

Hi, have you tried using transfer learning, like use a pretrained model trained on ImageNet, get rid of the top layers and add your layers there, fine tune and maybe get a better result, try reading this: https://www.tensorflow.org/tutorials/images/transfer_learning

TensorFlow

Transfer learning and fine-tuning | TensorFlow Core

lapis sequoia Nov 10, 2021, 10:32 AM

#

thanks! I will look into it 😄

teal mortar Nov 10, 2021, 10:33 AM

#

lapis sequoia thanks! I will look into it 😄

with pleasure, hope it helps

lapis sequoia Nov 10, 2021, 10:33 AM

#

but just for the sake of understanding:

#

do i need a pretrained model that has been trained on the exact same labels that i need to train on?

#

or would it work to use a model that has been trained on faces and i train it on the original face vs. manipulated image labels?

formal lava Nov 10, 2021, 10:47 AM

#

What guide is good for ml?

teal mortar Nov 10, 2021, 11:06 AM

#

lapis sequoia or would it work to use a model that has been trained on faces and i train it on...

find a model that was pretrained on human faces

teal mortar Nov 10, 2021, 11:07 AM

#

lapis sequoia but just for the sake of understanding:

tensorflow hub has lots of pretrained models that you can use https://tfhub.dev/s?module-type=image-augmentation,image-classification,image-classification-logits,image-classifier,image-feature-vector,image-generator,image-object-detection,image-others,image-pose-detection,image-segmentation,image-style-transfer,image-super-resolution,image-rnn-agent

teal mortar Nov 10, 2021, 11:09 AM

#

formal lava What guide is good for ml?

you can start with excellent course from Andrew Ng on Machine Learning https://www.coursera.org/learn/machine-learning

Coursera

Machine Learning

Learn Machine Learning from Stanford University. Machine learning is the science of getting computers to act without being explicitly programmed. In the past decade, machine learning has given us self-driving cars, practical speech recognition, ...

#

or deep learning https://www.coursera.org/specializations/deep-learning?

Coursera

Deep Learning

Learn Deep Learning from deeplearning.ai. If you want to break into Artificial intelligence (AI), this Specialization will help you. Deep Learning is one of the most highly sought after skills in tech. We will help you become good at Deep Learning.

#

also a good resource would be the book by Francois Chollet https://www.amazon.com/Learning-Python-Second-François-Chollet/dp/1617296864

Deep Learning with Python, Second Edition

teal mortar Nov 10, 2021, 11:26 AM

#

lapis sequoia or would it work to use a model that has been trained on faces and i train it on...

https://machinelearningmastery.com/how-to-perform-face-recognition-with-vggface2-convolutional-neural-network-in-keras/

Machine Learning Mastery

How to Perform Face Recognition With VGGFace2 in Keras

Face recognition is a computer vision task of identifying and verifying a person based on a photograph of their face. […]

sudden egret Nov 10, 2021, 11:51 AM

#

formal lava What guide is good for ml?

What do you mean by guide? A whole roadmap?

#

If so try checking out this video https://www.youtube.com/watch?v=pHiMN_gy9mk

This covers the whole range of tools and things you have to learn in order to be a machine learning engineer within a year or maybe more than that, it depends upon the effort you're putting in.

Goodluck!

YouTube

Daniel Bourke

2020 Machine Learning Roadmap (still valid for 2021)

Getting into machine learning is quite the adventure. And as any adventurer knows, sometimes it can be helpful to have a compass to figure out if you're heading in the right direction.

Although the title of this video says machine learning roadmap, you should treat it as a compass. Explore it, follow your curiosity, learn something and use what...

▶ Play video

sudden egret Nov 10, 2021, 11:56 AM

#

lapis sequoia do i need a pretrained model that has been trained on the exact same labels that...

Well, you can cut out the classification head of the pretrained model and append your head. For instance, resnet is trained on imagenet dataset and its classification head has alot of labels but in reality you dont need any of them, you might need only cats or dogs.

So in this case, you append your classification head you have with the pretrained model and fine tune for more epochs, which helps the pretrained model to learn your features.

#

Check this guide @lapis sequoia https://www.tensorflow.org/tutorials/images/transfer_learning

TensorFlow

Transfer learning and fine-tuning | TensorFlow Core

misty flint Nov 10, 2021, 1:00 PM

#

this is so true; just look at all the AI tech startups also coming out of the woodwork - all that venture capital money...

formal lava Nov 10, 2021, 1:05 PM

#

Ty

formal lava Nov 10, 2021, 1:25 PM

#

Do I havr to learn linear regression, KNN and SVM to be good at ML?

desert oar Nov 10, 2021, 2:02 PM

#

formal lava Do I havr to learn linear regression, KNN and SVM to be good at ML?

definitely yes to linear regression. the others are worth knowing about and part of a natural learning progression, even if they aren't commonly used in industry anymore

formal lava Nov 10, 2021, 2:04 PM

#

desert oar definitely yes to linear regression. the others are worth knowing about and part...

Ive watched a vid and it said that KNN is still used to this day, anyway maybe u can recomend where do I start?

desert oar Nov 10, 2021, 2:06 PM

#

formal lava Ive watched a vid and it said that KNN is still used to this day, anyway maybe u...

sure, it's still useful for some things. like i said, it's worth knowing about and part of a natural learning progression.

#

i don't know of any good general recommendations, check the pinned messages

#

if you have any specific interests or desires, maybe i can help with those

formal lava Nov 10, 2021, 2:08 PM

#

desert oar if you have any specific interests or desires, maybe i can help with those

Well my somewhat of a main goal is creating "bot" that could learn from the actions it does

desert oar Nov 10, 2021, 2:11 PM

#

formal lava Well my somewhat of a main goal is creating "bot" that could learn from the acti...

i see, then you might want to look into "reinforcement learning"

formal lava Nov 10, 2021, 2:14 PM

#

desert oar i see, then you might want to look into "reinforcement learning"

so i start with regression and go into reinforcment learning? and isnt q and reinforcment learning the same?

desert oar Nov 10, 2021, 2:16 PM

#

formal lava so i start with regression and go into reinforcment learning? and isnt q and rei...

that's a big jump, but you can try to start with some reinforcement learning material and see if it makes sense. you might have to go back and learn more things along the way. i believe q-learning is a specific algorithm for reinforcement learning. reinforcement learning itself is a general category of machine learning tasks.

formal lava Nov 10, 2021, 2:19 PM

#

desert oar that's a big jump, but you can try to start with some reinforcement learning mat...

okay, Ill do more research but should I start in .net (I hope I understand this correctly)?

desert oar Nov 10, 2021, 2:20 PM

#

formal lava okay, Ill do more research but should I start in .net (I hope I understand this ...

python is a lot more popular than .NET for machine learning, i recommend starting with python

#

unless you mean something else by ".net" other than the microsoft .NET language ecosystem (C#, F#)

formal lava Nov 10, 2021, 2:22 PM

#

desert oar unless you mean something else by ".net" other than the microsoft .NET language ...

Nop u understood it correctly. So now I go into yt, watch some guides and just start doing something (I use VSC)

serene scaffold Nov 10, 2021, 2:23 PM

#

(I use VSC btw)
FTFY

desert oar Nov 10, 2021, 2:23 PM

#

formal lava Nop u understood it correctly. So now I go into yt, watch some guides and just s...

i recommend also reading a book or following an online course. individual videos and small guides will not provide the same depth of education

#

again, check our pinned messages

formal lava Nov 10, 2021, 2:23 PM

#

desert oar i recommend also reading a book or following an online course. individual videos...

okay thanks

formal lava Nov 10, 2021, 2:23 PM

#

serene scaffold > (I use VSC btw) FTFY

whut?

serene scaffold Nov 10, 2021, 2:24 PM

#

formal lava whut?

it's a meme

formal lava Nov 10, 2021, 2:24 PM

#

oh

formal lava Nov 10, 2021, 2:36 PM

#

desert oar i recommend also reading a book or following an online course. individual videos...

do I need to install a new ide or smth?

#

and how do u define small guide like 3 hours?

desert oar Nov 10, 2021, 2:41 PM

#

there is nothing wrong with vs code

formal lava Nov 10, 2021, 2:46 PM

#

ok

misty flint Nov 10, 2021, 3:35 PM

#

vsc actually released a browser version

#

vscode.dev

#

ill probs still use my pycharm tho

#

logo_pycharm

formal lava Nov 10, 2021, 3:50 PM

#

desert oar i recommend also reading a book or following an online course. individual videos...

I want to die, is that common for beginners?

desert oar Nov 10, 2021, 3:51 PM

#

formal lava I want to die, is that common for beginners?

it probably happens to everyone once or twice

formal lava Nov 10, 2021, 3:51 PM

#

desert oar it probably happens to everyone once or twice

but does it happen when staring RL?

desert oar Nov 10, 2021, 3:52 PM

#

if you have that feeling when first starting something, maybe you are taking on too much at once

ashen umbra Nov 10, 2021, 3:53 PM

#

formal lava I want to die, is that common for beginners?

I relate to this.. also a beginner but I guess having a learning partner can help u stay on track

formal lava Nov 10, 2021, 3:53 PM

#

ashen umbra I relate to this.. also a beginner but I guess having a learning partner can hel...

true, then u can talk about your progression

ashen umbra Nov 10, 2021, 3:54 PM

#

Yep exactly. What are u currently working on. Always open to collab..

#

I am relearning python

formal lava Nov 10, 2021, 3:54 PM

#

ashen umbra Yep exactly. What are u currently working on. Always open to collab..

Im only learning all of the theory and "pseudo code" for RL and rn I want to die like no cap

ashen umbra Nov 10, 2021, 3:55 PM

#

ashen umbra I am relearning python

Taking some coursera courses..

formal lava Nov 10, 2021, 3:55 PM

#

why would u do it?

ashen umbra Nov 10, 2021, 3:56 PM

#

formal lava Im only learning all of the theory and "pseudo code" for RL and rn I want to die...

What is RL..? Also ya I guess the best way of learning is by doing it. Esp when comes to coding

formal lava Nov 10, 2021, 3:57 PM

#

ashen umbra What is RL..? Also ya I guess the best way of learning is by doing it. Esp when ...

you know this is for data science and ai right? RL is reinforcement learning

ashen umbra Nov 10, 2021, 4:02 PM

#

Ya I k. But trying to self learn smth new. Esp alone can get hard for anyone regardless which intersection of AI we are talking about

formal lava Nov 10, 2021, 4:02 PM

#

u alive? xd

ashen umbra Nov 10, 2021, 4:02 PM

#

I am multitasking lol

#

But I am sure there are plenty courses online that can provide u a good sense of direction

#

For RL

formal lava Nov 10, 2021, 4:08 PM

#

well there is too much to learn for RL

#

idk if I want to study that

brisk moth Nov 10, 2021, 4:17 PM

#

anyone have a few xeons and v100s i can borrow lmao i have a dataset of 773565 jpgs i need to use

#

my macbook pro is crying

formal lava Nov 10, 2021, 4:26 PM

#

anyone wanna try and work on a project and learn about RL at the same time?

rigid zodiac Nov 10, 2021, 4:56 PM

#

Hi everyone, Have you every use multiple npy file to train the model? If you have can you please teach me

brisk moth Nov 10, 2021, 4:57 PM

#

just build a chess bot

#

good way to do RL

#

even a simple one

formal lava Nov 10, 2021, 5:02 PM

#

who u talkin to btw?

desert oar Nov 10, 2021, 5:02 PM

#

rigid zodiac Hi everyone, Have you every use multiple npy file to train the model? If you hav...

i don't want to be rude here, but i think you have enough experience by now to be able to approach a problem like this without asking a "help me pls" type of question. you need to practice working through problems on your own and developing your own solutions, instead of just asking for help online.

rigid zodiac Nov 10, 2021, 5:11 PM

#

desert oar i don't want to be rude here, but i think you have enough experience by now to b...

I honestly dont know how to. Because traditionally we just use the cvs file. If I know how to I would not waste your time

desert oar Nov 10, 2021, 5:12 PM

#

rigid zodiac I honestly dont know how to. Because traditionally we just use the cvs file. If ...

but you don't put a csv file into a magic box and wait for results, right? you write some code to load the data into an array or dataframe, then you write more code to process the data, then you write more code that defines the model, etc. etc.

#

you already know how to do all of those things

#

so don't act like you don't know!

#

maybe your question is "how do i concatenate two numpy arrays" or maybe your question is "what are some established techniques for modeling >2-dimensional data"

#

but "how do i ml with 2 csv" is not an answerable question, and moreover you already know enough to get past that point and start formulating more specific questions

rigid zodiac Nov 10, 2021, 5:13 PM

#

desert oar but you don't put a csv file into a magic box and wait for results, right? you w...

For 1 big cvs file, yes i do know. In this case i have multiple small npy file in the folders. I have split it into train test val, i just dont know how to feed it into the ML DL

desert oar Nov 10, 2021, 5:14 PM

#

ok, that's a more specific question. are you asking specifically about loading data into a framework like pytorch or keras?

#

how do you normally do it with 1 csv file?

#

(note it is "CSV" not "CVS")

#

("Comma Separated Values")

#

("CVS" is a huge drug store chain in the USA)

hearty briar Nov 10, 2021, 5:15 PM

#

oh I see what you're getting at

#

withdrawn

rigid zodiac Nov 10, 2021, 5:23 PM

#

desert oar ok, that's a more specific question. are you asking specifically about loading d...

Man what is up with you today, also i have no idea about pytorch. For csv file i just split it to train test val. But this time i use npy

forest ledge Nov 10, 2021, 5:45 PM

#

Hello everyone, I have a python code that when I run it, it opens the camera and detects a face, and outputs the age and the gender, I also have a flutter app, my question is that, how can I integrate these two together?
like, the app has a button with a camera picture on it, and when the user clicks on the camera button, the phone camera should open and the python code should start running to detect the face and outputs the age and gender.

analog lantern Nov 10, 2021, 5:56 PM

#

Hello fellas!! I am working on a project that is sign language recognition using image processing and cnn and i'm stuck with some errors. If Any one can show up for help, DM me please .

formal lava Nov 10, 2021, 6:36 PM

#

How do I start writting RL code? I've watched like 1 hour of theory but have no clue where to start?

terse epoch Nov 10, 2021, 6:37 PM

#

terse epoch I have a script with pytrends matplotlib and pandas for trends everything works ...

Can someone help me with this? I've sent this message for so long

teal mortar Nov 10, 2021, 7:00 PM

#

terse epoch Can someone help me with this? I've sent this message for so long

try plt.xticks(rotation=70) or 90, if I understood correctly what you want to do

hearty briar Nov 10, 2021, 7:03 PM

#

rigid zodiac Man what is up with you today, also i have no idea about pytorch. For csv file i...

I think what he's saying is that the way you do that is the same with npy as it would be with csv

#

it's a workflow it's not so much about the format the data is in now

#

because you are going to be arranging and sanitizing the data do go in the pipeline

terse epoch Nov 10, 2021, 7:05 PM

#

teal mortar try plt.xticks(rotation=70) or 90, if I understood correctly what you want to do

I don't want to rotate it just if there's a way to make the bar graph move up or show all country names

hearty briar Nov 10, 2021, 7:07 PM

#

the rotation argument they said rotates the labels so they aren't 180 degrees vertically

#

not the whole chart

teal mortar Nov 10, 2021, 7:08 PM

#

terse epoch I don't want to rotate it just if there's a way to make the bar graph move up or...

https://matplotlib.org/2.0.2/examples/pylab_examples/subplots_adjust.html

hearty briar Nov 10, 2021, 7:08 PM

#

if the labels are too long and rotation doesn't help

#

you might try switching to horizontal bar

#

which would provide more space for your labels

teal mortar Nov 10, 2021, 7:09 PM

#

try plt.subplots_adjust(bottom=0.2) or bigger

rigid zodiac Nov 10, 2021, 7:09 PM

#

hearty briar I think what he's saying is that the way you do that is the same with npy as it ...

Thank you so much. But I have a doubt, since I never work with multiple files i dont know how to code it tbh

hearty briar Nov 10, 2021, 7:09 PM

#

in my experience this isn't a code thing this is more a arranging data to work in your pipeline thing

terse epoch Nov 10, 2021, 7:10 PM

#

Yeah one sec lemme try that

terse epoch Nov 10, 2021, 7:14 PM

#

teal mortar try plt.subplots_adjust(bottom=0.2) or bigger

Thanks that works

teal mortar Nov 10, 2021, 7:14 PM

#

terse epoch Thanks that works

you are welcome

sick burrow Nov 11, 2021, 1:29 AM

#

not sure if this is the place to ask.. but i've just installed modin using conda on a fresh ubuntu system and it gives me the ModuleNotFoundError: No module named 'modin' error when I try to import it..

#

there seems to be a total lack of information on how modin works with conda. the sum total is conda install -c conda-forge modin but that hasn't worked for me

placid tendon Nov 11, 2021, 2:09 AM

#

im usingthe code in https://pythonprogramming.net/reinforcement-learning-self-driving-autonomous-cars-carla-python/ to try out tensorflow on my amd gpu and i'm getting the error

Python Programming Tutorials

Python Programming tutorials from beginner to advanced on a massive variety of topics. All video and text tutorials are free.

charred umbra Nov 11, 2021, 2:51 AM

#

I am trying to make an algorithm to track this time series data. What would be a good fit?

main fox Nov 11, 2021, 5:23 AM

#

charred umbra I am trying to make an algorithm to track this time series data. What would be a...

Look into stats.model decomposition so that you may separate the trend and noise. After that, perhaps an ARIMA model.

night gorge Nov 11, 2021, 5:59 AM

#

what is the easiest method to produce a stacked bar graph. I don't want it to overlay, BUT on top of each other..
eg: first 20-19 then 30-39 on top of it

main fox Nov 11, 2021, 6:49 AM

#

night gorge what is the easiest method to produce a stacked bar graph. I don't want it to ov...

https://www.python-graph-gallery.com/stacked-and-percent-stacked-barplot

Is this what you're looking to do?

The Python Graph Gallery

Sctacked and Percent Stacked Barplot using Seaborn | The Python Gra...

How to plot a stacked and percent sctacked barplot using seaborn

night gorge Nov 11, 2021, 6:57 AM

#

main fox https://www.python-graph-gallery.com/stacked-and-percent-stacked-barplot Is thi...

thanks

somber prism Nov 11, 2021, 7:10 AM

#

can someone help me compute backpropagation ```
def sigmoid(z, prime = False):
if prime:
return sigmoid(z) * (1 - sigmoid(z))
return 1 / (1 + np.exp(-z))

def relu(z):
return np.maximum(0, z)

class Network(BaseEstimator):
def init(learning_rate = 0.01, epoches = 30, activations = [], layers = []):
self.learning_rate = learning_rate
self.layers = layers
self.n_layers = len(layers) - 1
self.activations = activations
self.epoches = epoches
self.weights, self.biases = self.initialize_parameters()
self.cache = self.initialize_cache()
self.costs = []

def initialize_parameters(self):
    w = [
        np.random.randn(next_layer, current_layer) for current_layer, next_layer in zip(layers[:-1], layers[1:]) * self.learning_rate
    ]
    b = [
        np.zeros((next_layer, 1)) for next_layer in layers[1:]
    ]
    
    return w, b

def initialize_cache(self):
    c = {}
    for i in range(len(self.weights)):
        c[f'z{i + 1}']
        c[f'activation{i + 1}'] = None
        c[f'w{i + 1}']  = None
        c[f'b{i + 1}']  = None
        
        c[f'dw{i + 1}']  = None
        c[f'db{i + 1}']  = None
        
    return c

def activate(self, z, activation):
    if activation == 'sigmoid':
        return sigmoid(z)
    elif activation == 'relu':
        return relu(z)

def forward(self, x):
    a_prev = x
    for i, params in enumerate(zip(self.weights, self.biases)):
        w, b = params 
        z = self.dot(w, a_prev) + b
        self.cache[f'z{i + 1}'] = z
        self.cache[f'activation{i + 1}'] = a_prev
        a_prev = activate(z, activation[i])
    
    self.cache[f'aL'] = a_prev
    return self.cache[f'aL']

def backward(self):
    pass

pure gull Nov 11, 2021, 7:46 AM

#

Hi, I am training a yolo model on pytorch and getting 15 GFLOPS of reported performance. Theoretical max for my GPU is reportedly over 2 TFLOPS - any idea what's with the massive performance difference? What's my bottleneck and how do I find out?

pure gull Nov 11, 2021, 7:55 AM

#

pure gull Hi, I am training a yolo model on pytorch and getting 15 GFLOPS of reported perf...

it's the smalles yolo model, does that influence reported performance?

fleet prism Nov 11, 2021, 8:43 AM

#

My algo works fine but these errors pop up in the terminal. Any idea how to resolve these? And will performance improve if resolved? In what way will it improve?

pure gull Nov 11, 2021, 9:00 AM

#

fleet prism My algo works fine but these errors pop up in the terminal. Any idea how to reso...

probably your algo cannot access the GPU due to missing libraries. The performance difference depends - is your CPU fast/slow? Is your GPU fast/slow?

#

Do you have an nvidia GPU? Which you need for CUDA

#

(cufft = CUDA FFT, etc)

fleet prism Nov 11, 2021, 9:21 AM

#

pure gull probably your algo cannot access the GPU due to missing libraries. The performan...

I have the latest AMD. So only processing time gets affected and no change to overall output? I mean the image won't look different with these dependencies set right?

pure gull Nov 11, 2021, 9:37 AM

#

CUDA only works for nvidia - AMD graphics cards are thus not usable with CUDA and there's nothing to do

pure gull Nov 11, 2021, 9:39 AM

#

fleet prism I have the latest AMD. So only processing time gets affected and no change to ov...

I don't know how to proceed but you can start your research here - you'll get some googleable terms at least https://medium.com/analytics-vidhya/install-tensorflow-2-for-amd-gpus-87e8d7aeb812

Medium

Install Tensorflow 2 for AMD GPUs

AMD has released ROCm, a Deep Learning driver to run Tensorflow-written scripts on AMD GPUs. However, many owners and I have encountered…

pure gull Nov 11, 2021, 9:40 AM

#

fleet prism I have the latest AMD. So only processing time gets affected and no change to ov...

if you're buying GPU specifically for deep learning, get an nvidia card - it's because of the software support I recommend this, hardware without software is irrelevant

fleet prism Nov 11, 2021, 9:42 AM

#

pure gull if you're buying GPU specifically for deep learning, get an nvidia card - it's b...

Tyvm

pure gull Nov 11, 2021, 9:56 AM

#

fleet prism Tyvm

Hope that link help you get your AMD card running though!

tight glacier Nov 11, 2021, 11:04 AM

#

Can someone help me interpret this confusion matrix

#

#

Q: What is the pair of classes that is most confusing for the 1 -nearest neighbour classifier trained in the previous sections?
My answer is the pair of classes that is most confusing is 4,9 we see here that these columns contain the most number of misclassifications
please can some let me know if my thinking is right?

uncut barn Nov 11, 2021, 11:38 AM

#

Does anyone know how matplotlib converts my array to this grayscale image?

#

#

tidal bough Nov 11, 2021, 11:50 AM

#

uncut barn

Are you using something like imshow?

uncut barn Nov 11, 2021, 11:55 AM

#

Yh

tidal bough Nov 11, 2021, 12:02 PM

#

tidal bough Are you using something like `imshow`?

normNormalize, optional

The Normalize instance used to scale scalar data to the [0, 1] range before mapping to colors using cmap. By default, a linear scaling mapping the lowest value to 0 and the highest to 1 is used. This parameter is ignored for RGB(A) data.

so by default, it detects the lowest and highest values of the data, and rescales the brightness to [0,1] based on that.

uncut barn Nov 11, 2021, 12:04 PM

#

tidal bough > normNormalize, optional > > The Normalize instance used to scale scalar d...

is there code that turns the array into this plot (with those values)?

#

my array is a (256, 256)

tidal bough Nov 11, 2021, 12:04 PM

#

What do you mean by an image?

#

like... do you to show it with matplotlib, or a file, or what?

uncut barn Nov 11, 2021, 12:05 PM

#

yh Im wondering how they determined certain pixels 255 and the others 0

tidal bough Nov 11, 2021, 12:06 PM

#

You're saying your data isn't this contrasting?

uncut barn Nov 11, 2021, 12:08 PM

#

no what im trying to do is convert the values myself but i get this

uncut barn Nov 11, 2021, 12:09 PM

#

tidal bough You're saying your data isn't this contrasting?

which isnt exactly the image that was shown before

hollow ember Nov 11, 2021, 12:10 PM

#

https://pastebin.com/jQWXU0iM

Pastebin

`def plot_conf_mat(conf_mat): """ Plot a condusion matrix usi...

Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.

tidal bough Nov 11, 2021, 12:10 PM

#

Perhaps they used a threshold smaller than 1, then?

hollow ember Nov 11, 2021, 12:10 PM

#

hollow ember https://pastebin.com/jQWXU0iM

AttributeError: 'QuadMesh' object has no property 'char'

#

Im using Seaborn to plot heatmap of confusion matrix

uncut barn Nov 11, 2021, 12:11 PM

#

tidal bough Perhaps they used a threshold smaller than 1, then?

yh but i cant find the threshold of the cmap gray in matplotlib

tidal bough Nov 11, 2021, 12:12 PM

#

You're applying the threshold yourself, here - arr[arr == 1] = 255, arr[arr < 1] = 0

uncut barn Nov 11, 2021, 12:12 PM

#

yh but what does matplotlib do for cmap gray, thats what im trying to find out/need help with

hollow ember Nov 11, 2021, 12:14 PM

#

hollow ember `AttributeError: 'QuadMesh' object has no property 'char'`

@tidal bough help

tidal bough Nov 11, 2021, 12:15 PM

#

uncut barn yh but what does matplotlib do for cmap gray, thats what im trying to find out/n...

If you pass it an array of only zeros and 255s, it just, well, colors the zeros black and the whites white

uncut barn Nov 11, 2021, 12:15 PM

#

yh but my array is between 0 and 1

#

hollow ember Nov 11, 2021, 12:20 PM

#

@serene scaffold AttributeError: 'QuadMesh' object has no property 'char'
Im using Seaborn to plot heatmap of confusion matrix

#

https://pastebin.com/jQWXU0iM

Pastebin

`def plot_conf_mat(conf_mat): """ Plot a condusion matrix usi...

Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.

placid tendon Nov 11, 2021, 12:34 PM

#

can anyone help me fix my carla simulator because my tutorial.py wont work due to world = client.get_world() not working

rotund trellis Nov 11, 2021, 1:03 PM

#

How do i load the images contained in the subdirectories Normal and Pneumonia, into python to its respectful folder?

hexed jasper Nov 11, 2021, 1:13 PM

#

Hi i have question regarding to object detection, does image size matter in labeling image? like do i need to convert each image size to 300x300 for example before labeling it

dusk iris Nov 11, 2021, 1:40 PM

#

What would be the best way to approach finding the dominant color of an image which comes from the sliced video frame?

#

Been attempting to make this work for quite some time now but didnt really find a decent solution

#

I've found some modules (colorthief, dominantcolor, dominantcolorfast etc) which work well if i source video from disk, but when i attempt to push frames into them they just dont give good enough results

tight glacier Nov 11, 2021, 2:46 PM

#

Is the unbiased estimate and the MLE of any statistic always the same?

lone drum Nov 11, 2021, 2:52 PM

#

Hello
I am working with 3 data frames
In which one data frame have date_time and strike_price column
I want to search that in other 2 data frames
I want to get rows from other 2 dataframes which matched with first dataframe

#

Ping me when replying

normal violet Nov 11, 2021, 3:09 PM

#

Has anyone here worked with ANN models? If so please ping me!

odd meteor Nov 11, 2021, 3:32 PM

#

dusk iris Been attempting to make this work for quite some time now but didnt really find ...

Idk if this would work since I know little about Computer Vision at the moment but give it a try.

import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt
import matplotlib.image as img
from scipy.cluster.vq import kmeans, vq
image = img.read('sea.jpg')
image.shape
r = []
g = []
b = [] 
for row in image:
    for pixel in row:
        temp_r, temp_g, temp_b = pixel
        r.append(temp_r)
        g.append(temp_g)
        b.append(temp_b)
pixel_df = pd.DataFrame({'red' : r, 'blue' : b, 'green' : g})
from sklearn.preprocessing import MinMaxScaler
scaler = MinMaxScaler() 
pixel_df[['scaled_red', 'scaled_green', 'scaled_green']] = scaler.fit_transform(pixel_df[['red', 'green', 'blue']]) 

#Creating an elbow plot
distortions = []
num_clusters = range(1, 11)
for i in num_clusters:
    cluster_centers, distortion = kmeans(pixel_df[['scaled_red', 'scaled_blue', 'scaled_green']], i)
    distortions.append(distortion)
elbow_plot = pd.DataFrame({'num_clusters' : num_clusters, 'distortions' : distortions})
sns.lineplot(x='num_clusters', y='distortions', data=elbow_plot)
plt.xticks(num_clusters)
plt.show() #assume this revealed the elbow plot to be at 2

#Finding Dominant Colours

cluster_centers = kmeans(pixel_df[['scaled_red', 'scaled_blioe', 'scaled_green']], 2)
colors = []
r_std, b_std, g_std = pixel_df[['red', 'blue', 'green']].std()
for cluster_center in cluster_centers:
    scaled_r, scaled_g, scaled_b = cluster_center
    colors.append((scaled_r * r_std/255, scaled_b * b_std/255, scaled_g* g_std/255))

#Display Dominant Colour
print(colors)
plt.imshow([colors])
plt.show()

rigid zodiac Nov 11, 2021, 3:33 PM

#

lone drum Hello I am working with 3 data frames In which one data frame have `date_time` ...

i think
df[df['strike_price'] == 3]

normal violet Nov 11, 2021, 3:34 PM

#

#help-orange , please read

dusk iris Nov 11, 2021, 3:35 PM

#

odd meteor Idk if this would work since I know little about Computer Vision at the moment b...

Eh whats sns and plt in this case

odd meteor Nov 11, 2021, 3:39 PM

#

dusk iris Eh whats sns and plt in this case

They are alias of Seaborn and Matplotlib visualization libraries respectively. You'd have to 1st import them at the beginning of the code
import seaborn as sns import matplotlib.pyplot as plt

lone drum Nov 11, 2021, 3:41 PM

#

rigid zodiac i think ``` df[df['strike_price'] == 3]```

hii

#data-science-and-ml

transforms.RandomResizedCrop(input_size),

transforms.RandomHorizontalFlip(),