hasty mountain Jan 13, 2023, 12:43 AM

#

But instead of using labels, the model compares the consequence of its actions to the expected consequences

iron basalt Jan 13, 2023, 12:43 AM

#

The idea is pretty straight forward. If the discriminator can't tell apart the expert from the novice, then the novice has mimicked the expert.

hasty mountain Jan 13, 2023, 12:45 AM

#

Interesting... I'll take a look. Thanks!

iron basalt Jan 13, 2023, 12:47 AM

#

If you are not trying to do IRL. Are you trying to use a generative model to maybe make the problem a bit easier (e.g. a generative model that reduces dimensionality)? Maybe do rollouts?

spiral barn Jan 13, 2023, 12:52 AM

#

im trying to do sentiment analysis of buisness articles. Ive tried to use both textblob and vaderSentiment but neither seem to work very well. Is there any alternatives?

lapis sequoia Jan 13, 2023, 12:53 AM

#

Yeah i can recommend 2

#

NLTK's SentimentIntensityAnalyzer and Genism

lapis sequoia Jan 13, 2023, 12:54 AM

#

spiral barn im trying to do sentiment analysis of buisness articles. Ive tried to use both ...

http://www.nltk.org/api/nltk.sentiment.html
https://radimrehurek.com/gensim/

Gensim: topic modelling for humans

Efficient topic modelling in Python

spiral barn Jan 13, 2023, 12:55 AM

#

thanks you ill try these

iron basalt Jan 13, 2023, 12:56 AM

#

hasty mountain Interesting... I'll take a look. Thanks!

https://arxiv.org/pdf/1709.06683.pdf

feral hedge Jan 13, 2023, 4:47 AM

#

Could someone recommend a data set? I'm seeking images that contain a single subject on a transparent background

arctic wedgeBOT Jan 13, 2023, 6:28 AM

#

@fleet sapphire Per Rule 6, your invite link has been removed. If you believe this was a mistake, please let staff know!

Our server rules can be found here: https://pythondiscord.com/pages/rules

hasty mountain Jan 13, 2023, 6:49 AM

#

feral hedge Could someone recommend a data set? I'm seeking images that contain a single sub...

There's Fashion MNIST, but it's grayscaled

feral hedge Jan 13, 2023, 6:49 AM

#

hmm, ok I saw this one on pytorch guide today

hasty mountain Jan 13, 2023, 6:49 AM

#

Take a look as tensorflow and Pytorch's Datasets pages

#

They might give you some ideas

mint palm Jan 13, 2023, 10:31 AM

#

i wanna try feeding saliency based segmented data to model, would pre-trained model be fine for that? if so, are there any recommendation of what model to use?

spiral glacier Jan 13, 2023, 10:49 AM

#

i am using plotly with a scatter plot. How can i prevent the x-axis from showing the gaps between ticks where no values are present? I tried setting the tickvals and ticktext.. but the result is not what i expected..

fig2.update_layout(
xaxis = dict(
tickmode = 'array',
tickvals = dff2.Auftragnummer.sort_values().unique(),
ticktext = dff2.Auftragnummer.sort_values().unique(),
)
)

supple wyvern Jan 13, 2023, 12:07 PM

#

Is it possible by any chance if anyone can help me with this? https://stackoverflow.com/questions/75108957/assertionerror-the-algorithm-only-supports-class-gym-spaces-box-box-as-acti

Stack Overflow

AssertionError: The algorithm only supports as action spaces but B...

So basically I tried converting this custom gym environment from https://github.com/Gor-Ren/gym-jsbsim to use farama foundation's gymnasium api. This is my repo whih I am working on: https://github...

vale lava Jan 13, 2023, 2:11 PM

#

I wrote some pytorch code to fit models once and for all. Introducing '𝙩𝙤𝙧𝙘𝙝𝙮', a PyTorch wrapper that preserves all of the functionality and code writing conventions of PyTorch, while adding the convenience of the '𝙢𝙤𝙙𝙚𝙡.𝙛𝙞𝙩()' method directly to the nn.Module class. With 'torchy', you can devote your time to making good models and preprocessing the data, while your training process becomes more efficient. Additional utility functions are also included in nn.Module and in torch.utils.data to make your PyTorch experience even more seamless.

You can find the torchy module on:
GitHub at https://lnkd.in/dDj3vV5r
PyPI at https://lnkd.in/dRn-BgQH
Docs at https://lnkd.in/dmnQHmJd

I hope you find torchy as useful as I do and I would love to hear your feedback.

PS: torchy is also my first python module in production.

hasty mountain Jan 13, 2023, 4:27 PM

#

When dealing with a dataset where the differences between each input are subtle, should I use a network with more features per layer or should I use more layers?
(I really don't want to make a monster with billions of parameters)

umbral ermine Jan 13, 2023, 5:13 PM

#

I need a power bi partner

shadow dew Jan 13, 2023, 5:55 PM

#

im editing source code from https://github.com/alxschwrz/dalle2_python
ive changed the function open_urls_in_browser to open a new qt window but i dont know how to display the image in it

def open_urls_in_browser(self, image_urls=None):
if image_urls is None:
image_urls = self.image_urls
for url in image_urls:
app = QApplication(sys.argv)
win = QMainWindow()
win.setGeometry(512,512,500,500)
win.setWindowTitle("Dall:E")
label = QLabel()
pixmap = QPixmap(url)
label.setPixmap(pixmap)
win.show()
sys.exit(app.exec())

GitHub

GitHub - alxschwrz/dalle2_python: DALLE2 in the command line.

DALLE2 in the command line. Contribute to alxschwrz/dalle2_python development by creating an account on GitHub.

lapis sequoia Jan 13, 2023, 8:53 PM

#

Hello everyone, I have a problem while practicing the fundamentals, why is this not working?? I wrote a name, and a seed=42, Ive tried importing tensorflow and numpy as well, and still didnt work.

#

I have a global and operational variable

austere swift Jan 13, 2023, 9:19 PM

#

try converting the masks into integers

#

or boolean

lapis sequoia Jan 13, 2023, 9:20 PM

#

help por fi

feral hedge Jan 13, 2023, 9:21 PM

#

hey thanks, I'm working on doing that preprocessing step I missed to normalize the data.

odd meteor Jan 13, 2023, 9:25 PM

#

lapis sequoia Hello everyone, I have a problem while practicing the fundamentals, why is this ...

The error message is pointing to the object not_shuffled

It's not a variable you've created anywhere, so python couldn't find it. Ensure you declared the variable / there's no typo in the variable name.

lapis sequoia Jan 13, 2023, 9:26 PM

#

@odd meteor okay will check, thanks

feral hedge Jan 13, 2023, 10:34 PM

#

train_dataloader = DataLoader(training_data, batch_size=64, shuffle=True)
for epoch in range(10):
    for images, masks in train_dataloader:

is it correct to say that images is a batch of 64 single images (which my get returns a single image and mask) that are then passed to the model? how is this different from one image at a time?

and is it safe to say if all my images are the same size, and one "batch" or iteration of the inner loop takes for example 10 seconds, than the total time I must wait for the model to finish running is total number of (images / 64) * the time it took for one batch

#

or am I misunderstanding entirely

serene scaffold Jan 13, 2023, 11:14 PM

#

feral hedge ```py train_dataloader = DataLoader(training_data, batch_size=64, shuffle=True) ...

are you using a CPU or a GPU?

#

In either case, the amount of math to be done per training instance is the same no matter what batch size you're doing. but a larger batch size means that there are fewer iterations of the for images, masks in train_dataloader loop and that more math is being done per pytorch operation. and letting pytorch do more work, and doing less pure Python work, is faster. And if you're using a GPU, the pytorch work is also being done in parallel.

#

but you also keep in mind how many training instances you want to account for before you adjust the weights.

feral hedge Jan 13, 2023, 11:46 PM

#

serene scaffold are you using a CPU or a GPU?

thanks, that clears some up. right now I noticed it's only using my cpu. I'll have to figure that out soon so I can use gpu. doesn't seem to be detected.

hasty mountain Jan 13, 2023, 11:48 PM

#

feral hedge thanks, that clears some up. right now I noticed it's only using my cpu. I'll ha...

Did you define your device?

serene scaffold Jan 13, 2023, 11:54 PM

#

feral hedge thanks, that clears some up. right now I noticed it's only using my cpu. I'll ha...

what GPU do you have?

feral hedge Jan 13, 2023, 11:56 PM

#

hasty mountain Did you define your device?

device = "cuda" if is_available() else "cpu"

hasty mountain Jan 13, 2023, 11:56 PM

#

Oh, ok

feral hedge Jan 13, 2023, 11:56 PM

#

serene scaffold what GPU do you have?

nvidia gtx with 4GB vram

#

have to check model

hasty mountain Jan 13, 2023, 11:56 PM

#

Usually, the tensors aren't initialized in your defined device, so you have to use .to(device)

serene scaffold Jan 13, 2023, 11:57 PM

#

feral hedge nvidia gtx with 4GB vram

if you do nvidia-smi at the terminal, you can see what your cuda version is. you might have the wrong pytorch wheel for your cuda version.

#

also device needs to be an instance of torch.device, not a string.

feral hedge Jan 13, 2023, 11:58 PM

#

serene scaffold if you do `nvidia-smi` at the terminal, you can see what your cuda version is. y...

I think I should determine if my OS is utilizing my gpu to begin with since I reformatted

#

I'm not confident it is

#

oh hmm

serene scaffold Jan 13, 2023, 11:59 PM

#

@feral hedge did you do nvidia-smi at the terminal? what happened?

feral hedge Jan 14, 2023, 12:01 AM

#

serene scaffold <@928250196889075762> did you do `nvidia-smi` at the terminal? what happened?

i don't have it so I'm just installing the drivers now.

feral hedge Jan 14, 2023, 12:34 AM

#

serene scaffold <@928250196889075762> did you do `nvidia-smi` at the terminal? what happened?

just had a hellish time trying to enable nvidia drivers. saving it for tomorrow. will update if you're around

supple wyvern Jan 14, 2023, 2:42 AM

#

650k to go but will 78ish be the highest it'll get?

supple wyvern Jan 14, 2023, 3:05 AM

#

oh it's going higher

keen meteor Jan 14, 2023, 4:21 AM

#

Hi~~
I'm editting Awesome ComputerVision.
Current content created includes Image Classification, Semantic Sementation, Object Detection, Fine grained Visual Categorization.

Unlike the existing Awesome, Kaggle and colab running code were added.
Thanks for lot of attention...!
https://github.com/kalelpark/Awesome-ComputerVision

GitHub

GitHub - kalelpark/Awesome-ComputerVision: Awesome-ComputerVision

Awesome-ComputerVision. Contribute to kalelpark/Awesome-ComputerVision development by creating an account on GitHub.

mint palm Jan 14, 2023, 7:48 AM

#

how to get saliency based segmented image of dataset? i see most of the literature on "detecting saliency" but not segmenting it

ripe sapphire Jan 14, 2023, 8:01 AM

#

One popular method is to use a saliency detection model, such as a deep neural network, to predict the saliency map of an image.

#

Once the saliency map is obtained, it can be thresholded to create a binary mask that segments the salient regions from the background.

mint palm Jan 14, 2023, 8:03 AM

#

ripe sapphire One popular method is to use a saliency detection model, such as a deep neural n...

actually the current model is transformer for video action recognition
Does that mean incorporating saliency based segmentation mean i will feed human with black backgroud to transformer?

tranquil jasper Jan 14, 2023, 1:17 PM

#

any good toturial for polars yet?

clear blaze Jan 14, 2023, 3:28 PM

#

i think taking datacamp/dataquest/<insert favorite vendor> and duplicating the questions using polars is my next step.

serene scaffold Jan 14, 2023, 3:53 PM

#

looks like you're promoting your own channel?

lapis sequoia Jan 14, 2023, 3:59 PM

#

I didn’t mean to

#

I just posted a video relevant to the topic

serene scaffold Jan 14, 2023, 4:00 PM

#

lapis sequoia I just posted a video relevant to the topic

if you're going to drop links, at least say why they're interesting. no one is just going to click random links.

lapis sequoia Jan 14, 2023, 4:02 PM

#

True . I can explain it

#

YOLOv8 is the newest version of the You only look once (YOLO) family . YOLO is a state-of-the-art, real-time object detection system and is used object detection, image classification, and instance segmentation tasks.

Here is a video i made about it

YOLO v8
https://youtu.be/jMvLCZBXbtc

YouTube

TechBro Mustafa

YOLO v8

YOLOv8 is the newest version of the You only look once (YOLO) family . YOLO is a state-of-the-art, real-time object detection system and is used object detection, image classification, and instance segmentation tasks.

▶ Play video

#

@serene scaffold check it out now

nova totem Jan 14, 2023, 5:02 PM

#

Good afternoon everyone, I'm new to the channel. I'm researching time series for an academic project. I have to create a model. I didn't get to work with it, if anyone can give me a direction to start I'd appreciate it.

lapis sequoia Jan 14, 2023, 5:07 PM

#

how do i get into ML and AI from the beginning

#

This is really neat, thanks!

stoic kettle Jan 14, 2023, 5:34 PM

#

Hi friends, I need a little help on how to build my classification model in TensorFlow using Python.

I have 7 classes and I want to identify each individual with one of them, but if it doesn't belong to any class mark it as Unidentified.

I would be very grateful 😭

tranquil jasper Jan 14, 2023, 5:34 PM

#

lapis sequoia how do i get into ML and AI from the beginning

Data science has three parts

tranquil jasper Jan 14, 2023, 5:34 PM

#

lapis sequoia how do i get into ML and AI from the beginning

Data engineering
Data analysis
Machine learning

#

Most companies want you to do all three

#

So it would make sense to start from data engineering and analysis

#

But if you want to go straight to ML
There are good courses on YouTube and...

mild dirge Jan 14, 2023, 5:37 PM

#

stoic kettle Hi friends, I need a little help on how to build my classification model in Tens...

You could see this as having 8 classes, 7 different kind of cats and "other"

#

As long as the data is balanced that should make it easier

#

Could also have a model to see if it is a cat, and then one to differentiate between different types of cats

tranquil jasper Jan 14, 2023, 5:39 PM

#

Any good introduction to numpy?

lapis sequoia Jan 14, 2023, 5:39 PM

#

I see

lapis sequoia Jan 14, 2023, 5:39 PM

#

tranquil jasper But if you want to go straight to ML There are good courses on YouTube and...

Ok

lapis sequoia Jan 14, 2023, 5:40 PM

#

stoic kettle Hi friends, I need a little help on how to build my classification model in Tens...

I can't even understand this , to me this looks like Schrodinger's cat

tranquil jasper Jan 14, 2023, 5:41 PM

#

lapis sequoia Ok

You can also check out edx and Coursera

#

Some big tech have courses there

wind lintel Jan 14, 2023, 7:54 PM

#

Does anybody use alpa and jax?

lapis sequoia Jan 14, 2023, 7:55 PM

#

ChatGPT (Generative Pre-trained Transformer)[1] is a chatbot launched by OpenAI in November 2022. It is built on top of OpenAI's GPT-3 family of large language models, and is fine-tuned (an approach to transfer learning[2]) with both supervised and reinforcement learning techniques. I made a video on some cool Facts about ChatGPT https://youtu.be/AiwjiSrMg0c

YouTube

TechBro Mustafa

ChatGPT Facts

ChatGPT enables users to submit questions using a natural language interface and receive responses written in conversational, if somewhat stilted, language.

The bot will respond based on the context of your conversation, considering your previous queries and answers. Its results come from analysing vast amounts of data collected from the World ...

▶ Play video

wind lintel Jan 14, 2023, 7:55 PM

#

I installed jax and jaxlib from alpa rep and jax functions like jax.random.PRNGKey(0) throw an error

pure sun Jan 14, 2023, 8:05 PM

#

how to make an AI:

#

?*

wind lintel Jan 14, 2023, 8:06 PM

#

?

pure sun Jan 14, 2023, 8:06 PM

#

wow

pure sun Jan 14, 2023, 8:06 PM

#

lapis sequoia ChatGPT (Generative Pre-trained Transformer)[1] is a chatbot launched by OpenAI ...

ChatGPT is OP

lapis sequoia Jan 14, 2023, 8:07 PM

#

SOP?

pure sun Jan 14, 2023, 8:08 PM

#

lapis sequoia SOP?

OP*

#

wallah OP

lapis sequoia Jan 14, 2023, 8:08 PM

#

pure sun OP*

It's pretty awesome

pure sun Jan 14, 2023, 8:08 PM

#

lapis sequoia It's pretty awesome

I want one like it but BUT BUT he answer everything

#

and Don't say its unethcal or something

lapis sequoia Jan 14, 2023, 8:10 PM

#

You can use ChatGPT iself

pure sun Jan 14, 2023, 8:11 PM

#

lapis sequoia You can use ChatGPT iself

I asked him how to hack he told me fuck off

lapis sequoia Jan 14, 2023, 8:11 PM

#

Don't use it for that. It will report you and ban you

pure sun Jan 14, 2023, 8:11 PM

#

sad

serene scaffold Jan 14, 2023, 8:55 PM

#

@lapis sequoia I'm going to have to ask you to stop positing your own videos.

#

Three in four hours is pretty excessive.

mint palm Jan 14, 2023, 8:56 PM

#

can i do saliency detection without finetuning?

serene scaffold Jan 14, 2023, 8:56 PM

#

mint palm can i do saliency detection without finetuning?

Can you explain what fine tuning is?

mint palm Jan 14, 2023, 8:57 PM

#

serene scaffold Can you explain what fine tuning is?

making a pretrain model learn to do similar task on similar dataset different to one is was trained on

#

actually i ask because i dont have ground truth for my video dataset

#

i wanna have a fixed backbone for saliency based segmentation
Are there any?

#

lemon_sentimental

heavy basin Jan 14, 2023, 11:24 PM

#

#

what does "Instead, open Anaconda3 with the Windows Start menu and select Anaconda(64-bit)" mean?

#

what am i supposed to do

serene scaffold Jan 14, 2023, 11:28 PM

#

@heavy basin adding a directory to PATH means that Windows will look in that directory for executables to run commands

#

Like, if you run a python command, the operating system needs to find an executable program named python that can do the command.

#

It says that it's not recommended. (And I don't recommend anaconda.)

heavy basin Jan 14, 2023, 11:30 PM

#

idk im trying to do a project with pandas and it recommended me to use anaconda

hasty mountain Jan 14, 2023, 11:31 PM

#

heavy basin idk im trying to do a project with pandas and it recommended me to use anaconda

Just use pip install pandas in the command prompt

#

It'll install pandas to your python directory and won't fulfill your computer with garbage

heavy basin Jan 14, 2023, 11:32 PM

#

is anaconda bad

serene scaffold Jan 14, 2023, 11:33 PM

#

heavy basin is anaconda bad

I think it's antiquated and that the data science community needs to let go of it.

heavy basin Jan 14, 2023, 11:33 PM

#

k

hasty mountain Jan 14, 2023, 11:33 PM

#

Yes. It makes things way more complicated than it should be, and might also mess up with your IDLE because it'll install another Python

heavy basin Jan 14, 2023, 11:34 PM

#

i will uninstall

serene scaffold Jan 14, 2023, 11:34 PM

#

> using idle

heavy basin Jan 14, 2023, 11:34 PM

#

ty

hasty mountain Jan 14, 2023, 11:34 PM

#

serene scaffold \> using idle

VS Code fan here hyperlemon

hard shoal Jan 15, 2023, 12:32 AM

#

hello everyone, any suggestions on methods to deal with imbalanced data..?

austere swift Jan 15, 2023, 12:40 AM

#

heavy basin is anaconda bad

i personally use conda since it handles cuda very well but I'm probably gonna switch over to a more classic env manager at some point

heavy basin Jan 15, 2023, 12:42 AM

#

do i ask pandas questions here

austere swift Jan 15, 2023, 12:42 AM

#

yes

heavy basin Jan 15, 2023, 12:43 AM

#

i want to create a dataframe and this is the txt file for each person's grade percentage, grade letter, and student ID

serene scaffold Jan 15, 2023, 12:43 AM

#

heavy basin i want to create a dataframe and this is the txt file for each person's grade pe...

is the student ID on a separate line, or is it just wrapping?

heavy basin Jan 15, 2023, 12:44 AM

#

separate line

serene scaffold Jan 15, 2023, 12:44 AM

#

that's really annoying

heavy basin Jan 15, 2023, 12:44 AM

#

yeah

serene scaffold Jan 15, 2023, 12:45 AM

#

heavy basin yeah

if you can read that file in as a string, and flatten each data point into one line, you can use pd.read_table

#

who gave you this data?

heavy basin Jan 15, 2023, 12:46 AM

#

it originally was an image and i used a image to text converter

#

https://media.discordapp.net/attachments/1044831441734860891/1063326433369411644/IMG_0560.png?width=253&height=547

serene scaffold Jan 15, 2023, 12:46 AM

#

oh okay

heavy basin Jan 15, 2023, 12:46 AM

#

actually i can ask my friend to send me the text of that

serene scaffold Jan 15, 2023, 12:46 AM

#

that's data cleaning for you, I guess.

#

the other thing you could use is a regular expression.

austere swift Jan 15, 2023, 12:48 AM

#

you could also replace every other \n with a

heavy basin Jan 15, 2023, 12:48 AM

#

austere swift you could also replace every other `\n` with a ` `

wait this is smart

#

ill try it ty

lapis sequoia Jan 15, 2023, 12:49 AM

#

hello everyone Im trying to solve this fundamental problem, I expected this to work in Google colab

austere swift Jan 15, 2023, 12:49 AM

#

lapis sequoia hello everyone Im trying to solve this fundamental problem, I expected this to w...

probably an issue with you not running the cells in order

serene scaffold Jan 15, 2023, 12:49 AM

#

lapis sequoia hello everyone Im trying to solve this fundamental problem, I expected this to w...

please don't do screenshots of text. you can copy and paste it into the chat.

lapis sequoia Jan 15, 2023, 12:49 AM

#

serene scaffold please don't do screenshots of text. you can copy and paste it into the chat.

okay

austere swift Jan 15, 2023, 12:49 AM

#

if not_shuffled was defined in another cell that you didn't run yet then it wouldn't be defined

#

common mistake with notebooks

lapis sequoia Jan 15, 2023, 12:50 AM

#

austere swift probably an issue with you not running the cells in order

Okay I see... Thank you

#

gonna try and solve it now

heavy basin Jan 15, 2023, 1:10 AM

#

austere swift you could also replace every other `\n` with a ` `

any way to do this inplace?

#

all the tutorials ive seen they create a list, do the .replace() and then write it back

serene scaffold Jan 15, 2023, 1:46 AM

#

heavy basin any way to do this inplace?

no. strings are immutable, so in-place operations are impossible.

heavy basin Jan 15, 2023, 1:48 AM

#

ok

#

@serene scaffold can you help me i kinda have no clue what im doing

#

#

i fixed the file and did pd.read_csv('newgrades.txt', delimiter = ' ') and this is what the dataframe looks like rn

serene scaffold Jan 15, 2023, 2:15 AM

#

heavy basin i fixed the file and did `pd.read_csv('newgrades.txt', delimiter = ' ')` and thi...

try pd.read_csv('newgrades.txt', delimiter='\s+', header=None, names=['percent', 'grade', 'student_id'])

#

a problem you're having is that it's interpreting the first row as the header

#

beyond that, what do you want to do with the data, now that you have it?

heavy basin Jan 15, 2023, 2:16 AM

#

i want to sort the dataframe by the percent column

serene scaffold Jan 15, 2023, 2:17 AM

#

so once you do df = pd.read_csv('newgrades.txt', delimiter='\s+', header=None, names=['percent', 'grade', 'student_id']), you can do df.sort_values('percent')

heavy basin Jan 15, 2023, 2:18 AM

#

wow! tysm

#

now

#

nvm

#

@serene scaffold how do i get the student IDs to be the row names

serene scaffold Jan 15, 2023, 2:25 AM

#

heavy basin <@253696366952316929> how do i get the student IDs to be the row names

pd.read_csv('newgrades.txt', delimiter='\s+', header=None, names=['percent', 'grade', 'student_id']).set_index('student_ids')

heavy basin Jan 15, 2023, 2:27 AM

#

that worked, thank you

#

how do you learn this stuff

hasty mountain Jan 15, 2023, 2:45 AM

#

How does a Conv2D works when the number of feature maps is reduced?
I know that, when I have a grayscaled image and I want to generate 64 feature maps, then my Conv layer will make 64 convolutions using 64 different kernels, right?
But what if I have 64 feature maps and I want to achieve a grayscaled image?

I just noticed that ResNet and Inception are way deeper than VGG, yet they have few parameters, so I got this curiousity...

serene scaffold Jan 15, 2023, 2:46 AM

#

heavy basin how do you learn this stuff

I was determined to figure out how to make this whole table without any loops or using .apply

heavy basin Jan 15, 2023, 2:46 AM

#

that's crazy

serene scaffold Jan 15, 2023, 2:47 AM

#

heavy basin that's crazy

every element is the result of calculations several dataframes deep. and I couldn't afford for there to be any mistakes. it probably took 50+ hours.

#

(but only because I didn't know pandas prior to that.)

heavy basin Jan 15, 2023, 2:48 AM

#

so impressive

serene scaffold Jan 15, 2023, 2:49 AM

#

my goal isn't to talk myself up, mind you. just that I learned-by-doing in a high-stakes setting.

#

whenever you want to do something with pandas, try to find a way to do it that doesn't involve loops, or apply.

#

and eventually you'll learn it pretty well.

heavy basin Jan 15, 2023, 2:50 AM

#

ok

serene scaffold Jan 15, 2023, 2:50 AM

#

hasty mountain How does a Conv2D works when the number of feature maps is reduced? I know that,...

bumping your question. does the fact that the image is greyscaled here matter, beyond there not being color channels (thus reducing the dimensionality of the image)?

hasty mountain Jan 15, 2023, 2:51 AM

#

serene scaffold bumping your question. does the fact that the image is greyscaled here matter, b...

Nah, it doesn't matter, it's just an example

#

Just to illustrate that from 1 channel to 64 channels, I'd have to make 64 convolutions(I guess)

#

Then, how is the process done when I have 64 channels and I want just 1 as output?

serene scaffold Jan 15, 2023, 2:52 AM

#

hmm? 64 channels?

hasty mountain Jan 15, 2023, 2:53 AM

#

I even have an idea of trying to combine a Conv to generate multiple images and then use a separate layer and a softmax to decide which image will be my output

#

But depending on how the Conv layer works, I suppose this would be redundant

serene scaffold Jan 15, 2023, 2:53 AM

#

I'm clearly out of my lane. going back to playing with words.

hasty mountain Jan 15, 2023, 2:54 AM

#

Don't you use convs when working with words, too?

#

At least the original idea of Transformer used Convs, too. I guess Conv2D with kernels 1x1

serene scaffold Jan 15, 2023, 2:56 AM

#

hasty mountain Don't you use convs when working with words, too?

speaking/writing are sequences of symbols, so most neural NLP techniques involve RNNs, not CNNs.

hasty mountain Jan 15, 2023, 2:57 AM

#

RNNs are lame. I always get vanishing gradients with them grumpchib

#

That's why I love attention layers

wooden sail Jan 15, 2023, 3:45 AM

#

an annoying bit here with tf is that it doesn't explicitly say what it does across channels when you do a 2d conv.

#

every 2d conv layer takes k matrices (k channels) of size m x n and spits out as many matrices as filters you specify, with sizes that depend on m,n and the filter sizes. the output is of size l x m x n, with l channels = l filters. not quite a 2d conv, but a sum or average of 2d convs. stack overflow says its just a sum, but it's the same thing modulo a scaling factor

wooden sail Jan 15, 2023, 3:55 AM

#

hasty mountain Then, how is the process done when I have 64 channels and I want just 1 as outpu...

this is what"s done if you use only 1 filter

#

as mentioned above, num filters = new num of channels

serene scaffold Jan 15, 2023, 4:02 AM

#

Does channels mean something different in this case than the RGB "layers"?

wooden sail Jan 15, 2023, 4:14 AM

#

past the 1st layer yes, since they no longer represent color after averaging

#

but also nothimg stops you from doing a 2d conv on data that isnt an age, where there is no color to speak of in the first place

#

i find calling it "channels" is misleading and hides the generality of the operation, but the name stuck due to people wanting to make networks accessible to people that dont wanna deal with the math

#

like calling it the "time axis" in rnns, space and channels in cnns, etc

echo orbit Jan 15, 2023, 4:17 AM

#

Hello, idk if that question fits in this channel (though it's about astrophysics), but does anyone know how to actualize the color of points in a 3D scatterplot in an animation please ? I'm having a hard time understanding why it doesn't work when i update the points of the scatterplot, even though it works fine if i replot the scatterplot each frame (which takes a lot more time and saving the animation keep all the previous plots, which isn't what i'm looking for)

For instance, i'm currently making an HR diagram (Luminosity VS Temperature) where the points change in color depending on the value of the temperature.

It looks like this if i replot every frame :

def updatefig_full(*args):
    global j
    if (j<len(logTe[:,0])-1):
        j += 1
    else:
        j=0
    index = np.where(logTe[j]!=0)[0]
    if (j>0 and np.size(index)!=0):
        Te_no_zeros = Te[j,index]
        L_no_zeros=L[j,index]
        size_no_zeros=size[j,index]
        scatplot = ax.scatter(Te_no_zeros, L_no_zeros, s=size_no_zeros,
                              c=Te_no_zeros,edgecolor='k',linewidth=.5,cmap='rainbow_r')
    else:
        scatplot = ax.scatter(Te[j], L[j], s=size[j],c=Te[j],edgecolor='k',linewidth=.5,cmap='rainbow_r')
    time_text.set_text('t = %.3f Myr'%round((10**Age[j])*1e-6,3))
    return scatplot,time_text

As you can see the c parameter needs the temperature array.

If i do it by updating the parameters of each point, i'll use this instead to generate the RGBA array (which, if i understand correctly, is automatically done by the ax.scatter function)

colormap = cm.rainbow_r
normalize = mcolors.Normalize(vmin=np.min(Te[j]), vmax=np.max(Te[j]))
s_map = cm.ScalarMappable(norm=normalize, cmap=colormap)
cmap=s_map.to_rgba(Te[j])
scatplot.set_sizes(size[j])
scatplot.set_facecolor(cmap)

with j the frame.

Could anyone explain to me what's wrong with that last portion please ?

#

The color change should be like this, but the other case keeps the color of each point (as if the temperature was the one at the first frame)

wooden sail Jan 15, 2023, 4:38 AM

#

echo orbit The color change should be like this, but the other case keeps the color of each...

what if you call scatplot.draw() after updating the oarams

echo orbit Jan 15, 2023, 4:39 AM

#

Right in the updating function ?

#

I think i should show the function i'm using for the 2nd case :

def updatefig(*args):
    global j
    if (j<len(logTe[:,0])-1):
        j += 1
    else:
        j=0
    index = np.where(logTe[j]!=0)[0]
    if (j>0 and np.size(index)!=0):
        Te_no_zeros = Te[j,index]
        L_no_zeros=L[j,index]
        size_no_zeros=size[j,index]
        pos = np.array([[Te_no_zeros[i],L_no_zeros[i]] for i in range(len(Te_no_zeros))])
        normalize = mcolors.Normalize(vmin=np.min(Te_no_zeros), vmax=np.max(Te_no_zeros))
        s_map = cm.ScalarMappable(norm=normalize, cmap=colormap)
        cmap=s_map.to_rgba(L_no_zeros)
        scatplot.set_sizes(size_no_zeros)
        scatplot.set_facecolor(cmap)
    else:
        pos = np.array([[Te[j,i],L_no_zeros[j,i]] for i in range(len(Te[j,i]))])
        normalize = mcolors.Normalize(vmin=np.min(Te[j]), vmax=np.max(Te[j]))
        s_map = cm.ScalarMappable(norm=normalize, cmap='rainbow_r')
        cmap=s_map.to_rgba(Te[j])
        scatplot.set_sizes(size[j])
        scatplot.set_facecolor(cmap)
        
    scatplot.set_offsets(pos)
    scatplot.draw() #Scatplot Draw here
    time_text.set_text('t = %.3f Myr \n n = %i'%(round((10**Age[j])*1e-6,3),j))
    return scatplot,time_text

This returns me an error :

TypeError: draw_wrapper() missing 1 required positional argument: 'renderer'

wooden sail Jan 15, 2023, 4:44 AM

#

hmm maybe i'm thinking of fig.canvas.draw() instead

#

do you have the figure stored in a variable?

echo orbit Jan 15, 2023, 4:45 AM

#

Still no change in color

#

I do

#

fig,ax=plt.subplots()
fig.suptitle('Hertzsprung-Russel Diagram')

wooden sail Jan 15, 2023, 4:46 AM

#

and how about
renderer = fig.canvas.renderer
scatplot.draw(renderer)

echo orbit Jan 15, 2023, 4:47 AM

#

No change again

#

For some reason the program doesn't want to renormalize with each frame

wooden sail Jan 15, 2023, 4:50 AM

#

and you're calling draw every frame? i would maybe try animation, but otherwise i'm not sure what the matter is

echo orbit Jan 15, 2023, 4:50 AM

#

I am

#

And i am animating

#

if fig_update: #Generate the figure each frame
    anim = animation.FuncAnimation(fig, updatefig_full,frames = len(Te[:,0]),
                          interval = 10, blit=True)
else: #Actualize the parameters of the points each frame
    anim = animation.FuncAnimation(fig, updatefig,frames = len(Te[:,0]),
                          interval = 10, blit=True)
Writer = animation.writers['ffmpeg']
writer = Writer(fps=30, bitrate=1800)

#

@wooden sail Found the issue

#

I had to use scatplot.set_array(Te[j]) instead of scatplot.set_facecolor(Te[j])

#

As it seems facecolor doesn't change the color of the points

#

Anyway thanks for your help 😳

wooden sail Jan 15, 2023, 5:18 AM

#

oof, aight

shell sequoia Jan 15, 2023, 8:05 AM

#

Hi guys

#

I am Mohit Narwani, A senior data analyst
I worked at tableau(sales force)

#

we generally used c++ at tableau but libraries like seaborn, matplotlib and plotly are very good for visualization. So I have decided to make a whatsapp group where i can add plots based on user's demand and also they can also help

#

https

atomic tide Jan 15, 2023, 8:09 AM

#

@shell sequoia We block WhatsApp links for a reason. Please get permission from the server admins before advertising or promoting within the server. Thanks.

shell sequoia Jan 15, 2023, 8:09 AM

#

okay

atomic tide Jan 15, 2023, 8:09 AM

#

You can contact the admins by DMing @sonic vapor

shell sequoia Jan 15, 2023, 8:09 AM

#

no i am not here to advertise, I am just here to make data vis more strong. I am not going to earn any money from it

shell sequoia Jan 15, 2023, 8:11 AM

#

shell sequoia we generally used c++ at tableau but libraries like seaborn, matplotlib and plot...

if someone is interested can dm e

#

me

hasty mountain Jan 15, 2023, 11:08 AM

#

wooden sail every 2d conv layer takes k matrices (k channels) of size m x n and spits out as...

pithink

wooden sail Jan 15, 2023, 11:08 AM

#

hmm?

hasty mountain Jan 15, 2023, 11:09 AM

#

Isn't a convolution, like... You pass a kernel(mxn array) through your input, column by column, row by row, applying element-wise multiplications which are summed to get your final feature map pixel?

#

So, I can imagine that, if your input is an array 1xHeightxWidth, and you want 64 feature maps, you'll just create 64 kernels and apply 64 convolutions, one for each kernel.

But what if you have 64 feature maps and you want as output a single array 1xHeightxWidth, how is this done?

wooden sail Jan 15, 2023, 11:11 AM

#

as i said, you add them up

#

note that this is not the only way of doing it, you're asking for an operation that can be defined in more than one way

#

tensorflow adds them up

hasty mountain Jan 15, 2023, 11:12 AM

#

So...it's a single convolution through each feature map, using the same kernel, and in the end tensorflow adds all the 64 outputs together, element-wise?

wooden sail Jan 15, 2023, 11:12 AM

#

yes

hasty mountain Jan 15, 2023, 11:12 AM

#

Strange...

#

Do you know how Pytorch does it?

wooden sail Jan 15, 2023, 11:13 AM

#

no idea

hasty mountain Jan 15, 2023, 11:13 AM

#

pithink

wooden sail Jan 15, 2023, 11:13 AM

#

https://pytorch.org/docs/stable/generated/torch.nn.Conv2d.html

#

i've never looked at it before

hasty mountain Jan 15, 2023, 11:14 AM

#

It's just that, in my GANs, it seems that Conv2D layers that output fewer channels than the input has much, much more parameters than Conv2D layers that output the same number of channels, or even more channels...

wooden sail Jan 15, 2023, 11:14 AM

#

aha, in pytorch there's an extra "groups" parameter with which you control this

#

it could also be that tf makes different kernels per channel and adds up those results

#

i do know it just adds up the results in the end. idk about the per channel behavior

hasty mountain Jan 15, 2023, 11:15 AM

#

Uh... I couldn't understand a single word in that "groups" parameter

wooden sail Jan 15, 2023, 11:15 AM

#

you should be able to test it out

#

well, all of the math there is pretty clear tbh. the pytorch documentation is a lot better

#

but as you noticed, you need to understand the math to read it 😛

#

tf makes decisions for you to make it easier to use

hasty mountain Jan 15, 2023, 11:16 AM

#

pithink

#

Tensorflow seems way more complicated, though

#

all inputs are convolved to all outputs.

#

What does it mean?

#

Each convolution through the input = each output, directly?

wooden sail Jan 15, 2023, 11:19 AM

#

hasty mountain Jan 15, 2023, 11:19 AM

#

pithink

wooden sail Jan 15, 2023, 11:19 AM

#

each line there is a convolution

#

(the question still remains whether converging lines are the same kernel in tf)

hasty mountain Jan 15, 2023, 11:20 AM

#

Regular convolution...makes a convolution through the Red channels to determine the output Blue channels?

#

yert

wooden sail Jan 15, 2023, 11:20 AM

#

as i said

#

adds up all the outputs

hasty mountain Jan 15, 2023, 11:21 AM

#

I could understand this for YUV channels, but...RGB?

wooden sail Jan 15, 2023, 11:21 AM

#

the names there R' don't actually mean color, as i said to stelercus

hasty mountain Jan 15, 2023, 11:21 AM

#

Oh...

wooden sail Jan 15, 2023, 11:21 AM

#

after the convolution layer, the outputs are no longer colors

#

even if you specify you want 3 output channels

hasty mountain Jan 15, 2023, 11:21 AM

#

For my GAN, they are py_guido

#

At least that's what matplotlib shows me

wooden sail Jan 15, 2023, 11:22 AM

#

that's because you then go out of your way to put the channels into the RGB channels of an image

#

that's your doing, not the network's

#

from the network's perspective, none of the inputs nor outputs have any interpretation other than being in vector spaces or manifolds

hasty mountain Jan 15, 2023, 11:25 AM

#

Still confused on why my Conv2d with 400 input channels with 3 output channels has so much more parameters than the ones with 3 inputs and 400 outputs pithink

wooden sail Jan 15, 2023, 11:26 AM

#

what sizes do they have

tranquil jasper Jan 15, 2023, 11:39 AM

#

any good introduction to numpy?

echo citrus Jan 15, 2023, 12:12 PM

#

fast cairn Jan 15, 2023, 12:14 PM

#

most important would be the one with the highest GINI, no?

echo citrus Jan 15, 2023, 12:15 PM

#

fast cairn most important would be the one with the highest GINI, no?

ohh so is it most impt: meanfun , least impt: sp.ent

#

something like that

fast cairn Jan 15, 2023, 12:16 PM

#

idk, i haven't worked with Gini index before, thats just from what i heard

dire verge Jan 15, 2023, 1:45 PM

#

Hello, I am trying to train myself to perform text clustering with gensim.Word2Vec and KMeans.

The vectorization and creation of clusters are both happening as expected. What I don't understand is that my dataframe has 900 rows, but my model generates 1700 vectors.

I thought it would generate a vector for each row, and as a result it is impossible to add them to my dataframe since the length is not the same.

Some people tell me not to worry and create the clusters from these vectors even if the length is different, but I would like to understand and even better have everything "organized" in my dataframe.

Any ideas ?

hasty mountain Jan 15, 2023, 1:51 PM

#

wooden sail what sizes do they have


        self.transconv1 = nn.ConvTranspose2d(100, 1000, 4, 1, 0, bias=False)
        self.transconv2 = nn.ConvTranspose2d(1000, 800, 4, 2, 1, bias=False)
        self.transconv3 = nn.ConvTranspose2d(800, 600, 4, 2, 1, bias=False)
        self.transconv4 = nn.ConvTranspose2d(600, 400, 4, 2, 1, bias=False)
        self.transconv5 = nn.ConvTranspose2d(400, 200, 4, 2, 1, bias=False)
        self.transconv6 = nn.ConvTranspose2d(200, 3, 3, 1, 1, bias=False)

Just a DCGAN with many, many feature maps

#

According to torchsummary:

----------------------------------------------------------------
        Layer (type)               Output Shape         Param #
================================================================
   ConvTranspose2d-1           [-1, 1000, 4, 4]       1,600,000
   ConvTranspose2d-2            [-1, 800, 8, 8]      12,800,000
             PReLU-3            [-1, 800, 8, 8]               1
   ConvTranspose2d-4          [-1, 600, 16, 16]       7,680,000
             PReLU-5          [-1, 600, 16, 16]               1
   ConvTranspose2d-6          [-1, 400, 32, 32]       3,840,000
             PReLU-7          [-1, 400, 32, 32]               1
   ConvTranspose2d-8          [-1, 200, 64, 64]       1,280,000
             PReLU-9          [-1, 200, 64, 64]               1
  ConvTranspose2d-10            [-1, 3, 64, 64]           5,400
             Tanh-11            [-1, 3, 64, 64]               0
================================================================
Total params: 27,205,404
Trainable params: 27,205,404

wooden sail Jan 15, 2023, 1:54 PM

#

idk what convtranspose is

hasty mountain Jan 15, 2023, 1:54 PM

#

Transposed Convolution

wooden sail Jan 15, 2023, 1:54 PM

#

what's that supposed to mean

hasty mountain Jan 15, 2023, 1:54 PM

#

pithink

#

It's a Deconvolution, but it isn't

#

It's complicated

wooden sail Jan 15, 2023, 1:55 PM

#

it's certainly not a deconvolution

hasty mountain Jan 15, 2023, 1:55 PM

#

https://pytorch.org/docs/stable/generated/torch.nn.ConvTranspose2d.html#torch.nn.ConvTranspose2d

This module can be seen as the gradient of Conv2d with respect to its input. It is also known as a fractionally-strided convolution or a deconvolution (although it is not an actual deconvolution operation as it does not compute a true inverse of convolution).

#

In a nutshell: a Convolution, but bigger

#

py_guido

#

However, while Convs with 2 strides usually make you get feature maps with half the height and width of your input, the Transposed Convolution with 2 strides make you double the height and width

wooden sail Jan 15, 2023, 2:05 PM

#

idk, after reading about it, i'd just call it upsampling, but ok

#

and which layers are you troubled by

hasty mountain Jan 15, 2023, 2:09 PM

#

I'm troubled by the fact that my first layer, which converts a random noise with sizes (Batch, 100, 1, 1) to (Batch, 1000, 4, 4) Has more than 10x more parameters than the second layer, which simply takes this (Batch, 1000, 4, 4) and converts to (Batch, 800, 8, 8)

#

I mean, ok, there's the upsampling, but...wow... 10x more parameters?

#

If I were to use a ConvBlock where the number of channels is all 1000, I could probably pile like 8 Conv layers until I reach 12 million parameters

wooden sail Jan 15, 2023, 2:14 PM

#

so i would just recall the image i posted earlier

wooden sail Jan 15, 2023, 2:14 PM

#

wooden sail

this one

#

and we're working with the case on the left, with groups = 1

#

that means every single input channel is involved in generating an output channel, and one convolution kernel is applied in doing each of these operations

#

that means in the first layer you will have 100*1000 filters of size 16

#

in the next one, you will have 800*1000 filters of size 16

hasty mountain Jan 15, 2023, 2:18 PM

#

pithink

#

input_channels * output_channels * (kernelM*kernelN) pithink

wooden sail Jan 15, 2023, 2:19 PM

#

yep

hasty mountain Jan 15, 2023, 2:20 PM

#

So... I just have to use kernels with size 1x1 py_guido

wooden sail Jan 15, 2023, 2:20 PM

#

so, scaling the input lmao

#

that's gotta be the most expensive scalar multiplication

hasty mountain Jan 15, 2023, 2:22 PM

#

Meh... I can't simply upsample because GAN things.
But I found Conv 1x1 a bit lame...I mean...you multiply element-wise a single number by your input array...wouldn't it be better if you just defined an array of weights and then multiplied element-wise your input?

wooden sail Jan 15, 2023, 2:23 PM

#

hasty mountain Meh... I can't simply upsample because GAN things. But I found Conv 1x1 a bit la...

as i said, a conv 1x1 is the same as multiplying the whole input by a scalar lol

hasty mountain Jan 15, 2023, 2:23 PM

#

There are some models that use that, though, like Self-Attention GAN

wooden sail Jan 15, 2023, 2:24 PM

#

probably because the API has no other built-in function for scalar multiplication

#

but as you say, that's what attention does

#

take some inputs, scale each of them, and add them up

#

that's the same as pytorch and tf decide to do by default when you do convolutions

hasty mountain Jan 15, 2023, 2:25 PM

#

wooden sail that's the same as pytorch and tf decide to do by default when you do convolutio...

Fun fact: the model was made in tensorflow

wooden sail Jan 15, 2023, 2:25 PM

#

this is the type of thing where using built-in layers obscures what you're doing. it's no longer a convolution at that point. you'd write clearer code if you wrote the math more directly

#

this whole "let's make AI accessible to everyone" has its downsides

hasty mountain Jan 15, 2023, 2:26 PM

#

wooden sail this is the type of thing where using built-in layers obscures what you're doing...

They might have made some fancy things that just complicated things...seems like most research papers do this.

#

Unless the chinese guys...they're op.

wooden sail Jan 15, 2023, 2:26 PM

#

it's not "fancy things", it's "exploiting the API to do what we actually want it to do"

hasty mountain Jan 15, 2023, 2:27 PM

#

Doubt

def _get_d_real_loss_KL(discriminator_on_data_logits):
  loss = tf.nn.softplus(-discriminator_on_data_logits)
  return tf.reduce_mean(loss)

wooden sail Jan 15, 2023, 2:27 PM

#

this is where i'd just say using jax is better. but that's neither here nor there. regarding the issues and questions you usually ask, i'd suggest to review the maths a bit more

wooden sail Jan 15, 2023, 2:29 PM

#

hasty mountain *Doubt* ```py def _get_d_real_loss_KL(discriminator_on_data_logits): loss = t...

hmm?

hasty mountain Jan 15, 2023, 2:29 PM

#

Just one of the functions they made for SAGAN

#

Which is...simply applying the loss from tensorflow

wooden sail Jan 15, 2023, 2:29 PM

#

that's like sum(exp(-inputs) + 1)

#

i would much rather read it that way

#

but they were limited by the api

hasty mountain Jan 15, 2023, 2:31 PM

#

They're literally using the API...

wooden sail Jan 15, 2023, 2:32 PM

#

that's exactly what i'm saying 👁️

hasty mountain Jan 15, 2023, 2:32 PM

#

yert

wooden sail Jan 15, 2023, 2:32 PM

#

it looks dumb because they're using the API

#

it is much easier and simpler than it looks there

#

but they have to write it in a specific way that looks dumb (e.g. 1x1 2D convolution kernel) because the API has a fixed set of tools

hasty mountain Jan 15, 2023, 2:34 PM

#

But... tensorflow is such a low-level API, they surely would be able to create a customized operation

wooden sail Jan 15, 2023, 2:34 PM

#

they COULD. but they aren't doing it

hasty mountain Jan 15, 2023, 2:35 PM

#

py_guido

#

That's why I don't read the official codes on GitHub anymore

#

Now I'm slightly curious, though...what if, instead of a Conv2D, I simply created an array with graphs with the same height and width as my input and simply multiplied element-wise this array of weights by each channel to get an output with the same number of channels?
Would it be at least as efficient as a Conv2D that returns an output with the same sizes as the input? Maybe even in a less expensive, faster, way?

wooden sail Jan 15, 2023, 2:50 PM

#

yes, but you lose the effect of having neighboring pixels affect each other, or being able to detect something regardless of where it is located in an image

#

which is the whole point of using convolutions

#

i.e. the network no longer enforces a "spatial invariance prior"

hasty mountain Jan 15, 2023, 2:52 PM

#

Not even through backpropagation?

wooden sail Jan 15, 2023, 2:52 PM

#

indeed

hasty mountain Jan 15, 2023, 2:54 PM

#

grumpchib

#

Sad. I'm getting annoyed by how much time it takes to make a conv that returns many channels

#

At least this appear to be a pattern in GANs...starting with a Conv that outputs like, 1024 channels, then the next layers filter those channels until you get 3

#

SRGAN uses a ResNet architecture, but I'm afraid fixed channels, height and width aren't effective for generating images, just for SuperResolution

#

Or I did something wrong, which is quite likely

river sapphire Jan 15, 2023, 2:58 PM

#

In the Asynchronous variant for n-step dqn I notice that they use theta instead of theta' when Initializing the network gradients. Does this mean they use one shared network gradient vector? Or is it separate for each thread?

#

hasty mountain Jan 15, 2023, 3:02 PM

#

river sapphire In the Asynchronous variant for n-step dqn I notice that they use theta instead ...

Which part are you referring to?

river sapphire Jan 15, 2023, 3:02 PM

#

hasty mountain Which part are you referring to?

Initialize network gradients dtheta <-- 0

hasty mountain Jan 15, 2023, 3:03 PM

#

I suppose that, if theta' is a target parameter, then it's the parameter the network must achieve

#

So there's no gradients to be initialized for theta'

river sapphire Jan 15, 2023, 3:03 PM

#

theta' is the thread-specific parameters

hasty mountain Jan 15, 2023, 3:04 PM

#

Oh, ok, there's theta' and theta-

river sapphire Jan 15, 2023, 3:04 PM

#

the way I understand it the threads don't necessarily update the main network at the same time hence why it's called asynchronous

hasty mountain Jan 15, 2023, 3:05 PM

#

Correct

river sapphire Jan 15, 2023, 3:05 PM

#

it just says "Perform asynchronous update of theta using dtheta" but that's really vague imo

#

I have no idea what that means

#

so do we like sum up the gradients then average them?

hasty mountain Jan 15, 2023, 3:06 PM

#

From what I'm seeing, theta' is more or less a variable to synchronize the networks, so theta' gradients is actually theta gradients

river sapphire Jan 15, 2023, 3:07 PM

#

well the way it works is you have multiple threads running at the same time exploring different parts of the state space right

hasty mountain Jan 15, 2023, 3:07 PM

#

Theta is the parameters of a specific thread, those parameters will be optimized.
After finishing the episode/state/idk, those parameters are synchronized to be theta'.
I suppose that, being an Asynchronous actor-learner, the better thread will provide the parameters for all threads

river sapphire Jan 15, 2023, 3:07 PM

#

my understanding was that different threads could encounter an episode termination at different times

hasty mountain Jan 15, 2023, 3:08 PM

#

river sapphire my understanding was that different threads could encounter an episode terminati...

That's correct. And this is why their parameters must be synchronized

river sapphire Jan 15, 2023, 3:08 PM

#

i'm confused on the Asynchronous update part though

hasty mountain Jan 15, 2023, 3:08 PM

#

If your thread A finishes your episode after 10 minutes, and thread B finishes only after 15 minutes, then thread A will be kept "dormant" until thread B finishes the episode

river sapphire Jan 15, 2023, 3:09 PM

#

so it's accumulating gradients... does it average them?

river sapphire Jan 15, 2023, 3:09 PM

#

hasty mountain If your thread A finishes your episode after 10 minutes, and thread B finishes o...

yeah

hasty mountain Jan 15, 2023, 3:09 PM

#

river sapphire so it's accumulating gradients... does it average them?

No, it sums all gradients and applies them during the optimization step

#

But this isn't exclusive to asynchronous algorithms, most RL algorithms do that

river sapphire Jan 15, 2023, 3:10 PM

#

that's interesting so then after summing all the gradients does it multiply it by the learning rate after?

hasty mountain Jan 15, 2023, 3:10 PM

#

Yes. In Stochastic Gradient Descent, yes.
If you use Adam, then it does its mathmagics there when applying gradients

river sapphire Jan 15, 2023, 3:11 PM

#

well the optimizer part is even more confusing because when I looked at how they implemented SGD with momentum into the algorithm

#

they said you keep separate gradient and momentum vectors for each thread

#

let me just find the section

hasty mountain Jan 15, 2023, 3:12 PM

#

The optimization occurs separately, probably

river sapphire Jan 15, 2023, 3:12 PM

#

hasty mountain Jan 15, 2023, 3:12 PM

#

The optimization occurs separately, indeed

river sapphire Jan 15, 2023, 3:13 PM

#

that is confusing

hasty mountain Jan 15, 2023, 3:13 PM

#

The optimization doesn't have to occur after you finish your episode

#

It can happen after certain number of steps, after your agent took certain number of actions

river sapphire Jan 15, 2023, 3:14 PM

#

so why does each thread have a separate momentum and gradient vector

hasty mountain Jan 15, 2023, 3:15 PM

#

Because you're actually training different networks at once to get the best parameters faster

river sapphire Jan 15, 2023, 3:16 PM

#

oh I see

#

I sort of understand it

#

so wait let's say for simplicity we perform an update on the main network after the episode terminates

#

so thread A performs an update on the main network after 10 minutes right

hasty mountain Jan 15, 2023, 3:16 PM

#

You're testing different networks in different configurations(different momentum, different learning rate).
After you had enough training(aka: the episode has ended), you see which one performed better, than applies that one parameters to all of them. Then restart training

river sapphire Jan 15, 2023, 3:17 PM

#

hasty mountain You're testing different networks in different configurations(different momentum...

oh we're using different configurations for each thread?

hasty mountain Jan 15, 2023, 3:17 PM

#

river sapphire so thread A performs an update on the main network after 10 minutes right

No, thread A finishes episode after 10 minutes. If thread B is still running its episodes, thread A will have to wait.

hasty mountain Jan 15, 2023, 3:18 PM

#

river sapphire oh we're using different configurations for each thread?

Yes, the image you sent say that they try different momentum vectors

#

And there's different gradients

river sapphire Jan 15, 2023, 3:18 PM

#

hasty mountain No, thread A finishes episode after 10 minutes. If thread B is still running its...

why does thread A have to wait though?

hasty mountain Jan 15, 2023, 3:18 PM

#

Because you don't know if thread B will actually perform better than A

#

If you update thread B based on A, you might sabotage thread B and lose performance

#

Thread B might even get confused, as it might have reached a state that thread A didn't manage to reach

river sapphire Jan 15, 2023, 3:19 PM

#

but thread A is updating the global network not thread B?

#

all threads are updating the global network

hasty mountain Jan 15, 2023, 3:20 PM

#

If thread A finished its episodes, it's not updating anymore

#

Think like this: you're playing a match of...idk... Valorant.
This match can have a duration of 10 minutes, 15 minutes or even 1 hour.
Your friend is playing a match of Valorant, too, but you both want to play together.
So, if her match ends after 10 minutes, but yours is still running, he'll have to wait.

#

If thread A finishes its episodes after 10 minutes, it finished its match, while B is still playing its match, so A have to wait until B finishes so they can synchronize(update parameters)

river sapphire Jan 15, 2023, 3:23 PM

#

but looking at the psuedocode it seems like after performing the asynchronous update it clears the gradients, synchronizes the thread-specific parameters with the global network parameters then sets t_start to t and gets state s_t

#

it doesn't seem to wait for the other thread

#

there is thread-locking though I think although I don't really understand it

#

it says it performs an asynchronous update meaning 2 updates cannot happen at the same time right?

#

from what I understand thread-locking might be used to prevent 2 threads updating the global network at the same time

hasty mountain Jan 15, 2023, 3:28 PM

#

Hm... "Asynchronous update" might refer to thread being updated

#

It refers to theta, not theta'

river sapphire Jan 15, 2023, 3:28 PM

#

right and theta refers to the global shared parameter vector

hasty mountain Jan 15, 2023, 3:29 PM

#

Oh, sure

#

It might be possible, then, that they actually try updates in real time

#

But it doesn't seem to make sense to me, though.
Asynchronous actors usually are for getting the best parameters, but you can't know which one is the best if one of them is still running and getting optimized

river sapphire Jan 15, 2023, 3:30 PM

#

I was confused on the thread-locking part because the only place where it sorta made sense to me was in the update part to prevent 2 threads updating the global shared parameter vector at the same time

river sapphire Jan 15, 2023, 3:31 PM

#

hasty mountain But it doesn't seem to make sense to me, though. Asynchronous actors usually are...

I think the point of asynchronous actors were for speeding up training not getting the best parameters

#

wait

hasty mountain Jan 15, 2023, 3:31 PM

#

It might be just because of that...avoid 2 updates at once.
And if the threads indeed update the main network in real time, then there might be a way to filter which one is better

river sapphire Jan 15, 2023, 3:31 PM

#

unless you're referring to the same thing?

#

wait you're saying parameters and not hyperparameters right

#

ok i'm just getting confused

hasty mountain Jan 15, 2023, 3:31 PM

#

river sapphire unless you're referring to the same thing?

It's more or less the same thing

hasty mountain Jan 15, 2023, 3:31 PM

#

river sapphire wait you're saying parameters and not hyperparameters right

Parameters are the weights

#

(I guess...Q-Learning models tend to be crazy)

river sapphire Jan 15, 2023, 3:32 PM

#

lol

#

but then it says in the sgd with momentum implementation it uses no thread-locking

#

which is another thing i'm confused about

hasty mountain Jan 15, 2023, 3:32 PM

#

pithink

#

There's an SGD optimizer for each thread, and there's an SGD optimizer for the global net

#

The global one has no locks, as it seems

river sapphire Jan 15, 2023, 3:34 PM

#

that is confusing

#

but if each thread has a separate SGD update why does it use theta?

hasty mountain Jan 15, 2023, 3:35 PM

#

If I understood it, then it's more or less

river sapphire Jan 15, 2023, 3:35 PM

#

see it defined theta as "the parameter vector that is shared across all threads"

hasty mountain Jan 15, 2023, 3:36 PM

#

The SGD for each thread is to apply the gradients in that thread and optimize the thread.
The global SGD is simply to apply the parameters you got from the threads.

hasty mountain Jan 15, 2023, 3:36 PM

#

river sapphire see it defined theta as "the parameter vector that is shared across all threads"

Shared across all threads upon initialization and upon synchronizing, not every moment

river sapphire Jan 15, 2023, 3:36 PM

#

hasty mountain Shared across all threads upon initialization and upon synchronizing, not every ...

right so this is the same thing as the global network right?

hasty mountain Jan 15, 2023, 3:37 PM

#

Yes

#

It seems you apply the parameters to the global network, and, upon synchronizing threads, you apply the global parameters to every thread

river sapphire Jan 15, 2023, 3:37 PM

#

so each thread is accumulating its own gradients then independently applies the SGD momentum update

hasty mountain Jan 15, 2023, 3:37 PM

#

Yep

river sapphire Jan 15, 2023, 3:38 PM

#

I see where I'm getting confused now

hasty mountain Jan 15, 2023, 3:38 PM

#

Each thread is optimized independently, they're autonomous. The synchronization is the only intervention they suffer

river sapphire Jan 15, 2023, 3:39 PM

#

it is possible for each thread to be using a different "version" of the global network correct?

hasty mountain Jan 15, 2023, 3:39 PM

#

No. Synchronization doesn't allow that

#

Unless the synchronization occurs while a thread didn't finish its episodes, which I find unlikely

river sapphire Jan 15, 2023, 3:41 PM

#

it says to repeat until terminal state or t - t_start == t_max

#

#

this seems to imply that it could be possible

hasty mountain Jan 15, 2023, 3:42 PM

#

That is for the thread

#

I guess the global thing would be the part if T mod I target == 0 then

river sapphire Jan 15, 2023, 3:43 PM

#

no that's for updating the global target network

hasty mountain Jan 15, 2023, 3:43 PM

#

This part you remarked is simply a normal step

#

Select action, check consequences, get reward, get grads, update

river sapphire Jan 15, 2023, 3:45 PM

#

so do they synchronize at the same time? that doesn't sound like it

hasty mountain Jan 15, 2023, 3:45 PM

#

This happens within a single thread

#

Each thread will perform the loop Get state --> get action ---> get new state ---> get reward ---> get grads ---> update if condition

river sapphire Jan 15, 2023, 3:46 PM

#

right

#

I am confused now

#

I'm pretty sure synchronization can happen while another thread is still running an episode

hasty mountain Jan 15, 2023, 3:48 PM

#

Well, it normally doesn't. But check the paper, perhaps they say something about that.

#

Synchronization while a thread is running an episode can be troublesome. You can't know if that thread will perform better than the others and that thread will change its parameters so abruptly that it can get unstable

#

And Reinforcement Learning is already remarked by unstability

river sapphire Jan 15, 2023, 3:50 PM

#

this is strange

#

so in this algorithm for one-step dqn it accumulates the gradients with respect to the global shared parameters

#

but in the one for n-step it accumulates gradients w.r.t the thread-specific parameters

hasty mountain Jan 15, 2023, 3:52 PM

#

The thread specific parameters are used as a basis to update the global shared parameters

#

As I said, probably the best thread gives origin to the global parameters

river sapphire Jan 15, 2023, 3:55 PM

#

the paper doesn't really say anything about the synchronization

#

https://arxiv.org/pdf/1602.01783.pdf

hasty mountain Jan 15, 2023, 3:56 PM

#

It's in the pseudocode

hasty mountain Jan 15, 2023, 3:59 PM

#

river sapphire

d0 < ---- d0 + ?(R-Q(s,a;0')²/?0''

#

I suppose that ?(R-Q(s,a;0')²/?0' would be the d0'

#

But I'm not so sure, now

#

As I said, Q-Learning algorithms tend to be crazy

#

That's why I prefer policy gradients and actor-critic

river sapphire Jan 15, 2023, 4:02 PM

#

it's adding gradients to d0 which are calculated w.r.t the thread-specific parameters

#

but then at the top of the repeat until T > T_max loop it has clear gradients dtheta <-- 0

#

so I'm assuming that if the gradients were cleared while another thread was accumulating gradients that wouldn't be good

#

I am confused

#

so then this goes back to my original question is d_theta shared or for each thread

#

the section where they implement SGD with momentum in an asynchronous setting implies that it was a shared vector

hasty mountain Jan 15, 2023, 4:08 PM

#

d_theta is probably global

#

if theta is global, d_theta should be global

river sapphire Jan 15, 2023, 4:08 PM

#

if it's global then there is no way the gradients should be cleared while another thread is accumulating gradients

#

then is it like you said it halts the thread until the other thread is done with the episode?

#

but that didn't make sense to me because it didn't mention that in the psuedocode

tranquil jasper Jan 15, 2023, 4:10 PM

#

any good introduction to numpy?

tidal bough Jan 15, 2023, 4:58 PM

#

tranquil jasper any good introduction to numpy?

There's this introduction to numpy and the rest of the data science stack: https://colab.research.google.com/github/yandexdataschool/Practical_RL/blob/coursera/week1_intro/primer/recap_ml.ipynb

#

(it's part of this course, but the rest of it is on RL: https://github.com/yandexdataschool/Practical_RL)

tranquil jasper Jan 15, 2023, 4:59 PM

#

tidal bough There's this introduction to numpy and the rest of the data science stack: <http...

thank you

tiny hedge Jan 15, 2023, 6:21 PM

#

I can't install openai I keep getting something wrong

serene scaffold Jan 15, 2023, 6:26 PM

#

tiny hedge I can't install openai I keep getting something wrong

if you need help, be sure to say what you did, and how you know that it's wrong.

warped berry Jan 15, 2023, 7:20 PM

#

Hi everyone, I'm on 35% of a DataScience career path, looking to find ppl to share ideas and some code reviews too, greetings!

drifting wagon Jan 15, 2023, 8:38 PM

#

so guys, for few days im in middle of trying to create unsupervised training algorithm
rn im stuck on creating network itself. I wanted to have rather simple inputs and outpus but it had issue with shape of tensor, so i checked it and it outputed "torch.Size([3, 81269])"

but this code is a little overkill, right? I belive inputs shouldnt be that big

    def __init__(self):
        super(Transformer, self).__init__()
        self.fc1 = nn.Linear(81269, 4).float()
        self.fc2 = nn.Linear(4, 3).float()
        self.fc3 = nn.Linear(3, 1).float()```

young granite Jan 15, 2023, 9:18 PM

#

warped berry Hi everyone, I'm on 35% of a DataScience career path, looking to find ppl to sha...

what u mean by 35% 🗿, how can u be so precise

warped berry Jan 15, 2023, 9:59 PM

#

young granite what u mean by 35% 🗿, how can u be so precise

for a Codecademy certification I mean

hasty mountain Jan 16, 2023, 12:44 AM

#

drifting wagon so guys, for few days im in middle of trying to create unsupervised training alg...

Which tensor has shape 3, 81269?

#

The inputs?

drifting wagon Jan 16, 2023, 12:45 AM

#

numpy array i created with bunch of ebooks

hasty mountain Jan 16, 2023, 12:45 AM

#

Also, you don't really need to use .float() in the linear layers. Pytorch initializes all floats in float32 by default

hasty mountain Jan 16, 2023, 12:45 AM

#

drifting wagon numpy array i created with bunch of ebooks

If it's a numpy array, then you need to convert it to a torch tensor with torch.from_numpy(). Then things should run fine.

#

(Though I suppose that, considering how much time passed since you posted that, you might've figured that out already)

#

I don't know if it's an overkill, actually. You probably would have to check through trial and error

drifting wagon Jan 16, 2023, 12:57 AM

#

you are right, i dont have this problem anymore but well
i have bunch of others

heavy basin Jan 16, 2023, 1:00 AM

#

can someone help me with pandas please

serene scaffold Jan 16, 2023, 1:05 AM

#

heavy basin can someone help me with pandas please

please always ask your actual question right away, without asking for a commitment, so that someone who knows the answer can start answering right away, with no follow-up questions.

#

to show the dataframe, do print(df.head().to_dict('list'))

heavy basin Jan 16, 2023, 1:12 AM

#

i have a dataframe df (sorted by percentage) that looks like this

#

i also have gender_grp = df.groupby('gender')

#

#

i tried sorting the thing by sorting the original dataframe and then doing sort=false

#

but that didnt fix the issue where B+ is under B

#

i think a possible solution would to be sort with percentages, but not show in the table. but i'm unsure how to do that

serene scaffold Jan 16, 2023, 1:18 AM

#

heavy basin

so you just want this, but where the letter index is sorted by grade order?

heavy basin Jan 16, 2023, 1:18 AM

#

yeah

serene scaffold Jan 16, 2023, 1:19 AM

#

I might be able to answer in a bit.

heavy basin Jan 16, 2023, 1:19 AM

#

👍

serene scaffold Jan 16, 2023, 1:19 AM

#

though I need to know if the expression in that screenshot is a DataFrame or a Series. if you had given the code as text, I could tell you how to figure that you.

drifting wagon Jan 16, 2023, 1:20 AM

#

I think i am trying to chew more than i am able to.
so this is my script, very crude attempt at unsupervies learning```import numpy as np
import torch
import torch.nn as nn
from numpy.core._simd import targets
from torch import Tensor
from torch.nn import MSELoss

load the input data

input_data = np.load("train_data.npy")

convert the input data to a torch tensor

inputs: Tensor = torch.from_numpy(input_data)

define the model

class Transformer(nn.Module):
def init(self):
super(Transformer, self).init()
self.fc1 = nn.Linear(81269, 20).float()
self.fc2 = nn.Linear(20, 5).float()
self.fc3 = nn.Linear(5, 1).float()

def forward(self, x):
    x = self.fc1(x)
    x = torch.relu(x)
    x = self.fc2(x)
    x = torch.relu(x)
    x = self.fc3(x)
    return x

model = Transformer()

define the loss function

criterion: MSELoss = nn.MSELoss()

define the optimizer

optimizer = torch.optim.Adam(model.parameters(), lr=0.01)

train the model

for epoch in range(100):
# Forward pass
outputs = model(inputs)
loss = criterion(outputs, targets)

# Backward and optimize
optimizer.zero_grad()
loss.backward()
optimizer.step()

if (epoch + 1) % 10 == 0:
    print('Epoch [{}/{}], Loss: {:.4f}'.format(epoch + 1, 100, loss.item()))
    torch.save(model.state_dict(), 'model_checkpoint.pth')

heavy basin Jan 16, 2023, 1:20 AM

#

gender  letter
boy     A         0.095238
        B         0.190476
        B+        0.142857
        B-        0.142857
        C         0.095238
        C+        0.095238
        C-        0.190476
        D/I       0.047619
girl    A         0.090909
        A-        0.272727
        B         0.181818
        B-        0.181818
        C         0.181818
        C+        0.090909```

serene scaffold Jan 16, 2023, 1:20 AM

#

do type(gender_grp.value_counts(['letter'], normalize = True, sort = False))

heavy basin Jan 16, 2023, 1:20 AM

#

series

hasty mountain Jan 16, 2023, 1:21 AM

#

Anyone has a tip on how to upgrade my DCGAN without tearing everything apart?

#

grumpchib

drifting wagon Jan 16, 2023, 1:21 AM

#

drifting wagon I think i am trying to chew more than i am able to. so this is my script, very ...

and i get this errors "```
Traceback (most recent call last):
File "C:\Users\Reny\PycharmProjects\crossoverwriter\trener.py", line 43, in <module>
outputs = model(inputs)
File "C:\Users\Reny\PycharmProjects\crossoverwriter\venv\lib\site-packages\torch\nn\modules\module.py", line 1194, in _call_impl
return forward_call(*input, **kwargs)
File "C:\Users\Reny\PycharmProjects\crossoverwriter\trener.py", line 24, in forward
x = self.fc1(x)
File "C:\Users\Reny\PycharmProjects\crossoverwriter\venv\lib\site-packages\torch\nn\modules\module.py", line 1194, in _call_impl
return forward_call(*input, **kwargs)
File "C:\Users\Reny\PycharmProjects\crossoverwriter\venv\lib\site-packages\torch\nn\modules\linear.py", line 114, in forward
return F.linear(input, self.weight, self.bias)
RuntimeError: mat1 and mat2 must have the same dtype

i belive line 43 is about input trying to get 2d tensor and reciving 3d tensor, but when i tried to convert my input data into 2d on it was saying "tulpe out of range" so it is 2d

hasty mountain Jan 16, 2023, 1:24 AM

#

The error says it's a dtype problem, so maybe your layer is dealing with float32 type and your input is in float64

tranquil jasper Jan 16, 2023, 1:24 AM

#

anyone familiar with polars can explain to me what Expressions means?

serene scaffold Jan 16, 2023, 1:24 AM

#

@heavy basin as soon as df is created, before you make gender_grp, do this:

df['grades'] = df['grades'].astype(pd.CategoricalDtype('A+ A A- B+ B B- C+ C C- D+ D/I'.split(), ordered=True))

hasty mountain Jan 16, 2023, 1:24 AM

#

Pytorch uses by default float32, but I guess numpy uses float64 by default

#

@drifting wagon

#

So converting from numpy directly to pytorch can cause those errors

drifting wagon Jan 16, 2023, 1:25 AM

#

i see

#

thanks, i will check that out

hasty mountain Jan 16, 2023, 1:25 AM

#

At least I know that when I convert a list of indices(int64) from numpy to Pytorch I get those errors

heavy basin Jan 16, 2023, 1:28 AM

#

serene scaffold <@532023333596823553> as soon as `df` is created, before you make `gender_grp`, ...

holy sh*t you are my savior

#

thank you 🙏

#

ive spent so many hours today trying to like add an invisible character with a unicide between + and - 😭

#

how come u needed to know if it was a series or not

tranquil jasper Jan 16, 2023, 1:29 AM

#

what is a standard deviation?

serene scaffold Jan 16, 2023, 1:30 AM

#

.wa standard deviation

#

.wiki standard deviation

strange elbowBOT Jan 16, 2023, 1:30 AM

#

Wikipedia Search Results

Standard deviation
statistics, the standard deviation is a measure of the amount of variation or dispersion of a set of values. A low standard deviation indicates that the

Geometric standard deviation
preferred to the more usual standard deviation. Note that unlike the usual arithmetic standard deviation, the geometric standard deviation is a multiplicative

lapis sequoia Jan 16, 2023, 1:31 AM

#

whats a bulldog request

hasty mountain Jan 16, 2023, 1:31 AM

#

There's a geometric version? yert

serene scaffold Jan 16, 2023, 1:31 AM

#

all these "what is x" questions are really better answered by Google.

drifting wagon Jan 16, 2023, 1:32 AM

#

hasty mountain At least I know that when I convert a list of indices(int64) from numpy to Pytor...

i managed to train it for the 1st time, thanks

#

i had some code for supervised learning which caused problems

heavy basin Jan 16, 2023, 1:33 AM

#

heavy basin how come u needed to know if it was a series or not

@serene scaffold do you have to do something different if it's another datatype?

serene scaffold Jan 16, 2023, 1:37 AM

#

heavy basin <@253696366952316929> do you have to do something different if it's another data...

gender_grp.value_counts(['letter'], normalize = True, sort = False) looks kind of like a dataframe with three columns. but it's actually a series with two levels of indexing. I'd have to experiment to figure out how one would change the dtype of the letter index level

#

but you should do things like this as early in the process as possible. if the letters are categories, and they have an order, you should make them that way when the dataframe starts.

hasty mountain Jan 16, 2023, 1:38 AM

#

drifting wagon i had some code for supervised learning which caused problems

In the fine-tuning?

#

What happened?

drifting wagon Jan 16, 2023, 1:39 AM

#

i use chatgpt as my deep learning mentor and he well, sometimes he spews bullshit straight into my face.

serene scaffold Jan 16, 2023, 1:39 AM

#

drifting wagon i use chatgpt as my deep learning mentor and he well, sometimes he spews bullshi...

well, don't do that.

hasty mountain Jan 16, 2023, 1:39 AM

#

Oh, then I guess you'll have to try filtering it

#

I don't know how to use teacher-student learning...never tried it because of the crazy things I'd have to do...

serene scaffold Jan 16, 2023, 1:40 AM

#

there's no point using chatgpt to get factual information if you have to fact-check everything that it says

drifting wagon Jan 16, 2023, 1:42 AM

#

serene scaffold well, don't do that.

he is not that bad, i m in middle of watching "Learn PyTorch for deep learning in a day." by daniel Bourke, but in all honestly a lot of lessons i watched from him gpt summarized in few paragraphs without losing information

#

and with use of davinci-code and davinci 3 i can get actually a lot of troubleshooting right away.

sweet crypt Jan 16, 2023, 1:42 AM

#

everytime i run my program in remote server I get this error, F external/org_tensorflow/tensorflow/tsl/platform/default/env.cc:74] Check failed: ret == 0 (11 vs. 0)Thread host_executor creation via pthread_create() failed

#

but it works fine on my machine

#

I dont understand, I think it has to do with how multiprocessing process are created ig?

hasty mountain Jan 16, 2023, 1:44 AM

#

drifting wagon he is not that bad, i m in middle of watching "Learn PyTorch for deep learning i...

It's a bit sad how ChatGPT can make some things so much easier to understand than listening to a specialist...
I only managed to learn how to implement a Reinforcement Learning algorithm after going to ChatGPT

But then... I guess this is the curse of knowing too much.
My biochemistry teacher at the college can't do a rule of 3 calculation, for instance...despite it being useful to calculate some variables used in biochemistry

heavy basin Jan 16, 2023, 1:46 AM

#

i tried asking chatgpt my problem but it didnt work

#

but that's probably since i dont understand it enough to ask it good

hasty mountain Jan 16, 2023, 1:48 AM

#

heavy basin but that's probably since i dont understand it enough to ask it good

There's this problem. Once I asked ChatGPT about SRGAN structure and it gave me a vague answer...and it was wrong

wooden sail Jan 16, 2023, 1:48 AM

#

trying to learn new stuff from chatgpt is a pretty bad idea

#

its notion of correctness is good grammar and text appearing together with high likelihood in the wild. nothing to do with factual correctness. that means it's very difficult to catch its mistakes if you're not already familiar with a topic

#

its made to be convincing and nothing else

drifting wagon Jan 16, 2023, 1:49 AM

#

well, gpt is limited to knowledge from 2021 and if he doesnt know something he makes it up.
But there is plugin that makes him utilise web search results and he can summarise online articles with it

wooden sail Jan 16, 2023, 1:49 AM

#

getting correct answers is just coincidence

hasty mountain Jan 16, 2023, 1:52 AM

#

I guess if it makes web search it might do it only if you correct it and appoint the mistake.

iron basalt Jan 16, 2023, 1:52 AM

#

ChatGPT will give you elegant nonsense. If you tell it what is correct, it will just accept it as fact, which can be used, for example as shown in Wolfram's recent blog post: https://writings.stephenwolfram.com/2023/01/wolframalpha-as-the-way-to-bring-computational-knowledge-superpowers-to-chatgpt/

Stephen Wolfram Writings

Stephen Wolfram

Wolfram|Alpha as the Way to Bring Computational Knowledge Superpowe...

Accessing Wolfram|Alpha's computational knowledge with ChatGPT--an ideal combination of precise computation with human-like expression of ideas. Stephen Wolfram explains how.

heavy basin Jan 16, 2023, 1:52 AM

#

it confused the INEOS 1:59 marathon with the legit 2:01:39 berlin marathon

#

it's not that smart

hasty mountain Jan 16, 2023, 1:52 AM

#

heavy basin it confused the INEOS 1:59 marathon with the legit 2:01:39 berlin marathon

Oh, you don't have to go that far. Ask it about Diffusion Models and Stable Diffusion

wooden sail Jan 16, 2023, 1:53 AM

#

no no, it's pretty good. but not at what people are using it for

#

it's not meant to give you factually correct answers

#

that's not what it was trained for. that's the user's own fault

hasty mountain Jan 16, 2023, 1:54 AM

#

wooden sail it's not meant to give you factually correct answers

There might be a bunch of employees at OpenAI at this moment having to label millions of ChatGPT answers according to degree of "correctness" lemon_bald

#

Poor guys...labeling a dataset is so boring...and prone to tendonitis

drifting wagon Jan 16, 2023, 1:56 AM

#

heavy basin it confused the INEOS 1:59 marathon with the legit 2:01:39 berlin marathon

same question but with web results enabled

heavy basin Jan 16, 2023, 1:56 AM

#

oh ok

iron basalt Jan 16, 2023, 1:56 AM

#

It needs to be combined with something that does have the facts. But ChatGPT is not that on its own. You can think of ChatGPT as being the human-computer interface, but it's just that. With something else to provide reasoning (and world knowledge beyond just the text it had) it could be one of the best tools ever made.

wooden sail Jan 16, 2023, 1:57 AM

#

as a text model, what it values is the likelihood of certain chunks of text occurring together. whether those chunks really belong together is a different matter (that it does not care about). you literally depend on how biased the training data is.

#

though extending it with wolfram searches would be great

#

still, it has no notion of correctness, so 😛

iron basalt Jan 16, 2023, 1:58 AM

#

One of the reasons ChatGPT does better than previous models is injection of MORE bias into it. But it's just human preferences, there is not an actual world model.

#

(Wolfram Alpha can function as such a world model for math/science)

serene scaffold Jan 16, 2023, 1:59 AM

#

I want my hot takes to be part of the world model

wooden sail Jan 16, 2023, 2:00 AM

#

🥞 hot cakes

iron basalt Jan 16, 2023, 2:00 AM

#

If programming in English is what you want, nothing comes even close to Wolfram Alpha, but it's not open source...

serene scaffold Jan 16, 2023, 2:01 AM

#

iron basalt If programming in English is what you want, nothing comes even close to Wolfram ...

I thought that was COBOL

iron basalt Jan 16, 2023, 2:02 AM

#

serene scaffold I thought that was COBOL

COBOL is if you want to program by committee.

#

(made up of people that never did any programming)

serene scaffold Jan 16, 2023, 2:02 AM

#

iron basalt COBOL is if you want to program by committee.

my hot takes for the world model would be a set of propositions in the format "x is y, but bad".
"COBOL is executable English, but bad."

drifting wagon Jan 16, 2023, 2:03 AM

#

openai has english to python finetuned model

wooden sail Jan 16, 2023, 2:04 AM

#

that's not very difficult considering python is essentially pseudocode in english

#

(i'm kidding about it not being difficult)

serene scaffold Jan 16, 2023, 2:04 AM

#

drifting wagon openai has english to python finetuned model

"finetuned" has a specific meaning in machine learning, and it's not "very good at a narrow task".

hasty mountain Jan 16, 2023, 2:05 AM

#

serene scaffold "finetuned" has a specific meaning in machine learning, and it's not "very good ...

Don't say that... I'm trying to make an Unsupervised reward model fine-tuned with supervised learning grumpchib

#

But yes, it isn't working

iron basalt Jan 16, 2023, 2:05 AM

#

drifting wagon openai has english to python finetuned model

The reason Wolfram Alpha does so well is because it spits out Wolfram Language code, which is very high level (much more high level than Python) and much easier to generate. It also does not spit out text, but rather the abstract syntax tree directly, removing an entire layer of complexity and errors when compared to using a text model for programming.

drifting wagon Jan 16, 2023, 2:06 AM

#

damn, now i want to check it out

hasty mountain Jan 16, 2023, 2:06 AM

#

Even higher level than Python? Damn...

serene scaffold Jan 16, 2023, 2:06 AM

#

iron basalt The reason Wolfram Alpha does so well is because it spits out Wolfram Language c...

I have a coworker who calls Wolfram his secret weapon

iron basalt Jan 16, 2023, 2:07 AM

#

Wolfram Language is probably the most high level language ever made that actually works and is not just a gimmick.

#

It's also been around for a very long time.

#

(Older than Python IIRC)

hasty mountain Jan 16, 2023, 2:08 AM

#

Interesting...
Now I know what to do every time I have to calculate some integral...specially if there's geometric functions involved

iron basalt Jan 16, 2023, 2:08 AM

#

Its major downside is that it's not open source and costs a lot.

hasty mountain Jan 16, 2023, 2:09 AM

#

grumpchib

iron basalt Jan 16, 2023, 2:09 AM

#

Upside is that they give you a lot in return, cloud compute, giant database of algorithms for everything, datasets for everything updated in real time, etc.

#

This approach works, but there will always be some gaps, where open source could help out. Like how Python has a library for even obscure things that maybe only 1 other person is doing.

#

I would like to see some kind of open source Wolfram Alpha like thing though. I think it helps a lot with the discoverability problem, which is what ChatGPT seems to be mainly used for. That is, knowing which things are available and the general way things are done with X (this is a barrier to entry for any new framework or project being worked on / with). Autocomplete in IDEs are a weak form of this (what are the available functions/methods and a brief description of them in their comments).

echo orbit Jan 16, 2023, 2:29 AM

#

Hello, does anyone know what is the easiest way to return the coordinates of points such that those with the largest sizes are closest to the origin (pos (0,0,0)) in 3D please ?

I've been trying spherical coordinates but it keeps returning me a totally uniform distribution while totally neglecting the condition on size. Here is the code i'm using :

    for i,mass in enumerate(initial_masses):
        for j,pos_mass in enumerate(positions['Mass'][0]):
            if pos_mass == mass:
                positions['size'][:,j] = data_matrix_object['size'][:,i]
                # Generate random spherical coordinates
                theta = np.random.uniform(0, 2*np.pi)
                phi = np.random.uniform(0, np.pi)
                R = 1/positions['size'][0,j]

                #Generate positions
                positions['X'][:,j] = R*np.sin(theta)*np.cos(phi)
                positions['Y'][:,j] = R*np.sin(theta)*np.sin(phi)
                positions['Z'][:,j] = R*np.cos(phi)

i = 0 corresponds to the greatest size and the higher the value of i, the smaller the size.

With this, i get that plot (purple dots = greatest size, red dots = smallest size) :

#

I'm pretty sure i missed something but i can't tell what exactly

hasty mountain Jan 16, 2023, 2:46 AM

#

@wooden sail let's talk about mathmagic again.
When I pass an image through a Conv2D layer, that layer will create feature maps based on the convolution through the image, right?

People use to say that the first conv layers, in the complete image, usually extracts the shapes and objects in the image and the deeper ones extracts relations between pixels.

My question is: are those feature maps related to each other? If I pass an image through a conv layer, generating 64 feature maps, are those 64 feature maps related somehow in a way that, if I try to work with 10 of them separately, I might get bad results?

#

I'm thinking about making a GAN that, despite using conv layers to generate many feature maps, I'd also like to make it extract some of the best feature maps generated according to certain input

wooden sail Jan 16, 2023, 2:47 AM

#

it's almost 4 am, maybe later

hasty mountain Jan 16, 2023, 2:47 AM

#

Okay. Sweet dreams

austere swift Jan 16, 2023, 6:38 AM

#

wooden sail its notion of correctness is good grammar and text appearing together with high ...

Yeah what a lot of people don't quite understand about these types of language models in general is that they're not trained based on correctness or clarity, they're purely trained to mimic human language. Even during training, if the model says something that's completely incorrect, but the dataset also included this incorrect statement, it will be told it's correct. There's nothing that distinguishes fact from fiction other than the idea that the data it's trained on tends to have correct statements, although that's not always the case.

versed flame Jan 16, 2023, 7:47 AM

#

I have a question that might not be for this space but Ill try as someone doing analytics maybe has encountered it. In excel you can generate a map a that shows statistics, but it requires a connected to the internet to setup. Does anyone know what data is shared while doing this check?

sacred halo Jan 16, 2023, 10:14 AM

#

Hello Everyone, I am working on Hands-On-Machine-Learning-with-Scikit-Learn-Keras-and-Tensorflow book version 2. In chapter 10 page 294, there is an example about MNIST dataset and my result and the book result are far from each other. We are using same dataset and model but our result are completely different, one of the reason can be we are using different Tensorflow version but just this can create such a this difference? Another difference is I limited epochs up to 70 but in the book is 50. Is everything is related to epochs then why at the begging they are the same? Thank you

shell crest Jan 16, 2023, 11:21 AM

#

sacred halo Hello Everyone, I am working on Hands-On-Machine-Learning-with-Scikit-Learn-Kera...

Your initial accuracy seems to be very high. What is your validation accuracy by epoch 50?
I ran the same code on Colab (took about 5min with Colab free GPU) and got the graph you can see below. Colab uses TF 2.9.2 and keras 2.9.0. Colab reports this as the epoch 50 training step:

Epoch 50/50
1719/1719 [==============================] - 5s 3ms/step - loss: 0.1636 - accuracy: 0.9420 - val_loss: 0.2978 - val_accuracy: 0.8936

There might be some randomness in the state which could be further sorted out

#

lapis sequoia Jan 16, 2023, 2:15 PM

#

I have a bunch of high-dimensional embeddings (well, millions in fact) that each represent an image, does anyone know of some examples I could use as inspiration for making cool visualizations from them, like in terms of libraries used and stuff like that? But i dont mean something like Tensorboard but more like a nicely formatted image or video I could post on r/dataisbeautiful or something

sacred halo Jan 16, 2023, 2:53 PM

#

shell crest Your initial accuracy seems to be very high. What is your validation accuracy by...

Thank you so much for the response. It is about %89 at epoch 50. In your view, when is a good time stop the training? at epoch 20?

uncut roost Jan 16, 2023, 3:21 PM

#

hi guys , I need help . how can ı accessing the coordinates of the bounding boxes ?

#

I'm trying to get the labeled coordinates of the test data of the model we trained in yolov5.

#

ı saw this code . the use torch.hub.load() but with github code and yolov5s own dataset labels . but we have our own model and own dataset label txt . so ı dont know how to use . Sorry for the poor english ı am still learning

shell crest Jan 16, 2023, 3:41 PM

#

sacred halo Thank you so much for the response. It is about %89 at epoch 50. In your view, ...

Whenever the validation accuracy does not increase, and for you it seems flat so I don't see any benefit :\

river sapphire Jan 16, 2023, 3:57 PM

#

pseudocode from: https://arxiv.org/pdf/1602.01783.pdf

what does t_max mean? is it the maximum number of steps per episode? or is it the value for n-steps? if it was the value for n-steps that wouldn't make sense because it's performing an asynchronous update right after accumulating gradients, which would mean it would be updating the weights sequentially (experiences right next to each other are highly correlated)

#

if it wasn't referring to the value for n-steps, why is it called n-step Q-learning then?

arctic wedgeBOT Jan 16, 2023, 5:55 PM

#

Hey @spiral tangle!

It looks like you tried to attach a Python file - please use a code-pasting service such as https://paste.pythondiscord.com

spiral tangle Jan 16, 2023, 5:56 PM

#

I'm a newbie so my apologies if something I ask might be to obvious and my english is not quite good. I'm stuck in doing a custom grid search with cross validation with LightFM which does not come with those functions. Here the code https://paste.pythondiscord.com/agukakibad. It seem the way I split the dataset is wrong but I do not understand why since I've replicated the code of the function random_train_test_split https://making.lyst.com/lightfm/docs/_modules/lightfm/cross_validation.html#random_train_test_split to get the folds. The error I get is Incorrect number of features in item_features. I'm glad if someone could give me some tips or suggest some platform even paying to get this thing done since it is driving me crazy.

📎 embeddings.csv 📎 observations.csv

cosmic lynx Jan 16, 2023, 6:03 PM

#

what is the first AI related project I should start on?

north barn Jan 16, 2023, 6:10 PM

#

cosmic lynx what is the first AI related project I should start on?

the youtube video i posted above shows you how to make a recommendation engine for research articles, it's pretty simple and works as a nice introduction to AI and NLP

cosmic lynx Jan 16, 2023, 6:11 PM

#

okay thanks

hasty mountain Jan 16, 2023, 7:15 PM

#

Can someone help me to filter some feature maps from a convolution in Pytorch?
I have an input with sizes (Batch, 3, 64, 64) which is passed through a Conv2D, providing an output (Batch, 1000, 64, 64)
Then, I have 1000 indices obtained through a FeedForward layer and a logsoftmax function, from which I extract the 3 ones with highest value, providing me a variable with sizes (Batch, 3)

Now, I want to use this variable to get 3 channels from the Conv2D outputs, batch by batch, but I'm having some hard time figuring out how to do this.

I tried so far variations of

x = conv_output
channels = torch.ones((conv_output.size(0), conv_output.size(1))

selected_channels = self.logsoftmax(self.fc(channels))

_, indices = torch.sort(selected_channels, 1, descending=True)
indicesA = indices[:, 0]
indicesB = indices[:, 1]
indicesC = indices[:, 2]

for batch in range(x.size(0)):
    selected_features[batch] = torch.cat((x[batch, selected_channelA[batch]], x[batch, selected_channelB[batch]], x[batch, selected_channelC[batch]]), 1)

However, this throws the following error:
RuntimeError: The expanded size of the tensor (32) must match the existing size (96) at non-singleton dimension 2. Target sizes: [3, 32, 32]. Tensor sizes: [32, 96]
And I simply don't get why the function is concatenating the height and width dimensions, not the channels

lapis sequoia Jan 16, 2023, 7:52 PM

#

Does anyone know about AI machine learning?

#

I need help with a python project, if anyone has the time and is willing to help me I would be happy to talk

serene scaffold Jan 16, 2023, 8:00 PM

#

lapis sequoia Does anyone know about AI machine learning?

be sure to always ask your actual question(s). no one is a universal expert, so there's no way to know if someone will want/be able to help until they know what the question is.

young granite Jan 16, 2023, 8:03 PM

#

does one know if theres a custom made solution to use linear regression within multioutputregressor

serene scaffold Jan 16, 2023, 8:03 PM

#

young granite does one know if theres a custom made solution to use linear regression within m...

linear regression is a widely-applicable concept in ML, so your question is probably under-specified.

young granite Jan 16, 2023, 8:04 PM

#

serene scaffold linear regression is a widely-applicable concept in ML, so your question is prob...

got 2n_features and n_targets and i wanted to use LR for it to better understand and reproduce parameters instead of KNN or RF

fallen crown Jan 16, 2023, 9:35 PM

#

does anyone already did th snake game with NEAT algortihm ?
My model learns very badly, I think I don't have the right inputs, i take :

#

the distance from the head to each of the walls (4 inputs)

#

distance from head to food (x_distance and y_distance) (2 inputs)

acoustic monolith Jan 16, 2023, 9:37 PM

#

Hi can someone help me with decision tree please

#

I am trying to use grid search for model improving

fallen crown Jan 16, 2023, 9:38 PM

#

the direction of the snake, up, down, left, right (4 inputs)

acoustic monolith Jan 16, 2023, 9:38 PM

#

Not working

mild dirge Jan 16, 2023, 9:53 PM

#

fallen crown the direction of the snake, up, down, left, right (4 inputs)

You could alrdy make it simpler by only considering forward, left and right. The snake can only go 3 directions at every timestep. as inputs you could do distance to wall/snake body in the same 3 directions. Then you could have the x and y offset of the candy relative to the head and orientation.

fallen crown Jan 16, 2023, 9:56 PM

#

mild dirge You could alrdy make it simpler by only considering forward, left and right. The...

oke i see, do you mean the distance between head and food ?

mild dirge Jan 16, 2023, 9:56 PM

#

For the last part yeah

fallen crown Jan 16, 2023, 9:57 PM

#

mild dirge For the last part yeah

okey so it is a total of 3 (wall) + 3 (snake body) + food (2)

mild dirge Jan 16, 2023, 9:57 PM

#

jup

fallen crown Jan 16, 2023, 9:57 PM

#

mild dirge jup

but the snake need to know in whoch direction it goes ?

#

right, left, up or down ?

mild dirge Jan 16, 2023, 9:57 PM

#

Well the information you supply is based on the current orientation yes

worldly dawn Jan 16, 2023, 9:58 PM

#

fallen crown but the snake need to know in whoch direction it goes ?

It doesn't if it's all relative to its heading

fallen crown Jan 16, 2023, 9:59 PM

#

worldly dawn It doesn't if it's all relative to its heading

hmm i see

tranquil jasper Jan 16, 2023, 9:59 PM

#

https://pola-rs.github.io/polars/py-polars/html/reference/api/polars.Config.set_fmt_str_lengths.html

does anyone know how this method of polars work?

#

it's supposed to help with changing the amount of data dataframe shows when calling it

fallen crown Jan 16, 2023, 10:06 PM

#

mild dirge Well the information you supply is based on the current orientation yes

I have to apply conditions to calculate each input? for example to calculate the distance from the wall in front of the head, it is calculated in 4 different ways depending on whether the snake goes up, down, goes right or goes left

#

and it's the same for each input, I have to calculate according to the direction of the snake

fallen crown Jan 16, 2023, 10:13 PM

#

mild dirge Well the information you supply is based on the current orientation yes

direction = ''
if direction == 'up' :
    input1 = 
    input2 = 
    ...
if direction == 'down' :
    input1 = 
    input2 = 
    ...
if direction == 'left' :
    input1 = 
    input2 = 
    ...
if direction == 'right' :
    input1 = 
    input2 = 
    ...```

worldly dawn Jan 16, 2023, 10:14 PM

#

fallen crown ```python direction = '' if direction == 'up' : input1 = input2 = ...

You could also base your calculations more on vectors

fallen crown Jan 16, 2023, 10:14 PM

#

I think that could work because even if each set of input is calculeted in a different way, each set of input have the same meaning

worldly dawn Jan 16, 2023, 10:14 PM

#

ie. transform your heading into a vector

fallen crown Jan 16, 2023, 10:15 PM

#

that change a lot ?

worldly dawn Jan 16, 2023, 10:15 PM

#

the point is to reduce the code

#

but if your current solution works, don't mind me 🙂

fallen crown Jan 16, 2023, 10:16 PM

#

worldly dawn You could also base your calculations more on vectors

but it gives me a super big line of code?? I'd rather have multiple lines than one big line of code

worldly dawn Jan 16, 2023, 10:17 PM

#

fallen crown but it gives me a super big line of code?? I'd rather have multiple lines than o...

It wouldn't give you super big lines of code. But I am starting to think it would be a distraction for you at this stage. You can always revisit that later

fallen crown Jan 16, 2023, 10:17 PM

#

I haven't tested it yet! but it should work I think since even if each variable of each condition is calculated in a different way, ultimately regardless of the conditions, they all express the same thing

fallen crown Jan 16, 2023, 10:18 PM

#

worldly dawn It wouldn't give you super big lines of code. But I am starting to think it woul...

I don't understand 😅

#

but thank you for your help, i am gonna try this solution tomorrow

unique vale Jan 16, 2023, 10:57 PM

#

👋 hey, I want to auto-classify pytest failures. Anybody have recommendations for the approach? I'm planning to do bert sentence similarity https://www.kaggle.com/code/eriknovak/pytorch-bert-sentence-similarity/notebook but I'm afraid that the logs are much bigger then just sentences and it will not work well.. 😆
Any suggestions would be greatly appreciated!

unique vale Jan 16, 2023, 11:54 PM

#

omg, this didn't work well 😆 pretty terrible classificaion

hasty mountain Jan 17, 2023, 1:13 AM

#

If I were to use a Variational AutoEncoder with a GAN...would it help my Generator and my Discriminator to converge?
I think I've seen that VAEs can be used to generate specific noise for the generator to generate specific images(kinda like Conditional GAN, but I suppose it's more stable), but I don't know exactly how I could make this to make my Discriminator be less efficient

#

Even the most rubbish of rubbishest discriminators I can make turns out to simply laugh at my generator useless attempts to fool him

#

I shall make a discriminator with only 3 conv layers. If he still manage to properly differentiate fake images from real images, then I'll give up for the Diffusion Models...

or not. Diffusion Models are boring to train

hasty mountain Jan 17, 2023, 1:52 AM

#

And spectral normalization, which was supposed to help, actually makes things even worse yert

stone glacier Jan 17, 2023, 4:10 AM

#

hello, can anyone recommend a website to search for deep learning projects by "theme"?

#

Not looking for complete projects, but just something to give me a list of possible ideas

#

also, theme is business-oriented so the usual ideas I find generally are not usable

wild pagoda Jan 17, 2023, 4:11 AM

#

Hi, i'm starting to learn stable disfution and wanted to make a server with stable difusion, anyone know where to start?

foggy vigil Jan 17, 2023, 5:26 AM

#

Hello I wanna start like data scientist but I know whether to start studying mathematics, statistics or Python.

#

But I alredy know programming until OOP

#

In c++

worldly dawn Jan 17, 2023, 5:27 AM

#

foggy vigil But I alredy know programming until OOP

OPP?

#

Do you mean OOP?

foggy vigil Jan 17, 2023, 5:27 AM

#

And I know some descriptive statistics and calculus (of one variable).

foggy vigil Jan 17, 2023, 5:27 AM

#

worldly dawn Do you mean OOP?

Yep sorry xd

#

So What would you recommend?

worldly dawn Jan 17, 2023, 5:28 AM

#

foggy vigil And I know some descriptive statistics and calculus (of one variable).

I would recommend:

Make sure you have a CS degree. The more advanced positions would even go as far as looking for masters
There are some nice introduction books to get started like Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems

wooden sail Jan 17, 2023, 5:30 AM

#

i would just note that it doesn't have to be a CS degree. there are other programs that cover more math (but less programming)

foggy vigil Jan 17, 2023, 5:30 AM

#

worldly dawn I would recommend: * Make sure you have a CS degree. The more advanced positions...

But before all that isn't more basic things necessary? I'm in 3rd semester

worldly dawn Jan 17, 2023, 5:31 AM

#

foggy vigil But before all that isn't more basic things necessary? I'm in 3rd semester

Before what?

#

a job? able to program something? To do basic statistics? etc.

foggy vigil Jan 17, 2023, 5:34 AM

#

I mean, is it not necessary to have some basis to start in ML?

worldly dawn Jan 17, 2023, 5:35 AM

#

You can do anything you want

azure mulch Jan 17, 2023, 9:19 AM

#

foggy vigil Hello I wanna start like data scientist but I know whether to start studying ma...

Probably the conceptual bit

arctic wedgeBOT Jan 17, 2023, 9:26 AM

#

Hey @late shell!

It looks like you tried to attach file type(s) that we do not allow (.docx). We currently allow the following file types: .gif, .jpg, .jpeg, .mov, .mp4, .mpg, .png, .mp3, .wav, .ogg, .webm, .webp, .flac, .m4a, .csv, .json.

Feel free to ask in #community-meta if you think this is a mistake.

#

@split cedar Per Rule 6, your invite link has been removed. If you believe this was a mistake, please let staff know!

Our server rules can be found here: https://pythondiscord.com/pages/rules

hasty mountain Jan 17, 2023, 11:50 AM

#

Yes. Sometimes it can even perform better than in supervised learning

#

Yep

#

I think there's also something related to the Neural Network trying to decrease the entropy in the data

#

But, from what I remember reading, there's not really a definitive explanation, just some hypothesis

#

I don't remember how clustering works, but the NN identifies patterns in the input and tend to assign it to the closest labels as possible.

#

I think GPT-2 works through Unsupervised Learning in text. And there's also Neural Networks for labeling data

https://d4mucfpksywv.cloudfront.net/better-language-models/language-models.pdf

https://lilianweng.github.io/posts/2021-12-05-semi-supervised

Learning with not Enough Data Part 1: Semi-Supervised Learning

When facing a limited amount of labeled data for supervised learning tasks, four approaches are commonly discussed.
Pre-training + fine-tuning: Pre-train a powerful task-agnostic model on a large unsupervised data corpus, e.g. pre-training LMs on free text, or pre-training vision models on unlabelled images via self-supervised learning, and the...

#

In fact, I learned about unsupervised learning in NNs exactly because I wanted to make one to label my datasets

#

You might see more about that in the Pseudo-labeling part of this blog post, as it seems that working with pseudo-labels(labels generated by unsupervised learning, including NNs) rather than actual labels(human-made) tend to generate better results.

hasty mountain Jan 17, 2023, 12:08 PM

#

hasty mountain And spectral normalization, which was supposed to help, actually makes things ev...

Curious... Using more discriminators and making them duel against each other makes things work surprisingly well.
I shall now make a GAN called The Good, The Bad and the Generator py_guido

quaint rivet Jan 17, 2023, 12:34 PM

#

where to ask for help?

#

i tried #❓｜how-to-get-help

hasty mountain Jan 17, 2023, 12:45 PM

#

@serene scaffold since you're the NLP guy, tell me...
How does the Transformer works in eval mode?
Since the Decoder requires both the input sentences and the target sentences, training is ok, but what about evaluation mode, when I don't have the target sentences?

I couldn't find a link that explains this clearly. The best explanation I could find was "the Transformer predicts many possible outputs and selects the best one based on a language score"

#

I suppose that if I use BERT's version, since it tries to predict words from masked values, this might be easier to deal with...I'd just have to pass a mask as target in eval mode, right?

serene scaffold Jan 17, 2023, 3:44 PM

#

!mute 691597752345296957 "1 day" This server is not a place for you to promote your YouTube channel. You will likely be removed from the server if this continues after your mute expires.

arctic wedgeBOT Jan 17, 2023, 3:44 PM

#

:incoming_envelope: :ok_hand: applied mute to @lapis sequoia until <t:1674056667:f> (1 day).

cyan tiger Jan 17, 2023, 4:31 PM

#

Hey guys, i’m running into a keyerror trying to .at a specific index that I know is valid. This specific column has 4409 rows of data, but i can only .at up to index 3559 without getting a keyerror, anyone know why? This is so bizarre

boreal gale Jan 17, 2023, 4:35 PM

#

are you sure it exists in the index?

#

!e example of when it's missing, note i have more than 4 rows.

import pandas as pd
df = pd.DataFrame({"x": [1,2,3,4,5]}, index=[1,1,1,2,3])
df.at[4, 'x']

arctic wedgeBOT Jan 17, 2023, 4:35 PM

#

@boreal gale :x: Your 3.11 eval job has completed with return code 1.

001 | Traceback (most recent call last):
002 |   File "/snekbox/user_base/lib/python3.11/site-packages/pandas/core/indexes/base.py", line 3803, in get_loc
003 |     return self._engine.get_loc(casted_key)
004 |            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
005 |   File "pandas/_libs/index.pyx", line 138, in pandas._libs.index.IndexEngine.get_loc
006 |   File "pandas/_libs/index.pyx", line 162, in pandas._libs.index.IndexEngine.get_loc
007 |   File "pandas/_libs/index.pyx", line 197, in pandas._libs.index.IndexEngine._get_loc_duplicates
008 | KeyError: 4
009 | 
010 | The above exception was the direct cause of the following exception:
011 | 
... (truncated - too many lines)

Full output: https://paste.pythondiscord.com/owijamerol.txt?noredirect

cyan tiger Jan 17, 2023, 4:36 PM

#

oh actually i just figured it out what the issue is

#

so when i import the dataframe into a csv, i can see the index of the dataframe vs the index on the excel sheet, and the index of the data frame is actually messed up, its not incrementing by 1, but there is some random pattern thats extremely weird. is there any way to reset that index on the actual dataframe?

boreal gale Jan 17, 2023, 4:38 PM

#

df.reset_index(drop=True)

cyan tiger Jan 17, 2023, 4:40 PM

#

beautiful thanks so much

grave frost Jan 17, 2023, 4:57 PM

#

iron basalt One of the reasons ChatGPT does better than previous models is injection of MORE...

technically it is a world model - just modelling the 1D world of tokens. So from its POV, everything it says is totally correct. However, we interpret those tokens in the context of the real world which it isn't capable of

hasty mountain Jan 17, 2023, 6:58 PM

#

hasty mountain I suppose that if I use BERT's version, since it tries to predict words from mas...

Better idea: using the same mask wouldn't make that much sense. So what if I passed an input from 0 to len(vocab_size) to a feedforward layer and the output would become the target sequence?

#

(I still didn't manage to discover how the Transformer generates sentences in eval mode)

iron basalt Jan 17, 2023, 7:36 PM

#

grave frost technically it *is* a world model - just modelling the 1D world of tokens. So fr...

I wrote something like this in a previous message.

#

(by "actual" I really mean "physical world model," I should have clarified)

#

(sensory fusion with text or via other means)

#

(Or maybe at least something like Wolfram Alpha, just math and science)

fallen crown Jan 17, 2023, 7:41 PM

#

@worldly dawn Hi, I have coded the distances to the walls in each direction but I cannot calculate the distance from the head to the body, for example if to the right of the head there is no body, what do I put cmme input because the distance does not even exist

exotic cypress Jan 17, 2023, 8:37 PM

#

Hi! I have Raspberry Pi 4 Model B 4 gb Ram 64-bit model. I am trying to use raspberry camera but it doesn't work. I guess 64 bit doesn't support camera yet. I installed 32 bit os to it. When I enabled legacy camera from configuration, a problem occurs that "Can't show the desktop". Is there any solution for this problem?

grave frost Jan 17, 2023, 8:39 PM

#

iron basalt (Or maybe at least something like Wolfram Alpha, just math and science)

like https://huggingface.co/spaces/JavaFXpert/Chat-GPT-LangChain

Chat GPT LangChain - a Hugging Face Space by JavaFXpert

wary breach Jan 17, 2023, 9:15 PM

#

mild dirge Jan 17, 2023, 9:16 PM

#

It does not only depend on data, it may also depend on the algorithm you are planning to use. Some algorithms desire input data that is normalized, others standardized, and some don't even care.

wary breach Jan 17, 2023, 9:17 PM

#

mild dirge It does not only depend on data, it may also depend on the algorithm you are pla...

I plan to use a TensorFlow NN as my baseline

#

I know models like XGBoost doesn't really care about the data in that sense

mild dirge Jan 17, 2023, 9:18 PM

#

So if you want to be on the safe side, normalization is often a pretty good pre-processing step. It is mostly important to not have extremely high/low values, and so that every feature is in the same range.

#

If you have very extreme outliers, you may want to think about dealing with those though, as they can heavily affect normalization

#

Having a single point with a feature value 99999 could make all other values very low after normalization f.e.

wary breach Jan 17, 2023, 9:23 PM

#

mild dirge Having a single point with a feature value 99999 could make all other values ver...

What's the best way to go about figuring out what outliers to remove (if any)? Here is the box plot for the top data or example:

Z5DuhgGXJKKMuCSVJQBl6SiDLgkFWXANdAi4gfAfwLviYgjEbGr3zNJbXkpvSQV5Rm4JBVlwCWpKAMuSUUZcEkqyoBLUlEGXJKKMuCSVNRfAe7eygXfWSWAAAAAElFTkSuQmCC.png

#

I know some people bin data to help deal with some of those extreme values

mild dirge Jan 17, 2023, 9:24 PM

#

This seems fine already, I'm more so talking about close to a magnitude higher than most other points

wary breach Jan 17, 2023, 9:24 PM

#

mild dirge This seems fine already, I'm more so talking about close to a magnitude higher t...

Like this? lol

#

That's the second set of data from my first pic ^

mild dirge Jan 17, 2023, 9:25 PM

#

Imo still fine, but I don't know if there is a objective way to tell if an outlier is too much for the normalization to have a bad impact on the network.

wary breach Jan 17, 2023, 9:27 PM

#

So the first set of data is "Age" and second set is "MonthlyIncome". Obviously those have very different scales. Do I need to normalize both then standardize or just standardize?

mild dirge Jan 17, 2023, 9:27 PM

#

Just normalize would be fine

wary breach Jan 17, 2023, 9:28 PM

#

mild dirge Just normalize would be fine

Why so? I'm fairly new and get really confused between the difference lol

#

in terms of when to use which ^

mild dirge Jan 17, 2023, 9:29 PM

#

Normalization is mostly used for NNs to get different features on the same scale. In your case the price is quite a few magnitudes bigger, so it could be biased to use that more over the age.

#

Standardization is used when your data seems normally distributed, and some algorithms expect your data to be normal and standardized.

wary breach Jan 17, 2023, 9:32 PM

#

mild dirge Standardization is used when your data seems normally distributed, and some algo...

So I should normalize the data first (if it isn't already). If it is normal already, I should should standardize it instead?

mild dirge Jan 17, 2023, 9:33 PM

#

Your data is not normalized, as the age ranges between 0 and 60 ish and the income betwen 0 and 20k

#

Normalized is between 0 and 1

wary breach Jan 17, 2023, 9:34 PM

#

mild dirge Your data is not normalized, as the age ranges between 0 and 60 ish and the inco...

I know I'm saying in general

mild dirge Jan 17, 2023, 9:34 PM

#

And I'm not sure when you would use standardization for neural networks. It seems to differ case by case when looking at projects

wary breach Jan 17, 2023, 9:34 PM

#

#

I'm assuming you would recommend to get rid of this outlier?

mild dirge Jan 17, 2023, 9:35 PM

#

Yeah that would probably be a good idea

wary breach Jan 17, 2023, 9:35 PM

#

Is it just best to drop them (I think it's like 5 points or something)? Or is there a better way to handle them?

mild dirge Jan 17, 2023, 9:36 PM

#

ehh, I'm not sure. If 5 points is not a lot of points you could just drop them. You could maybe also clamp them if you think they contain very useful info.

#

It might also still give good results if you simply just normalize anyways.

wary breach Jan 17, 2023, 9:37 PM

#

mild dirge ehh, I'm not sure. If 5 points is not a lot of points you could just drop them. ...

What's clamping?

mild dirge Jan 17, 2023, 9:38 PM

#

Basically just saying everything above value x will now be value x and everything below value y will now be y

#

For chosen x and y

cunning flame Jan 17, 2023, 9:38 PM

#

anyone knows how to make the graph in matplotlib fixed size and not flexible to the line?

mild dirge Jan 17, 2023, 9:38 PM

#

But this will treat every value above and equal to x the same

wary breach Jan 17, 2023, 9:39 PM

#

cunning flame anyone knows how to make the graph in matplotlib fixed size and not flexible to ...

https://www.geeksforgeeks.org/change-plot-size-in-matplotlib-python/ fig size you mean?

wary breach Jan 17, 2023, 9:40 PM

#

mild dirge Basically just saying everything above value `x` will now be value `x` and every...

Gotcha! Last question lol. Do I need to normalize each column by itself or everything at once? Guessing each individually

cunning flame Jan 17, 2023, 9:40 PM

#

,,,

mild dirge Jan 17, 2023, 9:40 PM

#

You should normalize every column separately

cunning flame Jan 17, 2023, 9:40 PM

#

tysm

wary breach Jan 17, 2023, 9:40 PM

#

mild dirge You should normalize every column separately

Kk thanks for the help!

mild dirge Jan 17, 2023, 9:41 PM

#

As a quick note, you should also normalize the test data according to min and max of the training data

#

So you should separate train and test before normalizing

#

And the normalize the training data according to train_max and train_min for each column

#

and use those for test as well

wary breach Jan 17, 2023, 9:42 PM

#

Yea 😄 although, this is for a Kaggle contest and I've heard it's fine to concat the two and preprocess them together since there won't be anymore data coming in. Is this true?

mild dirge Jan 17, 2023, 9:43 PM

#

Ehh, it is a bit more involved, I'll explain in a bit, bit busy atm

wary breach Jan 17, 2023, 9:43 PM

#

mild dirge Ehh, it is a bit more involved, I'll explain in a bit, bit busy atm

Sure np! Just @ me when you have time to explain a bit

cunning flame Jan 17, 2023, 9:44 PM

#

wary breach <https://www.geeksforgeeks.org/change-plot-size-in-matplotlib-python/> fig size ...

no wait it didnt quite work

wary breach Jan 17, 2023, 9:44 PM

#

cunning flame no wait it didnt quite work

Why not?

cunning flame Jan 17, 2023, 9:45 PM

#

wary breach Why not?

its still flexible to the line

#

but the size of the image itself

#

changes

#

but like the measurements are the same

wary breach Jan 17, 2023, 9:45 PM

#

Send me a pic? Idk what you mean xD

cunning flame Jan 17, 2023, 9:46 PM

#

oke

#

#

dont mind the numbers

#

see, the measurements are the same

#

but the size of the picture is different

wary breach Jan 17, 2023, 9:47 PM

#

Measurements as in what?

cunning flame Jan 17, 2023, 9:47 PM

#

like

wary breach Jan 17, 2023, 9:47 PM

#

y and x axis?

cunning flame Jan 17, 2023, 9:47 PM

#

yeah

wary breach Jan 17, 2023, 9:49 PM

#

Try using plt.xlim(__, __) plt.ylim(__, __)

cunning flame Jan 17, 2023, 9:49 PM

#

ok

#

yes

#

thats the thing!

#

tysm

wary breach Jan 17, 2023, 9:50 PM

#

Yay 😄

#

np

mild dirge Jan 17, 2023, 9:50 PM

#

wary breach Yea 😄 although, this is for a Kaggle contest and I've heard it's fine to concat...

So the reason you want to normalize just based on the training data is because you want to have a fair judgement of how well your model will perform on new/real world data. If you normalize based on all data, this means you are using information from data that you are testing on, which makes your performance measure on the test data less meaningful. It's a bit like cheating, your model will seem to perform better than it actually does.

#

It's also the reason why you only test on your test data once, and not modify the model anymore after getting the final performance measure on the test data

wary breach Jan 17, 2023, 9:51 PM

#

mild dirge So the reason you want to normalize just based on the training data is because y...

There won't be any new data though since they give a train/test set. Obviously in a real world application I would need to but wouldn't it be alright in this case?

mild dirge Jan 17, 2023, 9:52 PM

#

Would your model eventually be tested on other data to compare your model with others?

#

If so, then yes it could matter. In any case, your performance on the test set will be less meaningful if you normalize using your entire dataset, instead of just your training data.

wary breach Jan 17, 2023, 9:53 PM

#

mild dirge Would your model eventually be tested on other data to compare your model with o...

This is the note for the leaderboard: This leaderboard is calculated with approximately 20% of the test data. The final results will be based on the other 80%, so the final standings may be different.

mild dirge Jan 17, 2023, 9:53 PM

#

Eventually you would want to train on both train and test if you have done all steps of designing your model.

wary breach Jan 17, 2023, 9:53 PM

#

So I'm not sure if they use an extended set of test data or a completely different dataset

mild dirge Jan 17, 2023, 9:53 PM

#

Different datasets it seems

#

Of which 20% will be used for leaderboard, and 80% for actual score

wary breach Jan 17, 2023, 9:54 PM

#

So in that case I should normalize only the train data first

mild dirge Jan 17, 2023, 9:54 PM

#

When designing the model yes

wary breach Jan 17, 2023, 9:55 PM

#

mild dirge When designing the model yes

Let's say I normalize the train data + fit the model on the train test split data and it performs well. I'm guessing I need to then perform the same normalization on the test data that I did for the train data before I fit it to that?

#

Same normalization as in on the same columns

mild dirge Jan 17, 2023, 9:56 PM

#

You would fit on training, then probably use validation data to make decisions about your model like architecture and amount of layers etc.

#

Then after done you could test on tests set to see how well the final model does

wary breach Jan 17, 2023, 9:57 PM

#

Gotcha 😄 thank you for the help!

cunning flame Jan 17, 2023, 9:59 PM

#

is there any good tutorial on spacy?

hasty mountain Jan 17, 2023, 9:59 PM

#

hasty mountain (I still didn't manage to discover how the Transformer generates sentences in ev...

Beam search... I've studied this, but I had forgot...
Seems a technique a bit meh, though

#

#

(For a moment while he was processing the answer, I thought it would interpretate my input as if I was offended...this memory system it has is surely quite a thing)

wary breach Jan 17, 2023, 10:54 PM

#

mild dirge You would fit on training, then probably use validation data to make decisions a...

Do you know if sklearn's MinMaxScaler normalizes each column individually or the whole dataset when you feed it a dataframe?

#

It looks like it does each column individually but I can't tell for sure.

mild dirge Jan 17, 2023, 10:58 PM

#

https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.MinMaxScaler.html

scikit-learn

sklearn.preprocessing.MinMaxScaler

Examples using sklearn.preprocessing.MinMaxScaler: Release Highlights for scikit-learn 0.24 Release Highlights for scikit-learn 0.24 Image denoising using kernel PCA Image denoising using kernel PC...

#

individually "each feature individually"

wary breach Jan 17, 2023, 10:59 PM

#

mild dirge individually "each feature individually"

thank you 😅

cunning flame Jan 17, 2023, 11:37 PM

#

could someone please help

#

im being slow

#

wary breach Jan 18, 2023, 12:46 AM

#

cunning flame

Guessing one of your columns is a string insterad of int

#

Possible for gender?

cunning flame Jan 18, 2023, 12:47 AM

#

all of them are string

#

https://www.geeksforgeeks.org/python-gender-identification-by-name-using-nltk/

GeeksforGeeks

Python | Gender Identification by name using NLTK - GeeksforGeeks

A Computer Science portal for geeks. It contains well written, well thought and well explained computer science and programming articles, quizzes and practice/competitive programming/company interview Questions.

#

thats my "inspiration"

wary breach Jan 18, 2023, 12:48 AM

#

What are the values of gender?

#

Male and Female?

cunning flame Jan 18, 2023, 12:48 AM

#

yeah

wary breach Jan 18, 2023, 12:48 AM

#

Have you heard of one-hot-encoding before?

cunning flame Jan 18, 2023, 12:48 AM

#

nope

#

whats that

wary breach Jan 18, 2023, 12:50 AM

#

In your case, "gender" is a categorical feature (i.e. you're either male or female). One way to handle this is to turn the columns into integers (0 and 1). One-hot-encoding is a way to do this.

#

#

This is what it looks like

cunning flame Jan 18, 2023, 12:51 AM

#

oh yeah ik the idea

#

just didn;t know the fancy name

#

so i need to convert it

wary breach Jan 18, 2023, 12:52 AM

#

I would

#

makes it much easier to deal with

#

https://stackoverflow.com/questions/37292872/how-can-i-one-hot-encode-in-python

#

I personally use pd.get.dummies

cunning flame Jan 18, 2023, 12:53 AM

#

alright but can i ask one more question

wary breach Jan 18, 2023, 12:53 AM

#

sure

#

I'm still a noob though so I don't know as much as a lot of others xD

cunning flame Jan 18, 2023, 12:53 AM

#

could you explain how does nltk.NaiveBayesClassifier.train() exactly work

#

ik it uses Bayes theorem to classify

#

but what does it mean

#

ik the theorem but like how it works

wary breach Jan 18, 2023, 12:54 AM

#

Honestly I'm not sure I've never used it before xD

cunning flame Jan 18, 2023, 12:54 AM

#

oh alright

#

imma go research then

wary breach Jan 18, 2023, 12:55 AM

#

It uses Bayes' theorem that's all I know too lol

cunning flame Jan 18, 2023, 12:55 AM

#

welp one last question about my code

wary breach Jan 18, 2023, 12:55 AM

#

sure

cunning flame Jan 18, 2023, 12:55 AM

#

why does the same thing work herehttps://www.geeksforgeeks.org/python-gender-identification-by-name-using-nltk/

GeeksforGeeks

Python | Gender Identification by name using NLTK - GeeksforGeeks

A Computer Science portal for geeks. It contains well written, well thought and well explained computer science and programming articles, quizzes and practice/competitive programming/company interview Questions.

#

but not for me

wary breach Jan 18, 2023, 12:57 AM

#

Can you print test_set?

cunning flame Jan 18, 2023, 12:58 AM

#

sure

#

oh thats the error i think

#

it prints out last letter and then just "name"

wary breach Jan 18, 2023, 1:00 AM

#

your labeled_names has part of the code wrong

#

You have ([(name, "name)]) when in the example it's name, "male" and name, "female"

cunning flame Jan 18, 2023, 1:01 AM

#

oh

wary breach Jan 18, 2023, 1:02 AM

#

Also you're missing this line py featuresets = [(gender_features(n), gender) for (n, gender)in labeled_names]

#

Oh nvm I see you just called it "features" instead

cunning flame Jan 18, 2023, 1:03 AM

#

im not

#

yeah

#

so should i convert male to 1 and female to 0?

wary breach Jan 18, 2023, 1:03 AM

#

what happens now if you print test_set after you changed it?

#

It doesn't seem like you need to

cunning flame Jan 18, 2023, 1:03 AM

#

it prints out gender instead of name

#

but still teh same error

wary breach Jan 18, 2023, 1:04 AM

#

let me download the dataset and try it's hard to do it with no code xD sec

cunning flame Jan 18, 2023, 1:04 AM

#

oh um you want me to send txt files?>

wary breach Jan 18, 2023, 1:04 AM

#

no it's ok I see it in the geeksforgeeks page

#

Can you copy and paste your code?

#

!code

arctic wedgeBOT Jan 18, 2023, 1:05 AM

#

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

cunning flame Jan 18, 2023, 1:05 AM

#

sure

#

import random as r
from nltk.corpus import names
import nltk
def genderFind(word):
  return(word[-2])
MaleNames = open("/content/Male.txt").readlines()
FemaleNames = open("/content/Female.txt").readlines()
labeled_names = ([(name, "Male") for name in MaleNames] + [(name, "Female") for name in FemaleNames])
r.shuffle(labeled_names)

features = [(genderFind(n), gender) for (n, gender) in labeled_names]

train_set, test_set = features[500:], features[:500]
classifier = nltk.NaiveBayesClassifier.train(test_set)

wary breach Jan 18, 2023, 1:08 AM

#

Ok I downloaded sec let me look through it

cunning flame Jan 18, 2023, 1:09 AM

#

ok

wary breach Jan 18, 2023, 1:13 AM

#

Change the function you have in line 4

#

to this

def genderFind(word):
    return {'last_letter':word[-1]}```

#

Works for me now

cunning flame Jan 18, 2023, 1:14 AM

#

OH GOSH

#

IT WORKS

wary breach Jan 18, 2023, 1:14 AM

#

😄

cunning flame Jan 18, 2023, 1:15 AM

#

but its innacurate

#

imma use bigger lists

#

to train it

wary breach Jan 18, 2023, 1:15 AM

#

Yea I think it's due to the way you're reading the lists

#

if you print either of the male names or female names you can see there's a \n in every name string

cunning flame Jan 18, 2023, 1:18 AM

#

is there a way to say that its wrong when its wrong so it learns more?

#

and so it remembers it somehow

#

btw tysm for the help

#

forgot to say ity

wary breach Jan 18, 2023, 1:21 AM

#

It's predicting the test name based on how you trained it

#

It's only showing male names for me I think because of the way you're reading files

cunning flame Jan 18, 2023, 1:22 AM

#

oh

#

wdym

wary breach Jan 18, 2023, 1:24 AM

#

if you print either the MaleNames or FemaleNames, you can see that the names haven't been stripped of the new line from the txt file py 'Wilson\n', 'Wilt\n', 'Wilton\n', 'Win\n', 'Windham\n', 'Winfield\n', 'Winford\n', 'Winfred\n', 'Winifield\n', 'Winn\n', 'Winnie\n',...

#

Replace the 2 lines where you're reading in the file with this:

with open("/content/Male.txt", "r") as file:
    MaleNames=file.read().splitlines()
with open("/content/Female.txt", "r") as file:
    FemaleNames=file.read().splitlines()```

#

You could write a function to do it in less than 2 lines but I'm too lazy to figure that out rn xD

#

Based on the training data you gave it should be accurate now

#

@cunning flame

cunning flame Jan 18, 2023, 1:31 AM

#

oh

#

thank you

wary breach Jan 18, 2023, 1:32 AM

#

cunning flame thank you

Np let me know if it works 😄

cunning flame Jan 18, 2023, 1:32 AM

#

alr it became more accurate but im actually doing it for a n order so i need like 95-99% so ill have to go ask a very smart guy i know

#

but again thank you so much

wary breach Jan 18, 2023, 1:33 AM

#

Np 😄

cunning flame Jan 18, 2023, 1:33 AM

#

also can you tell me what was changed?

wary breach Jan 18, 2023, 1:33 AM

#

gl with your model

cunning flame Jan 18, 2023, 1:33 AM

#

like what was the problem?

wary breach Jan 18, 2023, 1:33 AM

#

For the reading files part?

cunning flame Jan 18, 2023, 1:33 AM

#

yeah

wary breach Jan 18, 2023, 1:34 AM

#

https://media.geeksforgeeks.org/wp-content/uploads/male.txt see how in the file each name is on a new line? Well python recognizes that so for each name in the file it added \n which means new line

#

Splitlines tells python to ignore the new lines and only focus on the words

cunning flame Jan 18, 2023, 1:34 AM

#

ooooh

#

alright

wary breach Jan 18, 2023, 1:35 AM

#

You could also get it more accurate by using different datasets

#

The male/female dataset provided seems kind of bad

cunning flame Jan 18, 2023, 1:36 AM

#

yeah

#

imma go find like a 10 thousand line one or smthn

#

also, do you know if there is a way for ai to remember its training so it doesnt have to redo it everytime?

#

i found a dataset with 240000 names

wary breach Jan 18, 2023, 1:55 AM

#

cunning flame also, do you know if there is a way for ai to remember its training so it doesnt...

You shouldn't have to retrain it every time

cunning flame Jan 18, 2023, 1:55 AM

#

but it does it

#

in the code

wary breach Jan 18, 2023, 1:56 AM

#

Oh that's because you're running everything again

cunning flame Jan 18, 2023, 1:56 AM

#

yeah

#

a, I SHOULD

wary breach Jan 18, 2023, 1:56 AM

#

What are you using?

#

like vscode, colab, etc

cunning flame Jan 18, 2023, 1:56 AM

#

google colab

wary breach Jan 18, 2023, 1:57 AM

#

Split up the lines of code into chunks so that way you only have to rerun the cell that has the name

#

or actually just keep it as is

#

and move this to a new chunk py print(classifier.classify(genderFind('Walt')))

cunning flame Jan 18, 2023, 1:57 AM

#

ih thats smart

wary breach Jan 18, 2023, 1:57 AM

#

then you only have to run that chunk of code

strong sedge Jan 18, 2023, 3:26 AM

#

hello, I am currently learning how to work with time series data, v this is a sample of the data I am working with, there is no "periodicity" can you suggest how I should do time series analysis on this ?

cunning flame Jan 18, 2023, 3:46 AM

#

are you trying to smooth it out or?

strong sedge Jan 18, 2023, 3:50 AM

#

cunning flame are you trying to smooth it out or?

atm I havent thought about that, I can try that, but I want some advice on algorithms that I can try
the data I have worked with in the past is clearly seasonal/periodic, so I just use arima, but I dont think that will give good perf on this dataset

cunning flame Jan 18, 2023, 3:50 AM

#

i need to first fully understand what you are trying to do

strong sedge Jan 18, 2023, 3:50 AM

#

if you want some more detail, this is a custom/real life sales data set

#

my trainer has asked me to do sales forecasting

cunning flame Jan 18, 2023, 3:51 AM

#

oh you can do a regression

#

you know whats that?

strong sedge Jan 18, 2023, 3:51 AM

#

yeah

cunning flame Jan 18, 2023, 3:51 AM

#

so just do that

strong sedge Jan 18, 2023, 3:52 AM

#

do you usually do regression on time series data ?

#

in practice ?

cunning flame Jan 18, 2023, 3:52 AM

#

yeah

#

you use it anywhere where you want to predisct stuff

wary breach Jan 18, 2023, 3:52 AM

#

it doesn't look like there's any sort of correlation for regression

cunning flame Jan 18, 2023, 3:52 AM

#

hm

#

thats true

strong sedge Jan 18, 2023, 3:52 AM

#

wary breach it doesn't look like there's any sort of correlation for regression

that was my thinking also

#data-science-and-ml

load the input data

convert the input data to a torch tensor

define the model

define the loss function

define the optimizer

train the model