potent sky Jun 3, 2023, 4:25 PM

#

Possible. I generally try to build from scratch all the basic stuff

past meteor Jun 3, 2023, 4:26 PM

#

I could use them for my work (medical stuff, modelling something as a function of vital signs + behaviour)

potent sky Jun 3, 2023, 4:26 PM

#

It's not just the coding part, but moreso the questions that arise during the process that are more valuable imo

past meteor Jun 3, 2023, 4:26 PM

#

But I just don't get why people are using them for time series. You don't want permutation invariance there

potent sky Jun 3, 2023, 4:26 PM

#

I almost certainly go on deeper math rabbit holes than coding when implementing something from scratch

past meteor Jun 3, 2023, 4:27 PM

#

So they have temporal fusion transformers that take away the permutation invariance of transformers, why not use a regular RNN at that point etc...

potent sky Jun 3, 2023, 4:27 PM

#

I find that a lot of details I might've glossed over when just reviewing the theory become difficult to ignore once you're implementing it

potent sky Jun 3, 2023, 4:27 PM

#

past meteor But I just don't get why people are using them for time series. You don't want p...

We account for that using positional embeddings

#

The other benefits of Transformers are exceptional for time series. And positional embeddings are powerful enough to make it work

hasty mountain Jun 3, 2023, 4:28 PM

#

past meteor I could use them for my work (medical stuff, modelling something as a function o...

Spare me the work for my undergraduation and make a Transformer to predict probabilities of disease diagnoses according to the symptoms a patient has related brainmon

potent sky Jun 3, 2023, 4:29 PM

#

past meteor So they have temporal fusion transformers that take away the permutation invaria...

Transformers are not restricted to processing the input sequence one-at-a-time
This leads to arguably some of the biggest benefits of Transformers, modelling very long sequence dependencies (theoretically infinite) and single shot computation in parallel

#

gtg now ;-;

hasty mountain Jun 3, 2023, 4:30 PM

#

potent sky Transformers are not restricted to processing the input sequence one-at-a-time T...

I'd say that there's a trend on using Transformers for anything... pithink

#

There's the Transformer for the Stable Diffusion conditioning(text), there's Transformers for AlphaStar, the DeepMind's AI that achieved GrandMaster in StarCraft 2, there's Transformers for image classification, for video classification, face recognition, text classification...

#

Maybe there's also for working with audio. I just didn't find it yet yert

agile cobalt Jun 3, 2023, 4:33 PM

#

https://ai.googleblog.com/2022/10/audiolm-language-modeling-approach-to.html

AudioLM is a pure audio model that is trained without any text or symbolic representation of music. AudioLM models an audio sequence hierarchically, from semantic tokens up to fine acoustic tokens, by chaining several Transformer models, one for each stage. Each stage is trained for the next token prediction based on past tokens, as one would train a text language model. The first stage performs this task on semantic tokens to model the high-level structure of the audio sequence.

AudioLM: a Language Modeling Approach to Audio Generation

potent sky Jun 3, 2023, 4:34 PM

#

hasty mountain I'd say that there's a trend on using Transformers for anything... <:pithink:652...

I won't deny it. In fact I was going through this paper sometime ago (not sequence processing) and it frustrated me; looked like they'd just thrown a transformer each at 7 parts of the problem and hoped it'd work.
But I can't complain. It works. That paper reports SOTA performance and defined a completely new task and training procedure

#

And tbh often there's a lot of thinking and mathematical justification that goes into where to throw a transformer and what kind of transformer

potent sky Jun 3, 2023, 4:34 PM

#

agile cobalt https://ai.googleblog.com/2022/10/audiolm-language-modeling-approach-to.html > A...

Yeah this is fun lol

past meteor Jun 3, 2023, 4:35 PM

#

potent sky Transformers are not restricted to processing the input sequence one-at-a-time T...

Yeah, I just don't have enough finesse with transformers

hasty mountain Jun 3, 2023, 4:36 PM

#

potent sky I won't deny it. In fact I was going through this paper sometime ago (not sequen...

I'm quite prone to think that, in most applications, Transformers are thrown into a task and everything else is adjusted in order to make it work.

past meteor Jun 3, 2023, 4:36 PM

#

I'll try them out for my work if I have spare time

#

At the very least it'll be a learning experience lol

merry roost Jun 3, 2023, 4:41 PM

#

How would you guys suggest I learn pytorch? I want to train a simple model over the next few days, taking in ~140 inputs, im not sure how many center layers, but only 7 outputs. I have learned a bit about ai and how it works, the math behind it and all that, but I just never learned pytorch yet. Most tutorials on pytorch seam kinda confusing and the main docs are verry in depth for a starting guide.

data for input and output is repeated data of

type
2-4. positon
5-7. rotation

ABOVE IS COPY FROM #python-discussion to continue a conversation.

if anyone wants to help, I dont know how to do this well, I can generate the data in this form and would prefer to use this rather than a waveform colapse to generate a map for a game.

#

@nova pollen hola

nova pollen Jun 3, 2023, 4:41 PM

#

what would the training data be?

#

the more context I get the less I think deep learning is suitable 😅

merry roost Jun 3, 2023, 4:44 PM

#

nova pollen what would the training data be?

fine, here is the long version

I want to take a racing game that has track prices for a lot of things, use the previous track peices to generate the next one and so on. The training data will come from maps that have no boosters or any sort of speed increse and be sorted into the ai model using the distance from the start block. The reason I have to sort it is because all maps have blocks unsorted, and without any maps with speed boosters, I can assume everything goes in a down direction or at least away from the spawn point allowing me to get the next peice and gett a good training set.

#

using that training set I will try to generate the next block that was placed in the map assinging weights to how similar blocks are to eachother allowing the ai to still get points for being wrong

#

I will also when chosign points, if the selected peice is less than 20 away from the start just include 0's as the types and position

#

*and rotation, allowing me to slowly geneate a map using this ai after it has trained on enought data on the maps and for long enought

#

This aprroch is better over waveform colapse, becausei want to learn ai and I dont want to haveto find all the parts my self, allong with the fact that that would illeminate any jumps or obsticals if I should use waveform colapse

merry roost Jun 3, 2023, 4:47 PM

#

nova pollen the more context I get the less I think deep learning is suitable 😅

I may also add one more data point giving how many track peices exist in total, allowing the ai to end the track at a reasonable point.

#

I know this is possiable, its just how hard it is

nova pollen Jun 3, 2023, 4:49 PM

#

mm apart from the sequence modelling point i mentioned earlier

#

this is a generative problem

#

since there isn't really a "ground truth" next object

merry roost Jun 3, 2023, 4:50 PM

#

nova pollen since there isn't really a "ground truth" next object

There is for the training data, I can use the one that is placed afterwards as that object

#

I would train it on a lot of comunity made maps

#

*reciently there was a no power compotion where people could tag their maps as not using any powered objects like boosters

nova pollen Jun 3, 2023, 4:51 PM

#

right, but if I gave you the sentence
"I am currently eating a BLANK"
and asked you to complete it, there would be many possible valid continuations

merry roost Jun 3, 2023, 4:52 PM

#

nova pollen right, but if I gave you the sentence "I am currently eating a BLANK" and asked...

yes, thats why I hoped to try to solve it using weights for how much each peice was similer, but I dont know if that would help at all

nova pollen Jun 3, 2023, 4:53 PM

#

this makes training it as if there were only one correct answer difficult

merry roost Jun 3, 2023, 4:54 PM

#

nova pollen this makes training it as if there were only one correct answer difficult

yeah...

#

I just wanted to do this for the reasons specified before

nova pollen Jun 3, 2023, 4:54 PM

#

mm

merry roost Jun 3, 2023, 4:55 PM

#

as generaly wavefrom colase, only looks at its direct neibors, but I could have it extend its size ig, or make premade assets for each road section

#

but eithro way it would not be as good as training it directly...

merry roost Jun 3, 2023, 4:57 PM

#

nova pollen mm

I have a arm small computer that I already have on 24/7 running nothing rn, so I could just give it the task on the cpu for a few days.

nova pollen Jun 3, 2023, 4:58 PM

#

doesnt hurt to try i suppose

merry roost Jun 3, 2023, 4:59 PM

#

nova pollen doesnt hurt to try i suppose

yeah.... If we have 1 correct also, it would train, just not the best

#

it would have conflicting information

#

but it would work

potent sky Jun 3, 2023, 5:00 PM

#

merry roost How would you guys suggest I learn pytorch? I want to train a simple model over ...

The tutorials in the torch docs should be a good option

#

JavaLim in ai channel 👀

potent sky Jun 3, 2023, 5:01 PM

#

past meteor At the very least it'll be a learning experience lol

Yeah. They're very powerful and very pervasive in many sub-fields of ML

potent sky Jun 3, 2023, 5:01 PM

#

hasty mountain I'm quite prone to think that, in most applications, Transformers are thrown int...

Maybe, maybe not. Still seems to give better performance than any other option

merry roost Jun 3, 2023, 5:01 PM

#

potent sky The tutorials in the torch docs should be a good option

do you think this is a good way of trying to solve this, or do you suggest I do somthing diffrent

cold osprey Jun 3, 2023, 5:02 PM

#

u wanna replicate how the game generate track pieces?

potent sky Jun 3, 2023, 5:02 PM

#

The first rule of machine learning is: don't use machine learning
i.e. try to find a simpler solution, mathematical or algorithmic

#

I read your long version but tbh it wasn't clear to me what exactly you're trying to do

#

Like wdym by "sorted into the ai model"

merry roost Jun 3, 2023, 5:04 PM

#

cold osprey u wanna replicate how the game generate track pieces?

not replecate, I want to generate a track

cold osprey Jun 3, 2023, 5:05 PM

#

assuming the game ure using doesnt just randomly generate track pieces and stitch them together, ure just replicating that algorithm by training a model on that data no?

rose dagger Jun 3, 2023, 5:05 PM

#

Sorry to interrupt the lively discussion here: I have a problem with the training time of my convolutional neural network. The inputs are 512x512 (grayscale) images and i want to perform image segmentation. For this i am choosing the U-Net architecture. Now even for a reduced training set of only ~100 samples, a single epoch takes ~1h to finish. My total amount of training data is ~1600 samples (not even including additional data augmentation). What would be smarter to do, in order to cut down on training time while keeping some of the performance: (i) "Reduce" the images to 256x256 or even 128x128 by some kind of "blurring" , (ii) reducing the networks architecture by removing a few layers or (iii) something else.

cold osprey Jun 3, 2023, 5:07 PM

#

rose dagger Sorry to interrupt the lively discussion here: I have a problem with the trainin...

ure using gpu i hope?

merry roost Jun 3, 2023, 5:07 PM

#

potent sky Like wdym by "sorted into the ai model"

the track peices are compleatly out of order in the data, but i have their positions, then using the start block, I can try to find what next track peice there is based off sorting them by distance using only tracks that have no speed increse;

track peices are out of order, I am sorting by their distance from the start blocks, using only tracks that have no external power for your vheical, allowing me to know what order they are in, and use that as input data, as without sorting it it would be hard

merry roost Jun 3, 2023, 5:07 PM

#

cold osprey assuming the game ure using doesnt just randomly generate track pieces and stitc...

I am trying to train on comunity made maps, there is nothing in the game for this

cold osprey Jun 3, 2023, 5:07 PM

#

why not just write an algorithm (non ml) to generate tracks?

merry roost Jun 3, 2023, 5:08 PM

#

I am not using wafeformcollapse because its its not fun, I want to learn ai, and I cannot allow for thins souch as jumps with that.

rose dagger Jun 3, 2023, 5:08 PM

#

cold osprey ure using gpu i hope?

oh my god i just checked. I am beyond stupid lmao. Thanks for the quick reply

cold osprey Jun 3, 2023, 5:09 PM

#

rose dagger oh my god i just checked. I am beyond stupid lmao. Thanks for the quick reply

tensorflow on windows?

rose dagger Jun 3, 2023, 5:09 PM

#

yes

potent sky Jun 3, 2023, 5:09 PM

#

Pain

merry roost Jun 3, 2023, 5:09 PM

#

cold osprey why not just write an algorithm (non ml) to generate tracks?

the only reasonable one would be wafefrom colapse, or to make all jumps and transitions my self then have it basicly do waveform colapse on that data to get the output I wnat

cold osprey Jun 3, 2023, 5:09 PM

#

sounds like seq to seq

potent sky Jun 3, 2023, 5:10 PM

#

Why can't you do jumps with waveform collapse. What are jumps here

cold osprey Jun 3, 2023, 5:10 PM

#

not sure of the details if its non fixed length, etc tho

potent sky Jun 3, 2023, 5:10 PM

#

merry roost the track peices are compleatly out of order in the data, but i have their posit...

Makes sense

cold osprey Jun 3, 2023, 5:10 PM

#

and how u would restrict certain combination of track pieces. ig it will be learnt based on community made maps

merry roost Jun 3, 2023, 5:11 PM

#

cold osprey and how u would restrict certain combination of track pieces. ig it will be lear...

check the map, and there was a chalange reciently for best maps without any external power, so I can use those as a training set

#

disguartding any seneary

potent sky Jun 3, 2023, 5:11 PM

#

Are the "track pieces" to be selected from a finite, discrete, pre-determined set?

past meteor Jun 3, 2023, 5:12 PM

#

The thing with time series at least is that very very simple models (t = t-1) type things or exponential smoothing can outperform complex models

potent sky Jun 3, 2023, 5:12 PM

#

Yes

past meteor Jun 3, 2023, 5:12 PM

#

I think transformers will matter in my case is when I start doing long horizon with a large conditioning window

#

Because that's the space where basic models fall flat

merry roost Jun 3, 2023, 5:12 PM

#

potent sky Are the "track pieces" to be selected from a finite, discrete, pre-determined se...

Its trained on comunity maps and as I said later, it might be good for me to include one more value on there about the amount of track peices placed, so it can determin how far away the end should be

cold osprey Jun 3, 2023, 5:13 PM

#

if all tracks are fixed length, could just do fixed length seq to seq

potent sky Jun 3, 2023, 5:13 PM

#

Or when there is complex structure inherent in your sequence. Time series forecasting tasks work well with simpler methods, like extrapolating stock prices maybe.
But you'd be hard pressed to compete with transformers on say speech or language tasks

potent sky Jun 3, 2023, 5:14 PM

#

past meteor I think transformers will matter in my case is when I start doing long horizon w...

^

rose dagger Jun 3, 2023, 5:14 PM

#

cold osprey tensorflow on windows?

If i may ask: How does this come into play? I.e. why does it matter whether i use tensorflow and whether i'm on windows for training time?

cold osprey Jun 3, 2023, 5:14 PM

#

rose dagger If i may ask: How does this come into play? I.e. why does it matter whether i us...

tensorflow doesnt support gpu on windows anymore. would need wsl

foggy kestrel Jun 3, 2023, 5:14 PM

#

hey, not really a coding question but does anyone know where i can download the "tesseract executable"

rose dagger Jun 3, 2023, 5:15 PM

#

Ok, well i'm working on Kaggle notebook, so i should be good, right?

potent sky Jun 3, 2023, 5:15 PM

#

merry roost Its trained on comunity maps and as I said later, it might be good for me to inc...

What are these track pieces? Are they limited choices like {A, B, C} are they infinite choices like [1, INF) or are they a continuous variable...? Or smtg else?

cold osprey Jun 3, 2023, 5:15 PM

#

rose dagger Ok, well i'm working on Kaggle notebook, so i should be good, right?

yep, itll be linux

potent sky Jun 3, 2023, 5:15 PM

#

foggy kestrel hey, not really a coding question but does anyone know where i can download the ...

That's an OCR module ig check their project page?

merry roost Jun 3, 2023, 5:16 PM

#

potent sky What are these track pieces? Are they limited choices like {A, B, C} are they in...

there is about lets just say 10 peices of road, a start, a checkpoint, and a end peice. 13 possiable peices, then all rotations of those

potent sky Jun 3, 2023, 5:16 PM

#

Relent downloading executables from unauthorised kr untrusted sources

foggy kestrel Jun 3, 2023, 5:17 PM

#

potent sky That's an OCR module ig check their project page?

ur right, i did a scan of their github and of google's tesseract-ocr github but haven't found anything yet

potent sky Jun 3, 2023, 5:17 PM

#

merry roost there is about lets just say 10 peices of road, a start, a checkpoint, and a end...

Okay, so the track length will always be 13? Or it'll sometimes stop at 6?

foggy kestrel Jun 3, 2023, 5:17 PM

#

i will go through again thank you very much

potent sky Jun 3, 2023, 5:17 PM

#

Or 7 or 3

potent sky Jun 3, 2023, 5:18 PM

#

foggy kestrel ur right, i did a scan of their github and of google's tesseract-ocr github but ...

Is it not pip installable? That should give you the wheels

merry roost Jun 3, 2023, 5:18 PM

#

potent sky Okay, so the track length will always be 13? Or it'll sometimes stop at 6?

the length is generates is determined by when it places the end block, it is given the track length at the current point in time, and as part of the data its given to train on, it is given the amont of peices placed for it to determin when the end is

#

it only geneates one peice at at time so that should be fine

potent sky Jun 3, 2023, 5:18 PM

#

Yes but it isn't necessary for
each track generated to be 13 pieces long is it?

foggy kestrel Jun 3, 2023, 5:19 PM

#

potent sky Is it not pip installable? That should give you the wheels

yeah i installed it but it doesn't come with the executable and has to be seperately installed

#

kinda weird but i think i did find it, had to do some digging in the original tesseract-ocr engine page

merry roost Jun 3, 2023, 5:19 PM

#

potent sky Yes but it isn't necessary for each track generated to be 13 pieces long is it?

no, that is the possiable peices to select from, it can generate a track of any length given those peices

#

but based on the trained data, I want it to generate the end peice

potent sky Jun 3, 2023, 5:20 PM

#

This seems like you can just sample from two distributions, one containing you set of tracks and one to regulate when it ends. You can add a bias to tune it.
I don't think it requires ML but sure you can use it if you want

merry roost Jun 3, 2023, 5:20 PM

#

potent sky This seems like you can just sample from two distributions, one containing you s...

one sec let me rewrite this as a long thing

potent sky Jun 3, 2023, 5:21 PM

#

Look into sequence modelling, RNNs, etc. There should be tutorials on pytorch docs.
And remember, one of the most important parts of an ML problem is formulating the data and model inputs in the right manner. You could be stuck in a simple problem for ages if you don't do this right.
Don't rush to the modelling part, give data all the time it demands and you should be better off for it

potent sky Jun 3, 2023, 5:22 PM

#

foggy kestrel yeah i installed it but it doesn't come with the executable and has to be sepera...

huh that's weird.
You could build from source either way ig

potent sky Jun 3, 2023, 5:24 PM

#

potent sky This seems like you can just sample from two distributions, one containing you s...

If you want it to be like the tracks other players have generated, you can add those to a population and sample from that instead of sampling arbitrarily

merry roost Jun 3, 2023, 5:28 PM

#

I want to generate a track in a game consisting of only track peices, starts, ends, and checkpoints, lets just say this is then a array of those real in game object ids, mapped to 0-13, 0 being nothing and only occoring before the start block.

the tracks to train on will first have to start with being cleared or selected with only 0 boosters / external power to limit the direction downwards and away form the start block. This will allow us the then sort the track peices that are currently randomly placed in the file, into a neat set from start to end in a continual pattern.

This data then we use to train a model by taking a random peice from a random track of data, selecting that peice as the one to be generated, this can be anything but a start block (Start blocks will only ever exist once and will never be placed by the ai.) the ai then takes the flowing data about blocks:

1x -
current track length

20x -
type (mapped betwine 0-13)
position (x,y,z) (clamped to a 1/4 th grid tile)
rotation (x,y,z) (clamped to 45* increments)

this data is given to the model, who then has to guess the track peice that was selected. This will repeat over and over attempting to generate blocks in the positions that tracks have most relivent online.

this will not be verry accurate, but with enough training, it should be close enough.

Waveform colapse is not a good option as for things like jumps or the end it needs more information that is easier to provide to a ai model.

generated format:
type (mapped again)
position (clamped again)
rotation (clamped again)

merry roost Jun 3, 2023, 5:29 PM

#

potent sky If you want it to be like the tracks other players have generated, you can add t...

I think I wrote that better

merry roost Jun 3, 2023, 5:36 PM

#

potent sky Look into sequence modelling, RNNs, etc. There should be tutorials on pytorch do...

stargazer?

potent sky Jun 3, 2023, 5:50 PM

#

merry roost stargazer?

busy with some work

potent sky Jun 3, 2023, 5:50 PM

#

merry roost I want to generate a track in a game consisting of only track peices, starts, en...

What are jumps

merry roost Jun 3, 2023, 5:50 PM

#

potent sky busy with some work

ok, sorry just didnt know

potent sky Jun 3, 2023, 5:50 PM

#

No it's alright dw about it. I just check this when I can

merry roost Jun 3, 2023, 5:51 PM

#

potent sky What are jumps

its a car racing game thing, so you can jump spaces with enough speed

potent sky Jun 3, 2023, 5:52 PM

#

Why is this problematic for jumps

#

*for waveform collapse

merry roost Jun 3, 2023, 5:53 PM

#

potent sky Why is this problematic for jumps

for wafeform collapse, it normaly only checks the spaces sorrounding, and because jumps have multiple blank air spaces, we cant use that by its self

#

so you eithro make bigger setcions for waveform collapse

#

contaning multiple peices

#

or has to expand on waveform collase

potent sky Jun 3, 2023, 5:54 PM

#

But your task is only to create the track right? Why consider jumps

merry roost Jun 3, 2023, 5:54 PM

#

potent sky But your task is only to create the track right? Why consider jumps

thats part of the track

potent sky Jun 3, 2023, 5:55 PM

#

So there are jumps between certain pairs of track elements (say 2-7) and not between others?

#

Or jump is one of the 13 elements?

merry roost Jun 3, 2023, 5:55 PM

#

potent sky Or jump is one of the 13 elements?

the model, will not prodict the peice in a certain place, but rather predict a peice and a positoon

#

this makes jumps easily possiable

cold osprey Jun 3, 2023, 5:56 PM

#

the model doesnt necessarily need to predict piece and position

merry roost Jun 3, 2023, 5:56 PM

#

yes, but I would like it to

cold osprey Jun 3, 2023, 5:57 PM

#

just piece should be fine if u set it in such a way that the outputs are already in its designated position

merry roost Jun 3, 2023, 5:57 PM

#

yes, but that makes it so jumps cant be done

potent sky Jun 3, 2023, 5:57 PM

#

I feel there's a bunch of information here that isn't apparent to us as it is to you since we don't know the game you're working with

#

I believe you can try to go for sequence modelling using ML. If nothing else, the process of preparing and structuring the data for the model should help you gain a lot of clarity about the problem

merry roost Jun 3, 2023, 5:59 PM

#

potent sky I believe you can try to go for sequence modelling using ML. If nothing else, th...

here is the exact game i wanted to try to do it on
https://store.steampowered.com/app/1440670/Zeepkist/

Steam

Zeepkist

Zeepkist is a racing game for 1-4 players, or up to 64 online, in which players race down extreme downhill soapbox courses to set the best times possible!If you like weird physics, soapbox racing, and/or creating your own crazy tracks, then this is the game for you!🔸 Race against time itself in Adventure mode!🔸 Crash into your friends in 4-playe...

Price

$11.99

Recommendations

931

▶ Play video

#

Just remove all non track blocks

potent sky Jun 3, 2023, 5:59 PM

#

I do think this can be solved using some probability and statistics, without ML. But you can try it out and see

merry roost Jun 3, 2023, 5:59 PM

#

and use maps with no boostars

merry roost Jun 3, 2023, 6:00 PM

#

potent sky I do think this can be solved using some probability and statistics, without ML....

I dont think so, its a more difficult question

#

I guess I could use some sort of modified waveform collapse

#

but it would be diffuclt

cold osprey Jun 3, 2023, 6:00 PM

#

I'm thinking more like how a sentence is generated, previous words matter to the next word being generated

potent sky Jun 3, 2023, 6:00 PM

#

merry roost here is the exact game i wanted to try to do it on https://store.steampowered.co...

That's possible. I'm not familiar with the game (and so the problem) as you are

potent sky Jun 3, 2023, 6:01 PM

#

merry roost I dont think so, its a more difficult question

cold osprey Jun 3, 2023, 6:01 PM

#

From my brief reading about wave function collapse, I don't see why u don't wanna use it

merry roost Jun 3, 2023, 6:02 PM

#

cold osprey From my brief reading about wave function collapse, I don't see why u don't wann...

certain things like jumps would require structures made of multiple blocks, doing this would also mean checking multiple blocks and I just think that that would be harder

#

allong with the fact I want to learn about using pytorch

cold osprey Jun 3, 2023, 6:02 PM

#

Yeah like if a jump block has been selected for piece 5, piece 6 cannot be another jump right?

cold osprey Jun 3, 2023, 6:03 PM

#

merry roost allong with the fact I want to learn about using pytorch

I think if ure learning something for the first time (in this case pytorch), best to start with something simple too that is well documented on how to approach

potent sky Jun 3, 2023, 6:04 PM

#

Whether or not it can be solved without ML, it does look like something that can be usefully solved with ML. So if you want to use it as a project to dive into learning ML, go for it

merry roost Jun 3, 2023, 6:04 PM

#

It depends on how you generate this, and its hard to explain right now in short sentances but I belive ai is what I want for this ranther than wavefuntction clapse

merry roost Jun 3, 2023, 6:04 PM

#

potent sky Whether or not it can be solved without ML, it does look like something that can...

thats part of it, and I think the results will be cooler / better with ai than with wavefunction and me making it basicly all by hand

potent sky Jun 3, 2023, 6:05 PM

#

Look into sequence modelling

#

RNNs, GRUs, LSTMs and the like
Transformers might be overkill

#

Also look into some of the simpler mathematical sequence modelling functions before that. You can derive inspiration from them if nothing else

cold osprey Jun 3, 2023, 6:06 PM

#

Attention is all you need

merry roost Jun 3, 2023, 6:06 PM

#

potent sky RNNs, GRUs, LSTMs and the like Transformers might be overkill

yeah ok, I have to learn the diffrence betwine all them, and how I should do this in pytorch, but I think my explination earlier was pertty good about inputs and outputs

merry roost Jun 3, 2023, 6:06 PM

#

cold osprey Attention is all you need

what?

potent sky Jun 3, 2023, 6:06 PM

#

It refers to Transformers

cold osprey Jun 3, 2023, 6:06 PM

#

merry roost what?

Haha nothing it's a title of a paper on transformers

#

Or rather, the paper

potent sky Jun 3, 2023, 6:07 PM

#

A machine learning method for sequence modelling

potent sky Jun 3, 2023, 6:07 PM

#

merry roost yeah ok, I have to learn the diffrence betwine all them, and how I should do thi...

Mhm. The tutorials are pretty good imo

potent sky Jun 3, 2023, 6:08 PM

#

cold osprey Haha nothing it's a title of a paper on transformers

The paper xd

merry roost Jun 3, 2023, 6:08 PM

#

potent sky Mhm. The tutorials are pretty good imo

previous input and the length of the input is not fixed

I planed for them to be fixed, should I just ignore that part

#

only more recient track peices effect the outcome

cold osprey Jun 3, 2023, 6:08 PM

#

Start with what's simpler and easier to do

#

U can always build from there

sleek harbor Jun 3, 2023, 6:08 PM

#

is it just me or does it make more sense to use permutation_importance instead of fearure_importance_ or coef_ for the importance_getter of SelectFromModel? (sklearn)

potent sky Jun 3, 2023, 6:10 PM

#

merry roost `previous input and the length of the input is not fixed` I planed for them to ...

Yeah maybe start with simpler pieces to get a better idea

merry roost Jun 3, 2023, 6:10 PM

#

potent sky Yeah maybe start with simpler pieces to get a better idea

I was just gonna fix the length of the thing and only supply 20 last blocks

potent sky Jun 3, 2023, 6:10 PM

#

Also look into autoregression

errant bison Jun 3, 2023, 6:14 PM

#

would be soo helpful if u provide the yt link for yolo + ocr.

potent sky Jun 3, 2023, 6:15 PM

#

errant bison would be soo helpful if u provide the yt link for yolo + ocr.

I-
Nvm there you go:
https://youtu.be/FKGtdSJu3X4

Your exact project, have fun lol

YouTube

Theos AI

Real-time License Plate Recognition with YOLOv7 + OCR in Google Col...

🥳 Sign up now for free: https://theos.ai

👋🏻 Join our discord server: https://discord.gg/CKYYExqMuP

✅ Join our WhatsApp group: https://chat.whatsapp.com/CzlqpwU9rID3rCg0kWq9Gu

🚘 License Plate Detection Tutorial Video: https://www.youtube.com/watch?v=GVLUVxTpqG0

✅ Google Colab Notebook: https://colab.research.google.com/drive/1LbbTUXzgYT7dn3lQ...

▶ Play video

errant bison Jun 3, 2023, 6:17 PM

#

potent sky I- Nvm there you go: https://youtu.be/FKGtdSJu3X4 Your exact project, have fun ...

this uses some theos api

potent sky Jun 3, 2023, 6:18 PM

#

errant bison this uses some theos api

What's the issue with that

merry roost Jun 3, 2023, 6:18 PM

#

Oh last thing, is 141 inputs a good amount, is it large or small, also how many hidden layers / nodes should I have? Rember only 7 outputs.

errant bison Jun 3, 2023, 6:23 PM

#

potent sky What's the issue with that

but that would simply not be training with yolo right..? thanks but

potent sky Jun 3, 2023, 6:26 PM

#

errant bison but that would simply not be training with yolo right..? thanks but

Fair enough
Look, your whole solution is neatly divided into 2 models
YOLO to detect and extract the license plate
And OCR to convert that to digital text
Just look up a yolo tutorial even without OCR and you should be fine
There are tons of yolo training tutorials. I'm a little busy rn so I can't search but it should be easy enough to find

merry roost Jun 3, 2023, 6:27 PM

#

merry roost Oh last thing, is 141 inputs a good amount, is it large or small, also how many ...

@potent sky what do you think?

potent sky Jun 3, 2023, 6:27 PM

#

merry roost Oh last thing, is 141 inputs a good amount, is it large or small, also how many ...

It really depends on the problem. To get clarity about things like this is partly why I suggested you go for it.
Think about what information the input carries, what output you want, how much information is relevant and necessary, how much feature extraction you need etc

#

Have a meeting now gtg

merry roost Jun 3, 2023, 6:27 PM

#

potent sky Have a meeting now gtg

ttyl

rose dagger Jun 3, 2023, 6:44 PM

#

In a convolutional layer with 3x3 filter, why should the number of channels increase to 64? I understand that due to the filter being 3x3 a 572x572 image is mapped to a 570x570 image, but how come we now get 64 channels instead of just 1? (This is a snapshot from the U-Net architecture)

mild dirge Jun 3, 2023, 6:48 PM

#

Because we don't have one 3x3 kernel, but we have 64 independent 3x3 kernels

#

Each generating a new image that is 570x570

#

That get stacked together

#

@rose dagger

#

And only in the first to second layer is the kernel actually 3x3(x1) because the input image has 1 channel

#

In the second one the kernel is actually 3x3x64

rose dagger Jun 3, 2023, 6:51 PM

#

Oh i see. Thank you. Then in the remaining encoding block (left side), do we then have a 3x3x2 kernel in the second part (since we go from 64 to 128) or a 3x3x128 kernel?

mild dirge Jun 3, 2023, 6:52 PM

#

No, each kernel shifts over the entire input image from left to rigth, and top to bottom, because it's a 2d convolution

#

So when you go from 64 depth to 128, you have 128 kernels that each are 3x3x64

#

As each kernel will generate a single image

rose dagger Jun 3, 2023, 6:53 PM

#

Ok, now i understand what you mean. Thank you, that makes more sense!

crimson summit Jun 3, 2023, 7:46 PM

#

This is my first time building my own neural network from scratch I just wrote the training part if anybody sees anything wrong with it feel free to let me know. It is a 3 layer 3 neuron in each layer neural network.

foggy kestrel Jun 3, 2023, 7:49 PM

#

trying to use Voice_Cloning package, this error comes back:

Traceback (most recent call last):
  File "c:\Users\Code\Documents\GitHub\Test\ref.py", line 12, in <module>
    from voice_cloning.generation import *
  File "C:\Users\Code\AppData\Local\Programs\Python\Python310\lib\site-packages\voice_cloning\generation.py", line 27, in <module>
    from encoder import inference as encoder
ModuleNotFoundError: No module named 'encoder'

Looking at Voice_Cloning, inference.py is a script within the encoder folder, which is on the same directory level as generation.py
Is there a way I can just modify this import statement so that it imports the file correctly?

#

this may not be the right chat for this so if someone could direct me to the right chat that would be helpful as well

sterile wyvern Jun 3, 2023, 8:20 PM

#

How often should you retrain your model? Generally, lets say you trian and test on time serries data 70/30 split in days. After you deploy you would forward test for 30 days then retrain?

queen cradle Jun 3, 2023, 8:46 PM

#

merry roost I want to generate a track in a game consisting of only track peices, starts, en...

Everyone has suggested fancy generative models. But let me suggest a simple one: A Markov model. In the simplest Markov model, you track the last block that was placed. For each of these, you use your training data to find the probability distribution of next blocks. To generate a new track, you pick blocks one at a time: The initial state is the start block; you randomly pick a next block from the distribution of blocks that follow the start block; then you randomly pick a next block, and so on. One of your blocks should be an "end of track" block (maybe this is an actual block, or maybe you stick it onto the end of each track in your data); when you generate the end of track block, your track is over.

merry roost Jun 3, 2023, 8:47 PM

#

queen cradle Everyone has suggested fancy generative models. But let me suggest a simple one:...

not exactly what I had in mind but, I wanted it to get the position ect too

#

so kinda diffrent

queen cradle Jun 3, 2023, 8:47 PM

#

You can add extra information to the state space.

#

There's a trade-off between how detailed your state space is and how much training data you have.

#

Sometimes it helps to reparametrize (e.g., maybe there's a way to use relative positions?).

#

You can also create a hierarchical model. The traditional example of this is a hidden Markov model. In these, your states don't correspond to blocks. Your states are something abstract with no well-defined meaning. However, your states also have an "output distribution," which is a probability distribution over blocks. At each step, you pick a new state; using the output distribution you pick a block. Then you pick a new state (which depends on the current state but not on the block you just placed), and so on.

#

Another option is to use a higher-order Markov model, where the next block depends not just on the current block but on the current and previous blocks.

#

Markov models are not as strong as fancier and trendier models. Their advantages are that they require less data, are faster, are easier to implement, and their training has fewer gotchas.

copper crow Jun 3, 2023, 8:59 PM

#

hi guys, I have a question
I want to create a Python Tkinter application for plotting crypto charts. Do you have any idea what would be the best library for this?

regal vault Jun 3, 2023, 10:01 PM

#

no matter how i hard i try i cant impliment my code so it runs on the gpu
do you gus know any good wrappers or libraries to run on gpu
numba dosent work becuase it dosent support a lot of things i use
like child inheartence and such

hasty mountain Jun 3, 2023, 10:06 PM

#

regal vault no matter how i hard i try i cant impliment my code so it runs on the gpu do you...

Tensorflow and Pytorch

regal vault Jun 3, 2023, 10:07 PM

#

will it work in a project where I use differnt classes and such

#

all classes i made using no external libriaires

#

@hasty mountain

hasty mountain Jun 3, 2023, 10:09 PM

#

Pytorch is a framework that loves classes

regal vault Jun 3, 2023, 10:10 PM

#

k

hasty mountain Jun 3, 2023, 10:10 PM

#

In fact, I had to learn how they work so I could use Pytorch

regal vault Jun 3, 2023, 10:10 PM

#

i see

#

in my case i have a project where im making a 3d render and would like it to run on the gpu instead of the cpu

#

*raytracing

#

only thing im worried about is that a lot of these programs are ml based

hardy depot Jun 3, 2023, 10:24 PM

#

guys im a student and wanna do a good ai course , not a beginner

#

but all the courses in coursera and udacity with certificates are expensive asf, and i already have two courses from udemy so do u guys know any places ican get a cheap course?

simple tapir Jun 3, 2023, 10:28 PM

#

Does machine learning or deep learning come first, when it's willed to go through this field and learner is beginner?

agile cobalt Jun 3, 2023, 10:30 PM

#

deep learning is an area of machine learning that uses neural networks

simple tapir Jun 3, 2023, 10:31 PM

#

So I better take ml courses first then dl courses?

#

@agile cobalt .

serene scaffold Jun 3, 2023, 10:57 PM

#

hardy depot but all the courses in coursera and udacity with certificates are expensive asf,...

no one's going to care about AI/ML certificates from those websites anyway, but there's a plethora of free content on youtube.

#

you're a student. at university? can you take an AI course?

serene scaffold Jun 3, 2023, 10:58 PM

#

simple tapir Does machine learning or deep learning come first, when it's willed to go throug...

You can think of it as "deep learning is part of machine learning, and machine learning is part of AI"

simple tapir Jun 3, 2023, 10:59 PM

#

I see

#

I've a very basic knowledge of machine learning and I think I could learn some deep learning without any issue

serene scaffold Jun 3, 2023, 11:00 PM

#

simple tapir I've a very basic knowledge of machine learning and I think I could learn some d...

you will have issues.

simple tapir Jun 3, 2023, 11:00 PM

#

oh dang

serene scaffold Jun 3, 2023, 11:00 PM

#

to learn is to suffer.

#

but in all seriousness, machine learning and deep learning take a long time to understand. that's why you can make a lot of money once you do.

simple tapir Jun 3, 2023, 11:01 PM

#

I'm in my first year at university and studying computer science and engineering. Next year, i'll take artifical intelligence lecture but I'm willing to go through this field on my own aswell to improve myself. Would it be waste of time to take some machine learning classes online?

serene scaffold Jun 3, 2023, 11:02 PM

#

simple tapir I'm in my first year at university and studying computer science and engineering...

what courses are you taking right now? and what math courses will you have taken by th etime you start the AI course(s)?

#

(when I say "course", that might be what you call a "module")

simple tapir Jun 3, 2023, 11:03 PM

#

I've already taken Pytorch for deep learning and machine learning and got no problem at all. But it wasn't that theoric

serene scaffold Jun 3, 2023, 11:03 PM

#

your university teaches a course that's specifically about pytorch?

simple tapir Jun 3, 2023, 11:03 PM

#

nope, I took it online

#

not from my uni

#

In the first semester, we took calculus 1 and this semester we have calculus 2 classes

serene scaffold Jun 3, 2023, 11:05 PM

#

will you be taking linalg?

simple tapir Jun 3, 2023, 11:06 PM

#

yes

serene scaffold Jun 4, 2023, 12:04 AM

#

what was the loss for the first epoch?

#

how many epochs did you do?

#

hundreds, I see. what does this model do?

#

hmm, okay

#

anyway, it's hard to say if a given loss is "normal" or not

#

what you really care about is how it changes between epochs.

severe topaz Jun 4, 2023, 1:32 AM

#

#

i am trying to optimize a plan which reflects the contemporary skills needed...

bl2iqSAwKQAEoAAWgABSAAlCg4gp85rOfpv8Hl5ZxLIZRLEAAAAASUVORK5CYII.png

#

let me know what you guys think

tidal scroll Jun 4, 2023, 2:06 AM

#

hi guys, want to ask about naive bayes method processing, I have pre process every data and drop unused column but when it comes to detecting outliers with Z Score or IQR my result is empty or rather NaN, do you guys have idea why the result like that?

versed heron Jun 4, 2023, 3:34 AM

#

any reputable guides on ML to train an AI that can be used within a python script?

serene scaffold Jun 4, 2023, 3:38 AM

#

versed heron any reputable guides on ML to train an AI that can be used within a python scrip...

It's impossible to answer unless you specify what kind of ai. What do you want the AI to do?

versed heron Jun 4, 2023, 3:38 AM

#

serene scaffold It's impossible to answer unless you specify what kind of ai. What do you want t...

right, my bad

#

detect car plates (then, OCR)
and
see if a plant is a "bad" or "good" plant

#

like growing well or not, prolly needs some supervised training im guessing

serene scaffold Jun 4, 2023, 3:40 AM

#

You'd need a dataset of healthy and unhealthy plant images, yes

#

Though I think that would be difficult for a model to learn

#

Unless there's some visual property shared by all unhealthy plants

versed heron Jun 4, 2023, 3:41 AM

#

serene scaffold Unless there's some visual property shared by all unhealthy plants

probably is i believe

#

like if they're straight or not

serene scaffold Jun 4, 2023, 3:41 AM

#

Guess I'm an unhealthy plant

versed heron Jun 4, 2023, 3:42 AM

#

lmao

serene scaffold Jun 4, 2023, 3:45 AM

#

Anyway, I wouldn't follow any tutorials on towards data science. Those tend to be trash tier.

versed heron Jun 4, 2023, 3:47 AM

#

serene scaffold Anyway, I wouldn't follow any tutorials on towards data science. Those tend to b...

what would you recommend then?

#

i need some material to start lawl

potent sky Jun 4, 2023, 7:06 AM

#

machinelearningmastery is a good website

#

imo towardsdatascience has some quality write-ups.
But as a beginner if you don't know your way around it can be easy to get into the bad articles on there (and there are many of them) and consequently adopt wrong understanding, bad ways of approaching a problem etc. which can be difficult to unlearn.
So I agree with Stel here

potent sky Jun 4, 2023, 7:12 AM

#

serene scaffold Anyway, I wouldn't follow any tutorials on towards data science. Those tend to b...

Do you not think it's a useful resource?
It takes some filtering but I find quality write-ups on there sometimes

wooden sail Jun 4, 2023, 8:49 AM

#

what you both say is my general experience with it. you can certainly find very good content there sporadically, but there is poor quality control at best

past meteor Jun 4, 2023, 8:52 AM

#

i don't think there's any quality control. Someone I know writes for TWDS and honestly she started writing there when she was learning about data science

#

So her intentions were good but the things were just not correct as you would expect from someone beginning to learn anything

hasty mountain Jun 4, 2023, 2:57 PM

#

I'd say to prefer to search for tutorials in the docs of the frameworks you're using. Tensorflow/Keras and Pytorch got some interesting tutorials.

You can use Towards Data Science articles, but...eh...be careful. Usually the folks that write there also has a small bio. If you see someone that at least seems to understand ML, that could be a good start

#

The best tutorial I found about Variational AutoEncoders was in Towards Data Science, and it was written by an AI Engineer from Meta

dull flare Jun 4, 2023, 4:34 PM

#

hloww e_skulllaugh

#

#

ValueError                                Traceback (most recent call last)
<ipython-input-41-8236c67b5777> in <cell line: 15>()
     13                metrics = ["accuracy"])
     14 
---> 15 history = model4.fit(tf.expand_dims(x,axis = -1),y,epochs = 100,verbose = 0)

1 frames
/usr/local/lib/python3.10/dist-packages/keras/engine/training.py in tf__train_function(iterator)
     13                 try:
     14                     do_return = True
---> 15                     retval_ = ag__.converted_call(ag__.ld(step_function), (ag__.ld(self), ag__.ld(iterator)), None, fscope)
     16                 except:
     17                     do_return = False

ValueError: in user code:

    File "/usr/local/lib/python3.10/dist-packages/keras/engine/training.py", line 1284, in train_function  *
        return step_function(self, iterator)
    File "/usr/local/lib/python3.10/dist-packages/keras/engine/training.py", line 1268, in step_function  **
        outputs = model.distribute_strategy.run(run_step, args=(data,))
    File "/usr/local/lib/python3.10/dist-packages/keras/engine/training.py", line 1249, in run_step  **
        outputs = model.train_step(data)
    File "/usr/local/lib/python3.10/dist-packages/keras/engine/training.py", line 1051, in train_step
        loss = self.compute_loss(x, y, y_pred, sample_weight)
    File "/usr/local/lib/python3.10/dist-packages/keras/engine/training.py", line 1109, in compute_loss
        return self.compiled_loss(
    File "/usr/local/lib/python3.10/dist-packages/keras/engine/compile_utils.py", line 265, in __call__
        loss_value = loss_obj(y_t, y_p, sample_weight=sw)
    File "/usr/local/lib/python3.10/dist-packages/keras/losses.py", line 142, in __call__
        losses = call_fn(y_true, y_pred)
    File "/usr/local/lib/python3.10/dist-packages/keras/losses.py", line 268, in call  **
        return ag_fn(y_true, y_pred, **self._fn_kwargs)
    File "/usr/local/lib/python3.10/dist-packages/keras/losses.py", line 2156, in binary_crossentropy
        backend.binary_crossentropy(y_true, y_pred, from_logits=from_logits),
    File "/usr/local/lib/python3.10/dist-packages/keras/backend.py", line 5707, in binary_crossentropy
        return tf.nn.sigmoid_cross_entropy_with_logits(

    ValueError: `logits` and `labels` must have the same shape, received ((None, 2, 1) vs (None,)).

potent sky Jun 4, 2023, 4:44 PM

#

Yep but docs tutorials are fully code oriented. You preferably need math too. That's where twds comes in sometimes. Machinelearningmastery otherwise, pretty reliable

potent sky Jun 4, 2023, 4:44 PM

#

hasty mountain I'd say to prefer to search for tutorials in the docs of the frameworks you're u...

^^

crimson summit Jun 4, 2023, 5:06 PM

#

I am trying to understand why some people square the cost function of a neural network and some people dont square it. It seems to me that if you square the error when you are traning the network it will overcorrect because the error will be bigger that what it actually is

mild dirge Jun 4, 2023, 5:07 PM

#

You mean the loss function L = (f(x) - y) ^ 2? @crimson summit

#

That is because you want to minimize the function, so the minimum would be if f(x) and y are the same. And if there is a difference between the two (positive or negative) then it should be larger than 0. That way minimzing this function gives the best results.

#

And also remember that we have a learning rate that we use for correcting the weights, which should be set low enough to not overcorrect.

past meteor Jun 4, 2023, 5:14 PM

#

There's a bunch of reasons and the ones listed above are definitely part of them

#

Sometimes you also just don't want large errors so squaring it makes total sense. There's other loss functions that don't do this.

crimson summit Jun 4, 2023, 5:15 PM

#

mild dirge That is because you want to minimize the function, so the minimum would be if `f...

wouldnt you be making the error bigger if you square it not minimizing it ?

mild dirge Jun 4, 2023, 5:16 PM

#

It would mean that larger errors are more heavily penalized than smaller errors yes

#

You can also have absolute difference as loss pretty sure

#

Or smooth l1 loss is another one

past meteor Jun 4, 2023, 5:16 PM

#

Yup you can

crimson summit Jun 4, 2023, 5:16 PM

#

mild dirge It would mean that larger errors are more heavily penalized than smaller errors ...

oh oh makes sense

mild dirge Jun 4, 2023, 5:17 PM

#

past meteor Jun 4, 2023, 5:17 PM

#

You can also just predict the log of Y, that's a common trick

mild dirge Jun 4, 2023, 5:17 PM

#

Here's l1 f(x) - y, l2 (f(x) - y)^2 and smooth l1 (which is a bit more complicated)

#

As long as it's differentiable and continuous it can be used pretty much

past meteor Jun 4, 2023, 5:17 PM

#

I'm not sure but I think MSE is just a tradition that is carried over from statistics

crimson summit Jun 4, 2023, 5:18 PM

#

mild dirge Here's l1 `f(x) - y`, l2 `(f(x) - y)^2` and smooth l1 (which is a bit more compl...

what does that do diffrently than just squaring

mild dirge Jun 4, 2023, 5:18 PM

#

Which one?

#

the smooth l1?

crimson summit Jun 4, 2023, 5:18 PM

#

ya

#

both i guess

past meteor Jun 4, 2023, 5:18 PM

#

In statistics minimizing the sum of squared errors is equivalent to maximizing the likelihood, which has certain good properties.

potent sky Jun 4, 2023, 5:18 PM

#

past meteor There's a bunch of reasons and the ones listed above are definitely part of them

This ^^

mild dirge Jun 4, 2023, 5:18 PM

#

It's almsot like a mix of l1 and l2, when close to 0 it behaves like l2, and further from 0 it's basically linear

#

As to not penalize very large errors too much

past meteor Jun 4, 2023, 5:19 PM

#

But penalizing large errors can be really bad

potent sky Jun 4, 2023, 5:19 PM

#

past meteor Sometimes you also just don't want large errors so squaring it makes total sense...

This*

past meteor Jun 4, 2023, 5:19 PM

#

Look at: huber loss for example

#

Selecting loss functions and models is something you can / need to do based on your "knowledge" of the problem. If you're worried about large errors ruining you, you should be looking at techniques from robust regression

crimson summit Jun 4, 2023, 5:21 PM

#

mild dirge It's almsot like a mix of l1 and l2, when close to 0 it behaves like l2, and fur...

so its just kind of like a standard practice that works on a wide range of situations

#

would you square the cost of the hidden layer aswell or only the final layer in a 3 layer neural network ?

past meteor Jun 4, 2023, 5:25 PM

#

mild dirge Here's l1 `f(x) - y`, l2 `(f(x) - y)^2` and smooth l1 (which is a bit more compl...

wdym with this?

hasty mountain Jun 4, 2023, 5:38 PM

#

past meteor In statistics minimizing the sum of squared errors is equivalent to maximizing t...

Oh, so this explains the MSE for Variational AutoEncoders... pithink

#

Though I admit I'm really enjoying the Gaussian Likelihood because it appears to me more accurate...and more interesting...all that thing of the Decoder having to predict the most likely value between an infinite range of possibilities...

short moth Jun 4, 2023, 5:48 PM

#

Where can i start learning AI with python?

wooden sail Jun 4, 2023, 6:23 PM

#

past meteor wdym with this?

from the plot and comments above, seems like some discussion on L2 ignoring small errors unlike L1, and L1 not being differentiable at 0. i would mention that it's subdifferentiable though, and most autodiff libs use a subderivative of 0 or 1 at 0

past meteor Jun 4, 2023, 6:24 PM

#

It looked like they were equating f(x) - y to L1

wooden sail Jun 4, 2023, 6:24 PM

#

ah that's what you mean

mild dirge Jun 4, 2023, 6:25 PM

#

Oh I forgot the abs there yeah

past meteor Jun 4, 2023, 6:30 PM

#

smooth L1 is new to me though. Initially I thought it was just ML people renaming elasticnet but it's something else

wooden sail Jun 4, 2023, 6:30 PM

#

it's something else indeed

mild dirge Jun 4, 2023, 6:31 PM

#

https://pytorch.org/docs/stable/generated/torch.nn.SmoothL1Loss.html

wooden sail Jun 4, 2023, 6:31 PM

#

you see it in many places though. gradient-based methods are nice because for well-behaved functions, you can find local minima

mild dirge Jun 4, 2023, 6:31 PM

#

You can find the formula here, saw it used for a reinforcement learning project

wooden sail Jun 4, 2023, 6:31 PM

#

whenever you have good reason to use a non-differentiable cost but also want to use gradient methods, smooth approximations are interesting

#

stuff like softmax falls here when used as a smooth argmax

past meteor Jun 4, 2023, 6:34 PM

#

Also quite similar to Huber loss I see.

wooden sail Jun 4, 2023, 6:35 PM

#

ah, that does appear to be the case

keen gust Jun 4, 2023, 6:52 PM

#

@potent sky another question for you, so I have my streamlit app up and running and I'm having an issue w/ the st cache data ttl. It's set to 1 hour but it doesn't actually clear the cache after an hour. It's still loading the same df from last night but when I edit ttl to a few seconds and test this change locally, it clears just fine. Is the ttl only valid while the app is actually in use? I was assuming if I close it and reopen the next day that it would be cleared on rerun but maybe I misunderstand how that works

potent sky Jun 4, 2023, 7:02 PM

#

Wdym by close it and reopen? Are you shutting down the program? Streamlit cache is persisted on disk too iirc so it could repopulate if you're shutting the program and restarting it later, but this will reset the timer

hasty mountain Jun 4, 2023, 7:03 PM

#

Phew... Finally managed to make a functional VAE...
now...onward to ~~creating abominations~~ have some fun with the architecture brainmon

potent sky Jun 4, 2023, 7:03 PM

#

VAEs are fun

#

What're you using it for, LDMs?

hasty mountain Jun 4, 2023, 7:04 PM

#

I want to make an experiment with GANs using latent vectors

#

The idea is to try using a GAN to create latent vectors rather than creating an entire image.
An idea that came to me after seeing the latent diffusion idea, which applies diffusion into a latent vector to make an image

#

Oh wait... LDM = Latent Diffusion Model, right?

#

So...almost for that pithink

potent sky Jun 4, 2023, 7:07 PM

#

hasty mountain The idea is to try using a GAN to create latent vectors rather than creating an ...

Ooh we actually do this in RL-GAN-NET iirc

#

Very interesting paper, look it up if you want

potent sky Jun 4, 2023, 7:08 PM

#

hasty mountain Oh wait... LDM = Latent Diffusion Model, right?

Yep, a class of models

hasty mountain Jun 4, 2023, 7:08 PM

#

potent sky Ooh we actually do this in RL-GAN-NET iirc

Aw... Then they did it before me grumpchib
The idea was exactly train a GAN on latent vector and then try to make a GAN-RL

potent sky Jun 4, 2023, 7:08 PM

#

2019 ICLR I think

potent sky Jun 4, 2023, 7:09 PM

#

hasty mountain Aw... Then they did it before me <:grumpchib:552214257148887060> The idea was e...

Lmao this happens a lot, I relate with you. Feels like every good idea under the sun that strikes you has been done before

#

I was just beginning work on LDMs for music/audio when they published AudioLDM this year Feb I think

#

I think it's still worth trying tho, you might get a different idea to solving the problems you encounter

hasty mountain Jun 4, 2023, 7:12 PM

#

Yes. I'll take a look.
Maybe I could at least make something more simpler/cheaper and get an average performance, since those papers usually go for absurd things...

#

Hm... They didn't use PPO for it brainmon

#

Thanks for the recommendation!

potent sky Jun 4, 2023, 7:19 PM

#

hasty mountain Hm... *They didn't use PPO for it* <:brainmon:439516188771483658>

I was planning to-

hasty mountain Jun 4, 2023, 7:21 PM

#

Go for it, then.
My university vacation will end soon, so I may take a while to work on it yert

#

Maybe you'll give me some inspiration

plain jungle Jun 4, 2023, 7:44 PM

#

Follow up on my RNN from scratch

dull pike Jun 4, 2023, 8:23 PM

#

I have a question

#

I have some pervious programming knowledge like I know the basics of python so I was wondering if I should get this course first: https://www.udemy.com/course/100-days-of-code/ or just find a course that is specific to machine learning and get into it right away

Udemy

100 Days of Code: The Complete Python Pro Bootcamp for 2023

Master Python by building 100 projects in 100 days. Learn data science, automation, build websites, games and apps!

#

I think if i dive into a course that's specific to machine learning it would be way harder to get finish/get into

past meteor Jun 4, 2023, 8:26 PM

#

That's a good course to get a baseline understanding of Python, which will definitely help if you go towards ML later on

keen gust Jun 4, 2023, 8:31 PM

#

potent sky Wdym by close it and reopen? Are you shutting down the program? Streamlit cache ...

meant when a user just closes the web page for the app. That explains it though. Thought that this is what was occurring

molten atlas Jun 4, 2023, 9:46 PM

#

I finally got through the ChatGPT noise and found a book that goes beyond Prompt Engineering and talks about OpenAI API integration

https://www.amazon.com/dp/1805123335

Modern Generative AI with ChatGPT and OpenAI Models: Leverage the c...

Modern Generative AI with ChatGPT and OpenAI Models: Leverage the capabilities of OpenAI's LLM for productivity and innovation with GPT3 and GPT4

agile cobalt Jun 4, 2023, 9:50 PM

#

a book for that sounds like a waste to me? specially at this point in time in which things are still moving ultra fast, to the point that something from 6 months ago may already be outdated

plain jungle Jun 4, 2023, 10:59 PM

#

molten atlas I finally got through the ChatGPT noise and found a book that goes beyond Prompt...

Agreed, it would be more beneficial to learn how transformers work, rather than how a specific transformer reacts

vale idol Jun 4, 2023, 11:24 PM

#

Hey everyone, I'm looking for some help to connect different dataframes using pandas for a uni project I am woring on. If anyone has experience here and can help please reach out, thanks in advance 🙂

mild dirge Jun 4, 2023, 11:27 PM

#

If the question doesn't require hours of guidance, it's probably best to just directly ask it here so people can inmediatly answer. People generally don't dm to find out what the question even is 😛

vale idol Jun 4, 2023, 11:30 PM

#

Yeah that's on me hahah, a bit desperate to find a solution so forgot to provide details 😆

#

So have 3 differnet dataframes that contain 4 simmilar varibles which are a yearly time series data for companies (multiple comanies can have multiple scores). What I've been trying to do here is make a function that assigns a label (high,low,mid) every year for each company depending if its value is below or above a certain quantile and store it in a seperate column. Don't have a lot of experience with python and couldn't really find a simmilar issue on stackoverflow

#

vale idol Jun 4, 2023, 11:39 PM

#

vale idol

Ignore the last two lines since its copy pasted

uncut ember Jun 5, 2023, 12:04 AM

#

vale idol So have 3 differnet dataframes that contain 4 simmilar varibles which are a year...

https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.applymap.html

vale idol Jun 5, 2023, 12:11 AM

#

uncut ember https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.appl...

Would this work even if I have to iteratively (every year) write all the labels to a single column in the dataframe? Additionally, I'm using a dataframe as an input where as I've only encountered applymap being used with dictionaries or lists as input

uncut ember Jun 5, 2023, 12:16 AM

#

I think this work

agile cobalt Jun 5, 2023, 12:17 AM

#

I don't think that there's a need for apply/applymap at all?

agile cobalt Jun 5, 2023, 12:17 AM

#

vale idol So have 3 differnet dataframes that contain 4 simmilar varibles which are a year...

are the variables similar or exact the same for each dataframe though? (same metrics / column names for different values, or actually different columns in each df)

vale idol Jun 5, 2023, 12:21 AM

#

agile cobalt are the variables _similar_ or exact _the same_ for each dataframe though? (same...

metrics are different, the lists before the function lists the relevant columns and because some of the values are inverted (higer values are worse instead of other way around) I know I'll have to take that in consideration when making the labels

agile cobalt Jun 5, 2023, 12:21 AM

#

oh, this might be useful (from googling pandas map quantile)

#

!d pandas.qcut

arctic wedgeBOT Jun 5, 2023, 12:21 AM

#

pandas.qcut


pandas.qcut(x, q, labels=None, retbins=False, precision=3, duplicates='raise')```
Quantile-based discretization function.

Discretize variable into equal-sized buckets based on rank or based on sample quantiles. For example 1000 values for 10 quantiles would produce a Categorical object indicating quantile membership for each data point.

vale idol Jun 5, 2023, 12:23 AM

#

agile cobalt oh, this might be useful (from googling `pandas map quantile`)

I'll look into this, thanks!

agile cobalt Jun 5, 2023, 12:24 AM

#

without it you could do some tricks to get which quantile each record fits into, but that function seems to just do it for you with a much simpler api than check which bucket each record fits yourself

vale idol Jun 5, 2023, 1:00 AM

#

agile cobalt without it you could do some tricks to get which quantile each record fits into,...

Thanks a lot for the suggestion, looks like this should solve the issue! Just a quick follow up, I noticed that since I create a temp dataframe that gets the yearly data and use it to assign values to the original I get nothing. Do I need to use a sepperate function like df[].apply to do this?

serene scaffold Jun 5, 2023, 1:01 AM

#

vale idol Thanks a lot for the suggestion, looks like this should solve the issue! Just a ...

share code as text. not screenshots

#

!code

arctic wedgeBOT Jun 5, 2023, 1:01 AM

#

Formatting code on discord

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

For long code samples, you can use our pastebin.

vale idol Jun 5, 2023, 1:03 AM

#

# Generate yearly rankings labels for each provider based on ESG score
top_quantile_30 = 0.3
bottom_qunatile_30 = 0.7
top_quantile_10 = 0.1
bottom_qunatile_10 = 0.9

years = [2013, 2014, 2015, 2016, 2017, 2018, 2019]
reprisk_scores = ['peak_yearly_RRI', 'yearly_environmental_score', 'yearly_social_score', 'yearly_governance_score']
sustainalytics_scores = ['total_esg_score', 'environment_score', 'social_score', 'governance_score']
capitaliq_scores = ['ESG_score', 'Environmental_score', 'Social_score', 'Governance_score']

labels_30th_p = ['LScores', 'LScores', 'LScores', 'MScores', 'MScores', 'MScores', 'MScores', 'MScores', 'MScores', 'HScores', 'HScores', 'HScores']
labels_30th_p = ['LScores', 'MScores', 'MScores', 'MScores', 'MScores', 'MScores', 'MScores', 'MScores', 'MScores', 'MScores', 'MScores', 'HScores']

def get_score_rankings(df, score_type):
    #Match relevant provider with correct score type!
    #Rankings for top/bottom 30%
    df['ESG_measure_sorts_30'] = ''
    df['env_measure_sorts_30'] = ''
    df['gov_measure_sorts_30'] = ''
    df['soc_measure_sorts_30'] = ''
    #Rankings for top/bottom 10%
    df['ESG_measure_sorts_10'] = ''
    df['env_measure_sorts_10'] = ''
    df['gov_measure_sorts_10'] = ''
    df['soc_measure_sorts_10'] = ''

    for year in years:
        yearly_df = df.loc[df['year'] == year, ['isin'] + score_type]
        for score in score_type:
            if score == 0:
                df['ESG_measure_sorts_30'] = pd.qcut(yearly_df[score], q=10, labels=labels_30th_p)
            if score == 1:
                df['env_measure_sorts_30'] = pd.qcut(yearly_df[score], q=10, labels=labels_30th_p)
            if score == 2:
                df['gov_measure_sorts_30'] = pd.qcut(yearly_df[score], q=10, labels=labels_30th_p)
            if score == 3:
                df['soc_measure_sorts_30'] = pd.qcut(yearly_df[score], q=10, labels=labels_30th_p)

get_score_rankings(sustainalytics, sustainalytics_scores)

agile cobalt Jun 5, 2023, 1:07 AM

#

vale idol ```py # Generate yearly rankings labels for each provider based on ESG score top...

when you do df[...] = ... or series[...] = ... in pandas, it tries to align the index of the objects for you

#

idk how you are creating each dataframe so I have no idea what their indexes look like, but that could be an issue

#

oh wait pithink

#

why the ['isin']?

vale idol Jun 5, 2023, 1:10 AM

#

Right after the for loop you can see where I define a temp dataframe which just get the data for each year and the relevant columns I need quantiles for. The dataframe I want to assign them to has the data for the entire range of the years

agile cobalt Jun 5, 2023, 1:11 AM

#

hmm ok it sounds like you are iterating over a list of strings and checking if the value is equal to a ~~string~~ number?

vale idol Jun 5, 2023, 1:11 AM

#

agile cobalt why the `['isin']`?

This is because I knew indexing would be an issue and is an identifier for a company

agile cobalt Jun 5, 2023, 1:11 AM

#

# ['total_esg_score', 'environment_score', 'social_score', 'governance_score']
for score in score_type:
    if score == 0:

agile cobalt Jun 5, 2023, 1:12 AM

#

vale idol This is because I knew indexing would be an issue and is an identifier for a com...

just what?.....

#

that is not going to work how you want

vale idol Jun 5, 2023, 1:13 AM

#

agile cobalt hmm ok it sounds like you are iterating over a list of strings and checking if t...

Yeah this might be very unnecessary since I could have just renamed the columns to have the same name in each dataframe but since I also need to set a few conditions that are specifc for each data frame I kept it as is

vale idol Jun 5, 2023, 1:15 AM

#

agile cobalt just what?.....

Its not in use so dw about it

agile cobalt Jun 5, 2023, 1:16 AM

#

I recommend either converting everything to one standard format, or creating one separate script for each different input data you want to transform

#

after you decide on that, start (from scratch, not copy/pasting what you have right now) prototyping in something interactive like a Jupyter Notebook or an IPython terminal

#

only after you get the operations right try to organize it into a function

vale idol Jun 5, 2023, 1:19 AM

#

agile cobalt I recommend either converting everything to one standard format, or creating one...

Is this what causing the issue when trying to send the labels back to the reference df?

agile cobalt Jun 5, 2023, 1:20 AM

#

what do you think that the score variable contains when you are doing score == 0 / score == 2 etc?

vale idol Jun 5, 2023, 1:21 AM

#

agile cobalt what do you think that the `score` variable contains when you are doing `score =...

position of item in list no?

agile cobalt Jun 5, 2023, 1:22 AM

#

what do you think that would happen if you did yearly_df[0]?

#

you have to organize your process in your head first, and only after that start coding - and even then, doing it in small steps, testing each part.

vale idol Jun 5, 2023, 1:23 AM

#

I'm still fairly new to python so might be messing up basic stuff

agile cobalt Jun 5, 2023, 1:23 AM

#

!e ```py
strings = ['a', 'b', 'c']
for string in strings:
print(string)
for i in range(len(strings)):
print(i)
for i, string in enumerate(strings):
print(i, string)

arctic wedgeBOT Jun 5, 2023, 1:23 AM

#

@agile cobalt :white_check_mark: Your 3.11 eval job has completed with return code 0.

agile cobalt Jun 5, 2023, 1:24 AM

#

for x in thing: iterates over each value in the thing, not over each position

vale idol Jun 5, 2023, 1:32 AM

#

Yeah I understand but the score == 0, 1 etc.. is just to match the column names of the yearly_df and the original dataframe column. Like I said before, I know this might not even be needed if the column names were the same for each dataframe I'm applying the function for

vale idol Jun 5, 2023, 1:41 AM

#

agile cobalt `for x in thing`: iterates over each _value_ in the thing, not over each _positi...

Nvm I'm stupid 😂

#

Got what you mean

wanton laurel Jun 5, 2023, 8:15 AM

#

Has anyone used the mask rcnn model - im trying to set it up on windows machine (https://github.com/matterport/Mask_RCNN/blob/master/samples/demo.ipynb) please dm to screenshare so that i can get it up and running.

GitHub

Mask_RCNN/demo.ipynb at master · matterport/Mask_RCNN

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow - Mask_RCNN/demo.ipynb at master · matterport/Mask_RCNN

quaint loom Jun 5, 2023, 8:35 AM

#

Any suggestion on graphs that I should pick when it comes to having a lot of paramters? I have tried to make double y axis but it seems like it still looks like a mess.

pseudo spire Jun 5, 2023, 8:51 AM

#

dull pike I have some pervious programming knowledge like I know the basics of python so I...

If you were sure you know how to program you wouldn't need it.
If you are not sure you know how to program you better learn programming first. Not specifically by this course but it looks nice. However, you'll need only "beginner" and "intermediate" lessons, which is roughly 1/3

molten atlas Jun 5, 2023, 9:13 AM

#

agile cobalt a book for that sounds like a waste to me? specially at this point in time in wh...

Lol 6 months is a long timeline considering how changes are fast paced now. I think it does warrant content that can skill the reader up and eventually help our bridge gaps.

celest vine Jun 5, 2023, 10:38 AM

#

Any data engineers here?

past meteor Jun 5, 2023, 10:41 AM

#

celest vine Any data engineers here?

What's your question?

celest vine Jun 5, 2023, 11:19 AM

#

past meteor What's your question?

How is exactly SQL used by data engineers? Like mainly for what purpose?

past meteor Jun 5, 2023, 11:21 AM

#

To transform data and do ad-hoc analysis.

#

There's also more and more tools that let you do dataviz with SQL

quaint loom Jun 5, 2023, 11:27 AM

#

Can someone look at my codes and tell me how to remove this graph 6? There is no data there.

https://paste.pythondiscord.com/obexahejec

mint palm Jun 5, 2023, 11:46 AM

#

i have these projects on my resume(applying for ML/DS role FULL Time), how do they look? in terms of difficulty, required time, how impressive are they?

#

and should i add more? or 3 are enough?

uncut ember Jun 5, 2023, 11:49 AM

#

quaint loom Can someone look at my codes and tell me how to remove this graph 6? There is no...

plt.delaxes(axes[2,1])

quaint loom Jun 5, 2023, 11:50 AM

#

uncut ember ```py plt.delaxes(axes[2,1]) ```

Would you describe what I should do with this? just add it into my code?

uncut ember Jun 5, 2023, 11:51 AM

#

just add this to your code

obtuse flax Jun 5, 2023, 12:17 PM

#

Hey, what are some great ways to run concurrent request in python? I'm working on https://github.com/apolloapi/apolloapi and want to structure concurrent request for our request wrappers. We'll probably implement some call_api method but I noticed python has an async keyword but I've heard python isn't the most friendly language for concurrency.

Apollo is a model management tool for training AI models, automating tasks and catching regressions. I'm currently working on adding a new provider this week that allows for LLM based grading against LLM generated output to produce a grading system for the regression testing feature of the project.

GitHub

GitHub - apolloapi/apolloapi: A radically simple LLMOps framework f...

A radically simple LLMOps framework for automation, monitoring and management for performance. - GitHub - apolloapi/apolloapi: A radically simple LLMOps framework for automation, monitoring and man...

serene scaffold Jun 5, 2023, 12:36 PM

#

obtuse flax Hey, what are some great ways to run concurrent request in python? I'm working o...

While Python doesn't support concurrent programs as well as, say, go, async is how you do it in python. there isn't some other way in python that's going to be better.

lapis sequoia Jun 5, 2023, 12:36 PM

#

Hello, does anyone have any resources on Multi Task Learning in general to recommend or on Multi-Head architectures more specifically?

serene scaffold Jun 5, 2023, 12:38 PM

#

serene scaffold While Python doesn't support concurrent programs as well as, say, go, `async` is...

you might implement the request handlers with FastAPI

ripe sapphire Jun 5, 2023, 12:39 PM

#

Hi everyone

#

I wanted to ask that what things do I have to learn in AI field in python programming language, I am little confused as this field is very vast and beyond my knowledge. Hope you guide me

serene scaffold Jun 5, 2023, 12:43 PM

#

ripe sapphire I wanted to ask that what things do I have to learn in AI field in python progra...

most of what you need to learn has nothing to do with python. I would start with a book like "data science from scratch" to start wrapping your head around what "data" is in the context of AI.

ripe sapphire Jun 5, 2023, 12:44 PM

#

serene scaffold most of what you need to learn has nothing to do with python. I would start with...

Ok thank you I will check out that book

#

But Can you just tell me what things I have to learn in programming, like I know basics of tensorflow, keras, numpy,

serene scaffold Jun 5, 2023, 12:45 PM

#

ripe sapphire But Can you just tell me what things I have to learn in programming, like I know...

No, because that ultimately isn't the point. If you just "learn tensorflow", you will have accomplished nothing in terms of understanding neural networks.

ripe sapphire Jun 5, 2023, 12:46 PM

#

serene scaffold No, because that ultimately isn't the point. If you just "learn tensorflow", you...

Then waht more should I learn

past meteor Jun 5, 2023, 12:46 PM

#

ripe sapphire Then waht more should I learn

I don't want to sound gatekeepey but as for data science, getting a solid background in traditional statistics will help a lot

serene scaffold Jun 5, 2023, 12:46 PM

#

ripe sapphire Then waht more should I learn

the book I recommended would take a few weeks to work though

past meteor Jun 5, 2023, 12:47 PM

#

Most of ML is turbocharged statistics. Knowing basic regression well helps you understand neural nets better later on

serene scaffold Jun 5, 2023, 12:47 PM

#

past meteor I don't want to sound gatekeepey but as for data science, getting a solid backgr...

it's not gatekeepy to make a universally accepted statement about how it be.

ripe sapphire Jun 5, 2023, 12:48 PM

#

past meteor Most of ML is turbocharged statistics. Knowing basic regression well helps you u...

Yes, I think learning statistics first make sense

alpine rain Jun 5, 2023, 12:59 PM

#

is it possible to update an NLP model like Stanza to fix certain incorrect dependency parsing values?

plain jungle Jun 5, 2023, 1:19 PM

#

ripe sapphire Then waht more should I learn

Calc and stats is what machine learning is built off of. Discrete mathematics is also heavily used in the algorithm side of AI (graph theory, etc)

#

#

This is kinda the under hood of a simple NN and why calc is important

alpine rain Jun 5, 2023, 1:27 PM

#

.92*1=0.92 heavy math indeed 😄

lapis sequoia Jun 5, 2023, 1:27 PM

#

ripe sapphire Hi everyone

i was in your place at some point, so i decided to document everything i learned from when i started till now in thie git repo: https://github.com/ahmedbelgacem/awesome-datascience i hope it helps you. It isn't a list of technologies and frameworks its a list of topics with the articles, books and courses i used to learn that topic. It isn't exhaustive as this is what i have learned up till now. I just recently landed a job as a Deep Learning engineer focusing on vision problems so much of this is on computer vision but you can find enough to learn. There's some french courses since i understand french and you may not. Hope this helps

GitHub

GitHub - ahmedbelgacem/awesome-datascience: A curated list of aweso...

A curated list of awesome python, machine learning, computer vision and data science resources, articles, guides, courses and books. - GitHub - ahmedbelgacem/awesome-datascience: A curated list of ...

lapis sequoia Jun 5, 2023, 1:28 PM

#

past meteor I don't want to sound gatekeepey but as for data science, getting a solid backgr...

i also agree with this

old echo Jun 5, 2023, 1:37 PM

#

should i be a software engineer or AI scientist ?

kind jetty Jun 5, 2023, 1:39 PM

#

do what you enjoy

serene scaffold Jun 5, 2023, 1:40 PM

#

old echo should i be a software engineer or AI scientist ?

you seem to have strong opinions about which career tracks are more future proof that I don't think anyone can change, so I don't think we can entertain this question.

#

and that's the end of that.

old echo Jun 5, 2023, 1:41 PM

#

will data scientists role replaced by gpt-4 ?

serene scaffold Jun 5, 2023, 1:42 PM

#

old echo will data scientists role replaced by gpt-4 ?

if you want to talk about that, go to an off-topic channel, or another server entirely.

#

this will be your only warning.

lapis sequoia Jun 5, 2023, 1:43 PM

#

old echo should i be a software engineer or AI scientist ?

depends on what you like most and what you're good at. I initially started as a software engineer and i studied software for 5 years. Then i added a masters degree in AI engineering. I thought that i'm good at computer science and found that easy enough. I also liked maths but it was more challenging for me and felt like doing maths was making me think and try hard while i wasn't trying hard in computer science alone. That's why i switched. Today I really like what i do (deep learning engineering) . I find that most of my work on a day to day base is pure software and coding but everything needs intuition, mathematical background and critical thinking. I find hard aspects on a day to day basis and i like the challenge. And no, it won't be replaced by gpt-4.

toxic mortar Jun 5, 2023, 1:45 PM

#

Hi everyone, I am new here, and this is my first message on this Discord server community.

I'm a software engineering student considering taking a neural networks course next semester. I've seen a few presentations on the course presentation, and it seemed a bit heavy on the theoretical side. I'm trying to see its applicability to real-life situations, but I think I fail to. I had a similar experience with discrete math that I took last semester; I thought it is beneficial for AI/ML, but I didn't find it particularly useful since we did only the pure theoretical part of it.

I'm curious if studying neural networks is a prerequisite for diving into other areas of AI? And how strongly correlated are the concepts covered in this course to the wider field of AI? I consciously used the term 'AI' in the messages above, as I don't want to decide what part I want to delve into before I inspect each aspect and possibility. I hope that makes sense and give you some overview of my question

Thank you 😄

alpine rain Jun 5, 2023, 1:45 PM

#

as long as the hallucination part is not fixed in the LLMs, they should not replace anything

past meteor Jun 5, 2023, 1:52 PM

#

toxic mortar Hi everyone, I am new here, and this is my first message on this Discord server ...

Discrete math can be relevant for AI/ML but it's definitely more abstract than taking a neural network course

#

I'd say a NN course is definitely a good idea for most CS majors even if you don't want to go into AI propper. A lot of chance you'll be working on/with a service that uses AI in the future.

potent sky Jun 5, 2023, 2:01 PM

#

toxic mortar Hi everyone, I am new here, and this is my first message on this Discord server ...

Apart from what others have said and a little off the point, but I think in time you'll find discrete math to be useful for software engineering and other types of problem solving in general

potent sky Jun 5, 2023, 2:05 PM

#

toxic mortar Hi everyone, I am new here, and this is my first message on this Discord server ...

Neural networks are overwhelmingly the concept on which most of modern deep learning is based (note: not all)
Deep Learning is a subset of Machine Learning. It has seen great visibility recently in powering technologies like voice assistants, recommendation systems (think "The Algorithm"), better camera quality, and a load of other things.
Machine Learning is one of the ways of manifesting AI and currently the most popular and successful one by far.
Hope this series of associations gives you some clarity!

lapis sequoia Jun 5, 2023, 2:13 PM

#

toxic mortar Hi everyone, I am new here, and this is my first message on this Discord server ...

Yes, i think that neural networks are prerequisite for modern ai but not sufficient. So if you're planning to study more AI courses in the future neural networks are a must but if you're going to study only that it can be beneficial for your culture but nothing more in my opinion

lapis sequoia Jun 5, 2023, 2:14 PM

#

potent sky Apart from what others have said and a little off the point, but I think in time...

i also agree with this

toxic mortar Jun 5, 2023, 2:15 PM

#

I really appreciate your replies guys. I think of myself that I am a hard-working guy willing to put in the work, and I'm not demoralized by the course, even if it is tough or abstract, as long as it'll benefit me in the long run. Given that, do you think it would be a good idea for me to start independently studying the neural networks course material over the summer before the formal semester begins? Because I think it might make the learning experience smoother when the actual lectures start, as I won't be encountering the topics for the first time if that makes sense

past meteor Jun 5, 2023, 2:25 PM

#

toxic mortar I really appreciate your replies guys. I think of myself that I am a hard-workin...

(Here I go again) Start with statistics if you want to learn something independently

#

And then connect the ideas you see in your neural nets course to the ideas you saw in stats. It'll make your knowledge a lot stronger in the long run

toxic mortar Jun 5, 2023, 2:27 PM

#

Agree, I am taking Probability and statistics course as we speak. I mean, I will finish it in couple of weeks.

past meteor Jun 5, 2023, 2:28 PM

#

Then a second prereq before going into neural nets is imo traditional machine learning methods

#

It's a hot take but I'd say all of ML is statistics but it tends to be called ML if it's done by someone from a comp sci/engineering background. Traditional stats, "traditional" ML and NN's are imo all part of a big toolbox you can use to solve many problems. Different problems will need different techniques so knowing a bit of everything helps. 🙂 Reason being that if you "skip" regular ML then you might overengineer things (especially on tabular datasets).

lapis sequoia Jun 5, 2023, 2:31 PM

#

toxic mortar I really appreciate your replies guys. I think of myself that I am a hard-workin...

if you really have time, i would suggest you refresh/study linear algebra its a must for neural networks

past meteor Jun 5, 2023, 2:32 PM

#

But tbh, I think there's a lot of people now that are working exclusively on speech, text, images, video, ... and I think these profiles can get away with not having a super in-depth knowledge of the traditional stuff. It's more specialised and nearly exclusively deep learning now.

potent sky Jun 5, 2023, 2:34 PM

#

past meteor But tbh, I think there's a lot of people now that are working exclusively on spe...

I still think it's useful to have a good understanding of traditional ML, even though you might not use the exact techniques

#

LinAlg, stats, probability and information theory (maybe vector spaces too if you're interested)

past meteor Jun 5, 2023, 2:35 PM

#

I think you can get away with vector spaces unless you're going for the theoretical route

dusk bear Jun 5, 2023, 2:42 PM

#

hey guys..
i am very much interested in ml/dl
but idk where to learn or how to learn🥲
i am good with math like linalg, prob and stats..
can anyone please help me
i have done some random courses.. but idk how much i have learnt and stuff.. i didnt do any projects and stuff too.. guide me pls🥲

past meteor Jun 5, 2023, 2:43 PM

#

dusk bear hey guys.. i am very much interested in ml/dl but idk where to learn or how to ...

Kaggle.com

dusk bear Jun 5, 2023, 2:44 PM

#

past meteor Kaggle.com

yea bro. i did some courses in kaggle.com

#

learn ones

#

but the thing is i dont get a pathway kinda.. like how to develop

past meteor Jun 5, 2023, 2:45 PM

#

Assuming you already have the prerequisite linalg, prob, stats then you should to "easy" Kaggle competitions (tabular playground series)

dusk bear Jun 5, 2023, 2:45 PM

#

have seen many utube tutorials. have all fundamentals but cant map them and learn 🥲

past meteor Jun 5, 2023, 2:45 PM

#

Solve the case yourself, submit your predictions and then look at other people's notebooks

dusk bear Jun 5, 2023, 2:45 PM

#

past meteor Assuming you already have the prerequisite linalg, prob, stats then you should t...

oh.. i never heard. lemme check

past meteor Jun 5, 2023, 2:45 PM

#

Beware that Kaggle only trains a subset of the skills you need to work in data though

dusk bear Jun 5, 2023, 2:46 PM

#

past meteor Solve the case yourself, submit your predictions and then look at other people's...

ok this way i can do for practice.. what about learning? like how to learn new things? like for free.. i can't afford for courses so yea..

dusk bear Jun 5, 2023, 2:46 PM

#

past meteor Beware that Kaggle only trains a subset of the skills you need to work in data t...

didnt get u... can u come again?

past meteor Jun 5, 2023, 2:47 PM

#

there's much more to data science than training models

past meteor Jun 5, 2023, 2:47 PM

#

dusk bear ok this way i can do for practice.. what about learning? like how to learn new t...

Books 🙂

dusk bear Jun 5, 2023, 2:47 PM

#

past meteor Books 🙂

implementation?🥲

past meteor Jun 5, 2023, 2:48 PM

#

These are all free: https://mml-book.github.io/ https://www.statlearning.com/ and http://www.mmds.org/

potent sky Jun 5, 2023, 2:49 PM

#

For theory https://www.deeplearningbook.org is also free

dusk bear Jun 5, 2023, 2:50 PM

#

potent sky For theory https://www.deeplearningbook.org is also free

yea i read this. (some parts)

dusk bear Jun 5, 2023, 2:50 PM

#

past meteor These are all free: https://mml-book.github.io/ https://www.statlearning.com/ an...

thanks bro!

#

actually, last week i started aeroplane object detection using RCNN. like ik what is cnn, how cnn works, architecture of cnn but idk how to code for it

#

how to build model for it.. so how to learn all these? this is what i wanted to know actually

slender kestrel Jun 5, 2023, 2:53 PM

#

hello i have a question about a deep learning model can anyone help me with that ?

past meteor Jun 5, 2023, 2:53 PM

#

dusk bear actually, last week i started aeroplane object detection using RCNN. like ik wha...

More books: https://d2l.ai/

#

This one in particular covers the theory and implementation of most, if not all, common architectures

dusk bear Jun 5, 2023, 2:54 PM

#

past meteor This one in particular covers the theory and implementation of most, if not all,...

oh cool

past meteor Jun 5, 2023, 2:55 PM

#

After that (reading these 4 I sent will take a very very long time if you do it properly) then what's left is the cutting-edge in papers + actually using what you've learnt in those to do projects

potent sky Jun 5, 2023, 2:56 PM

#

GitHub and docs are your friends

#

ofc be sure not to simply copy

past meteor Jun 5, 2023, 2:56 PM

#

The docs of Tensorflow / Pytorch / MXnet have examples that are typically well explained indeed

dusk bear Jun 5, 2023, 2:56 PM

#

past meteor After that (reading these 4 I sent will take a very very long time if you do it ...

actually im now in 3rd year of graduation, just 2 years left. i see in linkedin all my friends are doing lots and lots of things.. idk why am unable to 🥺 its a depressing btw

slender kestrel Jun 5, 2023, 2:56 PM

#

in this model 2 lstm layers are added in sequence with using return state=True so does it make it a stacked lstm network or not ?

past meteor Jun 5, 2023, 2:57 PM

#

But fundamentally - you need to decide if you want to be designing novel architectures or if your interest is in applying say the cutting-edge on specific problems

#

Imo these are wildly different skillsets

dusk bear Jun 5, 2023, 2:57 PM

#

past meteor Imo these are wildly different skillsets

yea true

past meteor Jun 5, 2023, 2:58 PM

#

dusk bear actually im now in 3rd year of graduation, just 2 years left. i see in linkedin ...

LinkedIn is 80 % inflating the truth 10 % straight up lies and 10 % factual

#

I remember I did a workshop track on computer vision in the cloud with a fancy consulting when I was a student. We got a (useless) certificate on the end.

Afterwards I was browsing LinkedIn and I saw someone post about a super cool thing they did. Turns out I was in exactly the same track as them but they inflated it so much I had no idea I even attended the same thing as them.

dusk bear Jun 5, 2023, 3:01 PM

#

past meteor I remember I did a workshop track on computer vision in the cloud with a fancy c...

lol

past meteor Jun 5, 2023, 3:02 PM

#

I don't know if it's a good idea to essentially dox yourself haha

dusk bear Jun 5, 2023, 3:03 PM

#

past meteor I don't know if it's a good idea to essentially dox yourself haha

nah just asking ur opinion

past meteor Jun 5, 2023, 3:04 PM

#

I'm a books person so my suggestion is to take https://www.statlearning.com/ and read it diagonally and experiment with the techniques you're learning there in Kaggle competitions.

dusk bear Jun 5, 2023, 3:04 PM

#

dusk bear actually, last week i started aeroplane object detection using RCNN. like ik wha...

btw for this
i have used selective search for plane detection
and this is what i get
so many borders.. idk how to correct them..

dusk bear Jun 5, 2023, 3:04 PM

#

past meteor I'm a books person so my suggestion is to take https://www.statlearning.com/ and...

ok.. thanks..

lapis sequoia Jun 5, 2023, 3:05 PM

#

dusk bear hey guys.. i am very much interested in ml/dl but idk where to learn or how to ...

If you're good with the math I would then recommend to study python in depth. After that start with the bases of machine learning (Andrew Ng haw a really good free course on coursera with Stanford University if you want to start). Then i would recommend some deep learning, neural networks etc after that it would be nice to try different things and choose what you like the most and play around different project you find, for example try computer vision thing, study it specifically then try to build a classifier for something you like. Then study for example nlp and try to build something with it etc

dusk bear Jun 5, 2023, 3:06 PM

#

lapis sequoia If you're good with the math I would then recommend to study python in depth. Af...

cool

lapis sequoia Jun 5, 2023, 3:08 PM

#

lapis sequoia i was in your place at some point, so i decided to document everything i learne...

i can redirect you to my previous answer. I can send you some specific links for something in particular you want to start learning. For python you can start with some easy book called Automate the boring stuff with python then go to more advanced things. If you like exercices along the way datacamp is really cool.

dusk bear Jun 5, 2023, 3:09 PM

#

lapis sequoia i can redirect you to my previous answer. I can send you some specific links for...

sure
please do send
and yea i did datacamp for somedays. statistical thinking with python thing ig..

dusk bear Jun 5, 2023, 3:10 PM

#

dusk bear btw for this i have used selective search for plane detection and this is what i...

ss.setBaseImage(imtest)
ss.switchToSelectiveSearchFast()
ssresults = ss.process()
imout = imtest.copy()
for e,result in enumerate(ssresults):
    if e < 2000:
        x,y,w,h = result
        timage = imout[y:y+h,x:x+w]
        resized = cv2.resize(timage, (224,224), interpolation = cv2.INTER_AREA)
        img = np.expand_dims(resized, axis=0)
        out = model_final.predict(img)
        if out[0][0] > 0.97:
            cv2.rectangle(imout, (x, y), (x+w, y+h), (0, 255, 0), 1, cv2.LINE_AA)
plt.figure()
plt.imshow(imout)```
This is the code for detection part btw..

dusk bear Jun 5, 2023, 3:13 PM

#

lapis sequoia If you're good with the math I would then recommend to study python in depth. Af...

andrew ng ml this one?

lapis sequoia Jun 5, 2023, 3:14 PM

#

wait i will send you the link

dusk bear Jun 5, 2023, 3:14 PM

#

lapis sequoia wait i will send you the link

ok

lapis sequoia Jun 5, 2023, 3:17 PM

#

https://www.coursera.org/learn/machine-learning
@dusk bear

Coursera

Supervised Machine Learning: Regression and Classification

In the first course of the Machine Learning Specialization, you will: • Build machine learning models in Python using popular machine ... Enroll for free.

#

they changed the name

dusk bear Jun 5, 2023, 3:19 PM

#

lapis sequoia they changed the name

ok.. and u said some other links.. what r those?

lapis sequoia Jun 5, 2023, 3:19 PM

#

i said if you have something in particular you are looking for, say you tell me i want to learn reinforcement learning, i would send you a link for that particular subject

potent sky Jun 5, 2023, 3:22 PM

#

dusk bear btw for this i have used selective search for plane detection and this is what i...

Are you familiar with Non Maximal Suppression

cold osprey Jun 5, 2023, 3:24 PM

#

what kinds of things can i do to improve image classification tasks?

Currently, im just trying out various models and fine tuning them on my dataset. Not sure what else I can explore to improve performance

somber panther Jun 5, 2023, 3:25 PM

#

recommend any starter courses for ds ml? the one i picked out on udemy is pretty dated

past meteor Jun 5, 2023, 3:26 PM

#

somber panther recommend any starter courses for ds ml? the one i picked out on udemy is pretty...

Scroll up, we had this discussion just now haha 🙂

past meteor Jun 5, 2023, 3:26 PM

#

cold osprey what kinds of things can i do to improve image classification tasks? Currently...

How are your train and val curves looking like?

#

Augmentation and/or other regularization strategies might be a good idea

#

If you have the time for it you can also just hyperparam tune

cold osprey Jun 5, 2023, 3:28 PM

#

past meteor How are your train and val curves looking like?

#

dont have a val set which may be a mistake now

past meteor Jun 5, 2023, 3:29 PM

#

Well, your test is your validation isn't it

#

so you'd need another dataset

sullen kernel Jun 5, 2023, 3:29 PM

#

hi, I'm having problems with my project and I would appreciate if anyone could get on a call with me and help me maybe?

cold osprey Jun 5, 2023, 3:29 PM

#

rightt

#

i have auto transforms that i get from the pre trained model itself

#

ImageClassification(
    crop_size=[288]
    resize_size=[288]
    mean=[0.485, 0.456, 0.406]
    std=[0.229, 0.224, 0.225]
    interpolation=InterpolationMode.BICUBIC
)```

past meteor Jun 5, 2023, 3:30 PM

#

I think at some point you are beginning to overfit so you can play around with adding dropout inyour FC layers, augmentation, ...

cold osprey Jun 5, 2023, 3:32 PM

#

there is one dropout layer already but i can increase the proba

past meteor Jun 5, 2023, 3:32 PM

#

Yeah if you have the compute for it, I'd do it with some sort of hyper parameter tuner

cold osprey Jun 5, 2023, 3:33 PM

#

hmm, would u do it across diff models too? like effnet b0 to b4 and with various hyperparameter values

shut yoke Jun 5, 2023, 3:33 PM

#

cold osprey ``` ImageClassification( crop_size=[288] resize_size=[288] mean=[0....

what language is that

cold osprey Jun 5, 2023, 3:34 PM

#

shut yoke what language is that

output of ```py

Get the transforms used to create our pretrained weights

auto_transforms = weights.transforms()
auto_transforms

shut yoke Jun 5, 2023, 3:34 PM

#

ah alright

past meteor Jun 5, 2023, 3:35 PM

#

cold osprey hmm, would u do it across diff models too? like effnet b0 to b4 and with various...

I used KerasTuner a bunch in the past and it allows you to have a "context" for hyperparameters so you can search in a better way

#

So across models and also "remembering" that hyperparam1_1 is related to model1 and hyperparam1_2 is related to model2 etc

#

Maybe Optuna has this too - my issue with KerasTuner is that it depends on Tensorflow and installing TF just to get this is crazy 😛

cold osprey Jun 5, 2023, 3:37 PM

#

ah hmmm

#

im not even sure if putting this much effort on just a project to showcase i know how to work with image classification stuff is worth it

dusk bear Jun 5, 2023, 4:10 PM

#

lapis sequoia i said if you have something in particular you are looking for, say you tell me ...

ah.. ok.. can u share me related to neural networks? ann, cnn, rnn, etc etc

dusk bear Jun 5, 2023, 4:11 PM

#

potent sky Are you familiar with Non Maximal Suppression

no..

potent sky Jun 5, 2023, 4:14 PM

#

dusk bear no..

Look it up. Should help with this problem

celest vine Jun 5, 2023, 4:45 PM

#

past meteor To transform data and do ad-hoc analysis.

But isn't the analysis part for the data analysts?

past meteor Jun 5, 2023, 4:46 PM

#

People wear multiple hats. It's common to be a data engineer that also does analysis / data science

celest vine Jun 5, 2023, 4:46 PM

#

past meteor People wear multiple hats. It's common to be a data engineer that also does anal...

Okay, so it will dependent on the company I work for?

past meteor Jun 5, 2023, 4:46 PM

#

But yeah, even if you don't do it yourself the analyst that is working downstream relative to yourself might do their analysis with SQL

past meteor Jun 5, 2023, 4:46 PM

#

celest vine Okay, so it will dependent on the company I work for?

yes

celest vine Jun 5, 2023, 4:47 PM

#

past meteor But yeah, even if you don't do it yourself the analyst that is working downstrea...

Also, which is heavily used for the ETL? Python or sql?

past meteor Jun 5, 2023, 4:47 PM

#

Probably SQL?

celest vine Jun 5, 2023, 4:48 PM

#

past meteor Probably SQL?

Why though? Because what can be done in sql can also be done in python.

past meteor Jun 5, 2023, 4:49 PM

#

Many data engineers don't know Python

celest vine Jun 5, 2023, 4:50 PM

#

past meteor Many data engineers don't know Python

Ohh. So, SQL and database knowledge is top priority if I want to become a data engineer?

past meteor Jun 5, 2023, 4:51 PM

#

celest vine Ohh. So, SQL and database knowledge is top priority if I want to become a data e...

imo yes

celest vine Jun 5, 2023, 4:55 PM

#

past meteor imo yes

You work as a data engineer yourself?

past meteor Jun 5, 2023, 4:56 PM

#

celest vine You work as a data engineer yourself?

I'm an applied AI engineer. I'm the one that does all the data engineering on the team though

#

In the past I did internships in data engineering specifically

celest vine Jun 5, 2023, 5:00 PM

#

past meteor I'm an applied AI engineer. I'm the one that does all the data engineering on th...

Nice. Can you provide your opinion on the roadmap I am following for data engineering?

past meteor Jun 5, 2023, 5:01 PM

#

Probably better placed people to do that than me :/ maybe @boreal gale

#

If not, try Reddit

celest vine Jun 5, 2023, 5:04 PM

#

#

Give your opinions on the roadmap

cold osprey Jun 5, 2023, 5:17 PM

#

if u plan on reading kimball, we can discuss it too

#

im on chapter 2

dull pike Jun 5, 2023, 5:18 PM

#

Do you guys think it’s worth it to get a teacher for learning python and machine learning?

celest vine Jun 5, 2023, 5:20 PM

#

cold osprey if u plan on reading kimball, we can discuss it too

I am currently on udemy course for data warehousing

past meteor Jun 5, 2023, 5:25 PM

#

The roadmap is fine so long as you do enough projects

#

I wouldn't spend time on Inmon, Data vault, data mesh or what have you. Just good ol' star schema's are fine for entry level

celest vine Jun 5, 2023, 5:31 PM

#

past meteor I wouldn't spend time on Inmon, Data vault, data mesh or what have you. Just goo...

Star schema and snowflake schema, right?

past meteor Jun 5, 2023, 5:32 PM

#

Yeah just star schema's are fine to focus on in the beginning

#

Maybe people that do data engineering full time might disagree so I'd go on r/dataengineering and ask their opinion

celest vine Jun 5, 2023, 5:33 PM

#

Got it. I appreciate all the suggestions you gave.

cold osprey Jun 5, 2023, 5:49 PM

#

galaxy schema xD

small heron Jun 5, 2023, 5:52 PM

#

How are you. ...... I'm create a shopping app using python kivy, if I send information from user interface to SQLite its going but not updating on my app at the real time. for example, on marketplace page if I add item to my cart its saying its added but going to my cart its not appearing but if rebuild the app it will be showing, so how can I make things update at the real time

boreal gale Jun 5, 2023, 6:19 PM

#

celest vine Give your opinions on the roadmap

err.. replying because i got pinged heh.

is solid, it's where i would start if i were to start over
is okay, i mean it's nice to know the concepts and all, but imo the value is limited unless you put it into practice
spark is good to know, but imo is optional, people abuse spark way too often (when you have a hammer, everything looks like nail type of thing), i would just ignore hadoop hive pig, only research them if the job you are applying requires it/you have an unnatural interest in them
can always help you job hunt, it's a plus but not essential
5):

airflow is not a must, but sure you need to come to grips with some orchestration tooling, prefect and dagster are viable contenders (heck even luigi depending on your usecases)
compute: no comment really, but if you know spark then this is probably not a big step up, again not essential imo
cicd: only CI is relevant to your core duties, knowing how to test your code is a big plus
docker: hell yes. you can't escape them containers these days.
6): 10000% yes, put it all into practice, do something original, it's the best way to drill some core concept into your brain and it serves well inside a portfolio

but i must say, imo data engineer is not a job you can easily land without some experience in other dev related role, companies that hire junior DE is few and far between.

also this is quoted often in the DE discord https://github.com/datastacktv/data-engineer-roadmap
and DDIA is almost a religous text in DE https://dataintensive.net/

good luck!

proud beacon Jun 5, 2023, 6:49 PM

#

Hi guy, I have a piece of coding instructions and I am using anaconda3, should I type these into the anaconda prompt of into my VScode application? ``` start Anaconda3

type:

cd E:\Xfer\NC\MCT2000_LOG_FILE

Press Enter

rose dagger Jun 5, 2023, 7:07 PM

#

Is there some way to reduce/manage the needed memory for a neurel net in tensorflow? I'm building a network with roughly 30 million parameters and i'm using the GPU provided on Kaggle, which roughly has 16 GB of GPU memory. When initializing the model it immediately runs out of memory. Any tips?

#

(I know one obvious option would be to reduce the complexity of the neural net, i.e. remove a few layers / connections, but say i want to improve the memory usage for a given fixed neural network)

tacit knot Jun 5, 2023, 7:51 PM

#

@rose dagger I don't have an actual answer for you, but i know there are several memory optimization things especially around Stable Diffusion (popular/open source) that you MIGHT be able to apply in some way? I'm guessing you are already familiar with some, but there are things like xformers, cunumeric, and several other things. Have you looked into any of those?

#

I'm actually trying to look into if/how I could potentially convert the ZoeDepth models to use TensorRT for performance boost...lol but so far I've only been trying to "use" AI stuff, not even sure where to start yet.

rose dagger Jun 5, 2023, 7:55 PM

#

tacit knot <@695668075886018580> I don't have an actual answer for you, but i know there ar...

I have not heard of those yet, but will look into them. Thanks!

tacit knot Jun 5, 2023, 7:56 PM

#

Ah ok, then there is hope for you yet lol...good news is this is a common problem, bad news is that it is really hard to find quality information.

#

https://huggingface.co/docs/diffusers/optimization/xformers

Installing xFormers

#

That will probably be your single biggest gain. I tried to get it working early on and failed many times. Finally got a better understanding of python environments and such, but it is a near drop in improvement. It DOES have a potential downside, certain things (not sure exactly what all) are not deterministic.

#

but also check out cunumeric, drop in replacement for some core python stuff that I've read can give performance/memory improvements

rose dagger Jun 5, 2023, 8:00 PM

#

tacit knot That will probably be your single biggest gain. I tried to get it working early ...

Interesting. It potentially not being deterministic probably won't bother me too much. I'll try it, but it'll probably take quite some time to implement

tacit knot Jun 5, 2023, 8:01 PM

#

Any idea how far out you are on memory?

#

like do you need to shave a bit or cut it in half?

crimson summit Jun 5, 2023, 8:06 PM

#

I just finished coding my first Neural Network. It is a simple 3 layer neural network. For some reason it is not working. I double checked the math part and everything seems right. If anybody sees any glaring errors please let me know.

#

https://github.com/cgx-ai/First-Neural-Network

GitHub

GitHub - cgx-ai/First-Neural-Network

Contribute to cgx-ai/First-Neural-Network development by creating an account on GitHub.

#

here is my code ^

#

https://pjreddie.com/projects/mnist-in-csv/

MNIST in CSV

MNIST is a great dataset in awful packaging. Here's a CSV instead of that crazy format they are normally available in. Enjoy!

#

this is the data that I am working with ^

#

I am supposed to get something similar to this as my answer for the #7 which is the first number in the test data set

#

instead I am just getting this

rose dagger Jun 5, 2023, 8:36 PM

#

tacit knot like do you need to shave a bit or cut it in half?

Oh well i cut my number of parameters in half and was still out of memory lol. I'll explore it some more tomorrow and try to get a more precise estimate.

tacit knot Jun 5, 2023, 8:38 PM

#

there is a good and extensive write up about optimization on HuggingFace

mild dirge Jun 5, 2023, 8:44 PM

#

@crimson summit

#

Here you calculate output_errors but don't use it?

crimson summit Jun 5, 2023, 8:47 PM

#

mild dirge <@900966051280474122>

i was calculating it to show the steps

#

could that mess up the network if it is not being used ?

mild dirge Jun 5, 2023, 8:47 PM

#

No, just a waste of processing time but it won't affect anything

#

Appareantly all outputs are very high, which means the weights might be very high

#

You could check if that is the case

#

Might not even be that the network is broken, but f.e. too high learning rate (0.3 is quite high for general models)

crimson summit Jun 5, 2023, 8:50 PM

#

mild dirge Might not even be that the network is broken, but f.e. too high learning rate (0...

i messed with the learning rate but that didnt do anything

#

i have not messed with the weights yet though

mild dirge Jun 5, 2023, 8:50 PM

#

Did you try values like 0.001 ?

crimson summit Jun 5, 2023, 8:50 PM

#

oh no I went down to 0.1

#

let me try the real quick

mild dirge Jun 5, 2023, 8:50 PM

#

Try something like 0.001 see if that makes any difference at all

#

Checking the manual gradients calculations would take quite a while for me as well, so if it's anything else that would be nice ;P

#

Btw, why do you have this inputs = (numpy.asfarray(all_values[1:]) / 255.0 * 0.99) + 0.01

#

Are you scared of zeros or something?

crimson summit Jun 5, 2023, 8:57 PM

#

its supposed to scale and shift the inputs between 0.1 and 1

mild dirge Jun 5, 2023, 8:58 PM

#

Normally you'd normalize to values between 0 and 1

#

Did the book suggest this (may be because you don't have a bias in your NN)

crimson summit Jun 5, 2023, 8:59 PM

#

crimson summit its supposed to scale and shift the inputs between 0.1 and 1

sorry between 0.01 and 1

crimson summit Jun 5, 2023, 9:00 PM

#

mild dirge Did the book suggest this (may be because you don't have a bias in your NN)

yea i am just following along in the book

#

but the guy in the book did some super wierd math that is inorrect so I trained my neural network diffrently

#

i am not to surprised that the results are different just trying to figure out what I need to adjust

mild dirge Jun 5, 2023, 9:02 PM

#

What is incorrect about it?

#

Was it the derivative of the sigmoid?

crimson summit Jun 5, 2023, 9:03 PM

#

https://github.com/makeyourownneuralnetwork/makeyourownneuralnetwork/blob/master/part2_neural_network.ipynb

GitHub

makeyourownneuralnetwork/part2_neural_network.ipynb at master · mak...

Code for the Make Your Own Neural Network book. Contribute to makeyourownneuralnetwork/makeyourownneuralnetwork development by creating an account on GitHub.

#

if you look in the train section he calculates the cost of the hidden layer output by just multiplying the weights times the output cost or "error" how he calls it

#

I also just tried making the weights way smaller but that did not do anything

mild dirge Jun 5, 2023, 9:11 PM

#

Do you have the csv, can you send in dm?

crimson summit Jun 5, 2023, 9:11 PM

#

mild dirge Do you have the csv, can you send in dm?

yea sure

crimson summit Jun 5, 2023, 9:12 PM

#

mild dirge Do you have the csv, can you send in dm?

sent you a friend request

mild dirge Jun 5, 2023, 9:15 PM

#

Alright let me just check some stuff out then

#

So the values grow big after the hidden layer, so from hidden to output they get to like 14 on average

#

When pulling those values through sigmoid they will basically all be close to 1

#

Not sure why those weights are so high yet though

crimson summit Jun 5, 2023, 9:24 PM

#

mild dirge Not sure why those weights are so high yet though

are the weight values that i have between -0.5 and 0.5 super big ?

mild dirge Jun 5, 2023, 9:24 PM

#

Nah shouldn't be

#

lol

#

found it

#

[2.31742179e-03 4.87518635e-06 6.80298229e-04 7.63022959e-05
 1.15368135e-07 2.46824449e-05 4.67039119e-08 9.99906227e-01
 2.01392314e-07 2.47477905e-05]

#

Getting this output now, with 0.99999 at index 7

#

The way I found it was by printing out the output_errors_deriv, and found that almost all derivative where positive

#

Which means that the model would try to correct the weigths to increase those values, but it wanted to increase all values but the one that was the correct target

#

You swapped targets and final_outputs in your error derivative

#

output_errors_deriv = 2 * (targets - final_outputs)

#

And not
output_errors_deriv = 2 * (final_outputs - targets)

#

@crimson summit

#

Or actually...

#

That was correct

crimson summit Jun 5, 2023, 9:30 PM

#

mild dirge `output_errors_deriv = 2 * (targets - final_outputs)`

doing final_outputs- targets helps me cancel out the -1 i think

mild dirge Jun 5, 2023, 9:30 PM

#

But swapping those also fixed it, you should actually change

self.who += self.lr * numpy.dot((output_errors_deriv * final_outputs_deriv), final_inputs_deriv2)

to

self.who -= self.lr * numpy.dot((output_errors_deriv * final_outputs_deriv), final_inputs_deriv2)

instead of swapping targets and final outputs

#

Because atm you are doing gradient ascent instead of gradient descent

crimson summit Jun 5, 2023, 9:32 PM

#

should I swap the sign to negative on the other weight calculating formula aswell

mild dirge Jun 5, 2023, 9:33 PM

#

Yeah I'm just checking that

crimson summit Jun 5, 2023, 9:36 PM

#

I am now getting the correct largest value for the number 7 so it is working fine now

#

I made them both negative btw

#

I just need to make the numbers decimals

#

I think

mild dirge Jun 5, 2023, 9:38 PM

#

Hmm, still something wrong even after swapping, getting 1k of 10k correct (basically random guessing)

crimson summit Jun 5, 2023, 9:42 PM

#

yea never mind when I try the second number in the data set its incorrect

mild dirge Jun 5, 2023, 9:46 PM

#

Yeah I'm not sure atm, it takes me too long to find too. I'd probably have to write it from scratch myself to see how I would do it and then compare it with your solution, but that takes a bit too long right now.

#

I don't think I can really help much further :/

crimson summit Jun 5, 2023, 9:46 PM

#

No worries bro

#

thank you for the help

mild dirge Jun 5, 2023, 9:51 PM

#

I'm doing a deep learning project, and my partner tried out all kinds of hyper params, these were the learning rates he tried out for the grid search ... :/

serene scaffold Jun 5, 2023, 10:12 PM

#

mild dirge I'm doing a deep learning project, and my partner tried out all kinds of hyper p...

gotta stay within the same order of magnitude, or the computer will explode /s

mild dirge Jun 5, 2023, 10:43 PM

#

Also set learning rate decay to 0.97, with training taking about 10k update steps (learning rate of 10^-14 after 1000 steps or so)

plain jungle Jun 5, 2023, 10:46 PM

#

mild dirge I'm doing a deep learning project, and my partner tried out all kinds of hyper p...

At least it’s not the other way of hyper 5e4 6e4 7e4 😎

mild dirge Jun 5, 2023, 10:46 PM

#

I actually forgot the minus at some point in this project, caused a big head ache haha

plain jungle Jun 5, 2023, 10:47 PM

#

Lmao, oh I could only imagine

rare fog Jun 5, 2023, 11:04 PM

#

How would I make a list that follows a distribution that looks something like this, for a given minimum, maximum, and number of items?

agile cobalt Jun 5, 2023, 11:25 PM

#

rare fog How would I make a list that follows a distribution that looks something like th...

most scientific computing libraries with random modules allow for you to specify which distribution to use, for example in numpy's case it would be using one of these methods https://numpy.org/doc/stable/reference/random/generator.html#distributions

#

(as for which one exactly fits your particular use case, I have no clue though)

rare fog Jun 5, 2023, 11:38 PM

#

agile cobalt most scientific computing libraries with random modules allow for you to specify...

Thanks

rancid widget Jun 6, 2023, 1:38 AM

#

I' am learning data science and learning statistics. Can anyone shed some light on 2 histograms I have along with how I determine the bins and tell me if it is normal distribution? It's confusing when its not perfect and never seems to be lol

rancid widget Jun 6, 2023, 2:32 AM

#

nm, I figured out how to plot against QQ plot

sweet crypt Jun 6, 2023, 6:16 AM

#

In search algorithms in game, how do we know we have taken good actions?

dense crane Jun 6, 2023, 7:56 AM

#

transforms.Normalize(mean=[0.5, 0.5, 0.5], std=[0.5, 0.5, 0.5]) is this normalization a thing?

#

like is someone ever use that or just it is useable?

winged lance Jun 6, 2023, 10:36 AM

#

i'm not from coding background . can i learn data science get a job pls give suggestion ?

lapis sequoia Jun 6, 2023, 10:54 AM

#

sweet crypt In search algorithms in game, how do we know we have taken good actions?

when you frame the problem, you generally have some kind of reward. Can you elaborate your question so i can give you a better answer?

lapis sequoia Jun 6, 2023, 10:56 AM

#

sweet crypt In search algorithms in game, how do we know we have taken good actions?

take a look at the slides part here: https://www.lamsade.dauphine.fr/~cazenave/MonteCarloSearch.html
this is the course i followed at my Uni

sonic creek Jun 6, 2023, 10:59 AM

#

I need helppp

#

@agile cobalt

#

@stable wing

lapis sequoia Jun 6, 2023, 11:01 AM

#

sonic creek I need helppp

whats your question?

sonic creek Jun 6, 2023, 11:02 AM

#

in discord.py
You know it?

lapis sequoia Jun 6, 2023, 11:03 AM

#

please write a full sentence framing your problem

sonic creek Jun 6, 2023, 11:03 AM

#

But it is very simple thig

#

Ok !

#

I have error

#

Can I send it?

#

@lapis sequoia

uneven thunder Jun 6, 2023, 12:13 PM

#

General question. I'm starting to learn ML and i'm wondering if training a ML model to determine even and odd numbers is a smart beginning. Is this a hard goal? is this a simple process?

Essentially:
Feed the model 100'000 numbers between 0 and 60'000,
Train it for idk, 14 epochs,

Save the model and test it with 10'000 numbers between 70'000 and 120'000.

Would that be a doable beginner project?

lost pier Jun 6, 2023, 12:13 PM

#

Hi peeps, wondering if anyone can help me with something. I have a pandas dataframe and I'm running a function through it, but it's getting tripped up by null values. The problem is I can't remove the null values, I just want to skip those rows, I can't find a way to do that, there just seems to be dropna() or fillna() but those null values are supposed to be there, I'm just not working on those bits, is there an ignore null and move on method in pandas?

cold osprey Jun 6, 2023, 12:15 PM

#

uneven thunder General question. I'm starting to learn ML and i'm wondering if training a ML mo...

Why do I feel like someone has already asked this before here

cold osprey Jun 6, 2023, 12:15 PM

#

lost pier Hi peeps, wondering if anyone can help me with something. I have a pandas datafr...

What specifically r u trynna do? Maybe show some code examples

#

!code

arctic wedgeBOT Jun 6, 2023, 12:15 PM

#

Formatting code on discord

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

For long code samples, you can use our pastebin.

lost pier Jun 6, 2023, 12:16 PM

#

cold osprey What specifically r u trynna do? Maybe show some code examples

Sure one second

uneven thunder Jun 6, 2023, 12:16 PM

#

cold osprey Why do I feel like someone has already asked this before here

I belive it's my first time writing in this channel.

I'm a beginner at ML and i wonder if that is a smart beginner project.

cold osprey Jun 6, 2023, 12:16 PM

#

uneven thunder I belive it's my first time writing in this channel. I'm a beginner at ML and ...

U can give it a try ig

#

U want to use a NN?

uneven thunder Jun 6, 2023, 12:17 PM

#

I was thinking about it yes. I feel like a decision tree would do fine, but i'd like to try a NN, yes

cold osprey Jun 6, 2023, 12:17 PM

#

What features will u pass in that makes u think a decision tree model will work?

uneven thunder Jun 6, 2023, 12:19 PM

#

I belive with enough trial and error it might figure out to follow the simple rules of "if odd: else:" which would result in a 1.0 accuracy.

The training data would consit of random numbers like i explained before and the correct answer for each current number it's training on

cold osprey Jun 6, 2023, 12:19 PM

#

uneven thunder I belive with enough trial and error it might figure out to follow the simple ru...

I mean, think about it tho

#

There needs to be some sense in how the model will work right

uneven thunder Jun 6, 2023, 12:20 PM

#

Yes.

cold osprey Jun 6, 2023, 12:21 PM

#

So if ure a decision tree, how would u 'split' the data?

#

In decision trees, numerical features are treated as 'Is X > 5?' for e.g.

#

Will any form of >, >=, <= or < work?

lost pier Jun 6, 2023, 12:22 PM

#

@cold osprey I have a largish dataset, 130,000 odd rows. there are two columns I am working with, one has an array which I have exploded they they are now single strings on separate rows, the other column is a key value pair, looks like JSON though to be fair it's in single quotes, but I can deal with that bit. So once stripping off any excess white space and they applying json.dumps and json.loads, I am now trying to apply the following line:

df[["workflow", "cost_centre"]] = df[["workflow", "cost_centre"]].applymap(ast.literal_eval)

after narrowing all this down, it works as expected untill it gets to a row where both of these columns are null values. I need to just skip them not remove or alter them if at all possible

uneven thunder Jun 6, 2023, 12:22 PM

#

cold osprey Will any form of >, >=, <= or < work?

No

#

well, i already pieced together a simple feedforward MLP, just to see what happens, but since i have no clue what i'm doing it has an incredible accuracy of 0.5.

I can show you if you'd like.

cold osprey Jun 6, 2023, 12:23 PM

#

Accuracy of 0.5 is no better than randomly guessing

gentle zenith Jun 6, 2023, 12:23 PM

#

AI is so cool!

uneven thunder Jun 6, 2023, 12:23 PM

#

cold osprey Accuracy of 0.5 is no better than randomly guessing

I'm aware.

cold osprey Jun 6, 2023, 12:24 PM

#

Which is what I would expect

#

What activation functions r u using?

#

I think u would need some non linear stuff to get it to work, not sure

uneven thunder Jun 6, 2023, 12:25 PM

#

Okay more general question. What model would be suited for such a task. I'm currently playing around with something like this:


import numpy as np
import tensorflow as tf
import matplotlib.pyplot as plt

# GENERATE
training_numbers = np.random.randint(0, 60001, size=100000)

# LABEL
training_labels = np.where(training_numbers % 2 == 0, 1, 0)

# DEFINE
model = tf.keras.Sequential([
    tf.keras.layers.Dense(64, activation='relu', input_shape=(1,)),
    tf.keras.layers.Dense(32, activation='relu'),
    tf.keras.layers.Dense(16, activation='relu'),
    tf.keras.layers.Dense(1, activation='sigmoid')
])

model.compile(optimizer='adam', loss='binary_crossentropy', metrics=['accuracy'])

# TRAIN
model.fit(training_numbers, training_labels, epochs=14, batch_size=10)

# GEN-TEST
test_numbers = np.random.randint(0, 60001, size=10000)

# LABEL-TEST
test_labels = np.where(test_numbers % 2 == 0, 1, 0)

# TEST
_, accuracy = model.evaluate(test_numbers, test_labels)
print('Average Accuracy:', accuracy)

# ANALYZE
#removed for discord
model.save("model.h5")

But like i said, i'm just experimenting around, not really knowing what i'm doing

cold osprey Jun 6, 2023, 12:25 PM

#

Problems like odd even where there is defined way to calculate it isn't usually solved by ML

uneven thunder Jun 6, 2023, 12:26 PM

#

Yes, but it seemed like an easy "enviroment" with simple rules and it's easy to test.

cold osprey Jun 6, 2023, 12:26 PM

#

The rule is modulus

#

So u would need to teach a model how to do modulus

uneven thunder Jun 6, 2023, 12:27 PM

#

cold osprey The rule is modulus

you're talking about % 2 == 0, 1, 0 i assume?

cold osprey Jun 6, 2023, 12:27 PM

#

Why not just use some dataset on kaggle?

#

Yeah

#

Remainder of modulus to be specific

uneven thunder Jun 6, 2023, 12:27 PM

#

cold osprey Why not just use some dataset on kaggle?

Do you have one in mind that offers a beginner goal?

cold osprey Jun 6, 2023, 12:28 PM

#

Idk I mean Titanic dataset for classification?

#

Iris datasets

uneven thunder Jun 6, 2023, 12:28 PM

#

Titanic?

#

lemme look it up rq

mild dirge Jun 6, 2023, 12:28 PM

#

Iris, or mnist, or fashion mnist

cold osprey Jun 6, 2023, 12:28 PM

#

These are like the typical first project datasets before moving onto something that interests u more and u have some domain knowledge over to apply

#

Fashion mnist was my intro to CNNs

uneven thunder Jun 6, 2023, 12:30 PM

#

somethink like this?

mild dirge Jun 6, 2023, 12:30 PM

#

Can probably also use a regular MLP for (fashion) mnist because the images are so small

cold osprey Jun 6, 2023, 12:30 PM

#

Ye tbh just pick any that interests u

uneven thunder Jun 6, 2023, 12:31 PM

#

Okay. I'll give it a shot.

#

is this something that's best done in a notebook?

cold osprey Jun 6, 2023, 12:31 PM

#

lost pier <@342346882800025600> I have a largish dataset, 130,000 odd rows. there are two ...

Braining this rn, maybe u could show some example rows before the applymap step?

#

Otw home rn, will look in more detail in a bit

cold osprey Jun 6, 2023, 12:32 PM

#

uneven thunder is this something that's best done in a notebook?

Yeah u can use a notebook

uneven thunder Jun 6, 2023, 12:42 PM

#

what is the random_state variable in X_train, X_test, y_train, y_test = train_test_split(features, labels, test_size=0.2, random_state=42)

cold osprey Jun 6, 2023, 12:42 PM

#

Just a number to randomize the split

#

For consistency between runs

uneven thunder Jun 6, 2023, 12:42 PM

#

okay

#

okay i solved it with a decision tree since it's just numbers.

#

I wonder if

#

hm

cold osprey Jun 6, 2023, 12:46 PM

#

Haha wdym solve

potent sky Jun 6, 2023, 12:47 PM

#

uneven thunder Okay more general question. What model would be suited for such a task. I'm curr...

try much lesser parameters

lost pier Jun 6, 2023, 12:48 PM

#

cold osprey Braining this rn, maybe u could show some example rows before the applymap step?

sure, here is a sample:

                      workflow       cost_centre
220     56860820     "ott"     {"ott": "2000920243", "txt": " "}
221     56860822     "txt"     {"txt": " "}
222     56860823     "txt"     {"ott": "2000920243", "txt": " "}
223     56860823     "ott"     {"ott": "2000920243", "txt": " "}
224     56860824     "txt"     {"txt": " "}
225     56860825     "txt"     {"txt": " "}
226     56860827     "txt"     {"txt": " "}
227     15694706     "txt"     {"txt": " "}
228     9877816     "txt"     {"txt": " "}
229     56860828     nan     {"processing": "DE"}
230     56860828     nan     {"processing": "DE"}
231     56860830     nan     {"processing": "DE"}
232     56860831     "txt"     {"txt": " "}

for the record, the result should come out as the following and does untill it sees "nan", which I had no idea was there untill I drilled into the data:

                      workflow       cost_centre
220     56860820     "ott"     {"ott": "2000920243"}
221     56860822     "txt"     {"txt": " "}
222     56860823     "txt"     {"ott": "2000920243"}
223     56860823     "ott"     {"ott": "2000920243"}
224     56860824     "txt"     {"txt": " "}
225     56860825     "txt"     {"txt": " "}
226     56860827     "txt"     {"txt": " "}
227     15694706     "txt"     {"txt": " "}
228     9877816     "txt"     {"txt": " "}
229     56860828     nan     {"processing": "DE"}
230     56860828     nan     {"processing": "DE"}
231     56860830     nan     {"processing": "DE"}
232     56860831     "txt"     {"txt": " "}

For more context I've labelled the columns though this was taken from the start of the fail row 229. So it is using the value in workflow to match the key in the cost_centre column, which can have up to 5 key value pairs in. They do match up, as the workflow has been exploded so that there is a workflow per row, this is just the last piece of the puzzle so that the correct cost centre is also showing on that row.

uneven thunder Jun 6, 2023, 12:49 PM

#

cold osprey Haha wdym solve

"the challange"

#

maybe solve wasn't the right word

#

this seems about right

#

Okay, i now watched a bunch of libraries create a tree which works .

cold osprey Jun 6, 2023, 12:58 PM

#

u can add on to it

#

create more features

#

tune hyper parameters

rose dagger Jun 6, 2023, 1:01 PM

#

A bit of an odd error: I trained a neural net with the following architecture (see image) and called model.predict(x) on one of the training data points and got the following error. The training worked without any errors, so what's the issue here?

#

The data point x is a 512x512 array

cold osprey Jun 6, 2023, 1:03 PM

#

lost pier sure, here is a sample: ``` workflow cost_centre 220...

hmm, need to see what literal_eval does under the hood

tidal bough Jun 6, 2023, 1:03 PM

#

rose dagger A bit of an odd error: I trained a neural net with the following architecture (s...

The error says the input must be 4-dimensional. Usually that's datapoint_index, height, width, channel, so I'd guess you want to reshape it to (1,512,512,1), assuming you only have one sample and the images are one-channel.

cold osprey Jun 6, 2023, 1:03 PM

#

https://stackoverflow.com/questions/52232742/how-to-use-ast-literal-eval-in-a-pandas-dataframe-and-handle-exceptions seems related

Stack Overflow

How to use ast.literal_eval in a pandas dataframe and handle except...

I have a dataframe with a column containing a tuple data as a string. Eg. '(5,6)'. I need to convert this to a tuple structure. One way of doing it is using the ast.literal_eval(). I am using it in...

rose dagger Jun 6, 2023, 1:04 PM

#

tidal bough The error says the input must be 4-dimensional. Usually that's `datapoint_index,...

Thanks, i'll try that. Why exactly is a datapoint_index necessary? What if i want to predict new data points, for which i may not have an index?

cold osprey Jun 6, 2023, 1:06 PM

#

rose dagger Thanks, i'll try that. Why exactly is a datapoint_index necessary? What if i wan...

u can pass 2 images at once (dependant on model) so thats another dimension

#

thats how i see it

rose dagger Jun 6, 2023, 1:07 PM

#

Oh i see.

#

meaning if i wanted to predict 3 data points x1,x2,x3 simultaneously i'd have the input shape as (3,512,512,num_channels), where the first dimension is merely indexing the data points i give as an input

uneven thunder Jun 6, 2023, 1:13 PM

#

cold osprey tune hyper parameters

time to learn what that is

cold osprey Jun 6, 2023, 1:14 PM

#

rose dagger meaning if i wanted to predict 3 data points x1,x2,x3 simultaneously i'd have th...

not 100% sure, i was thinking more of like having a model that takes 2 images as inputs

uneven thunder Jun 6, 2023, 1:25 PM

#

Okay so but like why can't we task an AI to build a better AI?

#

trol

#

sorry.

tidal bough Jun 6, 2023, 1:37 PM

#

rose dagger meaning if i wanted to predict 3 data points x1,x2,x3 simultaneously i'd have th...

Yup, it's just that the input has to be 4d even if you only have one sample (in which case the shape along axis 0 should be 1).

manic cave Jun 6, 2023, 2:43 PM

#

how difficult would it be to train a model with pytorch that detects bugs and inserts a print statement after that line

serene scaffold Jun 6, 2023, 2:59 PM

#

manic cave how difficult would it be to train a model with pytorch that detects bugs and in...

exceptionally

#

that's like god tier

hazy knot Jun 6, 2023, 3:43 PM

#

Is there a go-to or default method for model explainability?

tall tulip Jun 6, 2023, 3:43 PM

#

I've standarized the data and also trained model with that data:

data_mean = data.mean()
data_std = data.std()

norm_data = (data - data_mean) / data_std```
**Now I want to inverse the predicted values **
```inverse_data = (predicted_arr * data_) + data_mean```
**But it gives me the below error how can i handle this?**
```ValueError: Length of values (14604) does not match length of index (2)```

fleet heath Jun 6, 2023, 3:59 PM

#

tall tulip **I've standarized the data and also trained model with that data:** ```data = d...

what does your dataset_std and dataset_mean look like?

hasty mountain Jun 6, 2023, 4:01 PM

#

serene scaffold that's like god tier

An AI capable of debugging itself?
Is this the next step towards Skynet? brainmon

#

Hey @potent sky, since you're into trying some things on latent generative models, maybe this may be useful to you:
https://arxiv.org/pdf/2006.10273.pdf

It's a tutorial on Variational AutoEncoders, where it's explained more about the theory and mathematics around VAEs. It also talk about the confusion around the Decoder Loss (MSE or Likelihood).
My professor sent me this yesterday. Seems interesting.

#

I just don't really get one thing, though: if the ELBo Loss is more accurately applied when using a Likelihood metric(like Gaussian Likelihood)...why does it works with MSELoss in Diffusion Models?
I mean...I remember the sampling function for diffusion probabilistic models is based on ELBo... pithink

potent sky Jun 6, 2023, 4:14 PM

#

hasty mountain Hey <@833644804670750750>, since you're into trying some things on latent genera...

Hey thanks! I'll check it out

potent sky Jun 6, 2023, 4:15 PM

#

hasty mountain I just don't really get one thing, though: if the ELBo Loss is more accurately a...

Yeah I read into it long ago I don't remember all the details. I do remember being very satisfied tho the math was goood xd

potent sky Jun 6, 2023, 4:15 PM

#

hasty mountain I just don't really get one thing, though: if the ELBo Loss is more accurately a...

https://arxiv.org/abs/2107.00630 this might help a bit I think

arXiv.org

Variational Diffusion Models

Diffusion-based generative models have demonstrated a capacity for
perceptually impressive synthesis, but can they also be great likelihood-based
models? We answer this in the affirmative, and introduce a family of
diffusion-based generative models that obtain state-of-the-art likelihoods on
standard image density estimation benchmarks. Unlike o...

hasty mountain Jun 6, 2023, 4:16 PM

#

Thanks!

tall tulip Jun 6, 2023, 4:53 PM

#

fleet heath what does your `dataset_std` and `dataset_mean` look like?

sorry I've edit the question, it's the same 'data_mean' and 'data_std' and their values are mention below

column1    22.346957
column2     21.629736
dtype: float64
data_std value: 
column1    6.098700
column2     4.249352
dtype: float64```

#

@fleet heath here is the complete question kindly look at it

https://stackoverflow.com/questions/76416813/inverse-standardization-of-predicted-values

Stack Overflow

Inverse Standardization of predicted values

I've dataset with two columns. At first I've split the data into train, val and test and after that I've standardized all the data (train, val and test).
train_mean = train_data.mean()
train_std =

manic cave Jun 6, 2023, 5:04 PM

#

serene scaffold exceptionally

What if it will only detect Exceptions? It'll only be for Java code

cunning vector Jun 6, 2023, 5:33 PM

#

Hello all, qq. is it fine to run on old pandas version forever, as new pandas versions throwing merge error

#

this merge error was just a warning in older versions

agile cobalt Jun 6, 2023, 5:35 PM

#

ideally you should adjust your code that it does not gives you neither warnings nor errors

#

if it works on an old version, technically you can just never update anything and keep using it exactly as is, but if you ever need to add new features to it, or if security is a concern (e.g. web servers), you may want to update things

past meteor Jun 6, 2023, 5:38 PM

#

How do you guys decide where you're going to publish especially if you're doing more applied stuff (like in my case personal health)?

#

Like what helps you decide if you're going for an AI journal or a health (or any other) journal?

cunning vector Jun 6, 2023, 5:38 PM

#

agile cobalt if it works on an old version, technically you can just never update anything an...

its in my Local, im never gonna move it to prod.

coral field Jun 6, 2023, 7:32 PM

#

What's the difference between Tensorflow's .numpy() and Numpy's np.array()? How does functionality change if I choose one over the other?

hasty mountain Jun 6, 2023, 7:57 PM

#

I suppose Tensorflow will simply call np.array() while manipulating the data so the operation can be as efficient as possible

tidal bough Jun 6, 2023, 8:11 PM

#

coral field What's the difference between Tensorflow's ```.numpy()``` and Numpy's ```np.arra...

i'd expect no difference at all

#

maybe .numpy can avoid copying the data.

night kernel Jun 6, 2023, 8:15 PM

#

anyone hear about 'openchatkit' from redpajama? https://twitter.com/togethercompute/status/1666067674382888961

Together (@togethercompute)

Announcing RedPajama 7B trained on 1T tokens! 🚀

• Instruct, chat, base, and interim checkpoints on
@huggingface
• The instruct model outperforms all open 7B models on HELM benchmarks
• The 5TB dataset has been used to train over 100 models

Details👇

https://t.co/oUNKqYBmlS

Likes

358

Retweets

106

#

released this morning, is apparently one of the best open source chat models to-date. if you had to say, what do you believe is the best open source LLM

crimson summit Jun 6, 2023, 9:07 PM

#

I coded my first neural network and I finally got it to work lets goooo

#

97% accuracy

mild dirge Jun 6, 2023, 9:19 PM

#

What was the mistake in the end? @crimson summit

toxic mortar Jun 6, 2023, 9:20 PM

#

Has anyone watched this ? https://www.youtube.com/watch?v=pdJQ8iVTwj8&list=PL4_UwQwZnULUCwyjPOczIE3wE5FAH1Tfl&index=4&ab_channel=LexFridman

YouTube

Lex Fridman

Chris Lattner: Future of Programming and AI | Lex Fridman Podcast #381

Chris Lattner is a legendary software and hardware engineer, leading projects at Apple, Tesla, Google, SiFive, and Modular AI, including the development of Swift, LLVM, Clang, MLIR, CIRCT, TPUs, and Mojo. Please support this podcast by checking out our sponsors:

iHerb: https://lexfridman.com/iherb and use code LEX to get 22% off your order
N...

▶ Play video

crimson summit Jun 6, 2023, 9:21 PM

#

mild dirge What was the mistake in the end? <@900966051280474122>

I had to multiply the cost with respect to a2 by -2 instead of 2 and I had to cut my learning rate in half

mild dirge Jun 6, 2023, 9:21 PM

#

I can't stand lex's voice, he sounds high as a kite and his questions are so weird(?) sometimes

crimson summit Jun 6, 2023, 9:22 PM

#

toxic mortar Has anyone watched this ? https://www.youtube.com/watch?v=pdJQ8iVTwj8&list=PL4_U...

I saw some clips. Is mojo going to be the new lang 👀 ?

past meteor Jun 6, 2023, 9:32 PM

#

Lex also has some hot takes but who am i to judge on that front

vale idol Jun 6, 2023, 10:27 PM

#

Hi, I have a question regarding how to assign values to series in dataframes. I have a (main) dataframe that is divided into multiple years which also contains various kinds of scores. Each year has 1 unique score attached to an identifier. I would like to calculate deciles for every year (shown in code below) and do this using the yearly dataframe. Unfortunately, I have issues assigning this back to the original dataframe. Additionally, although the code below is only for 1 year, I would like to make a for loop function that does each year in the original dataframe. Any help is really appreciated 🙂

'''py
sustainalytics_scores = ['total_esg_score', 'environment_score', 'social_score', 'governance_score']
sustainalytics_c = sustainalytics.copy()
labels_30th_p = ['1', '2', '3', '4', '5', '6', '7', '8', '9', '10']

yearly = sustainalytics.loc[sustainalytics['year'] == 2014, ['isin'] + sustainalytics_scores]

sustainalytics_c.loc[sustainalytics_c['year'] == 2014, ['ESG_measure_sorts_30']] = pd.qcut(yearly['total_esg_score'], q=10, labels=labels_30th_p)
'''

plain jungle Jun 6, 2023, 10:52 PM

#

crimson summit I coded my first neural network and I finally got it to work lets goooo

Congrats!

night kernel Jun 6, 2023, 11:21 PM

#

crimson summit I coded my first neural network and I finally got it to work lets goooo

how did you do it? trying to learn myself

#

i was watching andrej karpathy's let's build gpt' from january, but i feel that models have progressed since then

hasty mountain Jun 6, 2023, 11:46 PM

#

night kernel i was watching andrej karpathy's let's build gpt' from january, but i feel that ...

Go forth. The models might have evolved, but if you know the fundamentals, you may be able to adapt pretty fine.

thorn swift Jun 7, 2023, 12:13 AM

#

im so bored, im desperate for a project if anyone is working on something

somber panther Jun 7, 2023, 1:05 AM

#

where might a look for some open source projects i might be able to contribute to while i'm learning ds?

agile cobalt Jun 7, 2023, 1:06 AM

#

you can play around with open datasets on Kaggle and try participating in their Competitions

somber panther Jun 7, 2023, 1:07 AM

#

is an idea, i don't really thrive in competitive settings

#

feel like id be more motivated if it was something i could invest myself in

#

that's a useful lead though, seeing a lot of libraries in use that i'm currently studying

potent sky Jun 7, 2023, 2:07 AM

#

somber panther is an idea, i don't really thrive in competitive settings

Kaggle competitions are generally months long so ig you could invest yourself

faint marten Jun 7, 2023, 4:33 AM

#

vale idol Hi, I have a question regarding how to assign values to series in dataframes. I ...

Hi I hope you had found a solution. If not, could you clarify what your data frame looks like? So you have a main data frame, with columns [‘year’ , ‘total_score’, ‘env_score’, ‘soc_score’, ‘gov_score’], so each year is one row? Or you have an additional column like ‘city’, so each year is N rows, where N is the number of cities?

lapis sequoia Jun 7, 2023, 4:43 AM

#

guys why does precision have two values when produced using a classification report in scikit learn

#

#

1 and 0

#

I thought precision = TP/(TP + FP) where TP = True positive, FP = false positive

#

how are there seperate values for 1 and 0

#

is it that the classification_report function is not assuming that 1 means positive and 0 means negative and is thus calculating the precision twice

#

once for 1 as positive and then 0 for positive

agile cobalt Jun 7, 2023, 4:50 AM

#

it is made to support multiclass classification, not just binary classification

#

take a look at the documentation https://scikit-learn.org/stable/modules/model_evaluation.html#classification-report

scikit-learn

3.3. Metrics and scoring: quantifying the quality of predictions

There are 3 different APIs for evaluating the quality of a model’s predictions: Estimator score method: Estimators have a score method providing a default evaluation criterion for the problem they ...

dusk bear Jun 7, 2023, 7:01 AM

#

guys...
a doubt regarding how to use precision and recall
actually i am building a cnn model for plane detection
i got this precision and recall values but it didnt recognise the third aeroplane only. so is this right or wrong? or .. any comments
please suggest something..
first list is predicted boxes
second list is ground truth boxes

odd meteor Jun 7, 2023, 7:44 AM

#

lapis sequoia guys why does precision have two values when produced using a classification rep...

The metrics is shown per-class basis. This is because we might want to know how the model performed per class in the response variable (Y); since it's possible for one to be more interested in really seeing the model's performance on either the positive / negative class (for a binary classification problem) separately.

For example (Assume class label 1 is the positive class here and this is a titanic dataset), this affords you the liberty to infer that:

Precision: Out of all the passengers the model predicted would survive, only 84% actually survived.
Recall: Out of all the passengers that actually did survive, the model only predicted this outcome correctly for 89% of those passengers.

(Now you can also make such inference for the negative class with ease by focusing on the label 0)

Finally, you can as well get a general overview of each metric performance (not per-class level this time) by looking their respective average score.

You can find the complete documentation for the classification_report function here https://scikit-learn.org/stable/modules/generated/sklearn.metrics.classification_report.html

scikit-learn

sklearn.metrics.classification_report

Examples using sklearn.metrics.classification_report: Recognizing hand-written digits Recognizing hand-written digits Faces recognition example using eigenfaces and SVMs Faces recognition example u...

shadow quiver Jun 7, 2023, 7:47 AM

#

I have a data that 50m rows in Postgres. I could easily manage 1m part of this data even with Pandas.

Now I want to write this data to parquet using Pyspark. But it gives memory error (java heap). I even partitioned the data by 100. Why Spark can't handle it and do it in small batches?

rose dagger Jun 7, 2023, 9:10 AM

#

I'm working on an image segmentation task where i'm currently trying out the U-Net architecture. When making predictions with the trained model, i am getting images of the following form (see attached image). The boundaries seem to be causing some issues here. My guess is that the cause is a combination of (a) the convolution blocks "downsizing" the images together with (b) the decoding part of the U-Net then "upsizing" the images again.
What are some steps to take to remedy this problem? Note that the inputs are WxH sized and the outputs are WxH sized as well, i.e. the same size as the input images. One idea i had was to slightly "crop" the output/train images so that they are of size (W-n)x(H-n), so that i have removed the boundary. It seems that in this Kaggle competition (https://www.kaggle.com/competitions/hubmap-kidney-segmentation/discussion/238198) the winning solution did exactly that. Any thoughts?

HuBMAP - Hacking the Kidney

Identify glomeruli in human kidney tissue images

lost pier Jun 7, 2023, 10:02 AM

#

Hi there, wonder if anyone can help, I've been trying to drop na values from a dataframe and it just will not go. I've tried the following:

df = df.dropna(subset=['workflow'])

I've tried :

df.dropna(subset=['workflow'], inplace=True)

I've tried:

test_df = df[['workflow']]
test_df = test_df.dropna()

and I've tried:

test_df = df[['workflow']]
test_df.dropna(inplace=True)

Bonus round, I've tried

df = df[df['workflow'].notna()]

In fact, the nan values in the dataframe do not even show up as True if isna() is applied. What else can I do to rid my data frame of this plague please?

boreal gale Jun 7, 2023, 10:08 AM

#

lost pier Hi there, wonder if anyone can help, I've been trying to drop na values from a d...

show us your dataframe, that should have worked

lost pier Jun 7, 2023, 10:10 AM

#

boreal gale show us your dataframe, that should have worked

Hi, here:

224    "soip"
225    "soip"
226       nan
227    "soip"
228    "soip"

#

This is now just the single column and it still won't go, I'm actually just trying to check for na to see if this is what is messing up my function on the larger dataframe, but I just don't understand why I can never get this to work without a fight

potent garnet Jun 7, 2023, 10:14 AM

#

Hello everyone, is anyone interested in Kaggle competitions?

cold osprey Jun 7, 2023, 10:16 AM

#

np.nan()

boreal gale Jun 7, 2023, 10:16 AM

#

lost pier Hi, here: ``` 224 "soip" 225 "soip" 226 nan 227 "soip" 228 "so...

colum dtype?

cold osprey Jun 7, 2023, 10:16 AM

#

prob a dtype thingy

lost pier Jun 7, 2023, 10:17 AM

#

224    "soip"
225    "soip"
226       nan
227    "soip"
228    "soip"
Name: Workflow, dtype: object

cold osprey Jun 7, 2023, 10:17 AM

#

cold osprey `np.nan()`

replace with this to None

dusk bear Jun 7, 2023, 10:17 AM

#

lost pier Hi there, wonder if anyone can help, I've been trying to drop na values from a d...

capital W

#

u did workflow

#

do Workflow

boreal gale Jun 7, 2023, 10:18 AM

#

good eye, if that doesn't work then see what type(col.loc[227]) gives you

lost pier Jun 7, 2023, 10:18 AM

#

yes sorry, i've just changed the name of the column as it's work data and I don't want to get into trouble, that was just a typo

dusk bear Jun 7, 2023, 10:18 AM

#

lost pier yes sorry, i've just changed the name of the column as it's work data and I don'...

ah then nvm

cold osprey Jun 7, 2023, 10:19 AM

#

try checking with .isnull()

#

see if it returns True

lost pier Jun 7, 2023, 10:20 AM

#

type(new_df.loc[227])

returns str

dusk bear Jun 7, 2023, 10:20 AM

#

boreal gale good eye, if that doesn't work then see what `type(col.loc[227])` gives you

but if there r multiple data types in a column it will give that particular dtype na?

dusk bear Jun 7, 2023, 10:20 AM

#

lost pier ``` type(new_df.loc[227]) ``` returns str

yea thats what

#

well ig its nan not np.nan

#

so replace nan with np.nan

#

and then do dropna

lost pier Jun 7, 2023, 10:20 AM

#

ah, ok, how can I fix that the original dataframe is 139,000 rows lol

cold osprey Jun 7, 2023, 10:21 AM

#

^

#

replace then drop

dusk bear Jun 7, 2023, 10:22 AM

#

new_df.replace('nan',np.nan)

#

and then the dropna code u wrote with subset

boreal gale Jun 7, 2023, 10:22 AM

#

lost pier ``` 224 "soip" 225 "soip" 226 nan 227 "soip" 228 "soip" Name: ...

i see, your data is really weird..
you have value of "soip" which is a string of literally "soip" including the "
and you also have value of nan which is also a string of literally nan

#

the above suggestions should work, but i would look into why your data is like that in the first place

dusk bear Jun 7, 2023, 10:22 AM

#

boreal gale the above suggestions should work, but i would look into *why* your data is like...

yea true..

#

which dataset are you working on btw? @lost pier

alpine temple Jun 7, 2023, 10:25 AM

#

Anyone here a PyTorch whisperer?

I've attempted to build a SqueezeNet, and it blows.

lost pier Jun 7, 2023, 10:25 AM

#

dusk bear which dataset are you working on btw? <@398356726656794637>

@dusk bear I have a large data set that with two columns I am working with, one is and array which has been exploded the other is a json object that I am trying to map with the result of the exploded column, but I hadn't seen the nan values till yesterday, so now I am trying to find a way to skip over the nan values as this is just a pipeline transformation for financial data, so nothing can be dropped

alpine temple Jun 7, 2023, 10:25 AM

#

Wondering if I could talk through my hyperparameters with someone, along with a sanity check.

lost pier Jun 7, 2023, 10:28 AM

#

boreal gale i see, your data is really weird.. you have value of `"soip"` which is a string...

Yes this data is very nasty it seems, I have put it through JSON.dumps and JSON.loads in an attempt to clean it but that might be what is causing the problem now I look at it

dusk bear Jun 7, 2023, 10:28 AM

#

ahh.. ok..

boreal gale Jun 7, 2023, 10:30 AM

#

lost pier <@917039383067103242> I have a large data set that with two columns I am working...

I am working with, one is and array which has been exploded the other is a json object that I am trying to map with the result of the exploded column
could you share some redacted examples? maybe there is a better way than json.dumps/loads?

lost pier Jun 7, 2023, 10:31 AM

#

The Json loads and dumps was an attempt yesterday, i've removed that now, I'll show you the code that works up to the nan values, one second

boreal gale Jun 7, 2023, 10:33 AM

#

oh my apologies, i somehow took it as the json.dumps/loads caused this weirdness.
but yes, showing what you have got would be useful

lost pier Jun 7, 2023, 10:37 AM

#

file = glob(f"{file_path}*.csv")[0]
df = pd.read_csv(file, encoding='utf-8')
df = df.replace({'\'': '"'}, regex=True)
df["Workflow"] = df["Workflow"].str.strip("[]").astype(str)
df["Workflow"] = df["Workflow"].str.split(",")
df = df.assign(Item_Cost_This_Month=df["Cost This Month"] / df["Workflow"].str.len())
df = df.assign(Item_Cost_Next_Month=df["Cost Next Month"] / df["Workflow"].str.len())
df = df.explode("Workflow").reset_index(drop=True)
df[["Workflow", "Cost_Centre"]] = df[["Workflow", "Cost_Centre"]].applymap(ast.literal_eval)

So the above code, works really well so long as there are no null values, here is a sample row of the whole data:

04/30/2023 23:24:26     1242360.0     LongForm     04/30/2023 23:24:26     05/30/2023 00:00:00     True     0     1     29     0.0     0.12     3.34     ['soip', 'ott']     uk     {'ott': '1234567890', 'soip': ' '}     abc    xyz    prd     NaN     NaN     NaN

workflow is the array, and cost code it the key value pair

rose dagger Jun 7, 2023, 10:38 AM

#

rose dagger I'm working on an image segmentation task where i'm currently trying out the U-N...

I've never posted on ai stackexchange, so could someone who is more active on that site tell me whether a question like this would be appropriate to ask there? Or is the site not meant for such questions?

lost pier Jun 7, 2023, 10:39 AM

#

The above only trips up when it gets to a row where the value in the array field is nan, as this value is used to map the key value pair, I just didn't expect skipping over it would be such a battle

boreal gale Jun 7, 2023, 10:40 AM

#

lost pier ``` file = glob(f"{file_path}*.csv")[0] df = pd.read_csv(file, encoding='utf-8')...

👍 this is a very good start to nailing what issue is plaguing you, now could you post some problematic (and some normal) rows? redact info if necessary

tidal bough Jun 7, 2023, 10:41 AM

#

boreal gale the above suggestions should work, but i would look into *why* your data is like...

df["Workflow"] = df["Workflow"].str.strip("[]").astype(str)

this is mildly concerning to me; I suspect it might be what's stringifying everything

lost pier Jun 7, 2023, 10:45 AM

#

yes, so here is a row after the explode, but before the ast line:

True     0     1     29     0.0     0.12     3.34     'ott'     uk     {'ott': '2000920243', 'soip': ' '}

and here is a line that is causing an issue:

False     0     0     0     0.0     0.0     0.0     nan     de     {'content_processing': 'XYZ'}

the above line is the first one that fails, and after looking at the csv and manually copying it out, I saw the issue and then after more digging, found that this was what was stopping it. Today I thought if I dropped all the nan's I could validate that theory lol

lost pier Jun 7, 2023, 10:51 AM

#

tidal bough ```py df["Workflow"] = df["Workflow"].str.strip("[]").astype(str) ``` this is mi...

Ah right let me explain that one for you, the data looks like an array, but is not, so I convert to a string to remove the square brackets so I can then split it back into an array, but yes I see what you are referring to

#

That's the column that's causing the problem too

tidal bough Jun 7, 2023, 10:52 AM

#

Possibly you want to do something like a json.loads followed by pandas.json_normalize

boreal gale Jun 7, 2023, 10:52 AM

#

can we have the header as well so we are on the same page? or just highlight which column is which (the ones you have used anyway)

lost pier Jun 7, 2023, 10:54 AM

#

Yes it's the columns that have nan and the key value pair, these ones:

workflow      cost_centre
'ott'      {'ott': '2000920243', 'soip': ' '}
nan        {'content_processing': 'XYZ'}

#

I have to split this up for these two columns and produce another csv that can then carry on down the pipeline into google big query I think it goes

boreal gale Jun 7, 2023, 10:58 AM

#

i understand now.
is workflow really 'ott'?
or is it "ott"?
only the later is valid JSON

lost pier Jun 7, 2023, 10:59 AM

#

It actually comes in from the csv as ['ott'], but I dont' know if that's python doing that

#

same with the key value pair, it looks like json with single quotes, which I thought was not valid json,

#

I did put this in there:

df = df.replace({'\'': '"'}, regex=True)

not sure if it was in the above code, I've tried all sorts of things to clean this up, I'm getting a little lost

#

So after that I tried the json.dumps and json.loads, and that did get the cost centre column into a valid json format, howerver literal_eval was working on the single quote dictionary version to be fair.

boreal gale Jun 7, 2023, 11:06 AM

#

hopefully this gives you some inspiration.

#

!e

import pandas as pd
import ast
df = pd.DataFrame({"itemgetters": ["['a', 'b']", "['a']"], "lookup": ["{'b': 'quack', 'a': 'meow'}", "{'b': 'quack', 'a': 'meow2'}"]})
df_parsed = df.applymap(ast.literal_eval).explode('itemgetters').reset_index()

lookup_values = pd.concat(
[
  df_group['lookup'].str[key]
  for key, df_group in df_parsed.groupby('itemgetters')
]
)
res = pd.concat([
lookup_values,
df_parsed,
],axis=1)

print(df)
print(df_parsed)
print(lookup_values)
print(res)

arctic wedgeBOT Jun 7, 2023, 11:06 AM

#

@boreal gale :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 |   itemgetters                        lookup
002 | 0  ['a', 'b']   {'b': 'quack', 'a': 'meow'}
003 | 1       ['a']  {'b': 'quack', 'a': 'meow2'}
004 |    index itemgetters                        lookup
005 | 0      0           a   {'b': 'quack', 'a': 'meow'}
006 | 1      0           b   {'b': 'quack', 'a': 'meow'}
007 | 2      1           a  {'b': 'quack', 'a': 'meow2'}
008 | 0     meow
009 | 2    meow2
010 | 1    quack
011 | Name: lookup, dtype: object
... (truncated - too many lines)

Full output: https://paste.pythondiscord.com/xadogowike.txt?noredirect

boreal gale Jun 7, 2023, 11:06 AM

#

i gotta bail now, good luck!

lost pier Jun 7, 2023, 11:08 AM

#

@boreal gale thank you for your help sir, it has been very inspiring for sure

#

@tidal bough thanks very much for your help also, you have indeed correctly identified what was causing that problem. I am able to dropna() straight after ingest

rose dagger Jun 7, 2023, 12:02 PM

#

Ok, i have posted my question on StackExchange: https://ai.stackexchange.com/questions/40742/convolutional-neural-network-struggling-at-the-boundary-of-images
I hope some of you might be able to help, but even if not, i'd appreciate an upvote on the question, if you think it is well-posed and interesting, in order to increase its visibility.

Artificial Intelligence Stack Exchange

Convolutional Neural Network struggling at the Boundary of Images

The setting is the following: As input data we are given $512\times 512$ images, in which we are supposed to identify certain regions in the image.
Input Image
Output Image (Binary)
For this, one...

brave sand Jun 7, 2023, 2:30 PM

#

can someone help me with object detection? how do I convert xml to csv for tf records?

frail dune Jun 7, 2023, 2:46 PM

#

Hey, I'm currently researching about digital twins and simulation and I wanted to ask whether someone here has some knowledge and could answer me some questions and give me a small overview on the topic (pm if its ok)

serene scaffold Jun 7, 2023, 2:51 PM

#

frail dune Hey, I'm currently researching about digital twins and simulation and I wanted t...

people aren't likely to want to DM you to find out of the questions are ones that they know the answer to, so you should ask your questions here.

crystal obsidian Jun 7, 2023, 3:22 PM

#

import nltk
# nltk.download()
from nltk.tokenize import word_tokenize
from spellchecker import SpellChecker
from gingerit.gingerit import GingerIt
from transformers import AutoTokenizer, T5ForConditionalGeneration

# Step 1: Tokenization
def tokenize_text(text):
    return word_tokenize(text)

# Step 2: Spell Checking
def correct_spelling(tokens):
     spell = SpellChecker()
     corrected_tokens = [spell.correction(token) for token in tokens]
     return corrected_tokens

# # Step 3: Grammar Correction
def correct_grammar(text):
     parser = GingerIt()
     result = parser.parse(text)
     corrected_text = result['result']
     return corrected_text

# # Step 4: Missing or Extra Words
def correct_missing_or_extra_words(text):

    tokenizer = AutoTokenizer.from_pretrained("grammarly/coedit-large")
    model = T5ForConditionalGeneration.from_pretrained("grammarly/coedit-large")
    input_text = text
    input_ids = tokenizer(input_text, return_tensors="pt").input_ids
    outputs = model.generate(input_ids, max_length=256)
    edited_text = tokenizer.decode(outputs[0], skip_special_tokens=True)


    return edited_text


# Example usage
input_text = "Thiiss is aa testt sentnce with spelng mistakas."
tokens = tokenize_text(input_text)
corrected_tokens = correct_spelling(tokens)
corrected_text = ' '.join(corrected_tokens)
corrected_text = correct_grammar(corrected_text)
corrected_text = correct_missing_or_extra_words(corrected_text)

print(corrected_text)
# for i in range (0, len(corrected_tokens)):
#   print(corrected_tokens[i])

this code is extremely slow bcz of the 4th function
also the output was expected:
This is a test sentence with spelling mistakes
but what I got:
This is a test sentence to see if I can spot mistakes.

lapis sequoia Jun 7, 2023, 3:58 PM

#

which is more likely to cause overfitting in random forests. high number of estimators or low.

lapis sequoia Jun 7, 2023, 4:16 PM

#

import math
import time
from pynput import keyboard, mouse

is_active = False
last_toggle_time = 0

def on_press(key):
    global is_active, last_toggle_time
    try:
        if key.char.lower() == 'c':
            current_time = time.time()
            if current_time - last_toggle_time > 0.5:
                is_active = not is_active
                last_toggle_time = current_time
                if is_active:
                    start_spinbot()
    except AttributeError:
        pass

def start_spinbot():
    screenSize = mouse.Controller().position
    centerX = screenSize[0] / 2
    centerY = screenSize[1] / 2
    radius = 200
    angularSpeed = 0.1

    mouseController = mouse.Controller()

    angle = 0
    while is_active:
        x = centerX + radius * math.cos(angle)
        y = centerY + radius * math.sin(angle)

        mouseController.position = (x, y)

        angle += angularSpeed

        time.sleep(0.01)

def on_release(key):
    if key == keyboard.Key.esc:
        return False

def main():
    print('Press "c" to activate/deactivate the spinbot. Press "Esc" to exit.')
    with keyboard.Listener(on_press=on_press, on_release=on_release) as listener:
        listener.join()

if __name__ == '__main__':
    main()

i can't find a channel for my issue really but my code is meant to spin the cursor around a 1440p native screen but well not only does it not spin it in the middle but it also doesn't stop after repressing C

cold osprey Jun 7, 2023, 4:29 PM

#

lapis sequoia ```py import math import time from pynput import keyboard, mouse is_active = Fa...

should it work on a UW screen?

#

trynna run it rn

lapis sequoia Jun 7, 2023, 4:30 PM

#

if i run it it works fine but like i said just doesnt even spin in the middle of the screen and it does not stop no matter what i press or well until i alt f4 out of it

cold osprey Jun 7, 2023, 4:30 PM

#

ok sec lemme try

frail dune Jun 7, 2023, 4:31 PM

#

Does anybody know whether its possible to simulate a digital twin of a CAD model in python?

#

and if yes does anybody have a paper or link to a readme or w.e.

lapis sequoia Jun 7, 2023, 4:37 PM

#

cold osprey ok sec lemme try

did it work

cold osprey Jun 7, 2023, 4:38 PM

#

sec setting up env

#

wanna install pynput in separate venv

#

hmm

#

mouse aint spinning

#

i can press esc to exit tho

#

wait nbvm

#

i didntr press c lol

cold osprey Jun 7, 2023, 4:47 PM

#

lapis sequoia did it work

im thinking coz when start_spinbot() is running, it doesnt register any keystrokes?

lapis sequoia Jun 7, 2023, 5:02 PM

#

cold osprey im thinking coz when `start_spinbot()` is running, it doesnt register any keystr...

very possible

brave sand Jun 7, 2023, 5:18 PM

#

what do I put as a checkpoint for tensorflow?

#

https://github.com/tensorflow/models/blob/master/research/object_detection/configs/tf2/ssd_mobilenet_v2_320x320_coco17_tpu-8.config

GitHub

models/ssd_mobilenet_v2_320x320_coco17_tpu-8.config at master · ten...

Models and examples built with TensorFlow. Contribute to tensorflow/models development by creating an account on GitHub.

#

on line 145

past meteor Jun 7, 2023, 5:24 PM

#

brave sand what do I put as a checkpoint for tensorflow?

Why do you want checkpoints? Do you know exactly what they are?

brave sand Jun 7, 2023, 5:24 PM

#

past meteor Why do you want checkpoints? Do you know exactly what they are?

the guide said to use a checkpoint

past meteor Jun 7, 2023, 5:24 PM

#

Can we take a step back for a second, what are you trying to do?

brave sand Jun 7, 2023, 5:25 PM

#

i am trying to train an object detector with mobilenet-ssd v2 320x320

#

#

i have these files, im not sure which one to use

#

https://towardsdatascience.com/custom-object-detection-using-tensorflow-from-scratch-e61da2e10087
this was the guide I am using, it doesn't go in depth though

Medium

Custom Object Detection using TensorFlow from Scratch

Custom Dataset Training for Object Detection using TensorFlow | Dog Detection in Real time Videos | Perfect Guide for Object Detection

past meteor Jun 7, 2023, 5:26 PM

#

Is the object you're trying to detect not part of the coco classes?

brave sand Jun 7, 2023, 5:26 PM

#

no, it isnt

#

i already labelled my data

#

and converted to csv for tfrecords

past meteor Jun 7, 2023, 5:27 PM

#

Okay great, sorry for asking. Just wanted to be sure 🙂

brave sand Jun 7, 2023, 5:27 PM

#

i'm on the last step, training the model

#

i'm unsure on what the checkpoint is

past meteor Jun 7, 2023, 5:28 PM

#

In all honesty I don't know either by I'm going to have a look as well

cold osprey Jun 7, 2023, 5:28 PM

#

iirc checkpoint of the model during training?

past meteor Jun 7, 2023, 5:29 PM

#

Yeah but they seem to have multiple check point files

cold osprey Jun 7, 2023, 5:29 PM

#

oh its starting form the ssd_mobilenet_v2_coco checkpoint to train

brave sand Jun 7, 2023, 5:30 PM

#

on step 8, if you download the zip file, there are 3 checkpoint files
https://towardsdatascience.com/custom-object-detection-using-tensorflow-from-scratch-e61da2e10087

Medium

Custom Object Detection using TensorFlow from Scratch

Custom Dataset Training for Object Detection using TensorFlow | Dog Detection in Real time Videos | Perfect Guide for Object Detection

#

am I missing something?

past meteor Jun 7, 2023, 5:32 PM

#

brave sand on step 8, if you download the zip file, there are 3 checkpoint files https://to...

This answers it: https://stackoverflow.com/questions/41265035/tensorflow-why-there-are-3-files-after-saving-the-model

Stack Overflow

TensorFlow, why there are 3 files after saving the model?

Having read the docs, I saved a model in TensorFlow, here is my demo code:

Create some variables.

v1 = tf.Variable(..., name="v1")
v2 = tf.Variable(..., name="v2")
...

Add an op to initialize ...

brave sand Jun 7, 2023, 5:33 PM

#

so basically the meta file is the checkpoint file i'm looking for

#

@past meteor that doesn't work

potent sky Jun 7, 2023, 6:13 PM

#

If you want to use the checkpoint for training, all of them are important
The meta file describes the graph structure etc. The .data file has the actual model Weights

potent sky Jun 7, 2023, 6:15 PM

#

brave sand so basically the meta file is the checkpoint file i'm looking for

You can create a checkpoint object with tf.train.latest_checkpoint and then load weights in using the model.load_weights() method
This will probably be the simplest way

past meteor Jun 7, 2023, 6:15 PM

#

@potent sky can I use you as a sounding board for a second?

potent sky Jun 7, 2023, 6:16 PM

#

Or atleast used to be last I used it
TF undergoing too many changes atm ;-;

potent sky Jun 7, 2023, 6:16 PM

#

past meteor <@833644804670750750> can I use you as a sounding board for a second?

Sure

past meteor Jun 7, 2023, 6:18 PM

#

I want to make synthetic data (tabular use cases).

I was thinking of going with graphical models because I can specify how everything is related to each other first. Afterwards I sample from it and send it through a (V)AE to add a bit of unpredictable/non-boring noise.

Am I severly overengineering/overthinking this?

brave sand Jun 7, 2023, 6:18 PM

#

potent sky You can create a checkpoint object with `tf.train.latest_checkpoint` and then lo...

alright, let me try that

past meteor Jun 7, 2023, 6:19 PM

#

If all the relationships are linear and everything is independent w.r.t. each other then I'd obvs just sample my N variables and make a predetermined function that determines f(X_1, ..., X_N) but that's just too boring

brave sand Jun 7, 2023, 6:19 PM

#

i just activated tensorflow for my gpu

#

now the old code won't work

#

how the hell

potent sky Jun 7, 2023, 6:21 PM

#

past meteor I want to make synthetic data (tabular use cases). I was thinking of going wit...

Hmm I think it depends on the eventual requirements of the synthetic data, what level of information you want it to carry, what it's going to be used for no?

#

Your reasoning for using graphical models seems pretty sound. If I wanted data realism and had to capture relationships between different variables, this would be a good option

potent sky Jun 7, 2023, 6:22 PM

#

brave sand alright, let me try that

Be sure to check out the docs

potent sky Jun 7, 2023, 6:22 PM

#

brave sand how the hell

Are you on windows? TF GPU is not supported on windows anymore I think.
Overall tf is undergoing many changes

brave sand Jun 7, 2023, 6:22 PM

#

nvm, i'm back at the same error

brave sand Jun 7, 2023, 6:22 PM

#

potent sky Are you on windows? TF GPU is not supported on windows anymore I think. Overall ...

linux

#

do I just wait for this?

#data-science-and-ml

Get the transforms used to create our pretrained weights

Create some variables.

Add an op to initialize ...