#data-science-and-ml | Python | Page 75

twilit tundra Jul 31, 2023, 7:54 PM

#

So problem solved?

umbral charm Jul 31, 2023, 7:54 PM

#

twilit tundra So problem solved?

Yea Thanks you so much

#

Im still new to this library

#

numpy so much easier

twilit tundra Jul 31, 2023, 7:57 PM

#

You're welcome, np

umbral charm Jul 31, 2023, 7:57 PM

#

twilit tundra So problem solved?

One quick question, Can i ask why to print only certain colums its

print(df[['MA1', 'MA2']]) and not
print(df['MA1', 'MA2'])

twilit tundra Jul 31, 2023, 7:59 PM

#

df[col] is the way to select a subseries inside of the dataframe
df[[cols]] is the way to select a subdataframe
In the first case, the argument is a string, in the second case it's a list

umbral charm Jul 31, 2023, 8:00 PM

#

twilit tundra df[col] is the way to select a subseries inside of the dataframe df[[cols]] is t...

thank you!

void veldt Jul 31, 2023, 8:10 PM

#

umbral charm numpy so much easier

same. I hate pandas with a passion. It appears to be the standard people use, but I try to avoid it at all costs and just use numpy or just straight python if possible

left tartan Jul 31, 2023, 8:19 PM

#

void veldt same. I hate pandas with a passion. It appears to be the standard people use, bu...

I hate it but I’ve just learned to accept the terrible ergonomics.

umbral charm Jul 31, 2023, 8:20 PM

#

void veldt same. I hate pandas with a passion. It appears to be the standard people use, bu...

Yea ITs sucks ive never seen anything like it

lapis sequoia Jul 31, 2023, 8:29 PM

#

void veldt same. I hate pandas with a passion. It appears to be the standard people use, bu...

interesting, i find pandas to be the only thing I enjoy doing in python.

tidal bough Jul 31, 2023, 8:32 PM

#

umbral charm One quick question, Can i ask why to print only certain colums its ``` print(df...

To expand on that explanation, the reason df['MA1', 'MA2'] doesn't get you two columns is because ('MA1','MA2') (a tuple of two strings, yes) is a valid column name in pandas. Yeah.

void veldt Jul 31, 2023, 8:33 PM

#

lapis sequoia interesting, i find pandas to be the only thing I enjoy doing in python.

it just comes across to me as unintuitive and I don't really see it offering any advantage over say numpy. Everything pandas does in terms of data organization and modification can easily be done using just python (no dependencies required), and while various libraries like numpy, scipy, matplotlib, etc. work well together, the same cannot be said for pandas.

tidal bough Jul 31, 2023, 8:34 PM

#

i mean, you can replicate pandas's capabilities in numpy, but with quite a lot of effort - either structured arrays, or an array per column. Also, the moment you want a groupby, problems will start.

#

(people who dislike pandas might want to take a look at polars, though - it is similar but makes some different choices, like not having indexes)

umbral charm Jul 31, 2023, 8:35 PM

#

lapis sequoia interesting, i find pandas to be the only thing I enjoy doing in python.

Idk maybe coz im New to it but its just whole new world to me

void veldt Jul 31, 2023, 8:35 PM

#

I personally prefer how matlab organizes and treats data. Love it's syntax and usage with linear algebra

umbral charm Jul 31, 2023, 8:36 PM

#

tidal bough i mean, you *can* replicate pandas's capabilities in numpy, but with quite a lot...

I was told i could do this, but i did some research and found for a lot of data, pandas is a lot more efficient than numpy

twilit tundra Jul 31, 2023, 8:36 PM

#

Having worked almost exclusively with pandas for more than a year, I can say that it's very natural to me and it has so many capabilities

#

It's not made for linear algebra, just data analysis and transformation

fleet granite Jul 31, 2023, 8:37 PM

#

I am facing an error in my code. can anyone help me to remove this error?

umbral charm Jul 31, 2023, 8:38 PM

#

fleet granite I am facing an error in my code. can anyone help me to remove this error?

send it these guys r good

fleet granite Jul 31, 2023, 8:47 PM

#

I am facing error in line 144 of SHAP summary plot. https://paste.pythondiscord.com/CT5A

left tartan Jul 31, 2023, 8:48 PM

#

I like numpy for numpy stuff and sql for everything else: pandas angers me because it is less capable than both. (I’m conflating sql and dbs but you get the idea)

desert oar Jul 31, 2023, 8:54 PM

#

I literally can't ask for help with my code when smth doesn't work, cus it's such a mess
are you sure? if your code is messy but functional, then it should be fine

desert oar Jul 31, 2023, 8:54 PM

#

left tartan I like numpy for numpy stuff and sql for everything else: pandas angers me becau...

pandas is literally numpy internally, the things that you can do with a typical pandas dataframe is mostly a superset of what you can do with the underlying numpy array (including access the underlying array if needed). if the "data frame" concept isn't useful for you then you don't need to use it. but it has a long successful history among statisticians and other data analysts in python, as well as R for many years before pandas came out

desert oar Jul 31, 2023, 8:57 PM

#

umbral charm One quick question, Can i ask why to print only certain colums its ``` print(df...

df[thing] selects the "thing", whatever that thing is. if it's a string or a list of strings, it selects a column or multiple columns. hence you need df[[col1, col2]]. it's completely logical, don't be thrown off by people who hate pandas because they didn't bother to learn how it works.

#

pandas does have a few big design flaws however, e.g. you can write df[thing] for boolean masks as well

#

that is, sometimes df[thing] is df.loc[:, thing] and sometimes it's df.loc[thing, :] depending on what thing is. imo that's bad design, and df[thing] should be reserved for onlye one of those cases. personally i always use it for the former and flatly reject any code that uses it for the latter.

umbral charm Jul 31, 2023, 8:58 PM

#

desert oar `df[thing]` selects the "thing", whatever that thing is. if it's a string or a l...

Maybe ill learn to love pandas, but right now its a pain in the ass

desert oar Jul 31, 2023, 8:58 PM

#

the docs honestly aren't great. that's the worst part.

#

the reference docs are good, but the "howto" material is kind of chaotic.

#

it's somewhat easier if you already know data frames from R

umbral charm Jul 31, 2023, 8:59 PM

#

Aight well Corey Shcafer it is

desert oar Jul 31, 2023, 8:59 PM

#

who?

fresh harbor Jul 31, 2023, 8:59 PM

#

ort.InferenceSession.get_inputs() equivalent in cv2.dnn.Net?

umbral charm Jul 31, 2023, 8:59 PM

#

??

#

That youtuber who teaches python

#

Hes really good

desert oar Jul 31, 2023, 8:59 PM

#

i don't recommend learning programming from youtube in general

umbral charm Jul 31, 2023, 8:59 PM

#

Does matplotlib numpy django and pandas

desert oar Jul 31, 2023, 8:59 PM

#

he might be good, but my overall trust in "youtube programming educators" is very low

umbral charm Jul 31, 2023, 9:00 PM

#

Eh if you think that, i learnt all my file handling and matplotlib from him

desert oar Jul 31, 2023, 9:00 PM

#

in any case we have some very experienced pandas users here. feel free to ask or search stackoverflow

umbral charm Jul 31, 2023, 9:00 PM

#

my teachers hate stackover flow

desert oar Jul 31, 2023, 9:00 PM

#

why?

twilit tundra Jul 31, 2023, 9:00 PM

#

desert oar the reference docs are good, but the "howto" material is kind of chaotic.

I've started reading that part recently, it's indeed a mess

umbral charm Jul 31, 2023, 9:01 PM

#

Idk i think they think of it kind of like the wiki pedia for prgramming

#

its just blurts out the answer with no explanation

#

but i learnt it myself if i really have too

desert oar Jul 31, 2023, 9:01 PM

#

twilit tundra I've started reading that part recently, it's indeed a mess

imagine learning from that and that only, back in 2015. what's a little unnerving is that the core tutorial and howto material is largely unchanged since then. i once tried to start revising it but i felt a little out of touch with what the core devs wanted for it and gave up. i'd need to be in closer contact w/ someone on the core team to make good progress on it.

#

i spent a while on it though. i should have saved what i did.

#

i think there are some good books on using pandas

desert oar Jul 31, 2023, 9:02 PM

#

umbral charm its just blurts out the answer with no explanation

the good answers do provide an explanation

#

at least, i always try to provide an explanation with mine

umbral charm Jul 31, 2023, 9:02 PM

#

desert oar at least, i always try to provide an explanation with mine

I feel like you just like helping people

#

props to you

desert oar Jul 31, 2023, 9:03 PM

#

if i didn't have R background knowledge i don't know how i'd have learned pandas tbh

desert oar Jul 31, 2023, 9:03 PM

#

umbral charm I feel like you just like helping people

i do indeed. i wouldn't be here otherwise

umbral charm Jul 31, 2023, 9:03 PM

#

Well i probably should start using R, but no clue where to start

desert oar Jul 31, 2023, 9:03 PM

#

nah, don't spend time on it. learn one thing at a time

twilit tundra Jul 31, 2023, 9:03 PM

#

I didn't have any R experience but I could probably have been way more efficient

desert oar Jul 31, 2023, 9:04 PM

#

learning another programming language is low value among the many other things out there to be learned

left tartan Jul 31, 2023, 9:04 PM

#

umbral charm Idk i think they think of it kind of like the wiki pedia for prgramming

For me, it's not SO (or GPT) that's the issue... its how people using it. Instead of asking: "What's the API for XYZ? What parameters does it take? How should I use it?", people jump straight to: "here's how you get the top 5 results from a dataframe".

#

And thus, they get caught in a vicious cycle of solution seeking, rather than understanding.

umbral charm Jul 31, 2023, 9:05 PM

#

left tartan For me, it's not SO (or GPT) that's the issue... its how people using it. Instea...

But the thing is, in my experience, thats exactly how we are taught 'this is how you get XYZ' 'This is how you plot'

twilit tundra Jul 31, 2023, 9:05 PM

#

left tartan And thus, they get caught in a vicious cycle of solution seeking, rather than un...

It's even worse with copilot

left tartan Jul 31, 2023, 9:06 PM

#

twilit tundra It's even worse with copilot

Oh, that's interesting.. I haven't tried, but I can totally imagine.

void veldt Jul 31, 2023, 9:06 PM

#

umbral charm Well i probably should start using R, but no clue where to start

the only major advantage I've found R to have is it has a lot of pre-set libraries for very common data analysis for various scientific fields. Otherwise from my experience u can use both interchangeablt

umbral charm Jul 31, 2023, 9:06 PM

#

twilit tundra It's even worse with copilot

I REALLY WANT CO PILOT

#

i emailed Github but they havent responded :(

twilit tundra Jul 31, 2023, 9:06 PM

#

You don't even need to write a proper google search or prompt

#

It just fills your code

umbral charm Jul 31, 2023, 9:06 PM

#

Yea

twilit tundra Jul 31, 2023, 9:07 PM

#

Can't you just buy the subscription?

umbral charm Jul 31, 2023, 9:07 PM

#

All you have to do is give a function the actual name of the function and it fills the function

umbral charm Jul 31, 2023, 9:07 PM

#

twilit tundra Can't you just buy the subscription?

You can but im trying to go through my uni account coz i might be able to get it for free

twilit tundra Jul 31, 2023, 9:07 PM

#

Oh, right

umbral charm Jul 31, 2023, 9:08 PM

#

In my uni its only avaible for the Engineering students and COmp sci students or Stats students

#

im neither so i have to send a personal request

twilit tundra Jul 31, 2023, 9:08 PM

#

I thought they would give out licenses to every student, that's surprising

umbral charm Jul 31, 2023, 9:09 PM

#

Honestly with the amount of python on my course i expected it too

#

But GOOD NEWS I got pycharm pro

#

so thats good at least

twilit tundra Jul 31, 2023, 9:09 PM

#

Like I'm pretty sure the free azure credits is available for students regardless of major

umbral charm Jul 31, 2023, 9:10 PM

#

twilit tundra Like I'm pretty sure the free azure credits is available for students regardless...

Azure crdits?

#

is this for VS?

past meteor Jul 31, 2023, 9:10 PM

#

desert oar pandas is literally numpy internally, the things that you can do with a typical ...

it's not, Pandas has switched to Arrow internally

twilit tundra Jul 31, 2023, 9:10 PM

#

What does pycharm provide that VSCode doesn't?

past meteor Jul 31, 2023, 9:10 PM

#

Pandas' Syntax is godawful though. It's the only data frame library I've used that feels wrong and I've used several in multiple languages.

twilit tundra Jul 31, 2023, 9:10 PM

#

umbral charm Azure crdits?

For cloud services

umbral charm Jul 31, 2023, 9:11 PM

#

twilit tundra What does pycharm provide that VSCode doesn't?

I have no clue, i havent really used any IDE's apart from python IDLE and Pycharm

#

so i just grew up with pycharm

past meteor Jul 31, 2023, 9:11 PM

#

Polars is a better option, both performance and syntax wise, but it definitely doesn't tie in as well with the data ecosystem

umbral charm Jul 31, 2023, 9:11 PM

#

used it for all my codeing career so far

left tartan Jul 31, 2023, 9:11 PM

#

past meteor it's not, Pandas has switched to Arrow internally

No exactly, but pandas does support both numpy and arrow backends... but there are many workflows that require numpy... so I'm more curious what happens longer term with the broader community.

past meteor Jul 31, 2023, 9:12 PM

#

Pandas will remain king because of sunk cost

twilit tundra Jul 31, 2023, 9:12 PM

#

I didn't expect pandas to be such a controversial topic lol

past meteor Jul 31, 2023, 9:12 PM

#

But personally I'm not writing any new code in Pandas

#

Unless I really have to for compatability reasons

bronze flint Jul 31, 2023, 9:13 PM

#

Quick question, i had an issue now where my training on Google colab stopped after idling with T4 graphics card
Was training a massive CNN but at the point it wasn't using T4

Do you guys know what the cooldown is?

#

I am using free Google colab

void veldt Jul 31, 2023, 9:17 PM

#

twilit tundra I didn't expect pandas to be such a controversial topic lol

eh it's really preference. I prefer to reinvent the wheel half the time and try to do everything in python so there are zero dependency concerns (ugly bloated code, but I like it). There r tons of libraries that all do the same thing, all just comes down to preference

left tartan Jul 31, 2023, 9:17 PM

#

past meteor But personally I'm not writing *any* new code in Pandas

Lately, I've just been gutting pandas code and replacing with duckdb sql, it's been much cleaner and more flexible. but, I love sql.

twilit tundra Jul 31, 2023, 9:24 PM

#

past meteor Polars is a better option, both performance and syntax wise, but it definitely d...

Polars does look cool, I might try it out for personal projects

desert oar Jul 31, 2023, 9:24 PM

#

past meteor it's not, Pandas has switched to Arrow internally

not completely. it's still provisional and numpy backend will not be removed

desert oar Jul 31, 2023, 9:25 PM

#

twilit tundra Polars does look cool, I might try it out for personal projects

more verbose than pandas but worth learning. main downside right now is lack of indexes and devs hostility to them

twilit tundra Jul 31, 2023, 9:25 PM

#

Polar bears are more deadly than pandas

worn stratus Jul 31, 2023, 9:26 PM

#

desert oar more verbose than pandas but worth learning. main downside right now is lack of ...

what disadvantages does this come with? just the lack of natural joins?

void veldt Jul 31, 2023, 9:26 PM

#

twilit tundra Polar bears are more deadly than pandas

this has convinced me to use polars

desert oar Jul 31, 2023, 9:26 PM

#

worn stratus what disadvantages does this come with? just the lack of natural joins?

filtering on things other than the sorting key

worn stratus Jul 31, 2023, 9:27 PM

#

desert oar filtering on things other than the sorting key

I don't really understand - filtering is much nicer in polars

past meteor Jul 31, 2023, 9:28 PM

#

desert oar more verbose than pandas but worth learning. main downside right now is lack of ...

Lack of indexes is amazing

#

They're horrible

worn stratus Jul 31, 2023, 9:28 PM

#

desert oar filtering on things other than the sorting key

do you have example of what you mean?

twilit tundra Jul 31, 2023, 9:29 PM

#

Is there something equivalent to query in polars?

bronze flint Jul 31, 2023, 9:30 PM

#

bronze flint Quick question, i had an issue now where my training on Google colab stopped aft...

Anyone maybe has experience with this? 😅

worn stratus Jul 31, 2023, 9:30 PM

#

twilit tundra Is there something equivalent to query in polars?

you can get an SQL context, or you can just chain filters/selects. you don't really need it

past meteor Jul 31, 2023, 9:31 PM

#

I've also used spark and Dataframes in R and MATLAB

#

Only pandas is "different"

twilit tundra Jul 31, 2023, 9:34 PM

#

worn stratus you can get an SQL context, or you can just chain filters/selects. you don't rea...

Makes sense

#

I've been wondering: does anyone know the technical reason xlsx files are so slow to load on python/pandas?

left tartan Jul 31, 2023, 9:56 PM

#

twilit tundra I've been wondering: does anyone know the technical reason xlsx files are so slo...

Don’t know your specific issue and never benchmarked this in pandas, as they’re acceptable for me, but: xlsx files as zips of xml files, which tend to be large and slow for very large datasets. Plus the side caching of shared strings leads to a lot of lookups.

civic elm Jul 31, 2023, 10:01 PM

#

is RNN a viable model for anything? or Transformers have been the norm?

#

Am I wasting time and effort in deep diving RNNs?

twilit tundra Jul 31, 2023, 10:14 PM

#

afaik, RNNs are still relevant for time series data

#

But other than that, it's mostly still a hot topic for research because there is a lot of untapped potential

tepid tartan Jul 31, 2023, 11:11 PM

#

@civic elm should I do khan linear and stats or do Coursera on the math 🤔

desert oar Aug 1, 2023, 12:01 AM

#

tepid tartan <@500487685153226752> should I do khan linear and stats or do Coursera on the ma...

hard to beat MIT 18.06 for an intro to linear algebra

desert oar Aug 1, 2023, 12:02 AM

#

twilit tundra I've been wondering: does anyone know the technical reason xlsx files are so slo...

because the xlsx format is relatively slow to parse. if you used the underlying library (openpyxl) and did it yourself with for loops, it'd be just as slow or slower.

fresh harbor Aug 1, 2023, 12:19 AM

#

Do model inferences get slowed down when you do A, B, C, D in a cycle? Do I instead use multiprocessing and queues to speed this up, or will it just cause resource starvation?

tepid tartan Aug 1, 2023, 12:26 AM

#

desert oar hard to beat MIT 18.06 for an intro to linear algebra

MIT that good for linear algebra pithink

desert oar Aug 1, 2023, 12:26 AM

#

tepid tartan MIT that good for linear algebra <:pithink:652247559909277706>

dr strang is a legend

desert oar Aug 1, 2023, 12:26 AM

#

fresh harbor Do model inferences get slowed down when you do A, B, C, D in a cycle? Do I inst...

what do you mean by "A, B, C, D"?

fresh harbor Aug 1, 2023, 12:26 AM

#

Model inferences

#

A is face detector
B is face recognition
C is magik
D is more magik

desert oar Aug 1, 2023, 12:30 AM

#

fresh harbor A is face detector B is face recognition C is magik D is more magik

unless all the models are being trained simultaneously like in a GAN, i would not try to train them all simultaneously

#

that seems like a giant waste of effort to get it working right, and it doesn't seem like you actually get anything out of it. if anything it's worse because you can't adjust the training processes individually

fresh harbor Aug 1, 2023, 12:31 AM

#

Not train, its for inference only

#

Wouldn't be too much work probably https://stackoverflow.com/a/45231035

Stack Overflow

How to set up a pipeline using Queues in multiprocessing

Here's my code, it's supposed to do something very similar to what this other question is trying to do, in particular this diagram is relavent:
with f1 = produce, f2 = f3 = worker, f4 = consumer.

I

#

@desert oar btw i wanted to ask whether yunet would be a decent enough replacement for retinaface.

#

Its built right into opencv so no extra deps

desert oar Aug 1, 2023, 12:48 AM

#

fresh harbor <@389497659087650836> btw i wanted to ask whether yunet would be a decent enough...

i wouldn't know, i have no practical experience with facial recognition in particular

serene scaffold Aug 1, 2023, 12:49 AM

#

desert oar i wouldn't know, i have no practical experience with facial recognition in parti...

is that because you're a lump of rock with a light bulb inside you?

desert oar Aug 1, 2023, 12:49 AM

#

fresh harbor Not train, its for inference only

i see. does B input require A output? or are they all completely independent?

desert oar Aug 1, 2023, 12:49 AM

#

serene scaffold is that because you're a lump of rock with a light bulb inside you?

yes, the entire concept of a "face" as you humans understand it is alien to my species, and i still have family members who don't really get it

#

so i try to stay away from those problems at work. no good intuition for it

serene scaffold Aug 1, 2023, 12:50 AM

#

Fascinating

fresh harbor Aug 1, 2023, 12:51 AM

#

desert oar i see. does B input require A output? or are they all completely independent?

Yes

desert oar Aug 1, 2023, 12:52 AM

#

fresh harbor Yes

🤔

fresh harbor Aug 1, 2023, 12:52 AM

#

Its a pipeline

desert oar Aug 1, 2023, 12:52 AM

#

so you need the output from each step as the input to the next step?

fresh harbor Aug 1, 2023, 12:52 AM

#

The question is whether I should operate it concurrently, in batches or serially right now?

fresh harbor Aug 1, 2023, 12:52 AM

#

desert oar so you need the output from each step as the input to the next step?

Yea

desert oar Aug 1, 2023, 12:53 AM

#

i see, you are asking about running the pipeline on multiple images

#

that's actually a very good question

fresh harbor Aug 1, 2023, 12:53 AM

#

Its actually a video

#

But video no good for my model

#

Need to break it down to frames

desert oar Aug 1, 2023, 12:54 AM

#

i see, is the model pipeline sequential across frames? or can you batch/chunk/re-order arbitrarily and it won't change the results?

fresh harbor Aug 1, 2023, 12:54 AM

#

Uhh i don't think it depends on previous results, if that's what u mean

desert oar Aug 1, 2023, 12:56 AM

#

okay. instinctively i would imagine that if i need to analyze something in a video i would want some window of past frames as input to my current inference. that would limit how you can run this pipeline

fresh harbor Aug 1, 2023, 12:56 AM

#

Fyi its YUNet > SFace > INSwapper > (some ONNX upscaler that opencv can run, not decided)

desert oar Aug 1, 2023, 12:56 AM

#

i think in general the answer to this question is why ML engineers get paid the big bucks. but i think the short answer is that it depends on what hardware you have available and how the models are implemented.

fresh harbor Aug 1, 2023, 12:57 AM

#

desert oar okay. instinctively i would imagine that if i need to analyze something in a vid...

its not like that, but i dunno if the model internally does some stuff like that

desert oar Aug 1, 2023, 12:57 AM

#

if the underlying implementation is already multithreaded/parallel, you can probably do ok by running it serially. you wouldn't want to combine that with parallelism in your application because everything will get gunked up

fresh harbor Aug 1, 2023, 12:57 AM

#

I am super new (3 days old) to this stuff

desert oar Aug 1, 2023, 12:58 AM

#

fresh harbor its not like that, but i dunno if the model internally does some stuff like that

you would know if it did, i think. it sounds like you're planning to use pre-trained off the shelf models to analyze each video frame as a separate image, so it's probably not a concern

fresh harbor Aug 1, 2023, 12:59 AM

#

Yea

desert oar Aug 1, 2023, 12:59 AM

#

you might want to check the opencv docs to see if it says anything about threading or parallelism

#

it's very easy to run into situations where paralyzation actually slows things down because your program is spending too much time sending data between processes

fresh harbor Aug 1, 2023, 1:00 AM

#

The bigger concern here is that I don't run into resource starvation

#

Most likely RAM / VRAM

#

Serial pipeline won't cause it

desert oar Aug 1, 2023, 1:00 AM

#

so if opencv has a way to run each inference in threads or processes, i recommend starting there and benchmarking

fresh harbor Aug 1, 2023, 1:01 AM

#

But if I parallelize it, what little control do I have over how much RAM the model chooses to eat?

desert oar Aug 1, 2023, 1:01 AM

#

eg numpy includes openmpi support

desert oar Aug 1, 2023, 1:01 AM

#

fresh harbor But if I parallelize it, what little control do I have over how much RAM the mod...

probably just inference batch size

#

i would start by just running everything serially and profiling + looking into threads/processes within opencv

fresh harbor Aug 1, 2023, 1:02 AM

#

Yea I'd need to run the serial pipeline with multiprocessing + queue anyways because it shouldn't block the gui

#

which makes me wonder, how does gradio achieve non blocking UI when everything is happening in the main thread?

desert oar Aug 1, 2023, 1:06 AM

#

fresh harbor which makes me wonder, how does gradio achieve non blocking UI when everything i...

are you sure everything is happening in the main thread?

fresh harbor Aug 1, 2023, 1:07 AM

#

desert oar are you sure everything is happening in the main thread?

That's what I have seen in many gradio apps

#

Or maybe they do use something after all?

#

I have definitely not seen any code using queues

empty furnace Aug 1, 2023, 1:29 AM

#

is plotly optimal for large datasets?

#

idk if consumes much memory just to make a chart

left tartan Aug 1, 2023, 1:40 AM

#

empty furnace is plotly optimal for large datasets?

optimal? I dunno if anything is optimal, but I do some large stuff with it.. but for complex diagrams, make sure to statically render rather than interactive.

#

matplotlib is still the baseline everything is compared against. Generally, for large datasets, the first step should be reducing the complexity of the plot, whether from quantizing/sampling/aggregation/smoothing/whatever

simple mirage Aug 1, 2023, 4:53 AM

#

How do u mathematically determine whether a distribution is skewed or not

#

I’ve tried plotting the histogram, and it looks skewed but I heard that median is better than mean for skewed graphs and so I tried comparing the mean and median results and the median result actually gave a result that leaned more towards the skew

#

Does that mean my data isnt actually skewed?

cold osprey Aug 1, 2023, 5:01 AM

#

skew and kurtosis

simple mirage Aug 1, 2023, 5:26 AM

#

Oh I get it now. Imputing with median is not meant to address the skew. It’s just suppose to make the distribution more robust

ashen axle Aug 1, 2023, 6:20 AM

#

Does anyone know if there is a straight-forward method of adding hover-text to a seaborn generated line plot? The plot is busy enough that a legend is not useful.

I've experimented with all of the major plotting libraries, and Id rather stick with the seaborn/matplotlib ecosystem if possible

rare quest Aug 1, 2023, 6:46 AM

#

Hi, this may be the wrong channel but, generating an image with pytorch takes a very long time, with CUDA enabled:

with autocast("cuda"):
    image = model("An image of a hand with a ball of ice levitating above it.")

This takes about 4 minutes with my RTX 2070S

#

Is there something I need to enable in windows11?

supple plover Aug 1, 2023, 8:05 AM

#

hello everyone, I'm looking to try and make an app for cameras in vehicles so that it can immediately detect and count the amount of passengers inside. Are there known examples of this that I can study from? I'm very new to AI/ML, I only managed to make a custom YOLOv4 model to do palm oil fruit classification deployed in an android app (I put the model as a .tflite inside the app itself) recently. I'm thinking can I use YOLO models for this passenger detection & counting? And if I want to have the AI model to be in a web API/cloud to be consumed through a website, are there examples on how to do that?

desert oar Aug 1, 2023, 8:33 AM

#

supple plover hello everyone, I'm looking to try and make an app for cameras in vehicles so th...

it's very typical to deploy a model in the cloud like you describe. often you can just do it with a basic web framework like fastapi or flask. but there are also platforms that can do it for you

supple plover Aug 1, 2023, 8:35 AM

#

desert oar it's very typical to deploy a model in the cloud like you describe. often you ca...

can you mention some of those platforms?

#

and another question. How's everyone's opinions on ML.NET?

desert oar Aug 1, 2023, 8:36 AM

#

sagemaker can do it for example. or mlflow

past meteor Aug 1, 2023, 8:46 AM

#

MLflow gives you many features (and complexity!) that you may (or may not) need

#

For the easiest case I'd start out with a simple container running your model with sanic / fastAPI, maybe CI/CD to easily update the model

supple plover Aug 1, 2023, 8:49 AM

#

I'll try that. thanks

supple plover Aug 1, 2023, 9:07 AM

#

since there's a lot of YOLO models now, which one is the best one for detecting and counting just one type of object?

heavy bay Aug 1, 2023, 10:32 AM

#

I made a simple neural network to predict y = 2x + 1
but the output is off by 0.002 lemon_thinking
what is the reason for this?

lapis sequoia Aug 1, 2023, 10:36 AM

#

heavy bay I made a simple neural network to predict `y = 2x + 1` but the output is off by ...

the dataset contains no noise? ie: every entry in the dataset is exactly y = 2x+1 or there is some error value?

lapis sequoia Aug 1, 2023, 10:37 AM

#

supple plover since there's a lot of YOLO models now, which one is the best one for detecting ...

since its easier then having multiple objects etc what I did is that I took the yolo8n and changed the head for something simpler

fleet granite Aug 1, 2023, 10:39 AM

#

Hi everyone, I am facing this error in the code line 144 "IndexError: index 2 is out of bounds for axis 1 with size 1" the code is https://paste.pythondiscord.com/CT5A

heavy bay Aug 1, 2023, 10:40 AM

#

lapis sequoia the dataset contains no noise? ie: every entry in the dataset is exactly y = 2x+...

there's no noise in the dataset

def gen_data(start, stop):
    x = np.array([])
    y = np.array([])
    for i in range(start, stop):
        x = np.append(x, i)
        y = np.append(y, (i*2)+1)

    return x, y


X, y = gen_data(1, 100)```

lapis sequoia Aug 1, 2023, 10:40 AM

#

heavy bay there's no noise in the dataset ```py def gen_data(start, stop): x = np.arra...

and what is the architecture of the model?

supple plover Aug 1, 2023, 10:41 AM

#

lapis sequoia since its easier then having multiple objects etc what I did is that I took the ...

I'm pretty new to AI/ML, so how did you change the head? If you don't mind explaining

serene scaffold Aug 1, 2023, 10:41 AM

#

heavy bay there's no noise in the dataset ```py def gen_data(start, stop): x = np.arra...

if you didn't already know, the way you've written this code is incredibly inefficient

lapis sequoia Aug 1, 2023, 10:42 AM

#

supple plover I'm pretty new to AI/ML, so how did you change the head? If you don't mind expla...

I pulled the yolo repo, and wrote in pytorch a new model using the yolo backbone and the new head. but if you need yolo you can directly used in my case I had some issues with inference time and overheating so it was needed to have a lighter model.

heavy bay Aug 1, 2023, 10:43 AM

#

lapis sequoia and what is the architecture of the model?

model = tf.keras.Sequential([
    keras.layers.Dense(units=1, input_shape=[1]),
    keras.layers.Dense(units=5),
    keras.layers.Dense(units=10),
    keras.layers.Dense(units=5),
    keras.layers.Dense(units=1),
])```
I didn't really think much about it, just messing around

heavy bay Aug 1, 2023, 10:43 AM

#

serene scaffold if you didn't already know, the way you've written this code is incredibly ineff...

lemon_sweat

#

i just realized that np.append returns a copy

supple plover Aug 1, 2023, 10:44 AM

#

lapis sequoia I pulled the yolo repo, and wrote in pytorch a new model using the yolo backbone...

right I really have to learn about how to improve inference time too, lots of homework. Thank you

lapis sequoia Aug 1, 2023, 10:45 AM

#

supple plover right I really have to learn about how to improve inference time too, lots of ho...

i'm working on yolo for an edge device so that's important for me

lapis sequoia Aug 1, 2023, 10:45 AM

#

heavy bay <:lemon_sweat:754441881718620281>

can you share the whole code? i'm not seeing where this is going wrong

heavy bay Aug 1, 2023, 10:45 AM

#

heavy bay i just realized that np.append returns a copy

maybe i should use a regular python array and case it to a numpy array when returning?

serene scaffold Aug 1, 2023, 10:47 AM

#

heavy bay maybe i should use a regular python array and case it to a numpy array when retu...

a list is not a "python array"--it's a list.

you could also do this

x = np.arange(start, stop)
y = (x * 2) + 1

heavy bay Aug 1, 2023, 10:47 AM

#

lapis sequoia can you share the whole code? i'm not seeing where this is going wrong

https://paste.pythondiscord.com/TTNQ

heavy bay Aug 1, 2023, 10:48 AM

#

serene scaffold a list is not a "python array"--it's a list. you could also do this ```py x = n...

oh ok thanks

slim bone Aug 1, 2023, 12:27 PM

#

So lets say I've made some basic neural network - and now I wish to download the model. Am I essentially downloading the (now adjusted) weights and biases of* the currently trained model?

potent sky Aug 1, 2023, 12:38 PM

#

download it? where from?

serene scaffold Aug 1, 2023, 12:39 PM

#

slim bone So lets say I've made some basic neural network - and now I wish to download the...

download, or just save?

slim bone Aug 1, 2023, 12:39 PM

#

serene scaffold download, or just save?

I think save is a more appropriate word

#

"The thing that will enable me to continue training it later" maybe?

serene scaffold Aug 1, 2023, 12:40 PM

#

if you save the model, the information that gets written to your hard drive is some representation of the weights and biases, yes.

potent sky Aug 1, 2023, 12:42 PM

#

depending on your requirement you may also choose to save the state of the optimizer (to pick up where you left off) as well as model configuration

hasty mountain Aug 1, 2023, 12:42 PM

#

I think I've never seen a "downloadable model", only the weights and biases. Then you have to rebuild its architecture in your code pithink

potent sky Aug 1, 2023, 12:43 PM

#

if you save the model configuration as well then you don't have to, for example .h5 models

#

saving just a state dict for the parameters only is quite useful tho, so it's popular

slim bone Aug 1, 2023, 1:00 PM

#

serene scaffold if you save the model, the information that gets written to your hard drive is s...

And I could simply feed the machine those saved weights and biases, instead of the initial, random weights and biases?

slim bone Aug 1, 2023, 1:00 PM

#

potent sky depending on your requirement you may also choose to save the state of the optim...

Ah, I haven't gotten much into optimizers yet. So I'm not sure what this means

#

I'm just trying to figure out how I should construct my program - I'm just building a simple neural network with NumPy as a starting project

fallow frost Aug 1, 2023, 2:54 PM

#

is there a way I can avoid doing a full scan on a pandas dataframe when filtering?
I want to get the first 5k filtered rows (when there are 500k), I dont want pandas to keep filtering the dataframe once it found 5k rows, is there a way to do this? and are there any alternatives?

boreal gale Aug 1, 2023, 3:05 PM

#

fallow frost is there a way I can avoid doing a full scan on a pandas dataframe when filterin...

need more context before i can comment on this.

fallow frost Aug 1, 2023, 3:15 PM

#

boreal gale need more context before i can comment on this.

what kind of context you want to know?

boreal gale Aug 1, 2023, 3:17 PM

#

what is the data that you are filtering on?
what kind of "filter" is it?
what is your data's cardinality?
why first 5k?
how often do you need this?
does data change?
what performance do you have now and what do you expect?

fallow frost Aug 1, 2023, 3:20 PM

#

hmmm thats alot of questions, my question is fairly simple, can I do this:

from more_itertools import take

big_data: Iterable[...]
filterd_generator = (x for x in big_data if predicate(x))
print(take(5000, filterd_generator))

instead of:

filterd_list = [x for x in big_data if predicate(x)]
print(take(5000, filterd_list))

#

you see the difference?

#

pandas does the latter.

#

I want to do the former

boreal gale Aug 1, 2023, 3:26 PM

#

hmmm thats alot of questions, my question is fairly simple, can I do this:
i don't ask question just for the sake of asking question, it's all for the ultimate goal of helping you.
if you simply require an answer to that, then no, you can't do that as far as i know.

left tartan Aug 1, 2023, 3:28 PM

#

fallow frost I want to do the former

I don't know of a way to do that in pandas (but doesn't mean it cant be done) without some sort of iteration, and iteration is generally an antipattern with pandas.

fallow frost Aug 1, 2023, 3:28 PM

#

if you simply require an answer to that, then no, you can't do that as far as i know.
thanks, thats what I wanted to know 🙂

left tartan Aug 1, 2023, 3:28 PM

#

You could map, for instance, accumulate and perhaps throw an exception when bucket is "full"

#

Or, perhaps do the filtered list over smaller windows of the data

fallow frost Aug 1, 2023, 3:30 PM

#

left tartan Or, perhaps do the filtered list over smaller windows of the data

and what if the sliced window dosent have enough to fulfill the 5k threshold

#

(which is just an example, but it could be any number)

left tartan Aug 1, 2023, 3:30 PM

#

you'd just keep moving the window until you fill.

#

ie: check first 1million, then next 1 mill, etc

fallow frost Aug 1, 2023, 3:30 PM

#

I see

#

I'm thinking of using Polars with their lazy API theyre supposed to have this functionality

#

but I dont want to add that dependency to my project

left tartan Aug 1, 2023, 3:31 PM

#

but, what kind of condition do you have where this is important? Like, df['col'] == something is not an expensive operation, and df[condition].head(5000) is only returning the first 5000 rows

#

(to be specific: I don't know where you could do df['col'] == something but only return the first 5000 indices that match the condition in a single operation)

fallow frost Aug 1, 2023, 3:33 PM

#

left tartan but, what kind of condition do you have where this is important? Like, df['col']...

col.isin(list_of_vals) for 3 columns, and a very expensive: df.col.apply(lambda lst: my_set.intersection(lst))

#

the latter is for a filter, not creating a new column

left tartan Aug 1, 2023, 3:36 PM

#

fallow frost `col.isin(list_of_vals)` for 3 columns, and a very expensive: `df.col.apply(lamb...

for what it's worth, what I do is: ```py
import duckdb
filtered_df = duckdb.execute("select * from df where col in (?) limit 5000", [list_of_vals])

#

but, I'm a sql guy.

fallow frost Aug 1, 2023, 3:37 PM

#

duckdb has got to have a query optimizer

left tartan Aug 1, 2023, 3:37 PM

#

fallow frost duckdb has got to have a query optimizer

indeed.

fallow frost Aug 1, 2023, 3:37 PM

#

which wont keep scanning after it found 5k rows

#

I will try that

left tartan Aug 1, 2023, 3:37 PM

#

tag me if you have any query questions, this is my jam.

fallow frost Aug 1, 2023, 3:38 PM

#

I do actually

fallow frost Aug 1, 2023, 3:39 PM

#

left tartan for what it's worth, what *I* do is: ```py import duckdb filtered_df = duckdb.ex...

if a column is a list of integers, can you actually check that each row is a subset of another list of strings?

boreal gale Aug 1, 2023, 3:39 PM

#

fallow frost `col.isin(list_of_vals)` for 3 columns, and a very expensive: `df.col.apply(lamb...

this is the sort of things that i wanted out of you, because without this it's pretty hard to point out alternative ways for achieving the same thing in a hopefully more efficient way.

left tartan Aug 1, 2023, 3:40 PM

#

fallow frost if a column is a list of integers, can you actually check that each row is a sub...

Can you give an example? I don't follow.

fallow frost Aug 1, 2023, 3:50 PM

#

left tartan Can you give an example? I don't follow.

yeah its a bit confusing:

df = pd.DataFrame({'col': [(1, 2, 3), (1, 2), (1, 2, 3), (1, 2, 3, 4), (1, 2, 3, 5)]})
# I want to find all the rows that contain: 1, 2, 3, and 4.
to_keep = {1, 2, 3, 4}

>>> df['col'].apply(lambda x: to_keep.issubset(x))
Out[150]: 
0    False
1    False
2    False
3     True
4    False
Name: col, dtype: bool

#

mask = df['col'].apply(lambda x: to_keep.issubset(x))
df = df[mask]

>>> df
Out[154]: 
            col
3  (1, 2, 3, 4)

left tartan Aug 1, 2023, 3:56 PM

#

This is my preferred approach (given your statement that pandas is too slow for your filter/limit): py import duckdb import pandas as pd df = pd.DataFrame({'col': [(1, 2, 3), (1, 2), (1, 2, 3), (1, 2, 3, 4), (1, 2, 3, 5)]}) duckdb.execute("select * from df where col = ? limit 1000", [(1,2,3,4)]).df() if looking for set intersection, need to get a little cleverer (function added to make this simpler): py CREATE OR REPLACE FUNCTION "@>"(haystack, needle) AS (select c == len(needle) from (select count(*) c from (SELECT UNNEST(haystack) INTERSECT SELECT UNNEST(needle)))) ; select col, col @> [1,2,5] b from df where b = True

desert oar Aug 1, 2023, 5:06 PM

#

fallow frost pandas does the latter.

note that pandas does the latter because it doesn't have a lazy query engine or a query optimizer. note also that you sometimes need/want to just use plain python instead of doing everything inside pandas. python for loops can be reasonably fast if you build them carefully.

#

i am gathering that you have an array/list-valued column, and you want to find the first row where the array/list contains some certain values?

#

the right solution definitely depends on how much data you have, memory vs. cpu constraints, etc. but that duckdb unnest operation above looks very elegant

#

you could also consider re-encoding your data as an integer bitfield and using binary &. that's a good leetcode trick for lookups on fixed-size sets

fallow frost Aug 1, 2023, 5:44 PM

#

left tartan This is *my* preferred approach (given your statement that pandas is too slow fo...

to be honest I dont understand how it works, but it looks interesting, I will test it tomorrow

fallow frost Aug 1, 2023, 5:45 PM

#

left tartan This is *my* preferred approach (given your statement that pandas is too slow fo...

just out of curiosity, in the line:

select col, col @> [1,2,5] b from df
what do the numbers 1, 2, and 5, reppresent?

left tartan Aug 1, 2023, 5:45 PM

#

fallow frost to be honest I dont understand how it works, but it looks interesting, I will te...

I like slr's bitfield suggestion, that's probably going to be the optimal approach tbh

left tartan Aug 1, 2023, 5:45 PM

#

fallow frost just out of curiosity, in the line: > select col, col @> [1,2,5] b from df what ...

[1,2,5] is your to_keep list

fallow frost Aug 1, 2023, 5:46 PM

#

left tartan [1,2,5] is your to_keep list

oh so I can change that. and I can add any number of items in the array, correct?

left tartan Aug 1, 2023, 5:46 PM

#

yup

fallow frost Aug 1, 2023, 5:47 PM

#

got it thanks

fallow frost Aug 1, 2023, 5:49 PM

#

desert oar note that pandas does the latter because it doesn't have a lazy query engine or ...

as I learn more, I realize that pandas is very ineficcient for filtering, and thats why I think a simple Python generator-expression might faster, altough I havent tried it yet

fallow frost Aug 1, 2023, 5:50 PM

#

desert oar i am gathering that you have an array/list-valued column, and you want to find t...

not just the first row, but the first N rows

fallow frost Aug 1, 2023, 5:51 PM

#

desert oar you could also consider re-encoding your data as an integer bitfield and using b...

I'm not familiar with whathever you mentioned, you would ve able to link some example?
the only optimzation that I did was dictionary encoding, so instead of having sets of strings, I know have sets of integers which are lighter

agile cobalt Aug 1, 2023, 5:53 PM

#

sounds like the issue is more about your data modelling than pandas itself?

you should not store tuples, lists and other arbitrary objects in pandas dataframes
you should avoid using apply as much as possible

#

seriously, don't go around complaining about pandas performance if you're using apply(). That alone kills any benefits you might hope to get from using pandas.

fallow frost Aug 1, 2023, 5:56 PM

#

if I only had a cent for each time somebody said that, I would be very wealthy

#

but seriously, whats the alternative?

agile cobalt Aug 1, 2023, 5:56 PM

#

and about the previous thing you mentioned about not doing a full lookup: Yeah, pandas is not the right tool for that job

agile cobalt Aug 1, 2023, 5:57 PM

#

desert oar you could also consider re-encoding your data as an integer bitfield and using b...

^

fallow frost Aug 1, 2023, 5:57 PM

#

agile cobalt and about the previous thing you mentioned about not doing a full lookup: Yeah, ...

ehm coughs in pandas performance

agile cobalt Aug 1, 2023, 5:58 PM

#

pandas is good for medium sized datasets
if it's large enough to justify stopping early on the example case you gave, you might as well use an actual database instead

left tartan Aug 1, 2023, 6:10 PM

#

fallow frost I'm not familiar with whathever you mentioned, you would ve able to link some ex...

Imagine you used a bit mask to represent your tuple: where each bit represents a unique member of the tuple. So, 0001 means (1) and 0101 means (1,3), got that part?

fallow frost Aug 1, 2023, 6:10 PM

#

agile cobalt pandas is good for medium sized datasets if it's large enough to justify stoppin...

pandas is good for ~~medium~~ small sized datasets

fallow frost Aug 1, 2023, 6:11 PM

#

left tartan Imagine you used a bit mask to represent your tuple: where each bit represents a...

ok?

left tartan Aug 1, 2023, 6:12 PM

#

Then, array & mask == mask means that all mask entries are in the input array

#

And this is a very efficient vectorizable operation

fallow frost Aug 1, 2023, 6:16 PM

#

@left tartan I'm not following you at all, can you link some tutorial or something that I can read up on how this stuff works?

left tartan Aug 1, 2023, 6:18 PM

#

fallow frost <@738234281146712084> I'm not following you at all, can you link some tutorial o...

I don’t know off the top of my head any tutorials, just a quick google shows this one perhaps https://towardsdatascience.com/understanding-bitmask-for-the-coding-interview-b1643f4b0e24

Medium

Understanding Bitmask for the Coding Interview

A practical guide to a challenging topic

#

(Altho I hate that site for requiring account)

twilit tundra Aug 1, 2023, 6:34 PM

#

I miss the days bypass paywall actually worked on that site

umbral charm Aug 1, 2023, 6:46 PM

#

in Pandas, if i got a column with numbers, 0's and with NaN's, if i wanna replace all the numbers withs 1's and leave the 0's as 0, but leave the NaN's as NaN's, Can i just replace it like df['Boo'] = [1 if df.loc[i, 'Boo'] > 0 else 0 for i in df.index]

#

or would that also changes the NaN's

whole rock Aug 1, 2023, 6:52 PM

#

hello

#

i need assitance asap

#

anyone experienced with hacking please shoot me a dm

desert oar Aug 1, 2023, 6:56 PM

#

agile cobalt sounds like the issue is more about your data modelling than pandas itself? - yo...

i think it's reasonable to have array-valued data in general. pandas however does not have any optimization for it

#

storing this data as sets might also help

#

df = pd.DataFrame({'things': [{'a', 'b'}, {'b', 'c', 'e'}]})

important_things = {'q', 'c'}
df['has_important_things'] = df['things'].map(lambda s: bool(s & important_things))

#

there are a couple of things going on here. yes, pandas has no support for "partial" or "lazy" filtering, and yes i suspect that a plain python loop might be faster (which you can implement in a single pass).

desert oar Aug 1, 2023, 6:58 PM

#

umbral charm in Pandas, if i got a column with numbers, 0's and with NaN's, if i wanna replac...

no, if and else are not "vectorized" over pandas series

#

@umbral charm

boo_is_notna_notzero = df['Boo'].notna & (df['Boo'] != 0)
df.loc[boo_is_notna_notzero, 'Boo'] = 1

umbral charm Aug 1, 2023, 6:59 PM

#

desert oar no, `if` and `else` are not "vectorized" over pandas series

Yea tried it and failed miserably

#

But i found this solution df['Boo'] = pd.notna(TSLA['Boo']).astype(int)

#

doesnt change the NaN's

#

took me a good 4 mins to realise not to use the & symbol but and instead

desert oar Aug 1, 2023, 7:06 PM

#

umbral charm But i found this solution df['Boo'] = pd.notna(TSLA['Boo']).astype(int)

did you see the code i posted directly above?

#

notna returns True if the value is not null, and False if null

#

so that will set both 0 and 1 to True, then astype(int) converts True to 1

umbral charm Aug 1, 2023, 7:08 PM

#

desert oar so that will set both 0 and 1 to True, then astype(int) converts True to 1

Thank you!

night kernel Aug 1, 2023, 7:47 PM

#

I want to put my own dataset into this recommendation system. anyone know how i can replace the classic movie lens dataset?https://github.com/microsoft/recommenders/blob/main/examples/02_model_collaborative_filtering/cornac_bpr_deep_dive.ipynb

twilit tundra Aug 1, 2023, 7:52 PM

#

night kernel I want to put my own dataset into this recommendation system. anyone know how i ...

Check the format of the input data and transform your own dataset to fit that format

umbral charm Aug 1, 2023, 8:18 PM

#

Suppose we have a dataframe, and a coloumn that has got True's and False's i want to know the index of all the True values, However if there more than 1 Trues togeather, i only want the index of the first one, how would i do this? Im so lost on iterating throught columns

left tartan Aug 1, 2023, 8:19 PM

#

umbral charm Suppose we have a dataframe, and a coloumn that has got True's and False's i wan...

Can you share an example of what you mean?

umbral charm Aug 1, 2023, 8:21 PM

#

left tartan Can you share an example of what you mean?

i have a column called 'Boo' in a dataframe called df, now this column Boo is full of True's and False's values (Boolean) but mostly Falses (90%). i want to find out at what index the True values occure, But if there is more than 1 True value togeather (so like True on index 95 and True on index 96) i only want to retrun the first True index (In this case 95)

twilit tundra Aug 1, 2023, 8:21 PM

#

Drop_duplicates(Boo)

#

🙂

night kernel Aug 1, 2023, 8:23 PM

#

twilit tundra Check the format of the input data and transform your own dataset to fit that fo...

i cant figure that out. can i dm?

twilit tundra Aug 1, 2023, 8:23 PM

#

My DMs are not open sorry

umbral charm Aug 1, 2023, 8:24 PM

#

twilit tundra Drop_duplicates(Boo)

Would this still leave The True values by itself alone?

#

or would it drop themm too

twilit tundra Aug 1, 2023, 8:24 PM

#

It would just leave 2 rows, one with true and one with false

#

And it keeps the first instance of both

#

Alternatively if you really just want that one true index, you can do df.query(boo==True).index[0]

left tartan Aug 1, 2023, 8:27 PM

#

I was going to suggest something like: a cumsum() of boo == False, then grouping and computing cumcount, and then keeping only those where cumcount()==1 (eliminating any runs)

umbral charm Aug 1, 2023, 8:30 PM

#

twilit tundra It would just leave 2 rows, one with true and one with false

I see, but i dont think its what im looking for

#

my dataframe only cosits of 1 column and 1 index

night kernel Aug 1, 2023, 8:31 PM

#

twilit tundra My DMs are not open sorry

i understand. it looks like the dataset is being imported from another folder. but this folder isnt part of my download. can i import another way?

#

from recommenders.datasets import movielens

umbral charm Aug 1, 2023, 8:31 PM

#

i want another column where the index in which there are True values are copied over, but if there are more than 1 True value tgoeather, i want it to be just in the first index they were seen togeahter

twilit tundra Aug 1, 2023, 8:32 PM

#

I'm not sure I understand, it would be something like None,... Until index 95 where it's equal to 95 and then 95 at index 96 because it's still true?

twilit tundra Aug 1, 2023, 8:33 PM

#

night kernel i understand. it looks like the dataset is being imported from another folder. b...

Have you installed the recommenders library?

#

It seems to be a dataset that is part of the library

umbral charm Aug 1, 2023, 8:33 PM

#

Example:

1 False False
2 True  True
3 False False
4 True True
5 True False
6 False False
7 True True
8 False False
9 True True
10 True False
11 True False
12 False False

night kernel Aug 1, 2023, 8:34 PM

#

twilit tundra It seems to be a dataset that is part of the library

no the dataset i want is not a part of that library

#

its not from the same project. this is the example from the project:

from recommenders.datasets import movielen

twilit tundra Aug 1, 2023, 8:35 PM

#

Referencing the notebook, it looks like the format is userid itemid rating

umbral charm Aug 1, 2023, 8:35 PM

#

You See how Its basically the same columns until there comes a consecutive True's in the first column, in which i only need the 2nd column to produe one True for the start of that consecutive run

twilit tundra Aug 1, 2023, 8:35 PM

#

Oh gotcha

#

Probably something with a rolling window of size 2

umbral charm Aug 1, 2023, 8:37 PM

#

That could work

left tartan Aug 1, 2023, 8:38 PM

#

So my solution is; calculate cumcount() over False. Then cumcount over Trues for each group from step 1. Then eliminate any count > 1

umbral charm Aug 1, 2023, 8:38 PM

#

twilit tundra Probably something with a rolling window of size 2

but wouldnt that just make like
True True
True False
True True
True False

#

if there were 4 togeahter

#

Idk what my max True's are consecutively

twilit tundra Aug 1, 2023, 8:38 PM

#

True true = false
True false = false
False true = true
False false = false

#

Not A and B

left tartan Aug 1, 2023, 8:39 PM

#

Oh easier; just drop where lag() == True and Val == true

umbral charm Aug 1, 2023, 8:39 PM

#

left tartan Oh easier; just drop where lag() == True and Val == true

lag and val? what r these

left tartan Aug 1, 2023, 8:39 PM

#

(Lag=shift)

#

I just mean; compare boo to previous boo, using shift. Like df[boo]==df[boo].shift()

twilit tundra Aug 1, 2023, 8:41 PM

#

I have to say, I appreciate boo over foo

left tartan Aug 1, 2023, 8:41 PM

#

And maybe & df[boo] otherwise you’d drop consecutive falses

agile cobalt Aug 1, 2023, 8:41 PM

#

I would just use series.diff() == 1

umbral charm Aug 1, 2023, 8:41 PM

#

Boo is just my go to name for throwawy columns

#

it used to be boob

left tartan Aug 1, 2023, 8:42 PM

#

agile cobalt I would just use `series.diff() == 1`

I’m assuming they want to eliminate runs, not just single changes

agile cobalt Aug 1, 2023, 8:43 PM

#

.diff() would be False => False : 0 True => True : 0 True => False : -1 False => True : 1 their (original) question was identifying where it goes from False => True

umbral charm Aug 1, 2023, 8:44 PM

#

That couldwork

twilit tundra Aug 1, 2023, 8:44 PM

#

It definitely does

agile cobalt Aug 1, 2023, 8:44 PM

#

sounds like at some point you tried to remove entire runs of 2+ consecutive Trues?

umbral charm Aug 1, 2023, 8:44 PM

#

agile cobalt sounds like at some point you tried to remove entire runs of 2+ consecutive True...

mhm

#

max 11 consecutives

night kernel Aug 1, 2023, 8:46 PM

#

twilit tundra Referencing the notebook, it looks like the format is userid itemid rating

how can i acheive this format if i dont have a rating?

#

i think the dataset i have has the user and item id's, however

left tartan Aug 1, 2023, 8:46 PM

#

agile cobalt `.diff()` would be ``` False => False : 0 True => True : 0 True => False : -1 Fa...

Op said more or less: more than one true together. Maybe there were two questions in thread tho

twilit tundra Aug 1, 2023, 8:47 PM

#

night kernel i think the dataset i have has the user and item id's, however

What kind of information do you have other than that?

#

Does it represent clicks? Or buying?

night kernel Aug 1, 2023, 8:48 PM

#

tweets dataset

umbral charm Aug 1, 2023, 8:48 PM

#

agile cobalt `.diff()` would be ``` False => False : 0 True => True : 0 True => False : -1 Fa...

Did this and something fishy happend

#

https://gyazo.com/e59f20d99d51bfb073ee597054733a92 this is with my df

Gyazo

#

this is with your series.diff
https://gyazo.com/511dbeb486e7fe49568cf2099a253920

Gyazo

agile cobalt Aug 1, 2023, 8:49 PM

#

did you do == 1 or != 0

umbral charm Aug 1, 2023, 8:49 PM

#

== 1

twilit tundra Aug 1, 2023, 8:49 PM

#

night kernel tweets dataset

Does the link represent a yser posting?

agile cobalt Aug 1, 2023, 8:50 PM

#

show what exactly you did? (code)

night kernel Aug 1, 2023, 8:50 PM

#

twilit tundra Does the link represent a yser posting?

this is where i got it https://ktype.net/wiki/research:articles:progress_20110209

#

it looks like weights are calculated

#

and i think thats the rating

agile cobalt Aug 1, 2023, 8:50 PM

#

derp oh wait .diff() with bools seems to be just XOR

#

sorry, you'll have to .astype(int).diff() instead of just .diff()
you can specify np.int8 instead of int if you want

twilit tundra Aug 1, 2023, 8:52 PM

#

Or you can add an and with the same column if you want to stay in full boolean for some reason

umbral charm Aug 1, 2023, 8:52 PM

#

agile cobalt sorry, you'll have to `.astype(int).diff()` instead of just `.diff()` you can sp...

Works like a charm holy shit

#

How do you guys think of these

agile cobalt Aug 1, 2023, 8:53 PM

#

seen similar problems a few times in the past

umbral charm Aug 1, 2023, 8:53 PM

#

all you people seem like proper smart

agile cobalt Aug 1, 2023, 8:53 PM

#

if you haven't yet, check out the pandas User Guides and take a look over all the different functions in the documentation, or at least ones that catch your eye

umbral charm Aug 1, 2023, 8:53 PM

#

be working for apple or some shi

twilit tundra Aug 1, 2023, 8:53 PM

#

night kernel and i think thats the rating

I don't have the time to read the full article but I'm pretty sure this is an n-gram representation, not a recommender dataset

agile cobalt Aug 1, 2023, 8:54 PM

#

StackOverflow is also pretty useful if you know how to search effectively

umbral charm Aug 1, 2023, 8:55 PM

#

agile cobalt StackOverflow is also pretty useful if you know how to search effectively

I love stackover flow, apart from the fact answers are a decade old, they still work somehow.

twilit tundra Aug 1, 2023, 8:57 PM

#

Surprisingly enough, they often edit their answer to correct it if a new version breaks it

slim bone Aug 1, 2023, 9:28 PM

#

Hey fellas, quick question about gradients:
I'm using MSE as my cost function
Now I'm trying to calculate the gradient, but I'm at a bit of an intuitive crossroad:
On one hand, the gradient should consist of all of my weights, each one being its own variable (So in my case of a 28x28 image, 784 variables)
On the other hand, the gradient of MSE is just:
2/n * (prediction_vector - target_vector)
And my prediction has 10 variables.

What am I missing?

agile cobalt Aug 1, 2023, 9:29 PM

#

you should just about never calculate the gradient yourself, but rather leave it up for the library you're using to determine it for you (pytorch, tensorflow, jax etc)

slim bone Aug 1, 2023, 9:29 PM

#

This is on purpose

agile cobalt Aug 1, 2023, 9:30 PM

#

take a look at https://pytorch.org/tutorials/beginner/blitz/autograd_tutorial.html - while being focused on torch, it explains the concept in general

slim bone Aug 1, 2023, 9:30 PM

#

And also not entirely the point of the question - there's clearly a knowledge gap here

slim bone Aug 1, 2023, 9:30 PM

#

agile cobalt take a look at https://pytorch.org/tutorials/beginner/blitz/autograd_tutorial.ht...

I've read this

agile cobalt Aug 1, 2023, 9:31 PM

#

backpropagation takes the loss of the output of an operation and broadcasts it to the input

slim bone Aug 1, 2023, 9:31 PM

#

I know...

desert oar Aug 1, 2023, 9:34 PM

#

slim bone Hey fellas, quick question about gradients: I'm using MSE as my cost function No...

the gradient with respect to what?

slim bone Aug 1, 2023, 9:34 PM

#

desert oar the gradient _with respect to what_?

Pardon? I'm not sure I understand

#

Gradient descent is what I'm after

desert oar Aug 1, 2023, 9:35 PM

#

yeah, you want the vector of partial derivatives with respect to each parameter in your model

slim bone Aug 1, 2023, 9:36 PM

#

desert oar yeah, you want the vector of partial derivatives with respect to each parameter ...

So, if I have 784 pixels as my input, and 0 layers (e.g., just input/output) - would the relevant gradient be a column vector of size 784?

#

Well, just a vector of shape (1,784) or whatever.

desert oar Aug 1, 2023, 9:36 PM

#

yes, if you're treating them as 784 individual features

slim bone Aug 1, 2023, 9:36 PM

#

Right. But how do I compute said gradient?

#

If I'm using MSE as my loss function

desert oar Aug 1, 2023, 9:37 PM

#

using the chain rule + rearranging terms to get the usual backpropagation formula

agile cobalt Aug 1, 2023, 9:37 PM

#

which loss function you are using does not influences this part at all btw

desert oar Aug 1, 2023, 9:37 PM

#

+1

slim bone Aug 1, 2023, 9:37 PM

#

Hmm? I thought we calculate the gradient of the loss function?

#

at a certain point*

desert oar Aug 1, 2023, 9:38 PM

#

yes, but specifically the gradient with respect to the parameters of the model

slim bone Aug 1, 2023, 9:38 PM

#

I'm not sure what "with respect" means in this context

desert oar Aug 1, 2023, 9:38 PM

#

the loss function is usually something like loss(prediction(parameters), data)

#

so you need the chain rule to get at the gradient with respect to the parameters

slim bone Aug 1, 2023, 9:38 PM

#

def MSE(prediction, ideal): is what I have

desert oar Aug 1, 2023, 9:38 PM

#

slim bone I'm not sure what "with respect" means in this context

it means that's what you're treating as the "input" to your function

desert oar Aug 1, 2023, 9:39 PM

#

slim bone `def MSE(prediction, ideal):` is what I have

i'm talking about the math, forget the code

slim bone Aug 1, 2023, 9:39 PM

#

Oh, apologies

desert oar Aug 1, 2023, 9:39 PM

#

if you have an expression like f(x,y,z) = ax + by + cz then you're implying that x,y,z are the "inputs" to the function

slim bone Aug 1, 2023, 9:39 PM

#

Right

desert oar Aug 1, 2023, 9:40 PM

#

so the gradient of the function with respect to x,y,z would be the vector of partial derivatives with respect to each input

slim bone Aug 1, 2023, 9:40 PM

#

Right

desert oar Aug 1, 2023, 9:40 PM

#

but you could also talk about the gradient with respect to a,b,c, reversing the roles of x,y,z and a,b,c

slim bone Aug 1, 2023, 9:40 PM

#

Huh, the scalers?

desert oar Aug 1, 2023, 9:40 PM

#

it's just math jargon to specify which variables are "inputs" and which aren't

slim bone Aug 1, 2023, 9:41 PM

#

I mean if you calculate the gradient relative to the scalers you'd just get nonsense no?

desert oar Aug 1, 2023, 9:41 PM

#

slim bone I mean if you calculate the gradient relative to the scalers you'd just get nons...

let's assume they're all scalars, and no

#

a and x are identical here except that one is treated as an "input" and the other is treated as given

#

but you can just swap the symbols

#

when you say "with respect to", it's telling me which symbols represent "inputs"

slim bone Aug 1, 2023, 9:42 PM

#

Oh, so if you "calculate a gradient relative to a,b,c" are you assuming those are the inputs now?

desert oar Aug 1, 2023, 9:42 PM

#

right. so if i have an expression like loss(prediction(parameters), data) the gradient with respect to prediction(parameters) is different from the gradient with respect to parameters

#

i think you're thinking you need the former, but you need the latter

slim bone Aug 1, 2023, 9:43 PM

#

Can you maybe explain what prediction(parameters) is exactly? I might be missing the point here

desert oar Aug 1, 2023, 9:43 PM

#

thinking about this properly kind of requires you to flip around what the "inputs" are

#

the prediction that you produce is literally a function of the parameters of the model + the data

slim bone Aug 1, 2023, 9:43 PM

#

Right

desert oar Aug 1, 2023, 9:43 PM

#

i guess i should write it loss(prediction(parameters, x), y)

slim bone Aug 1, 2023, 9:44 PM

#

loss being our loss function I assume?

desert oar Aug 1, 2023, 9:44 PM

#

yes

slim bone Aug 1, 2023, 9:44 PM

#

Just trying to make sure I understand you

desert oar Aug 1, 2023, 9:44 PM

#

thinking about this as an optimization problem requires you to flip around what you understand the inputs to be

#

the optimization problem treats the data as fixed. x and y are handed to us as-is and do not change.

slim bone Aug 1, 2023, 9:45 PM

#

Right

desert oar Aug 1, 2023, 9:45 PM

#

we are now interested in finding the parameters (model weights, coefficients, whatever) that minimize loss

#

that is, we are maximizing the loss as a function of the parameters

#

so when we talk about the gradient of the loss function, we are talking about the gradient of the loss function with respect to the model parameters

slim bone Aug 1, 2023, 9:46 PM

#

Again, "model parameters" = weights/biases?

desert oar Aug 1, 2023, 9:46 PM

#

yes

#

also i'd suggest maybe start with something simpler than images. imagine linear regression on a dataset like iris. 4 features, 1 response variable, all continuous numeric data.

#

forget even the notion that linear regression is a special case of a "neural network". just think about minimizing loss as a smooth differentiable function of some parameters

slim bone Aug 1, 2023, 9:47 PM

#

Yeah I do somewhat regret not going with something simpler, but this has been a learning experience all-in-all
It'd be a rather shame to stop now as I've sunk a few hours into this now

desert oar Aug 1, 2023, 9:48 PM

#

you'll get back to it

#

you'll be happy you spent time on the basics

slim bone Aug 1, 2023, 9:48 PM

#

Ah, this is the basics as far as I'm concerned

desert oar Aug 1, 2023, 9:48 PM

#

it's something i never had the discipline to do when i was younger and i'm still paying for it 10+ years later

slim bone Aug 1, 2023, 9:48 PM

#

Fair enough. I'll keep this in mind

#

Can I perhaps type out a concrete example (relating to what I'm trying to program) to see if I got the memo? I'll make it brief.

desert oar Aug 1, 2023, 9:49 PM

#

maybe? but it seems more like a matter of understanding the math than writing out code

slim bone Aug 1, 2023, 9:49 PM

#

Oh no, I'm not talking about the code

#

But I'm not entirely sure this is a math issue either. Intuitively this does make sense to a certain degree, and I do understand what you're saying

#

I'll try to type it out

iron basalt Aug 1, 2023, 9:50 PM

#

Try a single input and output (no hidden layers). Can code that without any libraries. Then 2 inputs, 1 output.

#

See if it can learn some logic gates.

desert oar Aug 1, 2023, 9:51 PM

#

iron basalt Try a single input and output (no hidden layers). Can code that without any libr...

i was suggesting iris but that'd work too

#

coming from a (social) science background i find the idea of learning logic gates abstract and very unlike anything i'd expect to encounter in real work

slim bone Aug 1, 2023, 9:52 PM

#

desert oar maybe? but it seems more like a matter of understanding the math than writing ou...

So I have my loss function, which takes 10 outputs and does a nice little trick with them to calculate the loss itself:
f(x_1, ..., x_10) = MSE(x_1, ..., x_10)
But each of those x values were given to me by another function:
x_1 = w_1*a_1 + w_2*a_2 + ... + w_784*a_784 (a representing the pixels here, w the weights)

Is this correct so far?

desert oar Aug 1, 2023, 9:53 PM

#

slim bone So I have my loss function, which takes 10 outputs and does a nice little trick ...

10 outputs? you have 10 images, or something else?

slim bone Aug 1, 2023, 9:53 PM

#

iron basalt See if it can learn some logic gates.

This sounds complicated for some reason

slim bone Aug 1, 2023, 9:53 PM

#

desert oar 10 outputs? you have 10 images, or something else?

a single image that's 28x28 and 10 possible numbers (The image represents a digit, so something between 0 and 9)

desert oar Aug 1, 2023, 9:53 PM

#

ah, mnist

slim bone Aug 1, 2023, 9:53 PM

#

Right

desert oar Aug 1, 2023, 9:53 PM

#

why are you using MSE on this?

slim bone Aug 1, 2023, 9:54 PM

#

I'm simply following 3blue1brown's video

#

I used RMSE but I was too lazy to calculate that gradient

slim bone Aug 1, 2023, 9:54 PM

#

slim bone I'm simply following 3blue1brown's video

Rather, a watered down version* no layers for now

desert oar Aug 1, 2023, 9:54 PM

#

it's just one step in the chain rule. i'd suggest working through it. that's essential and worth drilling until it's natural.

slim bone Aug 1, 2023, 9:54 PM

#

Just input, weights, and output
I'm not even sure what the bias does here

desert oar Aug 1, 2023, 9:55 PM

#

yeah i think this needs to be dialed back

#

from what i remember that 3b1b video is meant to be illustrative and relatively nontechnical

slim bone Aug 1, 2023, 9:55 PM

#

Right

#

Nevertheless, I wanted to see if I understood what was being said - and generally speaking, considering the fact that I've gotten rid of the layers, I figured this should be fairly simple

And yet, there's a lot of gaps in my knowledge

desert oar Aug 1, 2023, 9:56 PM

#

there are still some complicating factors here that i'd like to strip out

slim bone Aug 1, 2023, 9:56 PM

#

desert oar it's just one step in the chain rule. i'd suggest working through it. that's ess...

Can you perhaps explicitly type what I should do? There's a bit of a language barrier problem here (for me) I suspect

desert oar Aug 1, 2023, 9:56 PM

#

so let's dial it back to something simpler. imagine a single continuous output like body mass, and 3 inputs: height, waist size, and chest size.

slim bone Aug 1, 2023, 9:57 PM

#

Err, okay

#

Something that approximates body mass, got it

desert oar Aug 1, 2023, 9:57 PM

#

yeah. let's say we are interested in whether we can determine body mass from those 3 measurements

slim bone Aug 1, 2023, 9:58 PM

#

Alright

desert oar Aug 1, 2023, 9:58 PM

#

so we propose a simple model of the form y = b1*x1 + b2*x2 + b3*x3 + b0, where the xs are the 3 measurements and y is body mass

slim bone Aug 1, 2023, 9:58 PM

#

So far so good

#

What's b0 in this context btw?

desert oar Aug 1, 2023, 9:59 PM

#

this is the standard linear regression model. among many many other things, we can interpret it as 1 input layer and 1 output layer.

slim bone Aug 1, 2023, 9:59 PM

#

desert oar this is the standard linear regression model. among many many other things, we c...

Right, similar to what I'm trying to do

desert oar Aug 1, 2023, 9:59 PM

#

slim bone What's `b0` in this context btw?

it just sets the y intercept

slim bone Aug 1, 2023, 9:59 PM

#

Like a +C with antiderivatives?

desert oar Aug 1, 2023, 10:00 PM

#

that's what the machine learning people call "bias" because it kind of resembles bias in an electrical circuit. it's unrelated to the statistical term bias. statisticians call it an "intercept" to avoid the confusion & because it's literally the y intercept.

#

imagine setting all x1, ..., x3 to 0. then what's y?

slim bone Aug 1, 2023, 10:00 PM

#

just b0

desert oar Aug 1, 2023, 10:00 PM

#

right

#

if you didn't have b0, that forces y to be 0 as well when all xs are 0, which forces the entire line/plane you fit to pass through the origin, which is restrictive and makes your model worse for no benefit

slim bone Aug 1, 2023, 10:02 PM

#

Not sure I understand why, but feel free to skip this if it's not crucial

desert oar Aug 1, 2023, 10:02 PM

#

it's worth thinking about. having good geometric intuition for the math can help a lot

slim bone Aug 1, 2023, 10:02 PM

#

I do agree, I'm just not sure what this does in the context of ML

desert oar Aug 1, 2023, 10:02 PM

#

let me draw a picture

slim bone Aug 1, 2023, 10:03 PM

#

Sure thing

desert oar Aug 1, 2023, 10:03 PM

#

slim bone I do agree, I'm just not sure what this does in the context of ML

i'm attempting to build up some kind of foundation quickly so that you can proceed in your study 🙂

slim bone Aug 1, 2023, 10:03 PM

#

You probably couldn't figure but I do have some academic mathematical background ^^; It's just hard for me to process math in English for whatever reason

#

So you might be able to skim on some explanations

slim bone Aug 1, 2023, 10:04 PM

#

desert oar i'm attempting to build up some kind of foundation quickly so that you can proce...

Much appreciated

desert oar Aug 1, 2023, 10:04 PM

#

ok, i'll keep going and hopefully you can work up to understanding why the b0 is useful

#

let's assume for now that it's useful and that we usually want it

slim bone Aug 1, 2023, 10:04 PM

#

Sure thing

desert oar Aug 1, 2023, 10:05 PM

#

so we have our simple model y = b1*x1 + b2*x2 + b3*x3 + b0

#

now we want to find b0, ..., b3 that produce the best line/plane to describe this relationship

#

the relationship could be totally wrong, but we want to produce the best possible estimate among all relationships of this shape

#

we do so by coming up with a loss function and minimizing that

slim bone Aug 1, 2023, 10:06 PM

#

Makes sense

#

I mean, in theory at least

desert oar Aug 1, 2023, 10:07 PM

#

just to avoid messy notation, let's call our model prediction p, so we have the following task:

minimize l(p, y) with respect to b0,...,b3 where p = b1*x1 + b2*x2 + b3*x3 + b0

slim bone Aug 1, 2023, 10:08 PM

#

l is the cost function
y is the body mass
Mostly typing this for myself

desert oar Aug 1, 2023, 10:08 PM

#

so how do we do that? we note that p is differentiable with respect to the bs, so as long as l is differentiable and convex, we have the whole wide world of convex differentiable optimization techniques available to us

slim bone Aug 1, 2023, 10:09 PM

#

Sorry, convex?

desert oar Aug 1, 2023, 10:09 PM

#

https://en.wikipedia.org/wiki/Convex_function

Convex function

In mathematics, a real-valued function is called convex if the line segment between any two distinct points on the graph of the function lies above the graph between the two points. Equivalently, a function is convex if its epigraph (the set of points on or above the graph of the function) is a convex set. A twice-differentiable function of a si...

#

basically, it's a bowl, and there is a bottom of the bowl. we need to find the bottom of the bowl.

slim bone Aug 1, 2023, 10:10 PM

#

Oh, sure lol

#

But only a part of it is uh... "convex", no?

#

Or rather parts* of it

desert oar Aug 1, 2023, 10:11 PM

#

well in this case the whole thing is, but yeah the real life loss surfaces are enormously complicated

slim bone Aug 1, 2023, 10:11 PM

#

Right

desert oar Aug 1, 2023, 10:11 PM

#

we aren't always guaranteed to have a global minimum. gradient descent only finds a global minimum under certain nice conditions, otherwise it finds a local minimum and we hope it's a good one

slim bone Aug 1, 2023, 10:11 PM

#

Right

desert oar Aug 1, 2023, 10:12 PM

#

in this particular case there happens to be an exact analytical solution (which you'll spend quite a lot of time reasoning about in a statistics class, it turns out to be just an orthogonal projection), but you can also use gradient descent, so that's what we'll use because it's what neural networks use

slim bone Aug 1, 2023, 10:13 PM

#

orthogonal projection
Oh god, those are relevant to statistics? :(

#

Nevermind, sidetracked

desert oar Aug 1, 2023, 10:13 PM

#

yes, linear algebra is essential in stats and machine learning

warm sage Aug 1, 2023, 10:13 PM

#

Hi, I was hoping to get some advice here.

I am working on a digital text sentiment analysis tool in python. I was hoping to achieve this using machine learning and an amazon review dataset.

First of all, I'm not sure what type of model i will need to create (eg Linear Regression Model) so I could use some help deciding that.
Second of all, I have a 100gb file full of reviews and im not sure of the best way to go about importing and training on this data.

Thanks in advance

slim bone Aug 1, 2023, 10:14 PM

#

To machine learning - sure. I was just hoping to be done with it in my remaining math-oriented academic courses
Nevermind though. Gradient descent. Sure

desert oar Aug 1, 2023, 10:14 PM

#

for gradient descent, we need the gradient. but be careful: we specifically want the gradient of l with respect to the bs

slim bone Aug 1, 2023, 10:14 PM

#

Yes

desert oar Aug 1, 2023, 10:14 PM

#

remember, we are trying to minimize l over all bs

#

so we treat l as a function of the bs

#

does that make sense?

slim bone Aug 1, 2023, 10:15 PM

#

So you get the partial derivative of (for example) b_1 * x_1 where b_1 is the variable, so its just... b_1?

#

I'm probalby jumping the gun

#

Better to just understand by example, perhaps

desert oar Aug 1, 2023, 10:16 PM

#

slim bone I'm probalby jumping the gun

no, this was my next question for you. it's calculus time. what's the gradient?

slim bone Aug 1, 2023, 10:17 PM

#

The gradient are the partial derivatives with respect(?) to the variable you're looking for

desert oar Aug 1, 2023, 10:17 PM

#

slim bone So you get the partial derivative of (for example) `b_1 * x_1` where `b_1` is th...

the partial derivative of b1 * x1 with respect to b1 is x1

slim bone Aug 1, 2023, 10:17 PM

#

Oh, right

desert oar Aug 1, 2023, 10:17 PM

#

slim bone The gradient are the partial derivatives with respect(?) to the variable you're ...

yeah sorry. i mean it's time to compute it

#

use the chain rule

slim bone Aug 1, 2023, 10:17 PM

#

3x + 2y -> 3 partial derivative with x

#

Yeah I forgot

#

All clear

#

So the gradient would be (x_1, x_2, x_3) since there are no duplicate b's or whatever

desert oar Aug 1, 2023, 10:18 PM

#

that's the gradient of p with respect to the b's yes

slim bone Aug 1, 2023, 10:19 PM

#

Right

desert oar Aug 1, 2023, 10:19 PM

#

let's assume l is (p - y)^2. and p = b1*x1 + b2*x2 + b3*x3 + b0 as before. what's the gradient of l with respect to the bs?

slim bone Aug 1, 2023, 10:19 PM

#

Err just a moment, I need to go back to the original equation

#

Err... isn't that 0

#

Because p = y?

#

Or am I misreading

#

Probably misreading, you probably want me to use the chain rule with f(x) = x^2

desert oar Aug 1, 2023, 10:21 PM

#

slim bone Because p = y?

i adjusted the notation. p is our prediction, y is the true body mass in the dataset. x and y are given to us and we treat them as fixed

slim bone Aug 1, 2023, 10:21 PM

#

Ah apologies

desert oar Aug 1, 2023, 10:22 PM

#

i need to go make dinner. ponder this for now, because i think it's the core of what you were struggling with originally

slim bone Aug 1, 2023, 10:23 PM

#

desert oar i need to go make dinner. ponder this for now, because i think it's the core of ...

Could be. I think something clicked at the very least
Bon appetit~

#

Oh and, many thanks for your patience and help of course*

desert oar Aug 1, 2023, 10:23 PM

#

i strongly suggest working through the actual calculation here to get an analytical closed-form expression for the gradient

#

it's a drill that should feel easy

slim bone Aug 1, 2023, 10:24 PM

#

Indeed, but its crazy how quickly the human mind forgets things - I finished calc2 less than a month ago haha

#

I'll work through this

#

so:
l = (p - y)^2
l = (b1*x1 + b2*x2 + b3*x3 + b0 - y)^2
the partial derivative of b1 would be, err...
f(g(x))' = f'(g(x)) * g'(x) ->
f(x) = x^2, g(x) = (p - y) ->
2*(b1*x1 + b2*x2 + b3*x3 + b0 - y) * x1 =
2*x1(b1*x1 + b2*x2 + b3*x3 + b0 - y)? (partial derivative of b1)

Will pop something similar into wolfram real quick just to sanity check

#

Looks about right. I'll ponder on this for a little while longer
Thanks again, on the offchance you're reading this

tepid hazel Aug 1, 2023, 10:30 PM

#

hi there, I trained a model on a good bit of text based data and the model seems to give really odd results. Even when I copy and paste a sample of the training data into the model to be predicted it will return 1 despite the piece of data given having been labled as 0 when the model was trained. Is this a syntom of overfitting? I didn't add any sort of dropoff or regulation to tensorflow so I suspect it may be but would such cause the model to not even be able to identify data which was inside it's training data?

twilit tundra Aug 1, 2023, 10:35 PM

#

tepid hazel hi there, I trained a model on a good bit of text based data and the model seems...

Overfitting would mean on the contrary that samples from the training set are almost always classified correctly

#

Do you have a loss curve to check or something?

void veldt Aug 1, 2023, 10:38 PM

#

I'm looking for a way to determine the probability (something akin to a p value) of getting a particular set of residuals (i.e. chi2) from a set of fitted solutions to my model (non linear least squares). I've seen a number of tests (e.g. pearsons chi squared test) but don't know which one is correct, and these also don't appear to be %s either

twilit tundra Aug 1, 2023, 10:41 PM

#

You'd need to define the model you're using, the labels you're trying to classify, and what you'd define as reasonable results

#

I'd suggest using already available datasets such as Fashionpedia

#

And if you want good performances, the best way would probably be to use a prerrained computer vision model and use transfer learning

tepid hazel Aug 1, 2023, 11:12 PM

#

twilit tundra Do you have a loss curve to check or something?

Sadly not sorry, I have the loss value which I believe to be 1.4~

desert oar Aug 1, 2023, 11:25 PM

#

slim bone so: `l = (p - y)^2` `l = (b1*x1 + b2*x2 + b3*x3 + b0 - y)^2` the partial derivat...

correct. see page 4 for a slightly clearer way to write this https://see.stanford.edu/materials/aimlcs229/cs229-notes1.pdf

#

actually this document is very good and i suggest working through it

#

it seems right at your level

desert oar Aug 1, 2023, 11:35 PM

#

void veldt I'm looking for a way to determine the probability (something akin to a p value)...

are these continuous or discrete data? it sounds like you want the joint distribution of the set of residuals for a given dataset

void veldt Aug 1, 2023, 11:36 PM

#

desert oar are these continuous or discrete data? it sounds like you want the joint distrib...

discrete

desert oar Aug 1, 2023, 11:36 PM

#

which unless i am misunderstanding your intention, is just whatever error distribution is built into your model

#

ah. what kind of model?

#

it still sounds like you want something along the lines of an error distribution, which is pretty much exactly what most statistical models try to estimate

void veldt Aug 1, 2023, 11:38 PM

#

I've seen sometimes in their fits people report they got a chi2=1.5 with a 0.01% p value. Similar to how in F tests you report a p value (except there it's the probability of increasing adjustable parameters gives you a better fit)

#

here I'm looking more for the probability of my current fit given my data and model

#

and solutions

desert oar Aug 1, 2023, 11:49 PM

#

void veldt here I'm looking more for the probability of my current fit given my data and mo...

that just sounds like P(Y = y | X = x, Θ = θ) right?

void veldt Aug 1, 2023, 11:51 PM

#

desert oar that just sounds like `P(Y = y | X = x, Θ = θ)` right?

I'm afraid I don't quite understand this terminology. From my reading it's more P(x^2|v) where v is degrees of freedom

desert oar Aug 1, 2023, 11:52 PM

#

void veldt I'm afraid I don't quite understand this terminology. From my reading it's more ...

i am very literally talking about the probability distribution of the random variable Y | X = x, Θ = θ

#

are you interested in the probability of your exact model predictions, among all possible model predictions?

void veldt Aug 1, 2023, 11:54 PM

#

desert oar are you interested in the probability of your *exact* model predictions, among a...

I believe so

desert oar Aug 1, 2023, 11:55 PM

#

what kind of model is this?

void veldt Aug 1, 2023, 11:57 PM

#

desert oar what kind of model is this?

it's a custom non linear model

#

unless I'm misunderstanding ur question

desert oar Aug 1, 2023, 11:58 PM

#

this sounds like a hard task in general unless your model is parametric with a specific data distribution

#

i'd be tempted to solve this by simulation

#

generate realistic data, fit the model, repeat many times

void veldt Aug 2, 2023, 12:05 AM

#

I'm more so looking for given this chi2 from my minimized solution, how unique is this chi2? Can I get this by another random set of solutions?

supple plover Aug 2, 2023, 1:55 AM

#

I wonder how do you sort and refine a large amount of data for image classification/computer vision models? Are there automatic image labelling tools?

agile cobalt Aug 2, 2023, 2:46 AM

#

supple plover I wonder how do you sort and refine a large amount of data for image classificat...

if you rely on a tool to create your dataset automatically, any models trained from that data will perform at best as poorly as those tools do.

the best 'quality' datasets are typically labelled manually, by a lot of people hired specifically for that (see: human annotators)

for some purposes, you can just use images from Bing's API and alike, but typically you should prefer using curated datasets if any exist for the task you're trying to do

supple plover Aug 2, 2023, 2:47 AM

#

I see so there's no going around annotating manually for the best/cleanest datasets huh

#

is that why people keep saying AI/ML is like 90-99% spent on the data and only the remaining for the actual model? kekl

agile cobalt Aug 2, 2023, 2:47 AM

#

if you haven't yet, take a look at ImageNet and all the work that went behind the dataset used by it

#

part of, but not all of it

#

not just collecting/labelling data, but also dealing with issues like missing data, making sure you didn't misunderstand anything, checking some statistical properties sometimes

supple plover Aug 2, 2023, 2:50 AM

#

this field is really hard....

tepid hazel Aug 2, 2023, 3:18 AM

#

hello! my model as shown below seems to be suffering from what I can only assume to be overfitting. After retraining it and adding some regulation via L2 and disabling 50% of neurons during training with a 0.5 dropout. I'm not sure what I'm doing wrong here but whenever the model is tested on any text it will return something like 0.998~ however it seems to perform very well on the training data as when passed in it gets it correct. Here's my model ```py
model = tf.keras.Sequential([
tf.keras.layers.Embedding(input_dim=len(
tokenizer.word_index) + 1, output_dim=128, input_length=max_seq_length),
tf.keras.layers.LSTM(64),
tf.keras.layers.Dropout(0.5),
tf.keras.layers.Dense(1, activation='sigmoid',
kernel_regularizer=tf.keras.regularizers.l2(0.01))
])

and here are the epochs and final loss/accuracy, (I'm not sure why the final accuracy is so high) ```
loss: 0.0272 - accuracy: 0.9967
loss: 0.0134 - accuracy: 0.9992
loss: 0.0113 - accuracy: 0.9996
loss: 0.0096 - accuracy: 1.0000
loss: 0.0097 - accuracy: 0.9998
loss: 0.0095 - accuracy: 0.9999
loss: 0.0091 - accuracy: 1.0000
loss: 0.0091 - accuracy: 1.0000
loss: 0.0092 - accuracy: 1.0000
loss: 0.0090 - accuracy: 1.0000

Loss: 0.021111026406288147, Accuracy: 0.996656596660614

lapis sequoia Aug 2, 2023, 4:14 AM

#

Beginner here. Have some experience with using Tensorflow and Keras though at a novice level. What's one thing I cam do to go to the next level?

twilit tundra Aug 2, 2023, 5:40 AM

#

tepid hazel hello! my model as shown below seems to be suffering from what I can only assume...

What does the labels of your training data look like? My first instinct is that the class 1 label is overrepresented

twilit tundra Aug 2, 2023, 5:47 AM

#

lapis sequoia Beginner here. Have some experience with using Tensorflow and Keras though at a ...

Not sure what beginner level means. If you want to improve your ML skills, you can:
A) Train/fine-tune more complex models
B) Use your models on "real" use cases
C) Reimplement the basic bricks from scratch to learn how they work
It's not an exhaustive list and it really depends on what kind of skills you want/need to develop

hasty mountain Aug 2, 2023, 9:50 AM

#

supple plover I see so there's no going around annotating manually for the best/cleanest datas...

I haven't seen the process behind ImageNet, but I've been doing some (personal) researches on dataset labeling around that (and exactly to make my own datasets)
You should try taking a look at Unsupervised Learning and, specially, Self-Learning(which may provide you with better results).

This blog post may also help you:

https://lilianweng.github.io/posts/2021-12-05-semi-supervised/

Learning with not Enough Data Part 1: Semi-Supervised Learning

When facing a limited amount of labeled data for supervised learning tasks, four approaches are commonly discussed.
Pre-training + fine-tuning: Pre-train a powerful task-agnostic model on a large unsupervised data corpus, e.g. pre-training LMs on free text, or pre-training vision models on unlabelled images via self-supervised learning, and the...

#

In a nutshell, there's no escape from having to manually label your dataset, but you can spare some work ~~and anti-inflammatories~~ if you can make a model (and a method, maybe? Like SimCLR?) that's able to properly learn from few labeled samples and generate good quality pseudolabels (or labels automatically generated) for the rest of your dataset.

supple plover Aug 2, 2023, 9:54 AM

#

I'll look into it. To be fair, I'm only looking into it bcs I'm just studying all this alone, surely companies have human annotators to do the labeling.

hasty mountain Aug 2, 2023, 9:55 AM

#

Poor guys...

sleek harbor Aug 2, 2023, 10:14 AM

#

Anyone a pro in Plotly Dash?

I wanted to know:
1 - is it true that all callbacks get called automatically at the start, when the app is booted? If so, then in what order? Can that be checked/changed somehow?
2 - does that mean that there's no point in setting the value of a parameter/property inside the layout definition, if that parameter/property is the output of a callback, because it'll immediately be replaced by the output of the first automatic callback call?

lusty lotus Aug 2, 2023, 11:07 AM

#

I have an RL question. In this video https://youtu.be/my207WNoeyA?list=PLZbbT5o_s2xoWNVdDudn51XM8lOuZ_Njv&t=242, i understand previously that:

a function taking state s and action a can be mapped as f(s\sub{t}, a\sub{t}) = r\sub{t+1}
RL is mostly based on a loop State -> Action -> Reward
However i don't understand this, i don't understand how (and what) the transition probability is, even though i understand that the current action picked from a state determines the reward.

tidal bough Aug 2, 2023, 11:08 AM

#

There exist environments which aren't deterministic - the same action in the same state may result in varied next states.

lusty lotus Aug 2, 2023, 11:08 AM

#

right

#

but i don't understand the text and the math notation in the video ive just linked above

#

like i don't know how does the information this page tie into contents discussed previously

tidal bough Aug 2, 2023, 11:09 AM

#

This slide?

lusty lotus Aug 2, 2023, 11:10 AM

#

yes this one. i don't get it

tidal bough Aug 2, 2023, 11:15 AM

#

That's the distribution over possible next states - if you're in state s and take action a of the allowed ones in that state, you may end up in state s' with reward r with probability p(s',r | s,a). The equation on the bottom is just rewriting the same thing - it's defined as the probability that S_t, the state at time t, and R_t, the reward at time t, are s' and r respectively, conditional on the state at time t-1 being s and the action taken at time t-1 being a.

lusty lotus Aug 2, 2023, 11:22 AM

#

tidal bough That's the distribution over possible next states - if you're in state `s` and t...

i have a few questions on this:

what does the | mean
so does p(s',r | s,a) really just mean "i do something at state s', which is a (allowed by the state), which gives me reward r at state s'?

#

idk what the lower rewrite is

#

i mean the second part of the p(s',r | s,a)

tidal bough Aug 2, 2023, 11:24 AM

#

lusty lotus i have a few questions on this: - what does the | mean - so does p(s',r | s,a) ...

That's the "conditional" notation - e.g. P(A|B) is "probability A happens, conditional on B having happened"

so does p(s',r | s,a) really just mean "i do something at state s', which is a (allowed by the state), which gives me reward r at state s'?
If you do action a at state s, you can, in the general case, get any reward and end up in any state - and that's governed by a probability distribution. Specifically, the probability of getting reward r and ending up in state s' is p(s',r | s,a).

#

E.g. if your environment is fully deterministic, then for each s,a, there'll be just one specific s',r pair the probability of which will be 1, and the probabilities of all other states-rewards pairs will be 0.

lusty lotus Aug 2, 2023, 11:28 AM

#

tidal bough That's the "conditional" notation - e.g. `P(A|B)` is "probability A happens, con...

im not sure if i understand you correctly:

That's the "conditional" notation - e.g. P(A|B) is "probability A happens, conditional on B having happened"
does it mean "the probablility of A happening after B happens"?

and what you mean in your second part here is that after taking an action, the state and reward is like random but the chances of a SPECIFIC state and reward occuring is whatever is on the other side of the equal sign of p(s',r | s,a)? like a "spin a wheel" where the wheel has like sections with different colour?

lusty lotus Aug 2, 2023, 11:29 AM

#

tidal bough E.g. if your environment is fully deterministic, then for each `s,a`, there'll b...

and what you mean here is that if the game is deterministic, for action s,a will always yield s',r? so p(s',r | s,a) = 1 and everything else 0?

tidal bough Aug 2, 2023, 11:31 AM

#

lusty lotus and what you mean here is that if the game is deterministic, for action s,a will...

Well, sure. Of course, not the same pair for every state (or that'd be a very boring game where any action in any state gets you the same reward and gets you into the same state).

lusty lotus Aug 2, 2023, 11:32 AM

#

wdym same pair

tidal bough Aug 2, 2023, 11:32 AM

#

lusty lotus im not sure if i understand you correctly: > That's the "conditional" notation -...

does it mean "the probablility of A happening after B happens"?
Sure. That's probability-theory notation.

lusty lotus Aug 2, 2023, 11:32 AM

#

but in other words, should i replicate action a at state s, i'll yield the same rewards every time right?

lusty lotus Aug 2, 2023, 11:33 AM

#

tidal bough > does it mean "the probablility of A happening after B happens"? Sure. That's p...

nice, thanks for your clarification :D

#

im learning RL basics since im implementing AlphaZero (i set up the search alg and NN already, just need to implement the training loop since the paper implies i know this alr)

#

can't wait to learn this and implement training loop and train on my new GPU (excited)

tidal bough Aug 2, 2023, 11:34 AM

#

lusty lotus but in other words, should i replicate action a at state s, i'll yield the same ...

Yup, and will end up in the same next state.

lusty lotus Aug 2, 2023, 11:34 AM

#

right

#

may i ping you when i see something idk?

tidal bough Aug 2, 2023, 11:34 AM

#

probably just ask here, I am not always online.

lusty lotus Aug 2, 2023, 11:34 AM

#

sure

#

but like a lot of times my question is ignored or get pointed to an SO link

#

:/

lusty lotus Aug 2, 2023, 11:51 AM

#

what exactly is expected return? is it the sum of all future (anticipated) rewards from current time step t, all the way to future final time step T? https://youtu.be/a-SnJtmBtyA?list=PLZbbT5o_s2xoWNVdDudn51XM8lOuZ_Njv&t=65

tidal bough Aug 2, 2023, 11:56 AM

#

well, sure.

lusty lotus Aug 2, 2023, 12:01 PM

#

then the thing is

#

i don't understand why discounted reward exists and why it's useful lol

#

it says here but i don't get it https://youtu.be/a-SnJtmBtyA?list=PLZbbT5o_s2xoWNVdDudn51XM8lOuZ_Njv&t=145

tidal bough Aug 2, 2023, 12:06 PM

#

because if you don't do any discounting, for many games the expected reward is clearly infinity no matter what you do, so not much to optimize.

lusty lotus Aug 2, 2023, 12:06 PM

#

tidal bough because if you don't do any discounting, for many games the expected reward is c...

well yeah but i don't get how discounting it would ever help

tidal bough Aug 2, 2023, 12:06 PM

#

discounting will make the reward finite always, even if the game will be infinite.

lusty lotus Aug 2, 2023, 12:06 PM

#

like is it like infinity*80% or some shit?

#

this concept doesn't make sense

tidal bough Aug 2, 2023, 12:07 PM

#

no? like the sum of decaying exponential progression being finite.

boreal gale Aug 2, 2023, 12:07 PM

#

sleek harbor Anyone a pro in Plotly Dash? I wanted to know: 1 - is it true that all callback...

not a pro, used it a couple of times, actually still getting up to speed with it this week after not using it for years.
re. 1)
https://dash.plotly.com/advanced-callbacks#when-a-dash-app-first-loads
yes, it is called automatically
order is determined by a dependency tree (i think of it as a DAG 🤷‍♂️ )
see https://community.plotly.com/t/what-is-the-execution-order-of-callbacks/6858/2 for an answer from the author himself.

no comment on 2)

lusty lotus Aug 2, 2023, 12:08 PM

#

so my questions are:

how does making it discounted make it possible for continuous action (ik youve explained it but i still don't get it)
wouldn't making the discounted reward make it "less accurate"? like the agent is getting less reward (sad)
what does the equation here mean https://youtu.be/a-SnJtmBtyA?list=PLZbbT5o_s2xoWNVdDudn51XM8lOuZ_Njv&t=196

tidal bough Aug 2, 2023, 12:11 PM

#

Well, that's just the equation for discounted reward - it's expected reward except will multiply the terms by 1, γ^2, γ^3, ... - a decaying exponential progression. It's a math fact that if you take most series (those that don't grow unboundedly, or even do grow unboundedly but not exponentially fast) and construct a "discounted" series like that, the sum of it will be finite. So this makes the reward finite even for infinite games.

#

wouldn't making the discounted reward make it "less accurate"? like the agent is getting less reward (sad)
kind of, yeah, but this is more of a philosophical point. Note that you can make γ arbitarily close to 1 if you want the agent to consider the future more - as long as it's below 1, the discounting will work.

lusty lotus Aug 2, 2023, 12:15 PM

#

tidal bough Well, that's just the equation for discounted reward - it's expected reward exce...

then i have more questions in addition to the original ones:

why is there a expected return in the first place? even for episodic tasks? why (and how) would the AI make the most of all anticipated rewards? i know the reason is along the lines of not being as myopic and focused solely on short term gains
what would the agent do with the discounted expected return?

tidal bough Aug 2, 2023, 12:17 PM

#

what would the agent do with the discounted expected return?
the value itself? nothing, it just should be maximized - so in practice, RL usually consists of getting a good idea of what sets of actions will give what rewards in the long run, then doing them.

#

why is there a expected return in the first place? even for episodic tasks? why (and how) would the AI make the most of all anticipated rewards?
I don't really understand what you mean by these.

jaunty lion Aug 2, 2023, 12:19 PM

#

hey guys, what are ways i can analyze audio data (mp3files), in such a way that the resulting extracted data would always be in the same shape, so it would be suitable for machine learning purposes. Thanks for any answers.

sleek harbor Aug 2, 2023, 12:20 PM

#

boreal gale not a pro, used it a couple of times, actually still getting up to speed with it...

I have 3 callbacks that depend on each other 1->2->3. The 1st callback creates a global variable, and the 3rd one uses it to create a table. But for some reason, when u load the app - it doesn't work, there's nothing there, no table. The strange part is that if I then relaunch the app, then it does work (p.s. doing this in a jupyter notebook, so variables carry over from one app launch (cell execution) to the next, but when I reset the kernel - it doesn't work again). So all callbacks do work, the one that creates the global variable does work, and so does the one that uses it, but something must be wrong with the order, even tho it should be correct. I just can't get it

lusty lotus Aug 2, 2023, 12:21 PM

#

tidal bough > what would the agent do with the discounted expected return? the value itself?...

how should the reward be maximised when the expected return is not really used?

lusty lotus Aug 2, 2023, 12:22 PM

#

tidal bough > why is there a expected return in the first place? even for episodic tasks? wh...

i was just asking about the purpose of expected return (and then i said that i think i know the reason of having them, yk for you to check and elaborate on my understanding)

tidal bough Aug 2, 2023, 12:24 PM

#

lusty lotus how should the reward be maximised when the expected return is not really used?

Suppose I have a function, f(x) = x^2 + 5. I want to maximize it. I calculate the symbolic derivative of it, f'(x) = 2x. I note that symbolic derivative has one zero, at x=0, where it crosses from positive to negative. Hence, f(x) is maximized at x=0. There, I maximized my function, without ever explicitly calculating its values at any points.

#

similarly, you can introduce the concept of "discounted reward" to talk about "what actions to take to maximize this", but that doesn't necessarily mean you're actually going to be evaluating that function; maybe you'll just analytically determine the best strategy from analyzing it.

tidal bough Aug 2, 2023, 12:27 PM

#

lusty lotus i was just asking about the purpose of expected return (and then i said that i t...

Well, sure, it's just how we formalize looking into the future. An agent that doesn't do that will only care about getting the best reward at the current state, which isn't necessarily the same as maximizing reward in the long run.

lusty lotus Aug 2, 2023, 12:27 PM

#

tidal bough Suppose I have a function, `f(x) = x^2 + 5`. I want to maximize it. I calculate ...

so the expected return in this cause would be the function and you'd try and maximise by finding the derivative, setting to 0 and solving?

tidal bough Aug 2, 2023, 12:29 PM

#

Kind of. It'd be a much more complicated function - a function of your strategy (what actions you take depending on the state), and you'd want to find the optimal strategy. In almost no real games will you be able to just derive the optimal strategy (for instance, because you might not even know the form of the reward). But as you'll probably soon see in the course, that still allows you to derive some important properties the optimal actions must have.

lusty lotus Aug 2, 2023, 12:29 PM

#

shit im getting confused with the math

#

:/

lusty lotus Aug 2, 2023, 12:31 PM

#

tidal bough Kind of. It'd be a much more complicated function - a function of your *strategy...

can i do this using grad descent?

wooden sail Aug 2, 2023, 12:32 PM

#

strategies are often discrete if you choose/are able to represent them numerically. otherwise they're algorithms. neither lends themself to differentiation

tidal bough Aug 2, 2023, 12:34 PM

#

lusty lotus can i do this using grad descent?

Ehh, sure, for some kinds of games you can "just" numerically optimize a very high-dimensional function. But, well, remember how a strategy is basically "what action you take in a state"? Well, the state space isn't necessarily discrete, it may be continuous. So you're finding a function that optimizes a certain value, and that's getting very complicated to represent numerically.

boreal gale Aug 2, 2023, 12:35 PM

#

sleek harbor I have 3 callbacks that depend on each other 1->2->3. The 1st callback creates a...

i think i had something similiar happened to me before, can't remember how i fixed it.
do you have a minimal reproducible example?

lusty lotus Aug 2, 2023, 12:35 PM

#

tidal bough Ehh, sure, for some kinds of games you can "just" numerically optimize a very hi...

i see. this still sounds really confusing :/

burnt oxide Aug 2, 2023, 12:36 PM

#

is this channel not so beginner friendly. 🃏

wooden sail Aug 2, 2023, 12:37 PM

#

it is if you ask beginner-friendly questions 😛

tidal bough Aug 2, 2023, 12:37 PM

#

lusty lotus i see. this still sounds really confusing :/

Well, consider the fact that this formalism is, arguably, powerful enough to describe any agent, humans included, so if acting optimally was trivial, your existence would be very boring indeed 😛

lusty lotus Aug 2, 2023, 12:38 PM

#

now i have an extential crisis 🫠

wooden sail Aug 2, 2023, 12:38 PM

#

that is optimal in some sense, i.e. the worst possible

lusty lotus Aug 2, 2023, 12:38 PM

#

questioning my purpose as a simple being when i don't understand how decisions and rewards are supposed to be maximised 🫠

tidal bough Aug 2, 2023, 12:38 PM

#

decision theory does cause that as a side effect 🙂

twilit tundra Aug 2, 2023, 12:41 PM

#

It's easy to always make the optimal decision when you know the state and position of every particle in existence and you can predict accurately human behavior

tidal bough Aug 2, 2023, 1:05 PM

#

citation needed; i think there'll be some issues even then :p

left tartan Aug 2, 2023, 1:05 PM

#

tidal bough citation needed; i think there'll be some issues even then :p

Citation: Albert Einstein: "God does not play dice".

tidal bough Aug 2, 2023, 1:07 PM

#

qualifications of Albert Einstein:

hero of many anecdotes
being famously wrong about quantum mechanics

warm sage Aug 2, 2023, 1:27 PM

#

Hi, I was hoping to get some advice here.

I am working on a digital text sentiment analysis tool in python. I was hoping to achieve this using machine learning and an amazon review dataset.

First of all, I'm not sure what type of model i will need to create (eg Linear Regression Model) so I could use some help deciding that.
Second of all, I have a 100gb file full of reviews and im not sure of the best way to go about importing and training on this data.

Thanks in advance

simple tapir Aug 2, 2023, 1:33 PM

#

Can ML engineers also work as data scientist? Do they have to do anything extra to be a data scientist?

serene scaffold Aug 2, 2023, 1:34 PM

#

simple tapir Can ML engineers also work as data scientist? Do they have to do anything extra ...

there's no consistency in what all these job titles actually mean.

simple tapir Aug 2, 2023, 1:34 PM

#

uhh

serene scaffold Aug 2, 2023, 1:35 PM

#

uhh?

simple tapir Aug 2, 2023, 1:36 PM

#

When I searched for machine learning engineer positions, I couldn't find any for some companies. That's why I wondered whether I could apply for a data scientist position. (I'm not searching for a job rn, I've just finished my freshman year and want to go through this field)

serene scaffold Aug 2, 2023, 1:39 PM

#

simple tapir When I searched for machine learning engineer positions, I couldn't find any for...

there aren't regulations around who is allowed to call their employees ML engineers or data scientists. and there isn't really even a consensus around what a "data scientist" is. you'll have to look at the job description and requirements to get a sense for what the job actually involves.

#

have you started looking for internships for next summer and beyond?

simple tapir Aug 2, 2023, 1:42 PM

#

serene scaffold have you started looking for internships for next summer and beyond?

I've just completed my first year at university, dont have any internships yet

serene scaffold Aug 2, 2023, 1:42 PM

#

right, you probably wouldn't have gotten one this summer

simple tapir Aug 2, 2023, 1:43 PM

#

yeah, can't apply for one now

#

So, it's better to look at job descriptions instead of job titles, right?

serene scaffold Aug 2, 2023, 1:44 PM

#

pretty much

simple tapir Aug 2, 2023, 1:44 PM

#

Gotcha, thanks a lot

serene scaffold Aug 2, 2023, 1:44 PM

#

at least in the context of AI/ML/DS positions

simple tapir Aug 2, 2023, 1:45 PM

#

I see, will keep that in mind

left tartan Aug 2, 2023, 1:56 PM

#

simple tapir I see, will keep that in mind

Also, don't overspecialize early: build a strong foundation. You'll need broad skills... not just "ML" skills... to thrive in any position.

#

I've seen a lot of posts saying things like: "I don't need to learn XYZ because all I want is AI/ML"

simple tapir Aug 2, 2023, 1:57 PM

#

may you give an example?

left tartan Aug 2, 2023, 2:04 PM

#

simple tapir may you give an example?

I dunno, yesterday someone was saying something about not needing to learn anything about web development because they didn't want to be a front-end developer.

#

And someone else said they didn't like data analysis but wanted to do AI/ML, which I thought was hilarious.

past meteor Aug 2, 2023, 2:11 PM

#

100 % agree with Stel

#

Additionally, just give it time tbh. Enjoy life, enjoy school, take courses that you like and do internships

vestal widget Aug 2, 2023, 2:23 PM

#

I want to create a conversational chatbot that can generate text like GPT-3 or GPT-4 and can be trained with custom data, where should i start?

lapis sequoia Aug 2, 2023, 2:26 PM

#

what is wrong with my tacotron2 training model this is supposed to be spongebob😭

mild dirge Aug 2, 2023, 2:27 PM

#

kill it

lapis sequoia Aug 2, 2023, 2:28 PM

#

😂

serene scaffold Aug 2, 2023, 2:28 PM

#

lapis sequoia what is wrong with my tacotron2 training model this is supposed to be spongebob�...

I mean it sounds like him at the beginning

lapis sequoia Aug 2, 2023, 2:28 PM

#

I need to fix it because I want to stream ai sponge

serene scaffold Aug 2, 2023, 2:29 PM

#

so you're trying to make a synthetic voice of spongebob. what was the input for that audio? "hahahaha"?

lapis sequoia Aug 2, 2023, 2:29 PM

#

Hi I am spongebob

#

this is the input text😭

serene scaffold Aug 2, 2023, 2:29 PM

#

what was the total duration of your training data?

lapis sequoia Aug 2, 2023, 2:30 PM

#

I am not sure but it's 2000 samples

#

trained for 12 hours on 3090

serene scaffold Aug 2, 2023, 2:30 PM

#

welp

lapis sequoia Aug 2, 2023, 2:30 PM

#

it's cursed

serene scaffold Aug 2, 2023, 2:31 PM

#

you might have to keep training it.

#

but it might also be that you don't have enough data, or that the quality isn't pristine enough

lapis sequoia Aug 2, 2023, 2:31 PM

#

I don't think so it's just giving results like this no matter the training time

#

12 hours is quite long and it should at least be understandable

#

I just want to understand the issue

serene scaffold Aug 2, 2023, 2:32 PM

#

I've heard of people running tacotron for weeks

hasty mountain Aug 2, 2023, 2:33 PM

#

Have you trained it from scratch?

lapis sequoia Aug 2, 2023, 2:33 PM

#

yes

hasty mountain Aug 2, 2023, 2:33 PM

#

Then Stelercus is probably right

lapis sequoia Aug 2, 2023, 2:33 PM

#

I need to train it for longer?

serene scaffold Aug 2, 2023, 2:33 PM

#

> assuming Stelercus could be partially wrong

hasty mountain Aug 2, 2023, 2:33 PM

#

Tacotron 2 was originally trained on... I think...around 40.000 audio samples?

#

And for quite some time... I don't remember the details...been a while since I've read the paper pithink

serene scaffold Aug 2, 2023, 2:34 PM

#

wish they had named it sushitron

hasty mountain Aug 2, 2023, 2:34 PM

#

Why?

serene scaffold Aug 2, 2023, 2:34 PM

#

that was the other name they considered

#

but the taco camp won

hasty mountain Aug 2, 2023, 2:34 PM

#

Oh

lapis sequoia Aug 2, 2023, 2:35 PM

#

i've seen someone training a model using 10 audio samples and for 1 hour and it's understandable

hasty mountain Aug 2, 2023, 2:35 PM

#

Wish my Audio GAN would work without killing my GPU grumpchib

lapis sequoia Aug 2, 2023, 2:35 PM

#

on youtube

hasty mountain Aug 2, 2023, 2:35 PM

#

lapis sequoia i've seen someone training a model using 10 audio samples and for 1 hour and it'...

Fine-tuned the model

#

They probably used a pre-trained model and applied training on their custom data

lapis sequoia Aug 2, 2023, 2:35 PM

#

oh

#

he used google collab

hasty mountain Aug 2, 2023, 2:36 PM

#

I've also used a pre-trained model on 150 audio samples and it worked quite fine after 2~3 hours

lapis sequoia Aug 2, 2023, 2:36 PM

#

omg

#

so how long do you think I need to train it

#

is it possible to train spongebob voice on pre trained model?

hasty mountain Aug 2, 2023, 2:38 PM

#

On pre-trained model, you may need around 2~3 hours...maybe less, since you got a reasonable dataset size

lapis sequoia Aug 2, 2023, 2:38 PM

#

I am using this command ```
python train.py --output_directory=outdir --log_directory=logdir

#

how can I use pre trained model

hasty mountain Aug 2, 2023, 2:39 PM

#

You need to have the model weights already downloaded

lapis sequoia Aug 2, 2023, 2:39 PM

#

where can I get one

hasty mountain Aug 2, 2023, 2:39 PM

#

Maybe Tacotron2's GitHub will have the pretrained weights

lapis sequoia Aug 2, 2023, 2:40 PM

#

can you please tell me what is the prompt to use the pretrained model

hasty mountain Aug 2, 2023, 2:40 PM

#

I can't give more details, though. I always thought using pre-trained models was boring...specially since those tech companies tend to make their models GitHub a bit confusing...

hasty mountain Aug 2, 2023, 2:40 PM

#

lapis sequoia can you please tell me what is the prompt to use the pretrained model

They might have the prompt in the README

lapis sequoia Aug 2, 2023, 2:40 PM

#

ok

hasty mountain Aug 2, 2023, 2:40 PM

#

An easy way to train it may be using Uberduck, too

#

That was the way I used it.

lapis sequoia Aug 2, 2023, 2:41 PM

#

uberduck is very expensive if I need to stream 24/7

#

$120/day

#

anyway thank you for the information

past meteor Aug 2, 2023, 2:42 PM

#

lapis sequoia what is wrong with my tacotron2 training model this is supposed to be spongebob�...

wtf

#

This sounds like the stuff of nightmares

oblique quarry Aug 2, 2023, 2:42 PM

#

Good afternoon, Im reading up on layer normalization. And the concept checks out and makes sense, given that it combats issues like gradient explosion. But what bothers me is that I'd have to constantly take the mean and subtract it from my layer and then divide it by its std so im permanently altering my values in the layer so that they center around zero but dont i run into the danger of having too many zeros and subsequently killing the net?

#

Would appreciate if sb could link me a resource to help me understand the concept better

twilit tundra Aug 2, 2023, 2:49 PM

#

oblique quarry Would appreciate if sb could link me a resource to help me understand the concep...

https://proceedings.neurips.cc/paper_files/paper/2018/file/905056c1ac1dad141560467e0a99e1cf-Paper.pdf probably the most in-depth analysis

simple tapir Aug 2, 2023, 2:53 PM

#

left tartan I dunno, yesterday someone was saying something about not needing to learn anyth...

Well, front end development doesn't really interest me either, but backend development. I studied some html and css though

left tartan Aug 2, 2023, 2:54 PM

#

simple tapir Well, front end development doesn't really interest me either, but backend devel...

Yah, that's all I'm saying... you should still study it a little, not ignore it.

simple tapir Aug 2, 2023, 2:54 PM

#

Right, thanks for the suggestions 🙏

oblique quarry Aug 2, 2023, 3:07 PM

#

twilit tundra https://proceedings.neurips.cc/paper_files/paper/2018/file/905056c1ac1dad1415604...

Appreciate you!

gentle horizon Aug 2, 2023, 3:12 PM

#

hey !

lapis sequoia Aug 2, 2023, 3:16 PM

#

I made a monster

#

cold osprey Aug 2, 2023, 3:18 PM

#

first second or so is passable

lapis sequoia Aug 2, 2023, 3:18 PM

#

is this the way to use pretrained model to train a tacotron2 model : ```
python train.py --output_directory=outdir --log_directory=logdir -c tacotron2_statedict.pt --warm_start

lapis sequoia Aug 2, 2023, 3:34 PM

#

there is improvement guys

#

potent sky Aug 2, 2023, 4:10 PM

#

anyone here attending IAIM'2023?

humble shore Aug 2, 2023, 4:22 PM

#

Any one here had an internship

#

If so what projects do the require

#

Or what good resume

mint palm Aug 2, 2023, 4:36 PM

#

helpppp

#

#

i am using ssh
lsof -i :<port_number>
prints nothing

i tried to run this code before it ran "FINE", but now i get this

98 means port is busy, how to change it

tepid hazel Aug 2, 2023, 6:04 PM

#

twilit tundra What does the labels of your training data look like? My first instinct is that ...

it's a near 50/50 split

#

I believe there are more 0 entries than there are 1

#

im also not sure why model.evalutate gives it a 0.99, I use model.predict on the testing data and most of it is wrong

twilit tundra Aug 2, 2023, 6:08 PM

#

What loss are you using?

mild dirge Aug 2, 2023, 6:09 PM

#

tepid hazel im also not sure why model.evalutate gives it a 0.99, I use model.predict on the...

Did you process both datasets the same?

#

As in you normalized the test data as well f.e.

#

And you didn't accidentally flipped the labels at some point

tepid hazel Aug 2, 2023, 6:11 PM

#

mild dirge And you didn't accidentally flipped the labels at some point

I posted my model code above, (#data-science-and-ml message), as far as I know the preprocessing was identical and when I print off some entries at random with their respective label it seems to be in check

Discord

Discord - A New Way to Chat with Friends & Communities

Discord is the easiest way to communicate over voice, video, and text. Chat, hang out, and stay close with your friends and communities.

mild dirge Aug 2, 2023, 6:11 PM

#

So when does it provide "wrong" results then?

#

Only with .predict but not with .evaluate?

tepid hazel Aug 2, 2023, 6:13 PM

#

Here's the loss and accuracy of my model during training. The last printout is the loss and accuracy returned by model.evalutate

mild dirge Aug 2, 2023, 6:14 PM

#

On the test data right?

tepid hazel Aug 2, 2023, 6:14 PM

#

yes

mild dirge Aug 2, 2023, 6:16 PM

#

What is the shape of your dataset you pass to predict and to evaluate?

tepid hazel Aug 2, 2023, 6:20 PM

#

mild dirge What is the shape of your dataset you pass to predict and to evaluate?

the values on the x axis were just a list of strings and on the y it was a numpy array of integers marking the flags. Both single dimensional

#

if that's what you're asking

#

hm, the model seems to perform well on the test data, however, when it receives data that isn't really positive nor negative like hey! it will always form a bias to negative and return 0.99 as far as I can tell

oblique quarry Aug 2, 2023, 6:32 PM

#

Good evening guys is anyone familiar with batchNormalization?

serene scaffold Aug 2, 2023, 6:37 PM

#

oblique quarry Good evening guys is anyone familiar with batchNormalization?

be sure to ask your actual question right out of the gate

void veldt Aug 2, 2023, 6:40 PM

#

When discussing reduced chi2, is the sum of squared residuals normalized against degrees of freedom or number of fitted data points? Because I've seen both used. Or is it only number of fitted data points if data points >>>> number of adjustable parameters

umbral charm Aug 2, 2023, 6:49 PM

#

die = pd.DataFrame([1, 2, 3, 4, 5, 6])
trial = 10000
sum = [die.sample(2, replace = True).sum().loc[0] for i in range(trial)]
freq = pd.DataFrame(sum)[0].value_counts()
print(freq.sort_index())
Relfreq = freq.sort_index() / trial
Relfreq.plot(kind = 'bar')
plt.show()

is there a way i can make this faster, it takes about 4 seconds to do, but i need to go up to 1 million trials

#

maybe use numba or sometin

left tartan Aug 2, 2023, 6:55 PM

#

maybe ```py
import numpy as np
a = np.array([1, 2, 3, 4, 5, 6])
trial = 10000
s = np.sum(np.random.choice(a, size=(trial, 2), replace=True), axis=1)

#

See https://numpy.org/doc/stable/reference/random/generated/numpy.random.choice.html

umbral charm Aug 2, 2023, 6:56 PM

#

left tartan maybe ```py import numpy as np a = np.array([1, 2, 3, 4, 5, 6]) trial = 10000 s ...

you have no idea how much i want to use numpy

#

but i have to use pandas in this task

oblique quarry Aug 2, 2023, 6:56 PM

#

alright can somebody take a look at it ```py
class BatchNormalization():
def init(self):
"""
Please note that I'm substituting gamma and beta for weight and bias to make this module compa-
tible with the rest of the libary.
"""
self.weight = 1
self.bias = 0

def forward(self, inputs):
    """
    Subtracting from the input its mean before dividing by the standard deviation of the input.
    Finally multiplying it by the self.weight parameter and adding self.bias to it.
    """
    self.inputs = inputs
    self.mean = np.mean(inputs, axis=0)
    self.variance = np.var(inputs, axis=0)
    self.stdDev = np.sqrt(self.variance + 1e-8)
    self.normalizedInputs = (inputs - self.mean) / self.stdDev
    return self.weight * self.normalizedInputs + self.bias

def backward(self, gradient):
    """
    Backpropagation through the layer. We first compute the gradients of the loss with respect to
    the normalized inputs, variance, and mean. Then we apply the chain rule to derive dweight,
    dInput and dbias. As per usual, dbias is just the gradient as its derivative of the sum op-
    eration is one.
    """
    N, D = gradient.shape
    dNormalizedInputs = gradient * self.weight
    dVariance = np.sum(dNormalizedInputs * (self.inputs - self.mean) * -0.5 * (self.variance + 1e-8)**(-1.5), axis=0)
    dMean = np.sum(dNormalizedInputs * -1 / self.stdDev, axis=0) + dVariance * np.mean(-2 * (self.inputs - self.mean), axis=0)
    dInput = dNormalizedInputs / self.stdDev + dVariance * 2 * (self.inputs - self.mean) / N + dMean / N
    self.dweight = np.sum(gradient * self.normalizedInputs, axis=0)
    self.dbias = np.sum(gradient, axis=0)
    return dInput

left tartan Aug 2, 2023, 6:57 PM

#

umbral charm ```py die = pd.DataFrame([1, 2, 3, 4, 5, 6]) trial = 10000 sum = [die.sample(2, ...

well, find a way to avoid iterating over trials.

jovial swift Aug 2, 2023, 6:57 PM

#

Yes

umbral charm Aug 2, 2023, 6:57 PM

#

left tartan well, find a way to avoid iterating over trials.

thats the plan

left tartan Aug 2, 2023, 6:59 PM

#

umbral charm you have no idea how much i want to use numpy

perhaps ```py

die = pd.DataFrame([1, 2, 3, 4, 5, 6])
trial = 10000

samples = die.sample((trial * 2), replace=True).values
samples_2_2 = samples.reshape(trial, 2)
sums = samples_2_2.sum(axis=1)

print(samples)
print(samples_2_2)
print(sums)

#

die.sample(trial*2) gives you 20000 samples, then you reshape it to trial,2, then sum each row

sleek harbor Aug 2, 2023, 7:02 PM

#

Any easy way of sharing a jup notebook, since uploading here isn't allowed?

sleek harbor Aug 2, 2023, 7:09 PM

#

boreal gale i think i had something similiar happened to me before, can't remember how i fix...

https://filetransfer.io/data-package/lPNgCRmp#link

So, not necessarily minimal, but should be reproducible, I think. If u just run the whole notebook, then the last cell will give the error in the first screenshot. If u then open the app by clicking the link, and simply close it again, and rerun the last cell, then you'll get what I expect immediately. When you open the link for the first time, there is no table at the top right. If you then close the link and reopen it, without running any extra cells, the table will suddenly appear (as the variable is now properly initialized and filled, as seen by the last cell of the notebook). I don't understand this behavior.

Ignore the terrible formatting, that's how it's "supposed" to be (haven't worked on it yet). I added some thicc markdown so you could navigate my extremely messy code more easily. Sorry about that.. 😅 I can't seem to figure out why it doesn't work on the first start. I think it might have something to do with order of initial execution of callbacks, but honestly no idea. After you reopen the link, everything works as it should (when you change the sector dropdown, the ticker dropdown options change, and when you change those, the table changes)

Screenshot_2023-08-02-21-48-08-507_com.brave.browser.jpg

Screenshot_2023-08-02-21-48-23-947_com.brave.browser.jpg

Screenshot_2023-08-02-21-48-49-351_com.brave.browser.jpg

FileTransfer.io

Download Data package from August 2nd.

Size of the data package: 4.39 MB. Free transfer of up to 6 GB of photos, videos and documents. Send large files via email or a link to share. No registration, no ads, just simple file sharing!

gentle creek Aug 2, 2023, 7:12 PM

#

is anybody here presently working on object detection?

opal pike Aug 2, 2023, 7:12 PM

#

Yes

hasty spear Aug 2, 2023, 7:12 PM

#

nope but i'd love to hear some about it

molten onyx Aug 2, 2023, 7:21 PM

#

Hello, im new in the machine lerning field. i recently wrote the foundation of a neural network in c++. Currently im stuck at implementing the backpropagation method, and I just wanted to ask you guys if you have a good source where I can learn the math behind it.

edit:
I should note that this a supervised nn

hasty mountain Aug 2, 2023, 7:23 PM

#

The backpropagation is basically the Chain Rule from calculus.

#

There's one or another trick, like having to transpose the weights matrices when doing the chain rule, but in general it's just the chain rule, beginning at the loss function and going backwards until you get to the first layer.

#

Take a look at my references. The code itself is in python, but some references are more generalistic. They might help you:

https://github.com/Martyn0324/NumpyNetwork

molten onyx Aug 2, 2023, 7:26 PM

#

cool, thanks! ill have a look

hasty mountain Aug 2, 2023, 7:26 PM

#

A class about chain rule will also be a must. I don't recommend the ones I've used because they're not in english

plush jungle Aug 2, 2023, 8:25 PM

#

does anyone have any resources that show examples of how RNN hidden states encode patterns?

#

for example, my NLP professor mentioned that RNNs can learn open/close parenthesis and showed how the hidden state could encode that kind of thing, but it was a while ago and I don't remember it as well as I wish I did

gilded kestrel Aug 2, 2023, 8:38 PM

#

is colab memory usage garbage?

twilit tundra Aug 2, 2023, 8:43 PM

#

plush jungle for example, my NLP professor mentioned that RNNs can learn open/close parenthes...

Maybe it was based on this article? https://nlp.stanford.edu/~johnhew/rnns-hierarchy.html

plush jungle Aug 2, 2023, 8:44 PM

#

twilit tundra Maybe it was based on this article? https://nlp.stanford.edu/~johnhew/rnns-hiera...

that looks like exactly what he was talking about, thanks!

robust cliff Aug 2, 2023, 9:59 PM

#

hello cool people I need help choosing what to use for sentiment analysis

#

I am currently in a node enviroment and am using natural, but results are trash no matter how much I preprocess the data

#

I can probably get python to run in there, but what do I use?

#

the data comes from what a user has written in an obsidian note

#

so could be pretty much anything

twilit tundra Aug 2, 2023, 10:24 PM

#

robust cliff I can probably get python to run in there, but what do I use?

Check out huggingface, it has good documentation and a hub to select different pretrained models

robust cliff Aug 2, 2023, 10:25 PM

#

okay many thanks, didn't know about it

twilit tundra Aug 2, 2023, 10:26 PM

#

https://huggingface.co/blog/sentiment-analysis-python#2-how-to-use-pre-trained-sentiment-analysis-models-with-python seems beginner-friendly enough

Getting Started with Sentiment Analysis using Python

fresh harbor Aug 2, 2023, 11:47 PM

#

OpenCV seems to have a really shitty ONNX reader

vestal spruce Aug 3, 2023, 3:57 AM

#

I've asked about this before but I didn't follow up on the discussion, so I would like to ask again. Has there been any attempt to make a speech recognition model capable of distinguishing monologue and dialogue? So far I've found that it's possible and have a basic understanding of what I'm trying to achieve in a step by step which in this case determine how many person speaking in a given audio > figure out their segment in the audio > lastly convert the speech to text accordingly.

#

Would implementing NLP instead be better though?

left tartan Aug 3, 2023, 4:06 AM

#

vestal spruce I've asked about this before but I didn't follow up on the discussion, so I woul...

I don’t know but reminds me of this thing I read a while back from Google: I think this is it: https://cloud.google.com/speech-to-text/docs/multiple-voices

Google Cloud

Detect different speakers in an audio recording | Cloud Speech-to...

vestal spruce Aug 3, 2023, 4:23 AM

#

left tartan I don’t know but reminds me of this thing I read a while back from Google: I thi...

Ah wonderful, I didn't know google provided such package ^^. much obliged.

#

I will try this out

left tartan Aug 3, 2023, 4:25 AM

#

vestal spruce Ah wonderful, I didn't know google provided such package ^^. much obliged.

Curious how it works out for you, let me/us know! I haven’t tried that feature

vestal spruce Aug 3, 2023, 4:39 AM

#

left tartan Curious how it works out for you, let me/us know! I haven’t tried that feature

Oh it's nothing special, I wanted to build this model for audio only podcast so that people with impaired hearing can understand it with given context of monologue and dialogue

#

I don't know the formal term for "deaf" people, but to me this term felt condescending for some reason, so I'll refer it as impaired hearing even though I know it's not accurate (I'm ESL)

twilit tundra Aug 3, 2023, 5:22 AM

#

vestal spruce I will try this out

There is also a pytorch toolkit that is specialized for this kind of task if you want to have a more hands-on approach
https://github.com/pyannote/pyannote-audio

GitHub

GitHub - pyannote/pyannote-audio: Neural building blocks for speake...

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding - GitHub - pyannote/pyannote-audio: Neural build...

hollow citrus Aug 3, 2023, 11:17 AM

#

Hi! I am working on a project for work, where I have to use an LLM and fine-tune it based on user input. I have been asked to provide a general system configuration for a multi-server setup where this system would run for me to tinker with it and test it. I would like to get some suggestions for this. It will probably be a CPU-only server cluster, but GPU-based system recommendations are welcome. The systems would run some flavour of Linux, suggestions are welcome for that as well. I would also like to know how I go about using multiple systems to train a single model at the same time.

desert oar Aug 3, 2023, 11:19 AM

#

hollow citrus Hi! I am working on a project for work, where I have to use an LLM and fine-tune...

are you talking about how to set up the system (users, permissions, etc) or physically what hardware to buy, or both?

civic elm Aug 3, 2023, 11:23 AM

#

Hi any datascience substacks you read once in a while?

civic elm Aug 3, 2023, 11:25 AM

#

molten onyx Hello, im new in the machine lerning field. i recently wrote the foundation of a...

Have you already watched Andrej Kaparthy micrograd?

queen vector Aug 3, 2023, 11:26 AM

#

hi everyone
i was wondering, can we use ai ml in automating API testing?
if yes how ?, i would like to do a small implementation of it

near basin Aug 3, 2023, 11:29 AM

#

Hello, people, I was recommended to ask here, but the question is not specifically related to python.
I am again working with Reinforcement Learning.
This time I use Neural Networks for the Q-Table.
My Agent is playing a game against another independent Agent. The Reward policy in the middle of the game always produces 0, and in the end of the game the Reward is either -1 for losing, or +1 for winning. This reward gets backpropagated over all the states achieved in the game.
But here is my question:
If the Q-Table were just a Lookup Table - Q-Value adjustment would be as easy as going over every accumulated state and performing the adjustment.
However when the Q-Table is a Neural Network: Adjusting Q-Value for one state changes the whole network. In which order should the adjustments be made? Reversed order from the end of the game to the beginning, or from beginning to the end?

desert oar Aug 3, 2023, 11:29 AM

#

queen vector hi everyone i was wondering, can we use ai ml in automating API testing? if yes ...

haven't seen anything like this, but i imagine you could use some kind of AI to perform fuzzing or other kinds of "exploratory" testing

queen vector Aug 3, 2023, 11:33 AM

#

desert oar haven't seen anything like this, but i imagine you could use some kind of AI to ...

okie, thanks for this, will research

hollow citrus Aug 3, 2023, 11:39 AM

#

desert oar are you talking about how to set up the system (users, permissions, etc) or phys...

just the configuration

#

Also, how to use multiple systems to perform the training, if possible

#

It'd just be an ubuntu system or something and I would just run my code on the servers. I just want the config that can handle that

serene scaffold Aug 3, 2023, 12:06 PM

#

(They also asked this in #1136612354914799647)

mild dirge Aug 3, 2023, 12:52 PM

#

near basin Hello, people, I was recommended to ask here, but the question is not specifical...

https://datascience.stackexchange.com/questions/20535/what-is-experience-replay-and-what-are-its-benefits

Data Science Stack Exchange

What is "experience replay" and what are its benefits?

I've been reading Google's DeepMind Atari paper and I'm trying to understand the concept of "experience replay". Experience replay comes up in a lot of other reinforcement learning papers (particul...

left tartan Aug 3, 2023, 1:00 PM

#

mild dirge https://datascience.stackexchange.com/questions/20535/what-is-experience-replay-...

Oh, nice reference. You had me at Atari!

mild dirge Aug 3, 2023, 1:01 PM

#

The idea is that you don't just use the sample at a given time, but take multiple random previous samples and train the model on that

#

That way you can also use the same sample multiple times, and the batches are not as "correlated"

near basin Aug 3, 2023, 1:03 PM

#

This is not relevant in my case, I am asking about adjusting the NN in my mentioned way.

#

Thank you for the reference tho

mild dirge Aug 3, 2023, 1:04 PM

#

I think it is relevant to your question, you ask in what order you feed the data into your model for training right? @near basin

near basin Aug 3, 2023, 1:07 PM

#

More on the context (should probably mentioned it before):
I make a simple RL for playing Tic-Tac-Toe. The "experience replay" sounds promising, except it does not fit the use case here, because the Agent has "no" experience on a made step, and makes this "experience" only when the game is finished

#

So taking random batches won't really help

mild dirge Aug 3, 2023, 1:08 PM

#

What algorithm are you using to update the model, is it online or offline?

near basin Aug 3, 2023, 1:08 PM

#

I am not very advanced in this, what is the difference between these two?

mild dirge Aug 3, 2023, 1:08 PM

#

There are online methods that don't require future states to update the model

#

Such as deep-Q learning (probably most popular method)

near basin Aug 3, 2023, 1:10 PM

#

mild dirge What algorithm are you using to update the model, is it online or offline?

It is online

mild dirge Aug 3, 2023, 1:10 PM

#

So the experience replay buffer would be perfectly viable

near basin Aug 3, 2023, 1:12 PM

#

mild dirge Such as deep-Q learning (probably most popular method)

This is what I am doing, except earlier I only had models, where after each step the model is updated

mild dirge Aug 3, 2023, 1:12 PM

#

With deep-Q learning you can also update the model after each step

#

but not necessarily with the trajectory from that step

#

But with random trajectories from the previous 1000 or so steps

near basin Aug 3, 2023, 1:13 PM

#

I am reading https://towardsdatascience.com/deep-q-learning-tutorial-mindqn-2a4c855abffc right now, based on your recommendations

Medium

Deep Q-Learning Tutorial: minDQN

A Practical Guide to Deep Q-Networks

#

I will report later if I have any questions

mild dirge Aug 3, 2023, 1:13 PM

#

Sure

#

https://pytorch.org/tutorials/intermediate/reinforcement_q_learning.html

#

This was a tutorial I used when I did a RL project

#

But it is with pytorch, not tensorflow

near basin Aug 3, 2023, 1:16 PM

#

I do not care about implementation, only theory. Because I do not even make this project in python

mild dirge Aug 3, 2023, 1:17 PM

#

It helped me because code is a very exact way to describe how an algorithm works. So even if you don't care about the implementation, it may be good to look at still 🙂

near basin Aug 3, 2023, 1:22 PM

#

Hhmmm, my biggest mistake for now was, that I was using only one network, instead of Main+Target pair

mild dirge Aug 3, 2023, 1:23 PM

#

It helps with stability, but iirc I updated the target model after every 10 steps, and it still worked, so it is not 100% necessary for simpler problems

near basin Aug 3, 2023, 1:31 PM

#

Wdyt, if I use main+target, then using the target network I will play one round, then update the main network using the saved states and then put the weights to the target network. Sounds like a plan, doesn't it? But will this approach work?

#

Then, I don't have to care about in which direction the backpropagation is going

mild dirge Aug 3, 2023, 1:36 PM

#

In that case both networks would be the same

#

As you replace the target network with the policy network after updating the policy network every round

#

Typically you use the policy network to make decisions (but also add some random decisions to explore the state space)

#

And the target network is just used to calculate the temporal difference target

#

So this value is predicted with the target network

near basin Aug 3, 2023, 1:39 PM

#

Yeah

mild dirge Aug 3, 2023, 1:39 PM

#

In this formula

near basin Aug 3, 2023, 1:39 PM

#

mild dirge So this value is predicted with the target network

Was talking about this

mild dirge Aug 3, 2023, 1:39 PM

#

And the Q(S_t, A_t) is the policy network

near basin Aug 3, 2023, 1:41 PM

#

Alright thanks! Is there any '/thank' command for helping points like in [World of Coding] server?

oblique quarry Aug 3, 2023, 2:20 PM

#

Good afternoon I've been trying to get my convolutional Layer to run but it is performing poorly compared to the mlp implementation. This is my test set up i know the learningrate decay is kinda aggressive but it yields the best result

#

from framework.nn.Dense import DenseLayer, FlattenLayer
from framework.nn.ActivationFunction import ReLU, Softmax
from framework.nn.Loss import CategoricalCrossEntropyLoss
from framework.nn.Metrics import Metrics
from framework.nn.Optimizer import Adam
from framework.visualProcessing.convolution import Convolution
from framework.nn.Utils import sparseToOneHotEncoded, visualize, shuffle
from sklearn import datasets
import matplotlib.pyplot as plt

digits = datasets.load_digits()
bilder = digits.images
tar = digits.target
bilder, tar = shuffle(bilder, tar)
bilder, tar = bilder, tar
tar = sparseToOneHotEncoded(tar, 10)
batchSize = 64
conv = Convolution((batchSize, 8, 8))
f = FlattenLayer()
l1, l2, l3 = DenseLayer(64, 128), DenseLayer(128, 128), DenseLayer(128, 10)
relu1, relu2, softmax = ReLU(), ReLU(), Softmax()
loss, optim = CategoricalCrossEntropyLoss(), Adam(lernrate=5e-3, lernRateDecay=1e-2)
acc, l, lr = [], [], []
from tqdm import tqdm
for i in tqdm(range(1000)):
    for step in range(len(bilder) // batchSize):
        batchX = bilder[step * batchSize:(step + 1) * batchSize]
        batchY = tar[step * batchSize:(step + 1) * batchSize]
        #convOutput = f.forward(conv.forward(batchX))
        l1Output = relu1.forward(l1.forward(batchX.reshape(batchSize, -1)))
        l2Output = relu2.forward(l2.forward(l1Output))
        l3Output = softmax.forward(l3.forward(l2Output))
        if i % 10 == 0:
            acc.append(Metrics.accuracyClassifier(l3Output, batchY))
            l.append(loss.calculate(l3Output, batchY))
            lr.append(optim.getLearningRate)
        l3grad = l3.backward(softmax.backward(loss.backward(l3Output, batchY)))
        l2grad = l2.backward(relu2.backward(l3grad))
        l1grad = l1.backward(relu1.backward(l2grad))
        #conv.backward(f.backward(l1grad))
        optim.learningRateDecay()
        optim.step(l3)
        optim.step(l2)
        optim.step(l1)
        #optim.step(conv)
visualize(acc, l, lr, optim)```

#

This is the run with the conv layer

#

this is without so it pretty obvious that the net has memorized the data set(i believe 2k images)

#

but this only proves that my backward pass in the conv layer is messed up as i just added the conv layer ontop of the other net

#

here's the conv net https://paste.pythondiscord.com/FYVA

#

would appreciate if sb could check if i implemented the backward pass correctly(cuz there must be an logical error which is messing up my net)

lapis sequoia Aug 3, 2023, 2:37 PM

#

near basin Alright thanks! Is there any '/thank' command for helping points like in [World ...

I guess Tictactoe is not a good exemple for you to learn since it's a solved game and you would only need minmax algorithm. It makes sense to try the RL methods with things having uncertainty. I would recommend this book: Reinforcement Learning: An Introduction by Richard S. Sutton. This is what i used to learn.

#

I am using universal sentence encoder tensorflow, How can I speed it up, its currently only using CPU not GPU for some reason

serene scaffold Aug 3, 2023, 3:00 PM

#

lapis sequoia I am using universal sentence encoder tensorflow, How can I speed it up, its cur...

the only thing that will make any noticable difference is getting it onto the GPU, but I only know how to do that in pytorch.

lapis sequoia Aug 3, 2023, 3:01 PM

#

serene scaffold the only thing that will make any noticable difference is getting it onto the GP...

I have a rtx 2060 super

#

It says that the current version is more cpu bound something like that, one sec let me show you

#

2023-08-03 14:51:19.521843: I tensorflow/core/platform/cpu_feature_guard.cc:182] This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations.
To enable the following instructions: AVX2 FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags.

near basin Aug 3, 2023, 3:03 PM

#

lapis sequoia I guess Tictactoe is not a good exemple for you to learn since it's a solved gam...

I am not actually learning, just having fun

#

but thank you

serene scaffold Aug 3, 2023, 3:04 PM

#

lapis sequoia ``` 2023-08-03 14:51:19.521843: I tensorflow/core/platform/cpu_feature_guard.cc:...

My point is that the question isn't "how can I speed it up?" because we already know the answer. The real question is "how do I move this computation to the GPU?". And I'm not sure how to help you with that.

Even if that TensorFlow binary is optimized to use the available CPU instructions, the available CPU instructions won't be enough for what you're trying to do.

lapis sequoia Aug 3, 2023, 3:04 PM

#

serene scaffold My point is that the question isn't "how can I speed it up?" because we already ...

oka

lapis sequoia Aug 3, 2023, 3:05 PM

#

serene scaffold My point is that the question isn't "how can I speed it up?" because we already ...

can you help me with another problem please 👀

serene scaffold Aug 3, 2023, 3:06 PM

#

lapis sequoia can you help me with another problem please 👀

I have to know what the problem is before I can decide that.

lapis sequoia Aug 3, 2023, 3:08 PM

#

serene scaffold I have to know what the problem is before I can decide that.

recommenders = {}


@shared_task
def process_question(question, instance_id):
    if instance_id in recommenders:
        print("Using existing recommender")
        answer, recommender = use_main(
            question, instance_id, recommender=recommenders[instance_id]
        )
    else:
        print("Creating new recommender")
        answer, recommender = use_main(question, instance_id)
    recommenders[instance_id] = recommender
    return {"answer": answer}

I am using django and it this api route, How can I make a shared object of recommenders dictionary between all the workers, the recommenders dict has key value pairs of the instances of this class:

class SemanticSearch:
    def __init__(self):
        self.use = hub.load("./Universal Sentence Encoder/")
        self.fitted = False

    def fit(self, data, batch=1000, n_neighbors=5):
        self.data = data
        self.embeddings = self.get_text_embedding(data, batch=batch)
        n_neighbors = min(n_neighbors, len(self.embeddings))
        self.nn = NearestNeighbors(n_neighbors=n_neighbors)
        self.nn.fit(self.embeddings)
        self.fitted = True

    def __call__(self, text, return_data=True):
        inp_emb = self.use([text])
        neighbors = self.nn.kneighbors(inp_emb, return_distance=False)[0]

        if return_data:
            return [self.data[i] for i in neighbors]
        else:
            return neighbors

    def get_text_embedding(self, texts, batch=1000):
        print("Generating embeddings...")
        embeddings = []
        for i in range(0, len(texts), batch):
            text_batch = texts[i : (i + batch)]
            emb_batch = self.use(text_batch)
            embeddings.append(emb_batch)
        embeddings = np.vstack(embeddings)
        return embeddings

serene scaffold Aug 3, 2023, 3:10 PM

#

lapis sequoia ```python recommenders = {} @shared_task def process_question(question, instan...

I don't know enough about django to give you an informed answer; try asking in #web-development

lapis sequoia Aug 3, 2023, 3:11 PM

#

serene scaffold I don't know enough about django to give you an informed answer; try asking in <...

it more of a python thing ig than django but okay, thanks

haughty nest Aug 3, 2023, 3:53 PM

#

I am using GridSearchCV and for some reason it thinks an accuracy score of 96.47 is better than 96.57???

#

Can someone explain

winter sedge Aug 3, 2023, 3:55 PM

#

Not sure this is the right place to ask, if not please point me in the right direction. So I have a list with 6911 values, it looks like the attached image. I want to make a new list every time the value drops by x amounts so I can do a regression analysis and calculate the slope on each list. Where do I start? What do I need to learn to do something like this?

haughty nest Aug 3, 2023, 3:57 PM

#

when I do 'C': [1, 0.2236, 0.1] it gives me that 0.1 is the best with 96.47
when I do 'C': [1, 0.2236] it will tell me that 0.2236 is the best with 96.57.
I tried it multiple times and it gives me the same result

left tartan Aug 3, 2023, 4:03 PM

#

winter sedge Not sure this is the right place to ask, if not please point me in the right dir...

What's the source data, a dataframe? You need to define the conditions on which you want to partition the data. ie: "(current value - last value) / last_value < -10%". Once you can define the formula, then you can label the dataframe, and compute a regression on each label.

winter sedge Aug 3, 2023, 4:06 PM

#

Currently just a list, but I should absolutely import it to a dataframe to speed up the process.

slim bone Aug 3, 2023, 4:18 PM

#

So if I'm using:

MSE as my loss function l
Sigmoid as my activation function o
Some input layer a and an output layer p (and nothing else)
in order to find the partial derivative of w_alpha (some random weight) would it be right to do:
(l(p,t))' = 1/m * Sigma(1,m)[(o(w1a1 + w2a2 + ... + w_n*a_n) - t)^2)]'?
Trying to understand how to reach the gradient but I can't understand it for the life of me.

twilit tundra Aug 3, 2023, 4:19 PM

#

haughty nest when I do `'C': [1, 0.2236, 0.1]` it gives me that `0.1` is the best with 96.47...

Does it always give the same accuracy? Ie is your model using a set random seed?

haughty nest Aug 3, 2023, 4:20 PM

#

twilit tundra Does it always give the same accuracy? Ie is your model using a set random seed?

ye 42

#

always same accuracy

#

gridsearch just thinks 96.47 is better than .57

twilit tundra Aug 3, 2023, 4:22 PM

#

What if you change the order

#

1,0.1, 0.2236

haughty nest Aug 3, 2023, 4:23 PM

#

twilit tundra What if you change the order

same result, 96.47 is better

twilit tundra Aug 3, 2023, 4:25 PM

#

Maybe try putting verbose =3 so you have the full logs

#

Or 4

haughty nest Aug 3, 2023, 4:26 PM

#

ok

#

[LibLinear][LibLinear][LibLinear][LibLinear][LibLinear][LibLinear][LibLinear][LibLinear][LibLinear][LibLinear][LibLinear][LibLinear][LibLinear][LibLinear][LibLinear][LibLinear]

#

@twilit tundra it shows this

twilit tundra Aug 3, 2023, 4:37 PM

#

haughty nest [LibLinear][LibLinear][LibLinear][LibLinear][LibLinear][LibLinear][LibLinear][Li...

That's very weird, it should output something like this:

haughty nest Aug 3, 2023, 4:37 PM

#

i tried both verbose = 3 and 4

twilit tundra Aug 3, 2023, 4:41 PM

#

Oh you put it in the definition of your model

#

I meant in the gridsearch

haughty nest Aug 3, 2023, 4:45 PM

#

[CV 1/5] END C=1, class_weight=None, multi_class=ovr, penalty=l1, solver=liblinear, tol=0.0001;, score=0.946 total time= 0.0s
[CV 2/5] END C=1, class_weight=None, multi_class=ovr, penalty=l1, solver=liblinear, tol=0.0001;, score=0.947 total time= 0.0s
[CV 3/5] END C=1, class_weight=None, multi_class=ovr, penalty=l1, solver=liblinear, tol=0.0001;, score=0.949 total time= 0.0s
[CV 4/5] END C=1, class_weight=None, multi_class=ovr, penalty=l1, solver=liblinear, tol=0.0001;, score=0.949 total time= 0.0s
[CV 5/5] END C=1, class_weight=None, multi_class=ovr, penalty=l1, solver=liblinear, tol=0.0001;, score=0.947 total time= 0.0s
[CV 1/5] END C=0.1, class_weight=None, multi_class=ovr, penalty=l1, solver=liblinear, tol=0.0001;, score=0.947 total time= 0.0s
[CV 2/5] END C=0.1, class_weight=None, multi_class=ovr, penalty=l1, solver=liblinear, tol=0.0001;, score=0.947 total time= 0.0s
[CV 3/5] END C=0.1, class_weight=None, multi_class=ovr, penalty=l1, solver=liblinear, tol=0.0001;, score=0.949 total time= 0.0s
[CV 4/5] END C=0.1, class_weight=None, multi_class=ovr, penalty=l1, solver=liblinear, tol=0.0001;, score=0.949 total time= 0.0s
[CV 5/5] END C=0.1, class_weight=None, multi_class=ovr, penalty=l1, solver=liblinear, tol=0.0001;, score=0.947 total time= 0.0s
[CV 1/5] END C=0.2336, class_weight=None, multi_class=ovr, penalty=l1, solver=liblinear, tol=0.0001;, score=0.947 total time= 0.0s
[CV 2/5] END C=0.2336, class_weight=None, multi_class=ovr, penalty=l1, solver=liblinear, tol=0.0001;, score=0.947 total time= 0.0s
[CV 3/5] END C=0.2336, class_weight=None, multi_class=ovr, penalty=l1, solver=liblinear, tol=0.0001;, score=0.949 total time= 0.0s
[CV 4/5] END C=0.2336, class_weight=None, multi_class=ovr, penalty=l1, solver=liblinear, tol=0.0001;, score=0.949 total time= 0.0s
[CV 5/5] END C=0.2336, class_weight=None, multi_class=ovr, penalty=l1, solver=liblinear, tol=0.0001;, score=0.947 total time= 0.0s

#

hmm

twilit tundra Aug 3, 2023, 4:49 PM

#

Did your 96.47 come from evaluating on a set split? Then it makes sense that cv would have a different order

haughty nest Aug 3, 2023, 4:50 PM

#

twilit tundra Did your 96.47 come from evaluating on a set split? Then it makes sense that cv ...

yeah

#

i think

#

ye it is

#

20 80 split

#

so should I still take C=1 as the best parameter

#

or C=0.1

twilit tundra Aug 3, 2023, 4:53 PM

#

According to the cv, it would be 0.1

haughty nest Aug 3, 2023, 4:53 PM

#

ok but the accuracy score is lower when I call.score

twilit tundra Aug 3, 2023, 4:54 PM

#

On the 20/80 split?

haughty nest Aug 3, 2023, 4:54 PM

#

ye