Pastebin
Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.
1 messages · Page 103 of 1
Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.
can anyone tell me how much time it will take if i have to complete this project?
I might have missed it along the way, but the 'error', the distance between X(ᶦ) and the regression, does it have it's own letter?(ᶦ)
In scikit-learn, linear regression models are typically trained using the least squares method by default. When you train a LinearRegression model in scikit-learn, it automatically fits the model using the least squares method to minimize the residual sum of squares (RSS). Afterwards, you can calculate and check this error metric: https://scikit-learn.org/stable/modules/generated/sklearn.metrics.mean_absolute_error.html
Examples using sklearn.metrics.mean_absolute_error: Lagged features for time series forecasting Poisson regression and non-normal loss Quantile regression Tweedie regression on insurance claims
Sorry, forgot to respond with my answer attached, just check my upper message
wow dude
I learned about MAE, but there is not a letter designated to it I guess? What I mean is, when a formula has a SUM function, instead of typing SUM, you can type ∑
The letter for the error is epsilon
So if the quoefficient has letter A and the intercept has letter B
They would have epsilon(A) or epsilon(B)
And as for MAE, it would be epsilon(Y)
ahhh yes, I knew I saw something somewhere haha thank you
Np
The NIST handbook is a great resource
yo please look into this would really appretiate your help #1208723301967470673
In my job ,
I have worked on data asset management system with java, mongodb and open source framework: Nuxeo .
I have also worked on event streaming plattform using kafka, mongodb, java
how to write this in resume so that it looks better. Preferably in one line
I want to make a database for training an AI model to predict the value of a house
Im thinking of doing the following
Year the house was built ---date
Age of the house --integer
safety scores -- double
Total square footage of the house --Double
Number of bedrooms --integer
Number of bathrooms --integer
Lot size --double
Number of floors --integer
Presence of a garage (yes/no) and its size(double) ---unsure
Presence of a basement or attic(yes/no)
Presence of amenities(Text) as theyre could be multilpe ammenities like a fireplace, swimming pool
Based on the tutorials I've done so far, it seems that features (columns) with format other than numeric should be avoided. My knowledge is limited so this might not be true at all. But in this case you could replace the garage to 0 for no garage, 1 for small, 2 for large (this way you combine the boolean with the double), the attic with 1 and 0, and the amenities with numbers. The amenities might cause issues because the model could interpret the number as a weight, for example 1 is fireplace is more valuable than 5 is swimming pool because 1 is earlier than 5, or 5 is more valuable because it's a larger number. I had a similar situation in which I wanted to use simple numbers as my unique identifier, and someone explained that I need to convert it into vectors so the model doesn't get mixed up.
Actually for the garage it can be either a 0 or anything larger than 0
in general models literally only work with numbers, so yes you have to encode text and potentially other categorical features
it can either be Ordinal (one column with numerical values for sequential categories like no = 0, small = 1, large = 2 for the garage) or One-Hot (one column for each category, when they are not necessarily downwards compatible with each other, e.g. a boolean column for has fireplace True/False, a boolean column for has swimming pool True/False)
thank you
garage size could be just one double though, with no feature for "has garage", no = 0, yes = >0
also we call them "floats" in python rather than doubles
I wonder what will happen when text is fed as a feature into the model, will that simply not run the code, or worse, it runs but skewes the results?
usually it should just error
ok great, I rather have that then results that I can't trust
the models literally can only operate on numbers, if you had some very weird setup it might be possible for something to automatically cast them as numbers by using something like ord(), but that is not the sort of thing that just happens automatically
@stable grove are you by any chance doing the Coursera course? Because Andrew indeed introduced us to the housing example 😉
No, its a project idea I came up with.
could i convert all my text fields to ASCII values?
with a delimeter?
69,100,101,110,118,97,108,101 --Edenvale
technically it is possible but it is a pretty bad way of converting text to numbers your model can use - you want for the data you give to be as meaningful as possible, and ascii is pretty arbitrary
I would strongly recommend just using one-hot encoding first to get used to it before trying anything fancier
btw, dates also have to be encoded
technically you could use a unix timestamp, or just separate year/month/day into their own numerical columns (or do more operations like iso calendar week, summer/winter/fall/spring etc.)
but for this case, you might as well not include it whatsoever considering it would be 'overlapping' with age
thank you for the help
Hello, I'm wondering in what cases is it needed to use the following techniques on my RandomForestClassifier script: MinMaxScaler, CalibratedClassifierCV and GridSearchCV (without it slowing too much the load time of the script).
My first guess is that we don't need MinMaxScaler; we just need to use GridSearchCV once for each dataset and use CalibratedClassifierCV to check where the log loss is lower (either on not calibrated, sigmoid or isotonic)
Also if you have any more suggestions to improve the results of my predict_proba output, I'll be really thankful
will 20240218 work as an index, for example a transactional datestamp?
by the way, my math is rusty as hell super embarassing, but what does this sign means? (tried to find it in the Greek alphabet, wasn't there?)
it could work, specially for a tree based model, but it would be pretty weird for a regression task given the way it distort the distances between things like 20231230, 20231231 and 20240000
like... you might as well just use a timestamp if you want to turn it into a single number
yeah oke, I guess Time Series is a bit more on the advanced side of things, will come to that eventually 😄
in which context?
This one
this usually denotes either partial derivative or the boundary of a set
yeah, there it's a partial derivative
Ah alright, does this symbol have a name?
i'm actually not sure, looks like some sort of calligraphic d
in most places you'll find it as \partial
It must be a very widely used example, we did something similar in Matt Harrison's Statistics course
.latex $\partial$ $\del$ $\delta$ $\nabla$
!charinfo ∂
\u2202 : PARTIAL DIFFERENTIAL - ∂
^
oof
Andrew's are the housing prices in California, where is Matt's? 😄
Ok I made a note on that partial derivative/ partial differential, thank you.
I'll go back to another one of Andrew's courses I'm watching now lol
edit; 2.5/10 wouldn't recommend that mini course
ping me if you have a follow up question on what I said earlier
I absolutely love his course and the way he teaches, by drawing, which works really well for me. The last time I had math, algebra, calculus was ages ago, always loved it, but didn't keep it up since there wasn't anything that required me to do so. Now starting with ML, I'm slowly getting that feeling back of joy seeing and understanding formulas. I'm sure that at some point I'll want to bang my head into the wall, perhaps when I arrive at matrices. Seems to be a rough part based on some Reddit posts I saw.
Using the Ames, Iowa Housing Data https://jse.amstat.org/v19n3/decock.pdf
Oh cool. Well let us know if it all worked out with the database and properly feeding it into the model.
I'm gonna try this thing https://www.prefect.io/opensource
Works extremely well locally
But it says it can automate the infra, which would be really really good rn
You read my mind, I was looking at platforms to make ML pipelines instead of just the experiment.
It's really good, worrying about the infra has been slowing me down a lot, I think this lets you get a pipeline going, test it locally and then push it to a serverless compute worker
I already have that, but coded manually by me, which is cool but also a lot to manage for one person
What do y'all think of this roadmap?
https://roadmap.sh/ai-data-scientist
Looks nice, I'd try to do some coding in parallel as you learn the math
A lot of math can be learned much better with the help of python, where you can just do your own experiments
Probability in particular can get super abstract if you go into the details of prob spaces and whatnot, code can keep you grounded, which is important when self studying
I'm in an odd place because I have over a decade of programming experience, in particular game dev, so I have exposure to linear algebra, I wonder if there's a topic or project that can fast forward my "onboarding" into data/ai
Linear Algebra and calculus already put you in a good spot for ML imo, there's tools in stats that are important but if you know the others you can quickly pickup stats as you go
Or if there's any value to learning through training LLMs
I just got a job in data, starting out more on the "analyses" side and getting introduced to more stats and what not
Training LLM will get you in direct contact with the thing and it will make you make all sort of mistakes
But when it comes to AI, I have a lot of unknown unknowns and how they related to my current skillset
Which is good experience, but also probly the hard route
If you know math I'd go directly for doing a project that interests you and use it to learn
Ok, I view training LLMs as a top-down approach to learning AI, my other gut instinct is that learning neural networks will be a bottom-up approach to learning AI. Do you think this is accurate?
Does it make sense what I'm saying?
I think it's good to first have an understanding of the basics, like the gradient descent algorithm, mini batching, motivation for why NNs work in the first place
I have a book
Like a lot of it after the theory will be a sort of dark art where you kinda just experiment til you get it right
And the way you do it, the tools you use, etc, it's all important stuff you only learn by making mistakes
Since my previous background was in games and graphics, I'm way more inclined to learn from this book, but only the latter chapters are dedicated to the topic, but maybe it's a good enough intro
I also like that it would bridge concepts for me
Looks really cool
I also come from a sort of graphics-y background
I have one project in mind, but because I have a lot of unknown unknowns, I have no idea how feasible by just me
I used a lot of computer graphics concepts to do physics simulations
Oh dude, you're my new best friend
Hehe, I just had an interview where the first questions were all computer graphics
Nice, which graphic libs are you familiar with?
For MLE
Oh, none, I've implemented stuff from scratch and came up with new algos
Ig I've used SDL once to do a ray tracer engine
Nice, SDL is nice to use
So here's my first project idea; I like watching basketball, but I don't have time to sit through a full televised game. It would be great if I could get AI to take a game recording and cut out all the boring parts like free throws and players walking the ball up the court
Like usually the data I got back was not a visual output per say, it was physics data of energy deposition, prob distributions and whatnot
So, I reckon you're gonna be using a pret trained model for that, unless you got a lot of GPU with you and a lot of data
No wait, you can build a classifier right
And apply it to every frame
Could I just set something up on together.ai?
Is basketball, or is not basketball
They seem to be gen AI models
Ok, what do I need? An M3 macbook?
But I'd recommend getting an Encoder CNN with two output classes
Will the new laptop core ultra-9 work?
You need GPU, and I'd avoid ARM
How many gigs of VRAM?
Idk how the ARM support is rn, but last time it wasn't very good, might've changed
I have a 3070 that's not doing anything
But realistically, how long of a project is this if done on nights and weekends?
The more the better, I've been able to train 100M param language models on 16Gb, but it's frustrating and text data is more forgiving
I have no idea tbh, the thing that usually decides success is data quality
And compute power
I thought sentiment analysis was gonna be easy, been like a month and ton of infra
I'd have to really research into how to go about doing this. Somehow I have to train it to identify what "boring" moments are
Using audio and video
I'd go for image data
Like, videos are just images
And multimodal modals are very likely harder to train
Right
I assume that you want to cut out ads and such
Exactly
Then image should workout fairly well
a 2 and a half hour recording should turn into a 40-45 min thing
Like, if you require temporal understanding, it's harder
What I mean is like, if you want to detect the times when someone scores
It might be harder than just detect the ad bits
You might want to consider starting with a simpler classification problem and build up
I also highly recommend using this for pipelining https://www.prefect.io/opensource
Even locally works wonders
Ok, that's a good suggestion
Description will go into a meta tag in
And also this, it's indispensable
First thing you gotta do is find a dataset and evaluate it. The larger the better. You then split it into three:
Train dataset is the one that the network learns from, the network will perform forward pass, backwards pass and then gradient optimization
The test dataset contains data that the network has not seen during training
This is crucial because neural networks are functions with high capacity for memorization. So you want your network to perform generalization, or perhaps even better, you can call it compression.
To test that your network is not memorizing that dataset you test it against another dataset it had no chance of memorizing from
If you accidentally use data from test to train, that's called a data leakeage
The validation dataset is a bit more subtle, but it's the same as the test data set
When you select a model, you have all these "hyper parameters", which decide the shape of the model, the shape of the batches, and a bunch other stuff
So like, in order to see which hyper parameters work best, you do a sort of search, you train the model on different combinations of the hyper parameters
And this is the subtle part
If you use your test data set to choose the best hyper parameters
That still constitutes a data leakeage, because in a way, information is still flowing from the test dataset, but through you
So you get this validation dataset to choose your hyper parameters. Then you train the thing for longer periods of time, and then you test it against the test dataset
The test dataset is then used to measure how well the model will perform in the real world
Interesting
There's a ton of stuff, but this is the gist of it. The challange, at least for me, has been the GPU shortage and finding the hyper parameters that dont overfit my model
In terms of intuition for why models work at all
The reason is that they are universal function approximators. Same way you can build up any function with tailor series, or any image with Fourier components.
Everything in the world can be described with functions
And models are like these machines that you can tune to approximate any function. So they can do all these amazing things.
Nice
I got my work cut out, I guess I'll start going through that book I showed you tomorrow and see how far that gets me
can you drop some screenshots when you have it running, curious how clean and clear it is
still work in progress, but this is how my code is turning out, I'll run it in a sec
@flow
def main() -> None:
model = load_model()
loss_function = nn.CrossEntropyLoss()
mlflow.set_tracking_uri("sqlite:///mlruns.db")
mlflow.set_experiment("asa")
eval_interval = 1
save_interval = 1
for epoch_step in range(100):
cleanup_memory()
for training_loop_step, model, loss in training_loop(model, loss_function, accumulation_steps=1):
# observability into the training loop:
mlflow.log_metric("training/loss", loss.item(), step=training_loop_step, synchronous=False)
if training_loop_step % eval_interval == 0:
metrics = perform_evaluation(model, loss_function, total_count=5)
log_metrics(metrics, training_loop_step)
if training_loop_step % save_interval == 0:
mlflow.pytorch.log_model(model, "models")
if __name__ == "__main__":
main.serve("train")
with the serve it runs a sort of worker that awaits for me to trigger, now I run the UI
it appears right here
I still haven't benchmark performance, but haven't noticed any sgnificant differnece
around 55% max it seems
not bad, I've also been looking at KNIME, which is a low-code pipeline platform, also open source and local, but the nodes can be fully SQL and Python
no like, the max is 15gb
@final kiln this is how KNIME looks like
also looks good, does it deploy to spot ?
oh it watches the layers themselves ?
if you want to model the pipeline like that, sure
you can also just flunk everything in sequence like a notebook
To quickly build intelligent data-driven workflows, organizations need business analysts to work with data scientists and development teams to unlock useful insights from unstructured or semi-structured data. Learn how KNIME’s end-to-end data science product portfolio helps bridge the gap between the ideation and productionalization steps of dat...
@task
def forward_pass(model, loss_function, rating_b, text_bw) -> Tensor:
""" Forward pass through the model. """
predicted_logits_b5 = model(text_bw.int())
loss = loss_function(predicted_logits_b5, rating_b)
return predicted_logits_b5, loss
@task
def backward_pass(loss_train, multiplier: float = 1.) -> None:
""" Backward pass through the model. """
(loss_train * multiplier).backward()
@task(log_prints=False)
def gradient_descent(optimizer: torch.optim.Optimizer) -> None:
""" Perform a gradient descent step. """
optimizer.step()
optimizer.zero_grad()
im a bit wary of having these in their own task
seems it can
but code looks better for some reason
I think Prefect is a smaller install? KNIME is almost 1Gb
I think it doesn't get much attention because it markets itself as low-code solution, even though you can put full code into every node
visual programming languages are still code imo, labview can get pretty complicated, I think visual is better for pipelines
yep I agree, it makes learning ML much easier if I can see how it fits into the whole story. I understand that most courses focus on experiments, simply to avoid you getting lost, luckily I work in data so I know how to be lost properly 🤣
hi guys, what is the expectation from a applicat applying for a a junior NLP developer?
Eh, it's easy to get lost yeah
Uhm, what do you mean by NLP developer
naure language processing dveloper
for gold text processing and prediction
so what is the base line for a data scientist junior role or deep learning engineer role?
I got the take home assignment for code filtering a large text talk aobut gold, and they asked you need calculate the probabilities about the imcoming text, but i realize i don't know shit about how to code a linear regression, so what is the expecation for a junior role?
def training_loop(model: SentimentAnalysisModel, loss_function: callable, accumulation_steps = 10):
optimizer = create_optimizer(model)
training_data = load_dataset_from_npz("data/asa/test.npz")
for step, rating_batch_b, text_batch_bw in yield_batches(training_data[0], training_data[1], 15):
model.train()
_, loss = forward_pass(model, loss_function, rating_batch_b, text_batch_bw)
backward_pass(loss, multiplier=1/accumulation_steps)
if step % accumulation_steps == 0:
gradient_descent(optimizer)
yield step, model, loss
def evaluation_loop(model: SentimentAnalysisModel, loss_function: callable) -> Iterator[int, float, float]:
model.eval()
test_data = load_dataset_from_npz("data/asa/test.npz")
with torch.no_grad():
for step, rating_batch_b, text_batch_bw in yield_batches(test_data[0], test_data[1], 15):
predicted_logits_b5, loss = forward_pass(model, loss_function, rating_batch_b, text_batch_bw)
accuracy = calculate_accuracy(predicted_logits_b5, rating_batch_b)
yield step, loss, accuracy
I think I got myself a nice little pattern
my understanding
is that those roles don't exist, there are no entry level positions for MLE and data scientists
Or at least, they are not positions that lend themselves very well to being filled by entry level people
@final kiln Prefect is orchestration in general right, not specifically for ML? I'm turning the website inside out and don't see anything about experiment tracking or how the model behaves, etc?
It's general yeah, it takes care of logging and observability
you code just did a baseline copy from any website that do a trainning, and its only for machine learning model i think, so i have n idea where it is impressive.
And my hope is that it helps with the infra part, they mention it a lot in one section
Yeah I guess we are all implementing the same Algo isn't it
But the part that I like is the yield statement
Which neatly seperates my observability from the ML stuff
okay, move away
sorry ?
hey guys, anyone know what models to pick for object detection live camera feed on my mac from hugging face and how to set it up locally? im a newbie with setting up models
i know and download yolo models but dont know how to test it on the front mac camera
Check media pipe, they have good pre trained stuff
first you need to make sure you read the data correctly from your camera, check openCV topcis
then feed the live data to your models
for supervised leanring: you first prepare a large image data set from your camera, and lable them when the object is in the frame
for unsupervides learning: i have no idea
I think dropout has done the trick, but can only be sure after more 3 or 4h, this gonna be one of my last runs before revamping all the infra
Batch size/LR combination give a lot of instability but I don't really see a lot of trouble in the eval so I'm just gonna let it be
So what is the baseline expectation for a junior data scientist role? such for mechien learning and deep learning? what is the project make you outstand from this position? I know there are no junior role for it, but there are less expeirenced data scientist know basic stuffs
what you need to do and know?
Like I said, these roles are usually not entry level, I think people usually start with data analyst or junior software
But knowing what a regression is oughta be a good start
I know there are no junior role for it, but there are experience limit roles
I've seen internships for ML Research scientist, but theyre usually for PhDs and stuff like that
In terms of projects, theres a lot you can do to stand out
Just pick something that you are passionate about and make it impressive
Tbh, I have this hunch. I'm gonna pre train the embedder module and pre compute the positional encoding. I recall during early testing that this was the part that would cause the model to overfit more rapidly
There's also the possibility of training it on next token prediction and use the in between signals to train a classifier.
An advantage of that approach is that I can reuse the model for the other tasks I have to do to replicate the MetaFormer study
I lack of structured learning and i don't the water level
Personally I find that learning outside a structured environment is way more fun
But it very much depends on each person's learning style
There's a ton of resources online, I like to recommend Khan academy for building up the math knowledge
3blue1brown has a great primer on ML too
just for a understanding, i understand all machine learning concerpts, from linear to K-nearest to VSM, to linear algebra to CNN, RNN and TTS model design, as well spectrogram transforming (audio processing for TTS), is this deep ?
what is the water level of this
i think the only thing i don't understand is what the water level is a machine learning and deep learning engineer
what required for the job
How come you understand linear algebra and CNNs but you don't know how to code a linear regression ?
because i skipped the machine learning project, i learned all in 2 months, ML by nature is worse than deep learning
but i undertand now he many company maybe want a ML project as even worse
People usually learn regression very early in college, which is why I thought it was weird
Ig you can bunch it with ML, but first time most people hear of it it's just an optimization in stats class or wtv
i understand regression just like all other concerpts, but i don't understand how to code them
....
"What I can't create I don't understand"
that's ture
i have hands on experience in hot encoding to CNN stack, very limited, but i don't understand how to code a liner regression
LOL
Idk, you can just apply gradient descent to any linear regression problem
Tho there's a ton other optimization techniques
My fav is the monte Carlo one + local refinement with gradient estimation
There's ones with Jacobian and whatnot
It's a whole thing
we need a undersanding of this (we don't)
would pydantic questions fit here?
ye, that's where my 2 months college lack of , remember bunch cauasions guys' name
ye, i believe you
Well this is my script I’m writing to create a strategy that’s built to trade strict criteria and focused on exiting the first signs of market reversals or the conditions not being true anymore
Consider doing a degree if you can, it's worth it for many reasons
Locking in profit and minimizing risk
what degree?
Ah, that's a good question actually. I don't know. I come from Physics for example. I'd wait for other people to answer that question
But I reckon a lot of people do Computer Science, others do an actual ML degree
that's a good question, how did you get the job and how did you learned?
Uhm, I got a lot of hands on experience during my masters thesis, which ended up being a very successful project that produced research and even got me to do a talk and a couple poster presentations and published abstracts.
I got a job in software fairly easily, it was during the peak hiring frenzy at the end of COVID. Did a bit of everything, frontend, backend, infra, etc etc. My last role in particular, started as a fullstack but quickly grew into an MLE role, I was training models, etc.
Right now I'm in between jobs more or less, I'm interviewing for companies in Switzerland, MLE roles only, I got all this experience behind me which makes me a safer hire, ML is very expensive.
bro from full stack to MEL and thinking ML is experiensive
And I'm still learning quite a lot. This last one I thought was gonna be easy. Just simple text classification. But it turns out it ain't easy
great
ML is super expensive, since there's a huge GPU shortage and it requires a lot of experimentation
What is your master degree, i was wanted to do one
Actually haven't defended my thesis yet, but it's more of a formality by now
Master is super worth because you get to do a thesis
If you choose an hands on thing, your time is being used very well rite
You're both getting XP in software, making all the mistakes you need to make to learn
But you're also getting a degree and possibly other cool stuff
is media pipe a model? looks like its not on hugging face.
I think you don't understand what you saying, apprantly you from full stack to MLE with a different degree
Mediapipe is a python package with a bunch of ML stuff like object detection
what you learned monte Carlo one + local refinement, in bachelor?
I'm not from fullstack, I just worked several roles throughout the last two years, on startups this is natural
ah ah, well my boss for this project wants me to look multiple models on hugging face and find the best models and put all the metrics on an excel spreadsheet if that makes sense. like specifically on hugging face.
Uhm, I got a lot of hands on experience during my masters thesis, which ended up being a very successful project that produced research and even got me to do a talk and a couple poster presentations and published abstracts.
I got a job in software fairly easily, it was during the peak hiring frenzy at the end of COVID. Did a bit of everything, frontend, backend, infra, etc etc. My last role in particular, started as a fullstack but quickly grew into an MLE role, I was training models, etc.
?
I used it during my thesis to fit data to be used during particle transport simulations, I had a lot of data on various materials which had to be used effectively during the simulation
?
Medical Physics, I specialized in monte Carlo simulations of particle transport for predicting energy deposition of radiation treatments
and doing full stack
What about it ?
lol
i guess i will being a wonderfull ML engineer
You don't believe which part exactly
It's a pretty natural path I'd say
and ask people for a degree
MLE requires you to have understanding of software
and saying MLE is expensive
you transfer from full stack to MLE in same company, means ML is not expensive
becuase its cheap
What
they don't matter cost
Have you ever worked in the industry ?
nope
Makes sense
There's a lot of role fluidity in startups
You're expected to do a bit of everything
so startups affored expensive ML
your logic didn't make sense from frontend to MLE as physic grad
=> ML is expensive
=> do a degree
Less people means generalists are preferred
That's not a logic, that's my employment and educational history
Which do not say much about ML being expensive or not
What determines the cost of ML is the hardware and the nature of the task
Not if a random person has transitioned from full stack to MLE within a startup
What kind of argument is that even >.>
while you should stufy pholosphy first, i thought smart people like i am doing physics
Okay, you are now just saying random stuff, I'm gonna attend to my training loop
i mean, i guess the MLE dosn't need much experiences, as i guess from this conversation, so thanks
at least i know the water here
You strike me as a pretty hostile person. I'd work on that.
You have no idea how far you can get in life by just striving to be a good person
because i got all the easy interviews from data scientist or NLP or DL, i guess you really don't need experience on this one
i mean i will learn the optimizing tho, even for deep learning there is no much small tweaks, for machine leanring i guess there little bit?
like your fav My fav is the monte Carlo one + local refinement with gradient estimation as your only did project in your school
so do u think i go use openCV or media pipe like suggested above?
i don't have experience in company wise project.
Open CV works if I'm not mistaken, but mediapipe comes with the stuff all setup, easier for you just starting out I assume
for the porject you describled, use openCV read the camera images, store the image data set, lable the images has targets, you need to deal with with image transforming, such as what size of the image you want, and then you need a model for trainning
lol its not really a company project. more like a professor trying to test me before I can join his lab, but ig u can say he is my boss later on.
and the model u get from hugging face?
for what purpose? jus object dection? what objects
Just mix both
Mediapipe is good, I've used it for a PoC recently
It fails here and there, if you want better performance you'll have to train your own CNN
if you don't know what is a openCV, quit the topic
actually all objects really. the overall project is to get the camera feed. postprocess it and pass the feed running multithreaded through different models so different models can focus on detect their "purposed" objects better than just using one model.
Could you please explain what you mean ?
Anyway
and how come openCV and mediapipe can just "just mix both"?
I think you should be a bit more humble
Pleas open the link
@hollow furnace I just sent you the two links you need, the first one is the package that will do object detection for you, the second is the code that you can use to read from your camera
@vernal quartz Please stop being hostile towards me.
you should be more humble, this is the harmer to humble you
I guess YOLO is a overall better chocie, but you can try different models. I guess if you only use the APIs for model trainning, the main problem will be adjust the camera image so you can have a standarlized image database
The more important part is how you capture the images and then standardlize them such as size to feed all the models, also multithreading coding stuff
This is not impressive or humbling
i guess you didn't have an IQ to look through it
Sigh
i give you 1 million for using image detection techniqes for these kind of CAPTCHA recoginition
try and do it
this is called Lemin CAPTCHA
this is my own created algorithm for image recognition, it doesn't even used ML and deep learning
I'm sure your technical skills are good for someone without experience, but people are also looking for someone they can work with, it's horrible to work with people who are hostile and think they know more than the experienced folks
i guess you can't even see how many anti-recognition layers on there, this is not your common life image, and i guess you even don't have the skill for nature world images processing and recoginition
from skills good for someone without experience, then half sentence drop the skill talking about personality,
again, this is a multi-layer anti recognition CAPTCHA
this is not a common even real world image
i guess someone from physic transfer to MLE don't understand it
It's time for my coffee I suppose.
Yesterday I found out my model was overfitting, which I thought wouldn't be possible on such a. Large dataset >.>
That’s not really the way we do things in this server. The whole point is the welcome any question, not gatekeep who can ask. The less you know about a topic, the more reason to ask. “Everyone you meet knows something you don’t”
Hiya, fella's. I'm kind of eager to figure out why data science is interesting and what benefits it provides.
Does anyone mind sharing their insight?
And, if someone asks something that’s wrong, either take the time to explain… or just don’t respond. It’s all good.
GOT IT
Can you be more specific? Data science is a broad field
It’s like asking ‘why math is interesting’, very hard to answer succinctly
Would you be so kind to elaborate the most common topics within data science, and what purpose they might serve?
Data Science is only two parts, more tradiontional data anaylzing part that most replaced by GPT, and other great data scienctist doing there job i think, but mostly 300K+ plus just my future, others reporting their basic insights to the captlislim companies that don't know how to or too lazy to use GPT. The second part the only fun part is deep learning, no gurantee income, company don't understand it, but they want it
answer good?
So the process of analyzing sets of data and extracting conclusions from them in a nutshell?
Data science is hard to define the edges of. On one end, it starts with statistical techniques for the analysis of data and on the other, machine learning and AI topics, which include NLP, LLMs (like GPT), LSTM and other techniques.
Not necessarily conclusions: sometimes just an understanding of the data
This def seems pretty good:
Data science is an interdisciplinary academic field that uses statistics, scientific computing, scientific methods, processes, algorithms and systems to extract or extrapolate knowledge and insights from potentially noisy, structured, or unstructured data.Data science also integrates domain knowledge from the underlying application domain (e.g.,...
Could you summarize practical use-cases that are most common in the industry?
heavy or easy process part maybe given to data engineer, data scientist has two parts as i mentioned, really depends on what level of them, most them = GPT and others are great, these excluded DL engineers
Uhm, within ML you get the fancy stuff like semantic segmentation, text summarization, all the generative AI stuff
Are you writing a paper? Feels like these are paper prompts.
I've heard someone say Data Science can be thought of as an evolution of statistics
oh gush
please documented what i said and write on your paper
that will be so ture and funny as hell
I like to think of it as a mashup of CS, Stats and often some domain (ie; finance or physics or whatever)
Yeah makes sense, I always wonder if I should learn more stats, so far I haven't needed a lot of the advanced stuff when doing ML
do you know what is a good data scientist?
and where to learn?
#data-science-and-ml message is where to start
You can never learn too much stats (half joking): it’s completely worth learning a little stats every week. You’ll be amazed how often stats problems come in real world
In particular, the subfield of EDA is very useful.
No, I'm genuinely curious on why people think it's an interesting field.
its creating GPT!!! Like i do i will create terminator!!!
Oh I think Ive always done it in one form or another, like I just know what to look for when I get a bunch of data in front of me.
I wonder how close what I do is to what is supposed to be done tho >.>
Why DS? Because you can discover new information from old information, and new information makes you smarter. Eli5 version
or from when America created
I think ML is fascinating because it touches on stuff I like, math, philosophy and computer science. On top of it, it'w usually applied to some domain, so I can actually mix all the stuff I like into one job
What could the results of ML contribute towards to? Like more efficient ways of doing generic tasks?
That does sound like an assignment question
ML is trash, its based on data analyzing
Perhaps it makes an airplane more efficient. Or prevents a country from an agricultural disaster. Or answers your homework question. Anything, I guess
Hmm, how could it prevent a country from an agricultural disaster?
Comparing previous data with existing current data and forecasting some sort of scenario?
But it has the potential to disrupt entire industries. I'm not convinced about the singularity actually, recently changed my mind about it
Sure. Predicting weather perhaps? Or crop cycles?
Or finding more efficient planting methods, etc
My next project is gonna be an image classifier that tells me when my pizza is ready, cuz I need that in my life
Are there any dynamic changes it could bring towards the development of an individual. Ex; would you be able to use your skillset as a way to enhance your lifestyle?
Interesting.
Depends on what you mean, but there's a lot that can be said about that
Could you define a few examples?
farmers give us better food
And is forecasting the only type of byproduct of ML/DS or are there more?
Iirc, the very first web order site was for pizza.
you can anaylzing how many nicotine you inhalred, and predicte when you will die based on your speices
LOL, that's quite an example.
i guess that's what called death are coming
Just look into what open AI is doing, they're doing so much cool stuff rn
This topic is also https://en.m.wikipedia.org/wiki/Actuarial_science
Was ML or DS the major contributing factor of OpenAI's new tools?
What’s the difference?
Zero clue.
I believe ML is a subset of DS, or something like that
I think you should probably start with those Wikipedia pages, and find a good explanation and read it thoroughly. These are broad topics that need a more thorough explanation
I’m not sure we can adequately explain the entire industry effectively here
Ah, well.. I appreciate the insight(s) and guidance.
depends on new tools, openAI GPT model is using DL creating and analazing, sora is another transoforming model, in the nxet few years we can transfer babies to food

Theyre good questions, it just sounds like you should read a basic ‘what is DS’ page to get the terminology
Perfect, HBS. Source (https://online.hbs.edu/blog/post/what-is-data-science) for learners that want to tune along.
Yeah Medical is huge
Thanks, just a bit dazzled as an full stack dev.
This entire industry seems like a new digital world opening up for me.
please remember data science only two parts, one part is GPT stuff, the other part is DL
The DS terminology gets very confusing quickly, since there’s so many branches and techniques.
ChatGPT depends on DL, so that isn't a coherent partitioning of DS.
Actually one of my collegues was working on similar stuff, detecting cancer in PET scans or something of the sort
no, deep leanring developer is a different role, but most industry don't have enough courage or people to hire as a deep learning developer
Are DS'ers involved in building the algorithms, i.e. developing the software? Or are they limited in some sort of way? (Had a quick skim, couldn't get to the bottom of it) So far, I've seen most is data gathering.
There is no consistency in what a "data scientist" actually does.
Uhm, a lot of the data scientists I worked with seemed confined to jupyter notebooks and such, but I also heard that there's a trend towards requiring software skills from data scientists
I laughed at how the HBS link above separated DS and DA (the distinction is very muddy in real world)
So, pretty transformative. You just get to dabble in everything that's necessary? Quite chill, imo.
what I mean is, companies can ascribe whatever job responsibilities they want to a data scientist role. it doesn't mean that data scientists get to "dabble in everything" in the context of their empolyment.
This question is for the employed and or fascinating hobbyists, what responsibilities do you get to adhere to?
I wish there was enough time in a human life to do it, there's a ton of cool stuff to do in this industry
I wish, these time constraints on our lifespan as humans are quite tedious atleast.
My last role was at a small startup, I did a bit of everything from frontend, to platforming and to ML and even research (customer was keen on one of the projects)
I think at larger companies you'll be a lot more confined to your lane, but I've been in small places only so far
im tryna run this code on vs code but it seems like cv2 didnt detect any feed: cv2.error: OpenCV(4.9.0) /Users/xperience/GHA-OpenCV-Python2/_work/opencv-python/opencv-python/opencv/modules/highgui/src/window.cpp:971: error: (-215:Assertion failed) size.width>0 && size.height>0 in function 'imshow'
Looks like a validation error, did you notice anything in your camera
I lost interests when i read here
I'd say in general, it's normal to not have interest or a positive bias in abstract knowledge. Say you were to make it tangible and realized it into a project, you'd have more pleasure.
oh and im using the front mac camera btw, not some plugin external camera to the side
i dont think so, didnt see any windows pop up or any green light flashing
That’s why school is important: to ‘force’ you through these foundational topics
Wait did you check the answer ? I think there was a bug in that code
Quite rigurous, you could use some positive reinforcement or a different mental model to tackle your problem. Gets you through the boring stuff.
Matrices are just crazy and keep getting crazier the more you study them
no i understadn what is matric and what is linear, i just don't have courage to read the mathatics
i just lost the interest when i read N
Try this one
import cv2
# define a video capture object
vid = cv2.VideoCapture(0)
while(True):
# Capture the video frame
# by frame
ret, frame = vid.read()
# Display the resulting frame
cv2.imshow('frame', frame)
# the 'q' button is set as the
# quitting button you may use any
# desired button of your choice
if cv2.waitKey(1) & 0xFF == ord('q'):
break
# After the loop release the cap object
vid.release()
# Destroy all the windows
cv2.destroyAllWindows()
yeah. hmm maybe its a user password one at the bottom but 1) i dont know if my camera has credentials and 2) mac password has an @ symbol so dont know if it messes with the '@192..."
Yah, academic writing like this takes a lot of practice and familiarity with the boilerplate stuff. If you’re out of practice, it’s painful.
I just searched online
i mean after read it i know its talking about metrix, but what is EN?
what is this shit?
We're on the same boat.
in the Natural number field
This one is the official docs
Like, I know the terms, etc here so I can just skim and understand what it’s saying. But take a paper on a topic you’re not familiar with, you’ll get stuck on the definitional stuff
n is an element in the set of natural numbers
The e means; an element of
It's like any programming language really
Except people keep adding features to it
There's actually an isomorphism https://en.m.wikipedia.org/wiki/Curry–Howard_correspondence
In programming language theory and proof theory, the Curry–Howard correspondence (also known as the Curry–Howard isomorphism or equivalence, or the proofs-as-programs and propositions- or formulae-as-types interpretation) is the direct relationship between computer programs and mathematical proofs.
It is a generalization of a syntactic analogy ...
dang, its asking me permission for camera. i think that might work. ill wait until it runs completely
Sets are actually a good start to study math. Everything is sets. The number zero is the empty tuple, which is a set with the empty set, something crazy like that
why we don't just say it MN is in nature numbers?
In mathematics, the surreal number system is a totally ordered proper class containing not only the real numbers but also infinite and infinitesimal numbers, respectively larger or smaller in absolute value than any positive real number. Research on the Go endgame by John Horton Conway led to the original definition and construction of surreal n...
like say it in english like what ur saying here?
This is the coolest number system ever
because that takes 5 times more characters to write
cuz math needs precise definitions and should leave no doubt for the reader. at least thats what my math professor said.
this entire paper is bsting about rows and columns
and then i realize its matrx adding
fk
help
what is this
i think its also bc u want math to transcend beyond the language barrier. so a french math nerd can still understand some papers written in english from uc berkeley folks for example.
i know its talking about a matrix, but i felt discouraged
????????????
????????????
do you have a particular question about this?
here they're just introducing notation
is this saying ur multiplying elements diagonally and then summing them up?
waht is the reverse A?
it says in english: for all matrices A, B, C containing real numbers and with shapes m x n, n x p, and p x q, respectively, multiplication is associative
this is what ML scientists have been demanding our respect for?
we've been played for absolute fools.
the angle?
"all". and upside down E is "exists"
forall
backwards E, I should say
for all A in Real number field of size m times n
m times n are dimensions if i remember correctly
.latex $\forall$ means for all'' and $\exists$ means there exists''
.latex $\in$ means ``is an element of''
so "all" means A or all means A,B,C, and why they formated like A,B,C:, does they connected?
i always get jumpscared when I remember this is how one writes quotes in latex
i'd much rather csquotes, thank you.
not sure i understood your question
they are connected. but the for all means for each of the elements specifically
they're connected tgt by the last part
ah it means all A, B, and C with the given properties
(AB)C MUST be equal to A(BC)
forall ..., it's true that (A B)C = A (B C).
so for A in R, B in R, C in R, (AB)C=A(BC)
with the given shapes
not just "in R". the dimensions matter.
since matrix multiplication is not defined for arbitrarily shaped matrices, the shapes are important
R^(a x b) means "an array of real numbers of shape a-by-b"
well, this specific statement is true as long as it even makes sense. matrix multiplication is associative for all shapes for which it's even defined.
.latex Have you ever seen $\mathbb{Z}^{p \times q}$?
it would mean that both of A and B are m-by-n arrays.
.latex formally, the $\times$ symbol denotes the cartesian product. consider the tuple $(a, b)$ where $a$ and $b$ are both arbitrary real numbers. we can describe this as $\real \times \real$, or equivalently $\real^2$
(that is, they are both elements of the set of all m-by-n arrays)
is that just an alias for \mathbb{R}?
.latex formally, the $\times$ symbol denotes the cartesian product. consider the tuple $(a, b)$ where $a$ and $b$ are both arbitrary real numbers. we can describe this as $\mathbb{R} \times \mathbb{R}$, or equivalently $\mathbb{R}^2$
yeah. also some modulo stuff, like Z/nZ
but isn't A,B are elements in m-by-n arrays? why they became the m-by-n array?
.latex $A \in \mathbb{R}$ and $a \in A$ mean different things
oops, forgot the m xn
anyway, one says A is a multidimensional array
the other says that lowercase a is an item inside of the matrix A
so waht is
A is array or element
it's telling you right there
that's exactly what that says
you might wanna invest some time in familiarizing yourself with the symbols
you already know "for all" and "is an element of", so you're able to read this
so A is an array and a is element?
in what you shared, yes, but symbols are not universal. different authors will use different symbols
the symbols for "is an element of", "for all" and "exists" are usually kept standard, but anything else will vary depending on where you read it from, so it's in your best interest to learn how to read it, not memorize the symbols
i will use a shit symbol when i publish articals
you can use whatever you like as long as the notation is made clear
some of the more cursed stuff uses musical symbols
but, just want to confirm, what is actually A,B in Rmxn means, because it should be mean element right??
all you have to do is scroll up to where i told you exactly
but really, you're not reading
what you're asking me is what the image is telling you
sit down calmly and translate the expressions into english or whatever language you like
if I use like, a super agressive dropout on the first layer, would that work to prevent overfitting ?
but you said it can be element or an array
i don't know if that make seens in the notation it means array
∀A, B ∈ Rm×n, C, D ∈ Rn×p : (A + B)C = AC + BC (2.19a)
A(C + D) = AC + AD
you mean code?
that's python code
it will create a list, from 0 to 9, and read from it
and what is i?
this is off topic
no, it's an example of the same notation
that line of code tells you i is a number in the range from 0 to 10
Let A = {apple, strawberry, grape, 7}, then a∈A is either an apple, a strawberry, a grape, or 7. @vernal quartz
the line you shared tells you to consider some object A, doesn't matter what it is, that is in the set R^mxn
i is a memory, bytes, when it read it be assingend with bytes, represent each 0 to 9, its never been a number
the variable i will have different values in different iterations, but the value will always be in the range from 0 to 9
This notation is set notation, dealing with sets of things. E.g. the set of all real numbers (the fancy R).
How aggressive? Something in the range of 0.7 - 0.9? Well, I guess the only way to know is to try it out. Maybe throw in BatchNorm + Dropout
i said its off topic beacause its a bytes, nothing realted to the question
I'm gonna give it a try, but tbh, the solution will likely be to steal lamma's embedder, or the one from uncased bard from hugging face
even if you want to look at bytes, they're saying exactly the same thing
you're failing to see the connection
look also at the example squiggle gave you
positional encoding I can just pre-compute with the sine's fucntion thing from the 2017 paper
math is a representation, it is not existing, while bytes and coding stuff is machanical stuff, it should never be reprenseted as non-existing meanings, such as number
i have bad news for you if you intend on working with AI, then
a bytes is a physical stuff
it's also definitely not mechanical
and number is not
well, good luck
wasn't penrose who published a book about all math being real and we just live in the chunk of it that allows for sentience to exist
wait so for this portion set up in google mediapipe, do u want this to run while the camera is on (in the while loop)? and what goes in the detector initialization part at the bottom?
Math is a way to communicate ideas (including to oneself, by augmenting ones own memory via external devices such as paper). It also happens to be the best known way to communicate these kinds of ideas as efficiently as possible. Prior to this kind of notation proofs were in plain words, and they were long and really hard to follow. Trust me, you don't want that.
I think you need to code the while true (also called event loop) inside the with
if i write for i in range(10), what is i?
jjjj — Today at 1:14 AM
you mean code?
Edd — Today at 1:14 AM
that's python code --- i just saying this is totally two questions, as one is coding behending machanical, such as logical gate and can't simplified as fictional - numbers
you are describing a design machinal and designed physical situation within memories compare with a pure mathmatical (fictional) situation
back again, so A,B is array in R or A,B is element in R?
you're gonna have a really bad time if you don't learn to abstract real world phenomena
array or element or both
scroll up to where i already answered the question
because if A,B means array i don't understand the notation
wait til you get to whatever the heck category theory is
that does not mean array and has nothing to do with that
so you mean everything in this R
no
what is it
Just to drop a Wikipedia link: https://en.m.wikipedia.org/wiki/Universal_quantification
why don't you just read the line and replace the symbols we discussed with their meaning in english?
i think the hardest part about math is the notation is not as descriptive as code
A is element in R or array?
if you don't know what you're reading it might as well be hieroglyphics
In this conversation, I think the notation and code are interchangeable.
it's the same
and you mean math notation, because maths is independent of the notation
you could just as easily do it in english if you use precise language
right, but everyone uses math notation
yes, but not the same one
Category theory is a general theory of mathematical structures and their relations
I thought study of mathematical structures and their relations was maths, so like category theory is meta math ?
Where are you coming up with ‘array’?
every single book and paper starts with a description of what the symbols will mean throughout their work
Well, actually, it depends on which notation you use. Some is just straight up code, and it's also one of the preferred ways to provide a rigorous proof (since it can be checked by a machine).
(See formal methods)
I'm not talking about math papers I'm talking about papers that use math
same thing
most of the time people aren't proving their results or anything in model papers
they will start with a definition of the symbols, and they will also have text and figures and pseudo-code
be it pure math papers or engineering ones
because a mixture of all those things is perfectly valid, and the symbols are not universal
AI papers are generally bad is my 2 cents for you
For any A,B where each are elements of Rmxn
all you had to do was scroll up to where i had already told you that
i just past your anwser, you told me you are not sure
what?
you said one says its an array, one says its an element
yes
????
The math for this kind of stuff does not really fully exist yet, nor do the AI/ML papers often make any sense. The notation / meaning tends to fall apart once they start using notation involving probability, often due to a lack of analysis (the mathematical branch).
you shared an image with an element of a matrix, and with a matrix
it doesn't make sense if it is an array
you're not reading and you're refusing to understand
sit down calmly for a second and read carefully
.latex the image you shared spoke of a scalar element $a \in A$ and a matrix element $A \in \mathbb{R}^{m \times n}$
I thought it was me having a problem good to know that other people think a lot of the papers don't make any sense
and i told you to read the symbols carefully so you can distinguish them, because authors will use different symbols
nowhere in what i wrote was there any "i'm not sure", there was only "learn to read the notation, don't memorize"
.latex going back to the for loop, you can write it in the form $i \in [0, 10)$ which is of the same kind of what we're discussing here
I think I understand part of the confusion: the element type depends on the what the set contains. Could be anything. Could be the set of all adults in Kansas. Could be the set of all Real numbers. Could be a set of all sets containing at least one prime number, etc
The element would thus be any single member of the set
i also explained the notation R^{m x n} above, but they skipped that, it seems
Papers are not perfect, it's a messy thing, lack of knowledge for the math, or even trying to force it in to somehow make the paper feel more valid (instead of just "IDK, it works in practice").
i am bereft of patience now so i'll be on my way
‘Gotta publish something this semester’
Trying to remember how to index into my numpy tensor
ndarray whatever
3d array
I ended up fixing my polars so I only collect once but now I have this big honking numpy array I have to index manually for different columns
I guess I could just use the eager frame
The reality is that a lot of things start out as "IDK, it works in practice," and stay that way for a long time. This is not a bad thing. It's how all inventions (even in math) tend to start. But academia does not like that one bit, because it requires admitting a lack of knowledge (that does not exist yet), that you are on the bleeding edge.
does someone have a one liner to get the upper triangle only in a flattened numpy covariance matrix
im about to ask chatgpt
i will listen to Bobby
In a flattened array ?
That’s a bad idea. Edd actually knows this field, I’m just a visitor.
triu would've been my suggestion
so the single member does contain an array, because array at least 2 elements, wait, array can be 1 element, but when we talking about the elements in array, there are only elments, there are no array
The universe is the set of all (m,n) dimensional arrays. M and N could be whatever you want.
but in a array, it only containes elements
You’re confused.
self.observation_space = spaces.Box(
low=-np.inf, high=np.inf,
shape=(n_lags, 17 * self.no_symbols + 39 + self.no_symbols*(self.no_symbols+1)//2)
)
I need to write a comment on the shape of this thing at this point it's gotten monstrous
You’re using the term element in an ambiguous way, which is your confusion, I think
X element of Y means any member of the set Y
let's say this array, it can only contain elements. ∀A, B ∈ Rm×n, C, D ∈ Rn×p : when we use this A and B means anything in side this array as a single element, so it will be element
Apple, strawberry, grape, and 7 are all elements of A. @vernal quartz
yes, i know
No.
how come this became a array for A?
Associativity:
∀A ∈ Rm×n, B ∈ Rn×p, C ∈ Rp×q : (AB)C = A(BC) (2.18)
distributivity
Distributivity:
∀A, B ∈ Rm×n, C, D ∈ Rn×p : (A + B)C = AC + BC (2.19a)
A(C + D) = AC + AD (2.19 ----- so you saying A,B can be an array in nxp shape R array?
What is "this" in this sentence?
I'm not understanding the confusion, this is pretty simple to interpret I feel
the question is A,B can be an array in this notation?
They're matrices
as an element in nxp shape R array?
The confusion is whether the ‘element of’ is an element of an array or is it an member of the universe of all arrays
I don’t know how to explain differently
If you had something like: Let T = {[1, 2, 3], [4, 5, 6]}, and a∈T, then "a" is either [1, 2, 3] or [4, 5, 6].
consider reading ∈ as "belongs to the set", for less confusion. "element" is an ambigous term.
∀A, B ∈ Rm×n, my question is, as Rmxn is an array, so array only contains single digital numbers or anything in this array, so can A,B be an array
Rmxn is not an array
It’s the set of all m,n arrays
(Real valued arrays)
∀A, B ∈ Rm×n, C, D ∈ Rn×p : (A + B)C = AC + BC ===> because if you think A,B can be an array, i don't understand the following: (A + B)C = AC + BC, so this can aslo equaltion to array additiaon?
(A + B)C = AC + BC so this is array addition?
That's a matrix equation
... uh, yes? this identity involves matrix multiplication and matrix addition.
It's stating that the equation holds for all such matrices
Every time you write a math sentence you're implicitly stating it to be true
(A + B)C = AC + BC, if A,B are matixs, is this notation right? or equation
If a and b have the same shape
That’s why they’re both members of Rmxn, to say: they both have same dimensions.
how do I flatten the upper triangle only though
that's still the issue
np.triu gives you nans elsewhere but if you call .ravel() or something it still gives you the nan values
you could make a boolean mask with ones in the upper triangle, and do arr[mask]. that'd be a 1d array.
You could just mask it with … ^
I guess you could invert it and triu?
Oh, tril duh
a oneliner would be arr[np.triu(np.ones_like(arr))!=0], I guess
covariance = np.triu(np.eye(self.no_symbols))[np.triu_indices(self.no_symbols)]
"It is true that for all A in Rmn and for all B in Rnp, (A+B)C = AC + etc ... "
triu(eye)? isn't that always just an eye?
i got it here, the author called it matrix
that's just the case where it doesn't have enough observations
also I need to remove the triu call
I just need the indices
but yeah there's a triu_indices method too that's handy
purpose of whom
Do you need to keep any order when flattening ?
Then just sort them and slice from the index of the first Nan
nah that won't be consistent
Wdym you'd be literally sorting them
λ ∈ R. Let A ∈ Rm×n any idea of difference between R and Rmxn? this scalar λ in anything?
the mapping from i, j of the covariance matrix to the flattened array can be in any order, but it has to be the same mapping every time
Ah you can just kinda use index trickery for it I think
But it says that A is a matrix, it’s just saying for any matrix of real numbers
Because the dimensions aren’t important to these properties
λ ∈ R you mean this scalar is any real number?
You’re mixing multiple links here but that’s what that says yes
i got it
This is all just describing the basic ways in which matrix algebra works.
You could list all this with real numbers, or whatever you are used to, and it will look similar (list of rules).
The first line here, I can show you something similar you are familiar with: a(1/a) = 1 = (1/a)a.
thanks
is analytic gemotrey and matrix decomposition important?
Few things are more important.
?
Very important.
https://mml-book.github.io/book/mml-book.pdf can i change a book
this book feels off
i need a pre-education book
what s is this, its even write weird
why it needs to be twisted like that
What’s twisted? Have you taken stats before?
Bayes theorem is in college stats, right?
That's why I suggested you to consider a college degree earlier, a two month long course can't possibly have given you the tools to do ML properly
Yes, should be in undergraduate.
i took
I went to college decades ago. This stuff isn’t fresh in my mind, but it’s just part of my base knowledge.
In a review of probability before really getting to the actual stats part.
At the start of the first semester.
thanks sharing, i did bussiness analytic degree
i'm not sure that's ture, my first semester stats is 83
And you haven't learned neither Bayes theorem nor linear regression ?
We’re just trying to figure out what you need to review: as (mostly?) CS students, we take some of this stuff for granted.
Business analytics (BA) is a set of disciplines and technologies for solving business problems using data analysis, statistical models and other quantitative methods.
tell me everything
the applications of bayes theorem aren't always immediately obvious
everyone should remember the basic idea though
A good stats background is definitely important for DS
The three important courses: Linear Algebra, Stats (calculus based/engineering stats), and hmm, discrete Id guess
I may be missing something, but this looks like a good start
linear Algenra i got it, so Stats
Discrete mathematics / proof writing / mathematical notation seems missing. Counting and probability.
Also review the progression section here: https://en.m.wikipedia.org/wiki/Mathematical_maturity
And understand you’re just at the first step… it’s ok, just appreciate that there’s more ‘maturity’ to develop
With the basics
Sounds like a business stats course (non calculus based)
Many colleges have two stats courses, one for engineers and one for non-engineers
I have a PPO implementation with Actor and Critic both having 2 Dense layers 256 each with RelU activation and an output Dense layer with size 2 I thik with softmax. Unfortunately the average score for Cartpole v0 peaks at 100 and I don't know why. Can anybody advise if my layers are good enough?
Is this reinforcement learning ?
yeah PPO reinforcement learning
do not discriminate business degree student, we are smart ass
Ah, I haven't done any of it, if you could put in terms a analogous to supervised learning I might be able to help
But if it plateus it might be that the model is at capacity and you gotta make it bigger
what's a good reward on cartpole
possibly, I'm very new to reinforcement learning, never made a model before
I spend most of my time comparing rewards on my own custom environments not benchmarks
I never touched it, gradient man might be able to help tho
thoughts?
i'm using gym, so assuming time survived, the higher the better
If you can, I strongly suggest a college degree.
are you using sb3 for the PPO
there's very sparse documentation of gym environments to give concrete answers
I think you seem to be struggling with the notation, so anything that gets you used to that. One such thing is a discrete mathematics book, often used to get people in CS used to it (and it's relevant to writing algorithms).
I mean you can see benchmarks for cartpole on a lot of sites about RL
I just don't remember what a good score is
stopped here, bootstrap
There is going to obviously be a limit to how far you can get with a given environment
is this stable benchmarks 3?
stable baselines
never used that library
ah ok they have an implementation of PPO was just curious
i glued it together from https://www.youtube.com/watch?v=9DO63MSGeNA this video
i think is my bussiness stats 108
also do you think stacking more dense layers in parallel into Actor and Critic would improve my score?
Being able to apply existing methods / tools is great, and what a business degree should focus on, but in this case you are trying to understand how those work fundamentally, at a mathematical level. To do so you need to be able to read something like the Bayes theorem notation. This can be learned by any number of mathematical books on probability. But also you want to be comfortable with the set notation earlier, and the linear algebra notation. In addition, ML makes heavy use of calculus.
any types in particular apart from Dense?
not afaik
could an LSTM layer help? since you need to balance a cartpole and previous information could be useful
there's people researching different network architectures for policy networks
yes, i unsderstand, i want to figure out where i should get rerestarted, i already got linear algebra, so calculus then?
there is a version of PPO with an LSTM layer but empirical results aren't that great in most applications compared to simple framestacking
What do you mean when you say that you have linear algebra ?
I definitely don't have linear algebra I need to get the linear algebra brought in
Linear, Discrete and Stats are the holy trinity for CS… calculus is assumed, altho I don’t think is as important, but part of the assumption is that you’ve grinded through years of math and developed the math maturity to cope.
what level stats is this?
I'm probly biased, but I think calculus is the most important cuz of gradient descent
what effect does widening layers has as opposed to making them deeper?
I'm honestly less hung up about what method belongs in what box
I don't really have an interpretation for you besides the actual thing
I’m conflicted on this point: I think it’s simplest to explain the eli5 of this, but maybe only because of a few years of calc? I dunno
they lead to different results though
To what extent do you understand linear algebra? Have you done matrix factorization? Calculus, yes (including multivariate (cross over with linear algebra)). Set theory (set notation is everywhere). Probability (and counting) theory. Statistics (after probability and calculus review) with calculus in it (and set notation again). Then back to that ML book.
for example if I were to process images and stack conv2d layers, would making them wider help network learn better?
do they teach people statistics without calculus?
Yes
yikea
Business stats has no calc. Most colleges offer it to non engineering majors
calculus is assumed a lot of the time
My SO took 7, yes seven stats classes
I took statistics for economics and it had calculus
And they only gave her math after like 4 or 5?
AP stats, for instance, has no calc
we had to do real analysis in 1st year before diving deep into statistics
I studied business engineering. My path was a lot more linear. Year 1 had lin alg in semester 1, with some programming course, semester 2 had calculus.
i argue real analysis is calculus on steroids
I didn't take a real real analysis course I took a course called mathematics for economics that was about optimizing in convex spaces and hessians and all that jazz
Year 2, semester 1 brought stats
Year 3 brought econometrics and so on
I think econometrics was actually the most valuable data course I took because it was the one that taught you the "finesse" of working with data
That's so missing in most
i miss the funny questions like can an infinite union of countably infinite sets be countably infinite
If i'm being honest, with regards to what you are trying to get into, level 0.
I feel like my econometrics course was much ado about multiple linear regression
That’s something I find myself repeating a lot: that ‘calculus’ is the beginning not end of the math journey. Students were built up to believe that calculus was the pinnacle, rather than the pit stop at the foot of the mountain.
Yeah, it started slow but ramped up. By the end we were covering logit/probit and so on in detail
By my masters I was exclusively taking CS faculty ML courses and they felt half as rigorous. Many of my cohort were lacking that "finesse"
did anyone do Measure Theory?
Probability and stats don't really work without it.
Hah, I took a stats class in my CS masters and it kicked my ass. All the Cs majors were dying.
Like, half the class was stats majors
measure theory was breaking my brain. It's like real analysis for stats
just plane, multi planes and dot product, maybe more but i don't remember
They were so complementary. I took a course called "data mining" and there we had to do logistic regression by hand
It's great, and then you can look at all the stats online and realize most of it is busted.
I feel like all the courses I've taken have been for stupid people that's the only way I could have passed them
Probability i remember i did this
It was a nice eye opener to do SGD, chain rule etc. to really get it
Yeah, so linear algebra goes really deep, leading into / part of abstract algebra.
where i should go
The previous semester, econometrics, was really all about residual analysis and "modelling" moreso than "Yeah SGD is scalable!!!" (which was more the CS faculty approach)
It's probably the most applicable branch of math other than calculus (maybe even tied with it).
I did a lot of measure theory in terms of proving theorems and answering weird questions, but we never applied it in machine learning, so I forgot everything haha
Doesn’t matter, pick any one topic you don’t know and learn it. Repeat until you know everything (joke).
I learned backprop really well once and I have completely forgotten the actual math behind it tbh
anyone give me a road map for math
I forget everything
You can learn in any order, linear, stats, discrete, calc
school->college->uni
I must have done too many drugs or something
I'm never going to forget, data mining had this one exam question that was basically
What happens to the weights of a logistic regression if you have no regularization and the classes are perferctly linearly seperable. Answer in 15 words or less
i think i learned probabilty, and my stats i don't know what level is it
completely depends on your level
I remember they go to infinity and I used L'hopital's rule to prove it
I don't remember why tho
ah L'Hopital's rule
which areas of maths are you interested in?
I remember using that
They’re trying to build up to broad understanding of ML/DS
i interested whatever the shit is for me to build a deep leanring model comfortably
coding alongside linear algebra
My 2 cents, if you're like most people then you should iteratively go between math and coding
I'm gonna just admit it it's a real problem that I don't do anything by hand because I learn it all and then I forget how it works and at the end i'm no different than someone using a library they don't understand
deep learning is not coding
I'd not cope with doing math for the sake of it
calculus is important
Calculus, linear algebra, multivariate calculus, discrete, probability, statistics. Maybe this order is alright. Thoughts anyone?
I did multivariate calculus before linear algebra believe it or not
would real analysis be useful?
Sounds rough.
I just went calculus -> multivariate calculus -> linear algebra
I don't think we should overstate the amount of math you need tho
Yes, but at the end, it's like level 2. You don't really need it, but it will take your understanding to the next level.
Actually I found multivariate calculus relatively easy compared to Linear
What it mostly gives you is certainty that you're not doing stuff incorrectly
I think Linear algebra requires a completely different part of your brain or something
I think the part you should really get as a practice focused ML professional is how to evaluate models correctly
so for math, should i calculate everything by myself, after leanred, or just watching it
because I was fine at calculus but I suck at linear algebra
But I always say this 😄
i crammed all the exams with energy drinks and 7 days of revision so i probably did multivatiate calculus, cant remember xD
Yah. But that was the third in your series of calc courses too
for math, should i calculate everything by myself, after leanred, or just watching it
Implement stuff in code. For example, can you do Gauss-Jordan elimination in Python?
If you know how to evaluate models correctly, that means you could technically do no harm in using the models incorrectly if you don't know how they work
you need to attempt as many problems as physically possible
i don't know what that shit is.
dpeends if you are applied, visual, or whatever learner
i only know gussian
even I dont know what that is lmao
I couldn't code guass-jordan elimination 🤷
I know what it is
But idt it's particularly relevant to be honest
Being able to go from math to code is a great skill and will also solidy your understanding of the math too.
It's the same story with CS proper
Not every programmer needs to be a systems programmer
i don't question my coding skills, i just don't understand math
oh is that where you subtract rows and columns from each other?
how to calculate them
Not that gauss-jordan in particular is important to implement, but being able to do stuff like that makes you very valuable labor.
In that vein, not every ML professional needs a deep understanding of math
valualbe what?
A working knowledge is enough for the vast majority of roles
Everyone will want to employ you.
in my experience people care more about your experience doing commercial things
i will let every recruiter know i know guass-jordan elimination
Yes, but with enough experience, you become the number one target for hiring. People with these math + coding skills are highly valuable (and rare).
what area of maths is Kullback–Leibler divergence in?
This is handy to know, intuitively
Information theory
Yeah. I'm kind of talking within CS roles here.
If you’re aiming for a DS role, you’re certainly expected to ‘know’, not just ‘do’
We have several CS/Physics PhDs at work
Calculus, linear algebra, multivariate calculus, discrete, probability, statistics. can you order it from high scholl to uni? i want to know here to start
And I'm sure if I roll in tomorrow and ask everyone to code up Gauss-Jordan elimination