Help with MLE Estimators | Mathematics | Page 1

woeful dune May 6, 2024, 2:01 AM

#

Frankly, I have no idea what I have to do here, and I am in the brink of tears. I just don't get anything.

heady quailBOT May 6, 2024, 2:01 AM

#

woeful dune Frankly, I have no idea what I have to do here, and I am in the brink of tears. ...

#

Ask your question and show the work you've done so far. If you've posted a screenshot of a question, specify which part you need help with.
Wait patiently for a helper to come along.
Once someone helps you, say thank you and close the thread with:
```
+close
```
Feel free to nominate the person for helper of the week in #helper-nominations
Do not ping the mods, unless someone is breaking the rules.
If you're happy with the help you got here, and the server overall, you can contribute financially as well:

woeful dune May 6, 2024, 2:02 AM

#

I don't understand what the estimator of MLE in this case, is this just finding the common MLE for sigma? but what is r in this case?

broken plank May 6, 2024, 5:33 AM

#

woeful dune I don't understand what the estimator of MLE in this case, is this just finding ...

Here, you have two random variables V and X

#

You suppose that there is a nearly linear relationship between them

#

So there is a slope r such that V = r X + epsilon, where epsilon ~ N(0, sigma²)

#

So here usually it is easier to provide an estimation for the slope rather than sigma²

#

This is just a linear regression case

#

You could think that conditionally to X, V follows a normal distribution N(rX, sigma²)

#

Hence the conditional log likelyhood of V is given by $\ln f_{V|X,r, \sigma^2}(v|x, r, \sigma^2) = \sum_{i=1}^n \frac{(v-rx_i)^2}{\sigma^2} - \frac{n}{2}(\ln \sigma^2 + \ln 2\pi)$

idle sierraBOT May 6, 2024, 5:49 AM

#

Rion

broken plank May 6, 2024, 5:50 AM

#

Where x denotes the vector of all observations (x1, ..., xn), which are independent

#

So now you have an optimization problem

#

With 2 variables

#

How do you solve optimization problems with 2 variables?

woeful dune May 6, 2024, 12:25 PM

#

broken plank How do you solve optimization problems with 2 variables?

I honestly don't know, this course was done very very very rushed and i kinda lost the thread

#

I just want to know how to do the things in that exam and thats it ig

broken plank May 6, 2024, 12:28 PM

#

woeful dune I honestly don't know, this course was done very very very rushed and i kinda lo...

Compute the gradient?

#

And the critical points

woeful dune May 6, 2024, 12:32 PM

#

broken plank Compute the gradient?

solve the partial derivatives from the log likely function?

broken plank May 6, 2024, 12:33 PM

#

woeful dune solve the partial derivatives from the log likely function?

Well you equate both to zero

#

You see that the function there is a function of r and sigma²

#

(instead of sigma² I will use v)

woeful dune May 6, 2024, 12:33 PM

#

okay

broken plank May 6, 2024, 12:34 PM

#

So say $g(r, v)$

idle sierraBOT May 6, 2024, 12:34 PM

#

Rion

woeful dune May 6, 2024, 12:34 PM

#

okay

broken plank May 6, 2024, 12:35 PM

#

You compute $\nabla g (r, v)$ and equate it to 0

idle sierraBOT May 6, 2024, 12:35 PM

#

Rion

broken plank May 6, 2024, 12:35 PM

#

In other words, it's a system of equations, which are

#

$$\frac{\partial g}{\partial r} (r,v) = 0$$

$$\frac{\partial g}{\partial v} (r,v) = 0$$

idle sierraBOT May 6, 2024, 12:35 PM

#

Rion

broken plank May 6, 2024, 12:35 PM

#

2 equations, 2 unknowns r and v

#

your turn now

#

i need to eat

woeful dune May 6, 2024, 12:46 PM

#

The first would be something like Summation 2xi (v-rxi) / sigma^2

The second would be something like Summation -2(v- rxi)/sigma^3 + n/sigma?

broken plank May 6, 2024, 1:20 PM

#

woeful dune The first would be something like Summation 2xi (v-rxi) / sigma^2 The second w...

The second one is wrong

#

almost correct

woeful dune May 6, 2024, 1:20 PM

#

huh

#

let me double check second

broken plank May 6, 2024, 1:21 PM

#

also since the computations are fairly complex, if you want me to check I ask you to write with tex

woeful dune May 6, 2024, 1:21 PM

#

i dont know how to work the bot, let me see if i can figure it out

broken plank May 6, 2024, 1:21 PM

#

it's fairly intuitive

#

take example from my texts

#

$\sum_{i=1}^{n} i = \frac{n(n+1)}{2}$

idle sierraBOT May 6, 2024, 1:22 PM

#

Rion

broken plank May 6, 2024, 1:23 PM

#

When you enclose your math text between dollar signs, tex will render and display your message

#

And the round d is given by the command \partial

#

$\frac{\partial g}{\partial r}$

idle sierraBOT May 6, 2024, 1:24 PM

#

Rion

woeful dune May 6, 2024, 1:24 PM

#

$\sum_{i=1}^n \frac{2xi(v-rxi)}{sigma^2}$

idle sierraBOT May 6, 2024, 1:24 PM

#

Jacques

woeful dune May 6, 2024, 1:24 PM

#

i didnt know how to do sigma

broken plank May 6, 2024, 1:24 PM

#

Yeah, don't forget the subscripts

#

and for greek letters, add a backslash

#

$\sigma$

idle sierraBOT May 6, 2024, 1:24 PM

#

Rion

woeful dune May 6, 2024, 1:24 PM

#

gotcha

broken plank May 6, 2024, 1:24 PM

#

$x_i$ too

idle sierraBOT May 6, 2024, 1:24 PM

#

Rion

woeful dune May 6, 2024, 1:25 PM

#

$\sum_{i=1}^n \frac{2x_i(v-rx_i)}{\sigma^2}$

idle sierraBOT May 6, 2024, 1:25 PM

#

Jacques

broken plank May 6, 2024, 1:25 PM

#

Also, I apologize

woeful dune May 6, 2024, 1:25 PM

#

huh

#

what for

broken plank May 6, 2024, 1:25 PM

#

I shouldn't have used v for the variance, it's a symbol already in use

#

I wanted to replace $\sigma^2$ with $v$ but $v$ is already in use

idle sierraBOT May 6, 2024, 1:25 PM

#

Rion

woeful dune May 6, 2024, 1:25 PM

#

yeah, got that, used t in my head

broken plank May 6, 2024, 1:26 PM

#

Let's say instead that it's $\nu$

idle sierraBOT May 6, 2024, 1:26 PM

#

Rion

broken plank May 6, 2024, 1:26 PM

#

or t if you want

#

So again:

woeful dune May 6, 2024, 1:26 PM

#

i think i found my mistake

#

let me see if i can write it out

#

second

#

$\sum_{i=1}^n \frac{-(v-rx_i)^2}{\sigma^4} + frac{-n}{2\sigma^2}$

idle sierraBOT May 6, 2024, 1:27 PM

#

Jacques

woeful dune May 6, 2024, 1:28 PM

#

oh shit i fucked the frac up

#

$\sum_{i=1}^n \frac{-(v-rx_i)^2}{\sigma^4} + \frac{-n}{2\sigma^2}$

idle sierraBOT May 6, 2024, 1:28 PM

#

Jacques

woeful dune May 6, 2024, 1:28 PM

#

there

broken plank May 6, 2024, 1:28 PM

#

$g(r, \nu) = \sum_{i=1}^{n} \frac{(v_i - r x_i)^2}{\nu} - \frac{n}{2} (\ln(\nu) + \ln(2\pi))$

#

I think

idle sierraBOT May 6, 2024, 1:28 PM

#

Rion

woeful dune May 6, 2024, 1:28 PM

#

is my solution not correct?

broken plank May 6, 2024, 1:29 PM

#

I just rewrote the original function to optimize for you

woeful dune May 6, 2024, 1:29 PM

#

idle sierra **Jacques**

the fraction is not working idk why

broken plank May 6, 2024, 1:29 PM

#

idle sierra **Jacques**

What's this supposed to be?

broken plank May 6, 2024, 1:29 PM

#

woeful dune the fraction is not working idk why

you forgot the backslash

#

before frac

woeful dune May 6, 2024, 1:29 PM

#

idle sierra **Jacques**

is this not correct

#

the partial derivative wrt to sigma^2

broken plank May 6, 2024, 1:30 PM

#

Yeah, that's good, except for the fact that it's v_i instead of v

#

What about wrt r?

woeful dune May 6, 2024, 1:30 PM

#

idle sierra **Jacques**

this, no?

broken plank May 6, 2024, 1:30 PM

#

idle sierra **Jacques**

Same remark

woeful dune May 6, 2024, 1:30 PM

#

$\sum_{i=1}^n \frac{2x_i(v-rx_i)}{\sigma^2}$

idle sierraBOT May 6, 2024, 1:30 PM

#

Jacques

broken plank May 6, 2024, 1:30 PM

#

Also you're missing a -

woeful dune May 6, 2024, 1:31 PM

#

$\sum_{i=1}^n \frac{-2x_i(v-rx_i)}{\sigma^2}$

idle sierraBOT May 6, 2024, 1:31 PM

#

Jacques

broken plank May 6, 2024, 1:31 PM

#

v_i, not v

#

since it's a target per observation

#

one v_i per x_i

woeful dune May 6, 2024, 1:31 PM

#

$\sum_{i=1}^n \frac{2x_i(v_i-rx_i)}{\sigma^2}$

idle sierraBOT May 6, 2024, 1:31 PM

#

Jacques

woeful dune May 6, 2024, 1:31 PM

#

shit yeah, you're right

#

mb

broken plank May 6, 2024, 1:31 PM

#

Yeah, now the minus

woeful dune May 6, 2024, 1:31 PM

#

$\sum_{i=1}^n \frac{-2x_i(v_i-rx_i)}{\sigma^2}$

idle sierraBOT May 6, 2024, 1:31 PM

#

Jacques

woeful dune May 6, 2024, 1:33 PM

#

$\sum_{i=1}^n \frac{-(v_i-rx_i)^2}{\sigma^4} + \frac{-n}{2\sigma^2}$

idle sierraBOT May 6, 2024, 1:33 PM

#

Jacques

broken plank May 6, 2024, 1:33 PM

#

To recap what you said:
$$\frac{\partial g}{\partial r} (r, \nu) = -\sum_{i=1}^n \frac{2x_i(v_i-rx_i)}{\nu} = 0$$

$$\frac{\partial g}{\partial \nu} (r, \nu) = \sum_{i=1}^n \frac{-(v_i-rx_i)^2}{\nu^2} + \frac{-n}{2\nu} = 0$$

woeful dune May 6, 2024, 1:33 PM

#

yes

idle sierraBOT May 6, 2024, 1:33 PM

#

Rion

broken plank May 6, 2024, 1:33 PM

#

There we go

#

Now, 2 unknowns, 2 equations

#

You need to solve this

#

hint: start by finding the optimum r from the first equation

#

it should be fairly ok

#

then sub in the 2nd

woeful dune May 6, 2024, 1:35 PM

#

okay, give me a moment, i've never solved equations with summation before

#

our math courses were seriously lacking, i am finding

#

for the first one, intuitively $r = \frac{-x_i}{v_i}$

#

ah i have to write the second one out, second

idle sierraBOT May 6, 2024, 1:39 PM

#

Jacques

broken plank May 6, 2024, 1:41 PM

#

idle sierra **Jacques**

Not quite no

woeful dune May 6, 2024, 1:41 PM

#

damn it

broken plank May 6, 2024, 1:42 PM

#

woeful dune our math courses were seriously lacking, i am finding

Don't lose hope

#

I intend to commit to helping you when I first answered this post

#

so it's fine

woeful dune May 6, 2024, 1:42 PM

#

isn't that only 0 when $v_i - rx_i = 0$

idle sierraBOT May 6, 2024, 1:42 PM

#

Jacques

broken plank May 6, 2024, 1:42 PM

#

I will do it a demonstration for you with the first one

#

and you will try with the second one

woeful dune May 6, 2024, 1:42 PM

#

okay thank you

broken plank May 6, 2024, 1:43 PM

#

Here we just split and distribute the sum

#

$$\frac{\partial g}{\partial r} (r, \nu) = -\sum_{i=1}^n \frac{2x_i(v_i-rx_i)}{\nu} = 0$$

idle sierraBOT May 6, 2024, 1:43 PM

#

Rion

broken plank May 6, 2024, 1:44 PM

#

Assuming that we look for solutions $(r, \nu)$ such that $\nu > 0$, then we can maintain equivalence by multiplying by $\nu$

idle sierraBOT May 6, 2024, 1:44 PM

#

Rion

broken plank May 6, 2024, 1:44 PM

#

$\Longleftrightarrow \sum_{i=1}^n x_i(v_i-rx_i)= 0$

woeful dune May 6, 2024, 1:45 PM

#

makes total sense to me, yeah

idle sierraBOT May 6, 2024, 1:45 PM

#

Rion

broken plank May 6, 2024, 1:45 PM

#

Now, we split the sum

#

$\Longleftrightarrow \sum_{i=1}^n (x_i v_i-rx_i^2)= 0$

idle sierraBOT May 6, 2024, 1:45 PM

#

Rion

woeful dune May 6, 2024, 1:46 PM

#

then we split on both sides

broken plank May 6, 2024, 1:46 PM

#

$\Longleftrightarrow \sum_{i=1}^n x_i v_i - \sum_{i=1}^n rx_i^2= 0$

idle sierraBOT May 6, 2024, 1:46 PM

#

Rion

woeful dune May 6, 2024, 1:46 PM

#

move to the other side

#

sorry, i am saying this because it helps me follow

broken plank May 6, 2024, 1:47 PM

#

$\Longleftrightarrow \sum_{i=1}^n rx_i^2= \sum_{i=1}^n x_i v_i$

idle sierraBOT May 6, 2024, 1:47 PM

#

Rion

broken plank May 6, 2024, 1:47 PM

#

Now, see that the r doesn't depend on the sum index

#

we can factorize it

#

$\Longleftrightarrow r \sum_{i=1}^n x_i^2= \sum_{i=1}^n x_i v_i$

idle sierraBOT May 6, 2024, 1:47 PM

#

Rion

broken plank May 6, 2024, 1:48 PM

#

Finally:

#

$\Longleftrightarrow r = \left( \sum_{i=1}^n x_i^2 \right)^{-1}\sum_{i=1}^n x_i v_i$

idle sierraBOT May 6, 2024, 1:48 PM

#

Rion

broken plank May 6, 2024, 1:49 PM

#

For a critical point $(r, \nu)$ such that $\nabla g(r, \nu) = 0$, it necessarily holds that $r$ has the above expression

idle sierraBOT May 6, 2024, 1:49 PM

#

Rion

broken plank May 6, 2024, 1:50 PM

#

Conversely (since we only used $\Longleftrightarrow$), that expression of $r$ will provide $\frac{\partial g}{\partial r}(r, \nu) = 0$

idle sierraBOT May 6, 2024, 1:50 PM

#

Rion

woeful dune May 6, 2024, 1:51 PM

#

can't we simplify that to $r= \frac{v_i}{x_i}$

#

?

idle sierraBOT May 6, 2024, 1:51 PM

#

Jacques

broken plank May 6, 2024, 1:51 PM

#

No

#

Here it is not clear what is i

#

so your expression does not make sense

#

here the i that I used belongs to the summation

woeful dune May 6, 2024, 1:52 PM

#

so I need to use that entire thing you wrote out in the second one?

#

holy madlad

broken plank May 6, 2024, 1:52 PM

#

$r = \frac{x_1 v_1 + ... + x_n v_n}{x_1^2 + ...x_n^2}$

idle sierraBOT May 6, 2024, 1:53 PM

#

Rion

broken plank May 6, 2024, 1:53 PM

#

woeful dune so I need to use that entire thing you wrote out in the second one?

It's a long expression yeah

woeful dune May 6, 2024, 1:53 PM

#

jesus

#

okay

broken plank May 6, 2024, 1:53 PM

#

But it's not the most complicated estimator of the slope

#

MLE estimator is fairly simple

#

in comparison to other ones

#

if there are any

#

Anyway, use my example to try to figure out the expression of $\nu$

idle sierraBOT May 6, 2024, 1:54 PM

#

Rion

broken plank May 6, 2024, 1:54 PM

#

Since now you can just substitute $r$

idle sierraBOT May 6, 2024, 1:54 PM

#

Rion

woeful dune May 6, 2024, 1:54 PM

#

considering we covered Monte Carlo Simulation, Data Analysis, PCA and Multilinear Regression in 4 days, i am sure there are more we've missed in our lectures

broken plank May 6, 2024, 1:55 PM

#

sounds like data science to me

woeful dune May 6, 2024, 1:55 PM

#

it is

broken plank May 6, 2024, 1:55 PM

#

or stats i guess

woeful dune May 6, 2024, 1:55 PM

#

data science

#

but doing the entire thing in 4 days was pretty stupid

broken plank May 6, 2024, 1:55 PM

#

Then you gotta up your calc because monte carlo is going to hurt

#

pca is not too hard

woeful dune May 6, 2024, 1:56 PM

#

i know madlad

#

oh trust me, i know

#

sigh

broken plank May 6, 2024, 1:56 PM

#

Anyway, linear regression is one of the simplest models that can be demanded of you if you ever do a data scientist interview

#

so it'd be really beneficial for you to really understand that

#

Since I don't know if I will be here in a couple of hours, I will give you the answer so that you can check your MLE estimators

#

$\hat{r} = \left( \sum_{i=1}^n x_i^2 \right)^{-1}\sum_{i=1}^n x_i v_i$

idle sierraBOT May 6, 2024, 1:59 PM

#

Rion

broken plank May 6, 2024, 2:00 PM

#

$\hat{\nu} = \frac{1}{n}\sum_{i=1}^{n} (v_i - \hat{r} x_i)^2$

idle sierraBOT May 6, 2024, 2:00 PM

#

Rion

woeful dune May 6, 2024, 2:00 PM

#

shouldn't there be a 2

broken plank May 6, 2024, 2:00 PM

#

No, pretty sure it's this

#

I cross checked with internet

woeful dune May 6, 2024, 2:00 PM

#

i trust you with my entire life

broken plank May 6, 2024, 2:01 PM

#

You can try to carry out the computations

#

And see if you land the same result

woeful dune May 6, 2024, 2:01 PM

#

I landed on

#

$\hat{\nu} = \frac{2}{n}\sum_{i=1}^{n} (v_i - \hat{r} x_i)^2$

idle sierraBOT May 6, 2024, 2:01 PM

#

Jacques

woeful dune May 6, 2024, 2:02 PM

#

but i will take your word for it

#

So bear with me for just 5 minutes, we just found question 1 right

#

just to confirm (I am losing my mind)

broken plank May 6, 2024, 2:03 PM

#

woeful dune I landed on

Hmm

#

Well I will check again just to be sure

#

Ah, ok, I see the problem

#

We were asked to estimate $\sigma$, not $\sigma^2$

idle sierraBOT May 6, 2024, 2:04 PM

#

Rion

broken plank May 6, 2024, 2:04 PM

#

The answer I gave you was the estimate of sigma, which I squared

#

So, that's my bad

woeful dune May 6, 2024, 2:05 PM

#

so the asnwer is everything you had/ sqrt?

broken plank May 6, 2024, 2:05 PM

#

Yes, but we also wrote the wrong equations

#

Though you can keep my answer for r, it's correct

woeful dune May 6, 2024, 2:05 PM

#

oh fuck

broken plank May 6, 2024, 2:05 PM

#

Don't worry

#

it's a quick fix

#

$$\frac{\partial g}{\partial r} (r, \sigma) = -\sum_{i=1}^n \frac{2x_i(v_i-rx_i)}{\sigma^2} = 0$$

$$\frac{\partial g}{\partial \sigma} (r, \sigma) = \sum_{i=1}^n \frac{-2(v_i-rx_i)^2}{\sigma^3} - \frac{-n}{\sigma} = 0$$

woeful dune May 6, 2024, 2:07 PM

#

oh my first solution was correct!

idle sierraBOT May 6, 2024, 2:07 PM

#

Rion

woeful dune May 6, 2024, 2:07 PM

#

massive win for me, this is my first win in life

broken plank May 6, 2024, 2:07 PM

#

Yeah, my bad, I messed up because I thought it was the estimation of sigma²

woeful dune May 6, 2024, 2:07 PM

#

please never apologize

#

again

#

you're my savior, my new christ

broken plank May 6, 2024, 2:07 PM

#

Though it still doesn't solve the mystery of the hanging 2

#

when differentiating 1/sigma²

woeful dune May 6, 2024, 2:07 PM

#

no, now it does

#

because in the other one, we had n/2sigma

#

right?

broken plank May 6, 2024, 2:08 PM

#

because the 2 just appeared in the sum instead

woeful dune May 6, 2024, 2:08 PM

#

oh shit

#

oh you're right

broken plank May 6, 2024, 2:08 PM

#

since differentiating 1/sigma² wrt sigma

#

gives -2/sigma^3

woeful dune May 6, 2024, 2:08 PM

#

yeah yeah

#

maybe the 2 is just supposed to be there in this case, idk?

broken plank May 6, 2024, 2:09 PM

#

Hmmm

broken plank May 6, 2024, 2:10 PM

#

woeful dune maybe the 2 is just supposed to be there in this case, idk?

That's unlikely

#

Ah no I know why

#

It's because I'm a moron

woeful dune May 6, 2024, 2:12 PM

#

idle sierra **Rion**

maybe youshouldn't have used sigma square here?

broken plank May 6, 2024, 2:12 PM

#

broken plank Hence the conditional log likelyhood of V is given by $\ln f_{V|X,r, \sigma^2}(v...

Here

#

It's supposed to be, divided by 2 sigma²

#

so the 2 cancels out

#

again, it doesn't affect our reasoning with the finding of r

woeful dune May 6, 2024, 2:13 PM

#

why divided by 2sigma^2

#

or is just the loglikelyhood

broken plank May 6, 2024, 2:13 PM

#

For that, I'll need a bit of backtracking

#

You know that an observation $V_i$ follows, conditionally to $X_i$, a normal distribution

idle sierraBOT May 6, 2024, 2:13 PM

#

Rion

broken plank May 6, 2024, 2:13 PM

#

Except all $X_i$ are iid in our hypothesis

idle sierraBOT May 6, 2024, 2:13 PM

#

Rion

woeful dune May 6, 2024, 2:13 PM

#

yeah

broken plank May 6, 2024, 2:14 PM

#

So the distribution of $V = (V_1, ..., V_n)$ is

idle sierraBOT May 6, 2024, 2:14 PM

#

Rion

broken plank May 6, 2024, 2:15 PM

#

a product

#

since they're all independent

woeful dune May 6, 2024, 2:16 PM

#

okay

#

question

#

when given to find sigma^2, shouldn't the MLE be the biased sample variance

#

just asking, thats what i read online

broken plank May 6, 2024, 2:17 PM

#

woeful dune when given to find sigma^2, shouldn't the MLE be the biased sample variance

yes

#

indeed

#

Well, the thing is

#

Since you have a product

woeful dune May 6, 2024, 2:18 PM

#

okay, so when asked to find sigma^2 in the Gaussian case, always the biased smaple vairance

#

what will be the result in our case then

broken plank May 6, 2024, 2:19 PM

#

The distribution of the $\epsilon_i$ will be both:

idle sierraBOT May 6, 2024, 2:19 PM

#

Rion

woeful dune May 6, 2024, 2:19 PM

#

will it just be the biased sample variance squared root/

broken plank May 6, 2024, 2:20 PM

#

$f_{\epsilon}(v - r x) = \prod_{i=1}^{n} f_{\epsilon_{i}}(v_i - rx_i)$

#

Where $f_{\epsilon_i}$ is the cdf of a normal distribution of mean $0$ and variance $\sigma^2$

idle sierraBOT May 6, 2024, 2:22 PM

#

Rion

broken plank May 6, 2024, 2:22 PM

#

So that explains why when you take the log

idle sierraBOT May 6, 2024, 2:23 PM

#

Rion

broken plank May 6, 2024, 2:23 PM

#

$g(r, \sigma) = \sum_{i=1}^{n} \ln f_{\epsilon_i}(v_i - rx_i)$

idle sierraBOT May 6, 2024, 2:23 PM

#

Rion

woeful dune May 6, 2024, 2:24 PM

#

okay

#

i am with you so far

broken plank May 6, 2024, 2:25 PM

#

$g(r, \sigma) = \sum_{i=1}^{n} \ln f_{\epsilon_i}(v_i - rx_i) = \sum_{i=1}^{n} \ln \left( \frac{1}{\sqrt{2\pi \sigma^2}} \exp\left( \frac{(v_i - r x_i)^2}{2\sigma^2}\right)\right)$

#

And by using the properties of the log

#

$g(r, \sigma) = \sum_{i=1}^{n} \left( - \frac{1}{2} \ln(2 \pi) - \ln \sigma + \frac{(v_i - rx_i)^2}{2\sigma^2}\right)$

idle sierraBOT May 6, 2024, 2:26 PM

#

Rion

woeful dune May 6, 2024, 2:27 PM

#

okay so thats where we were last

#

then we derived

idle sierraBOT May 6, 2024, 2:27 PM

#

Rion

broken plank May 6, 2024, 2:27 PM

#

So here, if you take out the terms and group them

#

you end up with

#

$g(r, \sigma) = - \frac{n}{2} \ln(2\pi) - n \ln \sigma + \sum_{i=1}^{n} \frac{(v_i - r x_i)^2}{2\sigma^2}$

idle sierraBOT May 6, 2024, 2:28 PM

#

Rion

broken plank May 6, 2024, 2:28 PM

#

So this is the function to optimize

#

called the log-likelihood

#

since it's literally the log of the likelihood

woeful dune May 6, 2024, 2:29 PM

#

gotcha, so just to finish (iv'e been following so far)

#

we got r

broken plank May 6, 2024, 2:29 PM

#

Yep

#

We know how to find the r that maximizes the likelihood

woeful dune May 6, 2024, 2:29 PM

#

now to get sigma, is is just the biased sample variacne sqrt?

#

or is it actually different

broken plank May 6, 2024, 2:29 PM

#

Using similar computations, yes

#

you find that it is the biased sample variance

woeful dune May 6, 2024, 2:29 PM

#

thats for sigma^2

#

not for sigma tho

#

but sigma should be just that sqrt ig

broken plank May 6, 2024, 2:29 PM

#

I think it's the same thing, let me check

#

$\frac{\partial g}{\partial \sigma} (r, \sigma) = 0 = -\frac{n}{\sigma} + \sum_{i=1}^{n} \frac{-2 (v_i - rx_i)^2}{2 \sigma^3}$

idle sierraBOT May 6, 2024, 2:30 PM

#

Rion

broken plank May 6, 2024, 2:31 PM

#

So grouping stuff together, we find that:

#

Argh I did a mistake again

#

I'm so mad

woeful dune May 6, 2024, 2:32 PM

#

thats okay dw

#

rion can i ask you something else real quick

#

its still about this exercise

broken plank May 6, 2024, 2:33 PM

#

The $f_{\epsilon_i}(t) = \frac{1}{\sqrt{2 \pi \sigma^2}} \exp\left( -\frac{t^2}{2 \sigma^2}\right)$ with a minus in the exponential

idle sierraBOT May 6, 2024, 2:33 PM

#

Rion

broken plank May 6, 2024, 2:33 PM

#

so yeah you should find what you want

#

and sure do ask

woeful dune May 6, 2024, 2:33 PM

#

when it says find the confidence interval, how the shit would I approach that?

broken plank May 6, 2024, 2:33 PM

#

$g(r, \sigma) = - \frac{n}{2} \ln(2\pi) - n \ln \sigma - \sum_{i=1}^{n} \frac{(v_i - r x_i)^2}{2\sigma^2}$

idle sierraBOT May 6, 2024, 2:33 PM

#

Rion

broken plank May 6, 2024, 2:33 PM

#

woeful dune when it says find the confidence interval, how the shit would I approach that?

For what?

#

The estimators?

woeful dune May 6, 2024, 2:33 PM

#

yes

#

for r

broken plank May 6, 2024, 2:34 PM

#

$\hat{r} = \left( \sum_{i=1}^{n} x_i^2 \right) \sum_{i=1}^{n} x_i v_i$

idle sierraBOT May 6, 2024, 2:34 PM

#

Rion

broken plank May 6, 2024, 2:35 PM

#

$\hat{\sigma}^2 = \frac{1}{n} \sum_{i=1}^{n} (v_i - \hat{r} x_i)^2$

#

To think about confidence intervals

#

You should think of whether $\hat{r}$ is far from $r$

idle sierraBOT May 6, 2024, 2:35 PM

#

Rion

broken plank May 6, 2024, 2:36 PM

#

Considering that your $x_i$ and $v_i$ are random variables

idle sierraBOT May 6, 2024, 2:36 PM

#

Rion

broken plank May 6, 2024, 2:36 PM

#

then $\hat{r}$ and $\hat{\sigma}$ are also random variables

idle sierraBOT May 6, 2024, 2:36 PM

#

Rion

broken plank May 6, 2024, 2:36 PM

#

So 1. what distribution do they follow?

#

Can you deduce a confidence interval?

woeful dune May 6, 2024, 2:36 PM

#

normal distribution

#

we would need to use the student table with t

broken plank May 6, 2024, 2:37 PM

#

I'm not exactly sure if $\hat{r}$ is a student rv

idle sierraBOT May 6, 2024, 2:37 PM

#

Rion

woeful dune May 6, 2024, 2:38 PM

#

is it not?

#

since they follow normal dist, i thought they'd follow student

broken plank May 6, 2024, 2:39 PM

#

For the $\hat{\sigma}^2$

idle sierraBOT May 6, 2024, 2:39 PM

#

Rion

broken plank May 6, 2024, 2:39 PM

#

I can see it being the case

idle sierraBOT May 6, 2024, 2:39 PM

#

Rion

woeful dune May 6, 2024, 2:39 PM

#

fischer, then?

broken plank May 6, 2024, 2:40 PM

#

But then I don't have absolute guarantees

woeful dune May 6, 2024, 2:40 PM

#

what the fuck then

#

i am so stupid istg

broken plank May 6, 2024, 2:40 PM

#

Let me think for a sec

woeful dune May 6, 2024, 2:44 PM

#

Give a confidence interval for r of significance level α, with σ
2 = 1.

#

this is the question

#

thats sigma^2

broken plank May 6, 2024, 2:45 PM

#

For $\hat{r}$, conditionally to the $x_i$'s, maybe you can get away with some things

idle sierraBOT May 6, 2024, 2:45 PM

#

Rion

broken plank May 6, 2024, 2:46 PM

#

So if you consider the x_i constant (and not random)

woeful dune May 6, 2024, 2:46 PM

#

mhmm

broken plank May 6, 2024, 2:46 PM

#

idle sierra **Rion**

Mainly, here

#

You can consider the x_i constant

#

but the v_i are normal

#

N(r x_i, sigma²)

woeful dune May 6, 2024, 2:47 PM

#

okay

broken plank May 6, 2024, 2:47 PM

#

so divided by the sum of squares

#

$x_i v_i \sim \mathcal{N} \left( r x_i^2, x_i^2 \sigma^2 \right)$

idle sierraBOT May 6, 2024, 2:49 PM

#

Rion

woeful dune May 6, 2024, 2:51 PM

#

ok

broken plank May 6, 2024, 2:51 PM

#

And they are all independent

woeful dune May 6, 2024, 2:51 PM

#

mhmm

broken plank May 6, 2024, 2:51 PM

#

so the sum is still a normal distribution

woeful dune May 6, 2024, 2:51 PM

#

so it follows student then?

broken plank May 6, 2024, 2:52 PM

#

No, as said it is a normal distribution

#

$\hat{r} \sim \mathcal{N}(..., ...)$

idle sierraBOT May 6, 2024, 2:52 PM

#

Rion

broken plank May 6, 2024, 2:52 PM

#

we just gotta fill the blanks

woeful dune May 6, 2024, 2:52 PM

#

thats

#

z value

#

right

broken plank May 6, 2024, 2:53 PM

#

Well, only if the parameters are 0 and 1

#

but they're probably not

#

Let's check

#

anyhow, for a sum of independent normals, it follows a normal

#

which mean is the sum of means

#

and the variance is the sum of variances

#

$\sum_{i=1}^{n} x_i v_i \sim \mathcal{N}(r \sum_{i=1}^{n} x_i^2, \sigma^2 \sum_{i=1}^{n} x_i^2)$

idle sierraBOT May 6, 2024, 2:54 PM

#

Rion

broken plank May 6, 2024, 2:54 PM

#

Now, if you divide that by the sum of xi²

woeful dune May 6, 2024, 2:55 PM

#

1 and sigma^2

#

r and sigma&^2

broken plank May 6, 2024, 2:55 PM

#

$\hat{r} \sim \mathcal{N}\left(r, \sigma^2 \frac{1}{\sum_{i=1}^n x_i^2} \right)$

idle sierraBOT May 6, 2024, 2:55 PM

#

Rion

broken plank May 6, 2024, 2:56 PM

#

So now you know the distribution of $\hat{r}$

idle sierraBOT May 6, 2024, 2:56 PM

#

Rion

broken plank May 6, 2024, 2:56 PM

#

you can do your appropriate tests

woeful dune May 6, 2024, 2:56 PM

#

makes sense

#

thank you so much!

#

i already nominated you once, idk if i can twice

#

you're amazing

broken plank May 6, 2024, 2:57 PM

#

woeful dune you're amazing

I mean, we still haven't done shit about the other one

#

Which is at least ten times more difficult

#

because I think we need a theorem or something

woeful dune May 6, 2024, 2:58 PM

#

which other one

#

the third one?

broken plank May 6, 2024, 2:58 PM

#

No, the distribution of $\hat{\sigma}^2$

idle sierraBOT May 6, 2024, 2:58 PM

#

Rion

broken plank May 6, 2024, 2:58 PM

#

This one is pretty hardcore to prove and to be completely honest I don't have the toolbox to prove it

#

or at least it needs quite elaborate thinking

woeful dune May 6, 2024, 2:59 PM

#

honestly, between you and I, I don't think I have the toolbox to understand it even if you did

#

I am genuinely thankful for your help so far, this is way more than i knw before so

#

exam is tomorrow anyway, this is a joke

#

4 day to finish a data science course my uni went crazy

broken plank May 6, 2024, 3:01 PM

#

I don't have a good proof

#

But $\frac{n \hat{\sigma}^2}{\sigma}$ follows a $\chi^2_{n-1}$

idle sierraBOT May 6, 2024, 3:02 PM

#

Rion

broken plank May 6, 2024, 3:11 PM

#

https://online.stat.psu.edu/stat415/lesson/7/7.5

PennState: Statistics Online Courses

7.5 - Confidence Intervals for Regression Parameters | STAT 415

Enroll today at Penn State World Campus to earn an accredited degree or certificate in Statistics.

#

@woeful dune Check the book they mention for the proof

woeful dune May 6, 2024, 3:12 PM

#

will do, thank you so much

broken plank May 6, 2024, 3:14 PM

#

Wait I'm a moron

#

they never asked for the confidence interval of $\hat{\sigma}$

idle sierraBOT May 6, 2024, 3:14 PM

#

Rion

broken plank May 6, 2024, 3:14 PM

#

4am

woeful dune May 6, 2024, 3:14 PM

#

madlad

#

i wont lie, i didnt notice either

#

so really, i am the bigger moron

broken plank May 6, 2024, 3:14 PM

#

Well good thing they don't

#

I can't prove it

#

chad

woeful dune May 6, 2024, 3:15 PM

#

lmao

#

great

broken plank May 6, 2024, 3:16 PM

#

Anyway, a list of takeaways for you for your exam tomorrow

#

knowing how to compute a MLE is kind of really important

woeful dune May 6, 2024, 3:16 PM

#

mhmm

broken plank May 6, 2024, 3:16 PM

#

So first takeaway

#

the likelihood is just a product of likelihoods when you have iid samples

#

which justifies taking the log likelihood, since you go from product to sum

woeful dune May 6, 2024, 3:17 PM

#

ok

broken plank May 6, 2024, 3:17 PM

#

second takeaway:
the parameters that maximize the log-likelihood also maximize the likelihood

#

so that also justifies using the log-likelihood instead of the likelihood

#

third takeaway:
you need multivariate differential calculus to compute optima

#

just like how you take the derivative of a function to check where it's either max or min

woeful dune May 6, 2024, 3:19 PM

#

yeah, makes sense

#

and i guess pray to god

broken plank May 6, 2024, 3:20 PM

#

don't

#

i speak from experience, that doesn't work

#

i can't explain my losing streak in gacha games with statistics because i'm already outside the interval of confidence

#

chad

woeful dune May 6, 2024, 3:22 PM

#

lmao

#

i will pray regardless

#

tbh i am pretty terrified

broken plank May 6, 2024, 3:22 PM

#

Why

woeful dune May 6, 2024, 3:22 PM

#

#

#

this is what they had last year

#

I've solved everything but 1. b part 2. 4

broken plank May 6, 2024, 3:24 PM

#

1b?

#

the monte carlo thing?

woeful dune May 6, 2024, 3:25 PM

#

yes

#

i didnt get that

broken plank May 6, 2024, 3:25 PM

#

I think they just ask you to sample a bunch of x_i, and use the monte carlo estimator of the mean

woeful dune May 6, 2024, 3:26 PM

#

i dont know tbh

broken plank May 6, 2024, 3:26 PM

#

well you know what a monte carlo estimator is right

woeful dune May 6, 2024, 3:26 PM

#

yes

#

i mean i know the algorithm

broken plank May 6, 2024, 3:26 PM

#

Yeah

woeful dune May 6, 2024, 3:27 PM

#

this case its a discrete one, so i was thinking of just asnwering that with the theoretical thing

#

like just check h(x) + h(x) + h(x) all over n, until it becomes bigger than the interval, but i dont think thats correct

broken plank May 6, 2024, 3:28 PM

#

Well it is possible to compute E[h(X)] exactly using the transfer theorem

#

but they want an approximation, not an exact value

#

How did you define monte carlo methods in your lecture

woeful dune May 6, 2024, 3:28 PM

#

uh

#

do you want the file

#

its a powerpoint

broken plank May 6, 2024, 3:29 PM

#

just send me pictures of it here

#

parts you think are relevant

woeful dune May 6, 2024, 3:29 PM

#

#

#

#

woeful dune May 6, 2024, 3:30 PM

#

woeful dune

we did answered that algorithm question in this one , massive rip

#

and then for the continous rv, we have explanations with the inverse and rejection methods

#

#

#

and thats it

broken plank May 6, 2024, 3:35 PM

#

woeful dune we did answered that algorithm question in this one , massive rip

can't you just use that as an example then

woeful dune May 6, 2024, 3:36 PM

#

we didn't*

#

he just skipped over it lol

broken plank May 6, 2024, 3:36 PM

#

Ah ok

#

Well I mean

#

You can still do something

#

First of all, you can estimate $\mathbb{E}[Y]$ with an empirical mean of iid samples that follow the distribution of $Y$

idle sierraBOT May 6, 2024, 3:37 PM

#

Rion

broken plank May 6, 2024, 3:38 PM

#

So $\frac{1}{n} \sum_{i=1}^{n} Y_i$

idle sierraBOT May 6, 2024, 3:38 PM

#

Rion

broken plank May 6, 2024, 3:38 PM

#

So now I guess what you need is to find how to generate $Y_i$

idle sierraBOT May 6, 2024, 3:38 PM

#

Rion

broken plank May 6, 2024, 3:39 PM

#

This is not all that hard

#

You generate $X_i$ with said distribution

idle sierraBOT May 6, 2024, 3:40 PM

#

Rion

broken plank May 6, 2024, 3:40 PM

#

And you just compute $Y_i = h(X_i)$

idle sierraBOT May 6, 2024, 3:40 PM

#

Rion

broken plank May 6, 2024, 3:40 PM

#

And here, to generate $X_i$, you use the method they show in the slides

idle sierraBOT May 6, 2024, 3:40 PM

#

Rion

broken plank May 6, 2024, 3:40 PM

#

@woeful dune

#

So you cut the interval [0, 1] into bins

#

of size p1, p2, ..., pk

woeful dune May 6, 2024, 3:41 PM

#

so basically, just choose the other stuff, so instead of .32, maybe .5

broken plank May 6, 2024, 3:41 PM

#

(which should sum up to 1)

woeful dune May 6, 2024, 3:41 PM

#

and compute different stuff?

broken plank May 6, 2024, 3:41 PM

#

you generate a uniform rv

#

and you see in which bin it falls into

#

and you take the corresponding value

woeful dune May 6, 2024, 3:41 PM

#

#

so this exam is written

#

since i can't exactly generate

#

should i just take random values

#

over and over

broken plank May 6, 2024, 3:42 PM

#

No, here they ask you to create a scheme

#

so a method

#

you don't use the method but you explain that you can

woeful dune May 6, 2024, 3:42 PM

#

so basically write out an explanation?

broken plank May 6, 2024, 3:42 PM

#

That's what a scheme is isn't it?

#

Just a theoretical framework

#

Though then I don't quite understand the error part

#

the precision I mean

#

But I assume that it has to do with the number of samples you create, which is n

woeful dune May 6, 2024, 3:44 PM

#

something like

rand U
X ~ U(w)
Find h(x)
Keep doing this until E[Summation H(x)] > e

#

broken plank May 6, 2024, 3:45 PM

#

The standard deviation of $\bar{Y}n = \frac{1}{n}\sum{i=1}Y_i$ is $\frac{V(Y1)}{n^2}$

idle sierraBOT May 6, 2024, 3:45 PM

#

Rion

broken plank May 6, 2024, 3:45 PM

#

Ah yeah that's fine

#

Using the Chebyshev inequality

woeful dune May 6, 2024, 3:46 PM

#

so basically I should say

#

as long as theta < Var [h(x)]/ ne^2, keep generating

#

got it

broken plank May 6, 2024, 3:47 PM

#

Well, essentially, the idea is

#

you need $n$ to be large enough

idle sierraBOT May 6, 2024, 3:47 PM

#

Rion

broken plank May 6, 2024, 3:47 PM

#

But the question asks for how large

woeful dune May 6, 2024, 3:47 PM

#

yeah, thats question c

broken plank May 6, 2024, 3:48 PM

#

Oh ok

#

I didn't see

#

yeah my bad

woeful dune May 6, 2024, 3:49 PM

#

yeah question c is eaasier since he basically tells us to use a C

broken plank May 6, 2024, 3:49 PM

#

Just keep in mind

#

I think you are expected to compute Var(h(X1))

#

or rather Var(h(X))

#

but you can do that by hand

woeful dune May 6, 2024, 3:50 PM

#

how would i even compute var(h(x)) in that case

broken plank May 6, 2024, 3:50 PM

#

It's just E[(h(X) - E[h(X)])²], and given that X is discrete with just 5 or 6 values

#

it's not a long computation

woeful dune May 6, 2024, 3:51 PM

#

-2, -1, 0, 2, 3

#

that would be what, E[h(x) - E[h(x)^2]

#

let me ask you the dumbest question yet

#

how would I find the E of it for only one value

#

madlad

broken plank May 6, 2024, 3:52 PM

#

wdym

#

for a discrete rv X, the expectation of X is just the sum of x_i p_i

#

where p_i is P(X = x_i)

#

and using the transfer theorem

#

the expectation of h(X) is the sum of h(x_i) p_i

#

$E[h(X)] = \sum_{i=1}^{K} h(x_i) P(X = x_i)$

idle sierraBOT May 6, 2024, 3:54 PM

#

Rion

broken plank May 6, 2024, 3:54 PM

#

and here you have like 5 values

#

it's just a sum of 5 terms

#

likewise $E[h(X)^2] = \sum_{i=1}^{K} h(x_i)^2 P(X = x_i)$

idle sierraBOT May 6, 2024, 3:55 PM

#

Rion

woeful dune May 6, 2024, 3:55 PM

#

then what is X - E[h(X)^2]

broken plank May 6, 2024, 3:57 PM

#

The variance of a rv Y is the expectation of (Y - E[Y])²

#

so E[(Y - E[Y])²]

#

keep in mind that E[Y] is a number here

woeful dune May 6, 2024, 3:57 PM

#

in that, case, what do I substitute for Y

#

like

#

one of the values?

broken plank May 6, 2024, 3:58 PM

#

There is a formula that says that V(Y) = E[Y²] - E[Y]²

#

so you can compute those if it's easier for you

broken plank May 6, 2024, 3:58 PM

#

woeful dune in that, case, what do I substitute for Y

Y = h(X)

woeful dune May 6, 2024, 3:58 PM

#

and then the mean of that, will be found how?

#

so its number - number

#

its E[number] which means what exactly

broken plank May 6, 2024, 3:59 PM

#

Y is a random variable

#

E[Y] is a number (or, I guess, in perfectly accurate terms, a constant random variable that is always equal to the same number)

woeful dune May 6, 2024, 4:00 PM

#

broken plank There is a formula that says that V(Y) = E[Y²] - E[Y]²

this formula seems easier

#

so in my case i would do var (h(x)) = E[h(x^2)) - E[h(x)]^2

broken plank May 6, 2024, 4:00 PM

#

No

woeful dune May 6, 2024, 4:00 PM

#

oh

broken plank May 6, 2024, 4:00 PM

#

E[h(X)²] - E[h(X)]²

#

it's h(X) that is squared in the first one

#

not h(X²)

woeful dune May 6, 2024, 4:01 PM

#

gotcha

#

thats way esier

#

in the least romantic terms, i could kiss you right now

broken plank May 6, 2024, 4:02 PM

#

I will pass on that but I appreciate the gratitude

woeful dune May 6, 2024, 4:02 PM

#

can I give you a nitro as thanks or is that against the rules

broken plank May 6, 2024, 4:03 PM

#

I don't know, I mean I don't think anyone will complain if you message me privately

#

I complain when people ask me for help in private, because I don't like it and the rules explicitly say not to do so

#

I didn't help you to receive anything in return though

woeful dune May 6, 2024, 4:05 PM

#

meh, after days of asking for help in math discords, finally someone who could actually help me (and somehow had fucking infinite patience)

#

plus, i wouldn't be surprised if i asked another question in the future, so whatever

#

gratitude is good with words, but i like sending gifts too

#

shit, ny card isn't working

#

i will DM you it later, thank you so much again @broken plank

crisp rockBOT May 6, 2024, 4:07 PM

#

@woeful dune has given 1 rep to @broken plank

broken plank May 6, 2024, 4:11 PM

#

woeful dune i will DM you it later, thank you so much again <@276638393923010560>

sure we can talk later

#

I'm helping someone else with another q

woeful dune May 7, 2024, 4:14 AM

#

@broken plank shit, whenever you see this, we never actually get the estimators to the end and i am lost on how to do it

#

if you see this and write it out, thank you

broken plank May 7, 2024, 5:22 AM

#

woeful dune <@276638393923010560> shit, whenever you see this, we never actually get the est...

Wdym?

woeful dune May 7, 2024, 5:33 AM

#

broken plank Wdym?

We only partiall derived by sigma

#

But never actually got what sigma is

broken plank May 7, 2024, 5:34 AM

#

We did

#

We got both estimators

#

The MLE estimator of sigma is the biased empirical variance of the samples

woeful dune May 7, 2024, 5:35 AM

#

Isn't-that sigma^2

#

This is only sigma

broken plank May 7, 2024, 5:35 AM

#

Ah yeah mb

#

Well the square root of that

woeful dune May 7, 2024, 5:35 AM

#

Okay thank you

#

30 hours of no sleep and I couldnt find where we did that

#Help with MLE Estimators