L29 | Coconote

Hello and welcome to today's lecture. In the last two classes, we had discussed about the central limit theorem. So, it is one of the most important theorems which establishes a link between the theory of probability and statistical probability. statistical interference. So essentially in central near theorem, the most powerful statement is even for non-normal populations, the statistics from sampling distributions of essential statistics statistics like mean or sum of random variables are normal or follow normal distribution if the sampling sizes is large.

So two things we had in the last class we had discussed that if I define x as summation of xi so then and each of xi has mean mu and variance sigma square then summation of x so x equal to summation of xi will follow so x will be equal will follow normal distribution with mean n times mu. So, summation xi for n random variables right, n mu and variance. n sigma square. So, we can also convert this normal variable to we can define this variable z as x minus n mu by sigma root n. So, z gives us a standard normal variable.

So, z gives us a standard normal variable. means its mean is 0. So, mean of z is 0 and its variance is 1. So, similarly if I define x bar as a mean simple arithmetic mean is summation xi by n then we have found that z is equal to defined by x bar minus mu by sigma by root n is a standard normal variable. Now, what can we say about the sample variance? What can we say about the sample variance as another metric?

So, how do we define the sample variance? So, the sample variance is square. So, if you have x 1, x 2, comma x n as a random sample, random sample, and from a distribution with mean mu and variance sigma square and variance sigma square.

So, I will define my sample variance as S square. S square is equal to summation of xi minus x power whole square by n minus 1. So, this is the sample variance. So, how can I say anything about the link between the sample variance and the population level variance? So, essentially the question is how is S square and sigma square which is the population variance related? So, since s square is defined as summation of xi minus x bar whole square by n minus 1, so I can write n minus 1 times s square is equal to summation of xi minus x bar whole square.

Now, this I can expand and as we have done before, it can be shown that this will come out to be summation xi square minus n x bar square. So, I can take the expectation on both sides of this equation. So, I can write n minus 1 expectation of x square is expectation of summation xi square minus expectation of x power square.

Now, we know so we know that variance of x is defined as expectation of x square minus expectation of x whole square. So, from this equation I can write expectation of x square is equal to variance of x plus E of x whole square. Thank you. So, from this equation I can then simplify now because each of these x i's are independent I can write this equation as summation e of x i square minus n times e of x bar square.

So, summation E of xi square can be written as, so I can write n minus 1 times expectation of s square is equal to, so since each of these E xi is mean n, so I can simplify it as n expectation of x 1 square, let us say minus n expectation of x bar square. So, this becomes n times variance of x1 minus plus n times expectation of x1 whole square and this part minus n is variance of x bar and plus e of x bar whole square. So, I can then again simplify this equation to write n minus i s square it is shown as so n times variance of x1.

So, let me again write you have n times variance of x1 plus n times E of x1 square minus n times variance of x bar. minus n times E of x bar whole square. So, this I can simplify as n variance of x 1 is sigma square, n times E of x 1 is mu. So, we have n mu square.

Variance of x bar is, so x bar has variance of sigma square by n which we derived and n e of x bar is simply equal to mu, so minus n of mu square. So, this gives us equal to n sigma square. So, this and this term cancel each other out. So, we have n sigma square minus sigma square equal to n minus 1 into sigma square.

implying expectation of s square is simply equal to sigma square. So, thus the sample variance is equal to population variance, okay. This is another important equation.

So, coming back to the central limit theorem, so where can we make use of central limit theorem? So, we come to the idea of inferential statistics. Why is inferential statistics important? So some examples let us say government often predicts the short term or long term growth rate or growth forecast. of the country.

So this might be let us say at the country the GDP will grow at 5%, at 7% so on and so forth. So here you have what is called a point estimate that means we are predicting or the statistics is used to predict a single value versus let us say you can estimate So, you have a house you want to sell it you want to estimate the sale price of the house. So, this can be a range what is the minimum you can expect to get what is the maximum you can expect to get. So, in this case what you come up with is something called interval estimate.

So, now consider the point estimate let us take out the first example which is a point estimate. Like imagine you have a dart put and this is the true value of what you want. So, you want to hit the bull's eye or the center point but when you get the sample the data or you hit it in the board many times let us say you are repeatedly throwing it and this is how you are getting your points.

So, these are estimates of what you want of the bull side. So, this is one example. So, where you see that these values are mostly below this axis or you can have a situation. where you have points which are all over the place. So the difference between this situation here or this situation here or let us take another case the points are here.

So, what is how do we discriminate between these three cases? So, what is chosen when you want to come up with a point estimate what should be your yardsticks. So, number one so if this is your true value of a parameter that you are trying to estimate you want. So, let us say this is a true value.

So, this is some axis this is the true value of the parameter you want to estimate you want an estimator which is unbounded. biased. In other words, it has equal chance of predicting slightly higher or slightly lower values.

So, this would be closer to this one and this kind of an estimator is an unbiased estimator. You have an unbiased estimator versus in this case let us say the example we have drawn here it is mostly for example, you can draw it as like this. So, this is the true value.

So, most of the times here values are either underneath. So, this is an example of a biased estimator. Also to compare between this and this for both of them whatever is the true value is this one has this as the representation and the other one.

has this as the representation okay. So you can clearly see that in this case it is better because here the estimate okay the variance of this estimate is lesser okay. So that brings us okay that brings us to the idea that for point estimate okay to write it down okay when we want to come up with the point estimate.

There are two rules, you want an unbiased estimator and second the variance should be low. So clearly in our case what we found was the mean, the sample mean. is the best or one of the best is the best estimator of true value, of a true value of a parameter.

Why? Because we showed that the sampling mean follows a normal distribution and then depending if your sample size is larger then your variance can also come down because the variance for sample mean is sigma square by n. But in the general case as opposed to predicting a small value it is better to come up with a range or and this kind of a range is called an interval estimate.

For a normal distribution, For a normal distribution, we know, so I know that I can write, so we know that between two standard deviations 95 percent of the data is there right. So, roughly 95% of data or that is estimate is there between two standard deviations. So instead of 2 it is actually roughly 1.96 to be exact. So if I were to draw this.

So this is your distribution that you obtained. Let's say this is your sample mean. This is 1.96 times SE.

Both this and this is 1.96 times SE. And this total area this covers 95 percent of the data which means that the probability that any estimate falls within this range is 95 percent. So I can write down this statement that the probability, so if you are doing with the sample mean, we can write down the probability that between minus 1.96, So, the probability that your standard normal variable lies between minus 1.96 and 1.96 is 0.95. So, this is what it means for 95 percent chance that your value estimate is going to be within 2 standard deviations. So, I can rewrite this equation.

So, I can multiply by. by root n. I can write x bar minus mu plus n 1.96 sigma by root n and this is 0.95.

So, I can again simplify it I can multiply by a negative sign so I can write minus 1.96 sigma by root n less than mu minus x bar less than 1.96 sigma by root n. So, this can be simplified further implying if I add x bar to both of them x bar minus 1.96 sigma by root n less than mu less than x bar plus 1.96 sigma by root n. 0.95, okay.

So in other words, so the difference between estimator, so your domain, the 95% confidence interval. So, let's see. is the range given by x bar 1.96 sigma by root n to x bar plus 1.96 sigma by root n, okay.

This is called the 95% confidence interval. So let us take up an example, okay. Let us do an example, okay. So imagine a scientist. It is studying the effect of global warming on wildlife in the Arctic.

So, as part of this he samples the average weight of polar bears and what he founds from a random sample of 50 polar bears. So, he comes up with the average weight of 1000 pounds and a standard deviation of 100 pounds. So, the question is based on this can we estimate what is the average weight of all the polar bears.

So, all the polar bears means that you want to estimate mu which is the population mu. Okay, and what you have been given is the sample mean x bar which is equal to 1000 pounds and the sample variance sigma is equal to 100. Okay, so for 95%, so for creating the 95% confidence interval. So, I want x bar plus minus 1.96 times sigma by root n.

So, this would mean between so this is this term will come out to be 1.96 into 100 by root of 50 this is if you calculate this will come to roughly around 30 pounds. So what you can say with certainty, what you can say with certainty that the sample estimate of 1000 pounds lies within plus minus 30 pounds of the population mean. So, implying the population mean must lie between 970 and 1030 pounds ok.

So, with 95 percent confidence you can say that the mean is going to be lying between this and this. So, what does exactly does this 95 percent confidence interval mean ok. What do we mean by saying that 95% confidence? What it means is imagine this is your true mean, this is your true population mean.

Let us say you sample it once and you found the sample range interval to be between this and this. Similarly, in another time you did it and you found a range which is somewhere like this. You get values between these two and so on and so forth. Only 95 percent means only once out of 20 times.

You will probably get an interval which is like this, where which does not contain the population mean. So, in all of these three cases, you will get a interval which is like this.. the range contains the population mean, but this is an example where this range does not contain the population mean.

So, when we say 95% continuous interval this means that only in 1 out of 20 times you will have a scenario where the population mean does not lie in that interval. Let us take another example, so this is considering about opinion polls. In opinion polls, so let's say you have taken a random sample of 100 adults and this is an opinion poll about global warming.

So, opinion poll and of 100 adults, 70 percent believe. believe in global warming, 75% believe in global warming. So we want to estimate the true population who believe, believe in it. And, we want to find the margin, the margin of error.

So, here you are talking about the proportion now for proportion this follows normal distribution so proportion is probably x the number of people out of a population who believe in it okay. So, p the proportion follows normal distribution okay with mean p. which is given in our case to be equal to 70 percent and standard deviation root of pq by n, where n is your sample size. So, in our case, in this case the margin of error becomes 1.96 into root of pq by n. So, if you plug in the values, they should come out to be 0.09 implying the true population would lie between 0.6 to 0.79, 0.61 to 0.79.

With that I would like to conclude our class for today. So, we saw how you can make use of the central limit theorem to as a link between probability and statistical interference. And we make use of the idea of confidence intervals to gain a range within which the population mean should lie. Thank you for your attention. I look forward to next day's class.

Transcript for:L29

Transcript for:
L29