Transcript for:
Essentials of Probability Concepts

here are the top 10 most important things to know about probability must know number one experimental probability let's start off by doing an example where we conduct an experiment where we flip a coin 10 times and then we calculate what the experimental probability of flipping tails is so let's conduct this experiment I'll flip 10 coins and before we write down what the experimental probability of flipping Tales was from this experiment let me give you a couple quick definitions about probability and experimental probability probability is simply How likely something is to happen the value of a probability always falls somewhere on a scale between 0 and 1 a probability of zero means an event is impossible and a probability of one means an event is certain to happen and any probability let's say a probability of 0.9 can be expressed as a decimal or a fraction so in this case 9 out of 10 or a percentage 90% now an experimental probability is a probability that is determined based on the results of an experiment and let me make some room to add the formula for how we calculate an experimental problem probability let's say we wanted the experimental probability of some event happening we'll call it event a to calculate the probability of event a happening you do the number of times event a occurred and divide that by the total number of Trials of the experiment and there's a shorter notation for writing that where na stands for the number of times event a occurred and NS means the total number of elements in the sample space which in this case means the total number of Trials of the experiment now back to our experiment where we flipped a coin 10 times we wanted the experimental probability of flipping tails to calculate the experimental probability of flipping Tails I would need to do the number of times that Tails occurred divided by total number of elements in the sample space or the total number of Trials of this experiment So based on our experiment we got three tals out of 10 total flips of the coin so our experimental probability of flipping tails is 3 out of 10 or we could write that as a decimal 0.3 or as a percentage 30% and let me add a note before moving on the more trials you do of an experiment the closer the experimental probability is likely to be to the theoretical probability so if we were to continue flipping this coin hundreds and thousands of times you would notice that the relative frequency or experimental probability of flipping Tails would be approaching 0.5 and that leads us right in to our next must know must know number two theoretical probability theoretical probability uses reason or Math instead of an experiment to calculate the likelihood of an event occurring in the future and the formula for calculating the theoretical probability of any event we'll say event a is you do the total number of possible favorable outcomes and you divide that by the total number of possible outcomes and in a condensed notation we could write that as the number of ways that event a can happen divided by the total number of elements in the sample space which just means the total number of possible outcomes let's now do three quick examples that are common types of simple theoretical probability questions we'll do questions about rolling a die drawing a card and picking a marble let's start with rolling a die for any rolling a die question we'll assume that we have a standard six-sided die and let's say we roll that standard six-sided die one time let's calculate first the probability of rolling a four to calculate the theoretical probability of rolling a four we would have to do the number of fours that are on the six-sided die and then divide that by the total number of elements in the sample space which means how many total possible outcomes are there when we roll the die well well on the six side of die there is only one side that has a four on it so there's one way we can roll a four and there are six different outcomes when we roll the die so the total number of elements in the sample space is six so the theoretical probability of rolling a four is one out six let's also calculate the probability of rolling an odd number we would have to do the number of outcomes that are an odd number and divide that by the total number of possible outcomes well there are three odd numbers on six-sided die 1 3 and five and six total possible outcomes so the theoretical probability of rolling an odd number is three out of six which reduces to one out of two or we could write that as 0.5 or 50% let's move on to drawing a card for any question involving playing cards we'll assume it's a standard deck of 52 cards where there are the numbers 2 through 10 Jack queen king and Ace for each of the four suits hearts diamonds Spades and clubs if we were to draw a single card from a deck of cards let's calculate the probability that that card is the seven of Hearts the theoretical probability that being a seven of Hearts we would do the total number of seven of hearts that are in the deck of cards divided by the total number of possible outcomes so the total number of cards in our sample space well there's only one card that is a seven of hearts and there are 52 total cards in the deck of cards so there is a one out of 52 chance of being dealt the seven of cards what about calculating the probability that the single card you draw is a king we would have to do the number of kings that are in a deck of cards and divide that by the total number of elements in our sample space the total number of cards in the deck of cards well in the deck of cards there are four Kings one for each of the four suits and the total number of possible outcomes there's still 52 total cards 4 out of 52 could reduce to 1 out of 13 and let's move on to the last theoretical probability question I'll do with you picking a marble suppose you have a bag containing three red two blue and five green marbles if you draw one marble at random from the bag let's calculate the probability that you draw a red marble to calculate that theoretical probability we would do the total number of red marbles in the bag and divide that by the total number of marbles that are in the bag there are three red marbles and the total number of marbles in the bag 3 + 2 + 5 is 10 so your chance of drawing a red marble is 3 out of 10 let's do one more question it's going to look a little bit different let's calculate the probability that you don't draw a Blue Marble this notation here this apostrophe after blue means the complement of the probability of drawing a Blue Marble that means the probability of not drawing a Blue Marble so I'll make a little note of that and how you calculate the probability of an event not not happening is by doing 1 minus the probability that it does happen so for this question that would be 1 minus the probability of blue well the probability of drawing a Blue Marble two of the 10 marbles are blue so that' be 1 minus 2 out of 10 and 2 out of 10 can reduce to 1 out of 5 and 1 - 1 over 5 is 4 over 5 which we could instead right as 0.8 or 80% must know number three probability using sets let's say a survey was conducted where three questions were asked do you like hockey do you like soccer and do you like basketball the results of that survey could be shown in a VIN diagram this ven diagram helps us understand the relationship between the different answers to this survey inside the rectangle is the entire sample space so everyone who answered the survey is represented inside that rectangle we see all of the numbers in this circle represent all of the people that said they like hockey and this one everyone that said they like basketball this five in the very middle would be the people that said they like all three Sports and this four on the outside would represent the four people that said they don't like any of those three Sports what I want to do with this ven diagram is help you understand two important Concepts the intersection of sets and the union of sets let's start with the intersection of sets so let's say we had two sets set a and set B the intersection of those two sets is the set of elements that are in common to both A and B so in the v diagram that would be the overlap of the two circles and the notation that represents the intersection of the two sets would look like this A and B or that upside down U we read that as and it means the intersection of the sets so let's do a probability example where we look at the intersection of two sets the example says to calculate the probability that a respondent likes hockey and basketball so that word and remember means the intersection of the two sets so we are looking for the probability that a respondent likes hockey and basketball where this upside down u means and it's the intersection of the sets and then when calculating the probability you do the number of elements that are in the event space and then you have to divide that by the total number of elements in the sample space so the total number of people that took the survey let's start by finding the intersection of hockey and basketball so the number of people that like both hockey and basketball would be the overlap of those two circles which is right here I see that there are five plus three so there are eight people that like both hockey and basketball so my probability calculation it would be8 divided by how many ever total people took this survey and to find the total number of people that took the survey we would have to add every single number that we see in the v diagram and you'll see that that equals 100 so the probability of a person liking hockey and basketball is 8 out of 100 which could reduce to 2 out of 25 or we could write it as a decimal as 0.08 now let's do an example for the union of sets let's start off with what is the union of two sets I'll draw two sets once again I'll draw set a and set B and then if I wanted to represent the union of those two sets I would need to show the elements that are inside of a or b or both so that would be anything anywhere inside of this shaded region and the notation for the union of two sets looks like this a u b and for the union of sets we read this as a or b so the elements are anywhere inside of a or b and there's actually a formula for counting the number of elements that are inside of a or b what we do is we take the number of elements that are inside of a and to the that we add the number of elements that are inside of B so we take everything that's inside of a and to that we add everything that's inside of B but notice if we do that we would have double counted all of the elements that are in the intersection of those two sets so for that reason we need to subtract the number of elements that we double counted which are the number of elements that are inside of a and b the intersection of the two let's now see if we can do an example involving the union of two sets the examp example says to calculate the probability that a respondent likes basketball or soccer that word or tells me that we're looking at the union of two sets so I'll go ahead and write the probability formula the probability of a respondent liking basketball or soccer would equal the number of elements that are in the event space and divide that by the total number of elements inside the sample space now there are a couple different ways we could figure out the number of people that like basketball or soccer we could use this formula if I did that it would equal the number of people that like basketball plus the number of people that like soccer but when adding those two together I would have double counted the people that like both basketball and soccer so I have to then subtract the number of people that I double counted the number of people that like basketball and soccer divided by the number of elements in the sample space and if I do this formula let's look back up at our vend diagram the number of people that like basketball inside of the basketball Circle I see 48 elements plus the number of people that like soccer inside of the soccer Circle I see 42 elements and then I have to subtract the number of people that like basketball and soccer so that would be the intersection of the basketball and soccer circles I see 15 people inside that intersection those are the people that I counted as both the basketball Circle and the soccer circles so they're double counted so I have to subtract that 15 and then the size of the sample space was still 100 if I simplify this I get 75 over 100 which reduces to 3/4 or as a decimal 075 now it might have been easier to get this 75 people that like basketball or soccer just by using the V diagram looking at those two sets the basketball and the soccer set and then just adding together one time all of the numbers that fall anywhere inside of those two circles right because the Union of two sets is anything anywhere in those two and if I added all those numbers together I would get 75 must know number four conditional probability conditional probability is the probability of an event occurring given that another event has already occurred now let me show you the formula for how we calculate a conditional probability let's say we wanted the probability of event B happening given that event a has already occurred in this notation this vertical line we read it as given to calculate this conditional probability before I give you the formula let me give you a visual representation of what this conditional probability would look like so I'll draw two sets set a and set B inside of a sample space so let me zoom in on this we've got set a we've got set B and then the the overlap of those two sets is the intersection of the two of them which we can write like this the set of elements that are in a and b now if we were just finding the probability of event B happening I would simply do the number of elements that are inside of event B divided by the total number of elements inside of the sample space but we don't just want the probability of B we want the probability of B given that a has happened so I'm going to have to change both of these my sample space is no longer everything inside of this rectangle because we're given that a has occurred I can shrink my sample space to only being the elements inside of event a so I'll change my denominator to the number of elements inside of a and I'll shade in that new sample space so if this is our new sample space what are the number of elements inside of set B well that would be just these elements here that are in the overlap of A and B so I'll change my numerator to the number of elements that are in both a and B so this is my conditional probability formula let me zoom out and let's try an example to see how it works in this example we have a table that shows the results to the question do you like school and then it asks us to determine the probability that a respondent likes School given they are female this word given tells me this is a conditional probability I'm interested in finding the probability that they like School given they're female fale since we are given the respondent is a female I can shrink the sample space to just be the respondents that are female so in my probability calculation I'll be dividing by just the number of females and since we're given that they're a female the only way that they could like school is if they like school and are a female so my numerator is the number of people in the survey that like school and are a female so basically all the question is saying of the 14 females what's the probability that they like school well 10 of the 14 females like school right the number of students that like school and are female are these 10 people right here and that's divided by the total number of females which is 14 and that could be reduced to 5 over 7 must know number five the multiplication law if we're trying to find the probability of multiple events happening in sequence the formula we use will depend on if the two events are independent or if the events are dependent now what would make two events independent would be that if the occurrence of one event does not affect the occurrence of the other so let's say we had two events event a and event B and I wanted to find the probability that event a happens and then event B happens if the events are independent of each other meaning the occurrence of event a does not affect the occurrence of event B then I can calculate the probability of the two of them happening in sequence just by doing the probability of event a happening and then multiplying that by the probability that event B happens and sometimes instead of using the intersection of set symbol which means and sometimes you'll see it written as a comma between the two events probability of a and then B which means the same thing just multiply the probabilities of each event in the sequence now let me just shrink this a bit and let's make some room for an example of how to use this multiplication law for independent events the example says suppose you roll a die and flip a coin what is the probability of rolling a three and flipping Tails because I want two different events to happen I want to roll a three and flip tails I know I need to use the multiplication law to find the probability of event a and event B happening so if I write a formula for this to find the probability of rolling a three and then flipping Tails I would need to do the probability of rolling a three multipli by the probability of flipping tails the probability of rolling a three well one of the six sides of a die are a three so you have a one out of six chance of that happening multiplied by the probability of flipping Tails one of the two sides of a coiner tail you have a 1 out of two chance of that happening so the probability of both of those things happening is the product of those two 1 6 * 1 over2 that's just 1 over 12 in this example we just did rolling a three on the die did not affect the probability of flipping Tails but let's look at what would happen if the two events in our sequence are dependent on each other dependent events are two events where the occurrence of one of them affects the occurrence of the other so the formula for finding the probability of event a happening and event B happening if the two events are dependent on each other we would find the probability of event a and then multiply that by not just the probability of event B but the probability of event B given that a has occurred because that affects the probability of B so let's do an example where we need this multiplication formula that involves a conditional probability the question reads what is the probability of drawing two kings in a row without replacement so we're trying to figure out the probability of the first card you draw being a king and the second card you draw is also a king in probability when you see a symbol for and you want to think multiplication so we'll have to multiply the probability of the first card being a king by the probability of the second card being a a king given that the first card was a king the reason why this second probability has to be a conditional probability is because it says when you draw the cards you do it without replacement so after you take the first card out of the deck that affects the probability of the second card being a king so the events are dependent on each other and now let's figure out those probabilities what's the probability from a standard deck of 52 cards that the first card you draw is a king well there are Four Kings in a deck of cards out of the 52 cards so the probability of that happening is four out of 52 and that gets multiplied by the probability that the second card you draw as a king given the first card you drew was a king so we've already removed a king from the deck of cards meaning there are only three kings left in the deck and the deck doesn't have all 52 cards anymore you've already taken out one of the Kings so there are only 51 cards left to draw from and if we do this multiplication and simplify we would get 1 out of 221 as the probability of drawing two kings in a row without replacement must know number six permutations sometimes when counting the number of elements in a sample space or event space for a probability we need to consider the number of permutations of objects there are where permutation are just ordered Arrangements of objects the number of permutations of n distinct objects is just equal to n factorial and in case you don't know what a factorial is n factorial would just mean the sum of all the positive integers up to the value of n so for example 5 factorial would be 5 * 4 * 3 * 2 * 1 Let's do an example now where we calculate how many different orders can the letters a a b and c be arranged in since we're doing the number of ordered Arrangements of three objects that just means how many permutations of three objects can we make since the number of permutations of n objects equals n factorial I could say the number of permutations or ordered Arrangements of three objects would just be equal to 3 factorial which means 3 * 2 * 1 which equals 6 so there are six different orders I can rearrange those three letters into and let's take a look at why that makes sense let me write those letters a b and c and let's say we're placing them into these three spots we need to start by choosing a letter to go into the first spot let's say we choose B well how many options do we have for what could go there there were three options now we need to choose something for what goes in the second spot since we've used a letter in the first spot there's only two options remaining let's say we choose a for that spot and now there's only one spot left and and only one letter we can choose for it so we only have one option for what goes in the last spot and hopefully you can see the relationship between what I wrote here 32 and one and how we count the number of ordered Arrangements of three objects we just multiply the number of options we had at each step in the sequence when choosing the order of the letters so BAC was one of the six possible permutations I'll just write the other five quickly just so you can see them let me make a little bit of room and let's look at what happens if we're just doing ordered Arrangements of part of a set of objects the number of ordered Arrangements of n items taken r at a time could be calculated using this formula you say the permutations of n items taken r at a time would equal n factorial / nus R factorial and let's see how that works with an example let's say there are 10 people in a race how many different ways could they finish first second and third so basically we only want to find how many ordered Arrangements of three I can make from 10 people to calculate that I do the number of permutations of 10 taken three at a time would equal the formula tells me to do 10 factorial but then divide that by 10 - 3 factorial and I'll simplify 10 - 3 to 7 and then if I were to start expanding the 10 factorial in the numerator that' be 10 * 9 9 * 8 * 7 and then all the way down until I get to 1 but I can stop the expansion of a factorial by putting a factorial symbol that means that it continues all the way down to one and the reason I'm stopping at seven is because there is a 7 factorial in the denominator so what happens is those S factorials cancel and what I'm left with is just 10 * 9 * 8 and why does that make sense well from the 10 people we're only filling three spots we're only making an ordered arrangement of three from the 10 people so how many choices do we have for who could come first we had 10 choices how many options for then who could come second well there's nine people left that could come second and for third there would be eight people left so we do 10 * 9 * 8 and we figure out that there are 720 different ways the people in the race can finish first second and third and let me do one last example where we see how this could come into play when calculating a probability this example says that a lock opens if the right order of three numbers from 0 to 59 is input numbers can't be repeated what's the probability of guessing the correct passcode on your first try so for this example I want to calculate the probability of guessing the correct passcode I need to do the number of correct passcodes well there's only one actually correct passcode that will open the lock and then I have to divide that by the total number of possible different passcodes well there are 60 numbers between 0 and 59 including 0 and 59 because I'm doing an ordered arrangement of items that can't be repeated I know this is a permutations problem so I can just use the permutation formula to figure out how many permutations of 60 objects there are if I take them only three at a time and if I use the permutation formula for that it would simplify to just 60 * 59 * 58 and that's because you have 60 choices for the first number then 59 for the 2 then 58 for the 3D and that would mean our total probability is 1 over 25,32 must know number seven combinations a combination is a selection of all or part of a set of objects where the order of the objects does not matter and you can calculate the number of combinations of n items taken r a time using this formula we see the number of combinations of n items taken r at a time which we can write like that or we can write it like this and we usually pronounce this as n choose R and that's equal to n factorial / nus R factorial and then there's another R factorial in the denominator so it looks really similar to the permutations formula but there's this additional R factorial in the denominator so let's see how this form works and when to use it in this example it says how many groups of three can be made from Five People based on this question it doesn't seem like the order of the people in the group would matter so we're just wondering how many combinations of three can we make from five people using the combinations formula we would say that 5 choose 3 would equal 5 factorial / 5 - 3 factorial * 3 factorial this 5 - 3 change to two and then I'll simplify this by expanding the 5 factorial to 5 * 4 * 3 * 2 instead of wrting times 1 I'll just stop the expansion with a factorial symbol and I'll do that because I see that it cancels with the two factorial that's in the denominator and now what I have in the numerator 5 * 4 * 3 will give me the number of ordered Arrangements of three I can make from a group of five so that would be 60 so why isn't that our answer well let me show you with letters let's say from the five people uh one of the groups of three we make is person a with person B with person C we wouldn't care what order those people are in it could be ABC or ACB or any of the six different permutations I could make with those three people I wouldn't want to count those six different permutations as different groups I would want to count them as just one combination of people which is why we divide by the number of different permutations of those three people in the group that we can make we divide by three factorial that divides this answer by six to give us the total number of unique groups where the order does not matter and that would tell us that there are only 10 different groups of three people that we can make from five people and let's do one more example where we have to calculate a probability that's going to involve combinations this question says from a group of seven kids and 11 adults if you're making a team of six what's the probability there is only one kid on the team to calculate the probability that there's exactly one kid on the team I would have to do the number of teams that have exactly one kid and divide that by the total number of teams so divide that by the number of elements in the sample space well the total number of teams possible there are 18 people to choose from and I'm making a team of six where the order doesn't matter so I use combinations I can just do 18 choose and in the numerator I need to figure out the number of groups that have one kid well if I'm making a group of six that has one kid that means of the seven kids I need to choose only one of them so I can do seven choose one and from the seven kids if I'm only choosing one of them for my group of six that means from the 11 adults I have to fill the other five spots for the team so I would have to do 11 choose five to figure out the number of different groups of five adults I can make from the 11 adults and then if I do that multiplication I get 3,234 that's the total number of groups that have exactly one kid on the team and the total number of different teams of six that could have been made is 18564 if I evaluate that as a decimal it's about 01742 so there's about a 17% chance that there's exactly one kid on the team must know number eight continuous probability distributions now there are lots of different types of continuous probability distributions a few types are the normal distribution exponential Kai squ and uniform to explain continuous probability distributions to you I'm just going to focus on normal distributions and before I go into a full explanation let me give you a sketch of what the distribution of a set of normally distributed data would look like this function that I drew right here this function f x this is what we call a probability density function and what it does is it models the probabilities of outcomes of some continuous random variable X now there's a couple things I should comment on in more detail first of all a continuous random variable is a variable that can take on within a certain interval it can take on an infinite number of uncountable values so for example somebody's height let's say we look at the interval between 1.6 and 1.7 m there's an infinite number of heights somebody could be within that interval they could be 1.6 5 43 M there would be no way to count all the possible Heights within that interval so that makes it a continuous random variable so because there's an infinite number of values a continuous random variable can take the probability of any one specific value happening is going to be zero but what we do is use this probability density function to calculate probabilities over an interval by using the area that's underneath the function now the PDF function of any normally distributed data is calculated solely based on its mean and its standard deviation so depending on the mean and standard deviation of the data the position or shape of this function might change a little bit but it's going to follow the same group of properties and let me show you what those properties are let me make a little bit more room here the mean of the PDF function is always going to be right in the middle right in line with the highest peak of the function and then 68% of the data so 68% of the area under this curve is going to fall within one standard deviation of the mean and because this function is always symmetrical that would mean that that 68% could be broken in half into these two sections and then if I move two standard deviations out from the mean within two standard deviations of the mean lies 95% of the data and then within three standard deviations of the mean you would find 99.7% of the data assuming it's normally distributed so some approximate values for these little areas here uh I know this isn't going to add up to 9 9.7 exactly because there's been a lot of rounding happening here um but it's about 2.25% in each of those sections one other important piece of information that I haven't written down yet is that the total area underneath a PDF curve is equal to one that would mean that the definite integral of the PDF function FX between negative infinity and infinity would be 1 now there's a very special normal distribution it's called the standard normal distribution let me make a bit of room and we'll write about that a standard normal distribution has a mean that is equal to Z and a standard deviation that equals 1 so down here on my graph of my PDF function 0 would be in the middle and then each unit I increase or decrease by would be one so to the right would be 0 1 2 and 3 and to the left -1 -2 -3 because the mean is zero and the standard deviation is one the actual value of the variable communicates how many standard deviations you are to the right or left of the mean this value of two means you are two standard deviations to the right of the mean and this value of ne1 means you are one standard deviation to the left of the mean so these values have special names these are called zores and there is something called a zcore table where we could look up z-scores and the table would tell us what percentage of the data is less than or equal to the zcore that we're looking up so it will give us the area under the curve to the left of the value we look up and we'll use that in our next example in this example it says the lifespan of regular smokers follows a normal distribution with a mean of 68 and a standard deviation of 10 what percentage of smokers will live beyond 76 let me start by drawing a rough sketch of a normal distribution the question tells me that the mean is 68 so I know that goes in the middle and then I'll label these three units to the right going up by the standard deviation each time and then to the left label these three spots by going down by the standard deviation of 10 each time and what the question is asking it's saying what percentage of smokers will live beyond 76 to find what percentage of smokers live beyond 76 we would have to find the area under this curve that is to the right of 76 there are a couple different ways to do that the first way I'll show you is using a zcore table so what we have to do is we have to figure out what is the zcore of 76 and what a zcore is is just telling you how many standard deviations this value is away from the mean so it's telling you this Point's relative position over here on the standard normal curve and you can calculate a zcore right the number of standard deviations from the mean just by doing the x value minus the mean and dividing by the standard deviation so the zcore for 76 I would do 76 minus the mean of 68 and then divide by the standard deviation of 10 and I figure out that the zcore is 0.8 and notice 0.8 on this standard normal graph 0.8 would be right about here if I found the area to the right on the standard normal graph it would match exactly the area to the right of 76 on this graph now like I said we want the area to the right of it but what a zcore table does a z score table is only capable of telling you the area to the left of the zcore that you look up so what we'll first do is find the probability of the zcore being less than or equal to 0.8 if we look up 0.8 in the zcore table it'll tell us that the area to the left is about 07881 so we figured out that this area to the left is 7881 but we want the area over here on the right well because the total area remember equals 1 under the curve if I do 1 minus this area it'll give me that area so the probability of a zcore being greater than 0.8 would just be 1 minus the probability that's less than or equal to 0.8 so 1 minus that probability which is 02119 we can now answer our final question which says that the probability that a smoker lives beyond 176 so this area is about 21.9% and now there are calculator functions that you can use to not have to use a zcore table for example on a graphing calculator the ti84 you could find the option for the normal CDF function which is the cumulative density function and then within that option for normal CDF you input a lower boundary which is 76 an upper boundary for the area we want to type Infinity but we can just type um a really big number so like 1 time 10 to the^ of 99 and then input the mean and standard deviation and it'll give you the area within the interval that you asked for must know number nine binomial probability distributions a binomial probability distribution describes the probability of the number of possible successes in an experiment referred to be a binomial probability distribution this experiment that I'm talking about has to follow a set of criteria there has to be a fixed number of trials for each of the trials there's only two possible outcomes either a success or a failure the probability of success stays constant which means each trial is independent and there's a formula for calculating the probability of having K successes in N trials formula is the probability of the number of successes equaling K is n choose K * P the power of K * 1 - P to the^ of nus K and I'll explain to you what each of those variables stands for p is the probability of success so 1 minus P would then be the probability of failure K is our number of successes and N is the total number of Trials and then I suppose I could add but I'm out of room n minus K would be the number of Trials minus the number of successes so that would just be the number of failures and this function that we generated here that can find us the probability of K successes in N trials is called a probability Mass function it can find us the probability of any number of successes happening and because our variable is number of successes that's very countable right there's a discrete number of successes we could have so this is called a discrete probability distribution this is different than the continuous probability distribution we did in the last section so let's do an example of a binomial probability distribution the example says suppose you roll a six-sided Die four times create a theoretical probability distribution for the number of Threes rolled now because this is a discrete probability distribution we can typically communicate the probability of any number of Threes being rolled in a table and in the table our variable is the number of Threes that you roll so how many threes could you roll when you roll the Die four times well you could roll no threes or 1 or two or 3 or four and now what we need to do is find the probability associated with each of those values of X and I should mention in this experiment rolling a three is what we consider a success so we could have no successes or one or so on let's figure out first of all the probability if we are going to have let's say two successes if we are going to have two successes let's try and use this formula in the formula we do n choose K first the number of Trials choose the number of successes four rolls of the die and we want two of them to be successful that gets multiplied by the probability of success to the exponent of the number of successes well the probability of rolling a three is 1 out of six and we want that to happen two times and that gets multiplied by the probability of failure to the exponent of the number of failures well the probability of not rolling a three would be five out of six and if we're going to have two successes out of four rolls that means we're going to have two failures as well so if we have a look at this formula if we specifically look at this part it does the probability of success times the probability of success times the probability of failure times the probability of fail failure so what that considers is the probability of having a success then a success then a failure then a failure but that's not the only way you could have two successes four choose two tells you the number of different ways we could choose two of the four roles to be successful so finding the product of all three of those will give us the probability of having two successes out of four roles and if we calculate that it's about 0.157 let's now find the probability of having three successes that means we roll a three three times using the probability Mass function for a binomial distribution I would do 4 choose three right I have four roles I want three successes the probability of success I want that to happen three times and the probability of failure well I want to fail only one time if I'm succeeding three times calculating that would be 0154 so hopefully you can see how this formula works I'll just fill in these other three spots for you so we have the complete distribution so there's the complete probability distribution for any probability distribution the total of all the probabilities should equal one and also if we wanted to calculate What's called the expected value of the number of Threes we would roll there are two ways you can do that for any probability distribution you can just find the sum of each x value with its probability so we do 0 * its probability plus 1 * its probability and so on but for a binomial probability distribution all you have to do is the number of Trials times the probability of success so there are four trials and the probability of success on any one trial was one out of six so that equals 2/3 or 67 so what does this mean 67 an expected value is just if we were to conduct this experiment over and over and over again the average number of Threes that we would roll we would expect the average to be 67 must know number 10 a geometric probability distribution a geometric probability distribution models the probability of the number of Trials needed to achieve the first success in an experiment now just like a binomial distribution the experiment has some conditions there can only be two possible outcomes for each trial a successor a failure there has to be a constant probability of success meaning the trials are independent now even though it shares all three of these conditions with a binomial experiment our variable of interest is different we're not finding the number of successes in a fixed number of Trials we're instead finding out how many trials we need to do until we get the first success so the formula that can calculate that the probability Mass function for a geometric distribution is the probability of the number of Trials until the first success which we call the waiting time is equal to K is 1 minus P to the kus1 * p and let me tell you uh what these variables stand for and why this makes sense our variable of interest X is number of Trials till first success p is the probability of success so that would mean that one minus P would be the probability of failure and K is equal to X so it it's the trial number of the first succcess so what's happening in this formula the probability that K is our first success we would do 1 minus P right that's probably a failure we'd have to fail one less than K times and then after that we'd have to have our success let's now do an example of a geometric probability distribution question if you continue rolling a pair of dice until you get doubles part A says what is the problem ility that it takes four tries this is a geometric probability question because you keep rolling the dice until you get doubles so it just keeps going until you get your first success and the probability of getting doubles is constant on each trial the proba of rolling doubles well if you roll two dice there's six options on the first dice six options on the second dice so there are 36 total different outcomes that could happen so the probability of doubles the denominator the sample space is 36 and there are six different doubles that you can get doubles of ones twoos 3es fours fives or sixes so our probability of success in this experiment is 6 out of 36 or 1 out of 6 so in part A when I want to know what's the probability that the waiting time is equal to 4 based on the probability Mass function for a geometric distribution I would do the probability of failure so 1 minus the probability of success that would be 5 out of 6 to the exponent of K minus 1 to the exponent of 4 minus 1 which is 3 right if my first success is going to happen on the fourth try I'd have to fail three times and then on the fourth try I'd have to succeed and the probably success is one out of six and evaluating this I get 0965 let's do a Part B if we want the probability it takes fewer than three tries to get doubles so probability the waiting time is less than three well that would mean that the waiting time is either one or two either happens on your first try or your second try so if I add the probability that the waiting time is one with the probability that the waiting time is two that will answer this question and for the waiting time to be one that would mean that we would fail zero times and succeed right away on the first try and the probably the waiting time is two we'd have to fail on the first try and then succeed on our second try and if we evaluate this it's about 03056 and the last question we'll do is we will calculate the expected waiting time the expected value of a geometric distribution the expected waiting time before you get your first success it's always 1 iD the probability of success so you just have to do 1 / 1 over 6 which equals six so on average if you continued this experiment multiple times it would average out that it would take you six times to get doubles sometimes it'll take you less sometimes more but on average six so that's the end of the top 10 must knows for probability make sure to stay tuned to the channel because I'm going to put out in my next video a sequence of 10 probability questions that get increasingly harder where you can test out all of this knowledge J