4.2 Experiments and Probability of Events

We need to come to an understanding of probability and how it is used in statistics. This understanding will be developed over the course of this entire class. For now, keep in mind that probability is a number that is associated with how certain we are of the outcomes of an experiment or activity. While the word “experiment” might bring up a vision of a laboratory with bubbling solutions, when we talk about an experiment, we mean something different.  An experiment is a planned operation carried out under controlled conditions. Results of an experiment are not known before the experiment is conducted, so the outcome is uncertain.

A result of an experiment is called an outcome. If we make a list of all the possible outcomes for an experiment, we create the sample space. We typically are interested in a particular collection of the outcomes, so an event is any combination or subset of outcomes of an experiment.

The uppercase letter S  is used to denote the sample space. Because the sample space is a list of all the outcomes that are possible, we need a compact way to denote the list. In mathematics, we use curly brackets to list sets, this is called set notation, so the sample space for an experiment would be given by  S = {list of all outcomes}.

Upper case letters like A and B are typically used to represent events. Because an event is a collection or subset of the sample space, we will use set notation to list outcomes associated with events as well.  Any event should be described in words first and then the outcomes that make up the event should be listed as A = {list of outcomes corresponding to the event}.

Example 1: Describing a Sample Space and Events

Craps is a dice game, which uses two six-sided dice, as shown in the photograph. When each die is rolled, the top face will show any of the number of dots: 1, 2, 3, 4, 5, or 6. Together the dice can show sums from 2 to 36. The most basic roll in craps is the “come out roll.” One player will roll the dice and the sum of the dice is observed. Some players will win if a 7 or 11 shows and lose if a 2, 3, or 12 shows. Consider the experiment of rolling the dice on the come out roll and create the following related to this experiment:

Decorative photo of two dice.
Two Six-Sided Dice
  1. Describe the experiment.
  2. Define the sample space and list the outcomes.
  3. Define event A as the winning roll and list the elements in event A.
  4. Define event B as the losing roll and list the elements in event B.

Answers:

  1. The experiment consists of rolling two six-sided dice and observing the sum of the dots (pips).
  2. The sample space consists of all of the possible outcomes when two dice are rolled. S = {2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36}.
  3. Let event A = getting a 7 or 11; A = {7, 11}.
  4. Let event B = getting a 2, 3, or 12; B = {2, 3, 12}.

Combining Events

We often construct events by combining simpler events. All events are subsets of sample spaces, so we have seen how we use set notation to describe sample spaces and events. We can use set notation to combine sets as well.

The Union of two events, [latex]A \cup B[/latex], is the set of outcomes that belong to A, to B, or to both A and B at the same time. In words,  “A or B” means the same thing as [latex]A \cup B[/latex].

The Intersection of two events, [latex]A \cap B[/latex], is the set of outcomes that belong both to A and to B. In words, “A and B” means the same thing as [latex]A \cap B[/latex].

The Compliment of an event A is the set of outcomes that are not in A, In words, “not A” means the same thing as A’.

  • Union of Events: Sometimes we are interested in one event occurring or another event occurring. Event “A or B”  will be made up of all outcomes in A, or in B, or in both A and B at the same time. We can describe this idea by using the word “or” but it is also common to use the union symbol, [latex]\cup[/latex]. Event “A or B” means the same thing as [latex]A \cup B[/latex]. For example, let A = {1, 2, 3, 4, 5} and B = {4, 5, 6, 7, 8}. If we create the event [latex]A \cup B[/latex] we will list the outcomes in A, or in B, or both A and B. In this case, [latex]A \cup B[/latex] = {1, 2, 3, 4, 5, 6, 7, 8}. Notice that 4 and 5 are NOT listed twice. 
  • Intersection of Events: Sometimes we are interested in two events occurring at the same time, so one event occurs and the other event occurs. We can describe this idea by using the word “and” but it is also common to use the intersection symbol, [latex]\cap[/latex]. Event “A and B” means the same thing as [latex]A \cap B[/latex]. For example, let A = {1, 2, 3, 4, 5} and B = {4, 5, 6, 7, 8}. Then [latex]A \cap B[/latex] = {4, 5}.
  • Complement of an Event: Sometimes we are interested in outcomes that do not belong in an event A, so we are interested in what happens when A does not occur. The complement of event A is denoted A′ . We would read this as “A prime”. A′ consists of all outcomes that are not in A. An event and its compliment will not have outcomes in common and together would make up the entire sample space. For example, let S = {1, 2, 3, 4, 5, 6} and let A = {1, 2, 3, 4}. Then A’ = {5, 6}.

Example 2: Combining Events

An experiment is conducted in which we flip a two-sided coin and then roll a six-sided die. Let event A = the number on the die is a 1 or 2. Let event B = the coin shows heads. Find each of the following:

  1. List S, A, and B using set notation.
  2. Find [latex]A \cup B[/latex]
  3. Find [latex]A \cap B[/latex]
  4. Create the sets A’, B’, and [latex]A' \cap B'[/latex]

Answers:

  1. S = {H1, H2, H3, H4, H5, H6, T1, T2, T3, T4, T5, T6}, A = {H1, H2, T1, T2}, B = {H1, H2, H3, H4, H5, H6}
  2. [latex]A \cup B[/latex] = {H1, H2, H2, H4, H5, H6, T1, T2}
  3. [latex]A \cap B[/latex] = {H1, H2}
  4. A’ = {H3, H4, H5, H6, T3, T4, T5, T6}, B’ = {T1, T2, T3, T4, T5, T6}, [latex]A' \cap B'[/latex] = {T3, T4, T5, T6}

We study statistics because when we take a sample, we want to use the information to infer something about the population. Sampling from a population is a random process, when we use a random sampling method. Probability concepts then form a foundational part of our understanding of statistics. Probability is the tool we use to decide if a particular sample result can be valid for a certain population. We will consider our sample results and decide how likely they would be to occur. This idea of likelihood is where we will spend some time, so you get a foundation in probability thinking.

Probability:

The probability of any outcome is the long-term relative frequency of that outcome.

  • Probability refers to future events, and the notation P(event) is the probability the event occurs.
  • A probability is a proportion and always takes on a value between 0 and 1, inclusive, so
    0 [latex]\leq[/latex] P(event) [latex]\leq[/latex] 1
  • If the probability of an event is zero, P(event) = 0, then the event cannot happen.
  • If the probability of an event is one, P(event) = 1, then the event will happen for sure.
  • If an experiment is conducted, an outcome in the sample space will happen for sure, so P(S) = 1.

There are situations in which the only way we can estimate a probability is to repeat the experiment many, many times and then calculate the proportion of times the event occurs. This important characteristic of probability experiments is known as the Law of Large Numbers which states that as the number of repetitions of an experiment is increased, the relative frequency obtained in the experiment tends to become closer and closer to the theoretical probability. Even though the outcomes do not happen according to any set pattern or order, overall, the long-term observed relative frequency will approach the theoretical probability. (The word empirical is often used instead of the word observed.)

Coin Flip App

Go to the coin flip app and simulate flipping a coin many times to get a sense of the empirical probability related to observing heads. This is the Law of Large Numbers in action. The empirical probability approaches the theoretical probability when the coin is flipped a large number of times.

For example, if you flip one coin six times, you could observe the outcome: H, H, H, H, H, T. On the other hand, you could observe the outcome H, T, H, T, H, T. What happens if you continue flipping and keep track of the number of heads you observe? Try it by going to the Coin Flip App to run a simulation.

We know in the long run, a fair coin will land heads up about 50% of the time, so does our coin flipping result contract the theoretical probability? Not at all! Probability is not defined by what happens in the short term. However, if you repeatedly flip the coin, 20 times, 2000 times, 20,000 times, etc., the relative frequency of heads approaches 0.5, which is the true theoretical probability of heads occurring in the flip of a fair coin. P(head) = 0.5 means the event of observing a head in a future coin flip is equally likely to occur or not to occur. Equally likely means that each outcomes of an experiment occurs with equal probability. For example, if you toss a fair, six-sided die, each of the six faces (1, 2, 3, 4, 5, or 6) is as likely to occur as any other. If you toss a fair coin, a Head (H) and a Tail (T) are equally likely to occur. If you randomly guess the answer to a true/false question on an exam, you are equally likely to select a correct answer or an incorrect answer.

Probability of Equally-Likely Events

To calculate the probability of an event A when all outcomes in the sample space are equally likely, count the number of outcomes for event A and divide by the total number of outcomes in the sample space.

[latex]\displaystyle{P}{(A)} = \frac{{\text{Number of Outcomes in A}}}{{\text{Total Number of Outcomes}}}[/latex]

Example 3: Probability of Equally-Likely Events

  1. If you toss a fair dime and a fair nickel, list the outcomes in the sample space and calculate the probability of A, where A is the event of observing two heads.
  2.  Suppose you roll one fair six-sided die, with the numbers {1, 2, 3, 4, 5, 6} on its faces. Let event E = rolling a number that is at least five. Calculate the probability of observing event E.

Answers:

  1. There are only four possible outcomes in the sample space, which can be listed as S = {HH, TH, HT, TT}, where T = tails and H = heads. Because both coins are fair, each of the four outcomes is just as likely to occur as the others. The event of observing two heads corresponds to outcome HH, which occurs exactly one time. There are four equally-likely outcomes in the sample space. Therefore, [latex]\displaystyle{P}{({A})}=\frac{{1}}{{4}}={0.25}[/latex].
  2. There are two outcomes {5, 6} that correspond to event E. Because the die is fair, each of the six sides has the same probability of being rolled. Therefore, [latex]\displaystyle{P}{({E})}=\frac{{2}}{{6}}=\frac{{1}}{{3}}[/latex] as the number of repetitions grows larger and larger.

It is important to realize that in many situations, the outcomes are not equally likely. A coin or die may be unfair, or biased. Two math professors in Europe had their statistics students test the Belgian one Euro coin and discovered that in 250 trials, a head was obtained 56% of the time and a tail was obtained 44% of the time. The data seem to show that the coin is not a fair coin; more repetitions would be helpful to draw a more accurate conclusion about such bias. Some dice may be biased. Look at the dice in a game you have at home; the spots on each face are usually small holes carved out and then painted to make the spots visible. Your dice may or may not be biased; it is possible that the outcomes may be affected by the slight weight differences due to the different numbers of holes in the faces. Gambling casinos make a lot of money depending on outcomes from rolling dice, so casino dice are made differently to eliminate bias. Casino dice have flat faces; the holes are completely filled with paint having the same density as the material that the dice are made out of so that each face is equally likely to occur. Later we will learn techniques to use to work with probabilities for events that are not equally likely.

Mutually Exclusive Events

Two events are mutually exclusive if they cannot both occur at the same time. The intersection of two mutually exclusive events would have no elements in it and would be empty. Any set that is empty is defined as the empty set. For example, we have already discussed the complement of an event. Any event and its complement would not share any elements, so they would be mutually exclusive.  If we are curious about the probability of two mutually exclusive events happening, then we can add their probabilities.

Finding probabilities for events that are not mutually exclusive requires more care. If we just add the individual probabilities, then we are double counting probability where the two events share elements. Double counting occurs where the two events overlap. To correct for this issue, we can add the probabilities of the individual events as long as we subtract the probability where the intersection occurs. This brings us to several formulas for finding probabilities of combinations of events.

The Empty Set is a set with no outcomes: Empty Set = {  } or [latex]\varnothing[/latex].

Two sets with no outcomes in common are called Mutually Exclusive, or disjoint.

Probabilities of Combinations of Events

If A and B are Mutually Exclusive Events:

  • [latex]P( A \cup B ) = P(A) + P(B)[/latex]
  • [latex]P( A \cap B ) = 0[/latex]

If A and B are not Mutually Exclusive Events:

  • [latex]P( A \cup B ) = P(A) + P(B) - P(A \cap B)[/latex]. Note: For events that are not mutually exclusive, we will count twice any outcome that is a member of both events A and B, so we must take care to subtract the probability related to the intersection of A and B.

For an event A and its Complimentary Event A’ .

  • [latex]P(A) + P(A' ) = 1[/latex]
  • [latex]P(A) = 1 - P(A' )[/latex]

Example 4: Probability of Combinations of Events

A random number generator will produce a whole number between 1 and 19, inclusive. An experiment consists of generating one random number.

  1. List the sample space for the experiment.
  2. If event A = the number is even and event B = the number is greater than 13, then list elements in each event.
  3. Find the individual probabilities of event A and event B.
  4. List the elements in [latex]A\cap B[/latex] and list the elements in [latex]A \cup B[/latex].
  5. Find [latex]P(A \cap B)[/latex] and [latex]P(A \cup B)[/latex].
  6. List the elements in A′  and find P(A′ ).
  7. P(A) + P(A′ ) =

Answers:

  1. S = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19}
  2. A = {2, 4, 6, 8, 10, 12, 14, 16, 18}, B = {14, 15, 16, 17, 18, 19}
  3. [latex]{P}{({A})}=\frac{{9}}{{19}},{P}{({B})}=\frac{{6}}{{19}}[/latex]
  4. [latex]A \cap B[/latex] = {14, 16, 18}, [latex]A \cup B[/latex] = {2, 4, 6, 8, 10, 12, 14, 15, 16, 17, 18, 19}
  5. [latex]{P}{(A \cap B)} = \frac{{3}}{{19}},{P}{(A \cup B)} = \frac{{12}}{{19}}[/latex]
  6. A′  = {1, 3, 5, 7, 9, 11, 13, 15, 17, 19}; [latex]{P}{\left({A}′\right)} = \frac{{10}}{{19}}[/latex]
  7. P(A)+P(A’)=[latex]\frac{9}{19}+\frac{10}{19} = 1[/latex]

Example 5: Probabilities of Combinations of Events

Consider the experiment where we flip two fair coins. If we let events T = observe a tail and H = observe a head, then the possible outcomes of this experiment are HHHTTH, and TT . The sample space is {HHHTTHTT}. Note, the outcomes HT and TH are different. The HT means that the first coin showed heads and the second coin showed tails. The TH means that the first coin showed tails and the second coin showed heads.

  1. Find the probability of getting at most one tail.
  2. Find the probability of getting all tails.
  3. Find the probability of getting all heads.
  4. Find the probability of getting more than one tail.
  5. Find the probability of getting a head on the first roll.
  6. Find the probability of getting at least one tail.

Answers:

  1. Let A = the event of getting at most one tail. (At most one tail means zero or one tail.) Then A can be written as {HHHT,TH }. The outcome HH shows zero tails. HT and TH each show one tail. The probability for A is found as P(A) =
    34
     

    = 0.75.

  2. Let B = the event of getting all tails. B can be written as {TT }. B is the complement of A, so B = A′. Also, P(A) + P(B) = P(A) + P(A′ ) = 1. The probability for  is found as P(B) = 
    14
     

    = 0.25.

  3. Let C = the event of getting all heads. C = {HH }. Since B = {TT }, P(B [latex]\cap[/latex] C) = ∅. Events B and C are mutually exclusive, as B and have no elements in common because you cannot have all tails and all heads at the same time.
  4. Let D = the event of getting more than one tail. D = {TT }. P(D ) = 
    14
     

    = 0.25.

  5. Let E = the event of getting a head on the first roll. (This implies you can get either a head or tail on the second roll.) E = {HTHH }. P(E) = 
    24
     

    = 0.5.

  6. Find the probability of getting at least one (one or two) tail in two flips. Let F = event of getting at least one tail in two flips. F = {HTTHTT }. P(F) = 
    34
     

    = 0.75.

Example 6: Probability of Mutually Exclusive Events

The U.S. Census Bureau surveyed 100,000 households where 305,688 people lived. The Census Bureau wanted to get social and economic information about people living in the United States. The two-way table below summarizes the people by the region of their primary residence in the United States who had no health insurance and also those who have some type of health insurance.

Region No Health Insurance Some Health Insurance Total
Northeast 6761 47,957 54,718
Midwest 8598 57,408 66,006
South 21,639 91,498 113,137
West 12,837 58,990 71,827
Total 49,835 255,853 305,688
  1. Are the events “living in the South” and “living in the Midwest” mutually exclusive? Explain.
  2. Find the probability that a randomly selected person lives in the South or lives in the Midwest.
  3. Are the events “living in the South” and “having no health insurance” mutually exclusive? Explain.
  4. Find the probability that a randomly selected person lives in the South or has no health insurance. This probability can be represented as P(South or No Health Insurance). Write your answer as a decimal, rounded to two decimal places.

Answers:

  1. The events are mutually exclusive because no one can live in two regions at the same time. The events would have no elements in common.
  2. If we label the events as S = living in the South and M = living in the Midwest, then find P(S [latex]\cup[/latex] M) = P(S) + P(M). Using the table values, we see P(S [latex]\cup[/latex] M) = [latex]\frac{113,127}{305,688}+ \frac{66,006}{305,688}[/latex] = [latex]\frac{179,143}{305,688} \approx 0.586[/latex].
  3. The events are not mutually exclusive. It is possible for a person who is living in the South to not have health insurance. Notice in the table, there are 21,639 people who live in the South and who do not have health insurance. If two events can share elements, then the events are not mutually exclusive.
  4. If we label the events as S = living in the South and N = having no health insurance, then find P(S [latex]\cup[/latex] N) = P(S) + P(N) – P(S [latex]\cap[/latex] N). Using the table values, we see P(S [latex]\cup[/latex] N) = [latex]\frac{113,127}{305,688} + \frac{49,835}{305,688} - \frac{21,639}{305,688}[/latex] = [latex]\frac{141,323}{305,688} \approx 0.462[/latex].

In the previous example, the information about region and health insurance were presented in a table. A contingency table provides a way of portraying data that can facilitate calculating probabilities. The table helps in determining conditional probabilities quite easily. The table displays sample values in relation to two different variables that may be dependent or contingent on one another.

Example 7: Using a Contingency Table

Texting and driving is a risky behavior. According to the National Highway Traffic Safety Administration (NHTSA), driving at a rate of 55 miles per hour while sending or reading a text message for five seconds is the same as driving the entire length of a football field with your eyes shut.1 Suppose data was collected at an intersection in a small town and, over the course of a week, 755 cars were observed passing through the intersection. The contingency table summarizes fictitious data about the drivers who were or were not using a cell phone and whether or not they were speeding.

Speeding  Not Speeding  Total
Using A Cell Phone 25 280 305
Not Using A Cell Phone 45 405 450
Total 70 685 755

Looking over the table, we see 305 drivers who passed through the intersection used a cell phone and 25 of them were speeding. Of the 450 drivers who were not using a cell phone, 45 of them were speeding.

Calculate the following probabilities using the information from the table.

  1. Find P(Using a Cell Phone).
  2. Find P(Speeding).
  3. Find P(Using a Cell Phone AND Speeding) = P(Using a Cell Phone [latex]\cap[/latex] Speeding).
  4. Find P(Using a Cell Phone OR Speeding) =P(Using a Cell Phone [latex]\cup[/latex] Speeding).

Answers:

    1. P(Using a Cell Phone) = [latex]\frac{\text{Total Using a Cell Phone}}{\text{Total Number of Cars}} = \frac{305}{755}[/latex]
    2. P(Speeding) = [latex]\frac{\text{Total Speeding}}{\text{Total Number of Cars}} = \frac{70}{755}[/latex]
    3. P(Using a Cell Phone [latex]\cap[/latex] Speeding) = [latex]\frac{25}{755}[/latex]
    4. P(Using a Cell Phone [latex]\cup[/latex] Speeding)
      = P(Using a Cell Phone) + P(Speeding) – P(Using a Cell Phone [latex]\cap[/latex] Speeding)
      = [latex]\frac{305}{755}+\frac{70}{755}-\frac{25}{755}=\frac{350}{755}[/latex]

Videos

This YouTube video gives more examples of basic probabilities: Introduction to Probability

This YouTube video gives examples of probabilities involving AND.

This YouTube video gives examples of probabilities involving OR.

This YouTube video gives examples of finding probabilities related to mutually exclusive events.

This YouTube video gives an example of finding probabilities using a contingency table.

Sources

Example 4 was used from Statway College Module 2, by Carnegie Math Pathways is licensed under CC BY NC 4.0.

1NHTSA “Distracted Driving”. Available online at https://www.nhtsa.gov/risky-driving/distracted-driving (accessed 3/14/20240

definition

License

Icon for the Creative Commons Attribution 4.0 International License

Introduction to Statistics for Engineers Copyright © by Vikki Maurer & Jeff Crabill & Linn-Benton Community College is licensed under a Creative Commons Attribution 4.0 International License, except where otherwise noted.