Frequency distribution of attitudes towards the death penalty | ||
---|---|---|
Black and White respondents in GSS 2000 | ||
f | Percent | |
Support Death Penalty | 719 | 80.16 |
Oppose Death Penalty | 178 | 19.84 |
TOTAL | 897 | 100 |
SOC 221 • Lecture 8
Wednesday, July 30, 2025
Have used one sample to draw inferences about one population
Have drawn inferences about the difference between two populations
New goal: test statistical significance for associations in two-way tables
Association
Two variables are “associated” with each other when variation in one variable corresponds with variation in the other variable.
Synonym: relationship
Independent Variable
An independent variable is assumed to influence a dependent variable. The assumed “cause” in an association
Dependent Variable
A dependent variable is affected by one or more other variables. The assumed “effect” in an association
Generally avoid using language of “cause” and “effect” since establishing a case for causality is always difficult and rarely certain.
Say you were presented with the following two tables. Can you tell whether there is an association between race and support for the death penalty?
Frequency distribution of attitudes towards the death penalty | ||
---|---|---|
Black and White respondents in GSS 2000 | ||
f | Percent | |
Support Death Penalty | 719 | 80.16 |
Oppose Death Penalty | 178 | 19.84 |
TOTAL | 897 | 100 |
Frequency distribution of race | ||
---|---|---|
Black and White respondents in GSS 2000 | ||
f | Percent | |
Black | 104 | 11.59 |
White | 793 | 88.41 |
TOTAL | 897 | 100 |
Two-way table of attitudes towards the death penalty by race | |||
---|---|---|---|
Black and White respondents in GSS 2000 | |||
Black | White | TOTAL | |
Support Death Penalty | 60 | 659 | 719 |
Oppose Death Penalty | 44 | 134 | 178 |
TOTAL | 104 | 793 | 897 |
Two-way table of attitudes towards the death penalty by race | |||
---|---|---|---|
Black and White respondents in GSS 2000 | |||
Black | White | TOTAL | |
Support Death Penalty | 60 | 659 | 719 |
Oppose Death Penalty | 44 | 134 | 178 |
TOTAL | 104 | 793 | 897 |
Two-way table of attitudes towards the death penalty by race | |||
---|---|---|---|
Black and White respondents in GSS 2000 | |||
Black | White | TOTAL | |
Support Death Penalty | 60 | 659 | 719 |
Oppose Death Penalty | 44 | 134 | 178 |
TOTAL | 104 | 793 | 897 |
Two-way table of attitudes towards the death penalty by race | |||
---|---|---|---|
Black and White respondents in GSS 2000 | |||
Black | White | TOTAL | |
Support Death Penalty | 60 | 659 | 719 |
Oppose Death Penalty | 44 | 134 | 178 |
TOTAL | 104 | 793 | 897 |
Two-way table of attitudes towards the death penalty by race | |||
---|---|---|---|
Black and White respondents in GSS 2000 | |||
Black | White | TOTAL | |
Support Death Penalty | 60 | 659 | 719 |
Oppose Death Penalty | 44 | 134 | 178 |
TOTAL | 104 | 793 | 897 |
Two-way table of attitudes towards the death penalty by race | |||
---|---|---|---|
Black and White respondents in GSS 2000 | |||
Black | White | TOTAL | |
Support Death Penalty | 60 | 659 | 719 |
Oppose Death Penalty | 44 | 134 | 178 |
TOTAL | 104 | 793 | 897 |
Two-way table of attitudes towards the death penalty by race | |||
---|---|---|---|
Black and White respondents in GSS 2000 | |||
Black | White | TOTAL | |
Support Death Penalty | 60 (57.69%) |
659 (83.10%) |
719 (80.16%) |
Oppose Death Penalty | 44 (42.31%) |
134 (16.90%) |
178 (19.84%) |
TOTAL | 104 (100.00%) |
793 (100.00%) |
897 |
Two-way table of attitudes towards the death penalty by race | |||
---|---|---|---|
Black and White respondents in GSS 2000 | |||
Black | White | TOTAL | |
Support Death Penalty | 60 (57.69%) |
659 (83.10%) |
719 (80.16%) |
Oppose Death Penalty | 44 (42.31%) |
134 (16.90%) |
178 (19.84%) |
TOTAL | 104 (100.00%) |
793 (100.00%) |
897 |
57.69% = (60/104) * (100)
Two-way table of attitudes towards the death penalty by race | |||
---|---|---|---|
Black and White respondents in GSS 2000 | |||
Black | White | TOTAL | |
Support Death Penalty | 60 (57.69%) |
659 (83.10%) |
719 (80.16%) |
Oppose Death Penalty | 44 (42.31%) |
134 (16.90%) |
178 (19.84%) |
TOTAL | 104 (100.00%) |
793 (100.00%) |
897 |
Can detect the ASSOCIATION between variables by comparing conditional distributions…
STRENGTH
How strong is the tendency for certain values of Y to go with particular values of X?
DIRECTION
Is the association positive or negative?
STATISTICAL SIGNIFICANCE
How certain can we be that the association exists in the population?
STRENGTH
How strong is the tendency for certain values of Y to go with particular values of X?
STRENGTH
How strong is the tendency for certain values of Y to go with particular values of X?
Conditional distributions are completely dissimilar (maximum difference in column %s across values of the IV)
Two-way table of vote on Affordable Care Act (ACA) by political party | |||
---|---|---|---|
US Senators in 2009 | |||
Democrat | Republican | TOTAL | |
Voted for ACA | 60 (100.00%) |
0 (0.00%) |
60 (60.61%) |
Voted against ACA | 0 (0.00%) |
39 (100.00%) |
39 (39.39%) |
TOTAL | 60 (100.00%) |
39 (100.00%) |
99 |
No conditional variation (all cases with a particular IV value have the same DV value)
STRENGTH
How strong is the tendency for certain values of Y to go with particular values of X?
Conditional distributions are exactly the same (no difference in column %s across values of the IV; all match the marginal %s)
Two-way table of transportation to work by gender | |||
---|---|---|---|
Alltech Corp. workers 2014 | |||
Female | Male | TOTAL | |
Drive | 100 (71.43%) |
150 (71.43%) |
250 (71.43%) |
Public Transportation | 30 (21.43%) |
45 (21.43%) |
75 (21.43%) |
Walk/bike | 10 (7.14%) |
15 (7.14%) |
25 (7.14%) |
TOTAL | 140 (100.00%) |
210 (100.00%) |
350 (100.00%) |
High conditional variation (lots of different values of DV for cases with same IV value)
Two-way table: Number of Crimes Committed by Education | |||
---|---|---|---|
Sample of parolees from Florida Prisons | |||
Low Education | High Education | TOTAL | |
0 Crimes | 80 (50.00%) |
130 (86.67%) |
210 (67.74%) |
1 Crime | 24 (15.00%) |
15 (10.00%) |
39 (12.58%) |
2+ Crimes | 56 (35.00%) |
5 (3.33%) |
61 (19.68%) |
TOTAL | 160 (100.00%) |
150 (100.00%) |
310 (100.00%) |
YES: Conditional distributions are different
Two-way table of attitudes towards the death penalty by race | |||
---|---|---|---|
Black and White respondents in GSS 2000 | |||
Black | White | TOTAL | |
Support Death Penalty | 60 (57.69%) |
659 (83.10%) |
719 (80.16%) |
Oppose Death Penalty | 44 (42.31%) |
134 (16.90%) |
178 (19.84%) |
TOTAL | 104 (100.00%) |
793 (100.00%) |
897 |
How STRONG is the association?
Two-way table of attitudes towards the death penalty by race | |||
---|---|---|---|
Black and White respondents in GSS 2000 | |||
Black | White | TOTAL | |
Support Death Penalty | 60 (57.69%) |
659 (83.10%) |
719 (80.16%) |
Oppose Death Penalty | 44 (42.31%) |
134 (16.90%) |
178 (19.84%) |
TOTAL | 104 (100.00%) |
793 (100.00%) |
897 |
How STRONG is the association?
Two-way table of attitudes towards the death penalty by race | |||
---|---|---|---|
Black and White respondents in GSS 2000 | |||
Black | White | TOTAL | |
Support Death Penalty | 60 (57.69%) |
659 (83.10%) |
719 (80.16%) |
Oppose Death Penalty | 44 (42.31%) |
134 (16.90%) |
178 (19.84%) |
TOTAL | 104 (100.00%) |
793 (100.00%) |
897 |
RISK RATIO (a.k.a. relative risk)
The ratio of the probability of some outcome among one group to the probability of the outcome among a different group.
How STRONG is the association?
Two-way table of attitudes towards the death penalty by race | |||
---|---|---|---|
Black and White respondents in GSS 2000 | |||
Black | White | TOTAL | |
Support Death Penalty | 60 (57.69%) |
659 (83.10%) |
719 (80.16%) |
Oppose Death Penalty | 44 (42.31%) |
134 (16.90%) |
178 (19.84%) |
TOTAL | 104 (100.00%) |
793 (100.00%) |
897 |
For black respondents:
\(P(SUPPORT) = 0.5769\)
For white respondents:
\(P(SUPPORT) = 0.8310\)
RISK RATIO = \((0.8310)\) \(/\) \((0.5769)\) \(= 1.44\)
The probability of supporting the death penalty is 1.44 times greater for white than for black respondents
STRENGTH
How strong is the tendency for certain values of Y to go with particular values of X?
DIRECTION
Is the association positive or negative?
Two-way table: Number of Crimes Committed by Education | |||
---|---|---|---|
Sample of parolees from Florida Prisons | |||
Low Education | High Education | TOTAL | |
0 Crimes | 80 (50.00%) |
130 (86.67%) |
210 (67.74%) |
1 Crime | 24 (15.00%) |
15 (10.00%) |
39 (12.58%) |
2+ Crimes | 56 (35.00%) |
5 (3.33%) |
61 (19.68%) |
TOTAL | 160 (100.00%) |
150 (100.00%) |
310 (100.00%) |
Negative:
Higher education associated with lower number of crimes
What is the direction of this association?
What is the direction of this association?
Two-way table of attitudes towards the death penalty by race | |||
---|---|---|---|
Black and White respondents in GSS 2000 | |||
Black | White | TOTAL | |
Support Death Penalty | 60 (57.69%) |
659 (83.10%) |
719 (80.16%) |
Oppose Death Penalty | 44 (42.31%) |
134 (16.90%) |
178 (19.84%) |
TOTAL | 104 (100.00%) |
793 (100.00%) |
897 |
Not relevant since these are nominal variables (no higher or lower values)
STRENGTH
How strong is the tendency for certain values of Y to go with particular values of X?
DIRECTION
Is the association positive or negative?
STATISTICAL SIGNIFICANCE
How certain can we be that the association exists in the population?
There is an association in this sample
Two-way table of attitudes towards the death penalty by race | |||
---|---|---|---|
Black and White respondents in GSS 2000 | |||
Black | White | TOTAL | |
Support Death Penalty | 60 (57.69%) |
659 (83.10%) |
719 (80.16%) |
Oppose Death Penalty | 44 (42.31%) |
134 (16.90%) |
178 (19.84%) |
TOTAL | 104 (100.00%) |
793 (100.00%) |
897 |
Key question: Is this association in the sample strong enough to convince us that there is a real association in the POPULATION from which the sample was drawn?
Used to test statistical significance of associations in a two-way table (so, between categorical variables)
Intended to test whether a pattern or association observed in a set of sample data:
Based on a comparison of our observed frequencies to expected frequencies.
What would the table look like if there were no association between the variables?
Two-way table of attitudes towards the death penalty by race | |||
---|---|---|---|
Black and White respondents in GSS 2000 | |||
Black | White | TOTAL | |
Support Death Penalty | 60 (57.69%) |
659 (83.10%) |
719 (80.16%) |
Oppose Death Penalty | 44 (42.31%) |
134 (16.90%) |
178 (19.84%) |
TOTAL | 104 (100.00%) |
793 (100.00%) |
897 |
Chi-square test is based on comparison of the counts/frequencies we actually observe in the sample to what the table would look like if there were no association between the variables
What would the table look like if there were no association between the variables?
Conditional distributions of the DV would match across the values of the IV (same as marginals)
EXPECTED FREQUENCIES for attitudes towards the death penalty by race | |||
---|---|---|---|
Black and White respondents in GSS 2000 | |||
Black | White | TOTAL | |
Support Death Penalty | (80.16%) |
(80.16%) |
719 (80.16%) |
Oppose Death Penalty | (19.84%) |
(19.84%) |
178 (19.84%) |
TOTAL | 104 (100.00%) |
793 (100.00%) |
897 |
Chi-square test is based on comparison of the counts/frequencies we actually observe in the sample to what the table would look like if there were no association between the variables
What would the table look like if there were no association between the variables?
Conditional distributions of the DV would match across the values of the IV (same as marginals)
EXPECTED FREQUENCIES for attitudes towards the death penalty by race | |||
---|---|---|---|
Black and White respondents in GSS 2000 | |||
Black | White | TOTAL | |
Support Death Penalty | 83.36 (80.16%) |
635.64 (80.16%) |
719 (80.16%) |
Oppose Death Penalty | 20.64 (19.84%) |
157.36 (19.84%) |
178 (19.84%) |
TOTAL | 104 (100.00%) |
793 (100.00%) |
897 |
Chi-square test is based on comparison of the counts/frequencies we actually observe in the sample to what the table would look like if there were no association between the variables
\[
\chi^2 = \Sigma\frac{(f_o - f_e)^2}{f_e}
\] where
\(f_o =\) \(observed\) \(\text{frequency in a given}\) cell
\(f_e =\) \(\text{frequency in the}\) cell expected \(\text{under the assumption of the null hypothesis}\)
\(\text{(no association in the population)}\)
\[ f_e = \frac{\text{(row marginal)(column marginal)}}{n} \]
Note: Chi-square will take a value of 0 if there is no association in the sample.
OBSERVED and EXPECTED Frequencies | |||
---|---|---|---|
Black | White | TOTAL | |
Support Death Penalty | \(f_0 = 60\) \(f_e = 83.36\) (57.69%) |
\(f_0 = 659\) \(f_e = 635.64\) (83.10%) |
719 (80.16%) |
Oppose Death Penalty | \(f_0 = 44\) \(f_e = 20.64\) (42.31%) |
\(f_0 = 134\) \(f_e = 157.36\) (16.90%) |
178 (19.84%) |
TOTAL | 104 (100.00%) |
793 (100.00%) |
897 |
\(f_0\) | \(f_e\) | \((f_0 - f_e)\) | \((f_0 - f_e)^2\) | \(\frac{(f_0 - f_e)^2}{f_e}\) | |
---|---|---|---|---|---|
Cell #1 | 60 | 83.36 | -23.36 | 545.69 | 6.55 |
Cell #2 | 659 | 635.64 | 23.36 | 545.69 | 0.86 |
Cell #3 | 44 | 20.64 | 23.36 | 545.69 | 26.44 |
Cell #4 | 134 | 157.36 | -23.36 | 545.69 | 3.47 |
\[ \chi^2 = \Sigma\frac{(f_0 - f_e)^2}{f_e} \]
OBSERVED and EXPECTED Frequencies | |||
---|---|---|---|
Black | White | TOTAL | |
Support Death Penalty | \(f_0 = 60\) \(f_e = 83.36\) (57.69%) |
\(f_0 = 659\) \(f_e = 635.64\) (83.10%) |
719 (80.16%) |
Oppose Death Penalty | \(f_0 = 44\) \(f_e = 20.64\) (42.31%) |
\(f_0 = 134\) \(f_e = 157.36\) (16.90%) |
178 (19.84%) |
TOTAL | 104 (100.00%) |
793 (100.00%) |
897 |
\(f_0\) | \(f_e\) | \((f_0 - f_e)\) | \((f_0 - f_e)^2\) | \(\frac{(f_0 - f_e)^2}{f_e}\) | |
---|---|---|---|---|---|
Cell #1 | 60 | 83.36 | -23.36 | 545.69 | 6.55 |
Cell #2 | 659 | 635.64 | 23.36 | 545.69 | 0.86 |
Cell #3 | 44 | 20.64 | 23.36 | 545.69 | 26.44 |
Cell #4 | 134 | 157.36 | -23.36 | 545.69 | 3.47 |
\[ \chi^2 = \Sigma\frac{(f_0 - f_e)^2}{f_e} \]
\(\Sigma\) \(= 37.31\)
Chi-square score summarizes the difference between what we observe in the sample and what would expect to observe if there was no association between the variables.
Question: Is that difference big enough to convince us that it did not just happen by chance (sampling error)?
Need a hypothesis test
OBSERVED and EXPECTED Frequencies | |||
---|---|---|---|
Black | White | TOTAL | |
Support Death Penalty | \(f_0 = 60\) \(f_e = 83.36\) (57.69%) |
\(f_0 = 659\) \(f_e = 635.64\) (83.10%) |
719 (80.16%) |
Oppose Death Penalty | \(f_0 = 44\) \(f_e = 20.64\) (42.31%) |
\(f_0 = 134\) \(f_e = 157.36\) (16.90%) |
178 (19.84%) |
TOTAL | 104 (100.00%) |
793 (100.00%) |
897 |
\(H_0\): No association between race and attitudes toward death penalty in the population
\(H_a\): A real association between race and attitudes toward death penalty in the population
The default alpha level is \(0.05\) (\(0.01\) and \(0.001\) are tougher alternatives)
\(df\) \(= (R-1)(C-1) = (2-1)(2-1) =\) \(1\)
Now we can go to the chi-square distribution table to see what critical value is associated with an alpha level of 0.05 and 1 degree of freedom
Our critical value is \(3.84\)
OBSERVED and EXPECTED Frequencies | |||
---|---|---|---|
Black | White | TOTAL | |
Support Death Penalty | \(f_0 = 60\) \(f_e = 83.36\) (57.69%) |
\(f_0 = 659\) \(f_e = 635.64\) (83.10%) |
719 (80.16%) |
Oppose Death Penalty | \(f_0 = 44\) \(f_e = 20.64\) (42.31%) |
\(f_0 = 134\) \(f_e = 157.36\) (16.90%) |
178 (19.84%) |
TOTAL | 104 (100.00%) |
793 (100.00%) |
897 |
\(f_0\) | \(f_e\) | \((f_0 - f_e)\) | \((f_0 - f_e)^2\) | \(\frac{(f_0 - f_e)^2}{f_e}\) | |
---|---|---|---|---|---|
Cell #1 | 60 | 83.36 | -23.36 | 545.69 | 6.55 |
Cell #2 | 659 | 635.64 | 23.36 | 545.69 | 0.86 |
Cell #3 | 44 | 20.64 | 23.36 | 545.69 | 26.44 |
Cell #4 | 134 | 157.36 | -23.36 | 545.69 | 3.47 |
\[ \chi^2 = \Sigma\frac{(f_0 - f_e)^2}{f_e} \]
\(\Sigma\) \(= 37.31\)
Since the obtained chi-square (\(37.31\)) is greater than the critical value (\(3.84\)), I can reject the null hypothesis
This supports the research hypothesis that there IS a real association between race and attitudes toward the death penalty IN THE POPULATION
You collect data from a random sample of 375 individuals to look at whether feelings toward Thanksgiving differ by dietary preferences. The partial data are in the table to the right.
In this sample, is there an association between diet and feelings towards Thanksgiving? How do you know?
Feelings about Thanksgiving | |||
---|---|---|---|
Vegetarian | Carnivore | TOTAL | |
Dislikes | 25 |
80 | |
Indifferrent | 36 |
111 | |
Likes | 184 | ||
TOTAL | 100 | 275 | 375 |
You collect data from a random sample of 375 individuals to look at whether feelings toward Thanksgiving differ by dietary preferences. The partial data are in the table to the right.
In this sample, is there an association between diet and feelings towards Thanksgiving? How do you know?
Feelings about Thanksgiving | |||
---|---|---|---|
Vegetarian | Carnivore | TOTAL | |
Dislikes | 25 |
55 |
80 |
Indifferrent | 36 |
75 |
111 |
Likes | 39 |
145 |
184 |
TOTAL | 100 | 275 | 375 |
Fill in the missing observed frequencies
(note that once two cells are completed (and you have the marginals) you can complete the table)
You collect data from a random sample of 375 individuals to look at whether feelings toward Thanksgiving differ by dietary preferences. The partial data are in the table to the right.
In this sample, is there an association between diet and feelings towards Thanksgiving? How do you know?
Feelings about Thanksgiving | |||
---|---|---|---|
Vegetarian | Carnivore | TOTAL | |
Dislikes | 25 (25.00%) |
55 (20.00%) |
80 |
Indifferrent | 36 (36.00%) |
75 (27.27%) |
111 |
Likes | 39 (39.00%) |
145 (52.73%) |
184 |
TOTAL | 100 | 275 | 375 |
Add column percentages to better understand conditional distributions
You collect data from a random sample of 375 individuals to look at whether feelings toward Thanksgiving differ by dietary preferences. The partial data are in the table to the right.
In this sample, is there an association between diet and feelings towards Thanksgiving? How do you know?
Feelings about Thanksgiving | |||
---|---|---|---|
Vegetarian | Carnivore | TOTAL | |
Dislikes | 25 (25.00%) |
55 (20.00%) |
80 |
Indifferrent | 36 (36.00%) |
75 (27.27%) |
111 |
Likes | 39 (39.00%) |
145 (52.73%) |
184 |
TOTAL | 100 | 275 | 375 |
Difference in conditional distributions indicate that there IS an association in the sample
You collect data from a random sample of 375 individuals to look at whether feelings toward Thanksgiving differ by dietary preferences. The partial data are in the table to the right.
In this sample, is there an association between diet and feelings towards Thanksgiving? How do you know?
\(\frac{0.53}{0.39} = 1.\)\(36\)
Feelings about Thanksgiving | |||
---|---|---|---|
Vegetarian | Carnivore | TOTAL | |
Dislikes | 25 (25.00%) |
55 (20.00%) |
80 |
Indifferrent | 36 (36.00%) |
75 (27.27%) |
111 |
Likes | 39 (39.00%) |
145 (52.73%) |
184 |
TOTAL | 100 | 275 | 375 |
Use risk ratios to quantify the association. For example, the probability of liking Thanksgiving is 36% higher for carnivores than for vegetarians
You collect data from a random sample of 375 individuals to look at whether feelings toward Thanksgiving differ by dietary preferences. The partial data are in the table to the right.
Is this association statistically significant?
\[ \chi^2 = \Sigma\frac{(f_o - f_e)^2}{f_e} \]
Feelings about Thanksgiving | |||
---|---|---|---|
Vegetarian | Carnivore | TOTAL | |
Dislikes | 25 (25.00%) |
55 (20.00%) |
80 |
Indifferrent | 36 (36.00%) |
75 (27.27%) |
111 |
Likes | 39 (39.00%) |
145 (52.73%) |
184 |
TOTAL | 100 | 275 | 375 |
\(H_a\): IN THE POPULATION, there is an association
between diet and attitudes towards Thanksgiving.
\(H_0\): There is no association between diet and
attitudes towards Thanksgiving IN THE POPULATION.
You collect data from a random sample of 375 individuals to look at whether feelings toward Thanksgiving differ by dietary preferences. The partial data are in the table to the right.
Is this association statistically significant?
\[ \chi^2 = \Sigma\frac{(f_o - f_e)^2}{f_e} \]
Feelings about Thanksgiving | |||
---|---|---|---|
Vegetarian | Carnivore | TOTAL | |
Dislikes | 25 \(f_e = 21.33\) (25.00%) |
55 \(f_e = 58.67\) (20.00%) |
80 |
Indifferrent | 36 \(f_e = 29.60\) (36.00%) |
75 \(f_e = 81.40\) (27.27%) |
111 |
Likes | 39 \(f_e = 49.07\) (39.00%) |
145 \(f_e = 134.93\) (52.73%) |
184 |
TOTAL | 100 | 275 | 375 |
Find EXPECTED FREQUENCIES:
\(f_e = \frac{\text{(row marginal)(column marginal)}}{n}\)
For example: \(f_e\) for Cell 5 = \(\frac{(184)(100)}{375} =\) \(49.07\)
You collect data from a random sample of 375 individuals to look at whether feelings toward Thanksgiving differ by dietary preferences. The partial data are in the table to the right.
Is this association statistically significant?
\[ \chi^2 = \Sigma\frac{(f_o - f_e)^2}{f_e} \]
Feelings about Thanksgiving | |||
---|---|---|---|
Vegetarian | Carnivore | TOTAL | |
Dislikes | 25 \(f_e = 21.33\) (25.00%) |
55 \(f_e = 58.67\) (20.00%) |
80 |
Indifferrent | 36 \(f_e = 29.60\) (36.00%) |
75 \(f_e = 81.40\) (27.27%) |
111 |
Likes | 39 \(f_e = 49.07\) (39.00%) |
145 \(f_e = 134.93\) (52.73%) |
184 |
TOTAL | 100 | 275 | 375 |
Expected frequencies reflect how the table would look if there were no association between the variables (i.e., if the null hypothesis were true)
You collect data from a random sample of 375 individuals to look at whether feelings toward Thanksgiving differ by dietary preferences. The partial data are in the table to the right.
Is this association statistically significant?
\[ \chi^2 = \Sigma\frac{(f_o - f_e)^2}{f_e} \]
\(f_0\) | \(f_e\) | \((f_0 - f_e)\) | \((f_0 - f_e)^2\) | \(\frac{(f_0 - f_e)^2}{f_e}\) | |
---|---|---|---|---|---|
Cell 1 | 25 | 21.33 | 3.67 | 13.44 | 0.63 |
Cell 2 | 55 | 58.67 | -3.67 | 13.44 | 0.23 |
Cell 3 | 36 | 29.60 | 6.40 | 40.96 | 1.38 |
Cell 4 | 75 | 81.40 | -6.40 | 40.96 | 0.50 |
Cell 5 | 39 | 49.07 | -10.07 | 101.34 | 2.07 |
Cell 6 | 145 | 134.93 | 10.07 | 101.34 | 0.75 |
Expected frequencies reflect how the table would look if there were no association between the variables (i.e., if the null hypothesis were true)
You collect data from a random sample of 375 individuals to look at whether feelings toward Thanksgiving differ by dietary preferences. The partial data are in the table to the right.
Is this association statistically significant?
\[ \chi^2 = \Sigma\frac{(f_o - f_e)^2}{f_e} \]
\(f_0\) | \(f_e\) | \((f_0 - f_e)\) | \((f_0 - f_e)^2\) | \(\frac{(f_0 - f_e)^2}{f_e}\) | |
---|---|---|---|---|---|
Cell 1 | 25 | 21.33 | 3.67 | 13.44 | 0.63 |
Cell 2 | 55 | 58.67 | -3.67 | 13.44 | 0.23 |
Cell 3 | 36 | 29.60 | 6.40 | 40.96 | 1.38 |
Cell 4 | 75 | 81.40 | -6.40 | 40.96 | 0.50 |
Cell 5 | 39 | 49.07 | -10.07 | 101.34 | 2.07 |
Cell 6 | 145 | 134.93 | 10.07 | 101.34 | 0.75 |
\(\chi^2 =\) | 5.56 |
Expected frequencies reflect how the table would look if there were no association between the variables (i.e., if the null hypothesis were true)
You collect data from a random sample of 375 individuals to look at whether feelings toward Thanksgiving differ by dietary preferences. The partial data are in the table to the right.
\[ \text{obtained } \chi^2 = 5.56 \]
\(\text{critical value of }\)
\(\chi^2 = 5.99\)
Feelings about Thanksgiving | |||
---|---|---|---|
Vegetarian | Carnivore | TOTAL | |
Dislikes | 25 \(f_e = 21.33\) (25.00%) |
55 \(f_e = 58.67\) (20.00%) |
80 |
Indifferrent | 36 \(f_e = 29.60\) (36.00%) |
75 \(f_e = 81.40\) (27.27%) |
111 |
Likes | 39 \(f_e = 49.07\) (39.00%) |
145 \(f_e = 134.93\) (52.73%) |
184 |
TOTAL | 100 | 275 | 375 |
Since obtained value of chi-square is LESS EXTREME than the critical value we FAIL TO REJECT THE NULL HYPOTHESIS. The association observed is NOT statistically significant. Cannot be confident that the association exists in the population.
You collect data from a random sample of 375 individuals to look at whether feelings toward Thanksgiving differ by dietary preferences. The partial data are in the table to the right.
Feelings about Thanksgiving | |||
---|---|---|---|
Vegetarian | Carnivore | TOTAL | |
Dislikes | 25 \(f_e = 21.33\) (25.00%) |
55 \(f_e = 58.67\) (20.00%) |
80 |
Indifferrent | 36 \(f_e = 29.60\) (36.00%) |
75 \(f_e = 81.40\) (27.27%) |
111 |
Likes | 39 \(f_e = 49.07\) (39.00%) |
145 \(f_e = 134.93\) (52.73%) |
184 |
TOTAL | 100 | 275 | 375 |
What happens if the sample is doubled, with the same conditional distributions?
You collect data from a random sample of 375 individuals to look at whether feelings toward Thanksgiving differ by dietary preferences. The partial data are in the table to the right.
Since the conditional distributions are the same, and do not match, there still appears to be an association IN THE SAMPLE.
Feelings about Thanksgiving | |||
---|---|---|---|
Vegetarian | Carnivore | TOTAL | |
Dislikes | 50 (25.00%) |
110 (20.00%) |
160 |
Indifferrent | 72 (36.00%) |
150 (27.27%) |
222 |
Likes | 78 (39.00%) |
290 (52.73%) |
368 |
TOTAL | 200 | 550 | 750 |
What happens if the sample is doubled, with the same conditional distributions?
You collect data from a random sample of 375 individuals to look at whether feelings toward Thanksgiving differ by dietary preferences. The partial data are in the table to the right.
Since the conditional distributions are the same, and do not match, there still appears to be an association IN THE SAMPLE.
Feelings about Thanksgiving | |||
---|---|---|---|
Vegetarian | Carnivore | TOTAL | |
Dislikes | 50 \(f_e = 42.67\) (25.00%) |
110 \(f_e = 117.33\) (20.00%) |
160 |
Indifferrent | 72 \(f_e = 59.2\) (36.00%) |
150 \(f_e = 162.8\) (27.27%) |
222 |
Likes | 78 \(f_e = -98.13\) (39.00%) |
290 \(f_e = 269.87\) (52.73%) |
368 |
TOTAL | 200 | 550 | 750 |
What happens if the sample is doubled, with the same conditional distributions?
You collect data from a random sample of 375 individuals to look at whether feelings toward Thanksgiving differ by dietary preferences. The partial data are in the table to the right.
\(f_0\) | \(f_e\) | \((f_0 - f_e)\) | \((f_0 - f_e)^2\) | \(\frac{(f_0 - f_e)^2}{f_e}\) | |
---|---|---|---|---|---|
Cell 1 | 50 | 42.67 | 7.33 | 53.73 | 1.26 |
Cell 2 | 110 | 117.33 | -7.33 | 53.73 | 0.46 |
Cell 3 | 72 | 59.2 | 12.8 | 163.84 | 2.77 |
Cell 4 | 150 | 162.8 | -12.8 | 163.84 | 1.01 |
Cell 5 | 78 | 98.13 | -20.13 | 405.22 | 4.13 |
Cell 6 | 290 | 269.87 | 20.13 | 405.22 | 1.5 |
What happens if the sample is doubled, with the same conditional distributions?
You collect data from a random sample of 375 individuals to look at whether feelings toward Thanksgiving differ by dietary preferences. The partial data are in the table to the right.
\[ \text{obtained } \chi^2 = 11.13 \]
\(\text{critical value of }\)
\(\chi^2 = 5.99\)
\(f_0\) | \(f_e\) | \((f_0 - f_e)\) | \((f_0 - f_e)^2\) | \(\frac{(f_0 - f_e)^2}{f_e}\) | |
---|---|---|---|---|---|
Cell 1 | 50 | 42.67 | 7.33 | 53.73 | 1.26 |
Cell 2 | 110 | 117.33 | -7.33 | 53.73 | 0.46 |
Cell 3 | 72 | 59.2 | 12.8 | 163.84 | 2.77 |
Cell 4 | 150 | 162.8 | -12.8 | 163.84 | 1.01 |
Cell 5 | 78 | 98.13 | -20.13 | 405.22 | 4.13 |
Cell 6 | 290 | 269.87 | 20.13 | 405.22 | 1.5 |
\(\chi^2 =\) | 11.13 |
The obtained value of the chi-square goes way up. Now, the obtained value of chi-square \(\gt\) critical value of chi-square. The association observed in the sample IS statistically significant. We REJECT THE NULL HYPOTHESIS and find SUPPORT FOR THE ALTERNATIVE HYPOTHESIS that there is an association in the population.
What happens if the sample is doubled, with the same conditional distributions?