1. Fundamental Concepts
- Joint Frequencies: These refer to the specific data values in the cells of a two-way frequency table, representing the number of observations that fall into the combination of two categories from different variables. For example, in a survey about "11th and 12th graders playing golf or tennis", the "number of 11th graders who play golf" is a joint frequency.
- Marginal Frequencies: These are the total values in the rows or columns of a two-way frequency table, indicating the total number of observations in a single category of one variable, regardless of the other variable. For instance, "the total number of all 11th graders" or "the total number of all students who play tennis" are marginal frequencies.
2. Key Concepts
- Joint frequencies are located in the inner cells of the two-way table, reflecting the combined characteristics of two variables; marginal frequencies are located at the edges of the table (row totals or column totals), reflecting the overall characteristics of a single variable.
- Marginal frequencies can be obtained by summing the joint frequencies in the same row or column. For example, the sum of the joint frequencies in a row is the marginal frequency of that row.
- Both are used to describe the distribution of data and form the basis for analyzing two-way frequency tables, and can be further used to calculate relative frequencies (such as the proportion of the total).
3. Examples
The following examples are all based on a two-way frequency table of "11th graders and 12th graders participating in golf (G) and tennis (T)" (assuming the data is as follows):
| Golf (G) | Tennis (T) | Row Total | |
|---|---|---|---|
| 11th Grade | 10 | 15 | 25 |
| 12th Grade | 8 | 12 | 20 |
| Column Total | 18 | 27 | 45 (Total number of people) |
Easy Level
- Find the joint frequency of "11th graders playing golf": 10 (the value at the intersection of the 11th grade row and golf column in the table).
- Find the marginal frequency of "all students playing tennis": 27 (the total value of the tennis column).
Medium Level
- Calculate the proportion of the marginal frequency of "12th graders" to the total number of people: 20 ÷ 45 ≈ 44.4% (12th grade row total ÷ total number of people).
- Given that the joint frequency of 11th graders playing tennis is 15 and that of 12th graders playing tennis is 12, verify whether the marginal frequency of the tennis column is correct: 15 + 12 = 27 (consistent with the total of the tennis column in the table, so it is correct).
Hard Level
- If the joint frequency of 11th graders playing golf increases by 5, with other data unchanged, find the new marginal frequency of the 11th grade row and the marginal frequency of the golf column:
- New joint frequency of 11th graders playing golf: 10 + 5 = 15;
- New marginal frequency of the 11th grade row: 15 + 15 = 30;
- New marginal frequency of the golf column: 15 + 8 = 23.
- Analyze why the sum of the marginal frequency of "11th graders" (25) and the marginal frequency of "students playing golf" (18) is greater than the total number of people (45): Because there is an overlapping part (the 10 students who are 11th graders and play golf are counted twice).
4. Problem-Solving Techniques
- Identify the table structure: Clarify the row variables (e.g., grade) and column variables (e.g., sport type) of the two-way table, and distinguish between inner cells (joint frequencies) and edge totals (marginal frequencies).
- Calculate marginal frequencies: If joint frequencies are known, obtain marginal frequencies by "summing the joint frequencies in the same row/column"; conversely, if marginal frequencies and some joint frequencies are known, unknown joint frequencies can be found by subtraction (e.g., "row total - known joint frequency = unknown joint frequency").
- Verify data consistency: Use the rule that "the sum of all row marginal frequencies = the sum of all column marginal frequencies = total number of people" to check the correctness of calculations.
- Analyze with relative frequencies: When it is necessary to analyze proportional relationships, joint frequencies or marginal frequencies can be divided by the total number of people to obtain relative frequencies (such as the percentage of a joint frequency in the total), which makes it more intuitive to compare data distributions.