A | B |
Case | An individual about whom or which we have data. |
Categorical variable | A variable that names categories with words or numbers. |
Context | Tells the Who, What, How, Where, When, Why a study was performed. |
Data | Systematically recorded information, whether numbers or labels together with its context. |
Data table | An arrangement of data where rows represent a case and each column represents a variable. |
Population | All of the cases we wish we knew about. |
Quantitative variable | A variable in which the numbers act as numerical values; they always have units. |
Sample | The cases actually examined in seeking to understand the much larger population. |
Units | A quantity or amount adopted as a standard of measurement, such as meters, seconds, grams. |
Variable | Holds information about the same characteristics for many cases. |
statistics | Calculations made from data |
Simpson's Paradox | Taking averages across groups that appear to contradict overall averages. |
Frequency Table | Lists the categories in a categorical variable and gives the count of observations for each category |
Distribution | Gives the possible values of a variable and the relative frequency of each value |
Area principle | Represent each data value by the same amount of area in a statistical display. |
Bar chart | Uses a bar whose area represents the count of observations for each category of a categorical variable. |
pie chart | Shows the "whole" divided into categories with a wedge of a circle whose area corresponds to the proportion of each category. |
Categorical data condition | Data are counts or percentages of individual cases in categories (not quantitative data) |
Contingency table | Displays counts or percentages of individuals falling into named categories on two or more variables. |
Marginal distribution | The distribution of either variable alone in a contingency table. |
Conditional distribution | The distribution of a variable restricting the WHO to consider only a smaller group of individuals. |
Independence | When the conditional distribution of one variable is the same for each category of the other variable. |
Segmented bar chart | Displays conditional distribution of a categorical variable within each category of another variable. |