| A | B |
| statistics | collection, display, and analysis of data |
| population | eligible people for a data collection investigation |
| sample | part of a population selected so as to give information about the population as a whole |
| frequency | the number of times an event occurs |
| range | upper extreme minus lower extreme |
| survey | asking people's opinion |
| mode | the most frequently observed value of the measurements in the sample |
| mean | the sum of all the results included in the sample divided by the number of observations |
| median | the middle value of all the numbers in the sample |
| quartile | divides the total distrubution into four parts of equal frequency |
| inference | conclusion |
| bias | an unwanted influence on a a sample |
| axis | a line drawn through the center of a figure |
| scale | a sequence of marks, usually along a line, used in making measurements |
| proportional | one variable is proportional to another if the ration of corresponding values remains constant |
| interpolation | to estimate a value by following a pattern and staying within the values already known |
| extrapolation | to estimate a value by following a pattern and going beyond the values already known |
| discreet | variable where measurements that are distinct, periodic, and unconnected between data points |
| continuous | variable where measurements are uninterrupted and connected between data points |
| scatterplot | a graph that relates data from two different sets (2 words) |
| trend | relationship between two sets of data |
| correlation | relationship between sets of data |
| lower | minimum data value is the __________ extreme |
| upper | maximum data value is the __________ extreme |
| cluster | a particular segment of the population |
| gaps | spaces in the data set without a segment of the population |
| outlier | a point separted from the main body of the data |
| central | point within the range about which the rest of the data is considered balanced is the ____________ tendency |
| positive | both sets of data increase together is a ___________ correlation |
| negative | one set of data decreases as the other set of data increases is a ___________ correlation |
| no | the two data sets are not related is a ___________ correlation |
| weak | when the data is not clustered along an obvious line is a ___________ correlation |
| strong | when the data is clustered along an obvious line is a ___________ correlation |
| convenience sampling | an easily accessible group of people is chosen, and everyone in that group is surveyed |
| self-selective sampling | population provides information by volunteering their opinions |
| cluster sampling | a particular segment of the population is sampled using existing lists (Constituencies, Wards, Households, ...) |
| systematic sampling | every nth member of the population is sampled. The list being sampled may be ordered (alphabetical, seniority, street number, etc) |
| stratified random sampling | the population is divided into groups (strata) and the data collected from the strata by simple random sampling |
| simple random sampling | the sample is chosen randomly from the population. Here each sample of size n from the population of size N has an equal chance of selection |