| A | B |
| reliability | dependabilty or consistency or an instrument across time or items |
| correlation | a statistical method of observing the degree of a relaitonship between two sets of data or two variables |
| correlation coefficient | expression of a relationship between two variables |
| scattergram | graphic representation of a correlation |
| Pearson's r | a statistical formula for determining strength and direction of correlation |
| internal consistancy | the consistency of the items on an instrument to measure a skill, trait, or domain |
| test-retest reliabiltiy | study that employs the readminstration of a single instrument to check for consistency across time |
| equivalent forms reliabitiy | consistency of a test to measure some domain, trait, or skill using like forms of the same instrument |
| alternate forms reliabitiy | synonymous term for queivalent forms reliabity |
| split-half reliability | method of checking the consistency across items by halving a test and administering two half-forms of same test |
| Kuder-Richardson (K-R) 20 | formula used to check consistency across items of instrumetn with right/wrong responses |
| coefficiant alpha | a formula used to check consistency across items of instrument with repsonses with varying credit |
| interrater reliabitiy | the consistency of a test to measure a skill, trait, or domain across examiners |
| true score | the students actual score |
| standard error of measurement | amount of error determined to exist using a specific instrument, claculated using the instrumets standard deviation and areliabity |
| obtained score | observed score of a student on a particular test on a given day |
| confidence interval | range of scores for an obtained score determined by adding and subtracting standard error of measurement units |
| estimated true score | method of calculating the amount of error correlated with the distance of the score from the mean of the group |
| validity | quality of a test; the degree to which an instrument measures what it was desinged to measure |
| criterion-related validity | statistical method of comparing an instruments ability to measure a skill, trait, or domain with an existing instrument or other criterion |
| concurrent validity | a comparison of one instrument with another within a short period of time |
| predictive validity | measure of how well an instrument can predict future on some other variable |
| content validity | occurs when the items contained withint the test are representative of the content pruported to be measured |
| presentation format | metod in which items of an instrument are presented to a student |
| response mode | method required for the examinee to answer items of an instrumet |
| construct validity | ability of an instrument to measure psychological constructs |
| validity of test use | appropriate use of a specific instrument |