Preview

Reliability Exercise

Good Essays
Open Document
Open Document
631 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Reliability Exercise
This test of novel problem solving is a measure of fluid intelligence (Doubleday, King, & Papageorgiou, 2002). People’s ability to solve novel problems is a stable characteristic, as it is largely genetically determined (Nairne, 2009). Test-retest is typically appropriate for measures with stable attributes, but this test’s novel nature makes it an inappropriate technique in regard to reliability. In effect, its novelty diminishes after the initial testing, producing difficulties due to practice effects, reactivity, or both. Since it has just 20 questions, furthermore, it is easier for examinees to remember a significant portion of its items and therefore either to remember the answers during the retest or to seek them out during the interval, resulting in erroneous score improvements (Yu, 2005). As it is impossible to discern the precise influences of any one factor, the interpretation of a test-retest coefficient is challenging, and with more appropriate reliability measures available temporal stability should not be used for this test.
Alternate-forms reliability eliminates some of the reactivity associated with test-retest, but it is nonetheless an inappropriate reliability measure for this test due to the possible carryover effects of strategy. Even when each specific item’s content is novel or unfamiliar, examinees may accustom themselves to the test’s style and subsequently apply the same principle used to solve one problem to another (Groth-Marnat, 2009). Truly equivalent forms are already difficult to develop, but together with the increasing difficulty of items in this test, assuming that no two items are the same, it makes generating a reliable alternate form unfeasible.
This test’s dichotomous scoring protocol is designed to assess problem-solving ability objectively with questions being answered either correctly or incorrectly. Such a standardised procedure independently considerably eliminates subjective influence, and assessing inter-rater

You May Also Find These Documents Helpful

  • Good Essays

    Reliability

    • 514 Words
    • 2 Pages

    Validity: Look at the population used for the VMQ and the populations for the tests used to evaluate the VMQ’s validity. Do you believe that the populations of the other tests are comparable…

    • 514 Words
    • 2 Pages
    Good Essays
  • Satisfactory Essays

    This paper defines and critiques the Wide Range Achievement Test-4 (WRAT-4). The first test edition was developed by Sidney Bijou and Joseph Jastak in 1941, and was published in 1946 (Wilkinson, Robertson, 2006). The WRAT-4 was developed and published by Dr. Gary S. Wilkinson and Dr. Gary J. Robertson in 2006 (Hasinger,…

    • 53 Words
    • 1 Page
    Satisfactory Essays
  • Good Essays

    Using standardized tests to assess a person’s cognitive and learning ability is a common practice in all kinds of institutions and has been debated for years. The primary purpose of such tests is to screen out large number of applications that don’t meet the minimum requirements. The key to correct use of such tests is to ensure the content, format and process of taking the test matches with the requirements of the job.…

    • 1125 Words
    • 5 Pages
    Good Essays
  • Good Essays

    2003 Dbq Analysis

    • 479 Words
    • 2 Pages

    Document 2 states, “Here is Gerald Bracey’s list of some of the biggies that we generally don’t even try to use standardized test to measure: creativity, critical thinking, resilience, motivation, persistence, enthusiasm, empathy, self-discipline, resourcefulness, honesty, and integrity-to name a few.” It is evidently shown that Document 2 addressed a common issue with standardized test and this acts as a counterclaim when supporters of standardized test say that it covers everything. As a result, this allots Document 2 great credibility and…

    • 479 Words
    • 2 Pages
    Good Essays
  • Better Essays

    Test Review: Wjiii

    • 1165 Words
    • 5 Pages

    The authors of the Woodcock-Johnson III battery, created the assessment to determine an individual’s cognitive strengths and weaknesses, the nature of any impairments, and to aid in diagnosis (Child-trends, 2004). However, it has also been used to make decisions concerning educational achievement and scholastic aptitude for school aged individuals (Riverside publishing, 2012). It is a full battery assessment, which consists of two separate tests; the test of cognitive abilities and the test of achievement (Riverside, 2012). The Test of cognitive abilities measures both general and specific cognitive functions, and the test of achievement is used to determine and describe one’s academic strengths and weaknesses (Child-trends, 2004). There are extended versions of each test (Child-trends, 2004). The authors of the WJIII are Richard Woodcock, Kevin McGrew, Nancy Mather, and Fredrick Schrank. The test is published by Riverside Publishing Company (Riverside publishing, 2012). It is designed to measures general and specific cognitive abilities, scholastic aptitude,…

    • 1165 Words
    • 5 Pages
    Better Essays
  • Better Essays

    Intro to Psych

    • 4855 Words
    • 20 Pages

    - tests that evaluate your overall cognitive ability to learn and solve problems general aptitude can be seen as intelligence…

    • 4855 Words
    • 20 Pages
    Better Essays
  • Powerful Essays

    Cited: Anderson, Scarvia B., and John S. Helmick. On Educational Testing. San Francisco: Jossey-Bass, 1983. Print.…

    • 2569 Words
    • 7 Pages
    Powerful Essays
  • Powerful Essays

    The Values and Motives Questionnaire, also known as the Values and Motives Inventory, is designed to examine a person’s motivation in relation to his values and activities. In order to ensure a comprehensive understanding of values, the VMQ assess three distinct areas, including: interpersonal, intrinsic, and extrinsic. Interpersonal values, according to the VMQ refer to one’s relationships with others. Intrinsic values contain one’s personal beliefs and attitudes. Finally, extrinsic values are one’s motivating factors at the workplace. Each of these three areas contain twelve topics addressed during the test. While the VMQ can be used for a variety of reasons, it is typically used in the workplace as a guidance tool. When exploring the Values and Motives Questionnaire, it is important to understand its reliability and validity. This paper will address the measurement’s reliability and validity, including its coefficients, strengths, and weaknesses.…

    • 1068 Words
    • 5 Pages
    Powerful Essays
  • Powerful Essays

    Intelligence is an intrapersonal phenomenon, that is inside a person and it is generally agreed that the nature of this energy is unknown. Nevertheless, it may be known by its mental products (Groth-Marnet, 1997; Wechsler, 1939). Because there are many different ways to be intelligent there have also been many different definitions proposed (see Neiser, et al., 1996 for summary). A consensus on what constitutes intelligence is generally lacking. Alfred Binet (1908), the author of one of the first modern intelligence tests, defined intelligence as the inclination to take and maintain a specific direction, and capacity to adapt to achieve a goal outcome, and the power of autocriticism (Kaplan, & Saccuzzo, 2005). In contrast, David Wechsler, the developer of the Wechsler scales, defined intelligence as the aggregate capacity to act purposefully, think rationally, and deal effectively with the environment (Wechsler, 1958 as cited in Kaplin, & Saccuzzo). A review by Sternberg, (2005) of intelligence literature over the past century by psychologists and intelligence experts reveals two…

    • 4122 Words
    • 17 Pages
    Powerful Essays
  • Good Essays

    1. Find a 95% confidence interval for the true proportion of the professor’s students who were…

    • 770 Words
    • 6 Pages
    Good Essays
  • Satisfactory Essays

    Ethical Speaking Analysis

    • 423 Words
    • 2 Pages

    I’m not really sure about this test, because I don’t believe I have ever taking one before. I feel that IQ isn’t really a measure of how good you are in school. It is a direct reflection of how quickly you learn and the potential depth of thought you are capable of. This extends into creativity and every facet of interaction with reality; it certainly goes beyond the scope of knowledge and education. IQ test is an accurate measure of a person’s intelligence, only that there are certain environmental factors that can affect it. It has also been proven that results from the score of a standard IQ test may vary up to 15 points, when the person being tested is affected by factors such as mood, anxiety, emotions and biochemistry. In order to lessen the effects of these factors, many people choose to take multiple IQ tests instead of single standard IQ test, simply because the former test gives a more accurate perception.…

    • 423 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    Intellectual Power

    • 637 Words
    • 3 Pages

    Gottfredson L. & Saklofske D. (2009). Intelligence: Foundations and Issues in Assessment. Canadian Psychology © 2009 Canadian Psychological Association. Vol. 50, No. 3, 183–195…

    • 637 Words
    • 3 Pages
    Good Essays
  • Powerful Essays

    Camara, W. J., Nathan, J. S, & Puente, A. E. (2000). Psychological test usage: Implications in…

    • 1650 Words
    • 7 Pages
    Powerful Essays
  • Good Essays

    Child Psychology

    • 517 Words
    • 3 Pages

    Another problem with IQ tests is that the scoring might be too subjective. A number of alternative IQ tests have been put forward to measure intelligent behaviour. These include elementary cognitive tasks, visual illusions and the Raven’s standard Progressive matrices. This last test was created to determine a person’s non-verbal intelligence. This test requires a person to identify missing elements in a series of patterns, with each pattern becoming increasingly more difficult. The test measures the ability to make sense of complex data and the ability to retain…

    • 517 Words
    • 3 Pages
    Good Essays
  • Powerful Essays

    As a further check on the validity of EPPS results, Edwards included a consistency scale with 15 pairs of statement repeated in identical form. In other words, the 210 pairs of statements, only 195 are unique. The 15 that occur twice are presented more or less randomly throughout the test. With this format the number of times a subject make the identical choice can be converted to a percentile based on a normative data. Inventories consisted of 225 pairs of statements in which items from each of the 15 scales paired with other items from the 14 plus pairs of twelve other items to check consistency optional. This leaves the number of items (14x15) at 210. Edwards has used 15…

    • 2096 Words
    • 9 Pages
    Powerful Essays

Related Topics