Skip to main content

Main menu

  • Home
  • Content
    • Current
    • Ahead of print
    • Archive
    • Supplementary Material
    • Free Issue
    • Special Issues
  • Info for
    • Authors
    • Subscribers
    • Institutions
    • Advertisers
  • About Us
    • About Us
    • Editorial Board
  • Connect
    • Feedback
    • Help
    • Request JHR at your library
    • Alerts
  • Announcements
  • Special Issue
  • Other Publications
    • UW Press Journals

User menu

  • Register
  • Subscribe
  • My alerts
  • Log in
  • My Cart

Search

  • Advanced search
Journal of Human Resources
  • Other Publications
    • UW Press Journals
  • Register
  • Subscribe
  • My alerts
  • Log in
  • My Cart
Journal of Human Resources

Advanced Search

  • Home
  • Content
    • Current
    • Ahead of print
    • Archive
    • Supplementary Material
    • Free Issue
    • Special Issues
  • Info for
    • Authors
    • Subscribers
    • Institutions
    • Advertisers
  • About Us
    • About Us
    • Editorial Board
  • Connect
    • Feedback
    • Help
    • Request JHR at your library
    • Alerts
  • Announcements
  • Special Issue
  • Follow uwp on Twitter
  • Follow JHR on Bluesky
Research ArticleArticles

The High Stakes of Bad Exams

View ORCID ProfileJack Rossiter, View ORCID ProfileMight Kojo Abreh, View ORCID ProfileAisha Ali and View ORCID ProfileJustin Sandefur
Journal of Human Resources, November 2025, 60 (6) 2008-2037; DOI: https://doi.org/10.3368/jhr.0621-11739R1
Jack Rossiter
Jack Rossiter is a freelance education researcher .
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jack Rossiter
  • For correspondence: jack.rossiter{at}barcelonagse.eu
Might Kojo Abreh
Might Kojo Abreh is at the Institute for Educational Planning and Administration, University of Cape Coast.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Might Kojo Abreh
Aisha Ali
Aisha Ali is at the Center for Global Development.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Aisha Ali
Justin Sandefur
Justin Sandefur is at the Center for Global Development.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Justin Sandefur
  • Article
  • Figures & Data
  • Supplemental
  • Info & Metrics
  • References
  • PDF
Loading

References

  1. ↵
    1. Abreh, Might Kojo,
    2. Kofi Acheaw Owusu, and
    3. Francis Kodzo Amedahe
    . 2018. “Trends in Performance of WASSCE Candidates in the Science and Mathematics in Ghana: Perceived Contributing Factors and the Way Forward.” Journal of Education 198(1):113–23.
    OpenUrl
  2. ↵
    AERA. 2014. Standards for Educational and Psychological Testing. Washington, DC: American Educational Research Association.
    1. Agwu, Prince,
    2. Charles T Orjiakor,
    3. Aloysius Odii,
    4. Chinyere Onalu,
    5. Chidi Nzeadibe,
    6. Pallavi Roy,
    7. Obinna Onwujekwe, and
    8. Uzoma Okoye
    . 2022. “‘Miracle Examination Centres’ as Hubs for Malpractices in Senior Secondary School Certificate Examination in Nigeria: A Systematic Review.” International Journal of Educational Development 88:102538.
    OpenUrl
  3. ↵
    1. Ajayi, Kehinde F.
    2024. “School Choice and Educational Mobility: Lessons from Secondary School Applications in Ghana.” Journal of Human Resources 59(4):1207–43.
    OpenUrlFREE Full Text
  4. ↵
    1. Anzene, Solomon J.
    2014. “Trends in Examination Malpractice in Nigerian Educational System and its Effects on the Socio-Economic and Political Development of Nigeria.” Asian Journal of Humanities and Social Sciences 2(3):1–8.
    OpenUrl
  5. ↵
    1. Ballou, Dale, and
    2. Matthew G. Springer
    . 2017. “Has NCLB Encouraged Educational Triage? Accountability and the Distribution of Achievement Gains.” Education Finance and Policy 12(1):77–106.
    OpenUrl
  6. ↵
    1. Barlevy, Gadi, and
    2. Derek Neal
    . 2012. “Pay for Percentile.” American Economic Review 102(5):1805–31.
    OpenUrlCrossRef
  7. ↵
    1. Bashir, Sajitha,
    2. Marlaine Lockheed,
    3. Elizabeth Ninan, and
    4. Jee-Peng Tan
    . 2018. “Facing Forward: Schooling for Learning in Africa.” Washington, DC: The World Bank.
    1. Bergbauer, Annika B.,
    2. Eric A. Hanushek, and
    3. Ludger Woessmann
    . 2024. “Testing.” Journal of Human Resources 59(2):349–88.
    OpenUrlFREE Full Text
  8. ↵
    1. Birnbaum, Allan.
    1968. “Some Latent Trait Models and Their Use in Inferring an Examinee’s Ability.” In Statistical Theories of Mental Test Scores, ed. F.M. Lord and M.R. Novick, 397–479. Reading, MA: Addison-Wesley.
  9. ↵
    1. Bolt, Daniel M.
    1999. “Evaluating the Effects of Multidimensionality on IRT True-Score Equating.” Applied Measurement in Education 12(4):383–407.
    OpenUrlCrossRefWeb of Science
  10. ↵
    1. Braun, Henry.
    2004. “Reconsidering the Impact of High-stakes Testing.” Education Policy Analysis Archives 12(1).
    OpenUrl
  11. ↵
    1. Burdett, Newman.
    2017. “Review of High Stakes Examination Instruments in Primary and Secondary School in Developing Countries.” RISE Working Paper 17/018. Oxford: Research on Improving Systems of Education.
  12. ↵
    1. Burdett, Newman,
    2. Emily Houghton,
    3. Claire Sargent, and
    4. Jo Tisi
    . 2013. “Maintaining Qualification and Assessment Standards: Summary of International Practice.” Report. Slough, UK: National Foundation for Educational Research.
  13. ↵
    1. Burge, Bethan, and
    2. Louise Benson
    . 2020. “National Reference Test Results Digest 2020.” Report. Slough, UK: National Foundation for Educational Research.
  14. ↵
    1. Campbell, Donald T.
    1976. “Assessing the Impact of Planned Social Change.” Occasional Paper Series 8. Kalamazoo, MI: Western Michigan University.
  15. ↵
    1. Dillard, Mary E.
    2003. “Examinations Standards, Educational Assessments, and Globalizing Elites: The Case of the West African Examinations Council.” Journal of African American History 88(4):413–28.
    OpenUrl
  16. ↵
    1. Duflo, Esther,
    2. Pascaline Dupas, and
    3. Michael Kremer
    . 2021. “The Impact of Free Secondary Education: Experimental Evidence from Ghana.” NBER Working Paper Series 28937. Cambridge, MA: NBER.
  17. ↵
    1. Foy, Pierre, and
    2. Liqun Yin
    . 2016. “Scaling the TIMSS 2015 Achievement Data.” In Methods and Procedures in TIMSS 2015, ed. M.O. Martin, I.V.S. Mullis, and M. Hooper, 13.1–13.62. Boston, MA: Boston College, TIMSS & PIRLS International Study Center.
  18. ↵
    1. Furuta, Jared.
    2021. “Western Colonialism and World Society in National Education Systems: Global Trends in the Use of High-stakes Exams at Early Ages 1960 to 2010.” Sociology of Education 94(1):84–101.
    OpenUrl
  19. ↵
    1. Glewwe, Paul,
    2. Nauman Ilias, and
    3. Michael Kremer
    . 2010. “Teacher Incentives.” American Economic Journal: Applied Economics 2(3):205–27.
    OpenUrlCrossRefWeb of Science
  20. ↵
    1. Gneezy, Uri,
    2. John A. List,
    3. Jeffrey A. Livingston,
    4. Xiangdong Qin,
    5. Sally Sadoff, and
    6. Yang Xu
    . 2019. “Measuring Success in Education: The Role of Effort on the Test Itself.” American Economic Review: Insights 1(3):291–308.
    OpenUrlCrossRef
  21. ↵
    1. Ho, Andrew Dean.
    2008. “The Problem with ‘Proficiency’: Limitations of Statistics and Policy Under No Child Left Behind.” Educational Researcher 37(6):351–60.
    OpenUrlCrossRefWeb of Science
  22. ↵
    IEA. 2018. “Rosetta Stone: Measuring Global Progress Towards SDG4 by Linking Assessments Results to TIMSS and PIRLS International Benchmarks of Achievement.” Technical Report. Amsterdam: International Association for the Evaluation of Educational Achievement.
  23. ↵
    1. Jacob, Brian A.
    2005. “Accountability, Incentives and Behavior: The Impact of High-Stakes Testing in the Chicago Public Schools.” Journal of Public Economics 89(5–6):761–96.
    OpenUrlCrossRefWeb of Science
  24. ↵
    1. Kellaghan, Thomas, and
    2. Vincent Greaney
    . 2019. “Public Examinations Examined.” Report. Washington, DC: The World Bank.
  25. ↵
    1. Kolen, Michael, and
    2. Robert Brennan
    . 2014. “Test Equating, Scaling, and Linking: Methods and Practices.” Third edition. New York: Springer.
  26. ↵
    1. Le Nestour, Alexis,
    2. Laura Moscoviz, and
    3. Justin Sandefur
    . 2022. “The Long-Run Decline of Education Quality in the Developing World.” CGD Working Paper 608. Washington, DC: Center for Global Development.
  27. ↵
    1. Leschnig, Lisa,
    2. Guido Schwerdt, and
    3. Katarina Zigova
    . 2022. “Central Exams and Adult skills: Evidence from PIAAC.” Economics of Education Review 90:102289.
    OpenUrl
    1. Malik,
    2. Ikram Ali,
    3. Muhammad Sarwar, and
    4. Aqeel Imran
    . 2017. “Quality and Standardization: A Twin-Dilemma of Public Examinations at Higher Secondary School Level in Pakistan.” Report. Islamabad: Federal Board of Intermediate and Secondary Education.
  28. ↵
    MBSSE. 2019. “West African Senior School Certificate Examination (WASSCE) 2019.” Report. Freetown: Ministry of Basic and Senior Secondary Education.
  29. ↵
    Ministry of Education. 2018. “National Pre-Tertiary Education Curriculum Framework.” Report. Ministry of Education. Accra: National Council for Curriculum and Assessment.
  30. ↵
    1. Muraki, Eiji.
    1992. “A Generalized Partial Credit Model: Application of an EM Algorithm.” Applied Psychological Measurement 16(2):159–76.
    OpenUrlCrossRefWeb of Science
  31. ↵
    1. Neal, Derek.
    2010. “Aiming for Efficiency Rather Than Proficiency.” Journal of Economic Perspectives 24(3):119–32.
    OpenUrlCrossRefPubMed
  32. ↵
    1. Neal, Derek.
    . 2013. “The Consequences of Using One Assessment System to Pursue Two Objectives.” Journal of Economic Education 44(4):339–52.
    OpenUrlCrossRef
  33. ↵
    1. Newton, Paul E.
    2007. “Contextualising the Comparability of Examination Standards.” In Techniques for Monitoring the Comparability of Examinations Standards, ed. P. Newton, J. Baird, H. Goldstein, H. Patrick and P. Tymms, 9–42. London: Qualifications and Curriculum Authority.
  34. ↵
    1. Ozier, Owen.
    2018. “The Impact of Secondary Schooling in Kenya: A Regression Discontinuity Analysis.” Journal of Human Resources 53(1):157–88.
    OpenUrlAbstract/FREE Full Text
  35. ↵
    1. Patel, Dev, and
    2. Justin Sandefur
    . 2020. “A Rosetta Stone for Human Capital.” CGD Working Paper 550. Washington, DC: Center for Global Development.
  36. ↵
    1. Phelps, Richard P.
    2012. “The Effect of Testing on Student Achievement 1910–2010.” International Journal of Testing 12(1):21–43.
    OpenUrlCrossRef
  37. ↵
    1. Pointer, William.
    2014. “Setting the Grade Standards in the First Year of the New GCSEs.” Report. QA Centre for Education Research and Practice.
  38. ↵
    1. Rossiter, Jack,
    2. Might Kojo Abreh,
    3. Aisha Ali, and
    4. Justin Sandefur
    . 2023. “Replication Data for: The High Stakes of Bad Exams.” Harvard Dataverse. https://doi.org/10.7910/DVN/VM5BOQ
  39. ↵
    1. Rossiter, Jack, and
    2. Maimouna Konate
    . 2022. “Studying School Exams: A New Database.” Report. Washington, DC: Center for Global Development.
  40. ↵
    1. Singh, Abhijeet.
    2020. “Myths of Official Measurement: Auditing and Improving Administrative Data in Developing Countries.” RISE Working Paper 20/042. Oxford: Research on Improving Systems of Education.
  41. ↵
    1. Smith, William C.
    2016. “An Introduction to Global Testing Culture.” In The Global Testing Culture: Shaping Education Policy, Perceptions, and Practice, ed. William C. Smith, 7–24. Oxford: Symposium Books Ltd.
  42. ↵
    U.K. House of Commons. 2020. “Oral Evidence Taken before the Education Committee on 16 September 2020, on Accountability Hearings, HC 262.” House of Commons of the United Kingdom of Great Britain and Northern Ireland.
  43. ↵
    Vanguard. 2011. “NECO Releases Results, Records Another Mass Failure.” Vanguard, September 23.
  44. ↵
    WAEC. 2019. “Manual of Procedures of Activities for Test Development Division.” Report. Accra, Ghana: West African Examination Council.
  45. ↵
    1. Wingersky, Marilyn S., and
    2. Frederic M. Lord
    . 1984. “An Investigation of Methods for Reducing Sampling Error in Certain IRT Procedures.” Applied Psychological Measurement 8(3):347–64.
    OpenUrlCrossRefWeb of Science
  46. ↵
    1. Woessmann, Ludger.
    2018. “Central Exit Exams Improve Student Outcomes.” IZA World of Labor Working Paper. Bonn, Germany: IZA.
  47. ↵
    World Bank. 2020. “The Human Capital Index 2020 Update: Human Capital in the Time of COVID-19.” Report. Washington, DC: The World Bank.
  48. ↵
    1. Zimmerman, Seth D.
    2014. “The Returns to College Admission for Academically Marginal Students.” Journal of Labor Economics 32(4):711–54.
    OpenUrlCrossRef
PreviousNext
Back to top

In this issue

Journal of Human Resources: 60 (6)
Journal of Human Resources
Vol. 60, Issue 6
1 Nov 2025
  • Table of Contents
  • Table of Contents (PDF)
  • Cover (PDF)
  • Index by author
  • Front Matter (PDF)
Print
Download PDF
Article Alerts
Sign In to Email Alerts with your Email Address
Email Article

Thank you for your interest in spreading the word on Journal of Human Resources.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
The High Stakes of Bad Exams
(Your Name) has sent you a message from Journal of Human Resources
(Your Name) thought you would like to see the Journal of Human Resources web site.
Citation Tools
The High Stakes of Bad Exams
Jack Rossiter, Might Kojo Abreh, Aisha Ali, Justin Sandefur
Journal of Human Resources Nov 2025, 60 (6) 2008-2037; DOI: 10.3368/jhr.0621-11739R1

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Share
The High Stakes of Bad Exams
Jack Rossiter, Might Kojo Abreh, Aisha Ali, Justin Sandefur
Journal of Human Resources Nov 2025, 60 (6) 2008-2037; DOI: 10.3368/jhr.0621-11739R1
Twitter logo Facebook logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One
Bookmark this article

Jump to section

  • Article
    • Abstract
    • I. Introduction
    • II. Analytical Approach: Converting Scores to a Common Scale
    • III. Test Development and Data Collection
    • IV. Results
    • V. Economic Implications of Unreliable Exams
    • VI. Discussion: Policy Options to Improve Exam Comparability
    • VII. Conclusion
    • Acknowledgments
    • Footnotes
    • References
  • Figures & Data
  • Supplemental
  • Info & Metrics
  • References
  • PDF

Related Articles

  • Google Scholar

Cited By...

  • No citing articles found.
  • Google Scholar

More in this TOC Section

  • Teacher Testing Standards and the New Teacher Pipeline
  • Can Academic Redshirting Shrink the Education Gender Gap?
  • Working From Home and Wage Dynamics
Show more Articles

Similar Articles

Keywords

  • I25
  • I26
  • J24
  • O15
  • O55
UW Press logo

© 2026 Board of Regents of the University of Wisconsin System

Powered by HighWire