You are here

News Feeds

Standardized Admission Tests Are Not Biased. In Fact, They’re Fairer Than Other Measures

Skeptic.com feed - Thu, 05/22/2025 - 3:08pm
“It ain’t what you know that gets you into trouble. It’s what you know for sure that just ain’t so.” —Mark Twain

When it comes to opinions concerning standardized tests, it seems that most people know for sure that tests are simply terrible. In fact, a recent article published by the National Education Association (NEA) began by saying, “Most of us know that standardized tests are inaccurate, inequitable, and often ineffective at gauging what students actually know.”1 But do they really know that standardized tests are all these bad things? What does the hard evidence suggest? In the same article, the author quoted a first-grade teacher who advocated teaching to each student’s particular learning style—another ill-conceived educational fad 2 that, unfortunately, draws as much praise as standardized tests draw damnation.

Indeed, a typical post in even the most prestigious of news outlets34 will make several negative claims about standardized admission tests. In this article, we describe each of those claims and then review what mainstream scientific research has to say about them.

Claim 1: Admission tests are biased against historically disadvantaged racial/ethnic groups.

Response: There are racial/ethnic average group differences in admission test scores, but those differences do not qualify as evidence that the tests are biased.

The claim that admission tests are biased against certain groups is an unwarranted inference based on differences in average test performance among groups.

The differences themselves are not in question. They have persisted for decades despite substantial efforts to ameliorate them.5 As shown in the table above and reviewed more comprehensively elsewhere,67 average group differences appear on just about any test of cognitive performance—even those administered before kindergarten. Gaps in admission test performance among racial groups mirror other achievement gaps (e.g., high school GPA) that also manifest well before high school graduation. (Note: these group differences are differences between the averages— technically, the means—for the respective groups. The full range of scores is found within all the groups, and there is significant overlap between groups.)

Group differences in admission test scores do not mean that the tests are biased. An observed difference does not provide an explanation of the difference, and to presume that a group difference is due to a biased test is to presume an explanation of the difference. As noted recently by scientists Jerry Coyne and Luana Maroja, the existence of group differences on standardized tests is well known; what is not well understood is what causes the disparities: “genetic differences, societal issues such as poverty, past and present racism, cultural differences, poor access to educational opportunities, the interaction between genes and social environments, or a combination of the above.”8 Test bias, then, is just one of many potential factors that could be responsible for group disparities in performance on admission tests. As we will see in addressing Claim 2, psychometricians have a clear empirical method for confirming or disconfirming the existence of test bias and they have failed to find any evidence for its existence. (Psychometrics is that division of psychology concerned with the theory and technique of measurement of cognitive abilities and personality traits.)

Claim 2: Standardized tests do not predict academic outcomes.

Response: Standardized tests do predict academic outcomes, including academic performance and degree completion, and they predict with similar accuracy for all racial/ethnic groups.

The purpose of standardized admission tests is simple: to predict applicants’ future academic performance. Any metric that fails to predict is rendered useless for making admission decisions. The Scholastic Assessment Test (now, simply called the SAT) has predictive validity if it predicts outcomes such as college grade point average (GPA), whether the student returns for the second year (retention), and degree completion. Likewise, the Graduate Record Examination (GRE) has predictive validity if it predicts outcomes such as graduate school GPA, degree completion, and the important real world measure of publications. In practice, predictive validity, for example between SAT scores and college GPA, implies that if you pull two SAT-takers at random off the street, the one who earned a higher score on the SAT is more likely to earn a higher GPA in college (and is less likely to drop out). The predictive utility of standardized tests is solid and well established. In the same way that blood pressure is an important but not perfect predictor of stroke, cognitive test scores are an important but not perfect predictor of academic outcomes. For example, the correlation between SAT scores and college GPA is around .5,91011 the correlations between GRE scores and various measures of graduate school performance range between .3 and .4,12 and the correlation between Medical College Admission Test (MCAT) scores and licensing exam scores during medical school is greater than .6.13 Using aggregate rather than individual test scores yields even higher correlations that predict a college’s graduation rate given the ACT/SAT score of its incoming students. Based on 2019 data, the correlations between six-year graduation rate and a college’s 25th percentile ACT or SAT score are between .87 and .90.14

Standardized tests do predict academic outcomes, including academic performance and degree completion, and they predict with similar accuracy for all racial/ethnic groups.

Research confirming the predictive validity of standardized tests is robust and provides a stark contrast to popular claims to the contrary.151718 The latter are not based on the results of meta-analyses1920 nor on studies conducted by psychometricians.2122232425 Rather, those claims are based on cherry-picked studies that rely on select samples of students who have already been admitted to highly selective programs—partially because of their high test scores—and who therefore have a severely restricted range of test scores. For example, one often-mentioned study26 investigated whether admitted students’ GRE scores predicted PhD completion in STEM programs and found that students with higher scores were not more likely to complete their degree. In another study of students in biomedical graduate programs at Vanderbilt,27 links between GRE scores and academic outcomes were trivial. However, because the samples of students in both studies had a restricted range of GRE scores—all scored well above average28—the results are essentially uninterpretable. This situation is analogous to predicting U.S. men’s likelihood of playing college basketball based on their height, but only including in the sample men who are well above average. If we want to establish the link between men’s height and playing college ball, it is more appropriate to begin with a sample of men who range from 5'1" (well below the mean) to 6'7" (well above the mean) than to begin with a restricted sample of men who are all at least 6'4" (two standard deviations above the mean). In the latter context, what best differentiates those who play college ball versus not is unlikely to be their height—not when they are all quite tall to begin with.

Students of higher socioeconomic status (SES) do tend to score higher on the SAT and fare somewhat better in college. However, this link is not nearly as strong as many people … tend to assume.

Given these demonstrated facts about predictive validity, let’s return to the first claim, that admission tests are biased against certain groups. This claim can be evaluated by comparing the predictive validities for each racial or ethnic group. As noted previously, the purpose of standardized admission tests is to predict applicants’ future academic performance. If the tests serve that purpose similarly for all groups, then, by definition, they are not biased. And this is exactly what scientific studies find, time and time again. For example, the SAT is a strong predictor of first year college performance and retention to the second year, and to the same degree (that is, they predict with essentially equal accuracy) for students of varying racial and ethnic groups.2930 Thus, regardless of whether individuals are Black, Hispanic, White, or Asian, if they score higher on the SAT, they have a higher probability of doing well in college. Likewise, individuals who score higher on the GRE tend to have higher graduate school GPAs and a higher likelihood of eventual degree attainment; and these correlations manifest similarly across racial/ethnic groups, males and females, academic departments and disciplines, and master’s as well as doctoral programs.313233, 34 When differential prediction does occur, it is usually in the direction of slightly overpredicting Black students’ performance (such that Black students perform at a somewhat lower level in college than would be expected based on their test scores).

Claim 3: Standardized tests are just indicators of wealth or access to test preparation courses.

Response: Standardized tests were designed to detect (sometimes untapped) academic potential, which is very useful; and controlling for wealth and privilege does not detract from their utility.

Some who are critical of standardized tests say that their very existence is racist. That argument is not borne out by the history and expansion of the SAT. One of the long-standing purposes of the SAT has been to lessen the use of legacy admissions (set-asides for the progeny of wealthy donors to the college or university) and thereby to draw college students from more walks of life than elite high schools of the East Coast.35 Standardized tests have a long history of spotting “diamonds in the rough”—underprivileged youths of any race or ethnic group whose potential has gone unnoticed or who have under-performed in high school (for any number of potential reasons, including intellectual boredom). Notably, comparisons of Black and White students with similar 12th grade test scores show that Black students are more likely than White students to complete college.36 And although most of us think of the SAT and comparable American College Test (ACT) as tests taken by high school juniors and seniors, these tests have a very successful history of identifying intellectual potential among middle-schoolers37 and predicting their subsequent educational and career accomplishments.38

Students of higher socioeconomic status (SES) do tend to score higher on the SAT and fare somewhat better in college.39 However, this link is not nearly as strong as many people, especially critics of standardized tests, tend to assume—17 percent of the top 10 percent of ACT and SAT scores come from students whose family incomes fall in the bottom 25 percent of the distribution.40 Further, if admission tests were mere “wealth” tests, the association between students’ standardized test scores and performance in college would be negligible once students’ SES is accounted for statistically. Instead, the association between SAT scores and college grades (estimated at .47) is essentially unchanged (moving only to .44) after statistically controlling for SES.4142

Standardized tests have a long history of spotting “diamonds in the rough”—underprivileged youths of any race or ethnic group whose potential has gone unnoticed.

A related common criticism of standardized tests is that higher SES students have better access to special test preparation programs and specific coaching services that advertise their potential to raise students’ test scores. The findings from systematic research, however, are clear: the effects of test preparation programs, including semester-long, weekly, in-person structured sessions with homework assignments,43 demonstrate limited gains, and this is the case for the ACT, SAT, GRE, and LSAT.44454647 Average gains are small—approximately one-tenth to one-fifth of a standard deviation. Moreover, free test preparation materials are readily available at libraries and online; and for tests such as the SAT and ACT, many high schools now provide, and often require, free in-class test preparation sessions during the year leading up to the test.

Claim 4: Admission decisions are fairer without standardized tests.

Response: The admissions process will be less useful, and more unfair, if standardized tests are not used.

According to the fairtest.org website, in 2019, before the pandemic, just over 1,000 colleges were test-optional. Today, there are over 1,800. In 2022–2023, only 43 percent of applicants submitted ACT/SAT scores, compared to 75 percent in 2019–2020.48 Currently, there are over 80 colleges that do not consider ACT/SAT scores in the admissions process even if an applicant submits them. These colleges are using a test-free or test-blind admissions policy. The same trend is occurring for the use of the GRE among graduate programs.49

The movement away from admission tests began before the COVID-19 pandemic but was accelerated by it, and there are multiple reasons why so many colleges and universities are remaining test-optional or test-free. First, very small colleges (and programs) have taken enrollment hits and suffered financially. By eliminating the tests, they hope to attract more applicants and, hopefully, enroll more students. Once a few schools go test-optional or test-free, other schools feel they have to as well in order to be competitive in attracting applicants. Second, larger, less-selective schools (and programs) can similarly benefit from relaxed admission standards by enrolling more students, which, in turn, benefits their bottom line. Both types of schools also increase their percentages of minority student enrollment. It looks good to their constituents that they are enrolling young people from historically underrepresented groups and giving them a chance at success in later life. Highly selective schools also want a diverse student body but, similar to the previously mentioned schools, will not see much of a change in minority graduation rates simply by lowering admission standards if they also maintain their classroom academic standards. They will get more applicants, but they are still limited by the number of students they can serve. Rejection rates increase (due to more applicants) and other metrics become more important in identifying which students can succeed in a highly competitive academic environment.

The admissions process will be less useful, and more unfair, if standardized tests are not used.

There are multiple concerns with not including admission tests as a metric to identify students’ potential for succeeding in college and advanced degree programs, particularly those programs that are highly competitive. First, the admissions process will be less useful. Other metrics, with the exception of high school GPA as a solid predictor of first-year grades in college, have lower predictive validity than tests such as the SAT. For example, letters of recommendation are generally considered nearly as important as test scores and prior grades, yet letters of recommendation are infamously unreliable—there is more agreement between two letters about two different applicants from the same letter-writer than there is between two letters about the same applicant from two different letter-writers.50 (Tip to applicants—make sure you ask the right person to write your recommendation). Moreover, letters of recommendation are weak predictors of subsequent performance. The validity of letters of recommendation as a predictor of college GPA hovers around .3; and although letters of recommendation are ubiquitous in applications for entry to advanced degree programs, their predictive validity in that context is even weaker.51 More importantly, White and Asian students typically get more positive letters of recommendation than students from underrepresented groups.52 For colleges that want a more diverse student body, placing more emphasis on such admission metrics that also reveal race differences will not help.

Without the capacity to rely on a standard, objective metric such as an admission test score, some admissions committee members may rely on subjective factors, which will only exacerbate … disparate representation.

This brings us to our second concern. Because race differences exist in most metrics that admission officers would consider, getting rid of admission test scores will not solve any problems. For example, race differences in performance on Advanced Placement (AP) course exams, now used as an indicator of college readiness, are substantial. In 2017, just 30 percent of Black students’ AP exams earned a qualifying score compared to more than 60 percent of Asian and White students’ exams.53 Similar disparities exist for high school GPA; in 2009, Black students averaged 2.69, whereas White students averaged 3.09,54 even with grade inflation across U.S. high schools.5556 Finally, as mentioned previously, race differences even exist in the very subjective letters of recommendation submitted for college admission.57

Removing tests from the process is not going to address existing inequities; if anything, it promises to exacerbate them.

Without the capacity to rely on a standard, objective metric such as an admission test score, some admissions committee members may rely on subjective factors, which will only exacerbate any disparate representation of students who come from lower-income families or historically underrepresented racial and ethnic groups. For example, in the absence of standardized test scores, admissions committee members may give more attention to the name and reputation of students’ high school, or, in the case of graduate admissions, the name recognition of their undergraduate research mentor and university. Admissions committees for advanced degree programs may be forced to pay greater attention to students’ research experience and personal statements, which are unfortunately susceptible to a variety of issues, not the least being that students of high socioeconomic backgrounds may have more time to invest in gaining research experience, as well as the resources to pay for “assistance” in preparing a well-written and edited personal statement.58

So why continue to shoot the messenger?

If scientists were to find that a medical condition is more common in one group than in another, they would not automatically presume the diagnostic test is invalid or biased. As one example, during the pandemic, COVID-19 infection rates were higher among Black and Hispanic Americans compared to White and Asian Americans. Scientists did not shoot the messenger or engage in ad hominem attacks by claiming that the very existence of COVID tests or support for their continued use is racist.

Sadly, however, that is not the case with standardized tests of college or graduate readiness, which have been attacked for decades,59 arguably because they reflect an inconvenient, uncomfortable, and persistent truth in our society: There are group differences in test performance, and because the tests predict important life outcomes, the group differences in test scores forecast group differences in those life outcomes.

The attack on testing is likely rooted in a well-intentioned concern that the social consequences of test use are inconsistent with our social values of equality.60 That is, there is a repeated and illogical rejection of what “is” in favor of what educators feel “ought” to be.61 However, as we have seen in addressing misconceptions about admission tests, removing tests from the process is not going to address existing inequities; if anything, it promises to exacerbate them by denying the existence of actual performance gaps. If we are going to move forward on a path that promises to address current inequities, we can best do so by assessing as accurately as possible each individual to provide opportunities and interventions that coincide with that individual’s unique constellation of abilities, skills, and preferences.6263

Categories: Critical Thinking, Skeptic

A faster, more reliable method for simulating the plasmas used to make computer chips

Computers and Math from Science Daily Feed - Thu, 05/22/2025 - 1:27pm
Researchers developed a faster, more stable way to simulate the swirling electric fields inside industrial plasmas -- the kind used to make microchips and coat materials. The improved method could lead to better tools for chip manufacturing and fusion research.
Categories: Science

A faster, more reliable method for simulating the plasmas used to make computer chips

Matter and energy from Science Daily Feed - Thu, 05/22/2025 - 1:27pm
Researchers developed a faster, more stable way to simulate the swirling electric fields inside industrial plasmas -- the kind used to make microchips and coat materials. The improved method could lead to better tools for chip manufacturing and fusion research.
Categories: Science

An artificial protein that moves like something found in nature

Matter and energy from Science Daily Feed - Thu, 05/22/2025 - 1:26pm
Proteins catalyze life by changing shape when they interact with other molecules. The result is a muscle twitching, the perception of light, or a bit of energy extracted from food. The ability to engineer shapeshifting proteins opens new avenues for medicine, agriculture, and beyond.
Categories: Science

A new approach could fractionate crude oil using much less energy

Matter and energy from Science Daily Feed - Thu, 05/22/2025 - 1:25pm
Engineers developed a membrane that filters the components of crude oil by their molecular size, an advance that could dramatically reduce the amount of energy needed for crude oil fractionation.
Categories: Science

New dwarf planet spotted at the edge of the solar system

New Scientist Feed - Thu, 05/22/2025 - 1:00pm
The unusual orbit of a possible dwarf planet, known as 2017 OF201, makes it less likely that our solar system contains a hidden ninth “Planet X”
Categories: Science

Ultracold atoms have been 'hyperentangled' for the first time

New Scientist Feed - Thu, 05/22/2025 - 12:00pm
By exerting unprecedented control over extremely cold atoms, researchers have put them in a state with several simultaneously quantum-entangled properties
Categories: Science

Giant ground sloths evolved three different times for the same reason

New Scientist Feed - Thu, 05/22/2025 - 12:00pm
An analysis of the sloth family tree suggests three different groups of the animals evolved to gigantic sizes in response to cold and dry conditions
Categories: Science

HERMES-PF's 6 CubeSats Watch The Entire Sky For High-Energy Bursts

Universe Today Feed - Thu, 05/22/2025 - 11:57am

Multi-messenger astronomy has been all the rage lately. It involves capturing data on the gravitational and electromagnetic signals from catastrophic cosmic events. However, with that newfound interest comes required updates to infrastructure. Gravitational wave detectors have been upgraded and will be even more sensitive soon. But to realize the promise of multi-messenger astronomy, scientists must have a fleet of spacecraft watching the entire sky for high-energy signals indicative of the events that cause gravitational waves. At least, that is the team's long-term plan behind the High Energy Rapid Modular Ensemble of Satellites Pathfinder (HERMES-PF) mission, which successfully launched in March and is currently undergoing commissioning.

Categories: Science

Our Solar System May Have a New Planetary Sibling: Another Dwarf Planet

Universe Today Feed - Thu, 05/22/2025 - 11:53am

Our understanding of our Solar System is still evolving. As our telescopes have improved, they've brought the Solar System's deeper reaches into view. Pluto was disqualified as a planet because of it. Now, new research says another dwarf planet may reside at the edge of the Solar System. Its presence supports the Planet X hypothesis.

Categories: Science

AI is here to stay, let students embrace the technology, experts urge

Computers and Math from Science Daily Feed - Thu, 05/22/2025 - 10:35am
A new study says students appear to be using generative artificial intelligence (GenAI) responsibly, and as a way to speed up tasks, not just boost their grades.
Categories: Science

New atom-swapping method applied to complex organic structures

Matter and energy from Science Daily Feed - Thu, 05/22/2025 - 9:54am
Chemists have developed an efficient skeletal editing method for frequently used heteroaromatic structures. The technique could serve as a means to chemically modify biologically active compounds.
Categories: Science

ALMA measures evolution of monster barred spiral galaxy

Space and time from Science Daily Feed - Thu, 05/22/2025 - 9:54am
Astronomers have observed a massive and extremely active barred spiral galaxy in the early Universe and found that it has important similarities and differences with modern galaxies. This improves our understanding of how barred spiral galaxies, like our own Milky Way Galaxy, grow and evolve.
Categories: Science

Mathematical prediction of seismic wave propagation in magma containing crystals and bubbles

Matter and energy from Science Daily Feed - Thu, 05/22/2025 - 9:54am
Researchers have mathematically elucidated how the presence of crystals and gas bubbles in magma affects the propagation of seismic P-waves. A novel equation was derived to describe the travel of these waves through magma, demonstrating how varying proportions of crystals and bubbles influence wave velocity and waveform characteristics.
Categories: Science

Developing a pressure-induced water producing material

Matter and energy from Science Daily Feed - Thu, 05/22/2025 - 9:53am
Researchers have discovered a phenomenon -- applying pressure to a copper-chromium Prussian blue analog, which is a compound featuring crystal voids, causes the discharge of water retained within these voids. This material is expected to serve as a novel onsite water production platform for extraction of water solely through pressure application, without temperature or humidity control, even in arid regions.
Categories: Science

Saturn's moon: Mysterious wobbling atmosphere like a gyroscope

Space and time from Science Daily Feed - Thu, 05/22/2025 - 9:52am
The puzzling behavior of Titan's atmosphere has been revealed. The team has shown that the thick, hazy atmosphere of Saturn's largest moon doesn't spin in line with its surface, but instead wobbles like a gyroscope, shifting with the seasons.
Categories: Science

'Green' ammonia powered by sunlight

Matter and energy from Science Daily Feed - Thu, 05/22/2025 - 9:51am
Ammonia is a chemical essential to many agricultural and industrial processes, but it's mode of production comes with an incredibly high energy cost. Various attempts have, and are, being made to produce ammonia more efficiently. For the first time, a group has combined atmospheric nitrogen, water and sunlight, and using two catalysts, produced sizable quantities of ammonia without a high energy cost. Their processes mirror natural processes found in plants utilizing symbiotic bacteria.
Categories: Science

How property owners can work to prevent flooding

Matter and energy from Science Daily Feed - Thu, 05/22/2025 - 9:48am
The risk of heavy rainfall and severe flooding increases with climate change. But property owners -- regardless of size -- often underestimate their own responsibility and are unaware of what preventive measures they can take themselves.
Categories: Science

Breakthrough AI model could transform how we prepare for natural disasters

Computers and Math from Science Daily Feed - Thu, 05/22/2025 - 9:48am
From deadly floods in Europe to intensifying tropical cyclones around the world, the climate crisis has made timely and precise forecasting more essential than ever. Yet traditional forecasting methods rely on highly complex numerical models developed over decades, requiring powerful supercomputers and large teams of experts. According to its developers, Aurora offers a powerful and efficient alternative using artificial intelligence.
Categories: Science

Could AI understand emotions better than we do?

Computers and Math from Science Daily Feed - Thu, 05/22/2025 - 9:47am
Is artificial intelligence (AI) capable of suggesting appropriate behavior in emotionally charged situations? A team put six generative AIs -- including ChatGPT -- to the test using emotional intelligence (EI) assessments typically designed for humans. The outcome: these AIs outperformed average human performance and were even able to generate new tests in record time. These findings open up new possibilities for AI in education, coaching, and conflict management.
Categories: Science

Pages

Subscribe to The Jefferson Center  aggregator