cff-version: 1.2.0 abstract: "

The choice between Student’s t-test (IT) and Welch’s t-test (WT) represents a central debate in statistical practice. This paper provides a re-evaluation of their performance by examining the combined effects of unequal variances, skewed distributions, and disparate sample sizes. For normal distributions, we confirm that the WT maintains the false positive rate close to the nominal level when sample sizes and standard deviations are unequal. However, the WT was found to yield inflated false positive rates under skewed distributions with unequal sample sizes. A complementary empirical study based on gender differences in two psychological scales corroborated these findings. Finally, we contend that the null hypothesis of unequal variances together with equal means is often implausible to begin with, and that empirically, a difference in means typically coincides with differences in variance and skewness. An additional analysis using the Kolmogorov-Smirnov and Anderson-Darling tests shows that examining entire distributions, rather than just their means, can provide a suitable alternative when facing unequal variances or skewed distributions. Given these results, researchers should remain cautious with software defaults favoring Welch’s test.

" authors: - family-names: de Winter given-names: Joost orcid: "https://orcid.org/0000-0002-1281-8200" title: "Supplementary data for the paper "Why psychologists should not default to Welch’s t-test instead of Student’s t-test (and why the Anderson–Darling test is an underused alternative)"" keywords: version: 5 identifiers: - type: doi value: 10.4121/e8e6861a-7ab0-4b6d-bd67-5f95029322c5.v5 license: CC BY 4.0 date-released: 2025-07-20