TY - JOUR
T1 - Many Labs 2: Investigating Variation in Replicability Across Samples and Settings
AU - Many Labs 2 Collaboration
AU - Klein, Richard A.
AU - Vianello, Michelangelo
AU - Hasselman, Fred
AU - Adams, Byron G.
AU - Adams, Reginald B.
AU - Alper, Sinan
AU - Aveyard, Mark
AU - Axt, Jordan R.
AU - Babalola, Mayowa T.
AU - Bahník, Štěpán
AU - Batra, Rishtee
AU - Berkics, Mihály
AU - Bernstein, Michael J.
AU - Berry, Daniel R.
AU - Bialobrzeska, Olga
AU - Binan, Evans Dami
AU - Bocian, Konrad
AU - Brandt, Mark J.
AU - Busching, Robert
AU - Rédei, Anna Cabak
AU - Cai, Huajian
AU - Cambier, Fanny
AU - Cantarero, Katarzyna
AU - Carmichael, Cheryl L.
AU - Ceric, Francisco
AU - Chandler, Jesse
AU - Chang, Jen-ho
AU - Chatard, Armand
AU - Chen, Eva E.
AU - Cheong, Winnee
AU - Cicero, David C.
AU - Coen, Sharon
AU - Coleman, Jennifer A.
AU - Collisson, Brian
AU - Conway, Morgan A.
AU - Corker, Katherine S.
AU - Curran, Paul G.
AU - Cushman, Fiery
AU - Dagona, Zubairu K.
AU - Dalgar, Ilker
AU - Dalla Rosa, Anna
AU - Davis, William E.
AU - De Bruijn, Maaike
AU - De Schutter, Leander
AU - Devos, Thierry
AU - De Vries, Marieke
AU - Doğulu, Canay
AU - Dozo, Nerisa
AU - Dukes, Kristin Nicole
AU - Kovacs, Carrie
N1 - Funding Information:
This research was supported by the Center for Open Science and by a grant through the Association for Psychological Science from the Laura and John Arnold Foundation. The research in Chile was supported by Fondap Grant 15130009 from the Center for Social Conflict and Cohesion Studies.
Publisher Copyright:
© The Author(s) 2018.
PY - 2018/12/1
Y1 - 2018/12/1
N2 - We conducted preregistered replications of 28 classic and contemporary published findings, with protocols that were peer reviewed in advance, to examine variation in effect magnitudes across samples and settings. Each protocol was administered to approximately half of 125 samples that comprised 15,305 participants from 36 countries and territories. Using the conventional criterion of statistical significance (p <.05), we found that 15 (54%) of the replications provided evidence of a statistically significant effect in the same direction as the original finding. With a strict significance criterion (p <.0001), 14 (50%) of the replications still provided such evidence, a reflection of the extremely highpowered design. Seven (25%) of the replications yielded effect sizes larger than the original ones, and 21 (75%) yielded effect sizes smaller than the original ones. The median comparable Cohen’s ds were 0.60 for the original findings and 0.15 for the replications. The effect sizes were small (< 0.20) in 16 of the replications (57%), and 9 effects (32%) were in the direction opposite the direction of the original effect. Across settings, the Q statistic indicated significant heterogeneity in 11 (39%) of the replication effects, and most of those were among the findings with the largest overall effect sizes; only 1 effect that was near zero in the aggregate showed significant heterogeneity according to this measure. Only 1 effect had a tau value greater than.20, an indication of moderate heterogeneity. Eight others had tau values near or slightly above.10, an indication of slight heterogeneity. Moderation tests indicated that very little heterogeneity was attributable to the order in which the tasks were performed or whether the tasks were administered in lab versus online. Exploratory comparisons revealed little heterogeneity between Western, educated, industrialized, rich, and democratic (WEIRD) cultures and less WEIRD cultures (i.e., cultures with relatively high and low WEIRDness scores, respectively). Cumulatively, variability in the observed effect sizes was attributable more to the effect being studied than to the sample or setting in which it was studied.
AB - We conducted preregistered replications of 28 classic and contemporary published findings, with protocols that were peer reviewed in advance, to examine variation in effect magnitudes across samples and settings. Each protocol was administered to approximately half of 125 samples that comprised 15,305 participants from 36 countries and territories. Using the conventional criterion of statistical significance (p <.05), we found that 15 (54%) of the replications provided evidence of a statistically significant effect in the same direction as the original finding. With a strict significance criterion (p <.0001), 14 (50%) of the replications still provided such evidence, a reflection of the extremely highpowered design. Seven (25%) of the replications yielded effect sizes larger than the original ones, and 21 (75%) yielded effect sizes smaller than the original ones. The median comparable Cohen’s ds were 0.60 for the original findings and 0.15 for the replications. The effect sizes were small (< 0.20) in 16 of the replications (57%), and 9 effects (32%) were in the direction opposite the direction of the original effect. Across settings, the Q statistic indicated significant heterogeneity in 11 (39%) of the replication effects, and most of those were among the findings with the largest overall effect sizes; only 1 effect that was near zero in the aggregate showed significant heterogeneity according to this measure. Only 1 effect had a tau value greater than.20, an indication of moderate heterogeneity. Eight others had tau values near or slightly above.10, an indication of slight heterogeneity. Moderation tests indicated that very little heterogeneity was attributable to the order in which the tasks were performed or whether the tasks were administered in lab versus online. Exploratory comparisons revealed little heterogeneity between Western, educated, industrialized, rich, and democratic (WEIRD) cultures and less WEIRD cultures (i.e., cultures with relatively high and low WEIRDness scores, respectively). Cumulatively, variability in the observed effect sizes was attributable more to the effect being studied than to the sample or setting in which it was studied.
KW - Cognitive psychology
KW - Culture
KW - Individual differences
KW - Meta-analysis
KW - Open data
KW - Open materials
KW - Preregistered
KW - Registered Report
KW - Replication
KW - Sampling effects
KW - Situational effects
KW - Social psychology
UR - http://www.scopus.com/inward/record.url?scp=85116789185&partnerID=8YFLogxK
U2 - 10.1177/2515245918810225
DO - 10.1177/2515245918810225
M3 - Article
SN - 2515-2459
VL - 1
SP - 443
EP - 490
JO - Advances in Methods and Practices in Psychological Science
JF - Advances in Methods and Practices in Psychological Science
IS - 4
ER -