Communications in Kinesiology
A Survey on the Attitudes Towards and Perception of Reproducibility and Replicability in Sports and Exercise Science
Authors: Jennifer Murphy, Cristian Mesquida, Joe P. Warne
Editor: David Borg
DOI: 10.51224/cik.2023.53
Last Updated: May 10, 2023
Abstract
There are formal calls for increased reproducibility and replicability in sports and exercise science, yet there is minimal information on the overall knowledge of these concepts at a field-wide level. Therefore, we conducted a survey on the attitudes and perceptions of sports and exercise science researchers towards reproducibility and replicability. Descriptive statistics (e.g., proportion of responses), and thematic analysis, were utilized to characterize the responses. Of the 511 respondents, 42% (n = 217) believe there is a significant crisis of reproducibility or replicability in sports and exercise science while 36% (n = 182) believe there is a slight crisis. 3% (n = 15) of respondents believe there is no crisis while 19% (n = 95) did not know. Four themes were generated in the thematic analysis: the research and publishing culture, educational barriers to research integrity, research responsibility to ensure reproducibility and replicability, and current practices facilitating reproducibility and replicability. Researchers believe that engaging in open science can be detrimental to career opportunities due to lack of incentives. They also feel journals are a barrier to reproducible and replicable research due to high publication charges and a focus on novelty. Statistical expertise was identified as a key factor for improving reproducibility and replicability in the future, particularly, a better understanding of study design and different statistical techniques. Statistical education should be prioritised for early career researchers which could positively affect publication and peer review. Researchers must accept responsibility for reproducibility and replicability with thorough project design, appropriate planning of analyses, and transparent reporting practices.
1 Introduction
The recent concept of replication has gained attention in psychology due to a failure to replicate studies (Klein et al., 2014; Open Science Collaboration, 2015). However, it has also expanded to other fields such as social science (Camerer et al., 2018), economics (Camerer et al., 2016), and cancer biology (Errington et al., 2021), whereby similar large replication projects suggested a crisis of confidence in research findings (Pashler & Wagenmakers, 2012). This replication crisis
led to discussions around the replicability, reproducibility1, and transparency of research practices which helped inspire the open science movement (Munafo et al., 2017).
The response to the replication crisis
was met with mixed reaction. Those in favor of replication studies believe they can increase (or decrease) confidence in research findings, update boundaries on findings i.e., the external validity (Nosek & Errington, 2020), identify type I errors, and control for sampling error (Schmidt, 2009). However, there are arguments that concerns regarding replication are overblown, as replicability is not an ideal for all disciplines in science and cannot be universally applied (Guttinger, 2020). Others believe it is a waste of valuable resources and misguided to undertake large replication efforts (Eufemia et al., 2018; Stroebe & Strack, 2014).
Due to the contrasting views on the value of replication and reproducibility, a Nature survey explored the opinions of researchers in different fields (Baker, 2016). Of 1576 researchers, 52% believed there was a significant reproducibility crisis and 38% believed there was a slight crisis in science. A similar survey of psychologists was conducted to understand the community’s opinion on the importance of replication (Buttliere & Wicherts, 2018); results showed the community viewed replications as an essential aspect of the research process to determine what effects are real
. Although replication is one of the most obvious ingredients of science
(Schmidt, 2009, p. 91), it is not the norm across all scientific disciplines causing a period of unrest amongst those who advocate for it.
The issues of replication have yet to be examined in sports and exercise science, despite several publications identifying methodological and statistical concerns, and advocating for increased replication studies within the discipline (Caldwell et al., 2020; Halperin et al., 2015; Heneghan et al., 2012; Knudson, 2017). Some single study replication attempts were published in the field (Chalmers et al., 2018; Morin et al., 2019; Pitsch & Emrich, 2011), and there is an ongoing large replication project (Murphy et al., 2022). Additionally, research groups were formed to improve the manner in which we conduct research in the field (e.g., STORK; the Society for Transparency, Openness and Replication in Kinesiology). Yet, as replication has not grasped the attention of sports and exercise science like other fields (e.g., psychology, social science, cancer biology and economics), there is limited field-wide discussion on the concept. Consequently, there is no understanding of the attitudes towards, and perception of, reproducibility and replication in sports and exercise science to date. It is therefore difficult to gauge how accepting sports and exercise science researchers are of reproducibility and replicability at a field-wide level and, if opposed to it, the reasons for reluctance to embrace changes.
Publishers and journals are accused of prioritizing novel findings over replication studies for higher impact and to increase their journal metrics (Chambers et al., 2014; Nosek et al., 2012), which detracts from replication efforts. Replication is also considered to be a less inferior and creative method of research by some (Makel & Plucker, 2014). Other researchers are opposed to replication as they feel it is a personal attack on their work and a hostile action
(Nosek et al., 2022, p. 20). Thus, it is crucial to understand the barriers to the open science movement, particularly replication for this field, as this movement is affecting all areas of social science. By identifying the barriers to undertaking replication, changes can be implemented to incentivise researchers to adapt their methods and improve research practices. This information is essential to facilitate an increased number of replication studies, build awareness of current practices, and increase collaboration and transparency amongst researchers and statisticians alike (Caldwell et al., 2020; Sainani et al., 2020).
The purpose of this survey is to explore the attitudes and perceptions of researchers towards reproducibility and replicability in the field of sports and exercise science, by adapting the established Nature survey (Baker, 2016). The objectives of this study are to understand the community awareness of the terms reproducibility and replicability, and the attitudes towards these concepts, and to identify potential barriers to reproducibility and replicability in sports and exercise science.
2 Methods
2.1 Recruitment Strategy
To be included in this study, participants must be active researchers, therefore, the sample was limited to researchers who had published in a sports and exercise science journal in the previous 5 years to the survey distribution (2016 - 2021). As per the preregistration, we aimed to have a final sample size close to 2000. This sample size was based on similar surveys to our topic (Baker, 2016; Buttliere & Wicherts, 2018; Ross-Hellauer et al., 2017). All participants were informed through the survey website that it was anonymous and voluntary. Participants were informed that the study results and underlying data would be published. Participants provided consent using a digital informed consent form that was completed prior to beginning the survey.
2.2 Participants
There were 511 responses to the survey representing a response rate of 2.7%. For the demographics, 38% were from North America, 37% from Europe, 12% from Australasia, 6% from Asia, 5% from South America, and 2% from Africa. 31% of respondents were aged between 25 and 34 years, 36% from 35 to 44 years, and 18% from 45 to 54 years. Most respondents selected Associate Professor
as their main job role (27%), followed by Professor
(21%), Post-doctoral Fellow
(10%), and PhD student
(8%).
2.3 Preregistration Deviation
We originally planned to contact 10,000 sports and exercise science researchers via the mailing list of corresponding authors who had published in sports and exercise science journals according to the Web of Science research database (www.webofknowledge.com). However, we deviated from the preregistration due to very low response rates and contacted a total of 23,690 researchers instead. These were sent between May and July 2021 and 18,854 emails were delivered. The undelivered emails (N = 4836) were due to researchers moving institutions, university spam filters and other unknown reasons. We hypothesize low response rates could be a result of the survey length (mean completion time = 68:21 minutes), time of distribution (summer/university holidays) and no follow up reminder.
2.4 Experimental Design
The survey was adapted from a previously published Nature survey which explored scientist’s opinions on reproducibility in their field and other fields (Baker, 2016). Minor adaptations included the addition of questions relating to replication to those already focused on reproducibility. Questions were adapted to be specific to sports and exercise science such as In the field of sports and exercise science…
. This survey included 20 short sections and 45 questions with a focus on: familiarity of terminology; perception of the reproducibility/replication crisis; the proportion of published results that are reproducible or replicable; funder and publisher efforts to improve reproducibility and replicability; established procedures for reproducibility and replicability, and the impact of these on the laboratory; barriers to reproducibility and replicability; contributory factors to a failure to reproduce or replicate results; and factors that would improve reproducibility and replicability. The following definitions were provided in the survey: reproducibility is defined as retesting a claim using the same analyses and same data, whereas replication is retesting a claim using the same analyses and new data (Nosek & Errington, 2020).
Both multiple choice answers and free text boxes were used in the survey. We included open text boxes to capture opinions on reproducibility and replication that multiple choice questions potentially missed. Question skip logic was applied so participants did not have to respond to a question where the answer to the previous question made it irrelevant. The survey is available in full online along with the data, R code and supplementary materials (https://doi.org/10.17605/OSF.IO/64R8M). The preregistration is also available online (https://doi.org/10.17605/OSF.IO/EXK6N). Ethical approval was granted by Technological University Dublin (REC-PGRI-202021).
2.5 Quantitative Data Management and Statistical Analysis
The final analysis included survey responses which were fully completed and where digital consent was received. Data was collected via an encrypted, password protected online survey software, Microsoft Forms (version 16.63.1; Microsoft Office, Mountain View, CA, USA). There were 10 sections with free text data which consisted of brief sentences in response to the open-ended questions. These responses were transferred to a Microsoft Excel spreadsheet (version 16.63.1; Microsoft Office, Mountain View, CA, USA). Descriptive statistics were conducted for the categorical data (e.g., proportion of responses) using R (version 4.2.1) (2022).
2.6 Thematic Analysis Approach
The research question for this study was addressed using a reflexive thematic analysis approach. This approach involves the researcher’s reflective and thoughtful engagement with their data and their reflexive and thoughtful engagement with the analytic process
(Braun & Clarke, 2019, p. p594). As we analysed the data with our aim in mind, the themes are strongly related to the research question and were driven by the researcher’s theoretical interest. This is indicative of a deductive analysis; however, inductive analysis was also employed to ensure full interpretation of the data content. Using this type of analysis, responses were open coded to best represent meaning from the participants and a pre-specified coding book was not used.
Semantic coding was initially used to identify themes through engagement with the surface meaning of the data; key words and phrases were highlighted on hard copies of the transcripts. However, our coding approach was not exclusively semantic as we also interpreted the meaning underpinning responses from the participants in subsequent readings of the data (i.e., latent coding) (Braun & Clarke, 2019). The codes and their corresponding data extracts were then organised into theme piles
(Braun & Clarke, 2006) and subsequently revised and developed. When codes were organised based on recurring patterns, the sub-themes were formed. These sub-themes were then linked to one another and grouped to form a major theme. Our last step was to collate the data extracts in the table with their corresponding sub-themes and themes. Data extracts were selected in the results for the highest clarity for theme representation, but the dataset is fully available on the OSF project page.
3 Results
Of the 511 respondents, 47% (n = 239) of respondents were very familiar
and 39% (n = 200) were fairly familiar
with the term reproducibility, while 30% (n = 152) were very familiar
and 35% (n = 181) were fairly familiar
with the term replicability. Over three-quarters (78.1%) of these respondents believe there is a replication and reproducibility crisis in sports and exercise science (Figure 1).
When responding to a question asking whether they encountered barriers to implementing changes that would improve reproducibility and replicability in the laboratory, 37% of respondents (n = 189) identified barriers, 42% (n = 217) did not, and 20% (n = 102) were unsure. Furthermore, when answering a question on the factors that contribute to a study failing to replicate, respondents believe poor experimental design, insufficient mentoring, publishing pressure, and selective reporting were among the highest contributing factors (Figure 2). The number to the left of the bar indicates the percentage of participants who responded with always contributes
, very often contributes
, or sometimes contributes
while the number on the right indicates the percentage of participants who responded with rarely contributes
or never contributes
. The center of the bar (grey) indicates those who responded, I don’t know
. Statements are ordered according to the total percentage of agreement.
3.1 Thematic Analysis Results
Four key themes were generated from the data after the thematic analysis was applied (Supplementary Tables 1-4). They were the following: the research and publishing culture, educational barriers to research integrity, research responsibility to ensure reproducibility and replicability, and current practices facilitating reproducibility and replicability in the field. A summary of the results is presented below, and the tables include selected quotes and information directly from the respondents for clarity.
3.1.1 Key Theme 1: The Research and Publishing Culture
Under the main theme of the research and publishing culture (Supplementary Table 1), there were three recurring sub-themes identified as barriers to replication which were: incentives for undertaking replication research, priority of novel research, and the business model of publishing. Survey respondents believe that engaging in open science, or conducting replication studies, will be detrimental to career progression due to lack of incentives. Sports and exercise science researchers feel pressurized to produce a high quantity of research studies due to a high level of competition for career and funding opportunities. Furthermore, according to respondents, novel research is prioritized over studies that are methodologically sound, and this is exacerbated by journal bias.
Journals were described as a barrier to reproducible research by actively promoting the file drawer issue, as they often reject research which is not considered novel or is non-significant. Researchers also expect to be criticized
for publishing replication studies and feel there is no value placed on them, especially in higher-ranked or prestigious journals i.e., quartile 1 journals. Additionally, researchers feel that journals are a barrier to reproducible research as scientific publishing is a billion-dollar business now. Lastly, they believe publishers are often profit focused and publication fees further exacerbates the file drawer problem as unfunded researchers will simply not publish.
3.1.2 Key Theme 2: Educational Barriers to Research Integrity
Under the main theme of educational barriers to research integrity (Supplementary Table 2), there were two recurring sub-themes which were: quality of peer review, and statistical expertise and knowledge of researchers. There were mixed views on the role of peer review for upholding values of research integrity, yet there is agreement on the importance of statistical knowledge for peer reviewers. Respondents identified greater scrutiny is needed by peer reviewers on study design. However, a lack of a formalized education process or screening for peer reviewers has led to the inability of some reviewers to assess poor analyses, lack of controls, or to recognize bias. Statistical expertise was a clear recurring theme throughout many responses, specifically researchers’ statistical education. Many researchers feel that a better understanding of study design, and the use of different statistical techniques to analyse data, would improve reproducibility and replicability within the field. Errors with data management and statistical techniques application were discussed as common factors that affect reproducibility and replicability of this field.
3.1.3 Key Theme 3: Research Responsibility to Ensure Reproducibility and Replicability
Under the main theme of research responsibility to ensure reproducibility and replicability (Supplementary Table 3), there were three recurring sub-themes: journal responsibility, researcher responsibility, and senior researcher/supervisor responsibility. The ownership of responsibility to ensure reproducibility and replicability in the research process was heavily debated in the responses.
Some believe journals are responsible for promoting transparency; there should be basic criteria for sample size justification, reporting and analysis, and flexibility with journal article length would be helpful. As we move into a more digital era, researchers appear frustrated with the lack of a corresponding increase in page limits which would decrease the selective reporting of results. Journals can facilitate and encourage open science practices via author guidelines, types of publications requested i.e., replication studies, and can enforce reporting criteria for readers and authors. Essentially, they have an opportunity to be leaders in implementing policies; they should be fostering changes rather than just policing. On the other hand, some researchers feel that journals have too much research power; they should have a smaller role rather than act as gatekeepers in science.
Some respondents believe that the responsibility for ensuring reproducibility and replicability should be with the researchers. Publication is the last stage of the research process, so it is the researcher’s responsibility to maximise transparency in their reporting practices and appropriately design their studies. Finally, supervisors were specifically identified as having a responsibility to promote open science practices for reproducible and replicable research with early career researchers and students. The promotion of these practices by the supervisors appears to determine the engagement of other researchers within the laboratory or research group according to respondents.
Researchers in the field also believe that individuals overestimate the level of statistical expertise they have. Some theorize that this applies to both early career researchers and supervisors. Supervisors also have an important role as mentors and should educate themselves, and their students, on the importance of reproducibility and replicability. Respondents believe more collaboration with statisticians and data analysts would be helpful to improve their own knowledge and account for any shortfalls in their knowledge that could affect research transparency and quality.
3.1.4 Key Theme 4: Current Practices Facilitating Reproducibility and Replicability in the Field
Under the main theme of current practices facilitating reproducibility and replicability in the field (Supplementary Table 4), there were two recurring sub-themes which were: data sharing and checklist usage. There appears to be mixed views on open data or data sharing from researchers in the field; journals are encouraging data sharing, which is deemed positive, but there is little enforcement or standardization of this. Many respondents have concerns with data sharing; there are potential career disadvantages to forcing all data and code to be shared, for example, some authors fear being scooped
. Secondly, for the author, open datasets are time consuming because they must be organised in a readable format.
Finally, respondents believe it difficult to ascertain whether data badges and sharing are having a positive effect, therefore, they are unsure whether they are worthwhile. There is also a general sense of frustration with the use of checklists when submitting research for publication. Respondents feel they are currently too generic, applied inconsistently and without rationale, and are frequently ignored during the peer review process. Some researchers feel they should be compulsory, and the study should not be published if the checklists are not followed appropriately. In contrast, many respondents declared they should be banned altogether.
Lastly, and although not specifically linked to the themes identified in the thematic analysis, there were multiple comments regarding the attitude towards open science as a movement (Supplementary Table 5). Some respondents believe a few open science advocates are actively trying to discredit other researchers’ work or specifically targeting research groups. Others reported the negative perception around failed replication studies discourages them from attempting replication.
4 Discussion
The overall aim of this study was to determine the attitudes towards, and perception of, reproducibility and replicability in sports and exercise science researchers. Survey results showed three-quarters of the respondents believe there is a crisis of reproducibility and replicability in the field, while 42.5% believe this crisis is significant. The concerns regarding replicability and reproducibility are lower than those of Baker (2016) where 90% of researchers across different scientific disciplines acknowledged the existence of a reproducibility crisis. We expect the lower rate of concerns reflect the minimal discourse on replication in sports and exercise science. Additionally, the potential naivety that science is functionally well in the field, despite identified concerns among some researchers, could have contributed to this lower rate. Four key themes were also generated in the thematic analysis: the research and publishing culture, educational barriers to research integrity, research responsibility to ensure reproducibility and replicability, and current practices facilitating reproducibility and replicability, which we have interpreted and grouped in the results. Therefore, the remainder of this section will discuss the context and implications of these thematic areas, as well as suggestions for future practices.
As identified in the theme of the research and publishing culture, researchers feel that sport and exercise science is currently under siege from competition, commercialization and metrics. These create a research culture that is largely driven by career incentives and novel research (Chambers et al., 2014; Nosek et al., 2012; Smaldino & McElreath, 2016). The pressure to publish is exacerbated by competition within academia; there are more PhDs being produced in world universities than there are permanent academic positions (Powell, 2015). Publication influences hiring, promotion and grant decisions which are considered a marker of achievement (Fanelli et al., 2017), consequently, the publication process is negatively perceived by some researchers due to overwhelming academic pressure (de Vrieze, 2021).
Academic pressures are similarly apparent in sports and exercise science as pressure to publish
was identified as one of the highest contributing factors towards a failure to replicate or reproduce findings in our survey (Figure 2). This is unsurprising given the survey respondents feel pressure to produce a large quantity of research output, potentially without regard for the quality or transparency of that research to keep up with their peers. Furthermore, 62.8% of clinical cancer researchers admit publishing pressure influences their reporting while 23% believe selective reporting or manipulating data was necessary to prove a hypothesis (Boulbes et al., 2018). One could argue that our field could be suffering from the same assumption, and we may have a crisis of incentives on our hands.
Sports and exercise science researchers reported they are disincentivized to undertake replication studies due to the priority of novel research and the belief that replications lack creativity (we know that reviewers are seeking novelty in the work, and I would expect to be criticized if I submitted a replication study
). This finding is similar to other fields (Nosek et al., 2012), therefore, these researchers are as much victims as they are facilitators of poor scientific behaviors. They are incentivized to engage in poor, or potentially dishonest, practices (John et al., 2012) simply because of the trade-off between quantity and quality in sports and exercise science, of which quantity is winning (Allen & Mehler, 2019) (I believe that academia pushes for greater scientific output at the cost of its quality
). There needs to be a change in culture for individuals.
A healthy research culture, which rewards quality rather than publication volume, would improve replicability and reproducibility within the field. These are not simple changes; they require structural changes at a cultural, university and publishing level. Achievable changes can be made in the short term, which will set the foundations for improved culture practices in the future. Examples of these changes include organizing a journal club to discuss open science practices, preregistering studies, adopting preprints, using a dedicated and transparent project workflow system etc. The adoption of open science can be overwhelming as it has many different facets, but Kathawalla et al. (2021) created a helpful guide to assist students and advisers with their journeys into open science. The current accepted norms of pressure to publish will continue until the incentive structure changes within the field.
For researchers, there is a temptation to produce and prioritize work which is novel for career success (Chambers et al., 2014). Novel or impressive findings are a primary goal of the current academic culture (Bernards et al., 2017). This is evident by the 2500% increased frequency of words such as innovative
, novel
and ground-breaking
in abstracts of PubMed articles from 1974 to 2017 (Vinkers et al., 2015). The demand for novel research is also apparent in our field and it instills a need for researchers to produce statistically significant findings. According to our survey respondents, selective reporting of novel or positive results was one of the highest contributing factors towards a failure to reproduce or replicate studies (Figure 2). This is supported by the implausibly high positive result rate of 81% across 300 studies in three flagship sports and exercise science journals (Twomey et al., 2021). Similarly, a positive result rate of 82% was reported for four high impact sports medicine and physiotherapy journals (Büttner et al., 2020). Many clinical cancer researchers (47%) also felt pressured to produce a positive
result by a collaborator (Boulbes et al., 2018), and based on our survey responses, this proportion could be higher in our field.
Non-significant or less exciting
results are often shunned by journals due to lower citation practices (Fanelli et al., 2017). A consequence is that sports and exercise science researchers are possibly disinclined to submit these types of results for publication, and they are relegated to the file drawer
(Rosenthal, 1979). Significant, novel findings are therefore worthy
of publication while null or less exciting results will not be observed by the scientific community (I have had papers rejected on the basis that
). Publication bias can alarmingly distort the proportion of true effects in the literature body rendering many study findings non-replicable.the results weren’t ’positive
or significant
. We all have. Journals perpetuate the problem by prioritizing novel findings.
The crucial step of verification or replication is rarely taken in sports and exercise science while journals are breeding poor scientific behaviors (Chambers et al., 2014). However, changes are ongoing to prevent selective reporting of results as Registered Reports are now offered as a publishing format (Chambers et al., 2014). Registered Reports undergo two rounds of peer review, before and after data collection, so that the manuscript could have an in-principal acceptance before any results are obtained. Although this format of publication is offered by many journals (see cos.io/rr), it is only beginning to be offered by sports and exercise science journals (Abt et al., 2021; Impellizzeri et al., 2019). Sport and exercise science must undertake a collective effort, where possible, to support journals who promote open practices and guidelines, rather than a focus on profit or their impact factor, a controversial metric (Heathers, 2022). This may be easier for those who have more career security e.g., tenured researchers, and leadership from these more senior researchers on this issue would greatly improve adoption of better publishing practices.
Statistical education was a key recurring theme throughout the thematic analysis and is supported by the quantitative results as respondents selected poor experimental design, inadequate mentoring, low statistical power, and mistakes as contributing factors towards a failure to replicate. Statistical and methodological errors are prevalent in sports and exercise science (Borg, Lohse, et al., 2020; Knudson, 2017; Nielsen et al., 2017). The use of controversial statistical methods even resulted in mainstream media criticism (Aschwanden & Nguyen, 2018; Sainani et al., 2019). Consequently, some researchers advocate for increased collaboration with statisticians within the field and we echo those calls (Sainani et al., 2020; Sainani & Chamari, 2022). This recommendation requires a shift in the culture norm, but perhaps larger structural changes are required for the long-term health of the sports and exercise science academic system. A redirection of attention to the impact of open science practices on students could be instrumental for the future of our field (Pownall et al., 2022). The introduction of preregistration was perceived as a helpful planning tool in the education of undergraduate psychology students and could promote best research practices, thereby reducing questionable research practices (Blincoe & Buchert, 2019). Similarly, replication studies could be encouraged as part of student projects. For example, there was the Hagen Cumulative Science Project (Jekel et al., 2019), and the Collaborative Replications and Education Project (Wagge et al., 2019).
When replication studies are integrated as part of academic training, students report an increased understanding of the research process, increased confidence with statistical methods, and find the overall experience quite positive (Smith et al., 2021; Stojmenovska et al., 2019). The incorporation of reproducible and replicable practices by early career researchers could improve the outlook of sports and exercise science by positively influencing the accuracy of reporting, which respondents identified as problematic for research quality (I think many times researchers believe that they know more about research than they do, making serious errors in methodology, using the wrong statistical tests, or not having clear objectives that they know how to accomplish.
). Prioritization of statistical education may also have a positive impact on peer reviewers when early career researchers eventually assume this role. Therefore, the sports and exercise science field will reap the reward of an investment in better statistical education in the future.
There were mixed views on the responsibility of sports and exercise science journals for ensuring reproducibility and replicability. Some respondents believe journals should promote reproducibility and replicability (Journals can certainly facilitate good open science practices among academics
), while others believe researchers are responsible (I think the researchers should own and drive it
). Reporting guidelines and checklists were introduced by journals over a decade ago (Atkinson et al., 2008), although they do not appear to be used frequently (Twomey et al., 2021), even though their use was shown to increase the quality of reporting in medical journals (Turner et al., 2012). The Transparency and Openness Promotion (TOP) guidelines were created by the Center for Open Science to enhance journal transparency (Nosek et al., 2015). The mean TOP factor (https://osf.io/t2yu5/) for 38 sports and exercise science journals was 2.05 ± 1.99 out of 27 for engagement with openness and transparency (Hansford et al., 2022). This low score demonstrates an opportunity for these journals to review their open science policies and implement changes to increase transparency and move the sports and exercise science field forward. There was a clear consensus in the responses that journals are almost sole gatekeepers in science as they have a large proportion of research responsibility but frequently reject replication studies (The journals wield a double-edged sword when it comes to replication and reproducibility
).
We, as sports and exercise science researchers, need to assume responsibility of our study design(s) rather than expecting improvements to be suggested during the peer review process. Peer review is not designed to verify findings; that expectation is too much for a voluntary role (Mellor, 2021). Even if it was, it is only possible if the data and code are shared. As this is not the norm in sports and exercise science (Borg, Lohse, et al., 2020), peer reviewers are limited to reviewing the claims based on the limited information provided in the manuscript. We suggest spending more time and attention on our study designs (Mesquida et al., 2022; Scheel et al., 2020), undertake preregistration and specify our hypotheses2, and collaborate with statisticians to improve our statistical inferences (Sainani et al., 2020). Essentially, we must assume responsibility for reproducibility and replicability ourselves, as opposed to offsetting the responsibility elsewhere (i.e., peer reviewers).
Like reporting guidelines and checklists, data sharing guidelines are present in many sports and exercise science journals. Although, data sharing would facilitate reproducibility and replicability, the guidelines are not often enforced according to survey respondents (The implementation of and adherence to checklists and standards is very haphazard
). Of 300 sports and exercise science articles, only 2.33% had a data accessibility statement while 0.67% reported open data or code (Twomey et al., 2021). In a similar analysis of 299 sports and exercise science studies, only 4.3% of 299 articles shared data while 1.7% stated data was available on request3, and no study shared any code or syntax related to the statistical analysis (Borg, Bon, et al., 2020). There is some reluctance to share data due to concerns regarding scooping
, where another author or research group obtains the data and publishes first (…having to
). This concern is shared by researchers in other fields, who view open data access as a beneficial process for the development of the scientific system of knowledge but not for an individual researcher and their prospective career (Ostaszewski, 2014).give away
work that they would otherwise be able to leverage to get a head start on future publications to bigger (and hence faster-moving) groups is a real problem
Researchers are fearful that open data might lead to misuse or misinterpretation of that data (Ostaszewski, 2014). Yet, as data and code availability are essential for future replication and meta-analyses, identifying errors during the scientific process must be normalized and communicated in a respectful but factual manner. We, as researchers, make mistakes (Nuijten et al., 2015), and a process of long-term self-correction is important for research validity. Furthermore, citation counts are higher for studies with open data (Piwowar & Vision, 2013). There are initiatives to encourage data sharing such as open data badges and the Peer Reviewer’s Openness Initiative (Morey et al., 2016). Although there can be issues around data sharing (e.g., ethical considerations, intellectual property, data is part of a longitudinal project), one could release a limited set of variables (excluding those that threaten privacy), embargo the dataset, or share a simulated dataset (Borg, Bon, et al., 2020). Sharing data increases its utility whereas closed science decreases its usability over time (Vines et al., 2014). When data sharing is not possible, sharing of code, instruments and analysis materials are still valuable for replication and should be encouraged in sports and exercise science.
Finally, there were some comments from survey respondents about the open science movement in general. Some respondents reported a negative perception around failed replications. This indicates an increased need to educate researchers on the meaning of a non-replicable finding; it does not automatically undermine the original study results, or mean they are false (Maxwell et al., 2015). There are a number of reasons a replication study will have dissimilar results to the original study including: unanticipated differences in the studies, low statistical power, or large heterogeneity in effect size estimates (Klein et al., 2018). Perhaps the term failed
should be removed from replication research altogether as it infers negativity. Regardless of the replication outcome, there must be respectful communication to original authors (Janz & Freese, 2020) and consideration of the tone of scientific critique4 The open science movement aims to improve the current biased and exclusive academic system (Kent et al., 2022), and must be inclusive of all types of researchers: students, early career researchers and senior researchers. In other words, a shift in the current closed research culture and gate-keeping should be a goal of future researchers in this field.
4.1 Limitations
There are several limitations of this survey. Firstly, there was a high level of familiarity with the terms reproducibility and replicability; this indicates that the respondents were biased towards open science and were more likely to participate i.e., survey bias. The survey was specifically not advertised on social media to minimize this as best as possible, but it is highly likely that our respondents also shared an interest in this topic. Secondly, the survey was adapted from Baker (2016) who used the terms reproducibility and replicability interchangeably. For this survey, definitions for reproducibility and replicability were given. However, for question 95, these constructs were ill-defined and used interchangeably. For example, question 9 states the results of a given study could be replicated exactly or reproduced in multiple similar experimental systems with variations of experimental settings such as materials and experimental model
. This could be viewed as misleading for the participants as the answer should reflect the union of two different constructs. Additionally, some of the Likert questions were incorrectly balanced i.e., in Figure 2 there were more options for negative
answers than positive
. This is a limitation of the original study from which this survey was adapted that was not corrected here. Finally, the participants had the option of not answering questions with an open text box response, therefore, the respondents who had an opinion may be more inclined to answer i.e., response bias.
4.2 Conclusion
More than three-quarters of respondents believe there is a reproducibility and replicability crisis in sports and exercise science. In the thematic analysis, respondents believe novel research is prioritized over methodologically sound research, and publication quantity over quality. There was a consensus that journals currently have too much research power and the guidelines/policies they have in place for increasing transparency (reporting checklists and data sharing guidelines) are not enforced sufficiently. Statistical education was also highlighted as a contributing factor towards poor reproducibility and replicability in the field. We recommend assuming increased responsibility for ensuring the reproducibility and replicability of our own work by appropriately designing studies, preregistering hypotheses, collaborating with statisticians, and sharing data. We also recommend the inclusion of open science practices as part of early career researcher education, including replication studies as a potential replacement for the traditional thesis, as well as an open mind towards other replication attempts. The strategic implementation of small changes will ultimately benefit the reproducibility and replicability of the field in the future and seeing examples of open science practices should then increase uptake, particularly amongst early career researchers in the long term.
5 Additional Information
5.1 Data Accessibility
The survey data, R code and supplementary materials are available online at https://doi.org/10.17605/OSF.IO/64R8M while the preregistration is also available online at https://doi.org/10.17605/OSF.IO/EXK6N.
5.3 Conflict of Interest
The authors report there are no competing interests to declare.
5.4 Funding
This work was supported by the Irish Research Council’s Government of Ireland Postgraduate Scholarship Programme (GOIPG/2020/1155).
5.5 Acknowledgments
The authors would like to thank Aaron Caldwell for his helpful feedback on the manuscript and the R code for the graphs. We would also like to thank the survey respondents for their time, particularly those who contacted us with feedback about the survey.
5.6 Preprint
The pre-publication version of this manuscript can be found on SportRxiv (DOI: 10.51224/SRXIV.234).
6 References
retesting a claim using the same data and comparable analyses as opposed to replication which uses new data; Nosek & Errington (2020)↩︎
Preregistration can be done at the Open Science Framework or posted as a preprint of the protocol at SportRxiv↩︎
As Gabelica et al. (2022) states, this essentially means no data is available↩︎
All question information can be found in the data repository↩︎