Cristin-resultat-ID: 1362513
Sist endret: 5. desember 2016, 13:56
NVI-rapporteringsår: 2016
Resultat
Vitenskapelig artikkel
2016

Incorrect results in software engineering experiments: How to improve research practices

Bidragsytere:
  • Magne Jørgensen
  • Tore Dybå
  • Knut Liestøl og
  • Dag Sjøberg

Tidsskrift

Journal of Systems and Software
ISSN 0164-1212
e-ISSN 1873-1228
NVI-nivå 2

Om resultatet

Vitenskapelig artikkel
Publiseringsår: 2016
Publisert online: 2015
Volum: 116
Sider: 133 - 145
Open Access

Importkilder

Scopus-ID: 2-s2.0-84927142981

Beskrivelse Beskrivelse

Tittel

Incorrect results in software engineering experiments: How to improve research practices

Sammendrag

Context The trustworthiness of research results is a growing concern in many empirical disciplines. Aim The goals of this paper are to assess how much the trustworthiness of results reported in software engineering experiments is affected by researcher and publication bias, given typical statistical power and significance levels, and to suggest improved research practices. Method First, we conducted a small-scale survey to document the presence of researcher and publication biases in software engineering experiments. Then, we built a model that estimates the proportion of correct results for different levels of researcher and publication bias. A review of 150 randomly selected software engineering experiments published in the period 2002–2013 was conducted to provide input to the model. Results The survey indicates that researcher and publication bias is quite common. This finding is supported by the observation that the actual proportion of statistically significant results reported in the reviewed papers was about twice as high as the one expected assuming no researcher and publication bias. Our models suggest a high proportion of incorrect results even with quite conservative assumptions. Conclusion Research practices must improve to increase the trustworthiness of software engineering experiments. A key to this improvement is to avoid conducting studies with unsatisfactory low statistical power.

Bidragsytere

Magne Jørgensen

  • Tilknyttet:
    Forfatter
    ved Simula Research Laboratory
  • Tilknyttet:
    Forfatter
    ved Forskningsgruppen for programmering og software engineering ved Universitetet i Oslo
Aktiv cristin-person

Tore Dybå

  • Tilknyttet:
    Forfatter
    ved Forskningsgruppen for programmering og software engineering ved Universitetet i Oslo
  • Tilknyttet:
    Forfatter
    ved Software Engineering, Safety and Security ved SINTEF AS

Knut Liestøl

  • Tilknyttet:
    Forfatter
    ved Forskningsgruppen for biomedisinsk informatikk ved Universitetet i Oslo
Aktiv cristin-person

Dag Sjøberg

  • Tilknyttet:
    Forfatter
    ved Forskningsgruppen for programmering og software engineering ved Universitetet i Oslo
1 - 4 av 4