Planned maintenance
A system upgrade is planned for 24/9-2024, at 12:00-14:00. During this time DiVA will be unavailable.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Automated Bug Assignment: Ensemble-based Machine Learning in Large Scale Industrial Contexts
Linköping University, Sweden; Ericsson AB, Sweden.
RISE, Swedish ICT, SICS, Security Lab. Lund University, Sweden.ORCID iD: 0000-0001-7879-4371
KTH Royal Institute of Technology, Sweden; UC Berkeley, USA.
Linköping University, Sweden.
Show others and affiliations
2016 (English)In: Empirical Software Engineering, ISSN 1382-3256, E-ISSN 1573-7616, Vol. 21, no 4, p. 1533-1578Article in journal (Refereed) Published
Abstract [en]

Bug report assignment is an important part of software maintenance. In particular, incorrect assignments of bug reports to development teams can be very expensive in large software development projects. Several studies propose automating bug assignment techniques using machine learning in open source software contexts, but no study exists for large-scale proprietary projects in industry. The goal of this study is to evaluate automated bug assignment techniques that are based on machine learning classification. In particular, we study the state-of-the-art ensemble learner Stacked Generalization (SG) that combines several classifiers. We collect more than 50,000 bug reports from five development projects from two companies in different domains. We implement automated bug assignment and evaluate the performance in a set of controlled experiments. We show that SG scales to large scale industrial application and that it outperforms the use of individual classifiers for bug assignment, reaching prediction accuracies from 50 % to 89 % when large training sets are used. In addition, we show how old training data can decrease the prediction accuracy of bug assignment. We advice industry to use SG for bug assignment in proprietary contexts, using at least 2,000 bug reports for training. Finally, we highlight the importance of not solely relying on results from cross-validation when evaluating automated bug assignment.

Place, publisher, year, edition, pages
Springer US , 2016, 12. Vol. 21, no 4, p. 1533-1578
Keywords [en]
software engineering, machine learning, mining software repositories, issue management
National Category
Computer and Information Sciences
Identifiers
URN: urn:nbn:se:ri:diva-24448DOI: 10.1007/s10664-015-9401-9Scopus ID: 2-s2.0-84941356343OAI: oai:DiVA.org:ri-24448DiVA, id: diva2:1043529
Available from: 2016-10-31 Created: 2016-10-31 Last updated: 2022-09-15Bibliographically approved

Open Access in DiVA

fulltext(1344 kB)1145 downloads
File information
File name FULLTEXT01.pdfFile size 1344 kBChecksum SHA-512
f0e20921126ff52b93cca79e1baf6b75eed755ff38dcf43833ca9641e769808359008d36a20263c168e3ab8554cbf78bbc9760d7ddfad1d4ec0e1abfa98b063a
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopushttp

Authority records

Borg, Markus

Search in DiVA

By author/editor
Jonsson, LeifBorg, MarkusRuneson, Per
By organisation
Security Lab
In the same journal
Empirical Software Engineering
Computer and Information Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 1145 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 1439 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf