A multivariate statistical framework for the analysis of software effort phase distribution
2015 (English)In: Information and Software Technology, ISSN 0950-5849, E-ISSN 1873-6025, Vol. 59, p. 149-169Article in journal (Refereed) Published
Abstract [en]
Context
In software project management, the distribution of resources to various project activities is one of the most challenging problems since it affects team productivity, product quality and project constraints related to budget and scheduling.
Objective
The study aims to (a) reveal the high complexity of modelling the effort usage proportion in different phases as well as the divergence from various rules-of-thumb in related literature, and (b) present a systematic data analysis framework, able to offer better interpretations and visualisation of the effort distributed in specific phases.
Method
The basis for the proposed multivariate statistical framework is Compositional Data Analysis, a methodology appropriate for proportions, along with other methods like the deviation from rules-of-thumb, the cluster analysis and the analysis of variance. The effort allocations to phases, as reported in around 1500 software projects of the ISBSG R11 repository, were transformed to vectors of proportions of the total effort and were analysed with respect to prime project attributes.
Results
The proposed statistical framework was able to detect high dispersion among data, distribution inequality and various interesting correlations and trends, groupings and outliers, especially with respect to other categorical and continuous project attributes. Only a very small number of projects were found close to the rules-of-thumb from the related literature. Significant differences in the proportion of effort spent in different phrases for different types of projects were found.
Conclusion
There is no simple model for the effort allocated to phases of software projects. The data from previous projects can provide valuable information regarding the distribution of the effort for various types of projects, through analysis with multivariate statistical methodologies. The proposed statistical framework is generic and can be easily applied in a similar sense to any dataset containing effort allocation to phases.
Place, publisher, year, edition, pages
2015. Vol. 59, p. 149-169
Keywords [en]
Software effort distribution, Phased effort analysis, Compositional data analysis, Biplot, Cluster analysis
National Category
Electrical Engineering, Electronic Engineering, Information Engineering
Identifiers
URN: urn:nbn:se:ri:diva-33186DOI: 10.1016/j.infsof.2014.11.004Scopus ID: 2-s2.0-84921022835OAI: oai:DiVA.org:ri-33186DiVA, id: diva2:1177603
2018-01-252018-01-252023-05-16Bibliographically approved