Czasopismo
Tytuł artykułu
Autorzy
Warianty tytułu
Języki publikacji
Abstrakty
In this article we present an application of Artificial Intelligence in software project management by estimation of software metrics. Due to increasing significance of Open Source, we have selected project being hosted on the leading hosting platform of Open Source projects Source- forge.net. In the first part of the article, we describe steps of data extraction which was a large scale task because of the data source size and complexity. Moreover, we used data that were originally gathered to be used by project collaboration web-based system not to predict any features. Therefore extraction of meaningful data required an analysis of databases structure arid transformation of sets of records into the meaningful datasets. These datasets were constructed to predict four factors important to project management, i.e. skills, time, costs and effectiveness. In the later part of the article, we present the results of prediction experiments, that were performed using C4.5, RandomTree, CART, Neural Networks and Bayesian Belief Networks algorithms. Finally, we analyse influence of several methods' parameters on estimation accuracy and size of developed knowledge base. (original abstract)
Rocznik
Strony
96--105
Opis fizyczny
Twórcy
autor
- Zachodniopomorska Szkoła Biznesu w Szczecinie
autor
- Zachodniopomorski Uniwersytet Technologiczny w Szczecinie
Bibliografia
- Breiman L. (2001), Random forests, Machine Learning, Vol. 45, No. 1, pp. 5-32.
- Breiman L., Friedman J.PI., Olshen R.A., Stone C.J. (1984), Classification and Regression Trees, Wad- sworth International Group, Belmont.
- Eilhard J., Meniere Y. (2009), A look inside the forge: Developer productivity and spillovers in open source project, SSRN Working Paper Series, http://ssrn.com/abstract=13l6772.
- English R., Sehweik C.M. (2007), Identifying success and abandonment of floss commons: A classification ofsourceforge.net projects, Upgrade: The European Journal for the Informatics Professional, Vol. 8, No. 6, pp. 54-59.
- Gao Y., Huang Y., Madey G. (2004), Data mining project history in open source software communities, [in:] North American Association for Computational Social and Organization Sciences 2004, Pittsburgh.
- Madey G. (2008), The Sourceforge research data archive (srda), http://zerlot.cse.nd.edu.
- McColm G.L. (2003), An introduction to random trees, Research on Language & Computation, Vol. 1, No. 3-4, pp. 203-227.
- Mitchell T.M. (1997), Machine Learning, McGraw Hill, New York.
- Munson J.C. (2003), Software Engineering Measurement, CRC Press, Boca Raton.
- Quinlan R. (1993), C4.5: Programs for Machine Learning, Morgan Kaufmann Publishers, San Mateo.
- Raja U., Tretter M.J. (2006), Investigating Open Source Project success A data mining approach to model formulation, testing and validation, [in:] Proceedings of the Thirty first Annual SAS Users Group International Conference 2006, San Francisco, pp. 71-78.
- Russell S.J., Norvig P. (1995), Artificial Intelligence: A Modern Approach, Prentice Hall, Englewood Cliffs.
- Weiss D. (2005), Measuring success of open source projects using web search engines, [in:] The First International Conference on Open Source Systems (OSS 2005), Genova, pp. 93-99.
Typ dokumentu
Bibliografia
Identyfikatory
Identyfikator YADDA
bwmeta1.element.ekon-element-000169330064