Investigation of the Efficiency of a Novel Algorithm for the Choice of Variables in Cluster Analysis on Real World Data Sets

Korzeniewski, Jerzy

doi:11089/347

Artykuł - szczegóły

Czasopismo

Acta Universitatis Lodziensis. Folia Oeconomica

2010 | 235 Multivariate Statistical Analysis | 149--156

Tytuł artykułu

Investigation of the Efficiency of a Novel Algorithm for the Choice of Variables in Cluster Analysis on Real World Data Sets

Autorzy

Jerzy Korzeniewski

Treść / Zawartość

Pełne teksty:

http://dspace.uni.lodz.pl:8080/xmlui/bitstream/handle/11089/347/149-156.pdf?sequence=1 [zdalny]

Warianty tytułu

Badanie efektywności nowego algorytmu do wyboru zmiennych w analizie skupień na zbiorach dawnych ze świata realnego

Języki publikacji

Abstrakty

In the paper, we investigate the efficiency of an Author-proposed algorithm for the choice of variables in cluster analysis on real world data sets. The assessment of this algorithm on synthetic data sets in the form of the mixtures of normal distributions was the subject of other paper - the algorithm turned out to be well working. The idea of the algorithm is to pick up as true these variables whose variance does not get so small as the variance of masking variables in the one-step mean shift procedure of data set observations. (original abstract)

W artykule badana jest efektywność algorytmu do wyboru zmiennych w analizie skupień zaproponowanego przez Autora na zbiorach danych ze świata realnego. Ocena tego algorytmu na syntetycznych zbiorach danych w postaci mieszanin rozkładów normalnych była przedmiotem innego badania - algorytm spisał się dobrze. Ideą algorytmu jest wybieranie jako istotnych tych zmiennych, których wariancja nie zmniejsza się tak bardzo jak wariancja zmiennych maskujących po zastosowaniu jednego kroku procedury średniego przesunięcia do wszystkich obserwacji zbioru danych. (abstrakt oryginalny)

Słowa kluczowe

Research efficiency Algorithms Algorithmic effectiveness Cluster analysis Statistical methods

Badanie efektywności Algorytmy Efektywność algorytmów Analiza skupień Metody statystyczne

Czasopismo

Acta Universitatis Lodziensis. Folia Oeconomica

Rocznik

2010

Tom

235 Multivariate Statistical Analysis

Strony

149--156

Opis fizyczny

Twórcy

autor

Jerzy Korzeniewski

University of Lodz, Poland

Bibliografia

Atkinson A., Riani M. (2004), Cerioli A., Exploring Multivariate Data with the Forward Search, Springer-Verlag.
Carmone F. J. Jr., Kara Ali , Maxwell S. (1999), HINoV: A New Model to Improve Market Segment Definition by Identifying Noisy Variables, Journal of Marketing Research, Vol. 36, No. 4.
Dash M, Liu H., (2000) Feature selection for clustering, Proceedings of Fourth Pacific-Asia Conference on Knowledge Discovery and Data Mining, (PAKDD).
Kaufman L., Rousseeuw P. J., Finding Groups in Data: an Introduction to Cluster Analysis, John Wiley&Sons, 1990.
Korzeniewski J. (2009), A novel technique of variable choice and weighting in cluster analysis, paper at IFCS Drezno 2009.
Steinley D., Brusco M. (2008), Selection of variables in cluster analysis: an empirical comparison of eight procedures, Psychometrika Vol. 73.

Typ dokumentu

Bibliografia

Identyfikatory

DOI

11089/347

Identyfikator YADDA

bwmeta1.element.ekon-element-000169657946

Komentarze

Musisz być zalogowany aby pisać komentarze.

Acta Universitatis Lodziensis. Folia Oeconomica

Investigation of the Efficiency of a Novel Algorithm for the Choice of Variables in Cluster Analysis on Real World Data Sets

Zgłoszenie zostało wysłane

Zgłoszenie zostało wysłane