Ocena porównawcza kilku metod wyznaczania liczby skupień w zbiorze danych

Korzeniowski, Jerzy

Artykuł - szczegóły

Czasopismo

Prace Naukowe Akademii Ekonomicznej we Wrocławiu. Taksonomia

2006 | 13 | nr 1126 Klasyfikacja i analiza danych - teoria i zastosowania | 168--174

Tytuł artykułu

Ocena porównawcza kilku metod wyznaczania liczby skupień w zbiorze danych

Autorzy

Jerzy Korzeniowski

Warianty tytułu

Performance Comparison of Some Methods of Determining the Number of Clusters in a Data Set

Języki publikacji

Abstrakty

W niniejszej pracy zaproponowana została nowa wersja algorytmu, w której liczba skupień jest ustalana za pomocą miernika liczbowego opartego na statystyce ϰ² przeprowadzony został też eksperyment oceniający algorytm w porównaniu z innymi wybranymi metodami. (fragment tekstu)

This paper is an attempt to compare the performance of an algorithm for determining the number of clusters in a data set proposed by the author with other methods of determining the number о clusters. The other methods include the Rousseau silhouette index, the Krzanowski-Lai index, the Caliński-Harabasz index and the Hartigan index. The comparison was carried out on the Euclidean space sets generated by the Milligan's CLUSTGEN program in 4,6 and 8 dimensions. The algorithm proposed does not depend on the grouping of set's point therefore the other methods were applied to two different in nature grouping methods i.e. the k-means method and the average linkage agglomeration method. The results seem to prove the new algorithm's usefulness. (original abstract)

Słowa kluczowe

Analiza skupień Algorytmy Eksperyment badawczy

Cluster analysis Algorithms Scientific experiment

Czasopismo

Prace Naukowe Akademii Ekonomicznej we Wrocławiu. Taksonomia

Rocznik

2006

Tom

Numer

nr 1126 Klasyfikacja i analiza danych - teoria i zastosowania

Strony

168--174

Opis fizyczny

Twórcy

autor

Jerzy Korzeniowski

Uniwersytet Łódzki

Bibliografia

Comaniciu D., Meer P. (1999), Mean Shift Analysis and Applications, IEЕЕ Int. Conf. Computer Vision (ICCV99), Kerkyra, Greece, s. 1197-1203.
Gordon A.D. (1999), Classification, Chapman & Hall.
Korzeniewski J. (2005), Propozycja nowego algorytmu wyznaczającego liczbą skupień, Klasyfikacja i analiza danych - teoria i zastosowani, Taksonomia 12, AE, Wrocław.
Milligan G.W. (1985), An Algorithm for Generating Artificial Test Clusters, ,Psychometrika" vol. 50, nr 1, s. 123-127.
Sugar C.A., James G.M. (2003), Finding the Number of Clusters in a Dataset: An Information-Theoretic Approach, "JASA" vol. 98.

Typ dokumentu

Bibliografia

Identyfikatory

Identyfikator YADDA

bwmeta1.element.ekon-element-000171558352

Komentarze

Musisz być zalogowany aby pisać komentarze.

Prace Naukowe Akademii Ekonomicznej we Wrocławiu. Taksonomia

Ocena porównawcza kilku metod wyznaczania liczby skupień w zbiorze danych

Zgłoszenie zostało wysłane

Zgłoszenie zostało wysłane