Scientific and Technical Journal


ISSN Print 2221-3937
ISSN Online 2221-3805

Сlustering methods on large databases are discussed. As one of the possible algorithms for processing very large data bases (VLDB) a hierarchical agglomerative method BIRCH (Balanced Iterative Reducing and Clustering using Hierarchies) is considered. We introduce its matrix analog that allows not only processing huge data amounts, but also multidimensional data large database, and therefore allows you to work with multimedia content such as images or video information. We can also note that the BIRCH matrix modification easy to implement, fast enough, robust to different kinds of noises, allows consistent processing.



1. Han J., Kamber M., (2006) Data Mining: Concepts and Techniques. 2-nd ed., San Francisco: Morgan Kaufmann, 800 p.(In English)

2. Bodyanskiy Ye.V., Vynokurova E.A., and Dolotov A.I., (2013), Self-Learning Cascade Spiking Neural Network for Fuzzy Clustering Based on Group Method of Data Handling, Journal of Automation and Information Sciences, 45, No. 3, pp.23 – 33 (In English).

3. KolchyginB.V., and BodyanskiyYe.V., (2013), Adaptive Fuzzy Clustering with a Variable Fuzzifier, Cybernetics and Systems Analysis, 49, Issue 3, pp. 366 – 374 (In English).

4. BodyanskiyYe.V.,KolchyginB.V., VolkovaV.V., andPlissI.P., Adaptivnaya nechetkaya klasterizatsiya dannykh na osnove metoda Gustafsona–Kesselya [AdaptiveFuzzyClusteringviaGustafson-KesselMethods], (2013), UpravliaushieSistemiiMashini,No. 2, pp. 40 – 46 (InRussian).

5. Ng R.T., and Han J., (1994), Efficient and Effective Clustering Methods for Spatial Data Mining, Proc. 20th Int. Conf. on Very Large Data Bases. Santiago, Chile, pp. 144 – 145 (In English).

6. KaufmanL., and RousseeuwP.J., (1990),FindingGroups in Data: An Introduction to Cluster Analysis. N.Y.: John Wiley&Sons, 342 p.(In English).

7. Zhang T., Ramakrishnan R., and Livny M., (1996), BIRCH: An Efficient Data Clustering Method for Very Large Databases, Proc. Of the ACM SIGMOD Conf. On management of Data, Montreal: ACM Press, pp. 103 – 114(In English).

8. Guha S., Rastogi R., and Shim K., (2001), CURE: am Efficient Clustering Algorithm for Large Databases, Information Systems, 26, No. 1, pp. 35 – 58(In English).

9. Jain A.K., and Dubes R.C., (1988), Algorithms for Clustering Data, Englewood Cliffs, N.J.: Prentice Hall, 318 p. (In English).

Last download:
9 Dec 2019

[ © KarelWintersky ] [ All articles ] [ All authors ]
[ © Odessa National Polytechnic University, 2014-2018. Any use of information from the site is possible only under the condition that the source link! ]