Authors
Teuvo Kohonen, Samuel Kaski, Krista Lagus, Jarkko Salojarvi, Jukka Honkela, Vesa Paatero, Antti Saarela
Publication date
2000/5
Journal
IEEE transactions on neural networks
Volume
11
Issue
3
Pages
574-585
Publisher
IEEE
Description
Describes the implementation of a system that is able to organize vast document collections according to textual similarities. It is based on the self-organizing map (SOM) algorithm. As the feature vectors for the documents statistical representations of their vocabularies are used. The main goal in our work has been to scale up the SOM algorithm to be able to deal with large amounts of high-dimensional data. In a practical experiment we mapped 6840568 patent abstracts onto a 1002240-node SOM. As the feature vectors we used 500-dimensional vectors of stochastic figures obtained as random projections of weighted word histograms.
Total citations
2000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024134991117110110100709177696566543855294042282220201810
Scholar articles
T Kohonen, S Kaski, K Lagus, J Salojarvi, J Honkela… - IEEE transactions on neural networks, 2000