Home | Browse | Search | Credits | About
Register | User Area | DL-Harvest | Help
DLIST

Verifying the proximity and size hypothesis for self-organizing maps

Lin, Chienting and Chen, Hsinchun and Nunamaker Jr., Jay F. (2000) Verifying the proximity and size hypothesis for self-organizing maps . Journal of Management Information Systems 16(3):pp. 57-70.

Full text available as:
HTML

Abstract

The Kohonen Self-Organizing Map (SOM) is an unsupervised learning technique for summarizing high-dimensional data so that similar inputs are, in general, mapped close to one another. When applied to textual data, SOM has been shown to be able to group together related concepts in a data collection and to present major topics within the collection with larger regions. Research in which properties of SOM were validated, called the Proximity and Size Hypotheses,is presented through a user evaluation study. Building upon the previous research in automatic concept generation and classification, it is demonstrated that the Kohonen SOM was able to perform concept clustering effectively, based on its concept precision and recall7 scores as judged by human experts. A positive relationship between the size of an SOM region and the number of documents contained in the region is also demonstrated.

EPrint Type:Journal Article (Paginated)
Keywords:National Science Digital Library, NSDL, Artificial Intelligence Lab, AI Lab, Document Management, Decision Support Systems, Algorithms
Subjects:Management Information Systems
Knowledge Management
Information Systems
ID Code:457
Deposited On:04 September 2004
Alternative Locations:http://ai.bpa.arizona.edu/go/papers.html
Eprint Statistics:View statistics for this eprint
Tell A Colleague:Tell a colleague about it.

1. Chen, H. Machine learning for information retrieval: neural networks, symbolic learning, and genetic algorithms. Journal of the American Society for Information Science, 46 (1995), 194-216.

2. Chen, H.; Schuffels, C.; and Orwig, R. Internet categorization and search: a self-organizing approach. Journal of Visual Communications and Image Representation, 7, 1 (March 1996), 88102.

3. Doszkocs, T ; Reggia J.; and Lin X. Connectionist models and information retrieval. Annual Review of Information Science and Technology 25 (1990), 209-260.

4. Kaski, S.; Honkela, T.; Lagus, K.; and Kohonen, T. Creating an order in digital libraries with self organizing maps. Proceedings of World Congress on Neural Networks, San Diego, 1996.

5. Kohonen, T. Self Organizing Maps. Berlin: Springer Verlag, 1995.

6. Lin X.,; Soergel D.; and Marchionini G. A self-organizing semantic map for information retrieval. Proceedings of the Fifteenth Annual lnternational ACM/SIGIR Conference on R&D in Information Retrieval, Copenhagen, 1992, pp. 37-50.

7. Miikkulainen, R. Symbolic Natural Language Processing: An Integrated Model of Scripts, Lexicons, and Memory. Cambridge, MA: MIT Press, 1993.

8. Orwig R.; Chen, H.; and Nunamaker, J.ft A graphical, self-organizing approach to classifying electronic meeting output. Journal of the American Society for Information Science, 48, 2 ( 1997), 157-170.

9. Ritter, H., and Kohonen, T. Self-organizing semantic maps. Biological Cybernetics, 61 (1989), 241-254.

10. Roussinov D., and Chen, H. A scalable self-organizing map algorithm for textual classification: a neural network approach to thesaurus generation. Communication Cognition and Artificial Intelligence (Spring 1998).

11. Salton, G. Automatic Text Processing. Reading, MA: Addison Wiley, 1989.

12. Stepp, R. Concepts in conceptual clustering. Proceedings of the Tenth International Joint Conference on Artificial Intelligence, Milan; 1987.

EPrints dLIST, an open access archive for the Information Sciences, is supported by the School of Information Resources and Library Science and Learning Technologies Center, University of Arizona. Established in 2002, dLIST has a global Advisory Board and is a part of the Information Technology & Society Research Lab. Open Archives
Contact: Admin | Donate