Home | Browse | Search | Credits | About
Register | User Area | DL-Harvest | Help
DLIST

Semantic Retrieval for the NCSA Mosaic

Chen, Hsinchun and Schatz, Bruce R. (1994) Semantic Retrieval for the NCSA Mosaic. In Proceedings International World Wide Web Conferences, Chicago, IL.

Full text available as:
PDF - Requires Adobe Acrobat Reader or other PDF viewer.

Abstract

In this paper we report an automatic and scalable concept space approach to enhancing the deep searching capability of the NCSA Mosaic. The research, which is based on the findings from a previous NSF National Collaboratory project and which will be expanded in a new Illinois NSF/ARPA/NASA Digital Library project, centers around semantic retrieval and user customization. Semantic retrieval supports a higher level of abstraction in user search, which can overcome the vocabulary problem for information retrieval. Rather than searching for words within the object space, the search is for terms within a concept space (graph of terms occurring within objects linked to each other by the frequency with which they occur together). Co-occurrence graphs seem to provide good suggestive power in specialized domains, such as biology. By providing a more understandable, system-generated, semantics-rich concept space as an abstraction of the enormously complex object space plus algorithms and interface to assist in object/concept spaces traversal, we believe we can greatly alleviate both information overload and the vocabulary problem of internet services. These techniques will also be used to provide a form of customized retrieval and automatic information routing. Results from past research, the specific algorithms and techniques, and the research plan for enhancing the NCSA Mosaic's search capability in the NSF/ARPA/NASA Digital Library project will be discussed.

EPrint Type:Conference Paper
Keywords:National Science Digital Library, NSDL, Artificial Intelligence Lab, AI Lab, Information Retrieval
Subjects:Digital Libraries
Information Extraction
ID Code:523
Deposited On:01 October 2004
Alternative Locations:http://ai.bpa.arizona.edu/go/papers.html
Eprint Statistics:View statistics for this eprint
Tell A Colleague:Tell a colleague about it.

1 M. J. Bates. Subject access in online catalogs: a design model. Journal of the American Society for Information Science, 37(6):357-376, November 1986.

2 H. Chen. Collaborative systems: solving the vocabulary problem. IEEE COMPUTER, 27(5):58-66, Special Issue on Computer-Supported Cooperative Work (CSCW), May 1994.

3 H. Chen, P. Hsu, R. Orwig, L. Hoopes, and J. F. Nunamaker. Automatic concept classification of text from electronic meetings. Communications of the ACM, 37(10), October 1994.

4 H. Chen and K. J. Lynch. Automatic construction of networks of concepts characterizing document databases. IEEE Transactions on Systems, Man and Cybernetics, 2 (5):885-902, September/October 1992.

5 H. Chen, K. J. Lynch, K. Basu, and T. Ng. Generating, integrating, and activating thesauri for concept-based document retrieval. IEEE EXPERT, Special Series on Artificial Intelligence in Text-Based Information Systems, 8(2):25-34, April 1993.

6 H. Chen and T. Ng. An algorithmic approach to concept exploration in a large knowledge network (automatic thesaurus consultation): symbolic branch-and-bound vs. connectionist Hopfield net activation. In Journal of the American Society for Information Science, 1994, in press.

7 H. Chen, B. Schatz, T. Yim, and D. Fye. Automatic thesaurus generation for an electronic community system. In Journal of the American Society for Information Science, 1994, forthcoming.

8 G. W. Furnas, T. K. Landauer, L. M. Gomez, and S. T. Dumais. The vocabulary problem in human-system communication. Communications of the ACM, 30(11):964-971, November 1987.

EPrints dLIST, an open access archive for the Information Sciences, is supported by the School of Information Resources and Library Science and Learning Technologies Center, University of Arizona. Established in 2002, dLIST has a global Advisory Board and is a part of the Information Technology & Society Research Lab. Open Archives
Contact: Admin | Donate