Home | Browse | Search | Credits | About
Register | User Area | DL-Harvest | Help
DLIST

Comparison of Two Approaches to Building a Vertical Search Tool: A Case Study in the Nanotechnology Domain

Chau, Michael and Chen, Hsinchun and Qin, Jailun and Zhou, Yilu and Qin, Yi and Sung, Wai-Ki and McDonald, Daniel M. (2002) Comparison of Two Approaches to Building a Vertical Search Tool: A Case Study in the Nanotechnology Domain. In Proceedings Joint Conference on Digital Libraries, Portland, OR.

Full text available as:
PDF - Requires Adobe Acrobat Reader or other PDF viewer.

Abstract

As the Web has been growing exponentially, it has become increasingly difficult to search for desired information. In recent years, many domain-specific (vertical) search tools have been developed to serve the information needs of specific fields. This paper describes two approaches to building a domain-specific search tool. We report our experience in building two different tools in the nanotechnology domain  (1) a server-side search engine, and (2) a client-side search agent. The designs of the two search systems are presented and discussed, and their strengths and weaknesses are compared. Some future research directions are also discussed.

EPrint Type:Conference Paper
Keywords:National Science Digital Library, NSDL, Artificial Intelligence Lab, AI Lab, Information retrieval, Web search engine, vertical search engine, Internet spider, Internet searching and browsing, post-retrieval analysis, indexing, noun-phrasing, selforganizing map, personalization, summarization.
Subjects:Internet
Digital Libraries
Information Extraction
ID Code:474
Deposited On:09 September 2004
Alternative Locations:http://ai.bpa.arizona.edu/go/papers.html
Eprint Statistics:View statistics for this eprint
Tell A Colleague:Tell a colleague about it.

[1] Bowman, C. M., Danzig, P. B., Manber, U., and Schwartz F. Scalable Internet Resource Discovery: Research Problems and Approaches, Communications of the ACM, 37(8)(1994), 98-107.

[2] Brin, S. and Page, L. The Anatomy of a Large-Scale Hypertextual Web Search Engine. In Proceedings of the 7th International World Wide Web Conference (WWW7), Brisbane, Australia, Apr 1998.

[3] Chakrabarti, S., van den Berg, M., and Dom B. Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery. In Proceedings of the 8th International World Wide Web Conference (WWW8), Toronto, Canada, May 1999.

[4] Chau, M., Chen, H., Qin, J., Zhou, Y., Sung, W. K., Chen, Y., Qin, Y., McDonald, D., Lally, A., and Landon, M. NanoPort: A Web Portal for Nanoscale Science and Technology. In Proceedings of the 2nd ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL’02), Portland, OR, USA, July 2002.

[5] Chau, M., Zeng, D., and Chen, H. Personalized Spiders for Web Search and Analysis. In Proceedings of the First ACM/IEEE-CS Joint Conference on Digital Librarie(JCDL’01), Roanoke, VA, USA, June 2001.

[6] Chen, H. “Collaborative Systems: Solving the Vocabulary Problem,” IEEE Computer, Special Issue on Computer-Supported Cooperative Work (CSCW), 27(5) (1994), 58-66.

[7] Chen, H., Fan, H., Chau, M., and Zeng, D. MetaSpider: Meta-Searching and Categorization on the Web, Journal of the American Society for Information Science and Technology, 52(13), 1134-1147 (2001).

[8] Chen, H., Schufels, C., and Orwig, R. Internet Categorization and Search: A Self-Organizing Approach, Journal of Visual Communication and Image Representation, 7(1), 88-102 (1996).

[9] Courteau , J. “Genome Databases,” Science, 254, (1991), 201-207.

[10] DeBra, P. and Post, R. Information retrieval in the World-Wide Web: Making Client-based Searching Feasible. In Proceedings of the First International World Wide Web Conference, Geneva, Switzerland, 1994.

[11] Fox, E., Hix, D., Nowell, L. T., Brueni, D. J., Wake, W. C., Lenwood, S. H., and Rao, D. Users, User Interfaces, and Objects: Envision, A Digital Library. Journal of the American Society for Information Science, 44(8) (1993), 480-491.

[12] Furnas, G. W., Landauer, T. K., Gomez, L. M., and Dumais, S. T. “The Vocabulary Problem in Human-System Communication” Communications of the ACM, 30(11),(1987), 964-971.

[13] Hearst, M. A. TextTiling: Segmenting Text into Multiparagraph Subtopics Passages. Computational Linguistics, 23(1) (1997), 33-64.

[14] Hearst, M. A. and Pedersen, J. Reexamining the Cluster Hypothesis: Scatter/Gather on Retrieval Results, in Proceedings of the 19th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’96), 76-84 (1996).

[15] Hovy, E. and Lin, C. Y. Automated Text Summarization in SUMMARIST. Advances in Automatic Text Summarization, 81-94, MIT Press 1999.

[16] Kohonen, T. Self-Organizing Maps. Springer-Verlag, Berlin, 1995.

[17] Lawrence, S. and Giles, C. L., Inquirus, the NECI Meta Search Engine. In Proceedings of the 7th International World Wide Web Conference, Brisbane, Australia, Apr 1998.

[18] Lawrence, S. and Giles, C. L. Accessibility of Information on the Web, Nature, 400 (1999), 107-109.

[19] Lin, C., Chen, H., and Nunamaker, J. Verifying the Proximity and Size Hypothesis for Self-Organizing Maps. Journal of Management Information Systems, 16(3) (1999-2000), 61-73.

[20] Lin, X., Soergel, D., and Marchionini, G. A Self organizing Semantic Map for Information Retrieval, in Proceedings of the 14th International ACM SIGIR Conference on Research and Development in Information Retrieval (1991), 262-269.

[21] Luhn, H. P. The Automatic Creation of Literature Abstracts. IBM Journal of Research and Development 2 (2), 159-165(1959).

[22] Mani, I. and Maybury, M. T. Advances in Automatic Text Summarization. MIT Press, 1999, ix-xv.

[23] Mauldin, M. L. Lycos: Design Choices in an Internet Search Service. IEEE Expert, 12(1) (1997), 8-11.

[24] McBryan, O. A. GENVL and WWWW: Tools for Taming the Web. In Proceedings of the 1st International World Wide Web Conference, Geneva, Switzerland, 1994.

[25] Pinkerton, B. Finding What People Want: Experiences with the WebCrawler. In Proceedings of the 2nd International World Wide Web Conference, Chicago, IL, USA, 1994.

[26] Shneiderman, B., Feldman, D., Rose, A. and Grau, X. F. Visualizing Digital Library Search Results with Categorical and Hierarchical Axes, in Proceedings of 5th ACM Conference on ACM 2000 Digital Libraries, San Antonio, TX, USA, 2000.

[27] Stix, G. (ed.). Nanotechnology. Scientific America, September 2001 (entire issue).

[28] Tolle, K. M. and Chen, H. Comparing Noun Phrasing Techniques for Use with Medical Digital Library Tools. Journal of the American Society for Information Science, 51(4) (2000), 352-370.

[29] Veerasamy, A. and Belkin, N. J., Evaluation of a Tool for Visualization of Information Retrieval Results. In Proceedings of the 19th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’96), 85-92, 1996.

EPrints dLIST, an open access archive for the Information Sciences, is supported by the School of Information Resources and Library Science and Learning Technologies Center, University of Arizona. Established in 2002, dLIST has a global Advisory Board and is a part of the Information Technology & Society Research Lab. Open Archives
Contact: Admin | Donate