Home | Browse | Search | Credits | About
Register | User Area | DL-Harvest | Help
DLIST

Testing a Cancer Meta Spider

Chen, Hsinchun and Fan, Haiyan and Chau, Michael and Zeng, Daniel (2003) Testing a Cancer Meta Spider. International Journal of Human-computer Studies 59(1):pp. 755-776.

Full text available as:
PDF - Requires Adobe Acrobat Reader or other PDF viewer.

Abstract

As in many other applications, the rapid proliferation and unrestricted Web-based publishing of health-related content have made finding pertinent and useful healthcare information increasingly difficult. Although the development of healthcare information retrieval systems such as medical search engines and peer-reviewed medical Web directories has helped alleviate this information and cognitive overload problem, the effectiveness of these systems has been limited by low search precision, poor presentation of search results, and the required user search effort. To address these challenges, we have developed a domain-specific meta-search tool called Cancer Spider. By leveraging post-retrieval document clustering techniques, this system aids users in querying multiple medical data sources to gain an overview of the retrieved documents and locating answers of high quality to a wide spectrum of health questions. The system presents the retrieved documents to users in two different views: (1) Web pages organized by a list of key phrases, and (2) Web pages clustered into regions discussing different topics on a two-dimensional map (self-organizing map). In this paper, we present the major components of the Cancer Spider system and a user evaluation study designed to evaluate the effectiveness and efficiency of our approach. Initial results comparing Cancer Spider with NLM Gateway, a premium medical search site, have shown that they achieved comparable performances measured by precision, recall, and F-measure. Cancer Spider required less user searching time, fewer documents that need to be browsed, and less user effort.

EPrint Type:Journal Article (Paginated)
Keywords:National Science Digital Library, NSDL, Artificial Intelligence Lab, AI Lab, Cancer Spider
Subjects:Human Computer Interaction
Database Searching Instructions
ID Code:414
Deposited On:16 August 2004
Alternative Locations:http://ai.bpa.arizona.edu/go/papers.html
Eprint Statistics:View statistics for this eprint
Tell A Colleague:Tell a colleague about it.

Bin, L., Lun, K.C., 2001. The retrieval effectiveness of medical information on the Web. International

Journal of Medical Informatics 62, 155–163.

Bowman, C., Danzig, P., Manber, U., Schwartz, F., 1994. Scalable Internet resource discovery: research

problems and approaches. Communications of the ACM 37 (8), 98–107.

Brill, E., 1995. Transformation-based error-driven learning and natural language processing. Computational

Linguistics 21, 543–565.

Chau, M., Zeng, D., Chen, H., 2001. Personalized spiders for Web search and analysis. In: Proceedings of

the First ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL’01). ACM Press, New York,

pp. 79–87.

Chen, H., Schufels, C., Orwig, R., 1996. Internet categorization and search: a self-organizing approach.

Journal of Visual Communication and Image Representation 7 (1), 88–102.

Chen, H., Houston, A.L., Sewell, R.R., Schatz, B.R., 1998. Internet browsing and searching: user

evaluations of category map and concept space techniques. Journal of the American Society for

Information Science 49 (7), 582–603.

Chen, H., Fan, H., Chau, M., Zeng, D., 2001. MetaSpider: meta-searching and categorization on the Web.

Journal of the American Society for Information Science & Technology 52 (13), 1134–1147.

Gauch, S., Wang, G., Gomez, M., 1996. Profusion: intelligent fusion from multiple different search

engines. Journal of Universal Computer Science 2 (9), 637–649.

Harman, D., 1991. How effective is suffixing? Journal of the American Society for Information Science

42 (1), 7–15.

Hearst, M., 1995. TileBars: visualization of term distribution information in full text information access.

In: Proceedings of the ACM SIGCHI Conference on Human Factors in Computing Systems (CHI’95).

ACM Press, New York, pp. 59–66.

Hersh, W.R., 1996. Information Retrieval: A Health Care Perspective. Springer, Berlin, Germany.

Howe, A.E., Dreilinger, D., 1997. SavvySearch: a meta-search engine that learns which search engines to

query. AI Magazine 18 (2), 19–25.

Hull, D.A., 1996. Stemming algorithms—a case study for detailed evaluation. Journal of the American

Society for Information Science 47 (1), 70–84.

Kiley, R., 1999. Medical Information on the Internet: A Guide for Health Professionals. Churchill

Livingstone, London.

ARTICLE IN PRESS

H. Chen et al. / Int. J. Human-Computer Studies 59 (2003) 755–776 775

Keonemann, J., Belkin, N., 1996. A case for interaction: a study of interactive information retrieval

behavior and effectiveness. In: Proceedings of the ACM SIGCHI Conference on Human Factors in

Computing Systems (CHI’96). ACM Press, New York, pp. 205–212.

Kohonen, T., 1995. Self-Organizing Maps. Springer, Berlin, Germany.

Krovetz, R., 1993. Viewing morphology as an inference process. In: Proceedings of 16th International

ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ‘93). ACM

Press, New York, pp. 191–202.

Lawrence, S., Giles, C.L., 1998a. Inquirus, the NECI meta search engine. In: Proceedings of the 7th

International World Wide Web Conference, available at: http://www7.scu.edu.au/programme/

fullpapers/1906/com1906.htm.

Lawrence, S., Giles, C.L., 1998b. Context and page analysis for improved Web search. IEEE Internet

Computing 2 (4), 38–46.

Lawrence, S., Giles, C.L., 1999. Accessibility of information on the Web. Nature 400, 107–109.

Lin, X., 1997. Map displays for information retrieval. Journal of the American Society for Information

Science 48 (1), 40–54.

Lin, X., Soergel, D., Marchionini, G., 1991. A self-organizing semantic map for information retrieval.

In: Proceedings of the 14th International ACM SIGIR Conference on Research and Development in

Information Retrieval (SIGIR’91). ACM Press, New York, pp. 262–269.

Lovins, J.B., 1968. Development of a stemming algorithm. Mechanical Translation and Computational

Linguistics 11 (1–2), 22–31.

Orwig, R., Chen, H., Nunamaker, J.F., 1997. A graphical self-organizing approach to classifying

electronic meeting output. Journal of the American Society for Information Science 48 (2), 57–170.

Porter, M.F., 1980. An algorithm for suffix stripping. Program 14 (3), 130–137.

Salton, G., 1986. Another look at automatictext-re trieval systems. Communications of the ACM 29 (7),

648–656.

Salton, G., 1989. AutomaticText Processing. Addison-Wesley, Reading, MA.

Salton, G., Wong, A., Yang, C.S., 1975. A vector space model for automatic indexing. Communications of

the ACM 18, 613–620.

Shneiderman, B., 1997. Designing information-abundant Web sites: issues and recommendations.

International Journal of Human–Computer Studies 47, 5–29.

Selberg, E., Etzioni, O., 1995. Multi-service search and comparison using the MetaCrawler. In: Proceedings

of the Fourth World Wide Web Conference, available at: http://www.w3.org/Conferences/

WWW4/Papers/169/.

Selberg, E., Etzioni, O., 1997. The MetaCrawler architecture for resource aggregation on the Web. IEEE

Expert 12 (1), 1997.

Sutcliffe, A.G., Ennis, M., Hu, J., 2000. Evaluating the effectiveness of visual user interfaces for

information retrieval. International Journal of Human–Computer Studies 53, 741–763.

Tolle, K., Chen, H., 2000. Comparing noun phrasing techniques for use with medical digital library tools.

Journal of the American Society for Information Science 51 (4), 352–370.

Veerasamy, A., Belkin, N.J., 1996. Evaluation of a tool for visualization of information retrieval results.

In: Proceedings of the 19th International ACM SIGIR Conference on Research and Development in

Information Retrieval (SIGIR’96). ACM Press, New York, pp. 85–92.

Voorhees, E., Harman, D., 1998. Overview of the sixth Text REtrieval Conference (TREC-6). In:

Voorhees, E., Harman, D. (Eds.), NIST Special Publication 500-240: The Sixth Text REtrieval

Conference (TREC-6). National Institute of Standards and Technology, Gaithersburg, MD, pp. 1–24.

Westberg, E., Miller, R., 1999. The basis for using the Internet to support the information needs of

primary care. Journal of the American Medical Informatics Association 6, 6–25.

Zamir, O., Etzioni, O., 1999. Grouper: a dynamic clustering interface to Web search results. In:

Proceedings of the Eighth World Wide Web Conference, available at: http://www8.org/w8-papers/

3a-search-query/dynamic/dynamic.html.

EPrints dLIST, an open access archive for the Information Sciences, is supported by the School of Information Resources and Library Science and Learning Technologies Center, University of Arizona. Established in 2002, dLIST has a global Advisory Board and is a part of the Information Technology & Society Research Lab. Open Archives
Contact: Admin | Donate