Home | Browse | Search | Credits | About
Register | User Area | DL-Harvest | Help
DLIST

Expertise classification: Collaborative classification vs. automatic extraction

Bogers, Toine and Thoonen, Willem and van den Bosch, Antal (2006) Expertise classification: Collaborative classification vs. automatic extraction. In Furner, Jonathan and Tennis, Joseph T., Eds. Proceedings 17th Workshop of the American Society for Information Science and Technology Special Interest Group in Classification Research 17, Austin, Texas.

Full text available as:
PDF - Requires Adobe Acrobat Reader or other PDF viewer.

Abstract

Social classification is the process in which a community of users categorizes the resources in that community for their own use. Given enough users and categorization, this will lead to any given resource being represented by a set of labels or descriptors shared throughout the community (Mathes, 2004). Social classification has become an extremely popular way of structuring online communities in recent years. Well-known examples of such communities are the bookmarking websites Furl (http://www.furl.net/) and del.icio.us (http://del.icio.us/), and Flickr (http://www.flickr.com/) where users can post their own photos and tag them. Social classification, however, is not limited to tagging resources: another possibility is to tag people, examples of which are Consumating (http://www.consumating.com/), a collaborative tag-based personals website, and Kevo (http://www.kevo.com/), a website that lets users tag and contribute media and information on celebrities. Another application of people tagging is expertise classification, an emerging subfield of social classification. Here, members of a group or community are classified and ranked based on the expertise they possess on a particular topic. Expertise classification is essentially comprised of two different components: expertise tagging and expert ranking. Expertise tagging focuses on describing one person at a time by assigning tags that capture that person's topical expertise, such as ‘speech recognition' or ‘small-world networks'. information request, such as, for instance, a query submitted to a search engine. Methods are developed to combine the information about individual members' expertise (tags), to provide on-the-fly query-driven rankings of community members. Expertise classification can be done in two principal ways. The simplest option follows the principle of social bookmarking websites: members are asked to supply tags that describe their own expertise and to rank the other community members with regard to a specific request for information. Alternatively, automatic expertise classification ideally extracts expertise terms automatically from a user's documents and e-mails by looking for terms that are representative for that user. These terms are then matched on the information request to produce an expert ranking of all community members. In this paper we describe such an automatic method of expertise classification and evaluate it using human expertise classification judgments. In the next section we will describe some of the related work on expertise classification, after which we will describe our automatic method of expertise classification and our evaluation of them in sections 3 and 4. Sections 5.1 and 5.1 describe our findings on expertise tagging and expert rankings, followed by discussion and our conclusions in section 6 and recommendations for future work in section 7.

EPrint Type:Conference Paper
Keywords:expert classification, social tagging
Subjects:Classification
Computer Science
Knowledge Organization
ID Code:1792
Deposited On:27 March 2007
Eprint Statistics:View statistics for this eprint
Tell A Colleague:Tell a colleague about it.

Adar, E., Kargar, D., and Stein, L. (1999). Haystack: Per–user Information Environments. CIKM ’99: Proceedings of the Eighth International Conference on Information and Knowledge Management: 413–422.

Bogers, T., and Van den Bosch, A. (2006). Authoritative re-ranking in fusing authorship-based subcollection search results. In F. de Jong and W. Kraaij (eds.), Proceedings of the Sixth Belgian-Dutch Information Retrieval Workshop (DIR-2006): 49–55. Enschede: Neslia Paniculata.

Bogers, T., and Van den Bosch, A. (2006). Authoritative re-ranking of search results. Proceedings of the 28th European Conference on Information Retrieval (ECIR 2006), vol. 3936 of Lecture Notes on Computer Science: 519–522, Berlin: Springer Verlag.

Campbell, C., et al. (2003). Expertise identification using email communications. CIKM ’03: Proceedings of the Twelfth International Conference on Information and Knowledge Management: 528–531.

Chisholm, E., and Kolga, T. (1999). New term weighting formulas for the vector space method in information retrieval. Technical report ORNL/TM-13756, Computer Science and Mathematics Division, Oak Ridge National Laboratory.

Hertzum, M., and Pejtersen A. M. (2006) The information-seeking practices of engineers: Searching for documents as well as for people. Information Processing and Management, 36(5): 761–778.

Keller, P. (2006). Delicious statistics. Retrieved August 30, 2006, from http://www.pui.ch/phred/archives/2005/12/delicious-statistics.html

Mathes, A. (2004). Folksonomies: Cooperative classification and communication through shared metadata. Technical report, University of Illinois Urbana-Champaign.

McDonald, D. W. (2001). Evaluating expertise recommendations. Proceedings of the ACM 2001 International Conference on Supporting Group Work(GROUP’01): 214–223.

Miller, G. A. (1956). The magical number seven, plus or minus two: Some limits on our capacity for processing information. Psychological Review 63: 81-97.

Procter, R., et al. (1998). Genres in support of collaborative information retrieval in the virtual library. Interacting with Computers 10(2): 157–172.

Ross, P. E. (2006). The expert mind. Scientific American, August 2006.

Salthouse, T. (1991). Expertise as the circumvention of human processing limitations. In Toward a general theory of expertise (pp. 286-300). Cambridge: Cambridge University Press.

Sternberg, R. (1994). Cognitive conceptions of expertise. In Expertise in context (pp. 149-162). Cambridge, MA: MIT Press.

Thoonen, W. (2006). Expertise in Werkgroepen. Master’s thesis, Tilburg University, Tilburg.

TREC (2005). TREC Enterprise Track. Retrieved October 2005 from http://www.ins.cwi.nl/projects/trec-ent/wiki/index.php/Main Page.

Van Rijsbergen, C. (1979). Information retrieval, 2nd ed. Retrieved from http://www.dcs.gla.ac.uk/Keith/Preface.html.

Zheng, Z., and Srihari, R. (2003). Optimally combining positive and negative features for text categorization. ICML 2003 Workshop for Learning from Imbalanced Datasets II.

EPrints dLIST, an open access archive for the Information Sciences, is supported by the School of Information Resources and Library Science and Learning Technologies Center, University of Arizona. Established in 2002, dLIST has a global Advisory Board and is a part of the Information Technology & Society Research Lab. Open Archives
Contact: Admin | Donate