Home | Browse | Search | Credits | About
Register | User Area | DL-Harvest | Help
DLIST

Extracting Meaningful Entities from Police Narrative Reports

Chau, Michael and Xu, Jennifer J. and Chen, Hsinchun (2002) Extracting Meaningful Entities from Police Narrative Reports. In Proceedings National Conference for Digital Government Research, Los Angeles, CA.

Full text available as:
PDF - Requires Adobe Acrobat Reader or other PDF viewer.

Abstract

Valuable criminal-justice data in free texts such as police narrative reports are currently difficult to be accessed and used by intelligence investigators in crime analyses. It would be desirable to automatically identify from text reports meaningful entities, such as person names, addresses, narcotic drugs, or vehicle names to facilitate crime investigation. In this paper, we report our work on a neural network-based entity extractor, which applies named-entity extraction techniques to identify useful entities from police narrative reports. Preliminary evaluation results demonstrated that our approach is feasible and has some potential values for real-life applications. Our system achieved encouraging precision and recall rates for person names and narcotic drugs, but did not perform well for addresses and personal properties. Our future work includes conducting larger-scale evaluation studies and enhancing the system to capture human knowledge interactively.

EPrint Type:Conference Paper
Keywords:National Science Digital Library, NSDL, Artificial Intelligence Lab, AI Lab, Extraction
Subjects:Knowledge Management
Data Mining
Information Seeking Behaviors
ID Code:423
Deposited On:16 August 2004
Alternative Locations:http://ai.bpa.arizona.edu/go/papers.html
Eprint Statistics:View statistics for this eprint
Tell A Colleague:Tell a colleague about it.

S. Baluja, V. Mittal, and R. Sukthankar (1999). Applying machine learning for high performance namedentity

extraction, in Proceedings of the Conference of the Pacific Association for Computational

Linguistics, 1999.

A. Borthwick, J. Sterling, E. Agichtein, and R. Grishman (1998). NYU: Description of the MENE named

entity system as used in MUC-7, in Proceedings of the Seventh Message Understanding Conference

(MUC-7), April 1998.

H. Chen, J. Schroeder, R. V. Hauck, L. Ridgeway, H. Atabakhsh, H. Gupta, C. Boarman, K. Rasmussen,

and A. W. Clements (2002). COPLINK Connect: Information and knowledge management for law

enforcement, Decision Support Systems, Special Issue on Digital Government, forthcoming.

N. A. Chinchor (1998). Overview of MUC-7/MET-2, in Proceedings of the Seventh Message

Understanding Conference (MUC-7), April 1998.

R. V. Hauck, H. Atabakhsh, P. Ongvasith, H. Gupta, and H. Chen (2002). Using Coplink to analyze

criminal justice data, IEEE Computer, 35(3), pp. 30-37.

G. R. Krupka and K. Hausman (1998). IsoQuest Inc.: Description of the NetOwlTM extractor system as

used for MUC-7, in Proceedings of the Seventh Message Understanding Conference (MUC-7), April

1998.

R. P. Lippmann (1987). An introduction to computing with neural networks, IEEE Acoustics Speech and

Signal Processing Magazine, 4(2), pp. 4-22.

E. Marsh and D. Perzanowski (1998). MUC-7 evaluation of IE technology: Overview of results, in

Proceedings of the Seventh Message Understanding Conference (MUC-7), April 1998.

S. Miller, M. Crystal, H. Fox, L. Ramshaw, R. Schwartz, R. Stone, R. Weischedel, and the Annotation

Group (1998). BBN: Description of the SIFT system as used for MUC-7, in Proceedings of the Seventh

Message Understanding Conference (MUC-7), April 1998.

K. M. Tolle and H. Chen (2000). Comparing noun phrasing techniques for use with medical digital

library tools, Journal of the American Society for Information Science, 51(4), pp. 352-370.

I. H. Witten, Z. Bray, M. Mahoui, and W. J. Teahan (1999). Using language models for generic entity

extraction, in Proceedings of the ICML Workshop on Text Mining, 1999.

EPrints dLIST, an open access archive for the Information Sciences, is supported by the School of Information Resources and Library Science and Learning Technologies Center, University of Arizona. Established in 2002, dLIST has a global Advisory Board and is a part of the Information Technology & Society Research Lab. Open Archives
Contact: Admin | Donate