This text covers the emerging technologies of document retrieval, information extraction, and text categorization in a way which highlights commonalities in terms of both general principles and practical issues. It seeks to satisfy a need on the part of technology practitioners in the Internet space, faced with having to make difficult decisions as to what research has been done an what the best practices are. It is not intended as a vendor guide (such things are quickly out of date), or as a recipe for building applications (such recipes are very context-dependent). But it does identify the key technologies, the issues involved, and the strengths and weaknesses on evaluation in every chapter, both in terms of methodology (how to evaluate) and what controlled experimentation and industrial experience have to tell us.
- ISBN13 9781588112507
- Publish Date 20 June 2002
- Publish Status Inactive
- Out of Print 8 April 2013
- Publish Country NL
- Imprint John Benjamins Publishing Co
- Format Paperback
- Pages 226
- Language English
- URL https://benjamins.com/catalog/nlp.5.1st