Techniques applied in automatic indexing of text material
Abstract:
Automatic indexing of text material can be very basic, or it can involve some advanced techniques. It normally begins with lexical analysis and it can imply the use of stop word lists, stemming techniques, the extraction of meaningful word combinations or statistical term weighting. Sometimes word combinations are linked to controlled vocabularies or classifications. For two decades now the Text REtrieval Conferences (TREC) have been the laboratory for specialists in this field.
Key words
automatic text indexing
stemming
TREC conferences
Introduction
We are all familiar with automatic indexing of texts because web search engines offer us the possibility to search for ...
Get Indexing now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.