3

Techniques applied in automatic indexing of text material

Abstract:

Automatic indexing of text material can be very basic, or it can involve some advanced techniques. It normally begins with lexical analysis and it can imply the use of stop word lists, stemming techniques, the extraction of meaningful word combinations or statistical term weighting. Sometimes word combinations are linked to controlled vocabularies or classifications. For two decades now the Text REtrieval Conferences (TREC) have been the laboratory for specialists in this field.

Key words

automatic text indexing

stemming

TREC conferences

Introduction

We are all familiar with automatic indexing of texts because web search engines offer us the possibility to search for ...

Get Indexing now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.