Franaçois Morlane-Hondère, Cécile Fabre, Nabil Hathout, and Ludovic Tanguy

Disambiguating Distributional Neighbors Using a Lexical Substitution Dataset

Abstract: This paper addresses the issue of polysemy in a distributional thesaurus. In such resources, distributional neighbors can relate indistinguishably to various senses. We propose a method to cluster the neighbors of a target word with respect to its senses, i.e. to attribute one sense to each neighbor. This is made possible by the use of a lexical substitution dataset, to which the distribution of the neighbors are compared.

1 Introduction

Many NLP applications need to know whether a word A is semantically more related to B than to C. Unsupervised corpus-based approaches to similarity ...

Get Natural Language Processing and Cognitive Science now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.