Distributional memory explainable word embeddings in continuous space

IRIS

Natural Language Processing (NLP) is a key processing step in fusion systems that need to process unstructured -and possibly human generated- text in natural language. Recent developments in Deep Learning have greatly increased the performance of NLP tasks. In particular, learned word representations have the form of high dimensional real valued vectors, called word embeddings, that have a number of amenable properties such as representing similar words with similar values of their vectorial representation, and capturing semantic regularities that correspond to geometric properties in the continuous high dimensional space. However, word embeddings have the drawback of being non interpretable. That is, their dimensions cannot be clearly associated to linguistic features. In this work, we propose real valued explicit linguistic word vectors that enjoy the properties of learned word embeddings while being human understandable.

Distributional memory explainable word embeddings in continuous space

L. Snidaro^Primo;G. Ferrin^Secondo;G. L. Foresti^Ultimo

2019-01-01

Abstract

Natural Language Processing (NLP) is a key processing step in fusion systems that need to process unstructured -and possibly human generated- text in natural language. Recent developments in Deep Learning have greatly increased the performance of NLP tasks. In particular, learned word representations have the form of high dimensional real valued vectors, called word embeddings, that have a number of amenable properties such as representing similar words with similar values of their vectorial representation, and capturing semantic regularities that correspond to geometric properties in the continuous high dimensional space. However, word embeddings have the drawback of being non interpretable. That is, their dimensions cannot be clearly associated to linguistic features. In this work, we propose real valued explicit linguistic word vectors that enjoy the properties of learned word embeddings while being human understandable.

Scheda breve

Scheda completa

Scheda completa (DC)

Anno

2019

Appare nelle tipologie:

4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
Distributional memory explainable word embeddings.pdf non disponibili Tipologia: Versione Editoriale (PDF) Licenza: Non pubblico Dimensione 226.57 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	226.57 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11390/1242204

Citazioni

ND

6

4

social impact