Natural Language Processing (NLP) is a key processing step in fusion systems that need to process unstructured -and possibly human generated- text in natural language. Recent developments in Deep Learning have greatly increased the performance of NLP tasks. In particular, learned word representations have the form of high dimensional real valued vectors, called word embeddings, that have a number of amenable properties such as representing similar words with similar values of their vectorial representation, and capturing semantic regularities that correspond to geometric properties in the continuous high dimensional space. However, word embeddings have the drawback of being non interpretable. That is, their dimensions cannot be clearly associated to linguistic features. In this work, we propose real valued explicit linguistic word vectors that enjoy the properties of learned word embeddings while being human understandable.

Distributional memory explainable word embeddings in continuous space

L. Snidaro
Primo
;
G. Ferrin
Secondo
;
G. L. Foresti
Ultimo
2019-01-01

Abstract

Natural Language Processing (NLP) is a key processing step in fusion systems that need to process unstructured -and possibly human generated- text in natural language. Recent developments in Deep Learning have greatly increased the performance of NLP tasks. In particular, learned word representations have the form of high dimensional real valued vectors, called word embeddings, that have a number of amenable properties such as representing similar words with similar values of their vectorial representation, and capturing semantic regularities that correspond to geometric properties in the continuous high dimensional space. However, word embeddings have the drawback of being non interpretable. That is, their dimensions cannot be clearly associated to linguistic features. In this work, we propose real valued explicit linguistic word vectors that enjoy the properties of learned word embeddings while being human understandable.
File in questo prodotto:
File Dimensione Formato  
Distributional memory explainable word embeddings.pdf

non disponibili

Tipologia: Versione Editoriale (PDF)
Licenza: Non pubblico
Dimensione 226.57 kB
Formato Adobe PDF
226.57 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11390/1242204
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 4
social impact