Publications

カンファレンス (国際) On Approximately Searching for Similar Word Embeddings

Kohei Sugawara, Hayato Kobayashi and Masajiro Iwasaki

the annual meeting of the Association for Computational Linguistics (ACL2016)

2016.8.7

We discuss an approximate similarity search for word embeddings, which is an operation to approximately find embeddings close to a given vector. We compared several metric-based search algorithms with hash-, tree-, and graph- based indexing from different aspects. Our experimental results showed that a graph-based indexing exhibits robust performance and additionally provided useful information, e.g., vector normalization achieves an efficient search with cosine similarity.

Paper : On Approximately Searching for Similar Word Embeddings新しいタブまたはウィンドウで開く (外部サイト)

PDF : On Approximately Searching for Similar Word Embeddings