You shall know a word by the company it keeps
From Algolit
Type: | Algoliterary exploration |
Datasets: | Frankenstein, AstroBlackness, WikiHarass, Learning from Deep Learning, nearbySaussure |
Technique: | word-embeddings |
Developed by: | Google Tensorflow's word2vec, Algolit |
You shall know a word by the company it keeps is a series of 5 landscapes that are based on different datasets. Each landscape includes the words 'collective', 'human', 'social' in company of different semantic clusters. The belief that distances in the graph are connected to semantic similarity of words, was one of the basic ideas behind word2vec.
The graphs are the result of a code study based on an existing word-embedding tutorial script word2vec_basic.py. In a machine learning practise, graphs like these function as one of the validation tools to see if a model starts to make sense. It is interesting how this validation process is fuelled by individual semantic understanding of the clusters and the words.
How can we use these semantic landscapes as reading tools?
graph 1: Frankenstein
Includes the book Frankenstein or, The Modern Prometheus by Mary Shelly.
graph 2: AstroBlackness
A selection of texts from an afrofuturist perspective.
graph 3: nearbySaussure
Includes three secondary books about Saussure's work in structuralist linguistics.
graph 4: Learning from Deep Learning
Includes seven text books on the topic of deep learning.
graph 5: WikiHarass
Includes examples of harassment on Talk page comments from Wikipedia.