Actions

Reverse Algebra: Difference between revisions

From Algolit

(Created page with "Category:algoriterary-encounter")
 
 
(10 intermediate revisions by 3 users not shown)
Line 1: Line 1:
[[Category:algoriterary-encounter]]
+
{|
 +
|-
 +
| Type: || Algoliterary exploration
 +
|-
 +
| Technique: || Word embeddings
 +
|-
 +
| Developed by: || Radim Rehurek and Petr Sojka & Algolit
 +
|}
 +
 
 +
Word embeddings are language modelling techniques that through multiple mathematical operations of counting and ordering, plot words into a multi-dimensional vector space. When embedding words, they transform from being distinct symbols into mathematical objects that can be operated on.
 +
 
 +
Algebra is generally defined as a generalisation of arithmetic in which letters representing numbers are combined according to the rules of arithmetic. In natural language processing, the opposite happens: words are represented by numbers which are generated by the circumstances of the text and of the algorithm which processes them.
 +
 
 +
This exploration is using [https://radimrehurek.com/gensim/index.html gensim], an open source vector space and topic modelling toolkit implemented in Python, to manipulate text according to the mathematic relationships which emerge between the words, once they have been plotted in a vector space.
 +
 
 +
 
 +
 
 +
 
 +
 
 +
 
 +
 
 +
 
 +
 
 +
 
 +
 
 +
 
 +
 
 +
 
 +
[[Category:Algoliterary-Encounters]][[Category:Data_Workers]]

Latest revision as of 08:56, 28 February 2019

Type: Algoliterary exploration
Technique: Word embeddings
Developed by: Radim Rehurek and Petr Sojka & Algolit

Word embeddings are language modelling techniques that through multiple mathematical operations of counting and ordering, plot words into a multi-dimensional vector space. When embedding words, they transform from being distinct symbols into mathematical objects that can be operated on.

Algebra is generally defined as a generalisation of arithmetic in which letters representing numbers are combined according to the rules of arithmetic. In natural language processing, the opposite happens: words are represented by numbers which are generated by the circumstances of the text and of the algorithm which processes them.

This exploration is using gensim, an open source vector space and topic modelling toolkit implemented in Python, to manipulate text according to the mathematic relationships which emerge between the words, once they have been plotted in a vector space.