Reverse Algebra: Difference between revisions
From Algolit
(8 intermediate revisions by 3 users not shown) | |||
Line 1: | Line 1: | ||
{| | {| | ||
|- | |- | ||
− | | Type: || | + | | Type: || Algoliterary exploration |
|- | |- | ||
| Technique: || Word embeddings | | Technique: || Word embeddings | ||
|- | |- | ||
− | | Developed by: || Radim Rehurek and Petr Sojka | + | | Developed by: || Radim Rehurek and Petr Sojka & Algolit |
|} | |} | ||
+ | Word embeddings are language modelling techniques that through multiple mathematical operations of counting and ordering, plot words into a multi-dimensional vector space. When embedding words, they transform from being distinct symbols into mathematical objects that can be operated on. | ||
− | Algebra is generally defined as a generalisation of arithmetic in which letters representing numbers are combined according to the rules of arithmetic. In natural language processing, words are represented by numbers which are generated by the circumstances of the text and of the algorithm which processes them. | + | Algebra is generally defined as a generalisation of arithmetic in which letters representing numbers are combined according to the rules of arithmetic. In natural language processing, the opposite happens: words are represented by numbers which are generated by the circumstances of the text and of the algorithm which processes them. |
− | This | + | This exploration is using [https://radimrehurek.com/gensim/index.html gensim], an open source vector space and topic modelling toolkit implemented in Python, to manipulate text according to the mathematic relationships which emerge between the words, once they have been plotted in a vector space. |
Line 26: | Line 27: | ||
− | [[Category: | + | [[Category:Algoliterary-Encounters]][[Category:Data_Workers]] |
Latest revision as of 08:56, 28 February 2019
Type: | Algoliterary exploration |
Technique: | Word embeddings |
Developed by: | Radim Rehurek and Petr Sojka & Algolit |
Word embeddings are language modelling techniques that through multiple mathematical operations of counting and ordering, plot words into a multi-dimensional vector space. When embedding words, they transform from being distinct symbols into mathematical objects that can be operated on.
Algebra is generally defined as a generalisation of arithmetic in which letters representing numbers are combined according to the rules of arithmetic. In natural language processing, the opposite happens: words are represented by numbers which are generated by the circumstances of the text and of the algorithm which processes them.
This exploration is using gensim, an open source vector space and topic modelling toolkit implemented in Python, to manipulate text according to the mathematic relationships which emerge between the words, once they have been plotted in a vector space.