Actions

Difference between revisions of "An Ethnography of Datasets"

From Algolit

Line 2: Line 2:
 
creation of datasets that form the basis on which computer models function, conflict and ambiguity are neglected in favour of making reality computable. Data collection is political, but its politics are rendered invisible in the way it is presented and visualised. Datasets are not a distilled version of reality, nor simply a technology in itself. But as any technology, datasets encode their goal, their purpose and the world view of the makers.
 
creation of datasets that form the basis on which computer models function, conflict and ambiguity are neglected in favour of making reality computable. Data collection is political, but its politics are rendered invisible in the way it is presented and visualised. Datasets are not a distilled version of reality, nor simply a technology in itself. But as any technology, datasets encode their goal, their purpose and the world view of the makers.
  
With this work, we look into the most commonly used datasets for training machine learning and data scientists.
+
With this work, we look into the most commonly used datasets for training machine learning and data scientists. What material do they consist of? Who collected them? When? For what reason?
  
 
[[Category:Data_Workers]][[Category:Data_Workers_EN]]
 
[[Category:Data_Workers]][[Category:Data_Workers_EN]]

Revision as of 14:12, 28 February 2019

What seems to be overlooked in the transfer of bias from a societal level to the machine level is the dataset as an intermediate stage in decision making: the parameters by which a social environment is boxed into are determined by various factors. In the creation of datasets that form the basis on which computer models function, conflict and ambiguity are neglected in favour of making reality computable. Data collection is political, but its politics are rendered invisible in the way it is presented and visualised. Datasets are not a distilled version of reality, nor simply a technology in itself. But as any technology, datasets encode their goal, their purpose and the world view of the makers.

With this work, we look into the most commonly used datasets for training machine learning and data scientists. What material do they consist of? Who collected them? When? For what reason?