Main Page: Difference between revisions
From Algolit
(→Sessions) |
(→Sessions) |
||
Line 34: | Line 34: | ||
- '''Friday 19 January 24''': '''CLEANERS''': we will explore and try-out OCR-tools, like Tesseract, post-OCR correction tools, ex. OCR-D. Notes: https://pad.constantvzw.org/p/algolit_240119 | - '''Friday 19 January 24''': '''CLEANERS''': we will explore and try-out OCR-tools, like Tesseract, post-OCR correction tools, ex. OCR-D. Notes: https://pad.constantvzw.org/p/algolit_240119 | ||
− | - '''Friday 16 February 24''': '''CLEANERS''': we will apply OCR tools to the different pilot corpus | + | - '''Friday 16 February 24''': '''CLEANERS''': we will apply OCR tools to the different pilot corpus. Notes: https://pad.constantvzw.org/p/algolit_240216 |
- '''Friday 22 March 24''': '''READERS''': text analysis based on pilot corpus: string queries, frequency analysis, context windows, ... looking for patterns, interesting quotes, sentences, types of articles, contextualizing sources (left wing/right wing...) | - '''Friday 22 March 24''': '''READERS''': text analysis based on pilot corpus: string queries, frequency analysis, context windows, ... looking for patterns, interesting quotes, sentences, types of articles, contextualizing sources (left wing/right wing...) |
Revision as of 09:37, 16 February 2024
|