Norwegian version of this page


Thomas Hegghammer has published his first r-package, diaR on CRAN.

Arabic text as picture

R interface for the Google Cloud Services 'Document AI API' <> with additional tools for output file parsing and text reconstruction. 'Document AI' is a powerful server-based OCR processor that extracts text and tables from images and pdf files with high accuracy. 'daiR' gives R users programmatic access to this processor and additional tools to handle and visualize the output. See the package website <> for more information and examples.

Tags: R, text processing, data science, Political Science
Published June 16, 2021 7:58 AM - Last modified June 16, 2021 7:58 AM