Text Extraction, Rendering and Converting of PDF Documents

Utilities based on 'libpoppler' for extracting text, fonts, attachments and metadata from a PDF file. Also supports high quality rendering of PDF documents into PNG, JPEG, TIFF format, or into raw bitmap vectors for further processing in R.

Tests Vignettes

Available Snapshots

This version of pdftools can be found in the following snapshots:


Imports/Depends/LinkingTo/Enhances (3)
  • Rcpp >= 0.12.12
  • qpdf
  • Rcpp
  • Suggests (4)
  • png
  • webp
  • tesseract
  • testthat
  • Version History