docxtractr

Extract Data Tables and Comments from 'Microsoft' 'Word' Documents

'Microsoft Word' 'docx' files provide an 'XML' structure that is fairly straightforward to navigate, especially when it applies to 'Word' tables and comments. Tools are provided to determine table count/structure, comment count and also to extract/clean tables and comments from 'Microsoft Word' 'docx' documents. There is also nascent support for '.doc' files.

Tests Vignettes

Dependencies

Imports/Depends/LinkingTo/Enhances (6)
  • R
  • xml2
  • purrr
  • dplyr
  • httr
  • magrittr
  • Suggests (2)
  • covr
  • tinytest
  • Version History