Provides comprehensive tools for extracting and analyzing scientific
content from PDF documents, including citation extraction, reference matching,
text analysis, and bibliometric indicators. Supports multi-column PDF layouts,
'CrossRef' API <https://www.crossref.org/documentation/retrieve-metadata/rest-api/> integration, and advanced citation parsing.
| Version: |
0.2.0 |
| Depends: |
R (≥ 4.1.0) |
| Imports: |
base64enc (≥ 0.1-3), dplyr (≥ 1.1.0), httr2 (≥ 0.2.0), igraph, jsonlite (≥ 2.0.0), magrittr (≥ 2.0.4), openalexR (≥
2.0.2), pdftools (≥ 3.6.0), purrr (≥ 1.1.0), stringr (≥
1.5.2), tibble (≥ 3.3.0), tidyr (≥ 1.3.0), tidytext (≥
0.4.3), visNetwork (≥ 2.1.4) |
| Suggests: |
knitr, plotly, RColorBrewer, rmarkdown, scales, stringdist, testthat (≥ 3.0.0), mockery |
| Published: |
2025-10-30 |
| DOI: |
10.32614/CRAN.package.contentanalysis |
| Author: |
Massimo Aria
[cre, aut, cph],
Corrado Cuccurullo
[aut] |
| Maintainer: |
Massimo Aria <aria at unina.it> |
| BugReports: |
https://github.com/massimoaria/contentanalysis/issues |
| License: |
GPL (≥ 3) |
| URL: |
https://github.com/massimoaria/contentanalysis, |
| NeedsCompilation: |
no |
| Materials: |
README, NEWS |
| CRAN checks: |
contentanalysis results |