Package: cleanNLP Type: Package Title: A Tidy Data Model for Natural Language Processing Version: 3.1.0 Author: Taylor B. Arnold [aut, cre] Maintainer: Taylor B. Arnold Description: Provides a set of fast tools for converting a textual corpus into a set of normalized tables. Users may make use of the 'udpipe' back end with no external dependencies, or a Python back ends with 'spaCy' . Exposed annotation tasks include tokenization, part of speech tagging, named entity recognition, and dependency parsing. Depends: R (>= 3.5.0) Imports: Matrix (>= 1.2), udpipe, reticulate, stringi, stats, methods Suggests: knitr (>= 1.15), rmarkdown (>= 1.4), testthat (>= 1.0.1), covr (>= 2.2.2) SystemRequirements: Python (>= 3.7.0) License: LGPL-2 URL: https://statsmaths.github.io/cleanNLP/ BugReports: https://github.com/statsmaths/cleanNLP/issues LazyData: true Encoding: UTF-8 VignetteBuilder: knitr RoxygenNote: 7.3.1 Config/pak/sysreqs: libicu-dev libpng-dev python3 Repository: https://taylor-arnold.r-universe.dev Date/Publication: 2025-06-08 14:20:56 UTC RemoteUrl: https://github.com/taylor-arnold/rpkg RemoteRef: HEAD RemoteSha: fef7c1b376e204f78d8b21e59fa70171e30e7e48 RemoteSubdir: cleanNLP NeedsCompilation: no Packaged: 2026-07-04 08:23:39 UTC; root