boilerpipeR
Interface to the Boilerpipe Java Library
v1.3.2
·
May 19, 2021
·
Apache License (== 2.0)
Description
Generic Extraction of main text content from HTML files; removal of ads, sidebars and headers using the boilerpipe <https://github.com/kohlschutter/boilerpipe> Java library. The extraction heuristics from boilerpipe show a robust performance for a wide range of web site templates.
Downloads
294
Last 30 days
10691st
546
Last 90 days
546
Last year
Trend: +16.7% (30d vs prior 30d)
CRAN Check Status
1
ERROR
13
NOTE
Show all 14 flavors
| Flavor | Status |
|---|---|
| r-devel-linux-x86_64-debian-clang | ERROR |
| r-devel-linux-x86_64-debian-gcc | NOTE |
| r-devel-linux-x86_64-fedora-clang | NOTE |
| r-devel-linux-x86_64-fedora-gcc | NOTE |
| r-devel-macos-arm64 | NOTE |
| r-devel-windows-x86_64 | NOTE |
| r-oldrel-macos-arm64 | NOTE |
| r-oldrel-macos-x86_64 | NOTE |
| r-oldrel-windows-x86_64 | NOTE |
| r-patched-linux-x86_64 | NOTE |
| r-release-linux-x86_64 | NOTE |
| r-release-macos-arm64 | NOTE |
| r-release-macos-x86_64 | NOTE |
| r-release-windows-x86_64 | NOTE |
Check details (16 non-OK)
NOTE
r-devel-linux-x86_64-debian-clang
CRAN incoming feasibility
Maintainer: ‘Mario Annau <mario.annau@gmail.com>’
No Authors@R field in DESCRIPTION.
Please add one, modifying
Authors@R: c(person(given = c("See", "AUTHORS"),
family = "file.",
role = "aut"),
person(given = "Mario",
family = "Annau",
role = "cre",
email = "mario.annau@gmail.com"))
as necessary.
ERROR
r-devel-linux-x86_64-debian-clang
package dependencies
Package required but not available: ‘rJava’ See section ‘The DESCRIPTION file’ in the ‘Writing R Extensions’ manual.
NOTE
r-devel-linux-x86_64-debian-gcc
CRAN incoming feasibility
Maintainer: ‘Mario Annau <mario.annau@gmail.com>’
No Authors@R field in DESCRIPTION.
Please add one, modifying
Authors@R: c(person(given = c("See", "AUTHORS"),
family = "file.",
role = "aut"),
person(given = "Mario",
family = "Annau",
role = "cre",
email = "mario.annau@gmail.com"))
as necessary.
NOTE
r-devel-linux-x86_64-debian-gcc
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize; meant \describe ?
NOTE
r-devel-linux-x86_64-fedora-clang
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize; meant \describe ?
NOTE
r-devel-linux-x86_64-fedora-gcc
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize; meant \describe ?
NOTE
r-devel-macos-arm64
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize; meant \describe ?
NOTE
r-devel-windows-x86_64
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize; meant \describe ?
NOTE
r-oldrel-macos-arm64
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize; meant \describe ?
NOTE
r-oldrel-macos-x86_64
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize; meant \describe ?
NOTE
r-oldrel-windows-x86_64
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize; meant \describe ?
NOTE
r-patched-linux-x86_64
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize; meant \describe ?
NOTE
r-release-linux-x86_64
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize; meant \describe ?
NOTE
r-release-macos-arm64
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize; meant \describe ?
NOTE
r-release-macos-x86_64
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize; meant \describe ?
NOTE
r-release-windows-x86_64
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize; meant \describe ?
Check History
ERROR 0 OK · 13 NOTE · 0 WARNING · 1 ERROR · 0 FAILURE Mar 9, 2026
ERROR
r-devel-linux-x86_64-debian-clang
CRAN incoming feasibility
Maintainer: ‘Mario Annau <mario.annau@gmail.com>’
No Authors@R field in DESCRIPTION.
Please add one, modifying
Authors@R: c(person(given = c("See", "AUTHORS"),
family = "file.",
role = "aut"),
person(given = "Mario",
family = "Annau",
role = "cre",
email = "mario.annau@gmail.com"))
as necessary.
NOTE
r-devel-linux-x86_64-debian-gcc
CRAN incoming feasibility
Maintainer: ‘Mario Annau <mario.annau@gmail.com>’
No Authors@R field in DESCRIPTION.
Please add one, modifying
Authors@R: c(person(given = c("See", "AUTHORS"),
family = "file.",
role = "aut"),
person(given = "Mario",
family = "Annau",
role = "cre",
email = "mario.annau@gmail.com"))
as necessary.
NOTE
r-devel-linux-x86_64-fedora-clang
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize;
NOTE
r-devel-linux-x86_64-fedora-gcc
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize;
NOTE
r-devel-macos-arm64
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize;
NOTE
r-devel-windows-x86_64
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize;
NOTE
r-patched-linux-x86_64
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize;
NOTE
r-release-linux-x86_64
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize;
NOTE
r-release-macos-arm64
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize;
NOTE
r-release-macos-x86_64
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize;
NOTE
r-release-windows-x86_64
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize;
NOTE
r-oldrel-macos-arm64
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize;
NOTE
r-oldrel-macos-x86_64
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize;
NOTE
r-oldrel-windows-x86_64
Rd files
checkRd: (-1) Extractor.Rd:13: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:14: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:15: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:16: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:17: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:18: Lost braces in \itemize; meant \describe ? checkRd: (-1) Extractor.Rd:19: Lost braces in \itemize;
Dependency Network
Version History
new
1.3.2
Mar 9, 2026