llmclean

'LLM'-Assisted Data Cleaning with Multi-Provider Support

v0.1.0 · Apr 22, 2026 · GPL-3

Description

Detects and suggests fixes for semantic inconsistencies in data frames by calling large language models (LLMs) through a unified, provider-agnostic interface. Supported providers include 'OpenAI' ('GPT-4o', 'GPT-4o-mini'), 'Anthropic' ('Claude'), 'Google' ('Gemini'), 'Groq' (free-tier 'LLaMA' and 'Mixtral'), and local 'Ollama' models. The package identifies issues that rule-based tools cannot detect: abbreviation variants, typographic errors, case inconsistencies, and malformed values. Results are returned as tidy data frames with column, row index, detected value, issue type, suggested fix, and confidence score. An offline fallback using statistical and fuzzy-matching methods is provided for use without any API key. Interactive fix application with human review is supported via 'apply_fixes()'. Methods follow de Jonge and van der Loo (2013) <https://cran.r-project.org/doc/contrib/de_Jonge+van_der_Loo-Introduction_to_data_cleaning_with_R.pdf> and Chaudhuri et al. (2003) <doi:10.1145/872757.872796>.

Downloads

408

Last 30 days

9001st

733

Last 90 days

733

Last year

Trend: +25.5% (30d vs prior 30d)

CRAN Check Status

13 OK

Show all 13 flavors

Flavor	Status	Time
r-devel-linux-x86_64-debian-clang	OK	57.9s
r-devel-linux-x86_64-debian-gcc	OK	42.2s
r-devel-linux-x86_64-fedora-clang	OK	97.3s
r-devel-linux-x86_64-fedora-gcc	OK	96.9s
r-devel-windows-x86_64	OK	76s
r-oldrel-macos-arm64	OK	19s
r-oldrel-macos-x86_64	OK	87s
r-oldrel-windows-x86_64	OK	89s
r-patched-linux-x86_64	OK	51.9s
r-release-linux-x86_64	OK	51.7s
r-release-macos-arm64	OK	20s
r-release-macos-x86_64	OK	80s
r-release-windows-x86_64	OK	76s

Check History

OK 6 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Apr 23, 2026

Dependency Network

Version History

new 0.1.0 Apr 22, 2026

Maintainer

Sadikul Islam

Dependencies

Depends

R (>= 4.1.0)

Imports

stats utils dplyr (>= 1.0.0) rlang (>= 1.0.0)

Suggests

knitr rmarkdown testthat (>= 3.0.0) httr2 (>= 1.0.0) jsonlite (>= 1.8.0)

Compilation

No compilation needed

First Published

Apr 22, 2026

RSS Feed

CRAN Checks

View on CRAN →