Skip to content

rsdv

Synthetic Tabular Data Generation with Gaussian Copulas

v0.2.0 · Jun 9, 2026 · MIT + file LICENSE

Description

Generates synthetic tabular data from real datasets using Gaussian copula models, with parametric marginal selection for numerical columns and a cumulative-frequency embedding that brings categorical and boolean columns into the same joint copula. Includes a metadata system with column types and primary keys, declarative constraints enforced via rejection sampling, conditional sampling, and quality, validity and privacy reports modeled on those of the 'SDMetrics' library. Inspired by the Python 'SDV' (Synthetic Data Vault) library by 'DataCebo'; see Patki, Wedge and Veeramachaneni (2016) "The Synthetic Data Vault" <doi:10.1109/DSAA.2016.49>.

CRAN Check Status

7 OK
Show all 7 flavors
Flavor Status
r-devel-linux-x86_64-fedora-clang OK
r-devel-linux-x86_64-fedora-gcc OK
r-oldrel-macos-arm64 OK
r-oldrel-macos-x86_64 OK
r-release-macos-arm64 OK
r-release-macos-x86_64 OK
r-release-windows-x86_64 OK

Check History

OK 7 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Jun 9, 2026

Dependency Network

Dependencies Reverse dependencies copula generics jsonlite ggplot2 tibble FNN rpart scales rsdv

Version History

updated 0.2.0 ← 0.1.0 diff Jun 9, 2026
new 0.1.0 Jun 8, 2026