The Synthetic Data Vault
What can you use synthetic data for?
Use a synthetic data in place of real data for added protection, or use it in addition to your real data as an enhancement.
The SDV Ecosystem
Public, Source-Available Libraries
The SDV is an overall ecosystem for synthetic data models, benchmarks, and metrics. Explore publicly available libraries supporting the SDV. Each can be used as standalone packages for particular needs.
QUICKSTART
Try it out now!
Quickly discover SDV with just a few lines of code!
from sdv.datasets.demo import download_demo
from sdv.single_table import GaussianCopulaSynthesizer
real_data, metadata = download_demo(
'single_table', 'fake_hotel_guests')
synthesizer = GaussianCopulaSynthesizer(metadata)
synthesizer.fit(real_data)
synthetic_data = synthesizer.sample(num_rows=10)
Join Us
Join Our Community
Chat with developers across the world. Stay up-to-date with the latest features, blogs, and news.