Differentially Private Synthetic Data Generation

This project demonstrates how inference-only Large Language Models (LLMs) can generate synthetic datasets from sensitive data, while ensuring Differential Privacy (DP).

🚀 Hosted on Streamlit Cloud for free and open public access.

✨ Features

Generate realistic synthetic tabular data
Apply differential privacy (Laplace noise) during generation
Download synthetic datasets for safe sharing
Runs entirely in-browser via Streamlit

🛠️ Tech Stack

Streamlit for UI
Faker for mock data
numpy for DP noise injection
Python 3.9+

▶️ Run Locally

git clone https://github.com/yourusername/dp-synth-data.git
cd dp-synth-data
pip install -r requirements.txt
streamlit run app.py

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Differentially Private Synthetic Data Generation

✨ Features

🛠️ Tech Stack

▶️ Run Locally

About

Uh oh!

Releases

Packages

Languages

License

sharikalog7/Differentially-Private-Synthetic-Data-Generation

Folders and files

Latest commit

History

Repository files navigation

Differentially Private Synthetic Data Generation

✨ Features

🛠️ Tech Stack

▶️ Run Locally

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages