You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+15-9Lines changed: 15 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -12,25 +12,30 @@ Join us on [ to Gaussian Mixtures.
34
+
The repo includes a full ecosystem for synthetic data generation, that includes different models for the generation of synthetic structure data and time-series.
35
+
All the Deep Learning models are implemented leveraging Tensorflow 2.0.
36
+
Several example Jupyter Notebooks and Python scripts are included, to show how to use the different architectures.
37
+
38
+
Are you ready to learn more about synthetic data and the bext-practices for synthetic data generation?
34
39
35
40
## Quickstart
36
41
The source code is currently hosted on GitHub at: https://github.com/ydataai/ydata-synthetic
@@ -78,8 +83,8 @@ The below models are supported:
78
83
79
84
### Examples
80
85
Here you can find usage examples of the package and models to synthesize tabular data.
81
-
82
-
- Tabular synthetic data generation with CTGAN on adult census income dataset [](https://colab.research.google.com/github/Data-Centric-AI-Community/awesome-python-for-data-science/blob/main/workshop-ds/Workshop%20-%20Data-Centric%20AI%20pipelines%20-%20How%20and%20why.ipynb)
86
+
- Fast tabular data synthesis on adult census income dataset [](https://colab.research.google.com/github/ydataai/ydata-synthetic/blob/master/examples/regular/models/Fast_Adult_Census_Income_Data.ipynb)
87
+
- Tabular synthetic data generation with CTGAN on adult census income dataset [](https://colab.research.google.com/github/ydataai/ydata-synthetic/blob/master/examples/regular/models/CTGAN_Adult_Census_Income_Data.ipynb)
83
88
- Time Series synthetic data generation with TimeGAN on stock dataset [](https://colab.research.google.com/github/ydataai/ydata-synthetic/blob/master/examples/timeseries/TimeGAN_Synthetic_stock_data.ipynb)
84
89
- More examples are continuously added and can be found in `/examples` directory.
85
90
@@ -106,6 +111,7 @@ In this repository you can find the several GAN architectures that are used to c
106
111
-[Cramer GAN (The Cramer Distance as a Solution to Biased Wasserstein Gradients)](https://arxiv.org/abs/1705.10743)
107
112
-[CWGAN-GP (Conditional Wassertein GAN with Gradient Penalty)](https://cameronfabbri.github.io/papers/conditionalWGAN.pdf)
"# Tabular Synthetic Data Generation with Gaussian Mixture\n",
38
+
"- This notebook is an example of how to use a synthetic data generation methods based on [GMM](https://scikit-learn.org/stable/modules/generated/sklearn.mixture.GaussianMixture.html) to generate synthetic tabular data with numeric and categorical features.\n",
39
+
"\n",
40
+
"## Dataset\n",
41
+
"- The data used is the [Adult Census Income](https://www.kaggle.com/datasets/uciml/adult-census-income) which we will fecth by importing the `pmlb` library (a wrapper for the Penn Machine Learning Benchmark data repository).\n"
0 commit comments