You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
**Using *CGAN* to generate tabular synthetic data:**
4
+
5
+
Real-world domains are often described by **tabular data** i.e., data that can be structured and organized in a table-like format, where **features/variables** are represented in **columns**, whereas **observations** correspond to the **rows**.
6
+
7
+
CGAN is a deep learning model that combines GANs with conditional models to generate data samples based on specific conditions:
Here’s an example of how to synthetize tabular data with CGAN using the [Credit Card](https://www.openml.org/search?type=data&sort=runs&id=1597&status=active) dataset:
**Using *CRAMER GAN* to generate tabular synthetic data:**
4
+
5
+
Real-world domains are often described by **tabular data** i.e., data that can be structured and organized in a table-like format, where **features/variables** are represented in **columns**, whereas **observations** correspond to the **rows**.
6
+
7
+
CRAMER GAN is a variant of GAN that employs the Cramer distance as a measure of similarity between real and generated data distributions to improve training stability and enhance sample quality:
8
+
9
+
- 📑 **Paper:**[The Cramer Distance as a Solution to Biased Wasserstein Gradients](https://arxiv.org/abs/1705.10743)
10
+
11
+
Here’s an example of how to synthetize tabular data with CRAMER GAN using the [Credit Card](https://www.openml.org/search?type=data&sort=runs&id=1597&status=active) dataset:
**Using *CTGAN* to generate tabular synthetic data:**
4
+
5
+
Real-world domains are often described by **tabular data** i.e., data that can be structured and organized in a table-like format, where **features/variables** are represented in **columns**, whereas **observations** correspond to the **rows**.
6
+
7
+
Additionally, real-world data usually comprises both **numeric** and **categorical** features. Numeric features are those that encode quantitative values, whereas categorical represent qualitative measurements.
8
+
9
+
CTGAN was specifically designed to deal with the challenges posed by tabular datasets, handling mixed (numeric and categorical) data:
10
+
11
+
- 📑 **Paper:**[Modeling Tabular Data using Conditional GAN](https://arxiv.org/pdf/1907.00503.pdf)
12
+
13
+
Here’s an example of how to synthetize tabular data with CTGAN using the [Adult Census Income](https://www.kaggle.com/datasets/uciml/adult-census-income?resource=download) dataset:
**Using *CWGAN-GP* to generate tabular synthetic data:**
4
+
5
+
Real-world domains are often described by **tabular data** i.e., data that can be structured and organized in a table-like format, where **features/variables** are represented in **columns**, whereas **observations** correspond to the **rows**.
6
+
7
+
CWGAN GP is a variant of GAN that incorporates conditional information to generate data samples, while leveraging the Wasserstein distance to improve training stability and sample quality:
Here’s an example of how to synthetize tabular data with CWGAN-GP using the [Credit Card](https://www.openml.org/search?type=data&sort=runs&id=1597&status=active) dataset:
**Using *DRAGAN* to generate tabular synthetic data:**
4
+
5
+
Real-world domains are often described by **tabular data** i.e., data that can be structured and organized in a table-like format, where **features/variables** are represented in **columns**, whereas **observations** correspond to the **rows**.
6
+
7
+
DRAGAN is a GAN variant that uses a gradient penalty to improve training stability and mitigate mode collapse:
8
+
9
+
- 📑 **Paper:**[On Convergence and Stability of GANs](https://arxiv.org/pdf/1705.07215.pdf)
10
+
11
+
Here’s an example of how to synthetize tabular data with DRAGAN using the [Adult Census Income](https://www.kaggle.com/datasets/uciml/adult-census-income?resource=download) dataset:
**Using *TimeGAN* to generate synthetic time-series data:**
4
+
5
+
Although tabular data may be the most frequently discussed type of data, a great number of real-world domains — from traffic and daily trajectories to stock prices and energy consumption patterns — produce **time-series data** which introduces several aspects of complexity to synthetic data generation.
6
+
7
+
Time-series data is structured sequentially, with observations **ordered chronologically** based on their associated timestamps or time intervals. It explicitly incorporates the temporal aspect, allowing for the analysis of trends, seasonality, and other dependencies over time.
8
+
9
+
TimeGAN is a model that uses a Generative Adversarial Network (GAN) framework to generate synthetic time series data by learning the underlying temporal dependencies and characteristics of the original data:
Here’s an example of how to synthetize time-series data with TimeGAN using the [Yahoo Stock Price](https://www.kaggle.com/datasets/arashnic/time-series-forecasting-with-yahoo-stock-price) dataset:
**Using *WGAN* to generate tabular synthetic data:**
4
+
5
+
Real-world domains are often described by **tabular data** i.e., data that can be structured and organized in a table-like format, where **features/variables** are represented in **columns**, whereas **observations** correspond to the **rows**.
6
+
7
+
WGAN is a variant of GAN that utilizes the Wasserstein distance to improve training stability and generate higher quality samples:
Here’s an example of how to synthetize tabular data with WGAN using the [Credit Card](https://www.openml.org/search?type=data&sort=runs&id=1597&status=active) dataset:
**Using *WGAN-GP* to generate tabular synthetic data:**
4
+
5
+
Real-world domains are often described by **tabular data** i.e., data that can be structured and organized in a table-like format, where **features/variables** are represented in **columns**, whereas **observations** correspond to the **rows**.
6
+
7
+
WGANGP is a variant of GAN that incorporates a gradient penalty term to enhance training stability and improve the diversity of generated samples:
8
+
9
+
- 📑 **Paper:**[Improved Training of Wasserstein GANs](https://arxiv.org/abs/1704.00028)
10
+
11
+
Here’s an example of how to synthetize tabular data with WGAN-GP using the [Adult Census Income](https://www.kaggle.com/datasets/uciml/adult-census-income?resource=download) dataset:
0 commit comments