PyTorch Project to Build a GAN Model on MNIST Dataset
This project compares Vanilla GAN and WGAN regarding generating real images of MNIST. It evaluates these methods using qualitative metrics like FID and Inception Score. The study also attempts to understand the ability of each of these GAN architectures to generate high-quality images.
Project Outcomes
Requirements:
- →Good knowledge about GANs.
- →Python experience and expertise in other libraries, especially PyTorch.
- →Understanding neural networks, specifically the generators and discriminators utilized in GAN.
- →Some level of prior understanding of various transforms, which is simply resizing an image and normalization.
- →Familiarity with performance metrics like FID (Fréchet Inception Distance) and Inception Score.
- →Working with data visualization libraries, for example, matplotlib and torchvision.
Project Description
With this project, we will enter the world of GANs through comparative experimentation between two of its models - Vanilla GAN and WGAN - to produce real-life images using the MNIST dataset and assess which model can achieve this better!
We begin by feeding all real images into both GANs, through which some fake images will be generated. Then, we shall do some fun visualization by combining the real and fake images to assess their closeness.
After that, it's on to computing the FID, or Fréchet Inception Distance, to compare how similar the generated images are to the real deal. It calculates an Inception Score to measure image diversity and quality.
Hopefully, by this, one will have understood comparison-wise how the Vanilla GAN and WGAN perform to generate realistic and diverse images. Quite exciting, right?

Analyze Vanilla GAN vs. WGAN for MNIST image generation, using FID and Inception Score to evaluate and compare the quality of generated images.