0% found this document useful (0 votes)

25 views9 pages

DCGAN for Realistic Image Generation

This technical report discusses the use of Deep Convolutional Generative Adversarial Networks (DCGAN) for generating realistic images from datasets like MNIST and CelebA. It outlines the methodology, including the architecture of the GAN model, implementation details, and challenges faced during training. The results show that while the model can generate images, it struggled with quality due to limitations in training and GPU power.

Uploaded by

Omaima Younes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views9 pages

DCGAN for Realistic Image Generation

Uploaded by

Omaima Younes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

See discussions, stats, and author profiles for this publication at: [Link]

net/publication/330983916

DCGAN--Image Generation

Technical Report · February 2019

DOI: 10.13140/RG.2.2.23087.79523

CITATIONS READS
3 3,272

1 author:

Ashutosh Chapagain
Kathmandu University
2 PUBLICATIONS 4 CITATIONS

SEE PROFILE

All content following this page was uploaded by Ashutosh Chapagain on 09 February 2019.

The user has requested enhancement of the downloaded file.

Abstract

The potential of artificial intelligence to emulate human thought processes goes beyond passive
tasks and it extends well into creative activities. In this paper, we’ll explore the potential of deep
learning to generating real like images. We will use Deep Convolutional Generative Adversarial
Network (DCGAN) which has proven to be a great success in generating images. We have
discussed the theoretical aspect of GAN and also discussed about our methodology to create a
DCGAN Model for MNIST Datasets and CelebA Datasets.

Introduction

Learning features of huge unlabelled data and preserving those features to create new set of data
has a great scope in fashion, art and machine learning("Understanding Generative Adversarial
Networks (GANs)", 2019). Here we present a machine learning model which generates images
based on the feature provided by the training images. For our objective adversarial networks can
learn good representations of images for supervised learning and generative modeling (Radford,
Metz & Chintala, 2016).

Generative Adversarial Networks(GAN) belong to the set of generative models(Goodfellow,et

al.,2014). The GAN model consists of two network
● A generative network G(.) that takes in random input z and returns x_g=G(z) that should
follow the targeted probability distribution.
● A discriminator network D(.) that takes image vector x_image and classifies whether the
generated image is real or generated.

The generator needs to learn how to create data in a way that discriminator isn’t able to
distinguish as fake. The discriminator network has the task to determine if the image is real or
fake. An intuitive way to understand GAN is to imagine a forger trying to create a fake Picasso
painting (Chollet, n.d.). At first, the forger(generator) is pretty bad at this task. As times goes on,
the forger becomes increasingly competent at imitating the style of Picasso, and the art dealer
becomes increasingly expert at spotting [Link] the end, they have on their hands some excellent
fake Picassos. That’s what a GAN is: a forger network and an expert network, each being trained
to best the other.

1
Figure 1: Architecture of GAN Model(Chollet, n.d.)

We will use a Deep Convolutional GAN (DCGAN) which is very similar to GAN, but
specifically focuses on using Deep Convolutional networks in place of those fully-connected
networks. Convolutional networks in general find areas of correlation within an image.

Related Work

Variational Encoders: They are a kind of generative model that’s appropriate for the task of
image editing via concept vectors(Rezende,et al.,2014). Variational Encoders turns the image
into parameters of statistical distribution: a mean and a variance. The VAE then uses the mean
and variance parameters to randomly sample one element of distribution, and decodes that
element back to the original input (Kingma,et al.,2013). The stochasticity of this process
improves the robustness and forces the latent space to encode meaningful representations
everywhere: every point sampled in the latent space is decoded to valid output.
VAEs result in highly structured, continuous latent representations. VAE has a tendency to
approximate roughly which is over simplified compared to the true complex distribution of the
images. GANs consider the complexity of the distribution. Once training is over, the GANs are
capable of turning any point in its input space into a believable image[Chollet, n.d.].

2
Methodology

Datasets

1. MNIST dataset is a curated list of all handwritten digits. We have used dataset for quick
validation of our model. All images are scaled to 28X28.
2. The CelebA dataset consists of over 10k identities and 200k total images. All images
are originally of size 160X160 pixels. They are rescaled to 28X28 pixels.

Figure 2: Datasets used MNIST (left) and CelebA (right)

Discriminative Model Implementation

For feature extraction 64 filters of size 3X3 were applied on the original image. Average pooling
(2X2) and batch normalization was performed on the layers to reduce noise and to generalize the
features.

Again the resulting layers were stacked on top of each other and 128 filters and 256 filters of size
3X3 each were applied. Each convolution layer was followed by average pooling and batch
normalization.

The layer was flattened and dropout with probability 0.4 was applied. A Dense network was
stacked on top of the convolutional network with an output of 1 which determined whether the
image fed into discriminator was real or fake. The discriminator model was the classification
model which classified the images as real or fake.

3
Generative Model Implementation

A generator network maps vectors of shape (latent_dim,) to images of shape (32, 32, 3) . The
features of generative models are same as the discriminator except that it applies convolution
with a fractional stride (convolution transpose) (Chollet, n.d.).

Optimizing the Model

Weights are updated as to maximize the probability that any real data input x is classified as
belonging to the real dataset, while minimizing the probability that any fake image is classified
as belonging to the real dataset. In more technical terms, the loss/error function used maximizes
the function D(x), and it also minimizes D(G(z)).

Furthermore, the generator function maximizes D(G(z)).

Since during training both the Discriminator and Generator are trying to optimize opposite loss
functions, they can be thought of two agents playing a minimax game with value function
V(G,D).

Models were trained for 100 epoches with a batch size of 32 for CelebA dataset.
For MNIST Dataset, the model was trained with an iteration of 50,000 and batch size of 32.

System Specification

Programming Language: Python3

Framework Used : Tensorflow
Development Platform : Google Collaboratory
Training Time : 4 hours for MNIST Dataset and 11 hours for CelebA dataset.

4
Results

The digits produced image for MNIST dataset were:

A B
Figure3: Image generated from our GAN Model. (A) generated 8. (B) generated 7( inverted 7).

Faces generated from CelebA dataset were:

Figure4 : Image Generated from our GAN Model

These were the image generated from our GAN Model. Due to GPU constraints, we were able
to train only for 3 epoches out of 100. Hence, we used a pre-trained model from
[Link]

5
The output when we fine tuned the last networks from the pre-trained network with our model
were:

Figure: Image generated from our model on CelebA Dataset

Loss in Training

Figure: Training Loss for two models MNIST Dataset

6
Figure: Training Loss for two models CelebA Dataset

As shown in the figure, in our model, the discriminator model overpowers the generator
[Link] respective crests and troughs of both the model are inverse to each other, i.e. one
model overpowers the other. In ideal case both the models converges to 0 loss after training for a
long time.

Difficulties And Shortcomings

1. When training, the generator loss begin to increase considerably, while the discriminative
loss tends to zero, hence the discriminator overpowers the generator model. We had to be
very careful to tune in the hyper-parameters.
2. Due to the constraints in GPU power, we are not able to generate a perfect image based
on previous image.

7
View publication stats

Conclusion

Sampling from a latent space of images to create entirely new images or edit existing ones is
currently the most popular and successful application of creative AI . In this paper, we
demonstrated a way to generate images from training the existing similar images. GAN is a
dynamic system where the optimization process is seeking not a minimum, but an equilibrium
between two forces. It was difficult to train the generated image and they were not as good as the
real image. Hence we had to use a pre-trained model for CelebA Dataset.

References

Chollet, F. Deep learning with Python.

Radford, A., Metz, L., & Chintala, S. (2016). Unsupervised Representation Learning with Deep
Convolutional Generative Adversarial Networks. ICLR 2016.

Danilo Jimenez Rezende, Shakir Mohamed, and Daan Wierstra, “Stochastic Backpropagation
and Approxi-
mate Inference in Deep Generative Models,” arXiv (2014), [Link]

Diederik P. Kingma and Max Welling, “Auto-Encoding Variational Bayes, arXiv (2013),
[Link]
abs/1312.6114.
Understanding Generative Adversarial Networks (GANs). (2019). Retrieved from
[Link]
29

ProGAN: How NVIDIA Generated Images of Unprecedented Quality. (2019). Retrieved from
[Link]
51c98ec2cbd2

Realistic Face Image Generation Based On Generative Adversarial Network
No ratings yet
Realistic Face Image Generation Based On Generative Adversarial Network
4 pages
A Survey On Generative Adversarial Networks (GANs)
No ratings yet
A Survey On Generative Adversarial Networks (GANs)
5 pages
A Review of Generative Adversarial Networks For Computer Vision TasksElectronics Switzerland
No ratings yet
A Review of Generative Adversarial Networks For Computer Vision TasksElectronics Switzerland
17 pages
Image To Image Translation Using Generative Adversarial Network
No ratings yet
Image To Image Translation Using Generative Adversarial Network
5 pages
Applsci 13 10637 v2
No ratings yet
Applsci 13 10637 v2
29 pages
Deep Generative Image Models with GANs
No ratings yet
Deep Generative Image Models with GANs
10 pages
GAN Variants: A Comprehensive Survey
No ratings yet
GAN Variants: A Comprehensive Survey
8 pages
Deep Generative Adversarial Networks For Image-To
No ratings yet
Deep Generative Adversarial Networks For Image-To
26 pages
Optimizing Generative Adversarial Networks
No ratings yet
Optimizing Generative Adversarial Networks
5 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
52 pages
Generative Adversarial Networks
No ratings yet
Generative Adversarial Networks
4 pages
Beginner's Guide to GAN Architectures
No ratings yet
Beginner's Guide to GAN Architectures
9 pages
Expanding MNIST with DCGAN
No ratings yet
Expanding MNIST with DCGAN
4 pages
Understanding Generative AI Techniques
No ratings yet
Understanding Generative AI Techniques
69 pages
Advancements in Generative Adversarial Networks
No ratings yet
Advancements in Generative Adversarial Networks
3 pages
Text-to-Image GAN Implementation
No ratings yet
Text-to-Image GAN Implementation
7 pages
GANs: Applications and Challenges
No ratings yet
GANs: Applications and Challenges
24 pages
CVAE-GAN for Fine-Grained Image Synthesis
No ratings yet
CVAE-GAN for Fine-Grained Image Synthesis
10 pages
DCGANs for Realistic Face Generation
No ratings yet
DCGANs for Realistic Face Generation
9 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
4 pages
Comparative Analysis of Image Generators
No ratings yet
Comparative Analysis of Image Generators
4 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
11 pages
Unsupervised Learning with DCGANs
No ratings yet
Unsupervised Learning with DCGANs
11 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
24 pages
Generating Pokémon Sprites with GANs
No ratings yet
Generating Pokémon Sprites with GANs
10 pages
AdvGAN: Enhancing GANs with Adversarial Training
No ratings yet
AdvGAN: Enhancing GANs with Adversarial Training
12 pages
Six Fronts of Generative Adversarial Networks
No ratings yet
Six Fronts of Generative Adversarial Networks
11 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
5 pages
EWG Model for Deepfake Detection
No ratings yet
EWG Model for Deepfake Detection
30 pages
Data Augmentation with GANs
No ratings yet
Data Augmentation with GANs
16 pages
Image-to-Image Translation with GANs
No ratings yet
Image-to-Image Translation with GANs
4 pages
Trends in Generative Models Review
No ratings yet
Trends in Generative Models Review
10 pages
Understanding DCGAN Architecture and Training
No ratings yet
Understanding DCGAN Architecture and Training
27 pages
Simplified Generative Model Using Gradient Descent
No ratings yet
Simplified Generative Model Using Gradient Descent
8 pages
Advanced GANs for High-Quality Image Synthesis
No ratings yet
Advanced GANs for High-Quality Image Synthesis
5 pages
Overview of Generative Adversarial Networks
No ratings yet
Overview of Generative Adversarial Networks
31 pages
Probability Distributions in Generative Models
No ratings yet
Probability Distributions in Generative Models
11 pages
Overview of Generative Adversarial Networks
No ratings yet
Overview of Generative Adversarial Networks
9 pages
Biometric Fingerprint Generation with GANs
No ratings yet
Biometric Fingerprint Generation with GANs
36 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
51 pages
GANs: Overview and Applications
No ratings yet
GANs: Overview and Applications
12 pages
Implementing GANs for MNIST Data Generation
No ratings yet
Implementing GANs for MNIST Data Generation
23 pages
High-Resolution GANs for Face Generation
No ratings yet
High-Resolution GANs for Face Generation
9 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
15 pages
Autoregressive Models in Deep Learning
No ratings yet
Autoregressive Models in Deep Learning
14 pages
Lecture16 GAN Cont
No ratings yet
Lecture16 GAN Cont
35 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
16 pages
Machine Learning Models for Image Generation
No ratings yet
Machine Learning Models for Image Generation
15 pages
Overview of GAN Types and Challenges
No ratings yet
Overview of GAN Types and Challenges
8 pages
Deep Learning for Skin Disease Images
No ratings yet
Deep Learning for Skin Disease Images
6 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
24 pages
Criminal Face Recognition via GAN
No ratings yet
Criminal Face Recognition via GAN
3 pages
GANs for Image Data Augmentation
No ratings yet
GANs for Image Data Augmentation
6 pages
Comparing DC-GAN and LS-GAN Models
No ratings yet
Comparing DC-GAN and LS-GAN Models
6 pages
Deep Fakes with Cycle-GAN Techniques
No ratings yet
Deep Fakes with Cycle-GAN Techniques
9 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
8 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
20 pages
Maharashtra CAP 2024-25 Candidate List
No ratings yet
Maharashtra CAP 2024-25 Candidate List
577 pages
Doctor List for Akola Hospitals
No ratings yet
Doctor List for Akola Hospitals
1 page
Scott B. Jones - Soil Physics CV
No ratings yet
Scott B. Jones - Soil Physics CV
19 pages
Leading Quantum Consultancy Overview
No ratings yet
Leading Quantum Consultancy Overview
24 pages
Class 12 Maths Sample Paper Set 4
No ratings yet
Class 12 Maths Sample Paper Set 4
10 pages
History of Medical Technology in the Philippines
No ratings yet
History of Medical Technology in the Philippines
2 pages
Keezhadi Museum Project Report
No ratings yet
Keezhadi Museum Project Report
11 pages
Introduction to Machine Learning Course
No ratings yet
Introduction to Machine Learning Course
3 pages
Comprehensive Language Lesson Plan
No ratings yet
Comprehensive Language Lesson Plan
6 pages
Advanced Reading Techniques Overview
100% (1)
Advanced Reading Techniques Overview
147 pages
TM Intro To World Religions
100% (5)
TM Intro To World Religions
94 pages
Illinois Nurse Professional Licensing Guide
No ratings yet
Illinois Nurse Professional Licensing Guide
12 pages
Spring 2012-Winter 2013 Catalog
No ratings yet
Spring 2012-Winter 2013 Catalog
39 pages
Early Numeracy Assessment Guidelines
No ratings yet
Early Numeracy Assessment Guidelines
4 pages
INF1339H Computational Thinking Syllabus
No ratings yet
INF1339H Computational Thinking Syllabus
13 pages
Curriculum Relationships in Education
No ratings yet
Curriculum Relationships in Education
6 pages
Hooksondemand Ebook
No ratings yet
Hooksondemand Ebook
20 pages
Holistic Portfolio Assessment Rubric
No ratings yet
Holistic Portfolio Assessment Rubric
1 page
Analyzing Nikki Giovanni's Poetry
No ratings yet
Analyzing Nikki Giovanni's Poetry
1 page
PW Vidyapeeth Class Timetable 2025
No ratings yet
PW Vidyapeeth Class Timetable 2025
1 page
Enhancing Grade 3 Reading Skills
No ratings yet
Enhancing Grade 3 Reading Skills
13 pages
Admission for Textile Technology Course
No ratings yet
Admission for Textile Technology Course
1 page
TLE Teachers' Impact on Student Success
No ratings yet
TLE Teachers' Impact on Student Success
93 pages
Kangwon National University Admissions Guide
No ratings yet
Kangwon National University Admissions Guide
43 pages
Teaching Strategies for Multi-Grade Classes
No ratings yet
Teaching Strategies for Multi-Grade Classes
38 pages
100 Benefits of Meditation Explained
No ratings yet
100 Benefits of Meditation Explained
5 pages
Impact of IBI on Autism Treatment
No ratings yet
Impact of IBI on Autism Treatment
22 pages
Self-Awareness in Leadership Effectiveness
No ratings yet
Self-Awareness in Leadership Effectiveness
5 pages
B2 Unit 2 Test Answer Key
46% (13)
B2 Unit 2 Test Answer Key
2 pages
Xaverian Movement Uniform Guidelines
No ratings yet
Xaverian Movement Uniform Guidelines
10 pages