Neural Network Example Using Fashion MNIST dataset

About Fashion MNIST dataset

5 min readApr 5, 2019

Fashion-MNIST is a dataset of Zalando’s article images — consisting of a training set of 60,000 examples and a test set of 10,000 examples.

The ten fashion class labels include:

T-shirt/top
Trouser/pants
Pullover shirt
Dress
Coat
Sandal
Shirt
Sneaker
Bag
Ankle boot

Throughout this tutorial, you will learn how to train a simple Convolutional Neural Network (CNN) with Keras on the Fashion MNIST dataset.

Starting working on CNN(Convolutional Neural Network) design

First, we import some useful libraries for this project.

Then you can retrieve data set using Keras library as bellow.

Line 15 we load data into an array of size 4. Here train_images and train_labels use to train the network. For that, we use 60,000 images. And test_images, test_labels includes 10,000 images that are used to find how accurately the network.

Labels are included numerical values between 0–9 and those are mapped as bellow.

Those class labels are saved into an array because using names easy than use numbers.

Figure 6

All those images are 28*28 pixel size images and those pixel values fall in the range of 0 to 255.

Feeding 0–255 values into the neural network generate complex model so that we convert pixel values into binary as bellow.

Now we plot some sample images from a training set with labels to clarify that the data is in, correct format.

Build the model

Setup the layers

Layers are the basic building blocks of a neural network. Layers extract representations from the data fed into them.

Flatten (Line 52)

This layer only reformats the data. That's mean to transform the format of input image 28*28 pixels 2d array to a 1d — array of 28*28 pixels.

Dense (Line 53, Line 54)

These are densely connected or fully connected, neural layers. The 1st dense layer has 128 nodes(neurons) and takes relu as the activation function. The second layer is a 10 node softmax layer this returns an array of 10 probability scores that sum to 1.

Compile the model

Configures the model for training.

Optimizer — This is how the model is updated based on the data it sees and its loss function.
Loss function — If the model has multiple outputs, you can use a different loss on each output by passing a dictionary or a list of losses. The loss value that will be minimized by the model will then be the sum of all individual losses.
Metrics — Used to monitor the training and testing steps.