Classifying Images of Clothing Using TensorFlow

Train a Deep Learning model to classify images of clothing using Convolutional Neural Networks in TensorFlow.

# importing the libraries from tensorflow import keras import matplotlib.pyplot as plt import numpy as np import pandas as pd from sklearn.metrics import classification_report

Import the Fashion MNIST dataset

the MNIST fasion dataset conatins 70,000 grayscale image of 10 classes. Which reprsent individual clothing items with 28*28 pixels of resolution. Each value is in the range [0,255] which defines the color and intensity of each pixel. We'll be using 60,000 fir training and 10,000 for testing in order to classify images

fashion_mnist=keras.datasets.fashion_mnist (train_images,train_labels),(test_images,test_labels)= fashion_mnist.load_data()

The labels are an array of integers, ranging from 0 to 9. These correspond to the class of clothing the image represents:

class_names = ['T-shirt/top', 'Trouser', 'Pullover', 'Dress', 'Coat', 'Sandal', 'Shirt', 'Sneaker', 'Bag', 'Ankle boot']

Explore the data

print(train_images.shape) print(train_labels.shape) print(test_images.shape) print(test_labels.shape)

plt.figure() plt.imshow(train_images[5],cmap=plt.cm.binary) plt.colorbar() plt.grid(False) plt.show()

plt.figure(figsize=(10,10)) for i in range(25): plt.subplot(5,5,i+1) plt.xticks([]) plt.yticks([]) plt.grid(False) plt.imshow(train_images[i], cmap=plt.cm.binary) plt.xlabel(class_names[train_labels[i]]) plt.show()

Preprocess the data

train_images = train_images / 255.0 test_images = test_images / 255.0

# redimensionar as imagens train_images = train_images.reshape((train_images.shape[0], train_images.shape[1], train_images.shape[2], 1)) test_images = test_images.reshape((test_images.shape[0], test_images.shape[1], test_images.shape[2], 1)) print("train_images: ", train_images.shape) print("test_images: ", test_images.shape)

train_labels = keras.utils.to_categorical(train_labels,10) test_labels = keras.utils.to_categorical(test_labels,10)

print("First Label Before One-Hot Encoding: ", train_labels[0])

Building the model

For this project, we are going to use a typical CNN architecture represented in the image below.

As we have in the image, we will include a convolutional and a pooling layers, then another convolutional and pooling layers. Then, we are going to add a flatten layer to transform our 2d-array image in a 1d-array and add some dense layers. We can add some dropout layers to reduce overfitting. For the last layer, we add a dense layer with the number of classes from our problem (10) and a softmax activation, which creates the probability distribution for each class.

model= keras.Sequential([ keras.layers.Conv2D(filters=64, kernel_size=3, activation='relu', padding='same', input_shape=[28,28,1]), keras.layers.MaxPool2D(pool_size=2), keras.layers.Conv2D(filters=128, kernel_size=3, activation='relu', padding='same'), keras.layers.MaxPool2D(pool_size=2), keras.layers.Flatten(), keras.layers.Dense(128, activation='relu'), keras.layers.Dropout(0.25), keras.layers.Dense(64, activation='relu'), keras.layers.Dropout(0.25), keras.layers.Dense(10, activation='softmax'), ])

Compile the model

model.compile(optimizer='adam',loss='categorical_crossentropy',metrics=['accuracy'])

Train the model

model_history = model.fit(train_images, train_labels, batch_size=50, epochs=10, validation_split=0.3)

This model reaches an accuracy of about 0.95 (or 95%) on the training data.

Evaluating the model

pd.DataFrame(model_history.history).plot() plt.show()

test_loss, test_acc = model.evaluate(test_images, test_labels, verbose=2) print('\nTest accuracy:', test_acc)

It turns out that the accuracy on the test dataset is a little less than the accuracy on the training dataset. however this still represents a good result.

Make predictions

predictions = model.predict(test_images)

When the model predicts right, the text will be displayed in blue, if the prediction is wrong, it will be displayed in red. Also, it will be displayed the calculated probability for the predicted class.

def plot_img_label(img, pred_class, pred_percentage, true_class): plt.imshow(img,cmap=plt.cm.binary) if pred_class == true_class: color = 'blue' else: color = 'red' plt.title(label= f"Predicted: {pred_class} - {pred_percentage:2.1f}%\nActual: {true_class}", fontdict={'color': color})

plt.figure(figsize=(14,10)) for i in range(20): plt.subplot(5,5,i+1) plt.xticks([]) plt.yticks([]) plt.grid(False) i = i * 5 img = test_images[i].reshape(28,28) pred_class = class_names[np.argmax(predictions[i])] pred_percentage = np.max(predictions[i])*100 true_class = class_names[np.argmax(test_labels[i])] plot_img_label(img, pred_class, pred_percentage, true_class) plt.tight_layout() plt.show()

Results

predicted_label = np.argmax(predictions,axis = 1) true_label = np.argmax(test_labels, axis = 1)

crosstab = pd.crosstab(true_label, predicted_label, rownames=["True"], colnames=["Predicted"], margins=True)

classes = {} for item in zip(range(10), class_names): classes[item[0]] = item[1]

crosstab.rename(columns=classes, index=classes, inplace=True) crosstab

print(classification_report(true_label, predicted_label, target_names=class_names))

Conclusion

In this project, it was presented how to train a Convolutional Neural Network to classify images of clothing from the Fashion MNIST dataset using TensorFlow and Keras. Using this model, we got an overall accuracy of 91,22% in our test dataset, which is a good result. However, specifically for our Shirt class we got an accuracy of only 73%. We could try to improve the accuracy of this class using some data augmentation techniques. Furthermore, in case you want to get a model with higher accuracy, you could try changing some hyperparameters or using different network architectures.

.css-hdxizt{color:var(--chakra-colors-fg-neutral-primary);font-weight:var(--chakra-fontWeights-bold);letter-spacing:-0.09px;}Classifying Images of Clothing Using TensorFlow