104 flower Image Classification

Download and then unzip the folder¶

!unzip '/content/flowers_google.zip' -d '/content/flower/train/'

!unzip '/content/test_flower.zip' -d '/content/flower/test/'

#  !unzip 'source_location'  -d 'destination_location'

Import fastai libraries and others¶

from fastai.vision import *
from fastai import *
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns

Load the csv file it contains image ids and there corresponding labels¶

df = pd.read_csv('/content/flowers_idx.csv')
label = pd.read_csv('/content/flowers_label.csv')

df.head()

label.head()

Locate a proxy path for the Deep learning model and the data¶

path = Path('/content/flower/train/')  # Locates to the folder of train images
path1 = Path('/content/flower/test/')   # Locates to the folder of test images

forming DataBunch with test and train¶

tfms = get_transforms(do_flip=True,max_rotate=0.1,max_lighting=0.15)  # Each time it gets the image it will added some kind of transform like flip some times, rotate some times

test = (ImageList.from_folder(path1,extensions='.jpeg'))   # creating a test set

data = (ImageList.from_df(df,path,folder='flowers_google',suffix='.jpeg',cols='id') 
        # Creating a image list from dataframe  folder= folder_name, suffix = image_etxn, cols = column that contains image ids
                .split_by_rand_pct(0.15)
        # will take 15% of images as validation set
                .label_from_df(cols='flower_cls')
        # Label from dataframe cols= columns that contains category
                .transform(tfms)
        # add transforms
                .add_test(test)
        # add a test set
                .databunch(bs=128)
        # create a databuch for  the model 
                .normalize(imagenet_stats))
        # normalize with imagenet stats because we are using a pretrained resnet model which was trained on image net

A look at the databuch formed¶

data.show_batch(rows=3)   # rows = no. of rows

Total classes, length of train, validation and test set¶

len(data.classes),len(data.train_ds),len(data.valid_ds),len(data.test_ds)

(104, 13996, 2469, 7382)

Using a pretrained ResNet50 model¶

with metrics = f1_score¶

average = macro

because there is class imbalance in the data set

fb = FBeta()
fb.average='macro'

Now here a learner is being created to train the model¶

learn = cnn_learner(data,models.resnet50,metrics=[accuracy,fb]).to_fp16()   # .to_fp16() uses mixed precision traing computes operation in 16 bit floating point numbers

Downloading: "https://download.pytorch.org/models/resnet50-19c8e357.pth" to /root/.cache/torch/checkpoints/resnet50-19c8e357.pth

Here I will use fast.ai lr_find() function to find a suitable learning rate
Try to select a a bit before the minima

learn.lr_find()
learn.recorder.plot()

LR Finder is complete, type {learner_name}.recorder.plot() to see the graph.

Well the learning rate that corresponds to the minimum value is already a bit too high, since we are at the edge between improving and getting all over the place.
We want to go one order of magnitude before, a value that's still aggressive so that we train quickly but still on the safe side from an explosion.

gc.collect()

7286

lr = 1e-2

learn.fit_one_cycle(10,lr,moms=(0.8,0.7))  # training for 10  epochs

See in just 10 epochs we have reached a fbeta of 0.909 as well as accuracy of 91 %¶

learn.save('model1') # let's save the model
learn.load('model1') # let's load it again and unfreeze it for further training
learn.unfreeze()

#learn.export()

learn.lr_find() # let's use learning rate again to train some more
learn.recorder.plot()

LR Finder is complete, type {learner_name}.recorder.plot() to see the graph.

gc.collect()

5965

learn.fit_one_cycle(3,1e-5,wd=0.2)  # wd = weight decay it's regularization method to stop model from overfitting

learn.save('model2')
learn.load('model2')
learn.unfreeze()

#learn.to_fp32().export('/content/flower91.pkl')

learn.to_fp32().export()

#!cp '/content/flower91.pkl' '/content/drive/My Drive/Dataset'

learn.lr_find()
learn.recorder.plot()

LR Finder is complete, type {learner_name}.recorder.plot() to see the graph.

gc.collect()

0

learn.freeze_to(-2)
learn.fit_one_cycle(2,1e-6,wd=0.3)

Final Fbeta 0.91 as well as accuracy of 91.2 %¶

learn32 = load_learner(path)

img = open_image('/content/flower/test/d9cb87ad0.jpeg')  # open the image using ope_image func from fast.ai
print(learn32.predict(img)[0]) # lets make some prediction
img

californian poppy

#!cp '/content/flower.pkl' '/content/drive/My Drive/Dataset/'

Let's interpret our model loses¶

interp = ClassificationInterpretation.from_learner(learn)

interp.plot_top_losses(12,figsize=(20,8))

interp.most_confused(min_val=3) # here it show max number of times it was confused between which clases

[('common tulip', 'rose', 7),
 ('camellia', 'rose', 5),
 ('sunflower', 'common dandelion', 4),
 ('magnolia', 'lotus', 3),
 ('morning glory', 'iris', 3),
 ('rose', 'common tulip', 3)]

pred,output=learn.get_preds(DatasetType.Test) # getting prediction of Test folder

we can export the model for production¶

# using
learn.to_fp32().export('flower.pkl') # converting model back to 32 bit floating point numbers to save the model

	id	flower_cls
0	7486	cape flower
1	9454	cape flower
2	6974	cape flower
3	4739	cape flower
4	8783	cape flower

	label	flower_class
0	0	pink primrose
1	1	hard-leaved pocket orchid
2	2	canterbury bells
3	3	sweet pea
4	4	wild geranium

epoch	train_loss	valid_loss	accuracy	f_beta	time
0	1.733854	0.939811	0.759012	0.720295	01:44
1	1.153993	1.420853	0.676387	0.595894	01:41
2	1.138062	1.278481	0.701904	0.628009	01:40
3	0.892396	1.067045	0.737141	0.702936	01:39
4	0.725074	0.950229	0.760632	0.740352	01:39
5	0.564668	0.753347	0.809640	0.785900	01:38
6	0.434394	0.496903	0.875658	0.865769	01:38
7	0.295898	0.418402	0.895099	0.889186	01:37
8	0.191464	0.384876	0.906440	0.903880	01:37
9	0.153314	0.384284	0.909275	0.909659	01:37

epoch	train_loss	valid_loss	accuracy	f_beta	time
0	0.147254	0.378988	0.912920	0.911895	01:46
1	0.135803	0.372026	0.912110	0.912383	01:48
2	0.119626	0.371653	0.912110	0.912316	01:49

epoch	train_loss	valid_loss	accuracy	f_beta	time
0	0.276241	0.409569	0.896719	0.886403	01:42
1	0.277178	0.409115	0.896314	0.880471	01:41