Tensorflow小練習（一）：Kaggle貓狗分類

Kaggle 題目 Dogs vs。 Cats Redux： Kernels Edition

框架：Tensorflow（上層框架使用TF-slim）

問題描述：輸入圖片，判斷圖片中為貓還是狗。

實現方法：Fine-tune TF-slim中提供的VGG-19神經網路。

預訓練模型引數：

https：//

github。com/tensorflow/m

odels/tree/master/research/slim

資料集：Dogs vs。 Cats Redux： Kernels Edition

程式碼：2012013382/Cat_or_dog-kaggle-vgg16-tensorflow

資料預處理

讀入資料將圖片整理為矩陣形式，將標籤整理為one_hot形式。注意VGG_19的輸入必須為［Batch_size， 224， 224， 3］。

from

__future__

import

absolute_import

from

__future__

import

division

from

__future__

import

print_function

import

numpy

from

scipy。misc

import

imread

，

imresize

from

import

walk

from

os。path

import

join

import

tensorflow

def

read_images

（

path

，

classes

，

img_height

224

，

img_width

224

，

img_channels

）：

filenames

（

walk

（

path

））

。

（）［

］

num_files

len

（

filenames

）

images

。

zeros

（（

num_files

，

img_height

，

img_width

，

img_channels

），

dtype

。

float32

）

labels

。

zeros

（（

num_files

，

），

dtype

。

int32

）

for

，

filename

enumerate

（

filenames

）：

img

imread

（

join

（

path

，

filename

））

#read train data

img

imresize

（

img

，

（

img_height

，

img_width

））

#resize image to 224x224

img

。

astype

（

。

float32

）

images

［

，

：，

：］

img

labels

［

］

classes

。

index

（

filename

［

：

］）

# Luckily both ‘cat’ and ‘dog’ have 3 characters

# Convert from ［0， 255］ -> ［-0。5， 0。5］ floats。

#images［i，：，：，：］ = images［i，：，：，：］ * （1。 / 255） - 0。5

one_hot_labels

。

zeros

（（

num_files

，

len

（

classes

）））

1000

：

（

‘Load the

image of 25000。’

（

））

for

range

（

num_files

）：

one_hot_labels

［

，

labels

［

］］

return

images

，

one_hot_labels

#Read image function for kaggle test data

def

read_images_kaggle_result

（

path

，

img_height

224

，

img_width

224

，

img_channels

）：

filenames

（

walk

（

path

））

。

（）［

］

num_files

len

（

filenames

）

images

。

zeros

（（

num_files

，

img_height

，

img_width

，

img_channels

），

dtype

。

float32

）

for

，

filename

enumerate

（

filenames

）：

img

imread

（

join

（

path

，

filename

））

#read train data

img

imresize

（

img

，

（

img_height

，

img_width

））

#resize image to 224x224

img

。

astype

（

。

float32

）

images

［

int

（

filename

［：

］）

，

：，

：］

img

#images［int（filename［：-4］） - 1，：，：，：］ = images［int（filename［：-4］） - 1，：，：，：］ * （1。 / 255） - 0。5

1000

：

（

‘Load the

image of 12500。’

（

））

return

images

Kaggle提供的訓練集中共有25000張圖片，取其中30%作為驗證集，用於調參；其他作為訓練集，用於訓練模型。TensorFlow有利用佇列讀取資料的方式，能夠高效從磁碟中讀取資料，具體可以參考其官方教程。這裡為了方便，我使用簡單的一次性讀取的方法將資料弄成一batch的形式，用於之後的輸入。

from

__future__

import

absolute_import

from

__future__

import

division

from

__future__

import

print_function

import

manage_images

import

numpy

IMG_CLASSES

［

‘cat’

，

‘dog’

］

DATA_DIR

‘data/’

TRAIN_DATA_PATH

‘data/train/’

TEST_DATA_PATH

‘data/test/’

IMG_HEIGHT

int

（

224

）

#image shape ［224，224，3］ to fit VGG_16 input shape。

IMG_WIDTH

int

（

224

）

IMG_CHANNELS

NUM_FILES_DATASET

25000

#data size 25000； train data size 17500； test data size 7500

VALIDATION_SET_FRACTION

0。3

#validation size properbility

NUM_TRAIN_EXAMPLES

int

（（

VALIDATION_SET_FRACTION

）

NUM_FILES_DATASET

）

NUM_VALIDATION_EXAMPLES

int

（（

VALIDATION_SET_FRACTION

）

NUM_FILES_DATASET

）

NUM_KAGGLE_TEST

12500

def

pre_processing

（

data_set

‘train’

，

batch_size

）：

data_set

‘train’

：

images

，

labels

manage_images

。

read_images

（

TRAIN_DATA_PATH

，

IMG_CLASSES

，

IMG_HEIGHT

，

IMG_WIDTH

，

IMG_CHANNELS

）

#Substract mean value

train_mean

。

mean

（

images

，

axis

）

# Random sample

validation_images

［］

validation_labels

［］

train_images

［］

train_labels

［］

validation_size

int

（

VALIDATION_SET_FRACTION

len

（

images

））

idx

。

random

。

permutation

（

len

（

images

））

for

idx

：

validation_size

：

validation_images

。

append

（

images

［

］

train_mean

）

validation_labels

。

append

（

labels

［

］）

else

：

train_images

。

append

（

images

［

］

train_mean

）

train_labels

。

append

（

labels

［

］）

train_batch_num

NUM_TRAIN_EXAMPLES

batch_size

validation_batch_num

NUM_VALIDATION_EXAMPLES

batch_size

pointer

batch_train_images

［］

batch_train_labels

［］

batch_validation_images

［］

batch_validation_labels

［］

for

range

（

train_batch_num

）：

batch_train_images

。

append

（

train_images

［

pointer

：

pointer

batch_size

］）

batch_train_labels

。

append

（

train_labels

［

pointer

：

pointer

batch_size

］）

pointer

batch_size

pointer

for

range

（

validation_batch_num

）：

batch_validation_images

。

append

（

validation_images

［

pointer

：

pointer

batch_size

］）

batch_validation_labels

。

append

（

validation_labels

［

pointer

：

pointer

batch_size

］）

pointer

batch_size

batch_validation_set

{

‘images’

：

batch_validation_images

，

‘labels’

：

batch_validation_labels

}

batch_train_set

{

‘images’

：

batch_train_images

，

‘labels’

：

batch_train_labels

}

image_num

{

‘train’

：

NUM_TRAIN_EXAMPLES

，

‘validation’

：

NUM_VALIDATION_EXAMPLES

}

return

batch_train_set

，

batch_validation_set

，

image_num

elif

data_set

‘test’

：

images

manage_images

。

read_images_kaggle_result

（

TEST_DATA_PATH

，

IMG_HEIGHT

，

IMG_WIDTH

，

IMG_CHANNELS

）

#Substract mean

mean

。

mean

（

images

，

axis

）

images

mean

batch_test_images

［］

batch_num

NUM_KAGGLE_TEST

batch_size

pointer

for

range

（

batch_num

）：

batch_test_images

。

append

（

images

［

pointer

：

pointer

batch_size

］）

pointer

batch_size

batch_test_set

{

‘images’

：

batch_test_images

}

return

batch_test_set

訓練

from

__future__

import

absolute_import

from

__future__

import

division

from

__future__

import

print_function

import

numpy

import

time

import

data_processing

import

tensorflow

import

tensorflow。contrib。slim

slim

import

tensorflow。contrib。slim。nets

nets

import

os。path

import

time

TRAIN_LOG_DIR

。

path

。

join

（

‘Log/train/’

，

time

。

strftime

（

‘%Y-%m-

%H：%M：%S’

，

time

。

localtime

（

time

。

time

（））））

TRAIN_CHECK_POINT

‘check_point/train_model。ckpt’

VALIDATION_LOG_DIR

‘Log/validation/’

VGG_19_MODEL_DIR

‘check_point/vgg_19。ckpt’

BATCH_SIZE

EPOCH

not

。

gfile

。

Exists

（

TRAIN_LOG_DIR

）：

。

gfile

。

MakeDirs

（

TRAIN_LOG_DIR

）

not

。

gfile

。

Exists

（

VALIDATION_LOG_DIR

）：

。

gfile

。

MakeDirs

（

VALIDATION_LOG_DIR

）

batch_train_set

，

batch_validation_set

，

images_num

data_processing

。

pre_processing

（

data_set

‘train’

，

batch_size

BATCH_SIZE

）

def

get_accuracy

（

logits

，

labels

）：

correct_prediction

。

equal

（

。

argmax

（

logits

，

），

。

argmax

（

labels

，

））

accuracy

。

reduce_mean

（

。

cast

（

correct_prediction

，

。

float32

））

return

accuracy

with

。

Graph

（）

。

as_default

（）：

images

。

placeholder

（

。

float32

，

［

BATCH_SIZE

，

224

，

224

，

］）

labels

。

placeholder

（

。

float32

，

［

BATCH_SIZE

，

len

（

data_processing

。

IMG_CLASSES

）］）

keep_prob

。

placeholder

（

。

float32

）

with

slim

。

arg_scope

（

nets

。

vgg

。

vgg_arg_scope

（））：

logits

，

nets

。

vgg

。

vgg_19

（

inputs

images

，

num_classes

，

dropout_keep_prob

keep_prob

，

is_training

True

）

variables_to_restore

slim

。

get_variables_to_restore

（

exclude

［

‘vgg_19/fc8’

］）

restorer

。

train

。

Saver

（

variables_to_restore

）

with

。

name_scope

（

‘cross_entropy’

）：

loss

。

reduce_mean

（

。

softmax_cross_entropy_with_logits

（

logits

，

labels

））

。

summary

。

scalar

（

‘cross_entropy’

，

loss

）

learning_rate

1e-4

optimizer

。

train

。

AdamOptimizer

（

learning_rate

）

。

minimize

（

loss

）

with

。

name_scope

（

‘accuracy’

）：

accuracy

get_accuracy

（

logits

，

labels

）

。

summary

。

scalar

（

‘accuracy’

，

accuracy

）

merged

。

summary

。

merge_all

（）

saver

。

train

。

Saver

（）

config

。

ConfigProto

（）

config

。

gpu_options

。

allow_growth

True

with

。

Session

（

config

）

sess

：

train_writer

。

summary

。

FileWriter

（

TRAIN_LOG_DIR

）

#train_writer。add_summary（sess。graph）

sess

。

run

（

。

global_variables_initializer

（））

sess

。

run

（

。

local_variables_initializer

（））

restorer

。

restore

（

sess

，

VGG_19_MODEL_DIR

）

step

for

range

（

EPOCH

）：

all_accuracy

all_loss

for

range

（

images_num

［

‘train’

］

BATCH_SIZE

）：

，

accuracy_out

，

loss_out

，

summary

sess

。

run

（［

optimizer

，

accuracy

，

loss

，

merged

］，

feed_dict

{

images

：

batch_train_set

［

‘images’

］［

］，

labels

：

batch_train_set

［

‘labels’

］［

］，

keep_prob

：

0。5

}）

train_writer

。

add_summary

（

summary

，

step

）

step

all_accuracy

accuracy_out

all_loss

loss_out

：

（

“Epoch

： Batch

accuracy is

%。2f

； Batch loss is

%。5f

”

（

，

accuracy_out

，

loss_out

））

（

“Epoch

： Train accuracy is

%。2f

； Train loss is

%。5f

”

（

，

all_accuracy

（

images_num

［

‘train’

］

BATCH_SIZE

），

all_loss

（

images_num

［

‘train’

］

BATCH_SIZE

）））

all_accuracy

all_loss

for

range

（

images_num

［

‘validation’

］

BATCH_SIZE

）：

accuracy_out

，

loss_out

sess

。

run

（［

accuracy

，

loss

］，

feed_dict

{

images

：

batch_validation_set

［

‘images’

］［

］，

labels

：

batch_validation_set

［

‘labels’

］［

］，

keep_prob

：

1。0

}）

all_accuracy

accuracy_out

all_loss

loss_out

（

“Epoch

： Validation accuracy is

%。2f

； Validation loss is

%。5f

”

（

，

all_accuracy

（

images_num

［

‘validation’

］

BATCH_SIZE

），

all_loss

（

images_num

［

‘validation’

］

BATCH_SIZE

）））

saver

。

save

（

sess

，

TRAIN_CHECK_POINT

，

global_step

）

在我的實驗中驗證集上的準去率為97%。

測試

from

__future__

import

absolute_import

from

__future__

import

division

from

__future__

import

print_function

import

data_processing

import

tensorflow

import

tensorflow。contrib。slim

slim

import

tensorflow。contrib。slim。nets

nets

import

csv

CHECK_POINT_PATH

‘check_point/train_model。ckpt-2’

NUM_KAGGLE_TEST

12500

BATCH_SIZE

batch_test_set

data_processing

。

pre_processing

（

data_set

‘test’

，

batch_size

BATCH_SIZE

）

prediction_file

open

（

‘kaggle_result_file。csv’

，

‘wb’

）

prediction_file_object

csv

。

writer

（

prediction_file

）

prediction_file_object

。

writerow

（［

‘id’

，

‘label’

］）

with

。

Graph

（）

。

as_default

（）：

images

。

placeholder

（

。

float32

，

［

BATCH_SIZE

，

224

，

224

，

］）

keep_prob

。

placeholder

（

。

float32

）

logits

，

nets

。

vgg

。

vgg_19

（

inputs

images

，

num_classes

，

dropout_keep_prob

keep_prob

，

is_training

False

）

variables_to_restore

slim

。

get_variables_to_restore

（）

restorer

。

train

。

Saver

（

variables_to_restore

）

pros

。

softmax

（

logits

）

config

。

ConfigProto

（）

config

。

gpu_options

。

allow_growth

True

with

。

Session

（

config

）

sess

：

sess

。

run

（

。

global_variables_initializer

（））

sess

。

run

（

。

local_variables_initializer

（））

restorer

。

restore

（

sess

，

CHECK_POINT_PATH

）

for

range

（

NUM_KAGGLE_TEST

BATCH_SIZE

）：

batch_pros

sess

。

run

（

pros

，

feed_dict

{

images

：

batch_test_set

［

‘images’

］［

］，

keep_prob

：

1。0

}）

（

‘Batch

’

（

，

NUM_KAGGLE_TEST

BATCH_SIZE

））

for

range

（

BATCH_SIZE

）：

temp

0。0

batch_pros

［

，

］

0。5

：

temp

0。995

else

：

temp

0。005

prediction_file_object

。

writerow

（［

BATCH_SIZE

，

temp

］）

prediction_file

。

（）

Tensorflow小練習（一）：Kaggle貓狗分類

班主任寄語：學習態度和方法決定了效率

大門兩側種什麼樹好？

隨便看看

唐玄宗往前五百年是什麼朝代？

女童大擺裙的裁剪方法？

分家分家協議書怎樣才有法律效力？

4k水粉紙和素描紙的區別？

Tensorflow小練習（一）：Kaggle貓狗分類

班主任寄語：學習態度和方法決定了效率

大門兩側種什麼樹好？

猜你喜歡

工科備研用u盤至少多大？要求質優讀寫速度快的，求學長推薦？

train英文怎麼念？

《權力的遊戲》好看，取景地更好看

隨便看看

唐玄宗往前五百年是什麼朝代？

女童大擺裙的裁剪方法？

分家分家協議書怎樣才有法律效力？

4k水粉紙和素描紙的區別？