首页 文章

如何在keras中进行多类图像分类?

提问于
浏览
0

这就是我做的 . 我得到了狗/猫图像分类的代码,我编译并运行并获得80%的准确率 . 我在火车和验证文件夹中添加了一个类(飞机)文件夹 . 对以下代码进行了更改

model.compile(loss='categorical_crossentropy',
              optimizer='rmsprop',
              metrics=['accuracy'])

train_generator = train_datagen.flow_from_directory(
    train_data_dir,
    target_size=(img_width, img_height),
    batch_size=batch_size,
    class_mode='categorical')

validation_generator = test_datagen.flow_from_directory(
    validation_data_dir,
    target_size=(img_width, img_height),
    batch_size=batch_size,
    class_mode='categorical')

binary class_mode 更改为 categorical 并且还丢失为 categorical_crossentropy . 还将输出布局 sigmoid 更改为 softmax . 收到以下错误 .

ValueError: Error when checking target: expected activation_10 to have shape (None, 1) but got array with shape (16, 3)

我是否需要明确地将培训标签更改为如下所述的分类标签? (我从网站multilabel classification using keras读到这个)

train_labels = to_categorical(train_labels, num_classes=num_classes)

我不确定这里发生了什么 . 请帮忙 . 我对深度学习相对较新 .

模型

model = Sequential()

model.add(Conv2D(32, (3, 3), input_shape=input_shape))
model.add(Activation('relu'))
model.add(MaxPooling2D(pool_size=(2, 2)))

model.add(Conv2D(32, (3, 3)))
model.add(Activation('relu'))
model.add(MaxPooling2D(pool_size=(2, 2)))

model.add(Conv2D(64, (3, 3)))
model.add(Activation('relu'))
model.add(MaxPooling2D(pool_size=(2, 2)))

model.add(Flatten())
model.add(Dense(64))
model.add(Activation('relu'))
model.add(Dropout(0.5))
model.add(Dense(1))
model.add(Activation('softmax'))

model.compile(loss='categorical_crossentropy',
              optimizer='rmsprop',
              metrics=['accuracy'])

# this is the augmentation configuration we will use for training
train_datagen = ImageDataGenerator(
    rescale=1. / 255,
    shear_range=0.2,
    zoom_range=0.2,
    horizontal_flip=True)
# this is the augmentation configuration we will use for testing:
# only rescaling
test_datagen = ImageDataGenerator(rescale=1. / 255)

train_generator = train_datagen.flow_from_directory(
    train_data_dir,
    target_size=(img_width, img_height),
    batch_size=batch_size,
    class_mode='categorical')


validation_generator = test_datagen.flow_from_directory(
    validation_data_dir,
    target_size=(img_width, img_height),
    batch_size=batch_size,
    class_mode='categorical')
model.fit_generator(
    train_generator,
    steps_per_epoch=nb_train_samples // batch_size,
    epochs=epochs,
    validation_data=validation_generator,
    validation_steps=nb_validation_samples // batch_size)

2 回答

  • 1

    对于多类分类,最后一个密集层必须具有等于类数的节点数,然后是 softmax 激活,即模型的最后两层应该是:

    model.add(Dense(num_classes))
    model.add(Activation('softmax'))
    

    此外,您的标签(火车和测试)必须是一个热门编码;因此,假设您的初始猫和狗被标记为整数(0/1),并且您的新类别(飞机)最初类似地标记为“2”,您应该按如下方式转换它们:

    train_labels = keras.utils.to_categorical(train_labels, num_classes)
    test_labels = keras.utils.to_categorical(test_labels, num_classes)
    

    最后,在术语层面,你所做的是多类,而不是多标签分类(我编辑了你的帖子的 Headers ) - 最后一个术语用于样本可能属于多个类别的问题 .

  • 1

    对于多标签分类,NN的最后一层的大小必须等于类的数量 .

    F.i.对于您的问题(3个类),代码应如下所示:

    model = Sequential()
    
    model.add(Conv2D(32, (3, 3), input_shape=input_shape))
    model.add(Activation('relu'))
    model.add(MaxPooling2D(pool_size=(2, 2)))
    
    model.add(Conv2D(32, (3, 3)))
    model.add(Activation('relu'))
    model.add(MaxPooling2D(pool_size=(2, 2)))
    
    model.add(Conv2D(64, (3, 3)))
    model.add(Activation('relu'))
    model.add(MaxPooling2D(pool_size=(2, 2)))
    
    model.add(Flatten())
    model.add(Dense(64))
    model.add(Activation('relu'))
    model.add(Dropout(0.5))
    model.add(Dense(3))
    model.add(Activation('softmax'))
    

相关问题