使用Keras建立模型并训练等一系列操作方式
时间:2022-07-27
本文章向大家介绍使用Keras建立模型并训练等一系列操作方式,主要内容包括其使用实例、应用技巧、基本知识点总结和需要注意事项,具有一定的参考价值,需要的朋友可以参考一下。
由于Keras是一种建立在已有深度学习框架上的二次框架,其使用起来非常方便,其后端实现有两种方法,theano和tensorflow。由于自己平时用tensorflow,所以选择后端用tensorflow的Keras,代码写起来更加方便。
1、建立模型
Keras分为两种不同的建模方式,
Sequential models:这种方法用于实现一些简单的模型。你只需要向一些存在的模型中添加层就行了。
Functional API:Keras的API是非常强大的,你可以利用这些API来构造更加复杂的模型,比如多输出模型,有向无环图等等。
这里采用sequential models方法。
构建序列模型。
def define_model():
model = Sequential()
# setup first conv layer
model.add(Conv2D(32, (3, 3), activation="relu",
input_shape=(120, 120, 3), padding='same')) # [10, 120, 120, 32]
# setup first maxpooling layer
model.add(MaxPooling2D(pool_size=(2, 2))) # [10, 60, 60, 32]
# setup second conv layer
model.add(Conv2D(8, kernel_size=(3, 3), activation="relu",
padding='same')) # [10, 60, 60, 8]
# setup second maxpooling layer
model.add(MaxPooling2D(pool_size=(3, 3))) # [10, 20, 20, 8]
# add bianping layer, 3200 = 20 * 20 * 8
model.add(Flatten()) # [10, 3200]
# add first full connection layer
model.add(Dense(512, activation='sigmoid')) # [10, 512]
# add dropout layer
model.add(Dropout(0.5))
# add second full connection layer
model.add(Dense(4, activation='softmax')) # [10, 4]
return model
可以看到定义模型时输出的网络结构。
2、准备数据
def load_data(resultpath):
datapath = os.path.join(resultpath, "data10_4.npz")
if os.path.exists(datapath):
data = np.load(datapath)
X, Y = data["X"], data["Y"]
else:
X = np.array(np.arange(432000)).reshape(10, 120, 120, 3)
Y = [0, 0, 1, 1, 2, 2, 3, 3, 2, 0]
X = X.astype('float32')
Y = np_utils.to_categorical(Y, 4)
np.savez(datapath, X=X, Y=Y)
print('Saved dataset to dataset.npz.')
print('X_shape:{}nY_shape:{}'.format(X.shape, Y.shape))
return X, Y
3、训练模型
def train_model(resultpath):
model = define_model()
# if want to use SGD, first define sgd, then set optimizer=sgd
sgd = SGD(lr=0.001, decay=1e-6, momentum=0, nesterov=True)
# select lossoptimizer
model.compile(loss=categorical_crossentropy,
optimizer=Adam(), metrics=['accuracy'])
model.summary()
# draw the model structure
plot_model(model, show_shapes=True,
to_file=os.path.join(resultpath, 'model.png'))
# load data
X, Y = load_data(resultpath)
# split train and test data
X_train, X_test, Y_train, Y_test = train_test_split(
X, Y, test_size=0.2, random_state=2)
# input data to model and train
history = model.fit(X_train, Y_train, batch_size=2, epochs=10,
validation_data=(X_test, Y_test), verbose=1, shuffle=True)
# evaluate the model
loss, acc = model.evaluate(X_test, Y_test, verbose=0)
print('Test loss:', loss)
print('Test accuracy:', acc)
可以看到训练时输出的日志。因为是随机数据,没有意义,这里训练的结果不必计较,只是练习而已。
保存下来的模型结构:
4、保存与加载模型并测试
有两种保存方式
4.1 直接保存模型h5
保存:
def my_save_model(resultpath):
model = train_model(resultpath)
# the first way to save model
model.save(os.path.join(resultpath, 'my_model.h5'))
加载:
def my_load_model(resultpath):
# test data
X = np.array(np.arange(86400)).reshape(2, 120, 120, 3)
Y = [0, 1]
X = X.astype('float32')
Y = np_utils.to_categorical(Y, 4)
# the first way of load model
model2 = load_model(os.path.join(resultpath, 'my_model.h5'))
model2.compile(loss=categorical_crossentropy,
optimizer=Adam(), metrics=['accuracy'])
test_loss, test_acc = model2.evaluate(X, Y, verbose=0)
print('Test loss:', test_loss)
print('Test accuracy:', test_acc)
y = model2.predict_classes(X)
print("predicct is: ", y)
4.2 分别保存网络结构和权重
保存:
def my_save_model(resultpath):
model = train_model(resultpath)
# the secon way : save trained network structure and weights
model_json = model.to_json()
open(os.path.join(resultpath, 'my_model_structure.json'), 'w').write(model_json)
model.save_weights(os.path.join(resultpath, 'my_model_weights.hd5'))
加载:
def my_load_model(resultpath):
# test data
X = np.array(np.arange(86400)).reshape(2, 120, 120, 3)
Y = [0, 1]
X = X.astype('float32')
Y = np_utils.to_categorical(Y, 4)
# the second way : load model structure and weights
model = model_from_json(open(os.path.join(resultpath, 'my_model_structure.json')).read())
model.load_weights(os.path.join(resultpath, 'my_model_weights.hd5'))
model.compile(loss=categorical_crossentropy,
optimizer=Adam(), metrics=['accuracy'])
test_loss, test_acc = model.evaluate(X, Y, verbose=0)
print('Test loss:', test_loss)
print('Test accuracy:', test_acc)
y = model.predict_classes(X)
print("predicct is: ", y)
可以看到,两次的结果是一样的。
5、完整代码
from keras.models import Sequential
from keras.layers import Dense, Conv2D, MaxPooling2D, Flatten, Dropout
from keras.losses import categorical_crossentropy
from keras.optimizers import Adam
from keras.utils.vis_utils import plot_model
from keras.optimizers import SGD
from keras.models import model_from_json
from keras.models import load_model
from keras.utils import np_utils
import numpy as np
import os
from sklearn.model_selection import train_test_split
def load_data(resultpath):
datapath = os.path.join(resultpath, "data10_4.npz")
if os.path.exists(datapath):
data = np.load(datapath)
X, Y = data["X"], data["Y"]
else:
X = np.array(np.arange(432000)).reshape(10, 120, 120, 3)
Y = [0, 0, 1, 1, 2, 2, 3, 3, 2, 0]
X = X.astype('float32')
Y = np_utils.to_categorical(Y, 4)
np.savez(datapath, X=X, Y=Y)
print('Saved dataset to dataset.npz.')
print('X_shape:{}nY_shape:{}'.format(X.shape, Y.shape))
return X, Y
def define_model():
model = Sequential()
# setup first conv layer
model.add(Conv2D(32, (3, 3), activation="relu",
input_shape=(120, 120, 3), padding='same')) # [10, 120, 120, 32]
# setup first maxpooling layer
model.add(MaxPooling2D(pool_size=(2, 2))) # [10, 60, 60, 32]
# setup second conv layer
model.add(Conv2D(8, kernel_size=(3, 3), activation="relu",
padding='same')) # [10, 60, 60, 8]
# setup second maxpooling layer
model.add(MaxPooling2D(pool_size=(3, 3))) # [10, 20, 20, 8]
# add bianping layer, 3200 = 20 * 20 * 8
model.add(Flatten()) # [10, 3200]
# add first full connection layer
model.add(Dense(512, activation='sigmoid')) # [10, 512]
# add dropout layer
model.add(Dropout(0.5))
# add second full connection layer
model.add(Dense(4, activation='softmax')) # [10, 4]
return model
def train_model(resultpath):
model = define_model()
# if want to use SGD, first define sgd, then set optimizer=sgd
sgd = SGD(lr=0.001, decay=1e-6, momentum=0, nesterov=True)
# select lossoptimizer
model.compile(loss=categorical_crossentropy,
optimizer=Adam(), metrics=['accuracy'])
model.summary()
# draw the model structure
plot_model(model, show_shapes=True,
to_file=os.path.join(resultpath, 'model.png'))
# load data
X, Y = load_data(resultpath)
# split train and test data
X_train, X_test, Y_train, Y_test = train_test_split(
X, Y, test_size=0.2, random_state=2)
# input data to model and train
history = model.fit(X_train, Y_train, batch_size=2, epochs=10,
validation_data=(X_test, Y_test), verbose=1, shuffle=True)
# evaluate the model
loss, acc = model.evaluate(X_test, Y_test, verbose=0)
print('Test loss:', loss)
print('Test accuracy:', acc)
return model
def my_save_model(resultpath):
model = train_model(resultpath)
# the first way to save model
model.save(os.path.join(resultpath, 'my_model.h5'))
# the secon way : save trained network structure and weights
model_json = model.to_json()
open(os.path.join(resultpath, 'my_model_structure.json'), 'w').write(model_json)
model.save_weights(os.path.join(resultpath, 'my_model_weights.hd5'))
def my_load_model(resultpath):
# test data
X = np.array(np.arange(86400)).reshape(2, 120, 120, 3)
Y = [0, 1]
X = X.astype('float32')
Y = np_utils.to_categorical(Y, 4)
# the first way of load model
model2 = load_model(os.path.join(resultpath, 'my_model.h5'))
model2.compile(loss=categorical_crossentropy,
optimizer=Adam(), metrics=['accuracy'])
test_loss, test_acc = model2.evaluate(X, Y, verbose=0)
print('Test loss:', test_loss)
print('Test accuracy:', test_acc)
y = model2.predict_classes(X)
print("predicct is: ", y)
# the second way : load model structure and weights
model = model_from_json(open(os.path.join(resultpath, 'my_model_structure.json')).read())
model.load_weights(os.path.join(resultpath, 'my_model_weights.hd5'))
model.compile(loss=categorical_crossentropy,
optimizer=Adam(), metrics=['accuracy'])
test_loss, test_acc = model.evaluate(X, Y, verbose=0)
print('Test loss:', test_loss)
print('Test accuracy:', test_acc)
y = model.predict_classes(X)
print("predicct is: ", y)
def main():
resultpath = "result"
#train_model(resultpath)
#my_save_model(resultpath)
my_load_model(resultpath)
if __name__ == "__main__":
main()
以上这篇使用Keras建立模型并训练等一系列操作方式就是小编分享给大家的全部内容了,希望能给大家一个参考。
- 腾讯人工智能实验室AI Lab主任张潼博士前沿对话:AI如何助力营销?
- 当前所有源码链接
- Python入门基础连载(4)控制流
- 济西站构建大数据运营网络,打造智能化列车加工厂
- 五位数终端收购的域名dongxiao.cn已启用
- 全球互联网发展进入“拐点”——展望下一代互联网
- 2 分钟论文:语音生成表情包背后的技术原理
- 享学课堂谈-Python程序员的常见错误
- 区块链技术,如何提升网络安全?
- 趣店推“大白汽车”业务 启用域名dabaiqiche.com
- 糖果吃了那么多,你真的知道比特币分叉是咋回事吗?
- 静息态网络拓扑传输认知任务信息
- MYSQL字符串截取总结:LEFT、RIGHT、SUBSTRING、SUBSTRING
- 如何用Python提取中文关键词?
- JavaScript 教程
- JavaScript 编辑工具
- JavaScript 与HTML
- JavaScript 与Java
- JavaScript 数据结构
- JavaScript 基本数据类型
- JavaScript 特殊数据类型
- JavaScript 运算符
- JavaScript typeof 运算符
- JavaScript 表达式
- JavaScript 类型转换
- JavaScript 基本语法
- JavaScript 注释
- Javascript 基本处理流程
- Javascript 选择结构
- Javascript if 语句
- Javascript if 语句的嵌套
- Javascript switch 语句
- Javascript 循环结构
- Javascript 循环结构实例
- Javascript 跳转语句
- Javascript 控制语句总结
- Javascript 函数介绍
- Javascript 函数的定义
- Javascript 函数调用
- Javascript 几种特殊的函数
- JavaScript 内置函数简介
- Javascript eval() 函数
- Javascript isFinite() 函数
- Javascript isNaN() 函数
- parseInt() 与 parseFloat()
- escape() 与 unescape()
- Javascript 字符串介绍
- Javascript length属性
- javascript 字符串函数
- Javascript 日期对象简介
- Javascript 日期对象用途
- Date 对象属性和方法
- Javascript 数组是什么
- Javascript 创建数组
- Javascript 数组赋值与取值
- Javascript 数组属性和方法
- 利用Vscode进行Python开发环境配置的步骤
- python + selenium 刷B站播放量的实例代码
- 巧用x-cos-traffic-limit header来限制上传下载速率
- centos7上编译安装php7以php-fpm方式连接apache
- 基于Python实现视频的人脸融合功能
- 如何理解python对象
- Virtualenv 搭建 Py项目运行环境的教程详解
- python字符串的index和find的区别详解
- 浅谈Python 参数与变量
- 宝塔面板成功部署Django项目流程(图文)
- Python celery原理及运行流程解析
- Python Scrapy图片爬取原理及代码实例
- Python-for循环的内部机制
- 解决Tensorflow2.0 tf.keras.Model.load_weights() 报错处理问题
- 基于python实现模拟数据结构模型