深度学习001-从hello world开始

最新推荐文章于 2023-11-28 18:12:09 发布

Time Rolls On By

最新推荐文章于 2023-11-28 18:12:09 发布

阅读量429

点赞数 1

CC 4.0 BY-SA版权

分类专栏：深度学习图像处理文章标签：深度学习 tensorflow 神经网络

本文链接：https://blog.csdn.net/hanfei410/article/details/106146023

本文从深度学习的"hello world"开始，介绍如何使用TensorFlow实现一个简单的卷积神经网络。首先搭建了开发环境，然后通过3个步骤：准备数据、构建并训练模型、恢复模型进行推理验证。在训练中遇到学习率设置问题，调整后模型准确率显著提升。文章提供了部分核心代码，并提供完整版本的下载链接。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

从"hello world"开始

程序员思维，一切新事物的学习都有一个hello world，深度学习也不例外，本文就从机器学习届的“hello world”开始，实现一个简单的卷积神经网络，实现本文的前提是开发环境已经搭建完成，目前我所使用的开发环境如下：
硬件：1080ti x2
操作系统：Ubuntu16.04
软件：python3.5
tensoflow-gpu 1.4.0
pycharm 2020
在这里插入图片描述

基本包含以下三个步骤：
1、准备数据，编写数据读写功能函数
2、构建并训练保存模型
3、恢复模型并进行推理验证
最终的效果如下：
训练过程：
在这里插入图片描述在训练过程中出现了一点小问题，模型准确度始终在0.010左右徘徊，后来发现是学习率设置的过大（learning_rate = 0.1），改为0.001后，准确率瞬间飙升到0.9左右，由此可见相关超参对模型训练的影响之深。
推理结果：
在这里插入图片描述这里对测试图像进行了显示，并将推理结果及真实值进行了显示
部分核心代码如下：

from tensorflow.examples.tutorials.mnist import input_data
import tensorflow as tf
import matplotlib.pyplot as plt
import os

mnist = input_data.read_data_sets('../MNIST_data', one_hot=True)
# Parameters
learning_rate = 0.001
training_epochs = 500
batch_size = 100
display_step = 1
is_train = 1
# Network Parameters
n_input = 784
n_classes = 10
# tf Graph input
x = tf.placeholder("float", [None, n_input])
y = tf.placeholder("float", [None, n_classes])
# pre-define
def conv2d(x, W):
    return tf.nn.conv2d(x, W,strides=[1, 1, 1, 1],padding='SAME', name='conv_2d')
def max_pool_2x2(x):
    return tf.nn.max_pool(x, ksize=[1, 2, 2, 1],strides=[1, 2, 2, 1],padding='SAME', name='max_pool')

# Create model
def multilayer_preceptron(x, weights, biases):
    # now,we want to change this to a CNN network
    # first,reshape the data to 4_D ,
    x_image = tf.reshape(x, [-1, 28<