【TensorFlow】LSTM（使用TFLearn预测正弦sin函数）

shaoheshaohe · 发表于 2019-1-3 11:56:26

数据生成

因为标准的循环神经网络模型预测的是离散的数值，所以需要将连续的 sin 函数曲线离散化

所谓离散化就是在一个给定的区间 [0,MAX] 内，通过有限个采样点模拟一个连续的曲线，即间隔相同距离取点

采样用的是 numpy.linspace() 函数，它可以创建一个等差序列，常用的参数有三个

start：起始值
stop：终止值，不包含在内
num：数列长度，默认为 50

然后使用一个 generate_data() 函数生成输入和输出，序列的第 i 项和后面的 TIMESTEPS-1 项合在一起作为输入，第 i + TIMESTEPS 项作为输出

整个代码的结构如下

lstm_model() 类用于创建 LSTM 网络并返回一些结果
LstmCell() 函数用于创建单层 LSTM 结构，防止 LSTM 参数名称一样
generate_data() 函数用于创建数据集

由于原书中的代码是基于 1.0，而我用的是 1.5，所以出现了很多错误，我将所遇到的错误的解决方法都记录在了文末

import numpy as np
import tensorflow as tf
import matplotlib as mpl
from matplotlib import pyplot as plt
from tensorflow.contrib.learn.python.learn.estimators.estimator import SKCompat

# TensorFlow的高层封装TFLearn
learn = tf.contrib.learn

# 神经网络参数
HIDDEN_SIZE = 30  # LSTM隐藏节点个数
NUM_LAYERS = 2  # LSTM层数
TIMESTEPS = 10  # 循环神经网络截断长度
BATCH_SIZE = 32  # batch大小

# 数据参数
TRAINING_STEPS = 3000  # 训练轮数
TRAINING_EXAMPLES = 10000  # 训练数据个数
TESTING_EXAMPLES = 1000  # 测试数据个数
SAMPLE_GAP = 0.01  # 采样间隔

def generate_data(seq):
# 序列的第i项和后面的TIMESTEPS-1项合在一起作为输入，第i+TIMESTEPS项作为输出
X = []
y = []
for i in range(len(seq) - TIMESTEPS - 1):
      X.append([seq[i:i + TIMESTEPS]])
      y.append([seq[i + TIMESTEPS]])
return np.array(X, dtype=np.float32), np.array(y, dtype=np.float32)

# LSTM结构单元
def LstmCell():
lstm_cell = tf.contrib.rnn.BasicLSTMCell(HIDDEN_SIZE)
return lstm_cell

def lstm_model(X, y):
# 使用多层LSTM，不能用lstm_cell*NUM_LAYERS的方法，会导致LSTM的tensor名字都一样
cell = tf.contrib.rnn.MultiRNNCell([LstmCell() for _ in range(NUM_LAYERS)])

# 将多层LSTM结构连接成RNN网络并计算前向传播结果
output, _ = tf.nn.dynamic_rnn(cell, X, dtype=tf.float32)
output = tf.reshape(output, [-1, HIDDEN_SIZE])

# 通过无激活函数的全联接层计算线性回归，并将数据压缩成一维数组的结构
predictions = tf.contrib.layers.fully_connected(output, 1, None)

# 将predictions和labels调整为统一的shape
y = tf.reshape(y, [-1])
predictions = tf.reshape(predictions, [-1])

# 计算损失值
loss = tf.losses.mean_squared_error(predictions, y)

# 创建模型优化器并得到优化步骤
train_op = tf.contrib.layers.optimize_loss(
      loss,
      tf.train.get_global_step(),
      optimizer='Adagrad',
      learning_rate=0.1)

return predictions, loss, train_op

# 用sin生成训练和测试数据集
test_start = TRAINING_EXAMPLES * SAMPLE_GAP
test_end = (TRAINING_EXAMPLES + TESTING_EXAMPLES) * SAMPLE_GAP
train_X, train_y = generate_data(
np.sin(np.linspace(0, test_start, TRAINING_EXAMPLES, dtype=np.float32)))
test_X, test_y = generate_data(
np.sin(
      np.linspace(test_start, test_end, TESTING_EXAMPLES, dtype=np.float32)))

# 建立深层循环网络模型
regressor = SKCompat(learn.Estimator(model_fn=lstm_model, model_dir='model/'))

# 调用fit函数训练模型
regressor.fit(train_X, train_y, batch_size=BATCH_SIZE, steps=TRAINING_STEPS)

# 使用训练好的模型对测试集进行预测
predicted = [[pred] for pred in regressor.predict(test_X)]

# 计算rmse作为评价指标
rmse = np.sqrt(((predicted - test_y)**2).mean(axis=0))
print('Mean Square Error is: %f' % (rmse[0]))

# 对预测曲线绘图，并存储到sin.jpg
fig = plt.figure()
plot_predicted, = plt.plot(predicted, label='predicted')
plot_test, = plt.plot(test_y, label='real_sin')
plt.legend([plot_predicted, plot_test], ['predicted', 'real_sin'])
plt.show()

---------------------
作者：widiot8023
来源：CSDN
原文：https://blog.csdn.net/white_idiot/article/details/78882856
版权声明：本文为博主原创文章，转载请附上博文链接！

Happy清子 · 发表于 2019-1-4 16:03:47

这篇文章的亮点
是rnn的使用
这个建立多层rnn的代码
可以运行
换个方式就不行了
别问为啥，tensorflow就这么任性

Happy清子 · 发表于 2019-1-31 17:20:17

版本的问题
确实是
tf的一大难题
无法向后兼容，真是醉了