当前位置：移动技术网 > IT编程>脚本编程>Python > Deep Learning with Pytorch 中文简明笔记第六章 Using a neural network to fit the data

Deep Learning with Pytorch 中文简明笔记第六章 Using a neural network to fit the data

2020年07月26日 | 移动技术网IT编程 | 我要评论

Deep Learning with Pytorch 中文简明笔记第六章 Using a neural network to fit the data

Pytorch作为深度学习框架的后起之秀，凭借其简单的API和简洁的文档，收到了越来越多人的关注和喜爱。本文主要总结了 Deep Learning with Pytorch 一书第六章[Using a neural network to fit the data]的主要内容，并加以简单明了的解释，作为自己的学习记录，也供大家学习和参考。

文章目录

Deep Learning with Pytorch 中文简明笔记第六章 Using a neural network to fit the data
主要内容
1. 人工神经单元
2. Pytorch的nn模块
3. 最终的神经网络

主要内容

非线性激活函数
使用Pytorch的nn模型
使用神经网络解决线性拟合问题

1. 人工神经单元

复杂函数的最基本的单元是神经单元

在这里插入图片描述

复杂函数就是多个神经单元的连接，最后展现出的形式就是函数的多层嵌套

在这里插入图片描述

深度学习中最简单的单元是线性操作+非线性激活函数，非线性激活函数的主要作用为

在模型内部，它允许输出函数在不同的值处具有不同的斜率
在模型最后，它具有将先前线性运算的输出集中到给定范围的作用

激活函数的类型很多，如

在这里插入图片描述

一般而言，激活函数是非线性且可微分的。并且激活函数有一段敏感区间，在这段区间中函数值的变化较为剧烈，同样也有一段不敏感区间，这段区间中函数值的变换较为缓和甚至几乎没有变化。

而对于参数学习而言，可以理解为调整非线性激活函数的offset和scale，使得更好的拟合数据。

在这里插入图片描述

2. Pytorch的nn模块

PyTorch具有专用于神经网络的子模块，叫做torch.nn模块。它包括了构建神经网络所需要的各种基本块。当模型需要一个列表或者一个字典组成的子模型时，PyTorch提供了nn.ModuleList和nn.ModuleDict。

PyToch对其子类nn.Module定义好了__call__()方法，使得其可以作为实例而调用。

# In[5]: 
import torch.nn as nn
linear_model = nn.Linear(1, 1)
linear_model(t_un_val)

# Out[5]: 
tensor([[0.6018], [0.2877]], grad_fn=<AddmmBackward>)

当对模型传入参数（数据）时，会调用模型的forward()，并且向其传入相同的参数。因此，不需要手动调用forward()函数。

y = model(x) # 正确
y = model.forward(x) # 错误

刚才使用到的线性模型nn.Linear，接受三个参数，分别为输入特征大小，输出特征大小和是否包含偏置项（默认为True)。我们可以使用weight和bias这两个属性来查看模型的参数详情。

# In[6]: 
linear_model.weight

# Out[6]: 
Parameter containing: 
tensor([[-0.0674]], requires_grad=True)

# In[7]: 
linear_model.bias

# Out[7]:
Parameter containing: tensor([0.7488], requires_grad=True)

按照刚才的方法传入参数，可以得到经过网络的结果。

# In[8]: 
x = torch.ones(1) linear_model(x)

# Out[8]: 
tensor([0.6814], grad_fn=<AddBackward0>)

如果需要传入一个batch的数据，则构成一个列向量输入进网络。

# In[9]: 
x = torch.ones(10, 1) 
linear_model(x)

下面使用这种方法定义模型和优化器，并使用nn.Module.parameters()方法访问模型参数

# In[10]: 
linear_model = nn.Linear(1, 1) 
optimizer = optim.SGD( linear_model.parameters(), lr=1e-2)

# In[11]: 
linear_model.parameters()

# Out[11]: 
<generator object Module.parameters at 0x7f94b4a8a750>

# In[12]: 
list(linear_model.parameters())

# Out[12]: 
[Parameter containing: tensor([[0.7398]], requires_grad=True), Parameter containing: tensor([0.7974], requires_grad=True)]

然后定义训练的loop

# In[13]: 
def training_loop(n_epochs, optimizer, model, loss_fn, t_u_train, t_u_val, t_c_train, t_c_val):
	for epoch in range(1, n_epochs + 1): 
		t_p_train = model(t_u_train) 
		loss_train = loss_fn(t_p_train, t_c_train)

		t_p_val = model(t_u_val)
		loss_val = loss_fn(t_p_val, t_c_val)
		
		optimizer.zero_grad() 
		loss_train.backward() 
		optimizer.step()

		if epoch == 1 or epoch % 1000 == 0:
			print(f"Epoch {epoch}, Training loss {loss_train.item():.4f}," f" Validation loss {loss_val.item():.4f}")

对于损失函数，Pytorch也封装了MSELoss，不再需要手写损失函数了

# In[15]: 
linear_model = nn.Linear(1, 1) 
optimizer = optim.SGD(linear_model.parameters(), lr=1e-2)
training_loop( n_epochs = 3000, optimizer = optimizer, model = linear_model, loss_fn = nn.MSELoss(), t_u_train = t_un_train, t_u_val = t_un_val, t_c_train = t_c_train, t_c_val = t_c_val)

3. 最终的神经网络

我们在原来线性神经网络的基础上，加入激活函数

在这里插入图片描述

# In[16]: 
seq_model = nn.Sequential( nn.Linear(1, 13), nn.Tanh(), nn.Linear(13, 1))
seq_model

# Out[16]: 
Sequential( 
(0): Linear(in_features=1, out_features=13, bias=True) 
(1): Tanh() 
(2): Linear(in_features=13, out_features=1, bias=True)
)

使用了nn.Sequential()做顺序封装。同样可以使用nn.Module.paramaters()方法查看参数

# In[17]: 
[param.shape for param in seq_model.parameters()]

# Out[17]: 
[torch.Size([13, 1]), torch.Size([13]), torch.Size([1, 13]), torch.Size([1])]

如果想分别查看weight和bias，可以使用nn.Module.named_paramaters()方法

# In[18]: 
for name, param in seq_model.named_parameters(): 
	print(name, param.shape)

# Out[18]: 
0.weight torch.Size([13, 1])
0.bias torch.Size([13]) 
2.weight torch.Size([1, 13]) 
2.bias torch.Size([1])

如果想为网络中不同层次命名，则可以使用OrderedDict

# In[19]: 
from collections import OrderedDict
seq_model = nn.Sequential(OrderedDict([ 
('hidden_linear', nn.Linear(1, 8)), 
('hidden_activation', nn.Tanh()), 
('output_linear', nn.Linear(8, 1))
]))
seq_model

# Out[19]: 
Sequential( 
(hidden_linear): Linear(in_features=1, out_features=8, bias=True) 
(hidden_activation): Tanh() 
(output_linear): Linear(in_features=8, out_features=1, bias=True)
)

# In[20]: 
for name, param in seq_model.named_parameters(): 
	print(name, param.shape)

# Out[20]:
hidden_linear.weight torch.Size([8, 1]) 
hidden_linear.bias torch.Size([8]) 
output_linear.weight torch.Size([1, 8]) 
output_linear.bias torch.Size([1])

如果使用了orderdict，则可以直接使用层次的名称来直接访问参数

# In[21]: 
seq_model.output_linear.bias

# Out[21]: 
Parameter containing: tensor([-0.0173], requires_grad=True)

本文地址：https://blog.csdn.net/pengwill97/article/details/107580063

您可能感兴趣的文章:

如对本文有疑问，点击进行留言回复！！

Python3如何使用多线程升程序运行速度

优化前后新老代码如下：from git_tools.git_tool import get_collect_projects, qqnews_gitfrom t... [阅读全文]
Python3如何实现Win10桌面自动切换

得空写了个自动切换桌面背景图片的小程序。再不写python就要扔键盘了，对vue还有那么一点好感，天天php真是有够烦。准备工作准备个文件夹放在桌面上，平时看到... [阅读全文]
Python基于gevent实现文件字符串查找器

1、递归遍历目录下所有文件并通过finder函数定位指定格式字符串2、用来查找字符串的finder函数是自己定义的，这里定义了一个ip_port_finder通... [阅读全文]
Python3合并两个有序数组代码实例

第一种思路，把两个数组合为一个数组然后再排序，问题又回归到冒泡和快排了，没有用到两个数组的有序性。（不好）第二种思路，循环比较两个有序数组头位元素的大小，并把头... [阅读全文]
Pythonic版二分查找实现过程原理解析

前提：升序数组，待查元素在数组中。二分查找：就是一个递归函数c。待查元素a，当前数组中位数b，如果b=a则返回b的索引，b>a则在b左侧的子数组中调用函数... [阅读全文]
如何用python免费看美剧

最早一部《越狱》转变了我对美剧的看法。主人公scofield的聪明才智和坚强的毅力，《绝命毒师》里面主人公的中年逆袭，《纸牌屋》里面老谋深算的政客，等等，这些美... [阅读全文]
Python调用jar包方法实现过程解析

需求最近在后台项目代码中一段自定义的aes加解密的程序在平时的测试工作中应用频繁。因为写脚本经常会需要使用，而经过各种尝试，比如jpype等，都不尽如人意。最后... [阅读全文]
vscode代码片段，react模版

{ // 在此处放置您的片段。每个代码段均以代码段名称定义，并具有范围，前缀，主体和 // 描述。在范围... [阅读全文]
利用Python实现串口通信--以Arduino UNO为例

本博客为作者自我学习所整理，若对读者有帮助，不胜荣幸利用Python实现串口通信——以Arduino UNO为例... [阅读全文]
Python matplotlib模块及柱状图用法解析

代码如下import matplotlib.pyplot as pltimport numpy as np def test4(): names = ['电影... [阅读全文]

网友评论


验证码：

Deep Learning with Pytorch 中文简明笔记 第六章 Using a neural network to fit the data

2020年07月26日 | 移动技术网IT编程 | 我要评论

Deep Learning with Pytorch 中文简明笔记 第六章 Using a neural network to fit the data

文章目录

主要内容

1. 人工神经单元

2. Pytorch的nn模块

3. 最终的神经网络

您可能感兴趣的文章:

相关文章:

网友评论

Deep Learning with Pytorch 中文简明笔记第六章 Using a neural network to fit the data

Deep Learning with Pytorch 中文简明笔记第六章 Using a neural network to fit the data