【读书1】【2017】MATLAB与深度学习——正视过度拟合(1)

原作者: [db:作者] 来自: [db:来源] 收藏邀请

我们将在第三章的“代价函数和学习规则”部分进一步详述正则化的相关内容。

We will revisit regularization with furtherdetails in Chapter Three’s “Cost Function and Learning Rule” section.

在前面的数据分组例子中，由于训练数据简单，而且模型易于可视化，因此我们可以看出分组模型已经过度拟合。

We are able to tell that the grouping modelis overfitted because the training data is simple, and the model can be easilyvisualized.

然而，对于大多数情况，情况并非如此，因为被处理的数据具有更高的维度。

However, this is not the case for mostsituations, as the data has higher dimensions.

对于高维度的数据，我们无法绘制模型并直观地评估过度拟合的影响。

We cannot draw the model and intuitivelyevaluate the effects of overfitting for such data.

因此，我们需要另一种方法来确定训练过的模型是否被过度拟合。

Therefore, we need another method todetermine whether the trained model is overfitted or not.

这就是验证方法发挥作用的地方。

This is where validation comes into play.

验证是保留训练数据的一部分并使用它来监视模型性能的过程。

The validation is a process that reserves apart of the training data and uses it to monitor the performance.

验证数据不用于训练过程。

The validation set is not used for thetraining process.

因为训练数据的建模误差不能用于表明数据的过度拟合，所以我们使用训练数据中的一部分来检查模型是否过度拟合。

Because the modeling error of the trainingdata fails to indicate overfitting, we use some of the training data to checkif the model is overfitted.

我们可以说，当训练模型对保留的数据输入产生低性能时，模型被过度拟合。

We can say that the model is overfittedwhen the trained model yields a low level of performance to the reserved datainput.

在这种情况下，我们将修改模型，以防止过度拟合。

In this case, we will modify the model toprevent the overfitting.

图1-10示出了验证过程中训练数据的划分。

Figure 1-10 illustrates the division of thetraining data for the validation process.

图1-10 为验证过程划分训练数据集Dividing the trainingdata for the validation process

当涉及到验证时，机器学习的训练过程通过以下步骤进行：

When validation is involved, the trainingprocess of Machine Learning proceeds by the following steps:

将训练数据分成两组：一组用于训练，另一组用于验证。
Divide thetraining data into two groups: one for training and the other for validation.

作为应用上的经验法则，训练集与验证集的比率是8:2。

As a rule of thumb, the ratio of thetraining set to the validation set is 8:2.

用训练集训练模型。
Train the model with the training set.
使用验证集来评估模型的性能。
Evaluate the performance of the modelusing the validation set.

a. 如果模型得到满意的性能，则完成训练。

a. If the model yields satisfactoryperformance, finish the training.

b. 如果性能没有得到满意的结果，则修改模型，从步骤2重复以上过程。

b. If theperformance does not produce sufficient results, modify the model and repeatthe process from Step 2.

交叉验证是一种轻微变化的验证过程。

Cross-validation is a slight variation ofthe validation process.

它仍然将训练数据分成两组分别进行训练和验证，但是不断改变数据集。

It still divides the training data intogroups for the training and validation, but keeps changing the datasets.

交叉验证不保留最初划分的集合，而是重复数据的划分。

Instead of retaining the initially dividedsets, cross-validation repeats the division of the data.

这样做的原因是，即使在验证数据集被固定时，模型也可以能被过度拟合。

The reason for doing this is that the modelcan be overfitted even to the validation set when it is fixed.

——本文译自Phil Kim所著的《Matlab Deep Learning》

更多精彩文章请关注微信号：

鲜花

握手

雷人

路过

鸡蛋

该文章已有0人参与评论

请发表评论

全部评论

专题导读

More+

10-27 六六分期app的软件客服如何联系？(六六分期

11-06 可心卡盟:win10系统火狐flash插件崩溃怎么

11-06 亲亲特价:怎么删除回收站图标

11-06 济南大学虚拟社区:鲁大师节能降温的具体办

11-06 xlueops.exe:无线网络安装向导

11-06 女斗合众国:win7系统cf与主机连接不稳定怎

11-06 0xc000022-[cf烟雾头]cf怎么调烟雾头

11-06 qizideyouhuo:应用程序无法正常启动0xc0000

11-06 ipz-185:win7系统vcf文件怎么打开

11-06 傻哥蹦迪:win10系统s4怎么打开usb调试

11-06 八神浩树gtaste:回收站清空了怎么恢复

11-06 妖尾之黑色守护:win10系统电脑没有1440x900

11-06 校园至尊魔王小说:win7系统浏览网页时字体

11-06 女斗合众国:win10系统访问共享文件夹提示请

11-06 tokyo hot n0654:恢复win7系统默认字体一招

11-06 雨酷仙境:设置win7系统转移临时文件夹腾出

11-06 阿穆纳伊之杖:win7系统开始菜单在右边还原

11-06 tunespotting:win10系统火狐flash插件总是

11-06 甘尔葛分析师：计谋网站seo关键词暴涨有什

11-06 蔡贵霖: 计谋网站seo关键词暴涨有什么秘密

11-06 博益网首页:ao3网页版进入不了解决方法

11-06 漏斗子专栏: 网站数据分析小白易懂精华篇

11-06 见证双虹怎么做:win7系统开启telnet命令的

11-06 颾狐蝶蜋:系统资源不足无法完成请求的服务

11-06 国光中学校歌:提交网站到alexa查询详细步骤

11-06 西安有情天:静态网页和动态网页的区别

11-06 红木雅尚斋:外部链接构造对网站的好处

11-06 前官礼遇：防止域名劫持–增强域安全性的10

11-06 密传二转答案: 中文分词算法有哪些

11-06 金泉家园邮编:百度快照劫持的表现及应对方

delphiGDI图片压缩代码据说是位图缩放保持原图视觉效果最好的算法 ...发布时间：2022-07-18

[原创]delphiKeyUp、KeyPress、Keydown区别和用法，如何不按键盘调用事件 ...发布时间：2022-07-18

剪的笔顺,诠释剪的笔画,认识剪的部首

1 六六分期app的软件客服如何联系？(六六分期

六六分期app的软件客服如何联系？不知道吗？加qq群【895510560】即可！标题：六六分期

阅读：17973|2023-10-27

2 可心卡盟:win10系统火狐flash插件崩溃怎么

今天小编告诉大家如何处理win10系统火狐flash插件总是崩溃的问题，可能很多用户都不知

阅读：9569|2022-11-06

3 亲亲特价:怎么删除回收站图标

今天小编告诉大家如何对win10系统删除桌面回收站图标进行设置，可能很多用户都不知道

阅读：8129|2022-11-06

4 济南大学虚拟社区:鲁大师节能降温的具体办

今天小编告诉大家如何对win10系统电脑设置节能降温的设置方法，想必大家都遇到过需要

阅读：8511|2022-11-06

5 xlueops.exe:无线网络安装向导

我们在使用xp系统的过程中,经常需要对xp系统无线网络安装向导设置进行设置，可能很多

阅读：8415|2022-11-06

6 女斗合众国:win7系统cf与主机连接不稳定怎

今天小编告诉大家如何处理win7系统玩cf老是与主机连接不稳定的问题，可能很多用户都不

阅读：9309|2022-11-06

7 0xc000022-[cf烟雾头]cf怎么调烟雾头

电脑对日常生活的重要性小编就不多说了，可是一旦碰到win7系统设置cf烟雾头的问题，很

阅读：8378|2022-11-06

8 qizideyouhuo:应用程序无法正常启动0xc0000

我们在日常使用电脑的时候，有的小伙伴们可能在打开应用的时候会遇见提示应用程序无法

阅读：7810|2022-11-06

9 ipz-185:win7系统vcf文件怎么打开

今天小编告诉大家如何对win7系统打开vcf文件进行设置，可能很多用户都不知道怎么对win

阅读：8364|2022-11-06

10 傻哥蹦迪:win10系统s4怎么打开usb调试

今天小编告诉大家如何对win10系统s4开启USB调试模式进行设置，可能很多用户都不知道怎

阅读：7362|2022-11-06

客服电话

电子邮件

【读书1】【2017】MATLAB与深度学习——正视过度拟合(1)

请发表评论

全部评论

上一篇：

下一篇：

librespeed/speedtest: Self-hosted Speedt

avehtari/BDA_m_demos: Bayesian Data Anal

四维彩超怎么看性别？四维看男孩女孩诀窍

CVE-2022-24659

medfreeman/markdown-it-toc-and-anchor: m

剪的笔顺,诠释剪的笔画,认识剪的部首

六六分期app的软件客服如何联系？(六六分期

florent37/ViewAnimator: A fluent Android

florent37/Shrine-MaterialDesign2: implem

CVE-2020-36276

SimpleSoftwareIO/simple-sms: Send and re

关于我们

产品与服务

解决方案

139-2527-9053