王茂霖:数据挖掘提分三板斧!(附PPT下载)
作者:王茂霖,华中科技大学,Datawhale成员 来源:Datawhale 本文多图,建议阅读10+分钟 本文作者与你分享数据挖掘的三把利器。
内容概括
公众号(DatapiTHU)后台回复“20210420”获取完整PPT下载
Part 1 数据清洗和特征工程
![](https://filescdn.proginn.com/fe5cbff55545709ea76c43513b9c4d9b/d357d2eba5862cdce27533418e732652.webp)
![](https://filescdn.proginn.com/082398a20e3d612489edf2afe3518988/01b87679724c46004158900d96295f7c.webp)
![](https://filescdn.proginn.com/981e7bf55ae9ca4a551f7802f500b3b7/c7289fc327f13d27a1fa98ea6f40224a.webp)
![](https://filescdn.proginn.com/f447d5df645cf7293ab59849742d7a79/2b8b857e6ef3cbd2dd361fc42d9665b1.webp)
![](https://filescdn.proginn.com/c903ee449899768a4dcae2e18fe5add9/ead0f606f3ef7ab9f4994b79f11b75aa.webp)
在回归预测中,标准化是为了让特征值有均等的权重;
在训练神经网络的过程中,通过将数据标准化,能够加速权重参数的收敛;
主成分分析中,需要对数据进行标准化处理;默认指标间权重相等,不考虑指标间差异和相互影响。
![](https://filescdn.proginn.com/525466700916fc4a1aa05fd0e0793103/890f4cbd597e3ed78d0963505d648cff.webp)
![](https://filescdn.proginn.com/574784fe357dd710faaf35c193557320/94455a0cc86f1bc571035ac7129e0fc6.webp)
![](https://filescdn.proginn.com/4e1e2b01ec4483420a2025020dc2b64a/84e879508af5eb7be71667840818a53c.webp)
![](https://filescdn.proginn.com/1bda042f04d114cf2881329d6a27478c/8c6d11881435c024b2d4af02fefbfc1f.webp)
![](https://filescdn.proginn.com/0045c7a54e9af4761117c63a2f752dbe/e5807185b2a9dfd290bfa9340831af35.webp)
Part 2 模型参数调节
![](https://filescdn.proginn.com/79a78b0b0ecd313145258ecc5e98ecdb/95cbfb0b27eecd476593adaa92dce2ba.webp)
![](https://filescdn.proginn.com/6b67875d8ea75b1c4f9225bfd2f08416/8ac8963a48dd96cf90d7269859077475.webp)
![](https://filescdn.proginn.com/8916271020880a9798fbbb2e83aef7ac/74abe23ec54f29489545b38397ff1b5a.webp)
![](https://filescdn.proginn.com/f1e6d2194bc223e4db1a82d177df4b36/266fce164fcbd31b5fb0452a5c11f0e5.webp)
![](https://filescdn.proginn.com/0549f2d9707932349b6ebd96d43a4c48/e84bbf67d1b230c0aacfb307f9854364.webp)
Part 3 模型集成
![](https://filescdn.proginn.com/b1bec315223cfb2d07a8d80b9911fd51/9b823e87b095919a209e149ecff205ea.webp)
![](https://filescdn.proginn.com/966657e7f30264ba6431590abe4c5b84/259a42d0475d81932839883e83147776.webp)
![](https://filescdn.proginn.com/3c6ad1fa4567fa0abed12f3a068794a7/e964fdfd3446813c4bc9951467aaccf4.webp)
![](https://filescdn.proginn.com/0265d7e995a442a93edbfc0cdf609f26/c27d2bce4a7d85fbc4e5898e204538f0.webp)
![](https://filescdn.proginn.com/ebfd45a897e05673d0056e80a813786e/f5f04ddc908e937d33e52ded11bfd7a2.webp)
![](https://filescdn.proginn.com/e13c7797fc5f4f612e7475ea7d07f625/0f3c8e77d996ba88ad7fa7ff9184c70e.webp)
本文作者
访问下方地址或点击"阅读原文"查看分享:
学习地址:https://tianchi.aliyun.com/course
![](https://filescdn.proginn.com/13849c2a8d8f337d9099bd4026bbb799/3e0cace0ac4d54a632a627bb9ad71ef7.webp)
评论