超强图解 Pandas 18 招!
Pandas
是数据挖掘常见的工具,掌握使用过程中的函数是非常重要的。本文将借助可视化的过程,讲解Pandas
的各种操作。
sort_values
(dogs[dogs['size'] == 'medium']
.sort_values('type')
.groupby('type').median()
)
执行步骤:
size列筛选出部分行 然后将行的类型进行转换 按照type列进行分组,计算中位数
![](https://filescdn.proginn.com/c4ebd1c7ff7a6a56d8cf05260a7ca02e/f95d684224bda3979decbfef4d7df2b9.webp)
![](https://filescdn.proginn.com/da206afd537a7841e0faa1a5501cc3af/990c0dc9a3d8688d95ee96f61de5111b.webp)
![](https://filescdn.proginn.com/bea524caa8a50e262d281c64e97dd58a/cbb7cff449b591bd519ab5cde789542d.webp)
![](https://filescdn.proginn.com/91f32f6afa9b568cffcfe2518243570e/fd4f678df9db7e09ff7973965aaee0dd.webp)
selecting a column
dogs['longevity']
![](https://filescdn.proginn.com/224a4752b3a6c98b61832fa5426376ca/aec2e0e1ecffa6b578a201fed843a79b.webp)
groupby + mean
dogs.groupby('size').mean()
执行步骤:
将数据按照size进行分组 在分组内进行聚合操作
![](https://filescdn.proginn.com/159fbed88556eb725bebb1a717a4a2e8/c252d68d0dfd0907960a50bdcd7f9128.webp)
![](https://filescdn.proginn.com/fa8676062b51536b60cacea4938f23f9/c7d44afeaf0dc189808b394cd5b91553.webp)
grouping multiple columns
dogs.groupby(['type', 'size'])
![](https://filescdn.proginn.com/f0da84191d417e630e904072b9081c5c/6a72545112d421f75347d6e4b8affa6c.webp)
groupby + multi aggregation
(dogs
.sort_values('size')
.groupby('size')['height']
.agg(['sum', 'mean', 'std'])
)
执行步骤
按照size列对数据进行排序 按照size进行分组 对分组内的height进行计算
![](https://filescdn.proginn.com/e26fcd38360e8cd691481962cb4247b0/4757f6cbe339863883a494a24a1936d5.webp)
![](https://filescdn.proginn.com/416a68142a361e96764a0f8b26896851/b262ced0de4fd065c31839bfb9080a67.webp)
![](https://filescdn.proginn.com/e88e9a25780eb9dab7604c835bbd54dc/65518dab6bfd8a0352fc2e9a149185d6.webp)
![](https://filescdn.proginn.com/6f167e2723687e8b294c1dca2c4be41d/c87d309e6161e429dabd8da046e2dd46.webp)
filtering for columns
df.loc[:, df.loc['two'] <= 20]
![](https://filescdn.proginn.com/b00d97b9b474f5ea2a7551d2c53f0d5b/a9a34eb6cf3ce9c40f25cce590a26ebb.webp)
filtering for rows
dogs.loc[(dogs['size'] == 'medium') & (dogs['longevity'] > 12), 'breed']
![](https://filescdn.proginn.com/84935020efdc26d540080ecf9aef7e16/31ca9e13ffdc7018dbf639f6f3bb64ad.webp)
dropping columns
dogs.drop(columns=['type'])
![](https://filescdn.proginn.com/8827cfb4b5c1c253c4c5da3253ef495d/4bfd34036ac2483ccb4a40e6359672a5.webp)
joining
ppl.join(dogs)
![](https://filescdn.proginn.com/2e8ed6d15496457579b23a7e82293a74/031fcd3ab1e35f51e78ddadcfd9aca4f.webp)
merging
ppl.merge(dogs, left_on='likes', right_on='breed', how='left')
![](https://filescdn.proginn.com/2d1cdea8c1f47d97ad0cc9ca954a53cc/4bef47ad5d7b90626028e4b4f7832292.webp)
pivot table
dogs.pivot_table(index='size', columns='kids', values='price')
![](https://filescdn.proginn.com/e1432bcced8d1e297aa57e31ba70e521/aa7c1470513532e288ae5707cc3c6cd8.webp)
melting
dogs.melt()
![](https://filescdn.proginn.com/32a71a3b01eb98556b7b23bb277e24a5/b017217b602b639cfa545b0d636c8d67.webp)
pivoting
dogs.pivot(index='size', columns='kids')
![](https://filescdn.proginn.com/e2b9e8828205358b86b66b7731c5d4f9/53f9be61ffc1e2039d3b7628a0026c5a.webp)
stacking column index
dogs.stack()
![](https://filescdn.proginn.com/0ca07b3d33e4c367917f4d5f7c9e8ae4/ae9265625008a5fd0129dc557f6d56a1.webp)
unstacking row index
dogs.unstack()
![](https://filescdn.proginn.com/944f6d70bd03b459571a9bd4e1d38b15/ca08f4e0c30c6cdfa08b3730ff1bb7fe.webp)
resetting index
dogs.reset_index()
![](https://filescdn.proginn.com/ce0579a8d5e8e8dbe6bda39b457d29fc/cd7a0c319ec33e8c6a5c3896d45f2534.webp)
setting index
dogs.set_index('breed')
![](https://filescdn.proginn.com/3e84c6796a2bab560df07cb99746d1be/221d4062b3c66b6f0afc9ad2fe98f6d7.webp)
来源:https://pandastutor.com/index.html
如果有不理解的地方可以自己动手尝试一下,也可以通过直接私信交流。感谢支持。
扫码即可加我微信
学习交流
老表朋友圈经常有赠书/红包福利活动
万水千山总是情,点个 👍 行不行。
评论