2018北京积分落户数据,用pyspark、pyecharts大数据可视化分析,按用户分数分析

2018北京积分落户数据,用pyspark、pyecharts大数据可视化分析,按用户分数分析。

#导入积分落户人员名单数据
df = spark.read.csv('jifenluohu.csv', header='true', inferSchema='true')
df.cache()
df.createOrReplaceTempView("jflh")
#df.show()
spCount = agecount = spark.sql("select int(score) as name,count(*) as ct from jflh group by int(score) order by int(score) asc").collect()
name = [row.name for row in spCount]
count = [row.ct for row in spCount]

#图表展示
from pyecharts import Bar
bar = Bar("2018北京积分落户用户数据分析", "按用户分数汇总统计用户数量")
bar.add("用户数量", name, count)
bar

2018北京积分落户数据,用pyspark、pyecharts大数据可视化分析,按用户分数分析