一、安装Spark
2.下载spark
二、Python编程练习:英文文本的词频统计
1、准备文本(f1.txt)
Please send this message to those people who mean something to you,to those who have touched your life in one way or another,to those who make you smile when you really need it,to those that make you see the brighter side of things when you are really down,to those who you want to let them know that you appreciate their friendship.And if you don’t, don’t worry,nothing bad will happen to you,you will just miss out on the opportunity to brighten someone’s day with this message.
2、插入代码
path='/home/hadoop/sb/f1.txt' with open(path) as f: text=f.read() words = text.split() sb={} for word in words: sb[word]=sb.get(word,0)+1 sblist=list(sb.items()) sblist.sort(key=lambda x:x[1],reverse=True) print(sblist)
3、输出结果