运行环境spark-shell
val p=spark.read.json("file:///root/spark-2.1.1-bin-hadoop2.7/examples/src/main/resources/people.json")
p.show
方差和标准差
1. 求age平均值
import spark.sql
val avgvule= sql("select avg(age) from people").collect.apply(0) (0).asInstanceOf[Double]
2.udf的函数
def sub