1 Indirect Performance Enhancements 间接
1 Scala Versus Java Versus Python Versus R
2 DataFrame Versus SQL versus Datasets versus RDDS
2 Object Serialization in RDDs RDD对象的序列化
1 Cluster/Application sizing and sharing 资源共享
1 File-based long-term data storage 基于文件存储的长期数据
2 Splittable file types and compression 可分割文件类型和压缩
7 Memory Pressure and Garbage Collection
2 Measuring the impact of garbage collection
2 Direct Performance Enhancements 直接
3 Repartitioning and Coalescing
5 Temporary Data Storage Caching
<