sparkApache Spark is a unified engine for large-scale data processing, with an interface for programming clusters with implicit data parallelism and fault tolerance. It supports variousSpark SQL JSON TPC-DS 1TB . TPC-DS 8 . . Spark