Baquara / sparkStudy

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

sparkStudy

  1. Apache Spark is an open-source big data processing framework that is used for storing, processing, and analyzing large amounts of data.

  2. Spark is comprised of two main components: the Spark Core and the Spark SQL library.

  3. The Spark Core is used for processing and analyzing data, while Spark SQL is used for querying and manipulating data.

  4. Spark is scalable and can be used on commodity hardware, which makes it cost-effective for storing and processing big data.

  5. Spark is used by many companies and organizations, such as Yahoo, Netflix, and Uber.

About


Languages

Language:Python 100.0%