Description
- Module 1: Understand the fundamentals of querying datasets in Spark/ Write the results back into HDFS using Spark
- Module 2: Write queries that calculate aggregate statistics/ Load data from HDFS for use in Spark applications
- Module 3: Use meta store tables as an input source or an output sink for Spark applications/ Filter data using Spark
- Module 4: Generate reports by using queries against loaded data/ Produce ranked or sorted data
- Module 5: Perform standard extract, transform, load (ETL) processes on data using the Spark API/ Join disparate datasets using Spark
- Module 6: Use Spark SQL to interact with the meta store programmatically in your applications/ Read and write files in a variety of file formats
Reviews
There are no reviews yet.