Apache Spark Certification Training Course is designed to provide you with the knowledge and skills to become a successful Big Data & Spark Developer.
Understanding Spark basics - Overview of Big Data and Spark, Installing Spark, Distributed data processing system, Spark shell.
Writing Spark applications, Spark algorithms, Sparks core APIs in Scala/Java or in Python, Sparks architecture and developer API, Predictive analytics based on MLlib, Clustering with KMeans, Building classifiers, Modeling, Visualization techniques (matplotlib, ggplot2, D3, etc.).
Streaming architecture: How DStreams break down into RDD batches, Receivers running inside Executor task slots, Kafka, Multiple receivers, Union transformation, Sliding window operations on DStreams, Stateless transformations, Statefull transformation, Window transformation, Output operations, Persistence.
Resilient Distributed Datasets (RDDs) - Narrow vs. Wide dependencies, Types of RDDs (HadoopRDD, MappedRDD, FilteredRDD, CassandraRDD, SchemaRDD, etc), Preserves partitioning parameter, Broadcast , Accumulators, RDD operations - Transformations in RDD, Actions in RDD, Loading data in RDD, Key-value pair.
Spark SQL - Combining SQL, Machine learning, and streaming for Unified pipelines; Data transformation techniques, Loading of data, Hive queries through Spark, Spark applications, SQL library, Support for JSON and parquet file formats.
Define and explain Spark Streaming
Understand RDD and its operation along with implementation of Spark Algorithms
Understand the difference between Apache Spark and Hadoop
Learn about the Scala classes concept and execute pattern matching
Who Should Attend?
Engineering and IT students
Graduates with a programming background
Senior Database Engineer
Data Intelligence – Spark
Lead Solution Advisor - Apache Spark
After completing this course and successfully passing the certification examination, the student will be awarded the “Big data with Spark” certification.
If a learner chooses not to take up the examination, they will still get a 'Participation Certificate'
Frequently Asked Questions
Course Features :
Mode Of Delivery:
Valid for 6 months post activation