Purpose of training
The main goal of the training is to acquire practical skills in Apache Spark technology and the Scala language. Participants will learn how to create and debug Spark jobs that process large amounts of data. The issue of performance and optimization will also be discussed. Workshops are an important part of the training.
Training plan:
Day 1
- Review of Big Data solutions with the Apache family
- Scala for Big Data
- Scala vs Java
- var vs val
- Case Class, Traits, Abstract Class, Tuple
- Lazy evaluation
- String interpolation
- Pattern Matching (with covers)
- Accompanying object
- Collections and transformations
- For comprehension, mapping
- Error handling (Try / Either / Option)
- Option
- Implicits
- Practical workshops - Scala