The Apache Spark Bundle with Scala

The Apache Spark Bundle with Scala

Everything you need to become an expert Apache Spark developer. Master Spark with Scala and big data through clear lessons, practical exercises, and a smooth learning curve. Ideal for those with some programming experience, this bundle will take you from zero to hero.

Goal

Crunch that data.

You probably know already: Spark is one of the most powerful big data crunching tools, one of the most extensible and well maintained, one of the most loved, and used by thousands of developers, teams, companies and organizations. It’s 100x faster than the previous Map-Reduce paradigm, and it just works.

Spark has seen huge demand in recent years, has some of the best-paid engineering positions, and is just plain fun. This series is for Scala programmers who need to crunch big data with Spark, and need a clear path to mastering it.

This bundle contains

  • 100+ live runnable examples

  • 100+ exercises with solutions

  • 3 real-life projects crunching either one the largest public datasets available, or live data in real time

  • 4000+ lines of code written from scratch, more than most engineers write in months

  • (I have exactly 4109 lines on camera, which I’ll share with you 100%)

  • Access to a growing private community on Slack

  • ~25 hours of HD content (soon downloadable)

  • (Soon) takeaway slides

  • All updates and additional material for free, for life

”Show me the path.”

If you came here thinking, “I want to learn Spark, just show me a step-by-step way to do it, and I’ll follow”, this is it. There’s little else for me to say. More than 2000 people have taken this journey and they speak:

This is the connecting dots video series for get over all concepts of Spark process which is extremely useful for the peoples just started learning spark. Thank you Daniel (Anoop C., March 11, 2020)

There are hundreds of reviews like that, and that’s on Udemy. The version you’re looking at is even better and with exclusive new material. You will crunch BILLIONS of records on an Amazon cluster, do sentiment analysis on live Twitter data, integrate Apache Kafka, PostgreSQL, Cassandra, Akka Streams, and whatever your heart desires.

This path is literally what you need to follow to be good at Spark.

Skills You'll Learn

After completing this Spark masterclass, you’ll be equipped to:

  • Work with Apache Spark in projects of any size and complexity
  • Process data of any size
  • Deploy Spark applications on clusters of any size
  • Optimize Spark jobs for ultimate performance
  • Optimize Spark clusters for minimal resource usage
  • Lead other data engineers on Spark best practice
  • Understand Spark at a deep level
  • Work with batch and streaming jobs equally well

Many of the ideas from these courses - especially the Spark optimization courses - have saved millions of dollars in cloud costs for many companies. You can have these skills for a fraction of the cost of a corporate training package.

Apache Spark features we cover:

  • Spark basics
    • how Spark parallelizes data processing
    • transformations and actions
  • DataFrames
    • working with data sources
    • columns and expressions
    • aggregations
    • joins
    • grouping
  • Spark types
    • schemas
    • handling nulls
    • complex and structured types
    • type-safe Datasets in Scala
  • Spark SQL
  • RDDs
  • Operational Spark
    • how Spark runs on a cluster
    • understanding jobs
    • navigating the Spark UI
    • logging and debugging
  • Spark Structured streaming
    • micro-batching
    • streaming DataFrames
    • transformation restrictions
    • stream-native processing
    • stateful stream processing
    • time-based transformations
  • Spark integrations with data sources and sinks
  • Spark optimization
    • understanding narrow/wide transformations
    • tradeoffs between DataFrames and RDDs
    • predicting performance
    • query plans
    • reading and understanding DAGs
  • Performance optimization techniques
    • optimizing joins
    • broadcast joins
    • bucketing and partitioning
    • data pruning
    • fixing data skews
  • Adaptive Query Execution (AQE)
  • Low-level Spark optimization
    • co-grouping
    • RDD optimizations
    • iterator-to-iterator transformations
  • Spark performance tuning
    • understanding and configuring Spark internals
    • caching
    • checkpointing
    • memory and CPU tuning
    • partition tuning
    • cluster resource allocation
    • tuning serialization

In short, everything you need to become an Apache Spark master.

Additional perks:

  • Full access to all the code written during the courses
  • Full access to the data we use during the projects
  • Invitation to join a private Slack community, where you’ll get exclusive updates, opportunities, and discussions
  • Access to all slides for easy reference
  • Free updates and expansions, no expiration date

Meet Rock the JVM

Daniel Ciocîrlan

I'm a software engineer and the founder of Rock the JVM.

I'm a software engineer and the founder of Rock the JVM. I started the Rock the JVM project out of love for Scala and the technologies it powers - they are all amazing tools and I want to share as much of my experience with them as I can.

As of February 2024, I've taught Java, Scala, Kotlin and related tech (e.g. Cats, ZIO, Spark) to 100000+ students at various levels and I've held live training sessions for some of the best companies in the industry, including Adobe and Apple. I've also taught university students who now work at Google and Facebook (among others), I've held Hour of Code for 7-year-olds and I've taught more than 35000 kids to code.

I have a Master's Degree in Computer Science and I wrote my Bachelor and Master theses on Quantum Computation. Before starting to learn programming, I won medals at international Physics competitions.

Included Courses

  • Apache Spark Essentials with Scala

    Apache Spark Essentials with Scala

    Become an Apache Spark developer by mastering the essentials of Apache Spark with Scala and big data with our comprehensive, hands-on course

  • Apache Spark Streaming with Scala

    Apache Spark Streaming with Scala

    Master Apache Spark Streaming with Scala: process massive data as it arrives, integrate with Kafka, JDBC, Cassandra, and more – handle live data streams effortlessly

  • Apache Spark Optimization with Scala

    Apache Spark Optimization with Scala

    Write performant code: master Apache Spark with Scala's tools and techniques to make your applications run blazing fast and learn the strategies used by top developers

  • Apache Spark Performance Tuning with Scala

    Apache Spark Performance Tuning with Scala

    Optimize Apache Spark with Scala for peak performance: master Spark internals and configurations to achieve maximum speed and memory efficiency for your cluster

Take this course now!

The Apache Spark Bundle with Scala - Lifetime License

Loading...

All of the courses in this bundle with a one-time payment

  • 38 hours of 4K content
  • 6000 lines of code written
  • All PDF slides
  • Access to the private Rock the JVM community
  • Free updates
  • Lifetime access
Get Now

All-Access Membership

Loading...
/monthly

All of the Rock the JVM courses

  • 320 hours of 4K content
  • 60660 lines of code written
  • All Scala courses
  • All Kotlin courses
  • All ZIO courses
  • All Typelevel courses
  • All Apache Flink courses
  • All Apache Spark courses
  • All Akka/Pekko courses
Join Now
100% Money Back Guarantee

If you're not happy with this course, I want you to have your money back. If that happens, contact me with a copy of your welcome email and I will refund you the course.

Less than 0.05% of students refunded a course on the entire site, and every payment was returned in less than 72 hours.

FAQ