Apache Spark Essentials with Scala
Become an Apache Spark developer with our essentials course. Master the fundamentals of Apache Spark with Scala and big data through clear lessons, practical exercises, and a smooth learning curve. Ideal for those with some programming experience, this course will quickly equip you with essential skills to effectively tackle real-world big data challenges.
- Duration
- 10h of 4K content
- Lessons
- 31 lessons
By Daniel Ciocîrlan
Money-back guarantee · Unlimited access · Free updates
Course Roadmap
Skills You'll Learn
- Work with DataFrames for Spark jobs of any complexity
- Integrate any data source with Spark
- Perform DataFrame transformations, aggregations, joins, and unions
- Process absent values and perform data cleanup
- Deconstruct complex types like structures and arrays
- Manage type safety with DataFrames and Datasets
- Use RDDs for arbitrary Scala transformations
- Understand how Spark runs on a cluster
- Deploy Spark applications on Amazon
- Pick the right API level for any Spark job
Goal
Become a professional.
You probably know by now: Spark is the most popular computing engine for big data, the most maintained, and with a proven track record of performance. It’s 100 times faster than the old MapReduce paradigm, and can easily be extended with machine learning and streaming capabilities, and much more.
If you’re dealing with large amounts of data, learning Spark is a must.
The demand for Spark has skyrocketed, and companies are struggling to fill their Data Scientist positions. Scala and Spark are two of the best paying technologies in the field. Forget the reported 120k salaries on PayScale and StackOverflow. I regularly see engineers and data scientists working for 150k+ per year, or charging thousands a day for consulting.
However you take it, learning Spark will be a game changer for your career, if you choose. And this course will help you get those skills. Join this Spark Big Data online course and learn by writing code.
Work with real big data.
At the end of the course, we will dive into one of the biggest datasets publicly available and we’ll put everything that we’ve learned to the test. Unlike every other material on the web (free or paid), this Spark course is the only place where you can really practice big data.
Everyone else runs a Spark job on one million records in a 20MB dataset. Why would you need Spark for that?
We do 1.4 BILLION car trips in a 400GB dataset. You don’t fit that into most computers. That’s the definition of big data.
This will be your true exercise to practice Spark with Scala. At the end of the project, we gather data insights worth millions of dollars for the company you’re looking to help, and tens of thousands for you as a consultant and data scientist.
Take the proven path
As with the other Rock the JVM courses, this Spark and Big Data Essentials course will take you through a battle-tested path to Spark proficiency as a data scientist and engineer.
As always, I’ve:
- Deconstructed the complexity of Spark in bite-sized chunks that you can practice in isolation
- Selected the essential concepts and exercises with the appropriate complexity
- Sequenced the topics in increasing order of difficulty so that they “click” along the way
- Applied everything in live code
What Our Students Say
-
My team is expanding the use of Akka in our products so I needed a quick introduction on this topic. I have tried a couple of courses but the introduction to Akka was always too abrupt, too hard to comprehend. I blamed Akka for this as being too hard to explain. This was until I was exposed to the Rock The JVM courses which were an absolute delight when it comes to presenting such complex topics in such an easy to understand way. And Daniel has not stopped at Akka but has added to his portfolio amazing courses on Scala and Spark too. It seems like he is quite enjoying taking such challenges like complex technologies and making them so simple for everyone. I have instantly recommended Daniel’s work to my team, which helped them immensely with taking their skills to a new level, and I do recommend these courses to anyone who wants to have the fastest ramp-up in these tough but popular technologies.
Mihai FecioruAdobe · California
-
From Scala, to Akka, to Spark, Daniel delivers exceptional material in each and every one of these technologies. I’ve been using them for a long time and there is always something new I will discover from him. The level of detail he gets into as well as the way he delivers material is mindblowing. I personally find his latest course Spark Optimization pure gold and one of a kind. I’ve been using Spark for a year now and I haven’t even thought how much you can leverage query plans to make such optimizations. I can’t stop thinking every time, how he manages to go so deep - because using a technology is one thing, but knowing its internals so well and how everything works behind the scenes is another story when it comes to distributed systems. Long story short Daniel is definitely the best instructor I’ve come across and each one of his courses is the best resource you can find online. Kudos for all your work and knowledge sharing.
Giannis PolyzosVerverica · Greece
-
Daniel’s courses on Scala and Big Data are the best in class. I’ve been in touch with Daniel’s teaching and courses since early 2018. The first course that I took from him was Scala & Functional Programming; I was skeptical about it because over the internet there are many courses you can find, but few really worthy. I remember the very first day when Daniel started to speak and shared his examples - I started to love Scala, and then more as we went on. I am with Scala for the last 5 years now, but never ever has anyone explained to me or gave me comparable resources to Rock the JVM. Daniel gave me a shift in life and helped me crack top tech company interviews. His courses on big data are a must for any aspiring big data developer or data enthusiast. I highly recommend Daniel as an educator both online and on campus.
Anirban GoswamiApple · California
What's Included
Meet Rock the JVM
Daniel Ciocîrlan
Founder, Rock the JVM
I'm a software engineer and the founder of Rock the JVM.
I started Rock the JVM out of love for Scala and the technologies it powers. They are amazing tools, and I want to share as much of my experience with them as I can.
I've taught Java, Scala, Kotlin and related technologies such as Cats, ZIO and Spark to 100,000+ students at various levels. I've held live training sessions for companies including Adobe and Apple, taught university students who now work at Google and Facebook, run Hour of Code for 7-year-olds, and taught more than 50,000+ kids to code.
I have a Master's Degree in Computer Science and I wrote my Bachelor and Master thesis on Quantum Computation. Before learning programming, I won medals at international Physics competitions.
Enroll now!
All-Access Membership
Full (and growing) catalog
$195 billed yearly —Save 54%
Unlimited access to every Rock the JVM course
- 348 hours of 4K content
- All Scala courses
- All Kotlin courses
- All Typelevel courses
- All ZIO courses
- All Apache Spark courses
- All Apache Flink courses
- All Akka/Pekko courses
- Access to the private Rock the JVM community
- New courses included automatically
The Apache Spark Bundle with Scala
4 courses, one price
$180All courses in this bundle with a one-time payment
- 4 courses included
- 38 hours of 4K content
- All PDF slides
- Free updates
- Lifetime access
- Access to the private Rock the JVM community
Apache Spark Essentials with Scala
Lifetime license
$75Just this course with a one-time payment
- 10 hours of 4K content
- All PDF slides
- Free updates
- Lifetime access
- Access to the private Rock the JVM community
100% Money Back Guarantee
If you're not happy with this course, I want you to have your money back. Contact me with a copy of your welcome email and I will refund you.
Less than 0.05% of students have ever asked for a refund — and every payment was returned in under 72 hours.