Spark Essentials with Scala
The only Big Data online course on the web with actual big data.
Become a professional.
Your probably know by now: Spark is the most popular computing engine for big data, the most maintained, and with a proven track record of performance. It's 100 times faster than the old MapReduce paradigm, and can easily be extended with machine learning and streaming capabilities, and much more.
If you're dealing with large amounts of data, learning Spark is a must.
The demand for Spark has skyrocketed, and companies are struggling to fill their Data Scientist positions. Scala and Spark are two of the best paying technologies in the field. Forget the reported 120k salaries on PayScale and StackOverflow. I regularly see engineers and data scientists working for 150k+ per year, or charging thousands a day for consulting.
However you take it, learning Spark will be a game changer for your career, if you choose. And this course will help you get those skills.
Work with real big data.
This is our project at the end of the course.
We will dive into one of the biggest datasets publicly available and we'll put everything that we've learned to the test. Unlike every other material on the web (free or paid), this is the only place where you can really practice big data.
Everyone else runs a Spark job on one million records in a 20MB dataset. Why would you need Spark for that?
We do 1.4 BILLION car trips in a 400GB dataset. You don't fit that into most computers. That's the definition of big data.
At the end of the project, we gather data insights worth millions of dollars for the company you're looking to help, and tens of thousands for you as a consultant and data scientist.
(picture courtesy of Ravi Shekhar)
Take the proven path.
As with the other Rock the JVM courses, Spark & Big Data Essentials will take you through a battle-tested path to Spark proficiency as a data scientist and engineer.
As always, I've
- deconstructed the complexity of Spark in bite-sized chunks that you can practice in isolation
- selected the essential concepts and exercises with the appropriate complexity
- sequenced the topics in increasing order of difficulty so that they "click" along the way
- applied everything in live code
StartDataFrames Basics (23:09)
StartDataFrames Basics: Exercises (8:16)
StartHow DataFrames Work (6:22)
StartData Sources (13:24)
StartData Sources, Part 2 + Exercises (27:43)
StartDataFrame Columns and Expressions (24:33)
StartColumns and Expressions: Exercises (17:12)
StartDataFrame Aggregations (28:13)
StartDataFrame Joins (20:23)
StartJoins: Exercises (17:27)
I'm a software engineer and the founder of Rock the JVM. I started the Rock the JVM project out of love for Scala and the technologies it powers - they are all amazing tools and I want to share as much of my experience with them as I can.
For the last 7 years, I've taught a variety of Computer Science topics to 30000+ students at various levels and I've held live trainings for some of the best companies in the industry, including Adobe and Apple. I've also taught university students who now work at Google and Facebook (among others), I've held Hour of Code for 7-year-olds and I've taught 11000 kids to code.
I have a Master's Degree in Computer Science and I wrote my Bachelor and Master theses on Quantum Computation. Before starting to learn programming, I won medals at international Physics competitions.
Get started now!
Risk-free: 100% money back guarantee.
If you're not happy with this course, I want you to have your money back. If that happens, email me at [email protected] with a copy of your welcome email and I will refund you the course.
Less than 1.4 percent of students refunded the course, and every payment was returned in less than 72 hours.