Introduction
Welcome
()
What you should know
()
Using the exercise files
()
1. Introduction to Scala
The advantages of Scala for data science
()
Installing Scala
()
Scala data types
()
Scala collections
()
Scala sets Scala arrays, vectors, and ranges
()
Scala maps
()
Scala expressions
()
Scala functions
()
Scala objects
()
2. Parallel Processing in Scala
Advantages of parallel collections
()
Creating parallel collections
()
Mapping functions over parallel collections
()
Filtering parallel collections
()
When and when not to use parallel collections
()
3. Using SQL in Scala
Installing PostgreSQL
()
Loading data into PostgreSQL
()
Connecting to PostgreSQL
()
Querying with SQL strings
()
Querying with prepared statements
()
Summary of SQL in Scala
()
4. Scala and Spark RDDs
Introduction to Spark
()
Installing Spark
()
Getting Started with Spark RDDs
()
Mapping Functions over RDDs
()
Statistics over RDDs
()
Summary of Scala and Spark RDDs
()
5. Scala and Spark DataFrames
Creating DataFrames
()
Grouping and filtering on DataFrames
()
Joining DataFrames
()
Working with JSON files
()
Summary of Scala and Spark DataFrames
()
Conclusion
Review of Scala for data science
()
Ex_Files_Scala_EssT_Data_Science.zip
(625 KB)