Introduction
What data science tools must you know?
()
Course organization
()
1. Introduction to Data Science
Introduction
()
Data science
()
Fundamental skills
()
Tools of trade
()
Enabling technologies
()
2. Cloud Computing
Cloud computing and virtualization
()
Cloud fundamentals
()
Types of cloud
()
Solution providers
()
Private cloud hands-on with Proxmox
()
Proxmox: Bootable installation disk
()
Proxmox: Installation
()
Proxmox: Managing virtual machines
()
Proxmox: Creating and configuring virtual machines
()
3. Distributed File Systems
Distributed file systems
()
Fundamentals
()
Distributed systems and distributed processing
()
Hadoop hands-on
()
Hadoop: Preparation
()
Hadoop: Installation
()
Hadoop: MapReduce hands-on
()
4. Distributed Processing
Distributed processing with MapReduce
()
Distributed processing with Spark
()
Spark architecture and features
()
Spark: Installation
()
Spark: Spark shell
()
Spark: pyspark
()
Spark: Application
()
5. Machine Learning
Machine learning
()
Fundamentals
()
Types of machine learning
()
Weka: Installation
()
Weka: GUI
()
Weka: Training vs. testing
()
Weka: Clustering
()
6. Case Study
Putting it all together
()
Hadoop cluster: Installation
()
Hadoop cluster: Operation
()
Spark, YARN, and Hadoop
()
Weka and Spark
()