Welcome to Big Data Integration and Processing-Why Big Data Integration and Processing?
What is in this Course?
()
Summary of Big Data Modeling and Management
()
Why is Big Data Processing Different?
()
Slides: Summary & Why Is Big Data Processing Different
Welcome to Big Data Integration and Processing-Hands On: Setting Up Your Software Environment
Downloading and Installing the Cloudera VM Instructions (Windows)
Downloading and Installing the Cloudera VM Instructions (Mac)
Software Installation Frequently Asked Questions (FAQ)
Instructions for Downloading Hands On Datasets
Instructions for Starting Jupyter
Retrieving Big Data (Part 1)-Querying Data Part 1
What is Data Retrieval? Part 1
()
What is Data Retrieval? Part 2
()
Querying Two Relations
()
Subqueries
()
Slides: What is Data Retrieval?
Retrieving Big Data (Part 1)-Hands On
Querying Relational Data with Postgres
Querying Relational Data with Postgres
()
Retrieving Big Data (Part 2)-Querying Data Part 2
Querying JSON Data with MongoDB
()
Aggregation Functions
()
Querying Aerospike
()
Slides: Querying Data Part 2
Retrieving Big Data (Part 2)-Hands On
Querying Documents in MongoDB
Querying Documents in MongoDB
()
Exploring Pandas DataFrames
Exploring Pandas DataFrames
()
Big Data Integration-Information Integration
Overview of Information Integration
()
A Data Integration Scenario
()
Integration for Multichannel Customer Analytics
()
Slides: Information Integration
Big Data Integration-Industry Examples for Big Data Integration and Processing
Big Data Management and Processing Using Splunk and Datameer
()
Why Splunk?
()
Connected Cars with Ford's OpenXC and Splunk
()
Big Data Management and Processing using Datameer
()
Big Data Integration-Hands-On: Big Data Management and Processing Using Splunk
Downloading Splunk Enterprise
Installing Splunk Enterprise on Windows
()
Installing Splunk Enterprise on Linux
()
Exploring Splunk Queries
Exploring Splunk Queries
()
Optional: Instructions for Splunk Pivot Tutorial
Optional: Creating Pivot Reports in Splunk
()
Processing Big Data-Big Data Pipelines and High-level Operations for Big Data Processing
Big Data Processing Pipelines
()
Some High-Level Processing Operations in Big Data Pipelines
()
Aggregation Operations in Big Data Pipelines
()
Typical Analytical Operations in Big Data Pipelines
()
Big Data Processing Pipelines Slides
Processing Big Data-Big Data Processing Tools and Systems
Overview of Big Data Processing Systems
()
Big Data Workflow Management
The Integration and Processing Layer
()
Introduction to Apache Spark
()
Getting Started with Spark
()
Slides for Big Data Processing Tools and Systems
Processing Big Data-Hands-On: Let's Try Spark
WordCount in Spark
WordCount in Spark
()
Big Data Analytics using Spark-Programming in Spark
Spark Core: Programming In Spark using RDDs in Pipelines
()
Spark Core: Transformations
()
Spark Core: Actions
()
Slides for Module 5 Lesson 1
Big Data Analytics using Spark-Main Modules in the Spark Ecosystem
Spark SQL
()
Spark Streaming
()
Spark MLLib
()
Spark GraphX
()
Slides for Module 5 Lesson 2
Big Data Analytics using Spark-Hands-on: Data Processing in Spark
Exploring SparkSQL and Spark DataFrames
Exploring SparkSQL and Spark DataFrames
()
Instructions for Configuring VirtualBox for Spark Streaming
Analyzing Sensor Data with Spark Streaming
Analyzing Sensor Data with Spark Streaming
()
Learn By Doing: Putting MongoDB and Spark to Work-Assignment: Querying and Exporting from MongoDB
Let's Analyze Soccer Tweets!
Expressing Analytical Questions as MongoDB Queries
Exporting Data from MongoDB to a CSV File
Learn By Doing: Putting MongoDB and Spark to Work-Assignment: Analysis using Spark
Analyzing Tweets About Countries