Overview
- What started the big data era
- Three main big data sources
- How to get value from big data
- Big data's characteristics
- 5 steps to process to gain value from big data
- The main elements of the Hadoop stack
What started the big data era
Data Torrent + Computing(Anytime and Anywhere)
Three main big data sources
- Machines
- People
- Origanization
How to get value from big data
Value come from integrating different types of data sources
Data intergation
- Reduce data complexity
- Increase data availability
- Unify your data system
Big data's characteristics
- Volume (Size)
- Varity (Complexity)
- Valence (Connectedness)
- Veracity (Quality)
- Velocity (Speed)
5 steps to process to gain value from big data
- Acquire
- Indentify data sets
- Retrieve data
- Query data
- Prepare Explore data
- Understand the nature of data
- Preliminary analysis Pre-process Data
- clean
- Integrate
- Package
- Analyze
- Select analytical techiques
- Build models
- Report
- Communicate results
- Act
- Apply results
The main elements of the Hadoop stack
- Enable Scalability
- Handle Fault Tolerance
- Optimized for a Variety Data Types
- Facilitate a Shared Environnment
- Provide Value