Contenuti dettagliati del Corso
Giorno 1
- Course introduction
- Fundamentals of Big Data
- What is Big Data?
- The V’s of Big Data
- Data at Rest and Data in Transit (Batch vs Stream)
- The Big Data Pipeline
- Why a pipeline
- Big Data and Machine Learning
- Big Data Architecture
- Decoupling the Architecture
- Collecting Data
- Data and Datastore
- Storage solutions for huge amount of data
- DEMO: scalable storage solutions
- Databases, SQL and NoSQL
- Introduction to Graph DB
- Database as a Service, benefits
- DEMO: Database as a service
- Big data Processing and Analytics
- Introduction to big data Processing and Analytics
- How to perform simple querying
- Ad hoc analytics
- DEMO: ad hoc analytics
Giorno 2
- Data Warehouse
- On premises vs Managed data warehouse
- Data Lake
- Introduction to Data Lake
- Single source of truth
- Data Lake solutions
- Hadoop & Map Reduce Fundamentals
- Hadoop EcoSystem
- Map Function and Reduce Function
- MapReduce VS RMDBS
- Hadoop Frameworks
- Hadoop in the cloud
- DEMO: Hadoop in the cloud (EMR & Dataproc)
- Serverless Pipeline
- What serverless means
- Why going serverless
- DEMO: creting a serverless pipeline
- Data Visualization
- Business Intelligence tools
- Elastic Search, Logstash and Kibana
- DEMO: how to visualize data
- Big Data Solutions in Real World
- Typical Business Use Case