Serverless Data Processing with Dataflow - Advanced Streaming Analytics Pipeline with Cloud Dataflow (Java)
Google via Google Skills
2,000+ Free Courses with Certificates: Coding, AI, SQL, and More
AI, Data Science & Cloud Certificates from Google, IBM & Meta
Overview
Build a Learning Habit
Download Class Central's free printable study calendar
Download for Free
In this lab you read deal with late and malformed streaming data using advanced Apache Beam concepts.
Syllabus
- Overview
- Setup and requirements
- Lab part 1. Dealing with late data
- Task 1. Prepare the environment
- Task 2. Set allowed lateness
- Task 3. Set a trigger
- Lab part 2. Dealing with malformed data
- Task 1. Collect malformed data
- Task 2. Make code more modular with a composite transform
- Task 3. Write malformed data for later analysis
- Task 4. Run your pipeline
- Task 5. Test your pipeline
- End your lab