Brief Introduction
Learn by doing! The world is going real time. Batch processing, popularized by Hadoop, has latency exceeding required real-time demands of modern mobile, connected, always-on users. Stream processing with seconds-required response time is necessary to meet this demand. Twitter is a world leader in real-time processing at scale. Learn the future from the company defining it.Course Summary
Learn how to build real-time data processing pipelines using Apache Storm with this comprehensive course. Gain hands-on experience in building, deploying and scaling real-time analytics applications.Key Learning Points
- Understand the core concepts of real-time data processing and how Apache Storm works
- Build and deploy real-time data processing pipelines using Apache Storm
- Optimize and scale Storm topologies for efficient and reliable processing
Related Topics for further study
Learning Outcomes
- Build and deploy real-time data processing pipelines using Apache Storm
- Optimize and scale Storm topologies for efficient and reliable processing
- Understand the core concepts of real-time data processing and how Apache Storm works
Prerequisites or good to have knowledge before taking this course
- Basic knowledge of programming concepts
- Familiarity with the Java programming language
Course Difficulty Level
IntermediateCourse Format
- Self-paced
- Online
- Video lectures
- Hands-on projects
Similar Courses
- Real-time Stream Processing
- Apache Kafka Series - Learn Apache Kafka for Beginners
Related Education Paths
Notable People in This Field
- Jay Kreps
- Nathan Marz
Related Books
Description
The world is trending in real time! Learn Apache Storm, taught by Twitter, to scalably analyze real-time tweets and drive d3 visualizations. Storm is free, open and fun!Requirements
- Programming language required: Java To be successful, you'll need intermediate knowledge of Java. Specifically, this is defined by experience and comfort with Java syntax, compile & run-time error diagnostics and debugging, ability to use javadocs as needed, and intermediate data structures including Arrays, HashMaps, and LinkedLists. If you need to build these skills, a good starting point is Udacity’s Introduction to Java with additional comfortability needed identifying and debugging compile & run-time errors. No prior experience is assumed in Ubuntu, git, Maven, Redis, Flask (Python) or d3 (Javascript). Python is useful, but optional. A basic course such as CS101 or OO in Python would be helpful. See the Technology Requirements for using Udacity.
Knowledge
- Instructor videosLearn by doing exercisesTaught by industry professionals
Outline
- lesson 1 Basic Storm Topologies Link to a real-time d3 Word Cloud Visualization using Redis Flask and d3 lesson 2 Storm Basics Program Bolts link Spouts and connect to the live Twitter API to process real-time tweets Explore open source components by connecting a Rolling Count Bolt to your topology to visualize Rolling Top Tweeted Words lesson 3 Beyond Storm Basics Explore multi-language capabilities to download and parse real-time Tweeted URLs in Python using Beautiful Soup Integrate complex open source bolts to calculate Top-N words to visualize real-time Top-N Hashtags Use stream grouping concepts to easily create streaming join to connect and dynamically process multiple streams lesson 4 Final Project Work on your final project and we cover additional questions and topics brought up by Hackathon participants Explore Vagrant VirtualBox Redis Flask and d3 further if you are interested! lesson 5 Final Project: Construct a Storm Topology Design a Storm Topology and new bolt that uses streaming joins to dynamically calculate Top-N Hashtags and display real-time tweets that contain trending Top Hashtags Post your visualization to the forum and tweet them to your Twitter followers lesson 6 Project Extensions Use additional features of the real-time Twitter sample stream or use any data source to drive your real-time d3 visualization
Summary of User Reviews
Real-time analytics with Apache Storm is a highly rated course that teaches students how to use the popular real-time processing system. Users praise the course for its comprehensive curriculum and hands-on approach to learning.Key Aspect Users Liked About This Course
Hands-on approach to learningPros from User Reviews
- Comprehensive curriculum
- In-depth explanations
- Great for beginners and experienced users
- Real-world examples
- Interactive quizzes and exercises
Cons from User Reviews
- Some users found the course too basic
- Lack of advanced topics
- Limited support from instructors
- Outdated materials
- Not suitable for those with no programming experience