Real-Time Analytics with Apache Storm

Online Course

Udacity
Real-Time Analytics with Apache Storm

What is the course about?

Real-Time Analytics with Apache Storm
The course Real-Time Analytics with Apache Storm is an online class provided by Udacity. The skill level of the course is Intermediate. It may be possible to receive a verified certification or use the course to prepare for a degree.

The world is trending in real time! Learn from Twitter to scalably process tweets, or any big data stream, in real-time to drive d3 visualizations using Apache Storm, the “Hadoop of Real Time.” Storm is free, open source, and fun to use! Learn from Karthik Ramasamy, Technical Lead of Storm@Twitter, about the distributed, fault-tolerant, and flexible technology used to power Twitter’s real-time data flow pipeline. Twitter open sourced Storm in 2011, and it graduated to a top-level Apache project in September, 2014.

Course description
  • Real-Time Analytics with Apache Storm
  • 2 weeks
  • The “Hadoop of Real-Time”
  • Starting from basic distributed concepts presented during our first Udacity-Twitter Storm Hackathon, link Storm concepts to Storm syntax to scalably drive Word Cloud visualizations with Vagrant, Ubuntu, Maven, Flask, Redis, and d3. Link to the public Twitter gardenhose stream to process live tweets, parse embedded URLs, and calculate Top worldwide hashtags. Extend beyond Storm basics by exploring multi-language capabilities in Python, integrate open source components, and implement real-time streaming joins.
  • In your final project, follow real-time trending topics by implementing the data pipeline to visualize only tweets that contain Top worldwide hashtags. Extend your project by exploring the Twitter API, or any data source, alongside Hackathon participants as they design their own ideas, receive feedback from Karthik, and open source a final project calculating real-time tweet sentiment and geolocation to drive a U.S. Map.
  • Basic Storm Topologies
  • Storm Basics
  • Beyond Storm Basics
  • Final Project
  • Final Project: Construct a Storm Topology
  • Project Extensions
  • Link to a real-time d3 Word Cloud Visualization using Redis, Flask, and d3
  • Program Bolts, link Spouts, and connect to the live Twitter API to process real-time tweets
  • Explore open source components by connecting a Rolling Count Bolt to your topology to visualize Rolling Top Tweeted Words
  • Explore multi-language capabilities to download and parse real-time Tweeted URLs in Python using Beautiful Soup
  • Integrate complex open source bolts to calculate Top-N words to visualize real-time Top-N Hashtags
  • Use stream grouping concepts to easily create streaming join to connect and dynamically process multiple streams
  • Work on your final project and we cover additional questions and topics brought up by Hackathon participants
  • Explore Vagrant, VirtualBox, Redis, Flask, and d3 further if you are interested!
  • Design a Storm Topology and new bolt that uses streaming joins to dynamically calculate Top-N Hashtags and display real-time tweets that contain trending Top Hashtags
  • Post your visualization to the forum and tweet them to your Twitter followers
  • Use additional features of the real-time Twitter sample stream or use any data source to drive your real-time d3 visualization
  • Programming language required: Java
  • To be successful, you’ll need intermediate knowledge of Java. Specifically, this is defined by experience and comfort with Java syntax, compile & run-time error diagnostics and debugging, ability to use javadocs as needed, and intermediate data structures including Arrays, HashMaps, and LinkedLists. If you need to build these skills, a good starting point is Udacity’s Introduction to Java with additional comfortability needed identifying and debugging compile & run-time errors.
  • No prior experience is assumed in Ubuntu, git, Maven, Redis, Flask (Python) or d3 (Javascript). Python is useful, but optional. A basic course such as CS101 or OO in Python would be helpful.
  • See the Technology Requirements for using Udacity.
  • Learn by doing! The world is going real time. Batch processing, popularized by Hadoop, has latency exceeding required real-time demands of modern mobile, connected, always-on users. Stream processing with seconds-required response time is necessary to meet this demand. Twitter is a world leader in real-time processing at scale. Learn the future from the company defining it.

Prerequisites & Facts

Real-Time Analytics with Apache Storm

Course Topic

Computer Science, Programming

University, College, Institution

Udacity

Course Skill Level

Intermediate

Course Language

English

Place of class

Online, self-paced (see curriculum for more information)

Degree

Certificate

Degree & Cost

Real-Time Analytics with Apache Storm

To obtain a verified certificate from Udacity you have to finish this course or the latest version of it, if there is a new edition. The class may be free of charge, but there could be some cost to receive a verified certificate or to access the learning materials. The specifics of the course may have been changed, please consult the provider to get the latest quotes and news.
Udacity
Real-Time Analytics with Apache Storm
provided by Udacity

Reviews

Share your experience

Real-Time Analytics with Apache Storm
Udacity Udacity
Rate the course

Do you recommend the course? *
Here you can find information, reviews and user experiences for the course “Real-Time Analytics with Apache Storm“. The provider of the course – “Udacity” – will be glad to answer any questions you may have about the class, click here to use the offical support channels. It would be great if you could share your experience of participating in the course – Your honest review will surely help others to choose the right class!
School: Udacity
Topic: Computer Science, Programming