Invalid Login Attempt

Lab: Realtime Stream Processing with Cosmos DB and Azure Databricks

In this lab you will learn to use Cosmos DB and Azure Databricks to build real-time stream processing solution. You will use a pre-built application to read the Twitter data stream into Cosmos DB. You will then configure Azure Databricks to be able to read the change feed of your Cosmos DB collection and you will use Scala to process the data and visualize the data stream in a Databricks Notebook.

  • Estimated time required to complete: 1 hours, 0 minutes
  • You will have access to this environment for 3 hours, 0 minutes
  • Learning Credits Required: 15


Exercise 1: Environment Setup
In this exercise, you will connect to the environment that was setup for the lab.
Exercise 2: Deploy Cosmos DB and Databricks

In this exercise, you will setup your Azure Cosmos DB and Azure Databricks accounts. You will then use the Azure Portal to configure a database and a collection in Cosmos DB that you will stream data into. You will then create your Azure Databricks cluster and configure it to be able to connect to Cosmos DB.

Exercise 3: Buildig the Realtime Stream Processing Pipeline
In this exercise, you will setup your stream processing pipeline. You will start the TwitterServer application which will pull the Twitter feed and load it into Cosmos DB. You will then create a Databricks Notebook using Scala to access the Cosmos DB Change Feed allowing you to generate realtime metrics about the Twitter feed.

Login to Start Lab

Not Registered? Already Registered?
Real Time Labs allow you to learn technology in an isolated environment without the hassle or cost of setting up a dedicated learning environment.

How it works