Invalid Login Attempt

Lab: Introduction to Azure Machine Learning Studio

In this lab, you will set up an Azure Machine Learning Studio account. You will then walk through the various features and capabilities of Azure Machine Learning Studio. You will load data from local and external sources. You will clean, manipulate and transform the data to make it usable for machine learning. Finally, you will create a binary classification model using two-class boosted decision trees to build a targeted mailing list.

  • Estimated time required to complete: 2 hours, 40 minutes
  • You will have access to this environment for 4 hours, 0 minutes
  • Learning Credits Required: 5
Who this lab is designed for
  • Data Scientists
  • Data Analysts
  • Data Professionals

Learning Objectives

  • Setup and configure Azure Machine Learning Studio.
  • Navigate Machine Learning Studio and understand basic functionality such as managing projects, experiments and datasets.
  • Importing data into Machine Learning Studio from local and external data sources.
  • Summarizing data and viewing basic statistics about datasets in Azure Machine Learning Studio.
  • Clean and prepare data for training machine learning algorithms in Azure Machine Learning Studio.
  • Classifying data using decision trees in Azure Machine Learning Studio.
  • Leveraging existing R script in your machine learning experiments in Azure Machine Learning Studio.
  • Operationalizing your machine learning experiments with Azure Machine Learning Studio.


Exercise 1: Create a Machine Learning Studio Workspace

An Azure Machine Learning Studio Workspace allows you to use Machine Learning Studio to create and manage machine learning experiments and predictive web services. You can create multiple Workspaces, each one containing a set of your experiments, datasets, trained predictive models, web services, and notebooks. As the owner of a Workspace, you can invite other users to share the Workspace so you can collaborate with them on predictive analytics solutions.

In this exercise, you will create an Azure Machine Learning Studio Workspace.

Exercise 2: Working with Machine Learning Studio Projects and Datasets

Azure Machine Learning Studio is a powerful browser based visual drag-and-drop code free authoring environment for machine learning in Azure. It allows you to build, deploy and share predictive analytics solutions in a fully managed cloud service with minimal overhead and fast time to insights.

In this task, you will take a walkthrough of the Azure Machine Learning Studio interface where you will create and configure machine learning projects with imported datasets and other assets.

Exercise 3: Create an AzureML Experiment
In this exercise, you will create your a simple AzureML experiment to read and summarize a dataset using the Summarize Data task.
Exercise 4: Accessing External Data Sources
In this exercise, you will access an online data source using the Import Data task in Azure Machine Learning Studio.
Exercise 5: Data Preparation
In this exercise, you will clean, manipulate and transform data using Azure Machine Learning Studio. You will implement a data cleansing process in your experiment to remove duplicate rows, remove outliers, and remove rows missing key data points. You will further verify that your data cleansing and transformations work by integrating the Summarize Data task into key points in the data preparation pipeline.
Exercise 6: Using a Decision Tree to Build a Targeted Mailing List

In this exercise, you will create an AzureML experiment to help you create a targeted mailing list using a classification algorithm in Azure Machine Learning Studio.

The type of algorithm we will use is called a binary classifier. A binary classifier is a type of algorithm that will classify elements into one of two groups. In our case, whether or not we should send an advertisement to an individual (read: marketing wants to know whether it is worth the cost of the stamp to send an advertisement to a potential customer). Other example use cases might be whether a piece of email is junk or good, whether a patient’s lab value is positive or negative, or whether sentiment is positive or negative.

The specific algorithm we will be using is the Two-Class Boosted Decision Tree. Decision trees are a great entry point into machine learning because they are very intuitive and easy to understand. The Two-Class Boosted Decision Tree is one of the easiest methods to get good performance. However, it is constrained by the size of memory and may not be well suited for larger datasets.

Exercise 7: Embedding an R Script in AzureML

In Azure ML Studio, you can use the Execute R Script module to embed R code into experiments in Azure Machine Learning and execute them using the R language. This means you can have customized R functions and packages that are not immediately available in Azure ML Studio.

In this exercise, we are going to use an R script to sample our dataset. You might want to do this if you have a large dataset and want to use an algorithm such as Two-Class Boosted Decision Trees that operates in-memory and requires a smaller dataset. We will execute the R script by using the Execute R Script task in Azure ML Studio.

Exercise 8: Taking your AzureML Models to Production
So far, we have looked at different tasks for importing and preparing data, building experiments, and training models. Now, we are going to convert our training experiment to a predictive experiment. The predictive experiment will generate predictions, taking a single input, and producing a result. The predictive experiment will be deployed as a Web service on Azure, to make it available for use by external applications.

Login to Start Lab

Not Registered? Already Registered?
Real Time Labs allow you to learn technology in an isolated environment without the hassle or cost of setting up a dedicated learning environment.

How it works