Analyzing Big Data with Microsoft R

Online Course

edX
Analyzing Big Data with Microsoft R

What is the course about?

Analyzing Big Data with Microsoft R
The course Analyzing Big Data with Microsoft R is an online class provided by Microsoft through edX. The skill level of the course is Intermediate. It may be possible to receive a verified certification or use the course to prepare for a degree.

Learn how to use Microsoft R Server to analyze large datasets using R, one of the most powerful programming languages.

Course description

This course is part of the Microsoft Professional Program Certificate in Data Science and the Microsoft Professional Program Certificate in Big Data..The open-source programming language R has for a long time been popular (particularly in academia) for data processing and statistical analysis. Among R’s strengths are that it’s a succinct programming language and has an extensive repository of third party libraries for performing all kinds of analyses. Together, these two features make it possible for a data scientist to very quickly go from raw data to summaries, charts, and even full-blown reports. However, one deficiency with R is that traditionally it uses a lot of memory, both because it needs to load a copy of the data in its entirety as a data.frame object, and also because processing the data often involves making further copies (sometimes referred to as copy-on-modify). This is one of the reasons R has been more reluctantly received by industry compared to academia. The main component of Microsoft R Server (MRS) is the RevoScaleR package, which is an R library that offers a set of functionalities for processing large datasets without having to load them all at once in the memory. RevoScaleR offers a rich set of distributed statistical and machine learning algorithms, which get added to over time. Finally, RevoScaleR also offers a mechanism by which we can take code that we developed on our laptop and deploy it on a remote server such as SQL Server or Spark (where the infrastructure is very different under the hood), with minimal effort. In this course, we will show you how to use MRS to run an analysis on a large dataset and provide some examples of how to deploy it on a Spark cluster or a SQL Server database. Upon completion, you will know how to use R for big-data problems. Since RevoScaleR is an R package, we assume that the course participants are familiar with R. A solid understanding of R data structures (vectors, matrices, lists, data frames, environments) is required. Familiarity with 3rd party packages such as dplyr is also helpful.edX offers financial assistance for learners who want to earn Verified Certificates but who may not be able to pay the fee. To apply for financial assistance, enroll in the course, then follow this link to complete an application for assistance.

Prerequisites & Facts

Analyzing Big Data with Microsoft R

Course Topic

Data Analysis and Statistics

University, College, Institution

Microsoft

Course Skill Level

Intermediate

Course Language

English

Place of class

Online, self-paced (see curriculum for more information)

Degree

Certificate

Degree & Cost

Analyzing Big Data with Microsoft R

To obtain a verified certificate from edX / Microsoft you have to finish this course or the latest version of it, if there is a new edition. The class may be free of charge, but there could be some cost to receive a verified certificate (99.00 USD) or to access the learning materials. The specifics of the course may have been changed, please consult the provider to get the latest quotes and news.
Microsoft
Analyzing Big Data with Microsoft R
provided by edX

Reviews

Share your experience

Analyzing Big Data with Microsoft R
Microsoft edX
Rate the course

Do you recommend the course? *
Here you can find information, reviews and user experiences for the course “Analyzing Big Data with Microsoft R“. The provider of the course – “Microsoft” – will be glad to answer any questions you may have about the class, click here to use the offical support channels. It would be great if you could share your experience of participating in the course – Your honest review will surely help others to choose the right class!
School: Microsoft
Topic: Data Analysis and Statistics