The goal of this course is to give you solid foundations for developing, analyzing, and implementing parallel and locality-efficient algorithms. This course focuses on theoretical underpinnings. To give a practical feeling for how algorithms map to and behave on real systems, we will supplement algorithmic theory with hands-on exercises on modern HPC systems, such as Cilk Plus or OpenMP on shared memory nodes, CUDA for graphics co-processors (GPUs), and MPI and PGAS models for distributed memory systems.
Master key concepts and practical skills through structured learning modules. By completing this curriculum, you'll gain valuable expertise applicable to real-world scenarios.
Offered at Georgia Tech as CS 6220
This course is a graduate-level introduction to scalable parallel algorithms. "Scale" really refers to two things: efficient as the problem size grows, and efficient as the system size (measured in numbers of cores or compute nodes) grows. To really scale your algorithm in both of these senses, you need to be smart about reducing asymptotic complexity the way you’ve done for sequential algorithms since CS 101; but you also need to think about reducing communication and data movement. This course is about the basic algorithmic techniques you’ll need to do so.
The techniques you’ll encounter covers the main algorithm design and analysis ideas for three major classes of machines: for multicore and many core shared memory machines, via the work-span model; for distributed memory machines like clusters and supercomputers, via network models; and for sequential or parallel machines with deep memory hierarchies (e.g., caches). You will see these techniques applied to fundamental problems, like sorting, search on trees and graphs, and linear algebra, among others. The practical aspect of this course is implementing the algorithms and techniques you’ll learn to run on real parallel and distributed systems, so you can check whether what appears to work well in theory also translates into practice. (Programming models you’ll use include Cilk Plus, OpenMP, and MPI, and possibly others.)
This comprehensive High Performance Computing curriculum is designed to take you from foundational concepts to advanced implementation. Each module builds upon the previous, ensuring a structured learning path that maximizes knowledge retention and practical application.
Understand what you need to succeed in this course and determine if it's the right fit for your learning goals
What you need before starting this High Performance Computing course:
This course is perfect for:
Everything you need to know about this online course, from duration to certification
Difficulty Level
Advanced
Prior experience recommended
Subject Category
Flying Car, Autonomous Flight
Part of our Flying Car, Autonomous Flight curriculum
Course Language
English
All materials in English
This online course offers comprehensive training with expert instruction, practical exercises, and a certificate of completion. Join thousands of students advancing their careers through quality online education.
View Current Pricing
Check provider for latest offers
Pricing may vary. Check the course provider for current promotions and exact pricing.
High Performance Computing isn't for you? Don't worry, explore these courses and advance your skills or learn something totally new.