Learn how to accelerate your applications with OpenACC and CUDA and how to train and deploy a neural network to solve real-world problems.
The online workshop combines lectures about Accelerated Computing with OpenACC and CUDA with lectures about Fundamentals of Deep Learning.
The lectures are interleaved with many hands-on sessions using Jupyter Notebooks. The exercises will be done on a fully configured GPU-accelerated workstation in the cloud.
The workshop is co-organised by LRZ and NVIDIA Deep Learning Institute (DLI). NVIDIA DLI offers hands-on training for developers, data scientists, and researchers looking to solve challenging problems with deep learning.
All instructors are NVIDIA certified University Ambassadors.
Agenda
1st day: Fundamentals of Accelerated Computing with OpenACC (10:00-16:00 CET)
On the first day you learn the basics of OpenACC, a high-level programming language for programming on GPUs. Discover how to accelerate the performance of your applications beyond the limits of CPU-only programming with simple pragmas. You’ll learn:
	- 
	How to profile and optimize your CPU-only applications to identify hot spots for acceleration 
- 
	How to use OpenACC directives to GPU accelerate your codebase 
- 
	How to optimize data movement between the CPU and GPU accelerator 
Upon completion, you'll be ready to use OpenACC to GPU accelerate CPU-only applications.
2nd day: Fundamentals of Accelerated Computing with CUDA C/C++ (10:00-16:00 CET)
The CUDA computing platform enables the acceleration of CPU-only applications to run on the world’s fastest massively parallel GPUs. On the 2nd day you experience C/C++ application acceleration by:
	- 
	Accelerating CPU-only applications to run their latent parallelism on GPUs 
- 
	Utilizing essential CUDA memory management techniques to optimize accelerated applications 
- 
	Exposing accelerated application potential for concurrency and exploiting it with CUDA streams 
- 
	Leveraging command line and visual profiling to guide and check your work 
Upon completion, you’ll be able to accelerate and optimize existing C/C++ CPU-only applications using the most essential CUDA tools and techniques. You’ll understand an iterative style of CUDA development that will allow you to ship accelerated applications fast.
3rd day: Fundamentals of Deep Learning (10:00-16:00 CET)
Explore the fundamentals of deep learning by training neural networks and using results to improve performance and capabilities.
During this day, you’ll learn the basics of deep learning by training and deploying neural networks. You’ll learn how to:
	- Implement common deep learning workflows, such as image classification and object detection
- Experiment with data, training parameters, network structure, and other strategies to increase performance and capability
- Deploy your neural networks to start solving real-world problems
Upon completion, you’ll be able to start solving problems on your own with deep learning.