Course/Event Essentials
Training Content and Scope
Other Information
This HLRS course addresses students, data scientists, and researchers who would like to have an introduction to Machine and Deep Learning methods to solve challenging and future-oriented problems. Both Machine and Deep Learning methods and examples will be presented, together with their implementation on HLRS systems. The first part will be an introduction to basic methods in Machine Learning, including pre-processing and supervised learning using Apache Spark. The course will then move on to elements of supervised Deep Learning on real data to classify annotated images of waste in the wild. Given the deluge of information needed to power machine and deep learning methods, it is imperative to think about effective data processing strategies. Therefore, the course will conclude with an introduction to data compression using the BigWhoop library (part of the EXCELLERAT Data Exchange and Workflow Portal). As an efficient data reduction tool, BigWhoop can be applied to generic numerical datasets to minimize I/O bottlenecks and optimize data storage. The lectures are interleaved with many hands-on sessions using Jupyter Notebooks and scripts on HLRS systems.