Course/Event Essentials
Training Content and Scope
Other Information
The R part of the course will be focused on presenting the basics of exploratory data analysis in R, as well as presentation of the findings through visualization, and basics of statistical/machine learning modelling. The course will cover the basic workflow of exploratory analysis using packages from the 'tidyverse' universe. These includes packages for the loading of data, preprocessing data, basic data exploration, and visualization. In the second part, we will work on the basics of modelling in R starting with data preparation (missing data handling, one-hot enconding, etc.), model training, and model evaluation. In this part the main tools will be packages 'caret' and 'xgboost'.
The Python oriented part will introduce essential data-scientific packages that will demonstrate their usage with real world data analytic problems, and showing how to tackle such problems.