This course covers data science (DS), a science that combines statistics, data processing, machine learning, scientific inquiry, visualization, business analytics, big data, and big models. The purpose of data science is to address or gain insight into a problem in the real world by the application of computational and statistical techniques.
The course will cover the following subjects: Advanced Data Visualization, Feature engineering, Hypothesis Testing, Feature Selection, outlier detection, Unsupervised Learning, Algorithmic Bias, Virtualization and Distributed learning, Deep Learning (MPLs, CNNs), Computer Vision, Natural language processing, tokenization and transformers, Large Language Model Fine-tuning (SFT), Reinforcement learning (RL, RLHF, RLVR), Vision Language Models (VLMs).