Dec 15, 2025  
2022-2023 Graduate Catalog 
    
2022-2023 Graduate Catalog [ARCHIVED CATALOG]

Add to Catalog (opens a new window)

DS 7347 - High Performance Computing


Credits: 3

Provides a practical introduction to HPC systems, job scheduling, and parallelization of data analysis and machine learning tasks using tools such as the Rapids framework, Dask, and Horovod. Students learn to effectively manage large datasets with efficient file formats, define reproducible software environments using containers and version control systems, and execute parallelized workflows by simultaneously using multiple HPC compute nodes, CPUs, and GPUs. The course is taught using Unix shell scripting, Python, and C++ where appropriate to demonstrate various aspects of high-performance data analysis and machine learning workflows.



Add to Catalog (opens a new window)