The goal main of this ebook is to introduce you with the HPC systems and its software stack. This course has been specially designed to enable you to utilize parallel & distributed programming and computing resources to accelerate the solution of a complex problem with the help of HPC systems and Supercomputers. You can then use your knowledge in Machine learning, Deep learning, Data Sciences, Big data and so on.
From this book you will learn overviews of:
- Supercomputing basics
- Components of a HPC system
- PBS – Portable Batch System
- SLURM -Workload Manager
- Parallel programming – OpenMP and MPI
- Parallel programming – GPU and CUDA
What is a HPC system?
HPC clusters typically have a large number of computers (often called ‘nodes’) and, in general, most of these nodes would be configured identically. Though from the out side the cluster may look like a single system, the internal workings to make this happen can be quite complex. This idea should not be confused with a more general client-server model of computing as the idea behind clusters is quite unique.
Benefits of using HPC
A cluster of computers joins computational powers of the compute nodes to provide a more combined computational power. Therefore, as in the client-server model, rather than a simple client making requests of one or more servers, cluster computing utilize multiple machines to provide a more powerful computing environment perhaps through a single operating system.
Also available at
About the author
Ahmed Arefin, PhD is an enthusiastic computer programmer with more than a decade of well-rounded computational experience. Following his PhD (Computer Science) and Postdoc (Parallel Data Mining) research he’s moved forward to become a Scientific Computing professional keeping his research interests on, in the area of parallel, distributed and accelerated computing. He loves to code, research, write and teach.