Understand the different storage and data access patterns of scientific and Big Data applications.
Explore storage interfaces, libraries and abstractions used by applications.
Understand the organization and management of storage resources in advanced computing infrastructures.
Acquire knowledge about technologies, interfaces and architectures of local and distributed storage systems currently used in the state-of-practice (e.g., industry, production environments), while understanding their evolution and different purposes.
Design optimizations for accessing digital information on behalf of data-centric applications (e.g., for performance, reliability, resource management, security).
Perform the configuration and deployment of resources and storage systems in advanced computing environments.
Implement the diagnosis and evaluation of applications and storage systems in advanced computing environments.
Program
Scientific and Big Data data-centric applications and respective storage patterns (e.g., sequential vs. random accesses, large vs. small files).
Interfaces, file formats and different I/O libraries used in advanced computing (e.g., POSIX, HDF5, MPI-IO).
Infrastructures and storage architectures adopted by advanced computing centers.
Systems and hardware for local data storage.
Systems and hardware for distributed data storage.
Storage optimizations in terms of efficiency, scalability, reliability, resource management and data security (e.g., caching, tiering, compression, deduplication, replication, encryption).
Diagnosis of application I/O requests and evaluation of storage systems.
Bibliography
Alexander Thomasian. Storage Systems, O’Reilly, 2021.
Anjus George, Rick Mohr, James Sim. Understanding Lustre Internals, Oak Ridge National Laboratory, 2021.
Jeffrey S. Vetter. Contemporary High Performance Computing, O’Reilly, 2017.
Rajkumar Buyya, Toni Cortes, Hai Jin. High Performance Mass Storage and Parallel I/O: Technologies and Applications, Wiley-IEEE Press, 2001.
Ricardo Macedo, João Paulo, José Pereira, Alysson Bessani. A Survey and Classification of Software-Defined Storage Systems, ACM Computing Surveys, 2020.
Jean Luca Bez, Suren Byna, Shadi Ibrahim. I/O Access Patterns in HPC Applications: A 360-Degree Survey, ACM Computing Surveys, 2023.