Efficient Storage Systems

Objectives

  • Understand the different storage and data access patterns of scientific and Big Data applications.
  • Explore storage interfaces, libraries and abstractions used by applications.
  • Understand the organization and management of storage resources in advanced computing infrastructures.
  • Acquire knowledge about technologies, interfaces and architectures of local and distributed storage systems currently used in the state-of-practice (e.g., industry, production environments), while understanding their evolution and different purposes.
  • Design optimizations for accessing digital information on behalf of data-centric applications (e.g., for performance, reliability, resource management, security).
  • Perform the configuration and deployment of resources and storage systems in advanced computing environments.
  • Implement the diagnosis and evaluation of applications and storage systems in advanced computing environments.

Program

  • Scientific and Big Data data-centric applications and respective storage patterns (e.g., sequential vs. random accesses, large vs. small files).
  • Interfaces, file formats and different I/O libraries used in advanced computing (e.g., POSIX, HDF5, MPI-IO).
  • Infrastructures and storage architectures adopted by advanced computing centers.
  • Systems and hardware for local data storage.
  • Systems and hardware for distributed data storage.
  • Storage optimizations in terms of efficiency, scalability, reliability, resource management and data security (e.g., caching, tiering, compression, deduplication, replication, encryption).
  • Diagnosis of application I/O requests and evaluation of storage systems.

Bibliography

  • Alexander Thomasian. Storage Systems, O’Reilly, 2021.
  • Anjus George, Rick Mohr, James Sim. Understanding Lustre Internals, Oak Ridge National Laboratory, 2021.
  • Jeffrey S. Vetter. Contemporary High Performance Computing, O’Reilly, 2017.
  • Rajkumar Buyya, Toni Cortes, Hai Jin. High Performance Mass Storage and Parallel I/O: Technologies and Applications, Wiley-IEEE Press, 2001.
  • Ricardo Macedo, João Paulo, José Pereira, Alysson Bessani. A Survey and Classification of Software-Defined Storage Systems, ACM Computing Surveys, 2020.
  • Jean Luca Bez, Suren Byna, Shadi Ibrahim. I/O Access Patterns in HPC Applications: A 360-Degree Survey, ACM Computing Surveys, 2023.

Updated: