Managing data efficiently on a supercomputer is very important from both the users' and system's perspectives. Users need to be aware of various data storage and management mechanisms, in addition to their impact on application and system performance. In this webinar, we will cover a few basic data management techniques and I/O best practices in the context of the Expanse system at SDSC. We will learn about the file systems that are part of Expanse and discuss the pros and cons of using these file systems in terms of I/O performance, storage capacity, shared access, and backup. The webinar will also introduce the use of Globus for file transfers, file sharing, and more. Globus is software-as-a-service for research data management developed and operated by the University of Chicago and is widely used for moving and sharing data between and among a wide variety of storage systems at research labs, campus computing resources, and national facilities like SDSC.
---
Instructors
Mahidhar Tatineni
Director of User Services, SDSC
Mahidhar Tatineni received his M.S. & Ph.D. in Aerospace Engineering from UCLA. He currently leads the User Services group at SDSC and has done many optimization and parallelization projects on the supercomputing resources including Gordon, Comet, and Expanse.
Rick Wagner
Principal Research Systems Integration Engineer, UC San Diego
Rick began his career using cyberinfrastructure as a tool for research in astrophysics, working on problems in cosmology and supersonic turbulence. His research was largely done on campus, NSF, and DOE computing resources, the same kinds of systems he later managed for SDSC. Rick took a break from UCSD to work for Globus at the University of Chicago, helping researchers with data management solutions. Now Rick is part of the Research IT team, helping to design solution for projects that cut across the campus and beyond it. He is also trying to smooth the boundary between cybersecurity and research and was a 2021 Trusted CI Fellow.