In early March, before conferences of all types across the country shut down and social distancing was mandated, I was sitting in a conference room in Albuquerque, New Mexico, attending the Storage Technology Showcase 2020 and it struck me, I was surrounded by power users. Many of the participants were storage users who push the capabilities of data storage technology to the limit. Every industry has power users; in the automotive industry, it is those that demand the fastest and highest performing cars. Think of NASCAR or the 24 hours of Le Mans. The drivers push race cars to their limits. For data storage, the power users for storage are the organizations that perform high-performance computing.
Several of the presentations were from storage managers at supercomputing centers that performed research in energy and weather systems. They highlighted the demanding storage environments for high-performance computing. In practice, this means a lot of flash, a lot of disk, and massive quantities of tape connected via high-speed networks. For example, to support the existing Cray XC-40 and coming Perlmutter supercomputers, the National Energy Research Scientific Computing Center (NERSC) 2020 storage architecture includes more than 30 PB of flash, 340 PB of disk, and .5 EB of tape storage.
One of the themes of the conference was the importance of tape storage in cost-effectively managing massive amounts of archive data. For example, the European Centre for medium-range weather forecasts (ECMWF) has over 450 PB of archive data stored on tape and adds another 260 TB/Day. At these capacity points, tape storage is essential not only for financial reasons but also for other features: volumetric efficiency, a long media life and energy savings.
A surprising presentation was Western Digital Corporation’s “Tape to the future presentation." While most of us think of WDC as an HDD and Flash company, the presentation highlighted the potential of dramatic growth in the amount of data stored on tape in the coming decade. This growth will be driven by the growth of archive data in hyperscale data centers. To meet these needs, several cost, packaging, performance, and storage architecture requirements were suggested.
The final theme that several presentations touched on was the challenge of migrating storage infrastructure to new technology while maintaining service levels. Migrations present a host of challenges, and different approaches were taken due to widely differing environments and history. Given the substantial amounts of data stored and the value of the information, a great deal of planning was done, and the migration process spanned months, or even years.
Overall, it was a very informative conference with a wealth of real-world user case studies and information on storage technology from important suppliers. The conference agenda and presentations can be found here: STS2020. While the digital data explosion continues, technology is rapidly evolving to help organizations manage the challenge effectively.