Dr. John Dennis

Size: px

Start display at page:

Download "Dr. John Dennis"

Warren Black
5 years ago
Views:

1 Dr. John Dennis June 23,

2 High-resolution climate generates a large amount of data! June 23,

3 PIO update and Lustre optimizations How do we analyze high-resolution climate data faster? Wavelet compression for climate data June 23,

4 June 23,

5 Goals: Reduce memory usage Improve performance Writing a single file from parallel application Flexibility in I/O libraries MPI-IO,NetCDF3, NetCDF4, pnetcdf June 23,

6 Computational decomposition I/O decomposition Rearrangement Match I/O decomposition to Lustre stripe size June 23,

7 Read/write 3D POP sized variable [3600x2400x40] 10 files, 10 variables per file, [max bandwidth] Using Kraken (Cray XT5) + Lustre filesystem Used 16 of 336 OST Impact of PIO features Flow-control Vary number of IO-tasks Different general I/O backends Did we achieve our design goals? June 23,

8 June 23,

9 June 23,

10 June 23,

11 June 23,

12 John Dennis, Matthew Woitaszek (NCAR) Taleena Sines* (Frostburg State) Parallel high-resolution climate data analysis using Swift, Linux Clusters [under review] * SIParCS Intern June 23,

13 Used Swift a workflow language (UC/ANL) Parallel scripting language Data dependency driven Examine performance on several dataintensive architectures Flash memory: Dash (SDSC) SGI UV: Nautilus (NICS) Large memory node: Polynya (NCAR) 32 cores 1 TB ram [ 512 GB memory/ 512 GB ramdisk] 120 TB GPFS file-system (old hardware) June 23,

14 4 mins June 23,

15 John Clyne, Yanick Polius, John Dennis (NCAR) June 23,

16 Compression algorithm: Apply wavelet transform to model outputs Sort resulting wavelet coefficients by absolute magnitude Discard smallest coefficients Reconstruct an approximation of original data from remaining coefficients Compare original and reconstructed data using CESM AMWG Diagnostic Package June 23,

17 Preliminary experiment 10 years of 0.5 deg CAM Compression 2D variables 2:1 3D variables 8:1 Evaluate using student T-test Outcomes/Issues Looks great! Limited temporal variability Does not preserve zeros June 23,

18 June 23,

19 June 23,

20 June 23,

21 NCAR: NICS: D. Bailey F. Bryan M. Fahey T. Craig P. Kovatch B. Eaton J. Edwards [IBM] ANL: N. Hearn R. Jacob K. Lindsay R. Loy N. Norton M. Vertenstein LANL: COLA: E. Hunke J. Kinter P. Jones C. Stan M. Maltrud U. Miami LLNL B. Kirtman D. Bader U.C. Berkeley W. Collins D. Ivanova K. Yelick (NERSC) J. McClean (Scripps) U. Washington A. Mirin C. Bitz ORNL: P. Worley and many more Grant Support: DOE DE-FC03-97ER62402 [SciDAC] DE-PS02-07ER07-06 [SciDAC] NSF Cooperative Grant NSF01 OCI [PetaApps] CNS CNS CNS Computer Allocations: TeraGrid NICS DOE NERSC LLNL Grand Challenge Thanks for Assistance: Cray, NICS, and NERSC June 23,

22 June 23,

23 Computational decomposition + Simple + Most versions of MPI-IO will do aggregation - Computational decomposition may not be optimal for disk access - pnetcdf requires block cyclic decompositions June 23,

to disk - Non-scalable user space buffering - Very

24 Computational decomposition I/O decomposition Rearrangement + Maximize size of individual io-op s to disk - Non-scalable user space buffering - Very large fan-in large MPI buffer allocations June 23,

25 Computational decomposition I/O decomposition Rearrangement + Scalable user space memory + Relatively large individual io-op s to disk - Very large fan-in large MPI buffer allocations June 23,

26 Computational decomposition I/O decomposition Rearrangement + Scalable user space memory + Smaller fan-in -> modest MPI buffer allocations - Smaller individual io-op s to disk June 23,

27 June 23,

28 Supported parallel I/O library in CCSM4 & CESM1 release Addition of Flow-control algorithms (Worley) Initial documentation using Doxygen Small but growing user base ESMF VAPOR + wavelet compression June 23,

Parallel File Systems Compared

Parallel File Systems Compared Computing Centre (SSCK) University of Karlsruhe, Germany Laifer@rz.uni-karlsruhe.de page 1 Outline» Parallel file systems (PFS) Design and typical usage Important features