- See NEWS at github-wiki - includes information for the bi-weekly user telecons including notes.
- We are working on a lightweight very light secure app monitoring approach
- Sandia-UIUC collaboration on AI for Supercomputer Diagnostics
- 2017 ISC High Performance 2017 (ISC) Gauss Award Winner: Diagnosing Performance Variations in HPC Applications Using Machine Learning - using LDMS monitoring data as the basis for Machine Learning-based Performance Diagnosis
- LDMS wins 2015 R&D 100 award! LDMS Video
- 2015: ASCR awarded Resilience project Holistic Measurement Driven Resilience: Combining Operational Fault and Failure Measurements and Fault Injection for Quantifying Fault Detection and Impact
OVIS/LDMS can be obtained from github.com/ovis-hpc
- LDMS v4! Available at github site!
- The current distribution includes only the OVIS/LDMS monitoring, transport, and storage components.
Upcoming HPC Monitoring and Analysis Conference Events
- Workshop on Monitoring and Analysis for HPC Systems Plus Applications (HPCMASPA) held in conjunction with IEEE Cluster 2020 in Sept 2020 in Kobe, Japan.
- LDMSCON -- 2020 announcement coming soon!