ML-driven method for detecting and localizing failures nominated for best paper and best student paper at supercomputing 2020
Saurabh Jha , Shengkun Cui, Subho Banerjee, Tianyin Xu, Jeremy Enos, Mike Showerman, Zbigniew T. Kalbarczyk, Ravishankar K. Iyer (2020). Live Forensics for HPC Systems: A Case Study on Distributed Storage Systems. Proceedings of the International Conference for High-Performance Computing, Networking, Storage and Analysis (SC 2020)