I am a Postdoctoral Researcher at Center for Applied Scientific Computing (CASC) in Lawrence Livermore National Laboratory (LLNL). I work on uncovering and solving the portability and performance bottlenecks in I/O for multi-stage workflows in large-scale supercomputer. I got my PhD in Computer Science Researcher at Illinois Tech (Formerly known as Illinois Institute of Technology). My research interests are in scalable scientific data management. More specifically, I am interested in parallel I/O, data management systems for managing scientific data, and heterogeneous computing. I am also interested in convergence of Big Data and HPC storage systems.
I have built several HPC software such as Hermes Container Library, Hierarchical Prefetching and Compression Software, Intelligent Compression framework, DLIO Benchmark, etc.
Goal of this project is to improve I/O performance by systematically understanding and optimizing large scale workflows executing on supercomputers.
The goal of this project is accelerate I/O of scientific AI applications using DL frameworks such as LBANN, TensorFlow, and PyTorch on supercomputers.
The goal of the project is to make storage systems workload-aware by extending I/O Interfaces to express user Intent for complex workflows in HPC.
Goal of this project is to improve I/O performance by systematically understanding and optimizing large scale workflows executing on supercomputers.
- Hariharan Devarajan and Kathryn Mohror. "Extracting and characterizing I/O behavior of HPC workloads". The 2022 IEEE International Conference on Cluster Computing (CLUSTER'22), September 6-9, 2022, Heidelberg, Germany.
- Hariharan Devarajan and Kathryn Mohror. “Mimir: Extending I/O Interfaces to Express User Intent for Complex Workloads in HPC.” 2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS'23) St. Petersburg, Florida USA: iEEE, May 2023.
a user-space distributed library for enables efficient I/O pipeline for Deep Learning Applications. It enables a decoupled and asynchronous data pipeline paradigm.
- Hariharan Devarajan, Huihuo Zheng, Anthony Kougkas, Xian-He Sun, and Venkatram Vishwanath. "DLIO: A Data-Centric Benchmark for Scientific Deep Learning Applications". In 2021 21st IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID). IEEE. Best Paper Award
- Hariharan Devarajan, Anthony Kougkas, Huihuo Zheng, Venkatram Vishwanath, and Xian-He Sun, "Stimulus: Accelerate Data Management for Scientific AI applications in HPC," In the proceedings of the 2022 IEEE/ACM International Symposium in Cluster, Cloud, and Internet Computing (CCGrid'22), Taormina, Italy, May 16-19, 2022.
a new, heterogeneous-aware, dynamic, and distributed I/O buffering system. Hermes enables, manages, supervises and extends I/O buffering to fully integrate into the DMSH.
- Jaime Cernuda, Hariharan Devarajan, Luke Logan, Neeraj Rajesh, Jie Ye, Anthony Kougkas, X.-H. Sun,
“HFlow: A Dynamic and Elastic Multi-Layered Data Forwarder”, The 2021 IEEE International Conference on Cluster Computing (CLUSTER'2021), September 7-10, 2021, Virtual Meeting, pp. 114-124, DOI: 10.1109/Cluster48925.2021.00064.
- Neeraj Rajesh, Hariharan Devarajan, Jaime Cernuda Garcia, Keith Bateman, Luke Logan, Jie Ye, Anthony Kougkas, and Xian-He Sun. 2021. "Apollo: An ML-assisted Real-Time Storage Resource Observer". In Proceedings of the 30th International Symposium on High-Performance Parallel and Distributed Computing (HPDC '21). Association for Computing Machinery, New York, NY, USA, 147–159. DOI:https://doi.org/10.1145/3431379.3460640
- Hariharan Devarajan, Anthony Kougkas, and Xian-He Sun. "HReplica: A Dynamic Data Replication Engine with Adaptive Compression for Multi-Tiered Storage." 2020 IEEE International Conference on Big Data (Big Data), Atlanta, Georgia, USA, 2020.
- Hariharan Devarajan, Anthony Kougkas, Keith Bateman, and Xian-He Sun. "HCL: Distributing Parallel Data Structures in Extreme Scales." In 2020 IEEE International Conference on Cluster Computing (CLUSTER). IEEE, 2020.
- Hariharan Devarajan, Anthony Kougkas, Luke Logan, and Xian-He Sun. "HFetch: Hierarchical Data Prefetching for Scientific Workflows in Multi-Tiered Storage Environments," 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), New Orleans, Louisiana, USA, 2020.
- Hariharan Devarajan, Anthony Kougkas, Luke Logan, and Xian-He Sun. "HCompress: Hierarchical Data Compression for Multi-Tiered Storage Environments," 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), New Orleans, Louisiana, USA, 2020.
- Hariharan Devarajan, Anthony Kougkas, and Xian-He Sun. "An Intelligent, Adaptive, and Flexible Data Compression Framework", In Proceedings of the IEEE/ACM International Symposium in Cluster, Cloud, and Grid Computing (CCGrid'19)
- Anthony Kougkas, Hariharan Devarajan, and Xian-He Sun. "Hermes: A Heterogeneous-Aware Multi-Tiered Distributed I/O Buffering System", In Proceedings of the ACM 27th International Symposium on High-Performance Parallel and Distributed Computing (HPDC'18)
a new, distributed, Label- based I/O system utilizing asynchronous I/O, supports heterogeneous storage resources, with elasticity, and in-situ analytics.
- Anthony Kougkas, Hariharan Devarajan, Jay Lofstead, and Xian-He Sun. "LABIOS: A Distributed Label-Based I/O System", In Proceedings of the ACM 28th International Symposium on High-Performance Parallel and Distributed Computing (HPDC'19) Best Paper Award
Address:
Suite 1014, B315,
Lawrence Livermore National laboratory
Livermore CA 94550
E-mail:
hariharandev1@llnl.gov