ElasticBroker: Combining HPC with Cloud to Provide Realtime Insights into Simulations

dc.contributor.authorLi, Feng
dc.contributor.authorWang, Dali
dc.contributor.authorYan, Feng
dc.contributor.authorSong, Fengguang
dc.contributor.departmentComputer and Information Science, School of Scienceen_US
dc.date.accessioned2022-04-01T20:40:20Z
dc.date.available2022-04-01T20:40:20Z
dc.date.issued2020
dc.description.abstractFor large-scale scientific simulations, it is expensive to store raw simulation results to perform post-analysis. To minimize expensive I/O, "in-situ" analysis is often used, where analysis applications are tightly coupled with scientific simulations and can access and process the simulation results in memory. Increasingly, scientific domains employ Big Data approaches to analyze simulations for scientific discoveries. However, it remains a challenge to organize, transform, and transport data at scale between the two semantically different ecosystems (HPC and Cloud systems). In an effort to address these challenges, we design and implement the ElasticBroker software framework, which bridges HPC and Cloud applications to form an "in-situ" scientific workflow. Instead of writing simulation results to parallel file systems, ElasticBroker performs data filtering, aggregation, and format conversions to close the gap between an HPC ecosystem and a distinct Cloud ecosystem. To achieve this goal, ElasticBroker reorganizes simulation snapshots into continuous data streams and send them to the Cloud. In the Cloud, we deploy a distributed stream processing service to perform online data analysis. In our experiments, we use ElasticBroker to setup and execute a cross-ecosystem scientific workflow, which consists of a parallel computational fluid dynamics (CFD) simulation running on a supercomputer, and a parallel dynamic mode decomposition (DMD) analysis application running in a Cloud computing platform. Our results show that running scientific workflows consisting of decoupled HPC and Big Data jobs in their native environments with ElasticBroker, can achieve high quality of service, good scalability, and provide high-quality analytics for ongoing simulations.en_US
dc.eprint.versionAuthor's manuscripten_US
dc.identifier.citationLi, F., Wang, D., Yan, F., & Song, F. (2020). ElasticBroker: Combining HPC with Cloud to Provide Realtime Insights into Simulations. ArXiv:2010.04828 [Cs]. http://arxiv.org/abs/2010.04828en_US
dc.identifier.urihttps://hdl.handle.net/1805/28380
dc.language.isoenen_US
dc.publisherarXiven_US
dc.relation.journalarXiven_US
dc.rightsPublisher Policyen_US
dc.sourceArXiven_US
dc.subjectHPCen_US
dc.subjectcloud computingen_US
dc.subjectin-situ data analysisen_US
dc.titleElasticBroker: Combining HPC with Cloud to Provide Realtime Insights into Simulationsen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
LiElasticBroker-AAM.pdf
Size:
767.42 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: