A multi-scale time-series dataset with benchmark for machine learning in decarbonized energy grids

dc.contributor.authorZheng, Xiangtian
dc.contributor.authorXu, Nan
dc.contributor.authorTrinh, Loc
dc.contributor.authorWu, Dongqi
dc.contributor.authorHuang, Tong
dc.contributor.authorSivaranjani, S.
dc.contributor.authorLiu, Yan
dc.contributor.authorXie, Le
dc.contributor.departmentEngineering Technology, School of Engineering and Technologyen_US
dc.date.accessioned2023-07-10T12:42:55Z
dc.date.available2023-07-10T12:42:55Z
dc.date.issued2022-06-22
dc.description.abstractThe electric grid is a key enabling infrastructure for the ambitious transition towards carbon neutrality as we grapple with climate change. With deepening penetration of renewable resources, the reliable operation of the electric grid becomes increasingly challenging. In this paper, we present PSML, a first-of-its-kind open-access multi-scale time-series dataset, to aid in the development of data-driven machine learning (ML)-based approaches towards reliable operation of future electric grids. The dataset is synthesized from a joint transmission and distribution electric grid to capture the increasingly important interactions and uncertainties of the grid dynamics, containing power, voltage and current measurements over multiple spatio-temporal scales. Using PSML, we provide state-of-the-art ML benchmarks on three challenging use cases of critical importance to achieve: (i) early detection, accurate classification and localization of dynamic disturbances; (ii) robust hierarchical forecasting of load and renewable energy; and (iii) realistic synthetic generation of physical-law-constrained measurements. We envision that this dataset will provide use-inspired ML research in safety-critical systems, while simultaneously enabling ML researchers to contribute towards decarbonization of energy sectors.en_US
dc.eprint.versionFinal published versionen_US
dc.identifier.citationZheng X, Xu N, Trinh L, et al. A multi-scale time-series dataset with benchmark for machine learning in decarbonized energy grids. Sci Data. 2022;9(1):359. Published 2022 Jun 22. doi:10.1038/s41597-022-01455-7en_US
dc.identifier.urihttps://hdl.handle.net/1805/34262
dc.language.isoen_USen_US
dc.publisherSpringer Natureen_US
dc.relation.isversionof10.1038/s41597-022-01455-7en_US
dc.relation.journalScientific Dataen_US
dc.rightsAttribution 4.0 International*
dc.rights.urihttps://creativecommons.org/licenses/by/4.0*
dc.sourcePMCen_US
dc.subjectEnergy gridsen_US
dc.subjectEnergy networksen_US
dc.subjectComputer scienceen_US
dc.subjectElectrical engineeringen_US
dc.subjectElectronic engineeringen_US
dc.subjectEnergy modellingen_US
dc.titleA multi-scale time-series dataset with benchmark for machine learning in decarbonized energy gridsen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
41597_2022_Article_1455.pdf
Size:
11.24 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: