TY - JOUR
T1 - Integrated workflows and interfaces for data-driven semi-empirical electronic structure calculations
AU - Stishenko, Pavel
AU - McSloy, Adam
AU - Onat, Berk
AU - Hourahine, Ben
AU - Maurer, Reinhard
AU - Kermode, James
AU - Logsdail, Andrew
PY - 2024/7/3
Y1 - 2024/7/3
N2 - Modern software engineering of electronic structure codes has seen a paradigm shift from monolithic workflows toward object-based modularity. Software objectivity allows for greater flexibility in the application of electronic structure calculations, with particular benefits when integrated with approaches for data-driven analysis. Here, we discuss different approaches to create deep modular interfaces that connect big-data workflows and electronic structure codes and explore the diversity of use cases that they can enable. We present two such interface approaches for the semi-empirical electronic structure package, DFTB+. In one case, DFTB+ is applied as a library and provides data to an external workflow; in another, DFTB+ receives data via external bindings and processes the information subsequently within an internal workflow. We provide a general framework to enable data exchange workflows for embedding new machine-learning-based Hamiltonians within DFTB+ or enabling deep integration of DFTB+ in multiscale embedding workflows. These modular interfaces demonstrate opportunities in emergent software and workflows to accelerate scientific discovery by harnessing existing software capabilities.
AB - Modern software engineering of electronic structure codes has seen a paradigm shift from monolithic workflows toward object-based modularity. Software objectivity allows for greater flexibility in the application of electronic structure calculations, with particular benefits when integrated with approaches for data-driven analysis. Here, we discuss different approaches to create deep modular interfaces that connect big-data workflows and electronic structure codes and explore the diversity of use cases that they can enable. We present two such interface approaches for the semi-empirical electronic structure package, DFTB+. In one case, DFTB+ is applied as a library and provides data to an external workflow; in another, DFTB+ receives data via external bindings and processes the information subsequently within an internal workflow. We provide a general framework to enable data exchange workflows for embedding new machine-learning-based Hamiltonians within DFTB+ or enabling deep integration of DFTB+ in multiscale embedding workflows. These modular interfaces demonstrate opportunities in emergent software and workflows to accelerate scientific discovery by harnessing existing software capabilities.
KW - density-functional tight-binding
KW - density functional theory
KW - electronic structure methods
KW - electronic band structure
KW - electrostatics
KW - software engineering
KW - high performance computing
KW - machine learning
KW - application programming interface
KW - atomic and molecular clusters
UR - https://github.com/dftbplus/dftbplus
UR - https://github.com/ACEsuit/ACEhamiltonians.jl
UR - https://gitlab.com/pvst/asi
UR - https://pvst.gitlab.io/asi.
U2 - 10.1063/5.0209742
DO - 10.1063/5.0209742
M3 - Article
SN - 0021-9606
VL - 161
JO - Journal of Chemical Physics
JF - Journal of Chemical Physics
IS - 1
M1 - 012502
ER -