Video Article Open Access
Combining the Power of High-Throughput ab initio Calculations and Machine Learning towards Materials Informatics
Gian-Marco Rignanese1,2,*
1Institute of Condensed Matter and Nanosciences (IMCN), UCLouvain, Chemin des Étoiles 8, 1348 Louvain-la-Neuve, Belgium
2School of Materials Science and Engineering, Northwestern Polytechnical University, Xi’an, Shaanxi 710072, People’s Republic of China
Vid. Proc. Adv. Mater., Volume 3, Article ID 2208315 (2022)
DOI: 10.5185/vpoam.2022.08315
Publication Date (Web): 13 Dec 2022
Copyright © IAAM
The progress in first-principles simulation codes and supercomputing capabilities have given birth to the so-called high-throughput (HT) ab initio approach, thus allowing for the identification of many new compounds for a variety of applications (e.g. lithium battery and photovoltaic). As a result, a number of databases have also become available online, providing access to various properties of materials, mainly ground‑state though. Indeed, for more complex properties (e.g., linear or higher‑order responses), the HT approach is still out of reach because of the required CPU time. To overcome this limitation, machine learning approaches have recently attracted much attention in the framework of materials design.
In this talk, I will review recent progress in the emerging field of materials informatics. I will briefly introduce the OPTIMADE API that was developed for searching the leading materials databases (such as AFLOW, the Materials Cloud, the Materials Project, NOMAD, OQMD, ...) using the same queries. I will introduce the MODNet framework and its recent developments for predicting materials properties. This approach, which is particularly well suited for limited datasets, relies on a feedforward neural network and the selection of physically meaningful features. Finally, I will illustrate the power of materials informatics, combining high‑throughput ab initio calculations and machine learning, through a few recent examples.