Toward an Interoperable Perovskite Description
Hybrid perovskites are interesting optoelectronic materials. The perovskite ABX3 structure offers a vast compositional space, with over 300 perovskite ions identified. This flexibility enables tunable properties and has significantly contributed to the success of perovskite optoelectronics. However, this diversity also leads to confusion, ambiguity, and inconsistencies causing challenges for data mining and machine learning applications.
To address this issue, guidelines and a JSON schema are proposed to standardize the reporting of perovskite compositions. The schema adheres to IUPAC recommendations and is designed to make data both human- and machine-readable. It captures key descriptors such as perovskite composition, molecular formula, SMILES representation, IUPAC name, and CAS number for each ion. To facilitate adoption, utilities have been developed to automatically generate comprehensive and standardized perovskite descriptions from standard ion abbreviations and stoichiometric coefficients. Additionally, a curated database has been provided of all identified perovskite ions with associated descriptive data.