Download PDF
Research Article  |  Open Access  |  19 Feb 2025

PINK: physical-informed machine learning for lattice thermal conductivity

Views: 206 |  Downloads: 32 |  Cited:  0
J. Mater. Inf. 2025, 5, 12.
10.20517/jmi.2024.86 |  © The Author(s) 2025.
Author Information
Article Notes
Cite This Article

Abstract

Lattice thermal conductivity (κL) is crucial for efficient thermal management in electronics and energy conversion technologies. Traditional methods for predicting κL are often computationally expensive, limiting their scalability for large-scale material screening. Empirical models, such as the Slack model, offer faster alternatives but require time-consuming calculations for key parameters such as sound velocity and the Grüneisen parameter. This work presents a high-throughput framework, physical-informed kappa (PINK), which combines the predictive power of crystal graph convolutional neural networks (CGCNNs) with the physical interpretability of the Slack model to predict κL directly from crystallographic information files (CIFs). Unlike previous approaches, PINK enables rapid, batch predictions by extracting material properties such as bulk and shear modulus from CIFs using a well-trained CGCNN model. These properties are then used to compute the necessary parameters for κL calculation through a simplified physical formula. PINK was applied to a dataset of 377,221 stable materials, enabling the efficient identification of promising candidates with ultralow κL values, such as Ag3Te4W and Ag3Te4Ta. The platform, accessible via a user-friendly interface, offers an unprecedented combination of speed, accuracy, and scalability, significantly accelerating material discovery for thermal management and energy conversion applications.

Keywords

Physical-informed machine learning, thermoelectrics, lattice thermal conductivity, phonon engineering

INTRODUCTION

Understanding the temperature dependence of lattice thermal conductivity (κL) is essential for assessing the thermal transport capabilities of a material. This property plays a crucial role in both scientific research and industrial applications, including thermal management in microelectronics[1,2], energy conversion[3], and temperature regulation[4]. For example, materials exhibiting high κL, such as boron arsenide (BAs), are particularly suitable for heat dissipation in gallium nitride devices[5]. Conversely, materials with low κL can improve thermoelectric conversion efficiency by enabling the effective transformation of waste heat into electrical energy[6].

In recent years, significant theoretical advancements have been made in the theoretical prediction of κL in solid materials[7-10]. A widely used approach for predicting κL involves solving the phonon Boltzmann transport equation (PBTE) within the framework of density functional theory (DFT)[9], while classical molecular dynamics (MD) simulations are particularly useful for systems with complex crystal structures[11]. However, identifying materials with exceptionally low or high κL remains a significant challenge, mainly due to the high computational costs and time-consuming synthesis processes[12]. Moreover, calculations required to obtain interatomic force constants (IFCs) are especially demanding for large, low-symmetry primitive cells[13]. Furthermore, the reliability of MD simulations is strongly dependent on the selection of interatomic potentials, limiting their broader applicability[14]. Besides these challenges, significant progress has been made in accelerating material discovery and improving performance. Luo et al. reviewed the application of machine learning (ML) for predicting κL, emphasizing the potential of high-throughput predictions and ML potentials (MLPs) to overcome the limitations of traditional approaches[13]. Liu et al. focused on active and reversible techniques for regulating κL, such as the use of ferroelectric, ferromagnetic, and nanomaterials, enabling dynamic control of thermal conductivity for efficient thermal management[15]. Additionally, Shi et al. examined advancements in thermoelectric materials for multifunctional energy conversion and storage technologies, highlighting ongoing challenges related to scalability, material stability, and efficiency that must be addressed to fully realize their potential in practical applications[16]. Consequently, rapid determination of κL is crucial for advancing these materials.

Alternatively, empirical models such as the Debye-Callaway model[17,18] and the Slack model[19,20] provide faster and more cost-effective approaches for estimating κL. The Slack model, in particular, has been widely applied to predict κL in a variety of materials[21-26]. For instance, Qin et al. successfully employed the model to quickly predict thermal conductivity, offering valuable insights into thermal transport behavior[27]. Cao et al. explored the n-type thermoelectric properties of ABO3 cubic chalcogenides using a high-throughput method combined with Slack modeling[28]. They screened 46 stable materials, identified four conduction band minima structures, investigated the influence of chemical bonding on transport properties, and shortlisted 13 candidates with high thermoelectric figure of merit (ZT) values. However, the model’s reliance on experimental data or first-principles calculations for several parameters limits its scalability for large-scale, high-throughput screenings. Obtaining critical parameters, such as average sound velocity, acoustic Debye temperature, and the Grüneisen parameter, often requires considerable time and resources, posing a significant barrier.

In our previous work[29], we proposed a refined formula based on the Slack model, which enables highly accurate predictions of κL with an 8.97% mean relative error. The formula utilizes only the shear modulus, average sound velocity, and Grüneisen parameter, all of which are relatively easy to obtain. For example, the bulk modulus (B) and shear modulus (G) can be used to derive the average sound velocity in a material[30]. Additionally, significant research has been conducted to simplify the estimation of the Grüneisen parameter. Belomestnykh[31] developed a method that links Poisson’s ratio with sound velocity and elastic properties, yielding results consistent with quasiharmonic lattice dynamics calculations. This work underscores the strong relationship between elastic modulus and κL[29]. The crystallographic information file (CIF) provides comprehensive data on crystal structures, including lattice constants, crystal systems, density, and other key parameters. However, existing approximation methods have not fully exploited this information to predict κL. Only a limited number of studies have directly connected CIF data with κL for fast, high-throughput predictions. For example, Ju et al. used a neural network that leveraged descriptors from a pre-trained model to establish a relationship between crystal information and thermal conductivity[32]. Xie et al. introduced the crystal graph convolutional neural network (CGCNN) method[33], which converts crystal structure data into graph representations, enabling convolutional neural networks to predict the relationship between crystal features and thermoelectric properties[34,35]. Recently, Omee et al. reviewed the performance of five out-of-distribution (OOD) test sets across eight graph neural network (GNN) models using elasticity datasets[36]. Notably, the CGCNN model achieved the best mean absolute error (MAE) for both the Leave-One-Cluster-Out (LOCO[37]) test [0.0585 log10 (GPa)] and the SparseXsingle test [0.0499 log10 (GPa)], targeting structures with the lowest density and surpassing the other seven GNN models. These findings highlight the excellent generalization capabilities of CGCNN models.

In this work, to enable rapid and high-throughput κL predictions, we integrate the physical interpretability of our derived formula with the predictive power of the CGCNN model. This study introduces a high-throughput framework that combines a trained modulus model with our formula, facilitating the fast estimation of κL directly from CIF files. Encapsulated in a custom-developed web application, physical-informed kappa (PINK), this process enables batch predictions of κL within seconds of uploading CIF files. Users can also customize inputs such as bulk modulus, shear modulus, and Grüneisen parameter.

Our framework begins by extracting crystallographic information from the CIF files and utilizing the trained CGCNN model to predict the bulk and shear modulus. Subsequently, physical models are applied to calculate the average sound velocity and Grüneisen parameters, which are then incorporated into a formula to calculate the κL. Using this approach, we predict κL for 377,221 stable materials identified by Merchant et al. through graph networks[38]. Building on these high-throughput predictions, we develop an efficient method to accelerate the screening of materials with ultralow κL, applicable to any inorganic crystal structure with one or more CIF files. This method enabled the identification of thousands of promising materials with low κL from over 370,000 inorganic crystalline samples, with minimal computational cost. To validate our results, we confirm the ultralow κL values for Ag3Te4X (X = W, Ta) through first-principles calculations. The PINK application, powered by the CGCNN model, serves as a powerful tool for rapid material pre-screening. It provides researchers with an efficient, user-friendly platform for estimating κL, accelerating the discovery of materials with optimal thermal properties.

MATERIALS AND METHODS

CGCNN algorithms

Before presenting the framework of PINK, it is essential to clarify the method by which the CGCNN model predicts material properties based on crystal structures. CGCNN, an advanced ML algorithm, uses trained models to predict material properties with high efficiency[33]. The crystal structure is converted into a graph representation, where nodes correspond to atoms, and edges represent the bonds between them. This format allows the model to capture the local chemical environment.

Through convolutional and pooling layers, CGCNN autonomously identifies critical features necessary for predicting various material properties, such as bulk modulus and shear modulus. These predictions are both accurate and interpretable, providing valuable insights for the rational design of new materials. Moreover, the robust generalization capabilities of this model enable it to handle diverse crystal structures and compositions, significantly accelerating the material discovery process[33,34,39]. In this study, the elastic modulus dataset was split into training, validation, and test sets with a ratio of 80%, 10%, and 10%, respectively. The model consisted of three convolutional layers and two hidden layers, and was trained for 30 epochs with a learning rate of 0.01.

The PINK framework

As illustrated in Figure 1, we present a comprehensive workflow for calculating κL using our automated property prediction system. This workflow is designed to be user-friendly, requiring only CIF files as input. To ensure accurate calculations, the system automatically converts the uploaded crystal structure into its primitive cell format, which is essential for both CGCNN predictions and the parameters used in Equation (2). The process begins by extracting fundamental crystallographic data from the CIF file, including the primitive cell volume, number of atoms, and density. Next, our embedded CGCNN model, trained on extensive material data, predicts the bulk modulus and shear modulus. Using these predicted values, custom Python scripts calculate additional physical parameters crucial for estimating κL, including the longitudinal and transverse sound velocities, the average speed of sound, and the Grüneisen parameters. Finally, all of these calculated quantities are systematically incorporated into Equation (2) to compute κL. This automated workflow significantly streamlines the process of κL prediction, making it accessible to researchers without requiring in-depth expertise in each individual computational step.

PINK: physical-informed machine learning for lattice thermal conductivity

Figure 1. The workflow for calculating κL using PINK begins with the input of CIF files representing crystal structures. Starting with these CIF files, the framework utilizes CGCNN to predict the bulk and shear modulus, while also extracting crystal information such as volume, number of atoms, and density. These parameters are subsequently used to calculate both longitudinal and transverse sound velocities, which are essential for determining the Grüneisen parameter and the average speed of sound. All of these parameters are incorporated into Equation (2), which includes the Grüneisen parameter (γ), volume (V), temperature (T), and other variables necessary for predicting κL. PINK: Physical-informed kappa; CIF: crystallographic information file; CGCNN: crystal graph convolutional neural network.

The application provides comprehensive physical property data for 377,221 new materials, including 11,869 materials screened in this study. The modified open-source CGCNN code used for predicting bulk and shear modulus, as well as the Python scripts for CIF file processing and calculation execution (e.g., “app.py”), is also available. All of these data and codes are accessible via the following link: https://github.com/Jack-Liu0227/AI4Kappa.

Surrogate an interpretable formula for κL

Recently, Wang et al. proposed a simple and universal empirical formula that exhibits strong generalization ability and provides clear physical insights for κL of crystals, which is given as[29]:

$$ \mathit{κ}_L=\frac{G\upsilon _sV^{\frac{1}{3}}}{nT^{\delta}}\cdot e^{-\gamma}, $$

where G is the shear modulus, υs represents the average sound velocity, V is the volume of the primitive cell, n is the number of atoms in the primitive cell, δ lies between 1 and 2 (with δ = 1 for three-phonon scattering), T is the temperature in Kelvin, and γ denotes the Grüneisen parameter. It is important to note that κL and υ2 in Equation (1) do not exhibit a conventional proportional correlation, as both G and γ are functionally dependent on υ (see Supplementary Materials for details)[29].

The theoretical basis of the power law is complex, involving competition between scattering processes driven by cubic and quartic anharmonic terms[40,41]. For simplicity, we focus only on three-phonon scattering, assuming δ = 1:

$$ \mathit{κ}_L=\frac{G\upsilon _sV^{\frac{1}{3}}}{nT}\cdot e^{-\gamma}, $$

Which, derived from Slack’s approach[23] is useful for evaluating κL across various materials. A key aspect in evaluating κL involves determining the average speed of sound (υs) and the Grüneisen parameter (γ). Jia et al. proposed that υs can be accurately estimated from elastic properties [bulk modulus (B) and shear modulus (G)][42]. This approach is computationally more efficient than experimental methods or costly lattice dynamics simulations. The bulk modulus (B) and shear modulus (G) can both be extracted from our trained CGCNN model, providing an alternative means of estimating elastic properties and sound velocities, as demonstrated in[27,42].

$$ \upsilon_l=\sqrt{\frac{B+\frac{4}{3}G}{\rho }}, $$

$$ \upsilon_t=\sqrt{\frac{G}{\rho }}, $$

$$ \upsilon_s=\left \{ \frac{1}{3}\left [ \frac{1}{\upsilon_l^3}+\frac{2}{\upsilon_t^3} \right ] \right \}^{-\frac{1}{3}}, $$

where υs, υl, and υt are the average sound velocity, longitudinal sound velocity, and transverse sound velocity, respectively, and ρ is the material density.

After estimating υs from the bulk modulus (B) and shear modulus (G), the next step is to determine the Grüneisen parameter, which quantifies the anharmonicity of the material[43]. The speed of sound serves as an indicator of the strength of atomic interactions, with weaker interactions generally leading to lower sound velocities. It has been shown that the relationship between Poisson’s ratio (v) and γ is as follows[31,44]:

$$ v=\frac{x^2-2}{2x^2-2},\ \gamma =\frac{3}{2}\left ( \frac{1+v}{2+3v} \right ). $$

where x represents the ratio of longitudinal to transverse sound velocity, x = υl/υt.

Using the above method, both υs and γ can be estimated quickly from elastic properties, particularly the shear and bulk modulus. Previous studies indicate that this approach aligns well with experimental results for cubic, isotropic, and quasi-isotropic structures[42,45,46].

Deployment of PINK for κL

Streamlit is an open-source Python library that simplifies the creation of custom web apps for data-driven applications. It facilitates rapid development of interactive apps by converting Python scripts into shareable web applications in just a few minutes. Figure 2 illustrates the process of deploying an app: first, upload the project code to GitHub, and then create the application on the Streamlit platform by selecting the relevant project branch and Python file (e.g., app.py) to run. Setting up the application environment can be challenging, as web applications typically require multiple Python packages with specific versions. Fortunately, our code streamlines this process by including a requirements.txt file to ensure all dependencies are installed correctly. This allows the application to be deployed entirely in Python without requiring front-end experience. By leveraging Equation (2) and the CGCNN model, we developed the κL calculation application based on this framework.

PINK: physical-informed machine learning for lattice thermal conductivity

Figure 2. PINK code deployment process. To deploy and run the web application, one first uploads the code - along with the “app.py”, “requirements.txt”, and any other necessary files to GitHub. Then, we use the Streamlit platform to deploy the application online. PINK: Physical-informed kappa.

After deploying the application, users can quickly calculate a material’s elastic properties, κL, and other relevant outputs by uploading CIF files. The results are displayed on the website in a DataFrame format, and users can download them as CSV files. An illustration of the program’s interface is shown in Figure 3. Importantly, the app supports uploading single or multiple CIF files simultaneously, running the entire framework in parallel to provide results for all materials at once.

PINK: physical-informed machine learning for lattice thermal conductivity

Figure 3. The web page for our PINK app is divided into two panels. The left panel allows users to upload files, while the right panel displays the results. The output includes a DataFrame that lists various properties such as the number of atoms, density (g/cm-3), volume (Å3), atomic mass (amu), bulk modulus (GPa), shear modulus (GPa), transverse and longitudinal wave sound velocities (m/s), speed of sound (m/s), Poisson’s ratio (v), Grüneisen parameter (γ), acoustic Debye temperature (θa, K), and lattice thermal conductivity (W·m-1·K-1). For detailed instructions on using PINK, please refer to the PINK_tutorial.mp4. Additionally, the app supports custom functions for calculating bulk modulus (GPa), shear modulus (GPa), and Grüneisen parameter, with a separate tutorial available in PINK_Custom_Parameters_tutorial.mp4. PINK: Physical-informed kappa.

Our application, PINK, is easily accessible via the following link: https://kappap-ai.streamlit.app, which can be used both on your phone and on your local computer. If you prefer to deploy it locally or on Streamlit’s server, please refer to the README.md for detailed instructions on setting up the software.

Calculating κL using ab initio study

We implemented ab initio study through the Vienna Ab-initio Simulation Package (VASP)[47]. The calculations incorporated the projector augmented-wave (PAW) approach combined with the Perdew-Burke-Ernzerhof (PBE) functional for exchange-correlation[48-50]. To achieve high computational precision, we selected a 520 eV planewave cutoff energy alongside a Monkhorst-Pack sampling with a 4 × 4 × 4 k-mesh. The computational parameters were optimized with convergence thresholds of 10-8 eV for total energy and 10-4 eV/Å for atomic forces. In determining the second-order IFCs, our calculations utilized a supercell configuration of 2 × 2 × 2, employing the finite displacement methodology with a 4 × 4 × 4 k-point mesh. The third-order software package[40] was subsequently used to extract the third-order IFCs. The κL calculations, which account for three-phonon scattering processes, were performed using ShengBTE with a dense 20 × 20 × 20 q-point sampling[51].

RESULTS AND DISCUSSION

To streamline the time-intensive process of learning CGCNN for predicting material properties and handling file processing, we developed a high-throughput framework encapsulated in a user-friendly application. The interface allows researchers to input single or multiple CIF files simultaneously, generating instantaneous κL predictions for the specified compounds. The efficiency of this framework arises from its integration of pre-trained CGCNN models with Equation (2), providing rapid assessments of thermal transport properties.

Given that thermoelectric performance is strongly influenced by materials with low κL, we conducted a high-throughput screening across material dataset. This systematic evaluation successfully identified 11,869 potential candidates with promising thermal transport characteristics. To validate our screening approach, we selected Ag3Te4X (X = W, Ta) from a specific ternary system and performed detailed ab initio calculations to verify their properties.

Data collection

For evaluating ML performance in materials science applications, we utilized the Matbench V0.1[52]. Our analysis focused on two specific datasets within this collection that address elastic properties: “matbench_log_gvrh” and “matbench_log_kvrh”. These elastic modulus datasets contain identical material entries (10,987 in total) and are specifically designed to predict the logarithmic values of shear modulus (G) and bulk modulus (B) using the Voigt-Reuss-Hill (VRH) averaging methodology. The comprehensive nature of these standardized collections makes them ideal for training our ML models to predict key elastic characteristics. A detailed analysis of the dataset is presented in Figure 4, which illustrates the statistical distribution across three key aspects: the classification of crystal systems, the number of atoms in the primitive cell, and the distribution of chemical elements. The dataset exhibits remarkable diversity, incorporating materials from all seven fundamental crystal systems and 84 different elements in various structural arrangements.

PINK: physical-informed machine learning for lattice thermal conductivity

Figure 4. Statistical analysis of the training dataset. (A) The distribution of seven crystal systems, with cubic being the most common (3,847 structures), followed by tetragonal (2,055 structures), while triclinic is the least one (199 structures); (B) Distribution of range of number of atoms in the primitive cell (1-160 atoms) across the dataset; (C) Elemental distribution that illustrates the frequency of 84 distinct elements. The dataset encompasses transition metals, main group elements, and rare earth elements, with oxygen showing the highest frequency.

For predicting κL, we utilized datasets obtained from the AFLOW database[53] and relevant publications[20,54-57], which include both experimentally measured values and computationally derived properties. The AFLOW database provides comprehensive information on crystal structures and thermal characteristics, with κL values calculated using the methodologies outlined in[29,58]. To construct a test dataset for evaluating our application, we collected crystal structures, Grüneisen parameters (γ), and their corresponding κL values from AFLOW and other literature.

Obtaining high-throughput datasets for computational materials science can be challenging. However, recent advancements in ML have significantly enhanced the discovery of stable materials. Merchant et al. employed deep learning and GNNs to scale materials discovery, particularly for inorganic crystals[38]. Their work expanded the known set of stable materials by adding 381,000 new entries to the convex hull, resulting in a total of 377,221 stable crystal structures - a tenfold increase over previous datasets.

We accessed their extensive dataset through GitHub: https://github.com/google-deepmind/materials_discovery. The repository includes 377,221 valid CIF files in the “by_composition” folder, compatible with CGCNN, and a summary CSV file containing bandgap, crystal symmetry, and decomposition energy data. These materials encompass compositions ranging from two to six elements, with atomic numbers spanning from 2 to 106. Additional data from the Materials Project further complements these datasets, enabling the targeted retrieval of material properties through its open-source API. Together, these resources provide a robust foundation for high-throughput computations and analyses.

Model evaluation of CGCNN

The interpretable formula, detailed in the methods section, elucidates the correlation between elastic modulus and κL. In ML, MAE and R2 (R-squared) are standard metrics for evaluating regression models. MAE quantifies the average magnitude of prediction errors and is defined as:

$$ \mathrm{MAE}=\frac{1}{n}\sum_{i=1}^n|y_i-\hat{y}_i|, $$

where yi denotes the actual values and $$ \hat{y}_i $$ the predicted values. Lower MAE values denote higher prediction accuracy. R2 assesses the proportion of variance in the dependent variable explained by:

$$ R^2=1-\frac{\sum_{i=1}^n(y_i-\hat{y}_i)^2}{\sum_{i=1}^n(y_i-\bar{y})^2}. $$

where $$ \bar{y} $$ represents the mean of the actual values. R2 values approaching 1 indicate a better model fit.

The performance of the embedded CGCNN model on the test dataset is illustrated in Figure 5. The MAE for both the shear and bulk moduli is below 13, with R2 values approaching 1, indicating a strong correlation between the predicted and DFT-calculated elastic modulus. These results demonstrate the model’s reliability and predictive accuracy.

PINK: physical-informed machine learning for lattice thermal conductivity

Figure 5. The comparison of predicted vs. DFT-calculated values for (A) bulk modulus and (B) shear modulus across the test dataset of 10,987 structures. DFT: Density functional theory.

Model evaluation of κL

After predicting the shear and bulk moduli using the trained CGCNN model, the approximate average speed of sound was estimated. Utilizing known crystal structure information, we applied Equation (2) to approximate the material’s κL. To validate the PINK application, we compared its predictions with κL values calculated via DFT for 2,535 materials from the AFLOW database[53] and 46 experimentally measured values from the literature[20,54-57]. The Grüneisen parameters were obtained from the AFLOW database and experimental data, respectively.

Figure 6 presents scatter plots comparing κL predictions by PINK with calculated and experimentally measured values. In Figure 6A, each point represents a material, with the solid diagonal line indicating perfect agreement between predicted and calculated values. The dashed lines denote an acceptable range of deviation. Within the dataset, 2,415 points (95.27%) fall within this range, highlighting the model’s high accuracy. The clustering of points near the diagonal line further confirms a strong correlation between PINK predictions and DFT calculations. Deviations are likely attributable to the inapplicability of certain materials to the Slack model or inaccuracies in the elastic modulus predictions[43].

PINK: physical-informed machine learning for lattice thermal conductivity

Figure 6. The comparison between κL values predicted by PINK using (A) AFLOW[59] and (B) experimental[20,54-57] Grüneisen parameters κL values at 300 K. Dashed lines indicate deviations within half an order of magnitude from reference values. PINK: Physical-informed kappa.

In Figure 6B, κL predictions from PINK are compared with experimental values, with each point labeled by the corresponding material. Similar to Figure 6A, the solid diagonal line represents perfect agreement, and the dashed lines denote acceptable deviation boundaries. The model achieves a MAE of 0.526 and an R2 value of 0.881, indicating a close correspondence between PINK predictions and experimental results. The clustering of data points near the diagonal line demonstrates that PINK effectively predicts κL across diverse materials and crystal symmetries.

For additional validation, Table 1 presents the predicted κL values alongside their experimental counterparts for 46 materials. These results further substantiate the reliability and effectiveness of PINK in predicting κL, showing close alignment with both DFT-calculated and experimentally measured values. Notably, the accuracy of κL predictions could be significantly enhanced with more precise Grüneisen parameters and improved predictions of shear and bulk moduli[29].

Table 1

Predicted and experimental room-temperature κL values for compounds from the literature[20,54-57] are presented

MaterialsID-numbernρ (g·cm3)V3)G (GPa)υs (m·s-1)γκexp (W·m-1·K-1)κPINK (W·m-1·K-1)
AgCl[54]mp-2292225.58342.6318.8011,423.5971.90011.091
AlAs[20]mp-217223.59147.12640.5103,733.0870.6609847.054
AlSb[20]mp-262424.07860.56128.8082,959.0470.6005630.620
BN[20]mp-163923.45811.919350.20310,995.5290.700760727.981
BP[20]mp-147922.95323.500201.2929,011.0730.923 [by Equation (6)]350344.207
C[20]mp-6623.49611.410547.43613,613.3980.7503,0001,320.871
CaO[20]mp-260523.28728.33263.3164,863.6221.5702732.552
CdTe[20]mp-40625.47372.82714.3491,818.4340.5207.510.797
GaAs[20]mp-253425.05347.53245.1523,291.3630.7504542.381
GaP[20]mp-349024.00641.73748.2963,846.0050.75010050.725
GaSb[20]mp-115625.28860.13328.0332,557.7480.7504022.115
Ge[20]mp-3225.04247.84747.1343,362.1091.0606533.219
InAs[20]mp-2030525.33659.05023.6622,355.1220.5703020.454
InP[20]mp-2035124.58252.84027.5542,742.7620.6009325.940
InSb[20]mp-2001225.38472.96517.6682,026.6680.5602014.244
KBr[20]mp-2325122.62475.2946.1561,709.4461.4503.41.737
KBr[20]mp-57089122.98966.1118.3911,885.9431.4503.42.502
KCl[20]mp-2319321.90465.0336.3872,050.8031.4507.12.059
KI[20]mp-2289822.97292.7435.7901,546.9141.4502.61.585
LiF[20]mp-100900922.56916.76858.1745,236.9331.50017.628.999
LiH[20]mp-2370320.82516.00239.3467,537.2341.2801534.630
MgO[20]mp-126523.47119.279123.8116,564.6891.4406086.060
NaBr[20]mp-2291623.12154.74914.4952,392.5211.5002.84.897
NaCl[20]mp-2286222.10546.09616.6503,123.7061.5607.16.531
NaF[20]mp-68222.69325.89421.9293,171.3921.50018.47.651
Nal[20]mp-2326823.57269.6756.8231,547.0001.5601.81.521
PbS[20]mp-2127627.33454.17426.4852,111.1022.0002.94.772
PbSe[20]mp-220127.88660.25422.6181,876.7431.50026.189
RbBr[20]mp-2286723.16486.7816.9051,651.5091.4503.81.974
RbCl[20]mp-2329522.67275.1487.3371,854.2221.4502.82.244
Rbl[20]mp-2290323.360104.9576.2281,517.8621.4102.31.814
Si[20]mp-14922.28140.88855.8285,480.8521.06016660.869
SiC[20]mp-806223.22720.635222.8799107.2120.750490438.312
SrO[20]mp-247224.87835.27747.8643,468.1521.5201219.845
TePb[20]mp-1971727.85770.75817.5111,674.9962.009 [by Equation (6)]2.52.711
ZnS[20]mp-1069523.99940.47632.7683,191.5040.7502728.268
ZnSe[20]mp-119025.06447.33831.0482,763.0630.7501924.432
ZnTe[20]mp-217625.41959.14625.3832,416.1260.9701815.097
AlN[20]mp-66143.20142.527135.5497,197.8950.700350140.931
BeO[20]mp-254242.96727.992123.3097,116.6820.750370104.885
CdS[20]mp-67244.576104.86316.8852,166.4550.750166.790
GaN[20]mp-80445.92446.943110.8434,794.5070.70021079.333
ZnO[20]mp-213345.43849.71933.3762,790.1960.7506013.479
Bi2Te3[55]mp-3420257.315181.76710.4891,339.6851.4901.61.196
Al2O3[56]mp-1143103.87387.420132.7986,501.4261.3403033.445
ZnSb[57]mp-753166.347391.75932.3872,518.3321.681 [by Equation (6)]3.52.315

Comparison of calculation time for κL

To demonstrate κL the efficiency of prediction application, PINK, we compared its computational time against other commonly used methods. Traditional approaches, such as solving the PBTE with second- and third-order force constants[60] or employing the equilibrium MD Green-Kubo, typically require several hours for simple systems and days to weeks for complex ones.

Even semi-empirical models, such as the Slack model, require time-consuming calculations or experimental data to determine the necessary input parameters, often taking several hours to complete. In contrast, PINK offers a significant advantage by predicting κL and related physical properties directly from a CIF file in just a few seconds, regardless of the material’s complexity.

While traditional methods such as PBTE and Green-Kubo can perform efficiently for single-element systems, their computational cost increases exponentially with the number of atoms in the primitive cell, especially when calculating force constants[61,62]. PINK, which leverages the CGCNN model in combination with the Slack approximation, provides a highly efficient solution. This allows for rapid pre-screening of complex binary, ternary, and quaternary systems. As a result, PINK is an invaluable tool for identifying materials with high or low κL.

High-throughput screening

The detailed workflow for high-throughput screening using empirical calculations is illustrated in Figure 7. The screening process commenced with 377,221 compounds sourced from the Materials Discovery Database. Initial predictions of κL were made for these compounds, along with bulk modulus (GPa), shear modulus (GPa), transverse and longitudinal wave sound velocities (m/s), speed of sound (m/s), Poisson’s ratio (v), Grüneisen parameter (γ), acoustic Debye temperature (θa, K), and κL (W/m·K) can be downloaded at the link: https://github.com/Jack-Liu0227/AI4Kappa/tree/master/JMI_Supporting_Information.

PINK: physical-informed machine learning for lattice thermal conductivity

Figure 7. Flowchart of the high-throughput screening process, illustrating steps from data acquisition to filtering and empirical calculations for κL prediction.

To refine the dataset, preliminary screening criteria were applied. Since thermoelectric materials are semiconductors, band gaps were restricted to the range of 0.1-3.0 eV. To ensure stability, the energy above the convex hull was limited to zero or less[38]. This initial filtering reduced the dataset to 30,199 materials. Further exclusion of materials containing radioactive elements resulted in 26,305 candidates.

Subsequently, the CGCNN model was utilized to predict shear and bulk moduli, which were then employed to estimate κL using the Slack model at 300 K. Materials with κL values below 1 W·m-1·K-1 were identified as promising candidates for thermoelectric applications. This filtering yielded 11,869 materials, documented in Nature-filtered-low-Kappa.csv.

Additionally, using the Materials Project API, 54,359 structures with band gaps between 0.1-3.0 eV and no radioactive elements were extracted. PINK was employed to predict κL, resulting in a refined dataset of 21,001 low κL materials, detailed in MP-semiconductor-low-kappa.csv.

Statistical analysis of screening results

The process of screening 11,869 materials with low κL values (κL ≤ 1 W·m-1·K-1) has been detailed in a separate CSV file (Nature-filtered-low-kappa.csv), which also includes the associated material data. Statistical results for materials that have passed this screening are shown in Figure 8. The histogram in Figure 8A shows the distribution of κL values, with the majority between 0.1 and 0.5, highlighting a promising subset of high-performance thermoelectric materials. Meanwhile, Figure 8B represents the distribution of crystal structures among these screened materials, specifying the number of space groups for cubic systems. The analysis reveals an inverse trend between symmetry and material count - fewer materials are found with higher symmetry, as in cubic systems (262 materials), while lower symmetry, such as in triclinic systems, is associated with a larger count. For cubic structures, the relevant space group numbers include 198, 205, 214, 215, 216, 217, 225, 227, 229, and 230. Low κL values are widely acknowledged as critical for improving the efficiency of thermoelectric materials in converting waste heat to electrical energy.

PINK: physical-informed machine learning for lattice thermal conductivity

Figure 8. Statistical results of 11,869 screened candidates. (A) Distribution of κL and corresponding count; (B) Distribution of crystal symmetry, with space group details for cubic symmetry shown in the inset; (C) Histogram of elemental distribution in 11,869 compounds, with electronegativity values indicated at the top of each column. The electronegativity results are as follows: Cs: 0.79, Br: 2.96, Rb: 0.82, I: 2.66, O: 3.44, Se: 2.55, F: 3.98, K: 0.82, Cl: 3.16, S: 2.58, Tb: 1.1, Na: 0.93, P: 2.19, As: 2.18, Y: 1.22, Co: 1.88, Sb: 2.05, Pr: 1.13, Te: 2.1, Dy: 1.22. The inset in the top-right corner is the counting number excluding lanthanide-containing materials.

To analyze the compositional distribution of promising thermoelectric materials, a histogram was generated to display the elemental distribution within the screened materials, as shown in Figure 8C. This diagram underscores the importance of the 20 most common elements in compounds with κL values less than 1. Cesium bromine, rubidium, and adenosine oxide emerge as the elements most frequently encountered. Additionally, elements such as oxygen (O) and selenium (Se) are also prevalent in materials with low κL values. The corresponding electronegativity values of these elements are provided at the top of each column in Figure 7C. A thorough examination of the electronegativity data reveals that elements with higher electronegativity are more likely to form stronger ionic bonds. Among the ten elements that occur most frequently, the majority exhibit electronegativity values greater than 2.5. Interestingly, elements often associated with low thermal conductivity, such as cesium and selenium, are part of this group. Moreover, fluorine, characterized by its high electronegativity, readily forms ionic compounds with alkali metals, including cesium, rubidium, potassium, and sodium.

First-principle validation

On the basis of the results from our high-throughput screening and prior experience, we observed that compounds containing heavy elements and Group VIA elements generally exhibit lower thermal conductivity. Given the structural feasibility and computational efficiency, we selected the cubic structure for our study. Consequently, Ag3Te4W and Ag3Te4Ta were chosen as validation targets. To calculate κL for a given material with a specific structure, a series of DFT calculations are performed within the volume of the primitive cell. To validate the materials screened by the PINK, including those with low κL, we have selected Ag3Te4W and Ag3Te4Ta as case studies for detailed analysis. Both crystals belong to space group 215. As illustrated in Figure 9A, X (W, Ta) atoms are tetrahedrally coordinated by four Te atoms, while Ag atoms occupy interstitial sites between neighboring tetrahedra. Phonon spectra calculations for these materials [Figure 9B and C] reveal no imaginary frequencies, confirming their dynamic stability and theoretical viability.

PINK: physical-informed machine learning for lattice thermal conductivity

Figure 9. (A) The primitive crystal structures of Ag3Te4X (X = W, Ta). Phonon dispersions for (B) Ag3Te4W and (C) Ag4Se4Ta, respectively, along the high-symmetry points which are defined as Γ (0, 0, 0), X (0, 0.5, 0), M (0.5, 0.5, 0.0), Z (0, 0, 0.5), R (0, 0.5, 0.5), A (0.5, 0.5, 0.5).

The κL values of these compounds were calculated using the 3-phonon (3ph) method. Notably, Ag3Te4X (X = W, Ta) exhibits ultralow κL, comparable to benchmark thermoelectric materials such as PbQ (Q = Te, Se) and SnSe[46,63,64]. Figure 10A compares the temperature-dependent κL of Ag3Te4X with state-of-the-art systems including SnSe, Tl9SbTe6[62], and PbQ. At 300 K, Ag3Te4W and Ag3Te4Ta demonstrate κL values of 0.267 and 0.478 W·m-1·K-1 , respectively. By comparison, PbTe, PbSe, SnSe, and Tl9SbTe6 exhibit κL values of 2.3, 2.64, 0.62, and 0.143 W·m-1·K-1 at the same temperature, respectively [Table 2]. The exceptionally low κL of Ag3Te4X positions these materials as promising candidates for thermoelectric applications.

PINK: physical-informed machine learning for lattice thermal conductivity

Figure 10. (A) κL as a function of temperature for Ag3Te4X (X = W, Ta), Tl9SbTe6[62], SnSe[46], and PbQ (Q = Te, Se)[46]. Comparison of microscopy heat transport parameters for Ag3Te4X and Tl9SbTe6[62] at 300 K; (B) Cumulative κL using 3ph methods; (C) Specific heat capacity (CV) at constant volume; (D) Squared phonon group velocities (υ2) in the harmonic approximation; (E) Weighted phonon scattering phase space of 3ph; (F) Phonon scattering rates of 3ph.

Table 2

The predicted κL values from PlNK were compared with those computed using DFT at 300 K

MaterialsnV3)G (GPa)B (GPa)υs (m·s-1)γκPINK (W·m-1·K-1)κDFT (W·m-1·K-1)
PbTe[46]269.989124381,927.3182.180[63]3.59142.300 (Experiment)
PbSe[46]259.054427242,035.7932.6602.4932.640 (Experiment)
SnSe[64]8226.25715241,783.1332.3000.6810.680 (Experiment)
Ag3Te4Ta8250.95113.77645.1333,074.9502.2310.6280.478 (DFT)
Ag3Te4W8248.11112.40642.3162,939.4002.2760.5070.267 (DFT)

Acoustic phonons typically serve as the dominant contributors to thermal transport in materials. As illustrated in Figures 9B and C and Figure 10B, acoustic phonon branches predominantly occupy low-frequency regimes, with low-frequency acoustic modes dominating the contribution to κL. To unravel the microscopic mechanisms underlying the ultralow κL, we systematically examined key parameters governing thermal conductivity - including heat capacity, phonon group velocities, weighted phase space, and scattering rates - for Ag3Te4X (X = W, Ta) and Tl9SbTe6[62]. These analyses, presented in Figure 10B-F, provide critical insights into the interplay of phonon dynamics and thermal transport.

The specific heat capacity (CV) of solids at elevated temperatures approximates the Dulong–Petit limit, defined as 3NkB (NA/M), where N denotes the number of atoms per formula unit, kB is the Boltzmann constant, NA the Avogadro constant, and M the molar mass. DFT calculations yield CV values of 0.196, 0.197, and 0.146 J·g-1·K-1 for Ag3Te4W, Ag3Te4Ta, and Tl9SbTe6, respectively, closely aligning with Dulong–Petit predictions. Notably, the CV values for Ag3Te4X (X = W, Ta) exceed those of Tl9SbTe6 and marginally surpass the 0.156 J·g-1·K-1 reported for PbTe[65].

Given the proportionality κLυ2, we analyzed the frequency-dependent squared phonon group velocities (υ2) of Ag3Te4X (X = W, Ta) and Tl9SbTe6, as illustrated in Figure 10D. Within the frequency range dominant for κL, Ag3Te4Ta displays the highest υ2 (2 km2·s-2), an order of magnitude lower than PbTe’s reported υ2 of 14 km2·s-2 (κL = 2 W·m-1·K-1 at 300 K)[66]. Ag3Te4W exhibits intermediate values, while Tl9SbTe6 shows the lowest υ2.

To elucidate phonon scattering mechanisms, we calculated the weighted phonon scattering phase space (W3), which quantifies available phonon-phonon interaction pathways. As shown in Figure 10E, Ag3Te4Ta has the smallest W3, contrasting sharply with the largest W3 observed in Tl9SbTe6. Similarly, phonon scattering rates [Figure 10F] are significantly higher for Tl9SbTe6 than for Ag3Te4Ta. These results collectively underpin the ultralow κL of Ag3Te4X (X = W, Ta).

Discussions

In this work, we present PINK, a high-throughput computational framework designed to enhance the prediction of κL across diverse materials. Building on this platform, several strategic directions emerge for future refinement and application. First, integrating PINK with experimental databases and materials informatics platforms could accelerate the discovery of novel materials for thermoelectrics, thermal management in microelectronics, and energy conversion systems. Coupling high-throughput predictions[67] with experimental validation would enable rapid identification of high-performance materials, narrowing the gap between computational insights and functional material synthesis. Additionally, synergizing PINK with tools such as BoltzTraP[68] and TransOpt[69] could enable concurrent optimization of thermal and electrical transport properties in semiconductors.

A second critical opportunity lies in the precise engineering of multifunctional materials. By investigating the interaction between κL, mechanical properties (e.g., strength, elasticity), and environmental stability, researchers could design materials that simultaneously achieve thermal, mechanical, and operational demands in sectors such as aerospace, renewable energy, and advanced electronics. Such integration of properties would advance applications requiring both efficient heat regulation and structural resilience.

CONCLUSIONS

We have developed a high-throughput framework, packaged as an application named PINK, designed to rapidly predict the κL of materials based on the CIF files. The material space for κL was expanded significantly, increasing by an order of magnitude, through predictions for 377,221 newly reported materials[38]. Through high-throughput screening, several materials with ultralow κL were identified, and their predictions were validated using first-principles calculations.

Although first-principle calculations of κL require significant computational resources, especially for phonon spectra and third-order force constant matrices, there are existing databases related to phonons and κL. For instance, Togo developed an automated workflow interfaced with Phonopy, which calculated phonon spectra, density of states, entropy, and heat capacity for over 11,000 materials, creating a phonon database available at https://github.com/atztogo/phonondb/blob/main/mdr/phonondb/README.md. AFLOW[53], a comprehensive database, includes thermal property data for 5,664 materials, though this represents only a small fraction of the total material space. PINK addresses this gap by extending κL predictions to hundreds of thousands of materials, with accuracy contingent on the performance of its embedded CGCNN for elastic modulus prediction.

To enhance prediction accuracy, future advancements could integrate advanced crystal graph convolutional networks. Examples include the orbital graph convolutional neural network (OGCNN), which considers orbital roles[70]; the materials graph network (MEGNet), incorporating outfield information[71]; the geometric-information-enhanced crystal graph neural network (GeoCGNN), which integrates topological and geometric structure data[72]; and the atomistic line graph neural network (ALIGNN), which includes bond angle details[73]. Other notable approaches include the graph-attention graph neural network (GATGNN), utilizing attention mechanisms[74], and the scalable global graph attention neural network model DeeperGATGNN, featuring differentiable group normalization (DGN) and skip connections[75]. Moreover, powerful descriptors such as SOAP[76] and Voronoi tessellations[77] could be employed to further elucidate the link between crystal structures and material properties. Ruff et al. introduced a connection-optimized crystal graph network (coGN/coNGN), leveraging message passing and line graph templates[78]. Their model demonstrated exceptional performance on the MatBench benchmark dataset[52], outperforming other models and establishing itself as the leading general-purpose model in the benchmark.

DECLARATIONS

Authors’ contributions

Conceptualization, methodology, software, data curation, visualization, writing-original draft preparation: Liu, Y.

Writing-review and editing, supervision, project administration, funding acquisition: Gao, Z.

Performed data analysis and interpretation: Wang, X.; Hao, Y.; Li, X.

Investigation, discussion: Ding, X.; Lookman, T.; Sun, J.

Availability of data and materials

All data are available at https://github.com/JackLiu0227/AI4Kappa/tree/master/JMI_Supporting_Information.

Financial support and sponsorship

We acknowledge the support from the National Natural Science Foundation of China (No.12104356 and No.52250191). This work is sponsored by the Key Research and Development Program of the Ministry of Science and Technology (No.2023YFB4604100). We also acknowledge the support of the HPC Platform, Xi’an Jiaotong University.

Conflicts of interest

Ding, X. is an Associate Editor of Journal of Materials Informatics. He was not involved in any steps of the editorial process, including reviewer selection, manuscript handling, or decision-making. The other authors declare that there are no conflicts of interest.

Ethical approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Copyright

© The Author(s) 2025.

Supplementary Materials

REFERENCES

1. Li, N.; Ren, J.; Wang, L.; Zhang, G.; Hänggi, P.; Li, B. Colloquium: Phononics: manipulating heat flow with electronic analogs and beyond. Rev. Mod. Phys. 2012, 84, 1045.

2. van Erp R, Soleimanzadeh R, Nela L, Kampitsis G, Matioli E. Co-designing electronics with microfluidics for more sustainable cooling. Nature 2020, 585, 211-6.

3. Kim, H. S.; Liu, W.; Chen, G.; Chu, C. W.; Ren, Z. Relationship between thermoelectric figure of merit and energy conversion efficiency. Proc. Natl. Acad. Sci. U. S. A. 2015, 112, 8205-10.

4. Ouyang, H.; Gu, Y.; Gao, Z.; et al. Kirigami-inspired thermal regulator. Phys. Rev. Appl. 2023, 19, L011001.

5. Kang, J. S.; Li, M.; Wu, H.; Nguyen, H.; Aoki, T.; Hu, Y. Integration of boron arsenide cooling substrates into gallium nitride devices. Nat. Electron. 2021, 4, 416-23.

6. Xu, L.; Wang, X.; Wang, Y.; Gao, Z.; Ding, X.; Xiao, Y. Enhanced average power factor and ZT value in PbSe thermoelectric material with dual interstitial doping. Energy. Environ. Sci. 2024, 17, 2018-27.

7. Broido, D. A.; Malorny, M.; Birner, G.; Mingo, N.; Stewart, D. A. Intrinsic lattice thermal conductivity of semiconductors from first principles. Appl. Phys. Lett. 2007, 91, 231922.

8. Stackhouse, S.; Stixrude, L. Theoretical methods for calculating the lattice thermal conductivity of minerals. Rev. Mineral. Geochem. 2010, 71, 253-69.

9. Tadano, T.; Tsuneyuki, S. Self-consistent phonon calculations of lattice dynamical properties in cubic SrTiO3 with first-principles anharmonic force constants. Phys. Rev. B. 2015, 92, 054301.

10. Carbogno, C.; Ramprasad, R.; Scheffler, M. Ab initio green-kubo approach for the thermal conductivity of solids. Phys. Rev. Lett. 2017, 118, 175901.

11. Fan, Z.; Pereira, L. F. C.; Wang, H.; Zheng, J.; Donadio, D.; Harju, A. Force and heat current formulas for many-body potentials in molecular dynamics simulations with applications to thermal conductivity calculations. Phys. Rev. B. 2015, 92, 094301.

12. Yusuf, A.; Ballikaya, S. Electrical, thermomechanical and cost analyses of a low-cost thermoelectric generator. Energy 2022, 241, 122934.

13. Luo, Y.; Li, M.; Yuan, H.; Liu, H.; Fang, Y. Predicting lattice thermal conductivity via machine learning: a mini review. npj. Comput. Mater. 2023, 9, 964.

14. Mishin, Y. Machine-learning interatomic potentials for materials science. Acta. Mater. 2021, 214, 116980.

15. Liu, C.; Wu, C.; Zhao, Y.; et al. Actively and reversibly controlling thermal conductivity in solid materials. Phys. Rep. 2024, 1058, 1-32.

16. Shi, X. L.; Wang, L.; Lyu, W.; et al. Advancing flexible thermoelectrics for integrated electronics. Chem. Soc. Rev. 2024, 53, 9254-305.

17. Callaway, J. Model for lattice thermal conductivity at low temperatures. Phys. Rev. 1959, 113, 1046-51.

18. Zhang, Y. First-principles Debye–Callaway approach to lattice thermal conductivity. J. Materiomics. 2016, 2, 237-47.

19. Slack, G. Nonmetallic crystals with high thermal conductivity. J. Phys. Chem. Solids. 1973, 34, 321-35.

20. Morelli, D. T.; Slack, G. A. High lattice thermal conductivity solids. In: Shindé SL, Goela JS, editors. High thermal conductivity materials. New York: Springer-Verlag; 2006. pp. 37-68.

21. Nielsen, M. D.; Ozolins, V.; Heremans, J. P. Lone pair electrons minimize lattice thermal conductivity. Energy. Environ. Sci. 2013, 6, 570-8.

22. Chang, C.; Zhao, L. Anharmoncity and low thermal conductivity in thermoelectrics. Mater. Today. Phys. 2018, 4, 50-7.

23. Virkar, A. V.; Cutler, R. A. Fabrication of high-thermal-conductivity polycrystalline aluminum nitride: thermodynamic and kinetic aspects of oxygen removal. In: Shindé SL, Goela JS, editors. High thermal conductivity materials. New York: Springer-Verlag; 2006. pp. 143-66.

24. Chen, S.; Chen, Y.; Yan, W.; Gao, T. First-principles investigation of elastic anisotropy and thermal transport property of transition metal monosilicides CrSi, TiSi, and ZrSi under pressure. Mater. Today. Commun. 2024, 39, 108958.

25. Cao, Y.; Dai, S.; Wang, X.; et al. High-throughput screening of potentially ductile and low thermal conductivity ABX3 (X = S, Se, Te) thermoelectric perovskites. Appl. Phys. Lett. 2024, 124, 092101.

26. Li, R.; Li, X.; Xi, L.; Yang, J.; Singh, D. J.; Zhang, W. High-throughput screening for advanced thermoelectric materials: diamond-like ABX2 compounds. ACS. Appl. Mater. Interfaces. 2019, 11, 24859-66.

27. Qin, G.; Huang, A.; Liu, Y.; et al. High-throughput computational evaluation of lattice thermal conductivity using an optimized Slack model. Mater. Adv. 2022, 3, 6826-30.

28. Cao, Y.; Zhu, Z.; Li, X.; et al. Unraveling the relationships between chemical bonding and thermoelectric properties: n-type ABO3 perovskites. J. Mater. Chem. A. 2022, 10, 11039-45.

29. Wang, X.; Shu, G.; Zhu, G.; et al. An interpretable formula for lattice thermal conductivity of crystals. Mater. Today. Phys. 2024, 48, 101549.

30. Anderson, O. L. A simplified method for calculating the debye temperature from elastic constants. J. Phys. Chem. Solids. 1963, 24, 909-17.

31. Belomestnykh, V. N. The acoustical Grüneisen constants of solids. Tech. Phys. Lett. 2004, 30, 91-3.

32. Ju, S.; Yoshida, R.; Liu, C.; et al. Exploring diamondlike lattice thermal conductivity crystals via feature-based transfer learning. Phys. Rev. Mater. 2021, 5, 053801.

33. Xie, T.; Grossman, J. C. Crystal graph convolutional neural networks for an accurate and interpretable prediction of material properties. Phys. Rev. Lett. 2018, 120, 145301.

34. Zhu, T.; He, R.; Gong, S.; et al. Charting lattice thermal conductivity for inorganic crystals and discovering rare earth chalcogenides for thermoelectrics. Energy. Environ. Sci. 2021, 14, 3559-66.

35. Vaitesswar, U. S.; Bash, D.; Huang, T.; et al. Machine learning based feature engineering for thermoelectric materials by design. Digit. Discov. 2024, 3, 210-20.

36. Omee, S. S.; Fu, N.; Dong, R.; Hu, M.; Hu, J. Structure-based out-of-distribution (OOD) materials property prediction: a benchmark study. npj. Comput. Mater. 2024, 10, 1316.

37. Meredig, B.; Antono, E.; Church, C.; et al. Can machine learning identify the next high-temperature superconductor? Examining extrapolation performance for materials discovery. Mol. Syst. Des. Eng. 2018, 3, 819-25.

38. Merchant, A.; Batzner, S.; Schoenholz, S. S.; Aykol, M.; Cheon, G.; Cubuk, E. D. Scaling deep learning for materials discovery. Nature 2023, 624, 80-5.

39. Griesemer, S. D.; Xia, Y.; Wolverton, C. Accelerating the prediction of stable materials with machine learning. Nat. Comput. Sci. 2023, 3, 934-45.

40. Wang, X.; Gao, Z.; Zhu, G.; et al. Role of high-order anharmonicity and off-diagonal terms in thermal conductivity: a case study of multiphase CsPbBr3. Phys. Rev. B. 2023, 107, 214308.

41. Herring, C. Role of low-energy phonons in thermal conduction. Phys. Rev. 1954, 95, 954-65.

42. Jia, T.; Chen, G.; Zhang, Y. Lattice thermal conductivity evaluated using elastic properties. Phys. Rev. B. 2017, 95, 155206.

43. Knoop, F.; Purcell, T. A. R.; Scheffler, M.; Carbogno, C. Anharmonicity measure for materials. Phys. Rev. Mater. 2020, 4, 083809.

44. Belomestnykh, V. N.; Tesleva, E. P. Interrelation between anharmonicity and lateral strain in quasi-isotropic polycrystalline solids. Tech. Phys. 2004, 49, 1098-100.

45. Chung, D. H.; Buessem, W. R. The Voigt-Reuss-Hill approximation and elastic moduli of polycrystalline MgO, CaF2, β-ZnS, ZnSe, and CdTe. J. Appl. Phys. 1967, 38, 2535-40.

46. Xiao, Y.; Chang, C.; Pei, Y.; et al. Origin of low thermal conductivity in SnSe. Phys. Rev. B. 2016, 94, 125203.

47. Kresse, G.; Furthmüller, J. Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. Phys. Rev. B. Condens. Matter. 1996, 54, 11169-86.

48. Blöchl, P. E. Projector augmented-wave method. Phys. Rev. B. Condens. Matter. 1994, 50, 17953-79.

49. Kresse, G.; Joubert, D. From ultrasoft pseudopotentials to the projector augmented-wave method. Phys. Rev. B. 1999, 59, 1758-75.

50. Perdew, J. P.; Burke, K.; Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 1996, 77, 3865-8.

51. Li, W.; Carrete, J.; A. Katcho N, Mingo N. ShengBTE: A solver of the Boltzmann transport equation for phonons. Comput. Phys. Commun. 2014, 185, 1747-58.

52. Dunn, A.; Wang, Q.; Ganose, A.; Dopp, D.; Jain, A. Benchmarking materials property prediction methods: the Matbench test set and Automatminer reference algorithm. npj. Comput. Mater. 2020, 6, 406.

53. AFLOW. https://aflowlib.org/. (accessed 2025-02-17).

54. Beasley, J. D. Thermal conductivities of some novel nonlinear optical materials. Appl. Opt. 1994, 33, 1000-3.

55. Toberer, E. S.; Zevalkink, A.; Snyder, G. J. Phonon engineering through crystal chemistry. J. Mater. Chem. 2011, 21, 15843.

56. Slack, G. A. Thermal conductivity of MgO, Al2O3, MgAl2O4, and Fe3O4 crystals from 3° to 300° K. Phys. Rev. 1962, 126, 427-41.

57. Bjerg, L.; Iversen, B. B.; Madsen, G. K. H. Modeling the thermal conductivities of the zinc antimonides ZnSb and Zn4Sb3. Phys. Rev. B. 2014, 89, 024304.

58. Toher, C.; Oses, C.; Plata, J. J.; et al. Combining the AFLOW GIBBS and elastic libraries to efficiently and robustly screen thermomechanical properties of solids. Phys. Rev. Mater. 2017, 1, 015401.

59. Curtarolo, S.; Setyawan, W.; Hart, G. L.; et al. AFLOW: an automatic framework for high-throughput materials discovery. Comput. Mater. Sci. 2012, 58, 218-26.

60. Wang, Y.; Gao, Z.; Wang, X.; et al. Anomalous thermal conductivity in 2D silica nanocages of immobilizing noble gas atom. Appl. Phys. Lett. 2024, 124, 122205.

61. Zhou, F.; Nielson, W.; Xia, Y.; Ozoliņš, V. Lattice anharmonicity and thermal conductivity from compressive sensing of first-principles calculations. Phys. Rev. Lett. 2014, 113, 185501.

62. Hao, Y.; Zuo, Y.; Zheng, J.; et al. Machine learning for predicting ultralow thermal conductivity and high ZT in complex thermoelectric materials. ACS. Appl. Mater. Interfaces. 2024, 16, 47866-78.

63. Zhang, Y.; Ke, X.; Chen, C.; Yang, J.; Kent, P. R. C. Thermodynamic properties of PbTe, PbSe, and PbS: first-principles study. Phys. Rev. B. 2009, 80, 024304.

64. Zhao, L. D.; Lo, S. H.; Zhang, Y.; et al. Ultralow thermal conductivity and high thermoelectric figure of merit in SnSe crystals. Nature 2014, 508, 373-7.

65. Pashinkin, A. S.; Mikhailova, M. S.; Malkova, A. S.; Fedorov, V. A. Heat capacity and thermodynamic properties of lead selenide and lead telluride. Inorg. Mater. 2009, 45, 1226-9.

66. Tian, Z.; Garg, J.; Esfarjani, K.; Shiga, T.; Shiomi, J.; Chen, G. Phonon conduction in PbSe, PbTe, and PbTe1-xSex from first-principles calculations. Phys. Rev. B. 2012, 85, 184303.

67. Zhou, Z.; Cao, G.; Liu, J.; Liu, H. High-throughput prediction of the carrier relaxation time via data-driven descriptor. npj. Comput. Mater. 2020, 6, 417.

68. Madsen, G. K.; Singh, D. J. BoltzTraP. A code for calculating band-structure dependent quantities. Comput. Phys. Commun. 2006, 175, 67-71.

69. Li, X.; Zhang, Z.; Xi, J.; et al. TransOpt. A code to solve electrical transport properties of semiconductors in constant electron–phonon coupling approximation. Comput. Mater. Sci. 2021, 186, 110074.

70. Karamad, M.; Magar, R.; Shi, Y.; Siahrostami, S.; Gates, I. D.; Barati, F. A. Orbital graph convolutional neural network for material property prediction. Phys. Rev. Mater. 2020, 4, 093801.

71. Chen, C.; Ye, W.; Zuo, Y.; Zheng, C.; Ong, S. P. Graph networks as a universal machine learning framework for molecules and crystals. Chem. Mater. 2019, 31, 3564-72.

72. Cheng, J.; Zhang, C.; Dong, L. A geometric-information-enhanced crystal graph network for predicting properties of materials. Commun. Mater. 2021, 2, 194.

73. Choudhary, K.; Decost, B. Atomistic line graph neural network for improved materials property predictions. npj. Comput. Mater. 2021, 7, 650.

74. Louis, S. Y.; Zhao, Y.; Nasiri, A.; et al. Graph convolutional neural networks with global attention for improved materials property prediction. Phys. Chem. Chem. Phys. 2020, 22, 18141-8.

75. Omee, S. S.; Louis, S. Y.; Fu, N.; et al. Scalable deeper graph neural networks for high-performance materials property prediction. Patterns 2022, 3, 100491.

76. Isayev, O.; Oses, C.; Toher, C.; Gossett, E.; Curtarolo, S.; Tropsha, A. Universal fragment descriptors for predicting properties of inorganic crystals. Nat. Commun. 2017, 8, 15679.

77. Ward, L.; Liu, R.; Krishna, A.; et al. Including crystal structure attributes in machine learning models of formation energies via Voronoi tessellations. Phys. Rev. B. 2017, 96, 024104.

78. Ruff, R.; Reiser, P.; Stühmer, J.; Friederich, P. Connectivity optimized nested line graph networks for crystal structures. Digit. Discov. 2024, 3, 594-601.

Cite This Article

Research Article
Open Access
PINK: physical-informed machine learning for lattice thermal conductivity
Yujie Liu, ... Zhibin GaoZhibin Gao

How to Cite

Liu, Y.; Wang, X.; Hao, Y.; Li, X.; Sun, J.; Lookman, T.; Ding, X.; Gao, Z. PINK: physical-informed machine learning for lattice thermal conductivity. J. Mater. Inf. 2025, 5, 12. http://dx.doi.org/10.20517/jmi.2024.86

Download Citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click on download.

Export Citation File:

Type of Import

Tips on Downloading Citation

This feature enables you to download the bibliographic information (also called citation data, header data, or metadata) for the articles on our site.

Citation Manager File Format

Use the radio buttons to choose how to format the bibliographic data you're harvesting. Several citation manager formats are available, including EndNote and BibTex.

Type of Import

If you have citation management software installed on your computer your Web browser should be able to import metadata directly into your reference database.

Direct Import: When the Direct Import option is selected (the default state), a dialogue box will give you the option to Save or Open the downloaded citation data. Choosing Open will either launch your citation manager or give you a choice of applications with which to use the metadata. The Save option saves the file locally for later use.

Indirect Import: When the Indirect Import option is selected, the metadata is displayed and may be copied and pasted as needed.

About This Article

Special Issue

This article belongs to the Special Issue Machine Learning for Thermoelectric Materials
© The Author(s) 2025. Open Access This article is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, sharing, adaptation, distribution and reproduction in any medium or format, for any purpose, even commercially, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Data & Comments

Data

Views
206
Downloads
32
Citations
0
Comments
0
1

Comments

Comments must be written in English. Spam, offensive content, impersonation, and private information will not be permitted. If any comment is reported and identified as inappropriate content by OAE staff, the comment will be removed without notice. If you have any queries or need any help, please contact us at [email protected].

0
Download PDF
Share This Article
Scan the QR code for reading!
See Updates
Contents
Figures
Related
Journal of Materials Informatics
ISSN 2770-372X (Online)
Follow Us

Portico

All published articles are preserved here permanently:

https://www.portico.org/publishers/oae/

Portico

All published articles are preserved here permanently:

https://www.portico.org/publishers/oae/