The accreditors of this session require that you periodically check in to verify that you are still attentive.
Please click the button below to indicate that you are.
3990839
Integrating generative AI with computational chemistry for catalyst design in biofuel/bioproduct applications
Date
March 18, 2024
Explore related products in the following collection:
Catalysts play an ubiquitous role in producing renewable fuels and chemicals to achieve world-wide NetZero goals. However, effective catalyst design requires significant effort on literature review and deep knowledge of catalysis fundamentals. With the advent of AI for scientific discovery, particularly large language models (LLMs), new possibilities exist for using learned language representations to augment decision making during the catalyst design process. However, issues dealing with uncertainty, such as LLM hallucinations and lack of explainability, can curtail the LLMs’ utility.
Our work focuses on integrating computational chemistry approaches, i.e., density functional theory (DFT), in the generative LLM to identify the suitable catalytic descriptor along with the corresponding catalysts. Identification of these catalytic descriptors provides a way to develop models of surface reactivity and helps to connect the most important atomistic level properties of the catalyst that govern the macroscopic catalytic activity. ChemReasoner, our proposed system, provides an ability to intelligently search the scientific literature through LLM knowledge representation for the optimal set of descriptors via feedback obtained from DFT-simulation guided ML models.
In this effort, we developed a heuristic tree search to augment catalyst discovery by prompting the LLM with various properties and descriptors. Compared with the baseline single-prompt LLM output, which mostly predicted monometallic catalysts, our LLM model provides access to various novel catalyst structures (e.g. metal alloys) with favorable adsorption energy to facilitate the target reaction. The model also provides more specific and scientifically viable explanations for the catalyst choice (i.e. less LLM hallucinations). We enable these improvements via integration of a surrogate graph neural network (GNN) trained on density functional theory (DFT) calculations (Open Catalyst Project) to verify the quality of LLM predicted catalysts using calculated adsorption energies, which boosts the validity of our methodology.
C11H8 is a prototypical elemental composition for organic chemists as it encompasses a diverge set of structural isomers. These isomers are considered to be potential intermediates for the formation of Polycyclic Aromatic Hydrocarbons (PAHs) in the interstellar medium (ISM)…