3990839

Integrating generative AI with computational chemistry for catalyst design in biofuel/bioproduct applications

Date
March 18, 2024
Explore related products in the following collection:

Catalysts play an ubiquitous role in producing renewable fuels and chemicals to achieve world-wide NetZero goals. However, effective catalyst design requires significant effort on literature review and deep knowledge of catalysis fundamentals. With the advent of AI for scientific discovery, particularly large language models (LLMs), new possibilities exist for using learned language representations to augment decision making during the catalyst design process. However, issues dealing with uncertainty, such as LLM hallucinations and lack of explainability, can curtail the LLMs’ utility.

Our work focuses on integrating computational chemistry approaches, i.e., density functional theory (DFT), in the generative LLM to identify the suitable catalytic descriptor along with the corresponding catalysts. Identification of these catalytic descriptors provides a way to develop models of surface reactivity and helps to connect the most important atomistic level properties of the catalyst that govern the macroscopic catalytic activity. ChemReasoner, our proposed system, provides an ability to intelligently search the scientific literature through LLM knowledge representation for the optimal set of descriptors via feedback obtained from DFT-simulation guided ML models.

In this effort, we developed a heuristic tree search to augment catalyst discovery by prompting the LLM with various properties and descriptors. Compared with the baseline single-prompt LLM output, which mostly predicted monometallic catalysts, our LLM model provides access to various novel catalyst structures (e.g. metal alloys) with favorable adsorption energy to facilitate the target reaction. The model also provides more specific and scientifically viable explanations for the catalyst choice (i.e. less LLM hallucinations). We enable these improvements via integration of a surrogate graph neural network (GNN) trained on density functional theory (DFT) calculations (Open Catalyst Project) to verify the quality of LLM predicted catalysts using calculated adsorption energies, which boosts the validity of our methodology.

Presenter

Speaker Image for Mariefel Olarte
Pacific Northwest National Laboratory

Related Products

Thumbnail for Theoretical study of C11H8 isomers
Theoretical study of C11H8 isomers
C11H8 is a prototypical elemental composition for organic chemists as it encompasses a diverge set of structural isomers. These isomers are considered to be potential intermediates for the formation of Polycyclic Aromatic Hydrocarbons (PAHs) in the interstellar medium (ISM)…
Thumbnail for Unveiling the mechanistic pathways of the Chan-Lam C-N arylation reaction: A computational exploration
Unveiling the mechanistic pathways of the Chan-Lam C-N arylation reaction: A computational exploration
The copper-catalyzed Chan-Lam (CL) reaction has emerged as a powerful tool for the synthesis of diverse and valuable heteroatom-containing compounds…
Thumbnail for Power-to-X: Sustainable aviation fuels via methanol intermediate
Power-to-X: Sustainable aviation fuels via methanol intermediate
Leveraging Power-to-X (P2X) technologies presents a promising pathway for decarbonizing the aviation sector…