Related papers: MatChat: A Large Language Model and Application Service Platform for Materials Science

MatChat: A Large Language Model and Application Service Platform for Materials Science

URL: http://arxiv.org/abs/2310.07197v1
Date: Wed, 11 Oct 2023 05:11:46 GMT
Title: MatChat: A Large Language Model and Application Service Platform for Materials Science
Authors: Ziyi Chen, Fankai Xie, Meng Wan, Yang Yuan, Miao Liu, Zongguo Wang, Sheng Meng, Yangang Wang
Abstract summary: We harness the power of the LLaMA2-7B model and enhance it through a learning process that incorporates 13,878 pieces of structured material knowledge data. This specialized AI model, named MatChat, focuses on predicting inorganic material synthesis pathways. MatChat is now accessible online and open for use, with both the model and its application framework available as open source.
Score: 18.55541324347915
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The prediction of chemical synthesis pathways plays a pivotal role in materials science research. Challenges, such as the complexity of synthesis pathways and the lack of comprehensive datasets, currently hinder our ability to predict these chemical processes accurately. However, recent advancements in generative artificial intelligence (GAI), including automated text generation and question-answering systems, coupled with fine-tuning techniques, have facilitated the deployment of large-scale AI models tailored to specific domains. In this study, we harness the power of the LLaMA2-7B model and enhance it through a learning process that incorporates 13,878 pieces of structured material knowledge data. This specialized AI model, named MatChat, focuses on predicting inorganic material synthesis pathways. MatChat exhibits remarkable proficiency in generating and reasoning with knowledge in materials science. Although MatChat requires further refinement to meet the diverse material design needs, this research undeniably highlights its impressive reasoning capabilities and innovative potential in the field of materials science. MatChat is now accessible online and open for use, with both the model and its application framework available as open source. This study establishes a robust foundation for collaborative innovation in the integration of generative AI in materials science.

Related papers

Artificial Intelligence and Generative Models for Materials Discovery -- A Review [0.0]
Review aims to discuss different principles of AI-driven generative models that are applicable for materials discovery.<n>We will also highlight specific applications of generative models in designing new catalysts, semiconductors, polymers, or crystals.
arXiv Detail & Related papers (2025-08-05T09:56:27Z)
Expert-Guided LLM Reasoning for Battery Discovery: From AI-Driven Hypothesis to Synthesis and Characterization [47.97016882216093]
Large language models (LLMs) leverage chain-of-thought (CoT) techniques to tackle complex problems.<n>We introduce ChatBattery, a novel agentic framework that integrates domain knowledge to steer LLMs toward more effective reasoning in materials design.<n>We successfully identify, synthesize, and characterize three novel lithium-ion battery cathode materials, which achieve practical capacity improvements of 28.8%, 25.2%, and 18.5%, respectively.
arXiv Detail & Related papers (2025-07-21T23:46:11Z)
Materials Generation in the Era of Artificial Intelligence: A Comprehensive Survey [54.40267149907223]
Materials are the foundation of modern society, underpinning advancements in energy, electronics, healthcare, transportation, and infrastructure.<n>The ability to discover and design new materials with tailored properties is critical to solving some of the most pressing global challenges.<n>Data-driven generative models provide a powerful tool for materials design by directly create novel materials that satisfy predefined property requirements.
arXiv Detail & Related papers (2025-05-22T08:33:21Z)
Emerging Microelectronic Materials by Design: Navigating Combinatorial Design Space with Scarce and Dispersed Data [42.45821602529994]
Computational modeling and machine learning methods are employed for the design of materials. Physical mechanisms, cost of first-principles calculations, and the dispersity of data pose challenges to both physics-based and data-driven materials modeling. We propose a framework that integrates data-driven and physics-based methods to address these challenges and accelerate materials design.
arXiv Detail & Related papers (2024-12-23T05:06:19Z)
Polymetis:Large Language Modeling for Multiple Material Domains [11.396295878658924]
This paper proposes a large language model Polymetis model for a variety of materials fields. The model uses a dataset of about 2 million material knowledge instructions, and in the process of building the dataset, we developed the Intelligent Extraction Large Model. We inject this data into the GLM4-9B model for learning to enhance its inference capabilities in a variety of material domains.
arXiv Detail & Related papers (2024-11-13T16:10:14Z)
TopoChat: Enhancing Topological Materials Retrieval With Large Language Model and Multi-Source Knowledge [4.654635844923322]
Large language models (LLMs) have demonstrated impressive performance in the text generation task. We develop a specialized dialogue system for topological materials called TopoChat. TopoChat exhibits superior performance in structural and property querying, material recommendation, and complex relational reasoning.
arXiv Detail & Related papers (2024-09-10T06:01:16Z)
An Autonomous Large Language Model Agent for Chemical Literature Data Mining [60.85177362167166]
We introduce an end-to-end AI agent framework capable of high-fidelity extraction from extensive chemical literature. Our framework's efficacy is evaluated using accuracy, recall, and F1 score of reaction condition data.
arXiv Detail & Related papers (2024-02-20T13:21:46Z)
Agent-based Learning of Materials Datasets from Scientific Literature [0.0]
We develop a chemist AI agent, powered by large language models (LLMs), to create structured datasets from natural language text. Our chemist AI agent, Eunomia, can plan and execute actions by leveraging the existing knowledge from decades of scientific research articles.
arXiv Detail & Related papers (2023-12-18T20:29:58Z)
Large Language Models for Scientific Synthesis, Inference and Explanation [56.41963802804953]
We show how large language models can perform scientific synthesis, inference, and explanation. We show that the large language model can augment this "knowledge" by synthesizing from the scientific literature. This approach has the further advantage that the large language model can explain the machine learning system's predictions.
arXiv Detail & Related papers (2023-10-12T02:17:59Z)
AI-Generated Images as Data Source: The Dawn of Synthetic Era [61.879821573066216]
generative AI has unlocked the potential to create synthetic images that closely resemble real-world photographs. This paper explores the innovative concept of harnessing these AI-generated images as new data sources. In contrast to real data, AI-generated data exhibit remarkable advantages, including unmatched abundance and scalability.
arXiv Detail & Related papers (2023-10-03T06:55:19Z)
Artificial Intelligence in Concrete Materials: A Scientometric View [77.34726150561087]
This chapter aims to uncover the main research interests and knowledge structure of the existing literature on AI for concrete materials. To begin with, a total of 389 journal articles published from 1990 to 2020 were retrieved from the Web of Science. Scientometric tools such as keyword co-occurrence analysis and documentation co-citation analysis were adopted to quantify features and characteristics of the research field.
arXiv Detail & Related papers (2022-09-17T18:24:56Z)
Artificial Intelligence in Material Engineering: A review on applications of AI in Material Engineering [0.0]
High-performance computing has made it possible to test deep learning (DL) models with significant parameters. generative adversarial networks (GANs) have facilitated the generation of chemical compositions of inorganic materials. The use of AI to analyze the results from existing analytical instruments is also discussed.
arXiv Detail & Related papers (2022-09-15T04:21:07Z)
Graph neural networks for materials science and chemistry [2.2479652717640657]
Graph neural networks (GNNs) are one of the fastest growing classes of machine learning models. GNNs directly work on a graph or structural representation of molecules and materials. This review article provides an overview of the basic principles of GNNs, widely used datasets, and state-of-the-art architectures.
arXiv Detail & Related papers (2022-08-05T13:38:34Z)
Simulating Quantum Materials with Digital Quantum Computers [55.41644538483948]
Digital quantum computers (DQCs) can efficiently perform quantum simulations that are otherwise intractable on classical computers. The aim of this review is to provide a summary of progress made towards achieving physical quantum advantage.
arXiv Detail & Related papers (2021-01-21T20:10:38Z)
Machine Learning in Nano-Scale Biomedical Engineering [77.75587007080894]
We review the existing research regarding the use of machine learning in nano-scale biomedical engineering. The main challenges that can be formulated as ML problems are classified into the three main categories. For each of the presented methodologies, special emphasis is given to its principles, applications, and limitations.
arXiv Detail & Related papers (2020-08-05T15:45:54Z)

This list is automatically generated from the titles and abstracts of the papers in this site.