MatChat: A Large Language Model and Application Service Platform for
Materials Science
- URL: http://arxiv.org/abs/2310.07197v1
- Date: Wed, 11 Oct 2023 05:11:46 GMT
- Title: MatChat: A Large Language Model and Application Service Platform for
Materials Science
- Authors: Ziyi Chen, Fankai Xie, Meng Wan, Yang Yuan, Miao Liu, Zongguo Wang,
Sheng Meng, Yangang Wang
- Abstract summary: We harness the power of the LLaMA2-7B model and enhance it through a learning process that incorporates 13,878 pieces of structured material knowledge data.
This specialized AI model, named MatChat, focuses on predicting inorganic material synthesis pathways.
MatChat is now accessible online and open for use, with both the model and its application framework available as open source.
- Score: 18.55541324347915
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The prediction of chemical synthesis pathways plays a pivotal role in
materials science research. Challenges, such as the complexity of synthesis
pathways and the lack of comprehensive datasets, currently hinder our ability
to predict these chemical processes accurately. However, recent advancements in
generative artificial intelligence (GAI), including automated text generation
and question-answering systems, coupled with fine-tuning techniques, have
facilitated the deployment of large-scale AI models tailored to specific
domains. In this study, we harness the power of the LLaMA2-7B model and enhance
it through a learning process that incorporates 13,878 pieces of structured
material knowledge data. This specialized AI model, named MatChat, focuses on
predicting inorganic material synthesis pathways. MatChat exhibits remarkable
proficiency in generating and reasoning with knowledge in materials science.
Although MatChat requires further refinement to meet the diverse material
design needs, this research undeniably highlights its impressive reasoning
capabilities and innovative potential in the field of materials science.
MatChat is now accessible online and open for use, with both the model and its
application framework available as open source. This study establishes a robust
foundation for collaborative innovation in the integration of generative AI in
materials science.
Related papers
- Polymetis:Large Language Modeling for Multiple Material Domains [11.396295878658924]
This paper proposes a large language model Polymetis model for a variety of materials fields.
The model uses a dataset of about 2 million material knowledge instructions, and in the process of building the dataset, we developed the Intelligent Extraction Large Model.
We inject this data into the GLM4-9B model for learning to enhance its inference capabilities in a variety of material domains.
arXiv Detail & Related papers (2024-11-13T16:10:14Z) - TopoChat: Enhancing Topological Materials Retrieval With Large Language Model and Multi-Source Knowledge [4.654635844923322]
Large language models (LLMs) have demonstrated impressive performance in the text generation task.
We develop a specialized dialogue system for topological materials called TopoChat.
TopoChat exhibits superior performance in structural and property querying, material recommendation, and complex relational reasoning.
arXiv Detail & Related papers (2024-09-10T06:01:16Z) - An Autonomous Large Language Model Agent for Chemical Literature Data
Mining [60.85177362167166]
We introduce an end-to-end AI agent framework capable of high-fidelity extraction from extensive chemical literature.
Our framework's efficacy is evaluated using accuracy, recall, and F1 score of reaction condition data.
arXiv Detail & Related papers (2024-02-20T13:21:46Z) - Agent-based Learning of Materials Datasets from Scientific Literature [0.0]
We develop a chemist AI agent, powered by large language models (LLMs), to create structured datasets from natural language text.
Our chemist AI agent, Eunomia, can plan and execute actions by leveraging the existing knowledge from decades of scientific research articles.
arXiv Detail & Related papers (2023-12-18T20:29:58Z) - Large Language Models for Scientific Synthesis, Inference and
Explanation [56.41963802804953]
We show how large language models can perform scientific synthesis, inference, and explanation.
We show that the large language model can augment this "knowledge" by synthesizing from the scientific literature.
This approach has the further advantage that the large language model can explain the machine learning system's predictions.
arXiv Detail & Related papers (2023-10-12T02:17:59Z) - AI-Generated Images as Data Source: The Dawn of Synthetic Era [61.879821573066216]
generative AI has unlocked the potential to create synthetic images that closely resemble real-world photographs.
This paper explores the innovative concept of harnessing these AI-generated images as new data sources.
In contrast to real data, AI-generated data exhibit remarkable advantages, including unmatched abundance and scalability.
arXiv Detail & Related papers (2023-10-03T06:55:19Z) - Artificial Intelligence in Concrete Materials: A Scientometric View [77.34726150561087]
This chapter aims to uncover the main research interests and knowledge structure of the existing literature on AI for concrete materials.
To begin with, a total of 389 journal articles published from 1990 to 2020 were retrieved from the Web of Science.
Scientometric tools such as keyword co-occurrence analysis and documentation co-citation analysis were adopted to quantify features and characteristics of the research field.
arXiv Detail & Related papers (2022-09-17T18:24:56Z) - Artificial Intelligence in Material Engineering: A review on
applications of AI in Material Engineering [0.0]
High-performance computing has made it possible to test deep learning (DL) models with significant parameters.
generative adversarial networks (GANs) have facilitated the generation of chemical compositions of inorganic materials.
The use of AI to analyze the results from existing analytical instruments is also discussed.
arXiv Detail & Related papers (2022-09-15T04:21:07Z) - Graph neural networks for materials science and chemistry [2.2479652717640657]
Graph neural networks (GNNs) are one of the fastest growing classes of machine learning models.
GNNs directly work on a graph or structural representation of molecules and materials.
This review article provides an overview of the basic principles of GNNs, widely used datasets, and state-of-the-art architectures.
arXiv Detail & Related papers (2022-08-05T13:38:34Z) - Simulating Quantum Materials with Digital Quantum Computers [55.41644538483948]
Digital quantum computers (DQCs) can efficiently perform quantum simulations that are otherwise intractable on classical computers.
The aim of this review is to provide a summary of progress made towards achieving physical quantum advantage.
arXiv Detail & Related papers (2021-01-21T20:10:38Z) - Machine Learning in Nano-Scale Biomedical Engineering [77.75587007080894]
We review the existing research regarding the use of machine learning in nano-scale biomedical engineering.
The main challenges that can be formulated as ML problems are classified into the three main categories.
For each of the presented methodologies, special emphasis is given to its principles, applications, and limitations.
arXiv Detail & Related papers (2020-08-05T15:45:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.