PRIME: Physics-Related Intelligent Mixture of Experts for Transistor Characteristics Prediction
- URL: http://arxiv.org/abs/2505.11523v1
- Date: Sat, 10 May 2025 15:53:12 GMT
- Title: PRIME: Physics-Related Intelligent Mixture of Experts for Transistor Characteristics Prediction
- Authors: Zhenxing Dou, Yijiao Wang, Tao Zou, Zhiwei Chen, Fei Liu, Peng Wang, Weisheng Zhao,
- Abstract summary: PRIME (Physics-Related Intelligent Mixture of Experts) is proposed to capture and integrate complex regional characteristics.<n>In essence, our framework incorporates physics-based knowledge with data-driven intelligence.<n>Extensive evaluations are conducted on various gate-all-around (GAA) structures to examine the effectiveness of PRIME.
- Score: 11.03749258060248
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: In recent years, machine learning has been extensively applied to data prediction during process ramp-up, with a particular focus on transistor characteristics for circuit design and manufacture. However, capturing the nonlinear current response across multiple operating regions remains a challenge for neural networks. To address such challenge, a novel machine learning framework, PRIME (Physics-Related Intelligent Mixture of Experts), is proposed to capture and integrate complex regional characteristics. In essence, our framework incorporates physics-based knowledge with data-driven intelligence. By leveraging a dynamic weighting mechanism in its gating network, PRIME adaptively activates the suitable expert model based on distinct input data features. Extensive evaluations are conducted on various gate-all-around (GAA) structures to examine the effectiveness of PRIME and considerable improvements (60\%-84\%) in prediction accuracy are shown over state-of-the-art models.
Related papers
- Detecting and Pruning Prominent but Detrimental Neurons in Large Language Models [68.57424628540907]
Large language models (LLMs) often develop learned mechanisms specialized to specific datasets.<n>We introduce a fine-tuning approach designed to enhance generalization by identifying and pruning neurons associated with dataset-specific mechanisms.<n>Our method employs Integrated Gradients to quantify each neuron's influence on high-confidence predictions, pinpointing those that disproportionately contribute to dataset-specific performance.
arXiv Detail & Related papers (2025-07-12T08:10:10Z) - Quantum Neural Networks for Wind Energy Forecasting: A Comparative Study of Performance and Scalability with Classical Models [0.0]
Quantum Neural Networks (QNNs) are emerging as a powerful alternative to classical machine learning methods.<n>This study provides an in-depth investigation of QNNs for predicting the power output of a wind turbine.<n>We experimentally demonstrate that QNNs can achieve predictive performance that is competitive with, and in some cases marginally better than, the benchmarked classical approaches.
arXiv Detail & Related papers (2025-06-28T10:51:27Z) - PMNO: A novel physics guided multi-step neural operator predictor for partial differential equations [23.04840527974364]
We propose a novel physics guided multi-step neural operator (PMNO) architecture to address challenges in long-horizon prediction of complex physical systems.<n>The PMNO framework replaces the single-step input with multi-step historical data in the forward pass and introduces an implicit time-stepping scheme during backpropagation.<n>We demonstrate the superior predictive performance of PMNO predictor across a diverse range of physical systems.
arXiv Detail & Related papers (2025-06-02T12:33:50Z) - SA-GAT-SR: Self-Adaptable Graph Attention Networks with Symbolic Regression for high-fidelity material property prediction [0.0]
We introduce a novel computational paradigm, Self-Adaptable Graph Attention Networks integrated with Symbolic Regression (SA-GAT-SR)<n>Our framework employs a self-adaptable encoding algorithm that automatically identifies and adjust attention weights so as to screen critical features from an expansive 180-dimensional feature space.<n>The integrated SR module subsequently distills these features into compact analytical expressions that explicitly reveal quantum-mechanically meaningful relationships.
arXiv Detail & Related papers (2025-05-01T16:05:10Z) - Spatio-temporal Attention-based Hidden Physics-informed Neural Network for Remaining Useful Life Prediction [1.8554335256160261]
We introduce a Spatio-temporal Attention-based Hidden Physics-informed Neural Network (STA-HPINN) for predicting Remaining Useful Life (RUL)
The hidden physics-informed neural network is utilized to capture the dimension physics mechanisms that govern the evolution of RUL.
The approach is validated on a benchmark dataset, demonstrating exceptional performance when compared to cutting-edge methods.
arXiv Detail & Related papers (2024-05-20T21:10:18Z) - AI-driven emergence of frequency information non-uniform distribution
via THz metasurface spectrum prediction [36.84046475101662]
We unveil previously unreported information characteristics pertaining to different frequencies emerged during our work on predicting the terahertz spectral modulation effects of metasurfaces based on AI-prediction.
This approach effectively optimize the utilization of existing datasets and paves the way for interdisciplinary research and applications in artificial intelligence, chemistry, composite material design, biomedicine, and other fields.
arXiv Detail & Related papers (2023-12-05T01:48:58Z) - Physics Inspired Hybrid Attention for SAR Target Recognition [61.01086031364307]
We propose a physics inspired hybrid attention (PIHA) mechanism and the once-for-all (OFA) evaluation protocol to address the issues.
PIHA leverages the high-level semantics of physical information to activate and guide the feature group aware of local semantics of target.
Our method outperforms other state-of-the-art approaches in 12 test scenarios with same ASC parameters.
arXiv Detail & Related papers (2023-09-27T14:39:41Z) - Physics-Inspired Temporal Learning of Quadrotor Dynamics for Accurate
Model Predictive Trajectory Tracking [76.27433308688592]
Accurately modeling quadrotor's system dynamics is critical for guaranteeing agile, safe, and stable navigation.
We present a novel Physics-Inspired Temporal Convolutional Network (PI-TCN) approach to learning quadrotor's system dynamics purely from robot experience.
Our approach combines the expressive power of sparse temporal convolutions and dense feed-forward connections to make accurate system predictions.
arXiv Detail & Related papers (2022-06-07T13:51:35Z) - Privacy-preserving machine learning with tensor networks [37.01494003138908]
We show that tensor network architectures have especially prospective properties for privacy-preserving machine learning.
First, we describe a new privacy vulnerability that is present in feedforward neural networks, illustrating it in synthetic and real-world datasets.
We rigorously prove that such conditions are satisfied by tensor-network architectures.
arXiv Detail & Related papers (2022-02-24T19:04:35Z) - Leveraging the structure of dynamical systems for data-driven modeling [111.45324708884813]
We consider the impact of the training set and its structure on the quality of the long-term prediction.
We show how an informed design of the training set, based on invariants of the system and the structure of the underlying attractor, significantly improves the resulting models.
arXiv Detail & Related papers (2021-12-15T20:09:20Z) - Physically Explainable CNN for SAR Image Classification [59.63879146724284]
In this paper, we propose a novel physics guided and injected neural network for SAR image classification.
The proposed framework comprises three parts: (1) generating physics guided signals using existing explainable models, (2) learning physics-aware features with physics guided network, and (3) injecting the physics-aware features adaptively to the conventional classification deep learning model for prediction.
The experimental results show that our proposed method substantially improve the classification performance compared with the counterpart data-driven CNN.
arXiv Detail & Related papers (2021-10-27T03:30:18Z) - Quantum-tailored machine-learning characterization of a superconducting
qubit [50.591267188664666]
We develop an approach to characterize the dynamics of a quantum device and learn device parameters.
This approach outperforms physics-agnostic recurrent neural networks trained on numerically generated and experimental data.
This demonstration shows how leveraging domain knowledge improves the accuracy and efficiency of this characterization task.
arXiv Detail & Related papers (2021-06-24T15:58:57Z) - Machine learning based digital twin for dynamical systems with multiple
time-scales [0.0]
Digital twin technology has a huge potential for widespread applications in different industrial sectors such as infrastructure, aerospace, and automotive.
Here we focus on a digital twin framework for linear single-degree-of-freedom structural dynamic systems evolving in two different operational time scales.
arXiv Detail & Related papers (2020-05-12T15:33:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.