Neural Network Surrogate Model for Junction Temperature and Hotspot Position in $3$D Multi-Layer High Bandwidth Memory (HBM) Chiplets under Varying Thermal Conditions
- URL: http://arxiv.org/abs/2503.04049v1
- Date: Thu, 06 Mar 2025 03:05:54 GMT
- Title: Neural Network Surrogate Model for Junction Temperature and Hotspot Position in $3$D Multi-Layer High Bandwidth Memory (HBM) Chiplets under Varying Thermal Conditions
- Authors: Chengxin Zhang, Yujie Liu, Quan Chen,
- Abstract summary: This work develops a data-driven neural network model for the fast prediction of junction temperature and hotspot position in 3D chiplets.<n>It shows good generalizability for other thermal conditions not considered in the parameter space.<n>It makes it a valuable tool for improving thermal management and performance in high-performance computing applications.
- Score: 6.277496992820668
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: As the demand for computational power increases, high-bandwidth memory (HBM) has become a critical technology for next-generation computing systems. However, the widespread adoption of HBM presents significant thermal management challenges, particularly in multilayer through-silicon-via (TSV) stacked structures under varying thermal conditions, where accurate prediction of junction temperature and hotspot position is essential during the early design. This work develops a data-driven neural network model for the fast prediction of junction temperature and hotspot position in 3D HBM chiplets. The model, trained with a data set of $13,494$ different combinations of thermal condition parameters, sampled from a vast parameter space characterized by high-dimensional combination (up to $3^{27}$), can accurately and quickly infer the junction temperature and hotspot position for any thermal conditions in the parameter space. Moreover, it shows good generalizability for other thermal conditions not considered in the parameter space. The data set is constructed using accurate finite element solvers. This method not only minimizes the reliance on costly experimental tests and extensive computational resources for finite element analysis but also accelerates the design and optimization of complex HBM systems, making it a valuable tool for improving thermal management and performance in high-performance computing applications.
Related papers
- 2D-ThermAl: Physics-Informed Framework for Thermal Analysis of Circuits using Generative AI [9.414651358362388]
'ThermAl' is a physics-informed generative AI framework which effectively identifies heat sources and estimates full-chip transient and steady-state thermal distributions.<n>Our model is trained on an extensive dataset of heat dissipation maps, ranging from simple logic gates (e.g., inverters, NAND, XOR) to complex designs, generated via COMSOL.<n> Experimental results demonstrate that ThermAl delivers precise temperature mappings for large circuits, with a root mean squared error (RMSE) of only 0.71C, and outperforms conventional FEM tools by running up to 200 times faster.
arXiv Detail & Related papers (2025-12-01T00:45:26Z) - Fast 3D Surrogate Modeling for Data Center Thermal Management [15.644716872105002]
Traditional thermal CFD solvers are computationally expensive and require expert-crafted meshes and boundary conditions.<n>We develop a vision-based surrogate modeling framework that operates directly on a 3D voxelized representation of the data center.<n>Our results show that the surrogate models generalize across data center configurations and achieve up to 20,000x speedup.
arXiv Detail & Related papers (2025-11-13T02:12:24Z) - Data-Driven Temperature Modelling of Machine Tools by Neural Networks: A Benchmark [2.07811670193148]
We introduce a novel paradigm in which NNs are trained to predict high-fidelity temperature and heat flux fields within the machine tool.<n>The proposed framework enables subsequent computation and correction of a wide range of error types using modular, swappable downstream components.
arXiv Detail & Related papers (2025-09-26T15:43:08Z) - Data-Driven Optical To Thermal Inference in Pool Boiling Using Generative Adversarial Networks [1.0499611180329804]
We present a data-driven framework that infers temperature fields from geometric phase in a canonical pool boiling configuration.<n>Our results highlight the potential of deep generative models to bridge the gap between observable multiphase phenomena and underlying thermal transport.
arXiv Detail & Related papers (2025-05-01T19:26:01Z) - Optimizing Temperature for Language Models with Multi-Sample Inference [47.14991144052361]
This paper addresses the challenge of automatically identifying the (near)-optimal temperature for different large language models.<n>We provide a comprehensive analysis of temperature's role in performance optimization, considering variations in model architectures, datasets, task types, model sizes, and predictive accuracy.<n>We propose a novel entropy-based metric for automated temperature optimization, which consistently outperforms fixed-temperature baselines.
arXiv Detail & Related papers (2025-02-07T19:35:25Z) - RS-vHeat: Heat Conduction Guided Efficient Remote Sensing Foundation Model [59.37279559684668]
We introduce RS-vHeat, an efficient multi-modal remote sensing foundation model.
Specifically, RS-vHeat applies the Heat Conduction Operator (HCO) with a complexity of $O(N1.5)$ and a global receptive field.
Compared to attention-based remote sensing foundation models, we reduce memory usage by 84%, FLOPs by 24% and improves throughput by 2.7 times.
arXiv Detail & Related papers (2024-11-27T01:43:38Z) - Machine Learning-Assisted Thermoelectric Cooling for On-Demand Multi-Hotspot Thermal Management [12.515874333424929]
Thermoelectric coolers (TECs) offer a promising solution for direct cooling of local hotspots and active thermal management in advanced electronic systems.
TECs present significant trade-offs among spatial cooling, heating and power consumption.
We present a novel machine learning-assisted optimization algorithm for thermoelectric coolers that can achieve global optimal temperature.
arXiv Detail & Related papers (2024-04-20T18:35:45Z) - Deep encoder-decoder hierarchical convolutional neural networks for conjugate heat transfer surrogate modeling [0.0]
Conjugate heat transfer (CHT) analyses are vital for the design of many energy systems.<n>High-fidelity CHT numerical simulations are computationally intensive.<n>We develop a modular deep encoder-decoder hierarchical (DeepEDH) convolutional neural network for CHT analyses.
arXiv Detail & Related papers (2023-11-24T21:45:11Z) - DeepOHeat: Operator Learning-based Ultra-fast Thermal Simulation in
3D-IC Design [7.112313433801361]
DeepOHeat is a physics-aware operator learning framework to predict the temperature field of a family of heat equations.
We show that, for the unseen testing cases, a well-trained DeepOHeat can produce accurate results with $1000times$ to $300000times$ speedup.
arXiv Detail & Related papers (2023-02-25T01:18:48Z) - Environmental Sensor Placement with Convolutional Gaussian Neural
Processes [65.13973319334625]
It is challenging to place sensors in a way that maximises the informativeness of their measurements, particularly in remote regions like Antarctica.
Probabilistic machine learning models can suggest informative sensor placements by finding sites that maximally reduce prediction uncertainty.
This paper proposes using a convolutional Gaussian neural process (ConvGNP) to address these issues.
arXiv Detail & Related papers (2022-11-18T17:25:14Z) - Does Thermal Really Always Matter for RGB-T Salient Object Detection? [153.17156598262656]
This paper proposes a network named TNet to solve the RGB-T salient object detection (SOD) task.
In this paper, we introduce a global illumination estimation module to predict the global illuminance score of the image.
On the other hand, we introduce a two-stage localization and complementation module in the decoding phase to transfer object localization cue and internal integrity cue in thermal features to the RGB modality.
arXiv Detail & Related papers (2022-10-09T13:50:12Z) - Rapid Flow Behavior Modeling of Thermal Interface Materials Using Deep
Neural Networks [0.20999222360659608]
Thermal Interface Materials (TIMs) are widely used in electronic packaging.
We propose a lightweight rectangle to model the spreading behavior of TIM.
We further speed up the calculation by training an Artificial Neural Network (ANN) on data from this model.
arXiv Detail & Related papers (2022-08-08T10:44:17Z) - Collaborative Intelligent Reflecting Surface Networks with Multi-Agent
Reinforcement Learning [63.83425382922157]
Intelligent reflecting surface (IRS) is envisioned to be widely applied in future wireless networks.
In this paper, we investigate a multi-user communication system assisted by cooperative IRS devices with the capability of energy harvesting.
arXiv Detail & Related papers (2022-03-26T20:37:14Z) - Energy-Efficient Model Compression and Splitting for Collaborative
Inference Over Time-Varying Channels [52.60092598312894]
We propose a technique to reduce the total energy bill at the edge device by utilizing model compression and time-varying model split between the edge and remote nodes.
Our proposed solution results in minimal energy consumption and $CO$ emission compared to the considered baselines.
arXiv Detail & Related papers (2021-06-02T07:36:27Z) - Uhlmann Fidelity and Fidelity Susceptibility for Integrable Spin Chains
at Finite Temperature: Exact Results [68.8204255655161]
We show that the proper inclusion of the odd parity subspace leads to the enhancement of maximal fidelity susceptibility in the intermediate range of temperatures.
The correct low-temperature behavior is captured by an approximation involving the two lowest many-body energy eigenstates.
arXiv Detail & Related papers (2021-05-11T14:08:02Z) - Role of topology in determining the precision of a finite thermometer [58.720142291102135]
We find that low connectivity is a resource to build precise thermometers working at low temperatures.
We compare the precision achievable by position measurement to the optimal one, which itself corresponds to energy measurement.
arXiv Detail & Related papers (2021-04-21T17:19:42Z) - Thermal Neural Networks: Lumped-Parameter Thermal Modeling With
State-Space Machine Learning [0.0]
Thermal models for electric power systems are required to be both, real-time capable and of high estimation accuracy.
In this work, the thermal neural network (TNN) is introduced, which unifies both, consolidated knowledge in the form of heat-transfer-based lumped- parameter models.
A TNN has physically interpretable states through its state-space representation, is end-to-end trainable, and requires no material, geometry, nor expert knowledge for its design.
arXiv Detail & Related papers (2021-03-30T13:15:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.