Hybrid Supervised and Reinforcement Learning for the Design and
Optimization of Nanophotonic Structures
- URL: http://arxiv.org/abs/2209.04447v1
- Date: Thu, 8 Sep 2022 22:43:40 GMT
- Title: Hybrid Supervised and Reinforcement Learning for the Design and
Optimization of Nanophotonic Structures
- Authors: Christopher Yeung, Benjamin Pham, Zihan Zhang, Katherine T. Fountaine,
and Aaswath P. Raman
- Abstract summary: This paper presents a hybrid supervised and reinforcement learning approach to the inverse design of nanophotonic structures.
We show this approach can reduce training data dependence, improve the generalizability of model predictions, and shorten exploratory training times by orders of magnitude.
- Score: 8.677532138573984
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: From higher computational efficiency to enabling the discovery of novel and
complex structures, deep learning has emerged as a powerful framework for the
design and optimization of nanophotonic circuits and components. However, both
data-driven and exploration-based machine learning strategies have limitations
in their effectiveness for nanophotonic inverse design. Supervised machine
learning approaches require large quantities of training data to produce
high-performance models and have difficulty generalizing beyond training data
given the complexity of the design space. Unsupervised and reinforcement
learning-based approaches on the other hand can have very lengthy training or
optimization times associated with them. Here we demonstrate a hybrid
supervised learning and reinforcement learning approach to the inverse design
of nanophotonic structures and show this approach can reduce training data
dependence, improve the generalizability of model predictions, and shorten
exploratory training times by orders of magnitude. The presented strategy thus
addresses a number of contemporary deep learning-based challenges, while
opening the door for new design methodologies that leverage multiple classes of
machine learning algorithms to produce more effective and practical solutions
for photonic design.
Related papers
- Exploring the design space of deep-learning-based weather forecasting systems [56.129148006412855]
This paper systematically analyzes the impact of different design choices on deep-learning-based weather forecasting systems.
We study fixed-grid architectures such as UNet, fully convolutional architectures, and transformer-based models.
We propose a hybrid system that combines the strong performance of fixed-grid models with the flexibility of grid-invariant architectures.
arXiv Detail & Related papers (2024-10-09T22:25:50Z) - Hierarchical and Decoupled BEV Perception Learning Framework for Autonomous Driving [52.808273563372126]
This paper proposes a novel hierarchical BEV perception paradigm, aiming to provide a library of fundamental perception modules and user-friendly graphical interface.
We conduct the Pretrain-Finetune strategy to effectively utilize large scale public datasets and streamline development processes.
We also present a Multi-Module Learning (MML) approach, enhancing performance through synergistic and iterative training of multiple models.
arXiv Detail & Related papers (2024-07-17T11:17:20Z) - An Advanced Physics-Informed Neural Operator for Comprehensive Design Optimization of Highly-Nonlinear Systems: An Aerospace Composites Processing Case Study [0.0]
This paper introduces an advanced physics-informed DeepONet tailored for such complex systems with multiple input functions.
The proposed model handles high-dimensional design spaces with significantly improved accuracy, outperforming the vanilla physics-informed DeepONet by two orders of magnitude.
arXiv Detail & Related papers (2024-06-20T20:19:30Z) - Compositional Generative Inverse Design [69.22782875567547]
Inverse design, where we seek to design input variables in order to optimize an underlying objective function, is an important problem.
We show that by instead optimizing over the learned energy function captured by the diffusion model, we can avoid such adversarial examples.
In an N-body interaction task and a challenging 2D multi-airfoil design task, we demonstrate that by composing the learned diffusion model at test time, our method allows us to design initial states and boundary shapes.
arXiv Detail & Related papers (2024-01-24T01:33:39Z) - A Survey on Design Methodologies for Accelerating Deep Learning on
Heterogeneous Architectures [9.982620766142345]
The need for efficient hardware accelerators has become more pressing to design heterogeneous HPC platforms.
Several methodologies and tools have been proposed to design accelerators for Deep Learning.
This survey provides a holistic review of the most influential design methodologies and EDA tools proposed in recent years to implement Deep Learning accelerators.
arXiv Detail & Related papers (2023-11-29T17:10:16Z) - On Efficient Training of Large-Scale Deep Learning Models: A Literature
Review [90.87691246153612]
The field of deep learning has witnessed significant progress, particularly in computer vision (CV), natural language processing (NLP), and speech.
The use of large-scale models trained on vast amounts of data holds immense promise for practical applications.
With the increasing demands on computational capacity, a comprehensive summarization on acceleration techniques of training deep learning models is still much anticipated.
arXiv Detail & Related papers (2023-04-07T11:13:23Z) - Design of Convolutional Extreme Learning Machines for Vision-Based
Navigation Around Small Bodies [0.0]
Deep learning architectures such as convolutional neural networks are the standard in computer vision for image processing tasks.
Their accuracy however often comes at the cost of long and computationally expensive training.
A different method known as convolutional extreme learning machine has shown the potential to perform equally with a dramatic decrease in training time.
arXiv Detail & Related papers (2022-10-28T16:24:21Z) - Dynamically Grown Generative Adversarial Networks [111.43128389995341]
We propose a method to dynamically grow a GAN during training, optimizing the network architecture and its parameters together with automation.
The method embeds architecture search techniques as an interleaving step with gradient-based training to periodically seek the optimal architecture-growing strategy for the generator and discriminator.
arXiv Detail & Related papers (2021-06-16T01:25:51Z) - LENAS: Learning-based Neural Architecture Search and Ensemble for 3D Radiotherapy Dose Prediction [42.38793195337463]
We propose a novel learning-based ensemble approach named LENAS, which integrates neural architecture search with knowledge distillation for 3D radiotherapy dose prediction.
Our approach starts by exhaustively searching each block from an enormous architecture space to identify multiple architectures that exhibit promising performance.
To mitigate the complexity introduced by the model ensemble, we adopt the teacher-student paradigm, leveraging the diverse outputs from multiple learned networks as supervisory signals.
arXiv Detail & Related papers (2021-06-12T10:08:52Z) - Generative Design by Reinforcement Learning: Enhancing the Diversity of
Topology Optimization Designs [5.8010446129208155]
This study proposes a reinforcement learning based generative design process, with reward functions maximizing the diversity of topology designs.
We show that RL-based generative design produces a large number of diverse designs within a short inference time by exploiting GPU in a fully automated manner.
arXiv Detail & Related papers (2020-08-17T06:50:47Z) - Learning Deformable Image Registration from Optimization: Perspective,
Modules, Bilevel Training and Beyond [62.730497582218284]
We develop a new deep learning based framework to optimize a diffeomorphic model via multi-scale propagation.
We conduct two groups of image registration experiments on 3D volume datasets including image-to-atlas registration on brain MRI data and image-to-image registration on liver CT data.
arXiv Detail & Related papers (2020-04-30T03:23:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.