Evaluation of synthetic and experimental training data in supervised
machine learning applied to charge state detection of quantum dots
- URL: http://arxiv.org/abs/2005.08131v1
- Date: Sat, 16 May 2020 23:41:31 GMT
- Title: Evaluation of synthetic and experimental training data in supervised
machine learning applied to charge state detection of quantum dots
- Authors: Jana Darulova, Matthias Troyer, Maja C. Cassidy
- Abstract summary: We evaluate the prediction accuracy of a range of machine learning models trained on simulated and experimental data.
We find that classifiers perform best on either purely experimental or a combination of synthetic and experimental training data.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Automated tuning of gate-defined quantum dots is a requirement for large
scale semiconductor based qubit initialisation. An essential step of these
tuning procedures is charge state detection based on charge stability diagrams.
Using supervised machine learning to perform this task requires a large dataset
for models to train on. In order to avoid hand labelling experimental data,
synthetic data has been explored as an alternative. While providing a
significant increase in the size of the training dataset compared to using
experimental data, using synthetic data means that classifiers are trained on
data sourced from a different distribution than the experimental data that is
part of the tuning process. Here we evaluate the prediction accuracy of a range
of machine learning models trained on simulated and experimental data and their
ability to generalise to experimental charge stability diagrams in two
dimensional electron gas and nanowire devices. We find that classifiers perform
best on either purely experimental or a combination of synthetic and
experimental training data, and that adding common experimental noise
signatures to the synthetic data does not dramatically improve the
classification accuracy. These results suggest that experimental training data
as well as realistic quantum dot simulations and noise models are essential in
charge state detection using supervised machine learning.
Related papers
- Learning with Noisy Foundation Models [95.50968225050012]
This paper is the first work to comprehensively understand and analyze the nature of noise in pre-training datasets.
We propose a tuning method (NMTune) to affine the feature space to mitigate the malignant effect of noise and improve generalization.
arXiv Detail & Related papers (2024-03-11T16:22:41Z) - Post-training Model Quantization Using GANs for Synthetic Data
Generation [57.40733249681334]
We investigate the use of synthetic data as a substitute for the calibration with real data for the quantization method.
We compare the performance of models quantized using data generated by StyleGAN2-ADA and our pre-trained DiStyleGAN, with quantization using real data and an alternative data generation method based on fractal images.
arXiv Detail & Related papers (2023-05-10T11:10:09Z) - Trustworthiness of Laser-Induced Breakdown Spectroscopy Predictions via
Simulation-based Synthetic Data Augmentation and Multitask Learning [4.633997895806144]
We consider quantitative analyses of spectral data using laser-induced breakdown spectroscopy.
We address the small size of training data available, and the validation of the predictions during inference on unknown data.
arXiv Detail & Related papers (2022-10-07T18:00:09Z) - SynBench: Task-Agnostic Benchmarking of Pretrained Representations using
Synthetic Data [78.21197488065177]
Recent success in fine-tuning large models, that are pretrained on broad data at scale, on downstream tasks has led to a significant paradigm shift in deep learning.
This paper proposes a new task-agnostic framework, textitSynBench, to measure the quality of pretrained representations using synthetic data.
arXiv Detail & Related papers (2022-10-06T15:25:00Z) - BeCAPTCHA-Type: Biometric Keystroke Data Generation for Improved Bot
Detection [63.447493500066045]
This work proposes a data driven learning model for the synthesis of keystroke biometric data.
The proposed method is compared with two statistical approaches based on Universal and User-dependent models.
Our experimental framework considers a dataset with 136 million keystroke events from 168 thousand subjects.
arXiv Detail & Related papers (2022-07-27T09:26:15Z) - CMW-Net: Learning a Class-Aware Sample Weighting Mapping for Robust Deep
Learning [55.733193075728096]
Modern deep neural networks can easily overfit to biased training data containing corrupted labels or class imbalance.
Sample re-weighting methods are popularly used to alleviate this data bias issue.
We propose a meta-model capable of adaptively learning an explicit weighting scheme directly from data.
arXiv Detail & Related papers (2022-02-11T13:49:51Z) - MLReal: Bridging the gap between training on synthetic data and real
data applications in machine learning [1.9852463786440129]
We describe a novel approach to enhance supervised training on synthetic data with real data features.
In the training stage, the input data are from the synthetic domain and the auto-correlated data are from the real domain.
In the inference/application stage, the input data are from the real subset domain and the mean of the autocorrelated sections are from the synthetic data subset domain.
arXiv Detail & Related papers (2021-09-11T14:43:34Z) - Bias-Aware Loss for Training Image and Speech Quality Prediction Models
from Multiple Datasets [13.132388683797503]
We propose a bias-aware loss function that estimates each dataset's biases during training with a linear function.
We prove the efficiency of the proposed method by training and validating quality prediction models on synthetic and subjective image and speech quality datasets.
arXiv Detail & Related papers (2021-04-20T19:20:11Z) - Cognitive simulation models for inertial confinement fusion: Combining
simulation and experimental data [0.0]
Researchers rely heavily on computer simulations to explore the design space in search of high-performing implosions.
For more effective design and investigation, simulations require input from past experimental data to better predict future performance.
We describe a cognitive simulation method for combining simulation and experimental data into a common, predictive model.
arXiv Detail & Related papers (2021-03-19T02:00:14Z) - Neural network quantum state tomography in a two-qubit experiment [52.77024349608834]
Machine learning inspired variational methods provide a promising route towards scalable state characterization for quantum simulators.
We benchmark and compare several such approaches by applying them to measured data from an experiment producing two-qubit entangled states.
We find that in the presence of experimental imperfections and noise, confining the variational manifold to physical states greatly improves the quality of the reconstructed states.
arXiv Detail & Related papers (2020-07-31T17:25:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.