Anchoring to Exemplars for Training Mixture-of-Expert Cell Embeddings
- URL: http://arxiv.org/abs/2112.03208v1
- Date: Mon, 6 Dec 2021 18:05:49 GMT
- Title: Anchoring to Exemplars for Training Mixture-of-Expert Cell Embeddings
- Authors: Siqi Wang, Manyuan Lu, Nikita Moshkov, Juan C. Caicedo, Bryan A.
Plummer
- Abstract summary: We propose an embedding learning approach that learns a set of experts that are specialized in capturing technical variations in our training set and then aggregates specialist's predictions at test time.
We evaluate our approach on three datasets for tasks like drug discovery, boosting performance on identifying the true mechanism of action of cell treatments by 5.5-11% over the state-of-the-art.
- Score: 15.964908790433855
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Analyzing the morphology of cells in microscopy images can provide insights
into the mechanism of compounds or the function of genes. Addressing this task
requires methods that can not only extract biological information from the
images, but also ignore technical variations, ie, changes in experimental
procedure or differences between equipments used to collect microscopy images.
We propose Treatment ExemplArs with Mixture-of-experts (TEAMs), an embedding
learning approach that learns a set of experts that are specialized in
capturing technical variations in our training set and then aggregates
specialist's predictions at test time. Thus, TEAMs can learn powerful
embeddings with less technical variation bias by minimizing the noise from
every expert. To train our model, we leverage Treatment Exemplars that enable
our approach to capture the distribution of the entire dataset in every
minibatch while still fitting into GPU memory. We evaluate our approach on
three datasets for tasks like drug discovery, boosting performance on
identifying the true mechanism of action of cell treatments by 5.5-11% over the
state-of-the-art.
Related papers
- Weakly Supervised Set-Consistency Learning Improves Morphological Profiling of Single-Cell Images [0.6491172192043603]
We propose a set-level consistency learning algorithm, Set-DINO, to improve learned representations of perturbation effects in single-cell images.
We conduct experiments on a large-scale Optical Pooled Screening dataset with more than 5000 genetic perturbations.
arXiv Detail & Related papers (2024-06-08T00:53:30Z) - Optimizations of Autoencoders for Analysis and Classification of
Microscopic In Situ Hybridization Images [68.8204255655161]
We propose a deep-learning framework to detect and classify areas of microscopic images with similar levels of gene expression.
The data we analyze requires an unsupervised learning model for which we employ a type of Artificial Neural Network - Deep Learning Autoencoders.
arXiv Detail & Related papers (2023-04-19T13:45:28Z) - Ambiguous Medical Image Segmentation using Diffusion Models [60.378180265885945]
We introduce a single diffusion model-based approach that produces multiple plausible outputs by learning a distribution over group insights.
Our proposed model generates a distribution of segmentation masks by leveraging the inherent sampling process of diffusion.
Comprehensive results show that our proposed approach outperforms existing state-of-the-art ambiguous segmentation networks.
arXiv Detail & Related papers (2023-04-10T17:58:22Z) - Domain adaptation using optimal transport for invariant learning using
histopathology datasets [13.133231212085988]
Histopathology is critical for the diagnosis of many diseases, including cancer.
computational techniques are limited by batch effects, where technical factors like differences in preparation protocol or scanners can alter the appearance of slides.
Here, we propose a domain adaptation method that improves the generalization of histological models to data from unseen institutions.
arXiv Detail & Related papers (2023-03-03T22:19:19Z) - CellMix: A General Instance Relationship based Method for Data
Augmentation Towards Pathology Image Classification [6.9596321268519326]
In pathology image analysis, obtaining and maintaining high-quality annotated samples is an extremely labor-intensive task.
We propose the CellMix framework, which employs a novel distribution-oriented in-place shuffle approach.
Our experiments in pathology image classification tasks demonstrate state-of-the-art (SOTA) performance on 7 distinct datasets.
arXiv Detail & Related papers (2023-01-27T03:17:35Z) - Stain-invariant self supervised learning for histopathology image
analysis [74.98663573628743]
We present a self-supervised algorithm for several classification tasks within hematoxylin and eosin stained images of breast cancer.
Our method achieves the state-of-the-art performance on several publicly available breast cancer datasets.
arXiv Detail & Related papers (2022-11-14T18:16:36Z) - Bayesian Active Learning for Scanning Probe Microscopy: from Gaussian
Processes to Hypothesis Learning [0.0]
We discuss the basic principles of Bayesian active learning and illustrate its applications for scanning probe microscopes (SPMs)
These frameworks allow for the use of prior data, the discovery of specific functionalities as encoded in spectral data, and exploration of physical laws manifesting during the experiment.
arXiv Detail & Related papers (2022-05-30T23:01:41Z) - Learning multi-scale functional representations of proteins from
single-cell microscopy data [77.34726150561087]
We show that simple convolutional networks trained on localization classification can learn protein representations that encapsulate diverse functional information.
We also propose a robust evaluation strategy to assess quality of protein representations across different scales of biological function.
arXiv Detail & Related papers (2022-05-24T00:00:07Z) - A parameter refinement method for Ptychography based on Deep Learning
concepts [55.41644538483948]
coarse parametrisation in propagation distance, position errors and partial coherence frequently menaces the experiment viability.
A modern Deep Learning framework is used to correct autonomously the setup incoherences, thus improving the quality of a ptychography reconstruction.
We tested our system on both synthetic datasets and also on real data acquired at the TwinMic beamline of the Elettra synchrotron facility.
arXiv Detail & Related papers (2021-05-18T10:15:17Z) - Towards an Automatic Analysis of CHO-K1 Suspension Growth in
Microfluidic Single-cell Cultivation [63.94623495501023]
We propose a novel Machine Learning architecture, which allows us to infuse a neural deep network with human-powered abstraction on the level of data.
Specifically, we train a generative model simultaneously on natural and synthetic data, so that it learns a shared representation, from which a target variable, such as the cell count, can be reliably estimated.
arXiv Detail & Related papers (2020-10-20T08:36:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.