The Classification of Optical Galaxy Morphology Using Unsupervised
Learning Techniques
- URL: http://arxiv.org/abs/2206.06165v1
- Date: Mon, 13 Jun 2022 13:52:07 GMT
- Title: The Classification of Optical Galaxy Morphology Using Unsupervised
Learning Techniques
- Authors: Ezra Fielding, Clement N. Nyirenda, Mattia Vaccari
- Abstract summary: Unsupervised learning techniques do not require existing labels to classify data.
This paper aims to implement unsupervised learning algorithms to classify the Galaxy Zoo DECaLS dataset without human supervision.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The advent of large scale, data intensive astronomical surveys has caused the
viability of human-based galaxy morphology classification methods to come into
question. Put simply, too much astronomical data is being produced for
scientists to visually label. Attempts have been made to crowd-source this work
by recruiting volunteers from the general public. However, even these efforts
will soon fail to keep up with data produced by modern surveys. Unsupervised
learning techniques do not require existing labels to classify data and could
pave the way to unplanned discoveries. Therefore, this paper aims to implement
unsupervised learning algorithms to classify the Galaxy Zoo DECaLS dataset
without human supervision. First, a convolutional autoencoder was implemented
as a feature extractor. The extracted features were then clustered via k-means,
fuzzy c-means and agglomerative clustering to provide classifications. The
results were compared to the volunteer classifications of the Galaxy Zoo DECaLS
dataset. Agglomerative clustering generally produced the best results, however,
the performance gain over k-means clustering was not significant. With the
appropriate optimizations, this approach could be used to provide
classifications for the better performing Galaxy Zoo DECaLS decision tree
questions. Ultimately, this unsupervised learning approach provided valuable
insights and results that were useful to scientists.
Related papers
- Granularity Matters in Long-Tail Learning [62.30734737735273]
We offer a novel perspective on long-tail learning, inspired by an observation: datasets with finer granularity tend to be less affected by data imbalance.
We introduce open-set auxiliary classes that are visually similar to existing ones, aiming to enhance representation learning for both head and tail classes.
To prevent the overwhelming presence of auxiliary classes from disrupting training, we introduce a neighbor-silencing loss.
arXiv Detail & Related papers (2024-10-21T13:06:21Z) - A review of unsupervised learning in astronomy [0.0]
This review summarizes popular unsupervised learning methods, and gives an overview of their past, current, and future uses in astronomy.
Unsupervised learning aims to organise the information content of a dataset, in such a way that knowledge can be extracted.
arXiv Detail & Related papers (2024-06-25T06:57:47Z) - XAL: EXplainable Active Learning Makes Classifiers Better Low-resource Learners [71.8257151788923]
We propose a novel Explainable Active Learning framework (XAL) for low-resource text classification.
XAL encourages classifiers to justify their inferences and delve into unlabeled data for which they cannot provide reasonable explanations.
Experiments on six datasets show that XAL achieves consistent improvement over 9 strong baselines.
arXiv Detail & Related papers (2023-10-09T08:07:04Z) - Influence of Swarm Intelligence in Data Clustering Mechanisms [0.0]
Nature inspired Swarm based algorithms are used for data clustering to cope with larger datasets with lack and inconsistency of data.
This paper reviews the performances of these new approaches and compares which is best for certain problematic situation.
arXiv Detail & Related papers (2023-05-07T08:40:50Z) - Parametric Classification for Generalized Category Discovery: A Baseline
Study [70.73212959385387]
Generalized Category Discovery (GCD) aims to discover novel categories in unlabelled datasets using knowledge learned from labelled samples.
We investigate the failure of parametric classifiers, verify the effectiveness of previous design choices when high-quality supervision is available, and identify unreliable pseudo-labels as a key problem.
We propose a simple yet effective parametric classification method that benefits from entropy regularisation, achieves state-of-the-art performance on multiple GCD benchmarks and shows strong robustness to unknown class numbers.
arXiv Detail & Related papers (2022-11-21T18:47:11Z) - Semi-Supervised Domain Adaptation for Cross-Survey Galaxy Morphology
Classification and Anomaly Detection [57.85347204640585]
We develop a Universal Domain Adaptation method DeepAstroUDA.
It can be applied to datasets with different types of class overlap.
For the first time, we demonstrate the successful use of domain adaptation on two very different observational datasets.
arXiv Detail & Related papers (2022-11-01T18:07:21Z) - Detection of extragalactic Ultra-Compact Dwarfs and Globular Clusters
using Explainable AI techniques [1.3764085113103222]
Compact stellar systems such as Ultra-compact dwarfs (UCDs) and Globular Clusters (GCs) around galaxies are known to be the tracers of the merger events that have been forming these galaxies.
Here, we train a machine learning model to separate these objects from the foreground stars and background galaxies using the multi-wavelength imaging data of the Fornax galaxy cluster in 6 filters.
We are able to identify UCDs/GCs with a precision and a recall of >93 percent and provide relevances that reflect the importance of each feature dimension %(colors and angular sizes)
arXiv Detail & Related papers (2022-01-05T13:37:55Z) - CvS: Classification via Segmentation For Small Datasets [52.821178654631254]
This paper presents CvS, a cost-effective classifier for small datasets that derives the classification labels from predicting the segmentation maps.
We evaluate the effectiveness of our framework on diverse problems showing that CvS is able to achieve much higher classification results compared to previous methods when given only a handful of examples.
arXiv Detail & Related papers (2021-10-29T18:41:15Z) - DeepMCAT: Large-Scale Deep Clustering for Medical Image Categorization [24.100651548850895]
We propose an unsupervised approach for automatically clustering and categorizing large-scale medical image datasets.
We investigated the end-to-end training using both class-balanced and imbalanced large-scale datasets.
arXiv Detail & Related papers (2021-09-30T22:39:57Z) - Scalable Hierarchical Agglomerative Clustering [65.66407726145619]
Existing scalable hierarchical clustering methods sacrifice quality for speed.
We present a scalable, agglomerative method for hierarchical clustering that does not sacrifice quality and scales to billions of data points.
arXiv Detail & Related papers (2020-10-22T15:58:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.