AstronomicAL: An interactive dashboard for visualisation, integration
and classification of data using Active Learning
- URL: http://arxiv.org/abs/2109.05207v1
- Date: Sat, 11 Sep 2021 07:32:26 GMT
- Title: AstronomicAL: An interactive dashboard for visualisation, integration
and classification of data using Active Learning
- Authors: Grant Stevens, Sotiria Fotopoulou, Malcolm N. Bremer, Oliver Ray
- Abstract summary: AstronomicAL is a human-in-the-loop interactive labelling and training dashboard.
It allows users to create reliable datasets and robust classifiers using active learning.
System allows users to visualise and integrate data from different sources.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: AstronomicAL is a human-in-the-loop interactive labelling and training
dashboard that allows users to create reliable datasets and robust classifiers
using active learning. This technique prioritises data that offer high
information gain, leading to improved performance using substantially less
data. The system allows users to visualise and integrate data from different
sources and deal with incorrect or missing labels and imbalanced class sizes.
AstronomicAL enables experts to visualise domain-specific plots and key
information relating both to broader context and details of a point of interest
drawn from a variety of data sources, ensuring reliable labels. In addition,
AstronomicAL provides functionality to explore all aspects of the training
process, including custom models and query strategies. This makes the software
a tool for experimenting with both domain-specific classifications and more
general-purpose machine learning strategies. We illustrate using the system
with an astronomical dataset due to the field's immediate need; however,
AstronomicAL has been designed for datasets from any discipline. Finally, by
exporting a simple configuration file, entire layouts, models, and assigned
labels can be shared with the community. This allows for complete transparency
and ensures that the process of reproducing results is effortless
Related papers
- A Self Supervised StyleGAN for Image Annotation and Classification with
Extremely Limited Labels [35.43549147657739]
We propose SS-StyleGAN, a self-supervised approach for image annotation and classification suitable for extremely small annotated datasets.
We show that the proposed method attains strong classification results using small labeled datasets of sizes 50 and even 10.
arXiv Detail & Related papers (2023-12-26T09:46:50Z) - infoVerse: A Universal Framework for Dataset Characterization with
Multidimensional Meta-information [68.76707843019886]
infoVerse is a universal framework for dataset characterization.
infoVerse captures multidimensional characteristics of datasets by incorporating various model-driven meta-information.
In three real-world applications (data pruning, active learning, and data annotation), the samples chosen on infoVerse space consistently outperform strong baselines.
arXiv Detail & Related papers (2023-05-30T18:12:48Z) - Learn to Explore: on Bootstrapping Interactive Data Exploration with
Meta-learning [8.92180350317399]
We propose a learning-to-explore framework, based on meta-learning, which learns how to learn a classifier with automatically generated meta-tasks.
Our proposal outperforms existing explore-by-example solutions in terms of accuracy and efficiency.
arXiv Detail & Related papers (2022-12-07T03:12:41Z) - Is margin all you need? An extensive empirical study of active learning
on tabular data [66.18464006872345]
We analyze the performance of a variety of active learning algorithms on 69 real-world datasets from the OpenML-CC18 benchmark.
Surprisingly, we find that the classical margin sampling technique matches or outperforms all others, including current state-of-art.
arXiv Detail & Related papers (2022-10-07T21:18:24Z) - AutoGeoLabel: Automated Label Generation for Geospatial Machine Learning [69.47585818994959]
We evaluate a big data processing pipeline to auto-generate labels for remote sensing data.
We utilize the big geo-data platform IBM PAIRS to dynamically generate such labels in dense urban areas.
arXiv Detail & Related papers (2022-01-31T20:02:22Z) - TAGLETS: A System for Automatic Semi-Supervised Learning with Auxiliary
Data [8.653321928148545]
Machine learning practitioners often have access to a spectrum of data.
We describe TAGLETS, a system built to study techniques for automatically exploiting all three types of data.
arXiv Detail & Related papers (2021-11-08T20:08:45Z) - Deep Transfer Learning for Multi-source Entity Linkage via Domain
Adaptation [63.24594955429465]
Multi-source entity linkage is critical in high-impact applications such as data cleaning and user stitching.
AdaMEL is a deep transfer learning framework that learns generic high-level knowledge to perform multi-source entity linkage.
Our framework achieves state-of-the-art results with 8.21% improvement on average over methods based on supervised learning.
arXiv Detail & Related papers (2021-10-27T15:20:41Z) - Simple multi-dataset detection [83.9604523643406]
We present a simple method for training a unified detector on multiple large-scale datasets.
We show how to automatically integrate dataset-specific outputs into a common semantic taxonomy.
Our approach does not require manual taxonomy reconciliation.
arXiv Detail & Related papers (2021-02-25T18:55:58Z) - Adversarial Knowledge Transfer from Unlabeled Data [62.97253639100014]
We present a novel Adversarial Knowledge Transfer framework for transferring knowledge from internet-scale unlabeled data to improve the performance of a classifier.
An important novel aspect of our method is that the unlabeled source data can be of different classes from those of the labeled target data, and there is no need to define a separate pretext task.
arXiv Detail & Related papers (2020-08-13T08:04:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.