Detecting and Learning Out-of-Distribution Data in the Open world:
Algorithm and Theory
- URL: http://arxiv.org/abs/2310.06221v1
- Date: Tue, 10 Oct 2023 00:25:21 GMT
- Title: Detecting and Learning Out-of-Distribution Data in the Open world:
Algorithm and Theory
- Authors: Yiyou Sun
- Abstract summary: This thesis makes contributions to the realm of machine learning, specifically in the context of open-world scenarios.
Research investigates two intertwined steps essential for open-world machine learning: Out-of-distribution (OOD) Detection and Open-world Representation Learning (ORL)
- Score: 15.875140867859209
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This thesis makes considerable contributions to the realm of machine
learning, specifically in the context of open-world scenarios where systems
face previously unseen data and contexts. Traditional machine learning models
are usually trained and tested within a fixed and known set of classes, a
condition known as the closed-world setting. While this assumption works in
controlled environments, it falls short in real-world applications where new
classes or categories of data can emerge dynamically and unexpectedly. To
address this, our research investigates two intertwined steps essential for
open-world machine learning: Out-of-distribution (OOD) Detection and Open-world
Representation Learning (ORL). OOD detection focuses on identifying instances
from unknown classes that fall outside the model's training distribution. This
process reduces the risk of making overly confident, erroneous predictions
about unfamiliar inputs. Moving beyond OOD detection, ORL extends the
capabilities of the model to not only detect unknown instances but also learn
from and incorporate knowledge about these new classes. By delving into these
research problems of open-world learning, this thesis contributes both
algorithmic solutions and theoretical foundations, which pave the way for
building machine learning models that are not only performant but also reliable
in the face of the evolving complexities of the real world.
Related papers
- Open-world Machine Learning: A Review and New Outlooks [83.6401132743407]
This paper aims to provide a comprehensive introduction to the emerging open-world machine learning paradigm.
It aims to help researchers build more powerful AI systems in their respective fields, and to promote the development of artificial general intelligence.
arXiv Detail & Related papers (2024-03-04T06:25:26Z) - Machine Learning vs Deep Learning: The Generalization Problem [0.0]
This study investigates the comparative abilities of traditional machine learning (ML) models and deep learning (DL) algorithms in terms of extrapolation.
We present an empirical analysis where both ML and DL models are trained on an exponentially growing function and then tested on values outside the training domain.
Our findings suggest that deep learning models possess inherent capabilities to generalize beyond the training scope.
arXiv Detail & Related papers (2024-03-03T21:42:55Z) - Interpretable Machine Learning for Discovery: Statistical Challenges \&
Opportunities [1.2891210250935146]
We discuss and review the field of interpretable machine learning.
We outline the types of discoveries that can be made using Interpretable Machine Learning.
We focus on the grand challenge of how to validate these discoveries in a data-driven manner.
arXiv Detail & Related papers (2023-08-02T23:57:31Z) - Generalization Properties of Retrieval-based Models [50.35325326050263]
Retrieval-based machine learning methods have enjoyed success on a wide range of problems.
Despite growing literature showcasing the promise of these models, the theoretical underpinning for such models remains underexplored.
We present a formal treatment of retrieval-based models to characterize their generalization ability.
arXiv Detail & Related papers (2022-10-06T00:33:01Z) - Bridging the Gap to Real-World Object-Centric Learning [66.55867830853803]
We show that reconstructing features from models trained in a self-supervised manner is a sufficient training signal for object-centric representations to arise in a fully unsupervised way.
Our approach, DINOSAUR, significantly out-performs existing object-centric learning models on simulated data.
arXiv Detail & Related papers (2022-09-29T15:24:47Z) - Open Environment Machine Learning [84.90891046882213]
Conventional machine learning studies assume close world scenarios where important factors of the learning process hold invariant.
This article briefly introduces some advances in this line of research, focusing on techniques concerning emerging new classes, decremental/incremental features, changing data distributions, varied learning objectives, and discusses some theoretical issues.
arXiv Detail & Related papers (2022-06-01T11:57:56Z) - Bayesian Embeddings for Few-Shot Open World Recognition [60.39866770427436]
We extend embedding-based few-shot learning algorithms to the open-world recognition setting.
We benchmark our framework on open-world extensions of the common MiniImageNet and TieredImageNet few-shot learning datasets.
arXiv Detail & Related papers (2021-07-29T00:38:47Z) - Open-world Machine Learning: Applications, Challenges, and Opportunities [0.7734726150561086]
Open-world machine learning deals with arbitrary inputs (data with unseen classes) to machine learning systems.
Traditional machine learning is static learning which is not appropriate for an active environment.
This paper presents a systematic review of various techniques for open-world machine learning.
arXiv Detail & Related papers (2021-05-27T21:05:10Z) - Knowledge as Invariance -- History and Perspectives of
Knowledge-augmented Machine Learning [69.99522650448213]
Research in machine learning is at a turning point.
Research interests are shifting away from increasing the performance of highly parameterized models to exceedingly specific tasks.
This white paper provides an introduction and discussion of this emerging field in machine learning research.
arXiv Detail & Related papers (2020-12-21T15:07:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.