Why Machine Learning Cannot Ignore Maximum Likelihood Estimation
- URL: http://arxiv.org/abs/2110.12112v1
- Date: Sat, 23 Oct 2021 01:57:40 GMT
- Title: Why Machine Learning Cannot Ignore Maximum Likelihood Estimation
- Authors: Mark J. van der Laan and Sherri Rose
- Abstract summary: The growth of machine learning as a field has been accelerating with increasing interest and publications across fields.
Here, we assert that one essential idea is for machine learning to integrate maximum likelihood for estimation of functional parameters.
- Score: 1.7056768055368383
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The growth of machine learning as a field has been accelerating with
increasing interest and publications across fields, including statistics, but
predominantly in computer science. How can we parse this vast literature for
developments that exemplify the necessary rigor? How many of these manuscripts
incorporate foundational theory to allow for statistical inference? Which
advances have the greatest potential for impact in practice? One could posit
many answers to these queries. Here, we assert that one essential idea is for
machine learning to integrate maximum likelihood for estimation of functional
parameters, such as prediction functions and conditional densities.
Related papers
- Data Science Principles for Interpretable and Explainable AI [0.7581664835990121]
Interpretable and interactive machine learning aims to make complex models more transparent and controllable.
This review synthesizes key principles from the growing literature in this field.
arXiv Detail & Related papers (2024-05-17T05:32:27Z) - On the nonconvexity of some push-forward constraints and its
consequences in machine learning [0.0]
The push-forward operation enables one to redistribute a convex probability measure through a map.
It plays a key role in statistics and: many problems from optimal transport impact to push-forward.
This paper aims to help researchers better understand predictors or algorithmic learning problems.
arXiv Detail & Related papers (2024-03-12T10:06:48Z) - Machine learning and information theory concepts towards an AI
Mathematician [77.63761356203105]
The current state-of-the-art in artificial intelligence is impressive, especially in terms of mastery of language, but not so much in terms of mathematical reasoning.
This essay builds on the idea that current deep learning mostly succeeds at system 1 abilities.
It takes an information-theoretical posture to ask questions about what constitutes an interesting mathematical statement.
arXiv Detail & Related papers (2024-03-07T15:12:06Z) - Rethinking Explainable Machine Learning as Applied Statistics [9.03268085547399]
We argue that explainable machine learning needs to recognize its parallels with applied statistics.
The fact that this is scarcely being discussed in research papers is one of the main drawbacks of the current literature.
arXiv Detail & Related papers (2024-02-05T10:36:48Z) - A Survey on Brain-Inspired Deep Learning via Predictive Coding [85.93245078403875]
Predictive coding (PC) has shown promising performance in machine intelligence tasks.
PC can model information processing in different brain areas, can be used in cognitive control and robotics.
arXiv Detail & Related papers (2023-08-15T16:37:16Z) - Prediction-Powered Inference [68.97619568620709]
Prediction-powered inference is a framework for performing valid statistical inference when an experimental dataset is supplemented with predictions from a machine-learning system.
The framework yields simple algorithms for computing provably valid confidence intervals for quantities such as means, quantiles, and linear and logistic regression coefficients.
Prediction-powered inference could enable researchers to draw valid and more data-efficient conclusions using machine learning.
arXiv Detail & Related papers (2023-01-23T18:59:28Z) - Enhancing Generalizability of Predictive Models with Synergy of Data and
Physics [0.0]
This paper integrates the data mining with first-principle knowledge to increase generalizability of predictive models.
The proposed process is widely accepted by wind energy predictive maintenance practitioners because of its simplicity and efficiency.
arXiv Detail & Related papers (2021-05-04T11:34:44Z) - Patterns, predictions, and actions: A story about machine learning [59.32629659530159]
This graduate textbook on machine learning tells a story of how patterns in data support predictions and consequential actions.
Self-contained introductions to causality, the practice of causal inference, sequential decision making, and reinforcement learning equip the reader with concepts and tools to reason about actions and their consequences.
arXiv Detail & Related papers (2021-02-10T03:42:03Z) - Machine Learning and Computational Mathematics [8.160343645537106]
We discuss how machine learning has already impacted and will further impact computational mathematics, scientific computing and computational science.
We describe some of the most important progress that has been made on these issues.
Our hope is to put things into a perspective that will help to integrate machine learning with computational mathematics.
arXiv Detail & Related papers (2020-09-23T23:16:46Z) - Deducing neighborhoods of classes from a fitted model [68.8204255655161]
In this article a new kind of interpretable machine learning method is presented.
It can help to understand the partitioning of the feature space into predicted classes in a classification model using quantile shifts.
Basically, real data points (or specific points of interest) are used and the changes of the prediction after slightly raising or decreasing specific features are observed.
arXiv Detail & Related papers (2020-09-11T16:35:53Z) - Value-driven Hindsight Modelling [68.658900923595]
Value estimation is a critical component of the reinforcement learning (RL) paradigm.
Model learning can make use of the rich transition structure present in sequences of observations, but this approach is usually not sensitive to the reward function.
We develop an approach for representation learning in RL that sits in between these two extremes.
This provides tractable prediction targets that are directly relevant for a task, and can thus accelerate learning the value function.
arXiv Detail & Related papers (2020-02-19T18:10:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.