Test-time Computing: from System-1 Thinking to System-2 Thinking
- URL: http://arxiv.org/abs/2501.02497v1
- Date: Sun, 05 Jan 2025 10:24:20 GMT
- Title: Test-time Computing: from System-1 Thinking to System-2 Thinking
- Authors: Yixin Ji, Juntao Li, Hai Ye, Kaixin Wu, Jia Xu, Linjian Mo, Min Zhang,
- Abstract summary: We trace the concept of test-time computing back to System-1 models.
We highlight the key role of test-time computing in the transition from System-1 models to weak System-2 models.
- Score: 28.062866031945585
- License:
- Abstract: The remarkable performance of the o1 model in complex reasoning demonstrates that test-time computing scaling can further unlock the model's potential, enabling powerful System-2 thinking. However, there is still a lack of comprehensive surveys for test-time computing scaling. We trace the concept of test-time computing back to System-1 models. In System-1 models, test-time computing addresses distribution shifts and improves robustness and generalization through parameter updating, input modification, representation editing, and output calibration. In System-2 models, it enhances the model's reasoning ability to solve complex problems through repeated sampling, self-correction, and tree search. We organize this survey according to the trend of System-1 to System-2 thinking, highlighting the key role of test-time computing in the transition from System-1 models to weak System-2 models, and then to strong System-2 models. We also point out a few possible future directions.
Related papers
- Reasoning on a Spectrum: Aligning LLMs to System 1 and System 2 Thinking [0.9709444454602557]
Large Language Models (LLMs) exhibit impressive reasoning abilities, yet their reliance on structured step-by-step reasoning reveals a critical limitation.
This work challenges the assumption that step-by-step reasoning is always optimal and highlights the need for adapting reasoning strategies based on task demands.
arXiv Detail & Related papers (2025-02-18T02:58:37Z) - A General Bayesian Framework for Informative Input Design in System Identification [86.05414211113627]
We tackle the problem of informative input design for system identification.
We select inputs, observe the corresponding outputs from the true system, and optimize the parameters of our model to best fit the data.
Our method outperforms model-free baselines with various linear and nonlinear dynamics.
arXiv Detail & Related papers (2025-01-28T01:57:51Z) - System-1.x: Learning to Balance Fast and Slow Planning with Language Models [68.77277620915143]
Language models can be used to solve long-horizon planning problems in two distinct modes.
A fast 'System-1' mode, directly generating plans without any explicit search or backtracking, and a slow 'System-2' mode, planning step-by-step.
We propose the System-1.x Planner, a controllable planning framework with LLMs.
arXiv Detail & Related papers (2024-07-19T15:40:59Z) - Representing Timed Automata and Timing Anomalies of Cyber-Physical
Production Systems in Knowledge Graphs [51.98400002538092]
This paper aims to improve model-based anomaly detection in CPPS by combining the learned timed automaton with a formal knowledge graph about the system.
Both the model and the detected anomalies are described in the knowledge graph in order to allow operators an easier interpretation of the model and the detected anomalies.
arXiv Detail & Related papers (2023-08-25T15:25:57Z) - From system models to class models: An in-context learning paradigm [0.0]
We introduce a novel paradigm for system identification, addressing two primary tasks: one-step-ahead prediction and multi-step simulation.
We learn a meta model that represents a class of dynamical systems.
For one-step prediction, a GPT-like decoder-only architecture is utilized, whereas the simulation problem employs an encoder-decoder structure.
arXiv Detail & Related papers (2023-08-25T13:50:17Z) - Clarifying System 1 & 2 through the Common Model of Cognition [0.0]
We use the Common Model of Cognition to ground System-1 and System-2.
We aim to clarify their underlying mechanisms, persisting misconceptions, and implications for metacognition.
arXiv Detail & Related papers (2023-05-18T02:25:03Z) - AAAI 2022 Fall Symposium: System-1 and System-2 realized within the
Common Model of Cognition [0.0]
We situating System-1 and System-2 within the Common Model of Cognition.
Results show that what are thought to be distinctive characteristics of System-1 and 2 instead form a spectrum of cognitive properties.
arXiv Detail & Related papers (2023-05-16T01:28:06Z) - Probabilistic machine learning based predictive and interpretable
digital twin for dynamical systems [0.0]
Two approaches for updating the digital twin are proposed.
In both cases, the resulting expressions of updated digital twins are identical.
The proposed approaches provide an exact and explainable description of the perturbations in digital twin models.
arXiv Detail & Related papers (2022-12-19T04:25:59Z) - Improving Coherence and Consistency in Neural Sequence Models with
Dual-System, Neuro-Symbolic Reasoning [49.6928533575956]
We use neural inference to mediate between the neural System 1 and the logical System 2.
Results in robust story generation and grounded instruction-following show that this approach can increase the coherence and accuracy of neurally-based generations.
arXiv Detail & Related papers (2021-07-06T17:59:49Z) - A Novel Anomaly Detection Algorithm for Hybrid Production Systems based
on Deep Learning and Timed Automata [73.38551379469533]
DAD:DeepAnomalyDetection is a new approach for automatic model learning and anomaly detection in hybrid production systems.
It combines deep learning and timed automata for creating behavioral model from observations.
The algorithm has been applied to few data sets including two from real systems and has shown promising results.
arXiv Detail & Related papers (2020-10-29T08:27:43Z) - Superiority of Simplicity: A Lightweight Model for Network Device
Workload Prediction [58.98112070128482]
We propose a lightweight solution for series prediction based on historic observations.
It consists of a heterogeneous ensemble method composed of two models - a neural network and a mean predictor.
It achieves an overall $R2$ score of 0.10 on the available FedCSIS 2020 challenge dataset.
arXiv Detail & Related papers (2020-07-07T15:44:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.