MechaFormer: Sequence Learning for Kinematic Mechanism Design Automation
- URL: http://arxiv.org/abs/2508.09005v1
- Date: Tue, 12 Aug 2025 15:17:30 GMT
- Title: MechaFormer: Sequence Learning for Kinematic Mechanism Design Automation
- Authors: Diana Bolanos, Mohammadmehdi Ataei, Pradeep Kumar Jayaraman,
- Abstract summary: We introduce MechaFormer, a Transformer-based model that tackles mechanism design as a conditional sequence generation task.<n>Our model learns to translate a target curve into a domain-specific language () string, simultaneously determining the mechanism's topology and geometric parameters.<n>We demonstrate a suite of sampling strategies that can dramatically improve solution quality and offer designers valuable flexibility.
- Score: 3.5169782367209135
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Designing mechanical mechanisms to trace specific paths is a classic yet notoriously difficult engineering problem, characterized by a vast and complex search space of discrete topologies and continuous parameters. We introduce MechaFormer, a Transformer-based model that tackles this challenge by treating mechanism design as a conditional sequence generation task. Our model learns to translate a target curve into a domain-specific language (DSL) string, simultaneously determining the mechanism's topology and geometric parameters in a single, unified process. MechaFormer significantly outperforms existing baselines, achieving state-of-the-art path-matching accuracy and generating a wide diversity of novel and valid designs. We demonstrate a suite of sampling strategies that can dramatically improve solution quality and offer designers valuable flexibility. Furthermore, we show that the high-quality outputs from MechaFormer serve as excellent starting points for traditional optimizers, creating a hybrid approach that finds superior solutions with remarkable efficiency.
Related papers
- MetaEformer: Unveiling and Leveraging Meta-patterns for Complex and Dynamic Systems Load Forecasting [13.947900322863575]
Time series forecasting is a critical and practical problem in many real-world applications.<n>We introduce a novel scheme centered on fundamental waveform, a.k.a., meta-pattern.<n>We demonstrate superior performance across eight benchmarks under three system scenarios.
arXiv Detail & Related papers (2025-06-15T10:27:44Z) - Efficient Design of Compliant Mechanisms Using Multi-Objective Optimization [50.24983453990065]
We address the synthesis of a compliant cross-hinge mechanism capable of large angular strokes.<n>We formulate a multi-objective optimization problem based on kinetostatic performance measures.
arXiv Detail & Related papers (2025-04-23T06:29:10Z) - An Interpretable Automated Mechanism Design Framework with Large Language Models [26.89126917895188]
Mechanism has long been a cornerstone of economic theory, with traditional approaches relying on mathematical derivations.<n>Recent automated approaches, including differentiable economics with neural networks, have emerged for designing payments and allocations.<n>We introduce a novel framework that reformulates mechanism design as a code generation task.
arXiv Detail & Related papers (2025-02-16T12:33:03Z) - STAR: Synthesis of Tailored Architectures [61.080157488857516]
We propose a new approach for the synthesis of tailored architectures (STAR)<n>Our approach combines a novel search space based on the theory of linear input-varying systems, supporting a hierarchical numerical encoding into architecture genomes. STAR genomes are automatically refined and recombined with gradient-free, evolutionary algorithms to optimize for multiple model quality and efficiency metrics.<n>Using STAR, we optimize large populations of new architectures, leveraging diverse computational units and interconnection patterns, improving over highly-optimized Transformers and striped hybrid models on the frontier of quality, parameter size, and inference cache for autoregressive language modeling.
arXiv Detail & Related papers (2024-11-26T18:42:42Z) - Cliqueformer: Model-Based Optimization with Structured Transformers [102.55764949282906]
Large neural networks excel at prediction tasks, but their application to design problems, such as protein engineering or materials discovery, requires solving offline model-based optimization (MBO) problems.<n>We present Cliqueformer, a transformer-based architecture that learns the black-box function's structure through functional graphical models (FGM)<n>Across various domains, including chemical and genetic design tasks, Cliqueformer demonstrates superior performance compared to existing methods.
arXiv Detail & Related papers (2024-10-17T00:35:47Z) - LInK: Learning Joint Representations of Design and Performance Spaces through Contrastive Learning for Mechanism Synthesis [15.793704096341523]
In this paper, we introduce LInK, a novel framework that integrates contrastive learning of performance and design space with optimization techniques.
By leveraging a multimodal and transformation-invariant contrastive learning framework, LInK learns a joint representation that captures complex physics and design representations of mechanisms.
Our results demonstrate that LInK not only advances the field of mechanism design but also broadens the applicability of contrastive learning and optimization to other areas of engineering.
arXiv Detail & Related papers (2024-05-31T03:04:57Z) - Deep Generative Model-based Synthesis of Four-bar Linkage Mechanisms
with Target Conditions [22.164394511786874]
We propose a deep learning-based generative model for generating multiple crank-rocker four-bar linkage mechanisms.
The proposed model is based on a conditional generative adversarial network (cGAN) with modifications for mechanism synthesis.
The results demonstrate that the proposed model successfully generates multiple distinct mechanisms that satisfy specific kinematic and quasi-static requirements.
arXiv Detail & Related papers (2024-02-22T03:31:00Z) - Learning minimal representations of stochastic processes with
variational autoencoders [52.99137594502433]
We introduce an unsupervised machine learning approach to determine the minimal set of parameters required to describe a process.
Our approach enables for the autonomous discovery of unknown parameters describing processes.
arXiv Detail & Related papers (2023-07-21T14:25:06Z) - End-to-End Meta-Bayesian Optimisation with Transformer Neural Processes [52.818579746354665]
This paper proposes the first end-to-end differentiable meta-BO framework that generalises neural processes to learn acquisition functions via transformer architectures.
We enable this end-to-end framework with reinforcement learning (RL) to tackle the lack of labelled acquisition data.
arXiv Detail & Related papers (2023-05-25T10:58:46Z) - Recommender Systems meet Mechanism Design [29.132299904090868]
We consider a multi-item mechanism design problem where the bidders' value distributions can be approximated by a topic model.
We provide an extension of the framework that allows us to exploit the expressive power of topic models to reduce the effective dimensionality of the problem.
arXiv Detail & Related papers (2021-10-25T00:03:30Z) - Machine Learning Framework for Quantum Sampling of Highly-Constrained,
Continuous Optimization Problems [101.18253437732933]
We develop a generic, machine learning-based framework for mapping continuous-space inverse design problems into surrogate unconstrained binary optimization problems.
We showcase the framework's performance on two inverse design problems by optimizing thermal emitter topologies for thermophotovoltaic applications and (ii) diffractive meta-gratings for highly efficient beam steering.
arXiv Detail & Related papers (2021-05-06T02:22:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.