Related papers: Reproducible Science with LaTeX

Related papers

LaTeXTrans: Structured LaTeX Translation with Multi-Agent Coordination [46.53643691093418]
We introduce MTTrans, a collaborative multi-agent system designed to translate structured-formatted documents.<n>Trans ensures format preservation, structural fidelity, and consistency through six specialized agents.
arXiv Detail & Related papers (2025-08-26T08:17:26Z)
$A^2R^2$: Advancing Img2LaTeX Conversion via Visual Reasoning with Attention-Guided Refinement [53.14935624161711]
Vision-language models (VLMs) have achieved remarkable progress across a range of visual understanding tasks.<n>We propose $A2R2$: Advancing Img2La Conversion via Visual Reasoning with Attention-Guided Refinement.<n>For effective evaluation, we introduce a new dataset, Img2LaTex-Hard-1K, consisting of 1,100 carefully curated and challenging examples.
arXiv Detail & Related papers (2025-07-28T14:41:57Z)
TeXpert: A Multi-Level Benchmark for Evaluating LaTeX Code Generation by LLMs [0.0]
Large Language Models (LLMs) present a promising opportunity for researchers to produce publication-ready material.<n>Our benchmark dataset with natural language prompts for generating code focused on components of scientific documents.<n>Our evaluation across open and closed-source LLMs highlights multiple key findings.
arXiv Detail & Related papers (2025-06-20T13:39:16Z)
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning [57.09163579304332]
We introduce PaperCoder, a framework that transforms machine learning papers into functional code repositories.<n>PaperCoder operates in three stages: planning, designs the system architecture with diagrams, identifies file dependencies, and generates configuration files.<n>We then evaluate PaperCoder on generating code implementations from machine learning papers based on both model-based and human evaluations.
arXiv Detail & Related papers (2025-04-24T01:57:01Z)
NeuRaLaTeX: A machine learning library written in pure LaTeX [15.978130916451295]
We introduce NeuRaLa, which we believe to be the first deep learning library written entirely in rhyme. As part of your document you can specify the architecture of a neural network and its loss functions. When the document is compiled, the compiler will generate or load training data, train the network, run experiments, and generate figures. The paper took 48 hours to compile and the entire source code for NeuRaLa is contained within the source code of the paper.
arXiv Detail & Related papers (2025-03-31T15:05:19Z)
Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations Generation [1.7660225024861564]
We present a novel speech-to-La equations system specifically designed for the Greek language. We propose an end-to-end system that harnesses the power of Automatic Speech Recognition (ASR) and Natural Language Processing (NLP) techniques.
arXiv Detail & Related papers (2024-12-11T22:29:44Z)
LATTE: Improving Latex Recognition for Tables and Formulae with Iterative Refinement [11.931911831112357]
LATTE improves the source extraction accuracy of both formulae and tables, outperforming existing techniques as well as GPT-4V. This paper proposes LATTE, the first iterative refinement framework for recognition.
arXiv Detail & Related papers (2024-09-21T17:18:49Z)
TeXBLEU: Automatic Metric for Evaluate LaTeX Format [4.337656290539519]
We propose BLEU, a metric for evaluating mathematical expressions in the format built on the n-gram-based BLEU metric. The proposed BLEU consists of a tokenizer trained on the arXiv paper dataset and a fine-tuned embedding model with positional encoding.
arXiv Detail & Related papers (2024-09-10T16:54:32Z)
Towards Semantic Markup of Mathematical Documents via User Interaction [0.0]
We present an approach to semantic markup of formulas by (semi-)automatically generating grammars from existing s macro definitions and parsing formulas with them. We also present a GUI-based tool for the disambiguation of parse results and showcase its potential using a grammar for parsing untyped $lambda$-terms.
arXiv Detail & Related papers (2024-08-05T12:36:40Z)
Visually Guided Generative Text-Layout Pre-training for Document Intelligence [51.09853181377696]
We propose visually guided generative text-pre-training, named ViTLP. Given a document image, the model optimize hierarchical language and layout modeling objectives to generate the interleaved text and layout sequence. ViTLP can function as a native OCR model to localize and recognize texts of document images.
arXiv Detail & Related papers (2024-03-25T08:00:43Z)
TopoX: A Suite of Python Packages for Machine Learning on Topological Domains [89.9320422266332]
TopoX is a Python software suite that provides reliable and user-friendly building blocks for computing and machine learning on topological domains. TopoX consists of three packages: TopoNetX, TopoEmbedX and TopoModelx.
arXiv Detail & Related papers (2024-02-04T10:41:40Z)
LILO: Learning Interpretable Libraries by Compressing and Documenting Code [71.55208585024198]
We introduce LILO, a neurosymbolic framework that iteratively synthesizes, compresses, and documents code. LILO combines LLM-guided program synthesis with recent algorithmic advances in automated from Stitch. We find that AutoDoc boosts performance by helping LILO's synthesizer to interpret and deploy learned abstractions.
arXiv Detail & Related papers (2023-10-30T17:55:02Z)
DocCoder: Generating Code by Retrieving and Reading Docs [87.88474546826913]
We introduce DocCoder, an approach that explicitly leverages code manuals and documentation. Our approach is general, can be applied to any programming language, and is agnostic to the underlying neural model.
arXiv Detail & Related papers (2022-07-13T06:47:51Z)
Machine Translation of Mathematical Text [0.0]
We have implemented a machine translation system, the PolyMath Translator, for documents containing mathematical text. The current implementation translates English to French, attaining a BLEU score of 53.5 on a held-out test corpus of mathematical sentences. It produces documents that can be compiled to PDF without further editing.
arXiv Detail & Related papers (2020-10-11T11:59:40Z)
N-LTP: An Open-source Neural Language Technology Platform for Chinese [68.58732970171747]
textttN- is an open-source neural language technology platform supporting six fundamental Chinese NLP tasks. textttN- adopts the multi-task framework by using a shared pre-trained model, which has the advantage of capturing the shared knowledge across relevant Chinese tasks.
arXiv Detail & Related papers (2020-09-24T11:45:39Z)
Synthesizing Tasks for Block-based Programming [72.45475843387183]
We propose a novel methodology to automatically generate a set $(rm Tout, rm Cout)$ of new tasks along with solution codes. Our algorithm operates by first mutating code $rm Cin$ to obtain a set of codes $rm Cout$.
arXiv Detail & Related papers (2020-06-17T15:04:37Z)
A Makefile for Developing Containerized LaTeX Technical Documents [0.0]
We propose a Makefile for developing containerized $La$ technical documents. The Makefile allows the author to execute the code that generates variables, tables and figures. We release an open source repository of a template that uses the Makefile and demonstrate its use by developing this paper.
arXiv Detail & Related papers (2020-05-26T12:31:22Z)

This list is automatically generated from the titles and abstracts of the papers in this site.