Reproducible Science with LaTeX
- URL: http://arxiv.org/abs/2010.01482v2
- Date: Tue, 6 Oct 2020 01:54:58 GMT
- Title: Reproducible Science with LaTeX
- Authors: Haim Bar and HaiYing Wang
- Abstract summary: This paper proposes a procedure to execute external source codes from a document.
It includes the calculation outputs in the resulting Portable Document Format (pdf) file automatically.
- Score: 4.09920839425892
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper proposes a procedure to execute external source codes from a LaTeX
document and include the calculation outputs in the resulting Portable Document
Format (pdf) file automatically. It integrates programming tools into the LaTeX
writing tool to facilitate the production of reproducible research. In our
proposed approach to a LaTeX-based scientific notebook the user can easily
invoke any programming language or a command-line program when compiling the
LaTeX document, while using their favorite LaTeX editor in the writing process.
The required LaTeX setup, a new Python package, and the defined preamble are
discussed in detail, and working examples using R, Julia, and MatLab to
reproduce existing research are provided to illustrate the proposed procedure.
We also demonstrate how to include system setting information in a paper by
invoking shell scripts when compiling the document.
Related papers
- LATTE: Improving Latex Recognition for Tables and Formulae with Iterative Refinement [11.931911831112357]
LATTE improves the source extraction accuracy of both formulae and tables, outperforming existing techniques as well as GPT-4V.
This paper proposes LATTE, the first iterative refinement framework for recognition.
arXiv Detail & Related papers (2024-09-21T17:18:49Z) - TeXBLEU: Automatic Metric for Evaluate LaTeX Format [4.337656290539519]
We propose BLEU, a metric for evaluating mathematical expressions in the format built on the n-gram-based BLEU metric.
The proposed BLEU consists of a tokenizer trained on the arXiv paper dataset and a fine-tuned embedding model with positional encoding.
arXiv Detail & Related papers (2024-09-10T16:54:32Z) - Towards Semantic Markup of Mathematical Documents via User Interaction [0.0]
We present an approach to semantic markup of formulas by (semi-)automatically generating grammars from existing s macro definitions and parsing formulas with them.
We also present a GUI-based tool for the disambiguation of parse results and showcase its potential using a grammar for parsing untyped $lambda$-terms.
arXiv Detail & Related papers (2024-08-05T12:36:40Z) - Visually Guided Generative Text-Layout Pre-training for Document Intelligence [51.09853181377696]
We propose visually guided generative text-pre-training, named ViTLP.
Given a document image, the model optimize hierarchical language and layout modeling objectives to generate the interleaved text and layout sequence.
ViTLP can function as a native OCR model to localize and recognize texts of document images.
arXiv Detail & Related papers (2024-03-25T08:00:43Z) - TopoX: A Suite of Python Packages for Machine Learning on Topological
Domains [89.9320422266332]
TopoX is a Python software suite that provides reliable and user-friendly building blocks for computing and machine learning on topological domains.
TopoX consists of three packages: TopoNetX, TopoEmbedX and TopoModelx.
arXiv Detail & Related papers (2024-02-04T10:41:40Z) - LILO: Learning Interpretable Libraries by Compressing and Documenting Code [71.55208585024198]
We introduce LILO, a neurosymbolic framework that iteratively synthesizes, compresses, and documents code.
LILO combines LLM-guided program synthesis with recent algorithmic advances in automated from Stitch.
We find that AutoDoc boosts performance by helping LILO's synthesizer to interpret and deploy learned abstractions.
arXiv Detail & Related papers (2023-10-30T17:55:02Z) - DocCoder: Generating Code by Retrieving and Reading Docs [87.88474546826913]
We introduce DocCoder, an approach that explicitly leverages code manuals and documentation.
Our approach is general, can be applied to any programming language, and is agnostic to the underlying neural model.
arXiv Detail & Related papers (2022-07-13T06:47:51Z) - Machine Translation of Mathematical Text [0.0]
We have implemented a machine translation system, the PolyMath Translator, for documents containing mathematical text.
The current implementation translates English to French, attaining a BLEU score of 53.5 on a held-out test corpus of mathematical sentences.
It produces documents that can be compiled to PDF without further editing.
arXiv Detail & Related papers (2020-10-11T11:59:40Z) - N-LTP: An Open-source Neural Language Technology Platform for Chinese [68.58732970171747]
textttN- is an open-source neural language technology platform supporting six fundamental Chinese NLP tasks.
textttN- adopts the multi-task framework by using a shared pre-trained model, which has the advantage of capturing the shared knowledge across relevant Chinese tasks.
arXiv Detail & Related papers (2020-09-24T11:45:39Z) - Synthesizing Tasks for Block-based Programming [72.45475843387183]
We propose a novel methodology to automatically generate a set $(rm Tout, rm Cout)$ of new tasks along with solution codes.
Our algorithm operates by first mutating code $rm Cin$ to obtain a set of codes $rm Cout$.
arXiv Detail & Related papers (2020-06-17T15:04:37Z) - A Makefile for Developing Containerized LaTeX Technical Documents [0.0]
We propose a Makefile for developing containerized $La$ technical documents.
The Makefile allows the author to execute the code that generates variables, tables and figures.
We release an open source repository of a template that uses the Makefile and demonstrate its use by developing this paper.
arXiv Detail & Related papers (2020-05-26T12:31:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.