A Makefile for Developing Containerized LaTeX Technical Documents
- URL: http://arxiv.org/abs/2005.12660v9
- Date: Mon, 13 Nov 2023 18:19:40 GMT
- Title: A Makefile for Developing Containerized LaTeX Technical Documents
- Authors: Paschalis Bizopoulos
- Abstract summary: We propose a Makefile for developing containerized $La$ technical documents.
The Makefile allows the author to execute the code that generates variables, tables and figures.
We release an open source repository of a template that uses the Makefile and demonstrate its use by developing this paper.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We propose a Makefile for developing containerized $\LaTeX$ technical
documents. The Makefile allows the author to execute the code that generates
variables, tables and figures (results), which are then used during the
$\LaTeX$ compilation, to produce either the draft (fast) or full (slow) version
of the document. We also present various utilities that aid in automating the
results generation and improve the reproducibility of the document. We release
an open source repository of a template that uses the Makefile and demonstrate
its use by developing this paper.
Related papers
- Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion [20.44433450426808]
Docling is an easy-to-use, self-contained, MIT-licensed, open-source toolkit for document conversion.
It can parse several types of popular document formats into a unified, richly structured representation.
Docling is released as a Python package and can be used as a Python API or as a CLI tool.
arXiv Detail & Related papers (2025-01-27T19:40:00Z) - BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks [55.61185100263898]
We introduce BigDocs-7.5M, a high-quality, open-access dataset comprising 7.5 million multimodal documents across 30 tasks.
We also introduce BigDocs-Bench, a benchmark suite with 10 novel tasks.
Our experiments show that training with BigDocs-Bench improves average performance up to 25.8% over closed-source GPT-4o.
arXiv Detail & Related papers (2024-12-05T21:41:20Z) - LATTE: Improving Latex Recognition for Tables and Formulae with Iterative Refinement [11.931911831112357]
The source and rendered PDF images look drastically different, especially for formulae and tables.
Prior work generates sources in a single iteration and struggles with complex formulae.
This paper proposes LATTE, the first iterative refinement framework for recognition.
arXiv Detail & Related papers (2024-09-21T17:18:49Z) - Towards Semantic Markup of Mathematical Documents via User Interaction [0.0]
We present an approach to semantic markup of formulas by (semi-)automatically generating grammars from existing s macro definitions and parsing formulas with them.
We also present a GUI-based tool for the disambiguation of parse results and showcase its potential using a grammar for parsing untyped $lambda$-terms.
arXiv Detail & Related papers (2024-08-05T12:36:40Z) - RepoAgent: An LLM-Powered Open-Source Framework for Repository-level
Code Documentation Generation [79.83270415843857]
We introduce RepoAgent, a large language model powered open-source framework aimed at proactively generating, maintaining, and updating code documentation.
We have validated the effectiveness of our approach, showing that RepoAgent excels in generating high-quality repository-level documentation.
arXiv Detail & Related papers (2024-02-26T15:39:52Z) - DocXChain: A Powerful Open-Source Toolchain for Document Parsing and
Beyond [17.853066545805554]
DocXChain is a powerful open-source toolchain for document parsing.
It automatically converts the rich information embodied in unstructured documents, such as text, tables and charts, into structured representations.
arXiv Detail & Related papers (2023-10-19T02:49:09Z) - Augraphy: A Data Augmentation Library for Document Images [59.457999432618614]
Augraphy is a Python library for constructing data augmentation pipelines.
It provides strategies to produce augmented versions of clean document images that appear to have been altered by standard office operations.
arXiv Detail & Related papers (2022-08-30T22:36:19Z) - DocCoder: Generating Code by Retrieving and Reading Docs [87.88474546826913]
We introduce DocCoder, an approach that explicitly leverages code manuals and documentation.
Our approach is general, can be applied to any programming language, and is agnostic to the underlying neural model.
arXiv Detail & Related papers (2022-07-13T06:47:51Z) - Named Tensor Notation [117.30373263410507]
We propose a notation for tensors with named axes.
It relieves the author, reader, and future implementers from the burden of keeping track of the order of axes.
It also makes it easy to extend operations on low-order tensors to higher order ones.
arXiv Detail & Related papers (2021-02-25T22:21:30Z) - Reproducible Science with LaTeX [4.09920839425892]
This paper proposes a procedure to execute external source codes from a document.
It includes the calculation outputs in the resulting Portable Document Format (pdf) file automatically.
arXiv Detail & Related papers (2020-10-04T04:04:07Z) - DocBank: A Benchmark Dataset for Document Layout Analysis [114.81155155508083]
We present textbfDocBank, a benchmark dataset that contains 500K document pages with fine-grained token-level annotations for document layout analysis.
Experiment results show that models trained on DocBank accurately recognize the layout information for a variety of documents.
arXiv Detail & Related papers (2020-06-01T16:04:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.