Synthetic Fungi Datasets: A Time-Aligned Approach
- URL: http://arxiv.org/abs/2501.02855v1
- Date: Mon, 06 Jan 2025 09:00:17 GMT
- Title: Synthetic Fungi Datasets: A Time-Aligned Approach
- Authors: A. Rani, D. O. Arroyo, P. Durdevic,
- Abstract summary: This dataset captures phenomena such as spore size reduction, branching dynamics, and the emergence of complex mycelium networks.<n>The dataset is optimized for deep learning (DL) applications.<n>With applications spanning agriculture, medicine, and industrial mycology, this resource provides a robust foundation for automating fungal analysis.
- Score: 3.996773524410032
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Fungi undergo dynamic morphological transformations throughout their lifecycle, forming intricate networks as they transition from spores to mature mycelium structures. To support the study of these time-dependent processes, we present a synthetic, time-aligned image dataset that models key stages of fungal growth. This dataset systematically captures phenomena such as spore size reduction, branching dynamics, and the emergence of complex mycelium networks. The controlled generation process ensures temporal consistency, scalability, and structural alignment, addressing the limitations of real-world fungal datasets. Optimized for deep learning (DL) applications, this dataset facilitates the development of models for classifying growth stages, predicting fungal development, and analyzing morphological patterns over time. With applications spanning agriculture, medicine, and industrial mycology, this resource provides a robust foundation for automating fungal analysis, enhancing disease monitoring, and advancing fungal biology research through artificial intelligence.
Related papers
- Causal Representation Meets Stochastic Modeling under Generic Geometry [49.24293444627916]
We develop causal representation learning for continuous-time latent point processes.<n>We develop MUTATE, an identifiable variational autoencoder framework with a time-adaptive transition module.<n>Across simulated and empirical studies, we find that MUTATE can effectively answer scientific questions.
arXiv Detail & Related papers (2026-02-04T20:40:53Z) - Disentangling multispecific antibody function with graph neural networks [1.9732999783524041]
Multispecific antibodies offer transformative therapeutic potential by engaging multiples simultaneously.<n>Their efficacy is governed by complex molecular architectures.<n>We present a generative method for creating synthetic functional landscapes that capture non-linear interactions.<n>We demonstrate that this model, trained on synthetic landscapes, recapitulates complex functional properties and, via transfer learning, has the potential to achieve high predictive accuracy.
arXiv Detail & Related papers (2026-01-30T17:36:19Z) - Neural Ordinary Differential Equations for Simulating Metabolic Pathway Dynamics from Time-Series Multiomics Data [0.0]
We introduce Neural Ordinary Differential Equations (NODEs) as a dynamic framework for learning the complex interplay between the proteome and metabolome.<n>Our results show a greater than 90% improvement in root mean squared error over baselines across both Limonene and Isopentenol datasets.
arXiv Detail & Related papers (2025-12-09T15:44:03Z) - Image-based Morphological Characterization of Filamentous Biological Structures with Non-constant Curvature Shape Feature [2.099922236065961]
We propose an image-based method by which it is possible to analyze shape changes over time in tendrils when mechano-stimulated in different portions of their body.<n>We employ a geometric approach using a 3D Piece-Wise Clothoid-based model to reconstruct the configuration taken by a tendril after mechanical rubbing.<n>Our analysis reveals higher responsiveness in the apical segment of tendrils, which might correspond to higher sensitivity and tissue flexibility in that region of the organs.
arXiv Detail & Related papers (2025-11-09T04:54:16Z) - CellPainTR: Generalizable Representation Learning for Cross-Dataset Cell Painting Analysis [51.56484100374058]
We introduce CellPainTR, a Transformer-based architecture designed to learn foundational representations of cellular morphology.<n>Our work represents a significant step towards creating truly foundational models for image-based profiling, enabling more reliable and scalable cross-study biological analysis.
arXiv Detail & Related papers (2025-09-02T03:30:07Z) - Lyra: An Efficient and Expressive Subquadratic Architecture for Modeling Biological Sequences [12.6343176904061]
We introduce Lyra, a subquadratic architecture for sequence modeling, grounded in the biological framework of epistasis.
We demonstrate that Lyra is performant across over 100 wide-ranging biological tasks, achieving state-of-the-art (SOTA) performance in many key areas.
arXiv Detail & Related papers (2025-03-20T17:09:18Z) - GENERator: A Long-Context Generative Genomic Foundation Model [66.46537421135996]
We present GENERator, a generative genomic foundation model featuring a context length of 98k base pairs (bp) and 1.2B parameters.
Trained on an expansive dataset comprising 386B bp of DNA, the GENERator demonstrates state-of-the-art performance across both established and newly proposed benchmarks.
It also shows significant promise in sequence optimization, particularly through the prompt-responsive generation of enhancer sequences with specific activity profiles.
arXiv Detail & Related papers (2025-02-11T05:39:49Z) - Omics-driven hybrid dynamic modeling of bioprocesses with uncertainty estimation [0.0]
This work presents an omics-driven modeling pipeline that integrates machine-learning tools.<n> Random forests and permutation feature importance are proposed to mine omics datasets.<n> Continuous and differentiable machine-learning functions can be trained to link the reduced omics feature set to key components of the dynamic model.
arXiv Detail & Related papers (2024-10-24T15:50:35Z) - Exploring Fungal Morphology Simulation and Dynamic Light Containment from a Graphics Generation Perspective [7.750079771971273]
This study equates fungal morphology simulation to a two-dimensional graphic time-series generation problem.
We propose a zero-coding, neural network-driven cellular automaton. Fungal spread patterns are learned through an image segmentation model and a time-series prediction model, which then supervise the training of neural network cells.
arXiv Detail & Related papers (2024-09-08T17:50:24Z) - Causal machine learning for single-cell genomics [94.28105176231739]
We discuss the application of machine learning techniques to single-cell genomics and their challenges.
We first present the model that underlies most of current causal approaches to single-cell biology.
We then identify open problems in the application of causal approaches to single-cell data.
arXiv Detail & Related papers (2023-10-23T13:35:24Z) - Tertiary Lymphoid Structures Generation through Graph-based Diffusion [54.37503714313661]
In this work, we leverage state-of-the-art graph-based diffusion models to generate biologically meaningful cell-graphs.
We show that the adopted graph diffusion model is able to accurately learn the distribution of cells in terms of their tertiary lymphoid structures (TLS) content.
arXiv Detail & Related papers (2023-10-10T14:37:17Z) - Generating synthetic multi-dimensional molecular-mediator time series
data for artificial intelligence-based disease trajectory forecasting and
drug development digital twins: Considerations [0.0]
The use of synthetic data is recognized as a crucial step in the development of neural network-based Artificial Intelligence (AI) systems.
Insufficiency of statistical and data-centric machine learning means of generating this type of synthetic data is due to a combination of factors.
The generation of synthetic data that accounts for the identified factors of multi-dimensional time series data is an essential capability for the development of mediator-biomarker based AI forecasting systems.
arXiv Detail & Related papers (2023-03-16T03:13:53Z) - T-Phenotype: Discovering Phenotypes of Predictive Temporal Patterns in
Disease Progression [82.85825388788567]
We develop a novel temporal clustering method, T-Phenotype, to discover phenotypes of predictive temporal patterns from labeled time-series data.
We show that T-Phenotype achieves the best phenotype discovery performance over all the evaluated baselines.
arXiv Detail & Related papers (2023-02-24T13:30:35Z) - Semantic Image Segmentation with Deep Learning for Vine Leaf Phenotyping [59.0626764544669]
In this study, we use Deep Learning methods to semantically segment grapevine leaves images in order to develop an automated object detection system for leaf phenotyping.
Our work contributes to plant lifecycle monitoring through which dynamic traits such as growth and development can be captured and quantified.
arXiv Detail & Related papers (2022-10-24T14:37:09Z) - Enhancing Mechanical Metamodels with a Generative Model-Based Augmented
Training Dataset [0.7734726150561089]
Microstructural patterns, which play a major role in defining the mechanical behavior of tissues, are difficult to simulate.
In this work, we investigate the efficacy of machine learning-based generative models as a tool for augmenting limited input pattern datasets.
We have created an open access dataset of Finite Element Analysis simulations based on Cahn-Hilliard patterns.
arXiv Detail & Related papers (2022-03-08T16:15:54Z) - Leveraging the structure of dynamical systems for data-driven modeling [111.45324708884813]
We consider the impact of the training set and its structure on the quality of the long-term prediction.
We show how an informed design of the training set, based on invariants of the system and the structure of the underlying attractor, significantly improves the resulting models.
arXiv Detail & Related papers (2021-12-15T20:09:20Z) - Towards an Automatic Analysis of CHO-K1 Suspension Growth in
Microfluidic Single-cell Cultivation [63.94623495501023]
We propose a novel Machine Learning architecture, which allows us to infuse a neural deep network with human-powered abstraction on the level of data.
Specifically, we train a generative model simultaneously on natural and synthetic data, so that it learns a shared representation, from which a target variable, such as the cell count, can be reliably estimated.
arXiv Detail & Related papers (2020-10-20T08:36:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.