Deep-learning-based pan-phenomic data reveals the explosive evolution of avian visual disparity
- URL: http://arxiv.org/abs/2602.03824v1
- Date: Tue, 03 Feb 2026 18:32:15 GMT
- Title: Deep-learning-based pan-phenomic data reveals the explosive evolution of avian visual disparity
- Authors: Jiao Sun,
- Abstract summary: This study uses deep learning techniques to explore avian morphological evolution.<n>We show that the high-dimensional embedding space encodes phenotypic convergence.<n>We also demonstrate that hierarchical semantic structures emerged in the high-dimensional embedding space despite being trained on flat labels.
- Score: 8.871465190908003
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The evolution of biological morphology is critical for understanding the diversity of the natural world, yet traditional analyses often involve subjective biases in the selection and coding of morphological traits. This study employs deep learning techniques, utilising a ResNet34 model capable of recognising over 10,000 bird species, to explore avian morphological evolution. We extract weights from the model's final fully connected (fc) layer and investigate the semantic alignment between the high-dimensional embedding space learned by the model and biological phenotypes. The results demonstrate that the high-dimensional embedding space encodes phenotypic convergence. Subsequently, we assess the morphological disparity among various taxa and evaluate the association between morphological disparity and species richness, demonstrating that species richness is the primary driver of morphospace expansion. Moreover, the disparity-through-time analysis reveals a visual "early burst" after the K-Pg extinction. While mainly aimed at evolutionary analysis, this study also provides insights into the interpretability of Deep Neural Networks. We demonstrate that hierarchical semantic structures (biological taxonomy) emerged in the high-dimensional embedding space despite being trained on flat labels. Furthermore, through adversarial examples, we provide evidence that our model in this task can overcome texture bias and learn holistic shape representations (body plans), challenging the prevailing view that CNNs rely primarily on local textures.
Related papers
- On the Approximation of Phylogenetic Distance Functions by Artificial Neural Networks [0.0]
In this work we describe minimal neural network architectures that can approximate classic phylogenetic distance functions.<n>The learned distance functions generalize well and, given an appropriate training dataset, achieve results comparable to state-of-the art inference methods.
arXiv Detail & Related papers (2025-12-01T21:42:01Z) - BioCLIP 2: Emergent Properties from Scaling Hierarchical Contrastive Learning [60.80381372245902]
We find emergent behaviors in biological vision models via large-scale contrastive vision-language training.<n>We train BioCLIP 2 on TreeOfLife-200M to distinguish different species.<n>We identify emergent properties in the learned embedding space of BioCLIP 2.
arXiv Detail & Related papers (2025-05-29T17:48:20Z) - G2PDiffusion: Cross-Species Genotype-to-Phenotype Prediction via Evolutionary Diffusion [108.94237816552024]
We propose the first genotype-to-phenotype diffusion model (G2PDiffusion) that generates morphological images from DNA.<n>The model contains three novel components: 1) a MSA retrieval engine that identifies conserved and co-evolutionary patterns; 2) an environment-aware MSA conditional encoder that effectively models complex genotype-environment interactions; and 3) an adaptive phenomic alignment module to improve genotype-phenotype consistency.
arXiv Detail & Related papers (2025-02-07T06:16:31Z) - Hierarchical Conditioning of Diffusion Models Using Tree-of-Life for Studying Species Evolution [19.899467048643363]
We introduce Phylo-Diffusion, a framework for conditioning diffusion models with phylogenetic knowledge represented by HIERarchical Embeddings (HIER-Embeds)
We also propose two new experiments for perturbing the embedding space of Phylo-Diffusion: trait masking and trait swapping.
Our work also opens a new chapter of research in evolutionary biology by using generative models to visualize evolutionary changes directly from images.
arXiv Detail & Related papers (2024-07-31T21:06:14Z) - Learning biologically relevant features in a pathology foundation model using sparse autoencoders [2.5919097694815365]
We trained a Sparse Autoencoder on the embeddings of a pathology pretrained foundation model.<n>We found that Sparse Autoencoder features represent interpretable and monosemantic biological concepts.
arXiv Detail & Related papers (2024-07-15T15:03:01Z) - Unveiling the Unseen: Identifiable Clusters in Trained Depthwise
Convolutional Kernels [56.69755544814834]
Recent advances in depthwise-separable convolutional neural networks (DS-CNNs) have led to novel architectures.
This paper reveals another striking property of DS-CNN architectures: discernible and explainable patterns emerge in their trained depthwise convolutional kernels in all layers.
arXiv Detail & Related papers (2024-01-25T19:05:53Z) - Neural Echos: Depthwise Convolutional Filters Replicate Biological
Receptive Fields [56.69755544814834]
We present evidence suggesting that depthwise convolutional kernels are effectively replicating the biological receptive fields observed in the mammalian retina.
We propose a scheme that draws inspiration from the biological receptive fields.
arXiv Detail & Related papers (2024-01-18T18:06:22Z) - A Neural Collapse Perspective on Feature Evolution in Graph Neural
Networks [44.31777384413466]
Graph neural networks (GNNs) have become increasingly popular for classification tasks on graph-structured data.
In this paper, we focus on node-wise classification and explore the feature evolution through the lens of the "Neural Collapse" phenomenon.
We show that even an "optimistic" mathematical model requires that the graphs obey a strict structural condition in order to possess a minimizer with exact collapse.
arXiv Detail & Related papers (2023-07-04T23:03:21Z) - Revisiting the Shape-Bias of Deep Learning for Dermoscopic Skin Lesion
Classification [4.414962444402826]
It is generally believed that the human visual system is biased towards the recognition of shapes rather than textures.
In this paper, we revisit the significance of shape-biases for the classification of skin lesion images.
arXiv Detail & Related papers (2022-06-13T20:59:06Z) - Data-driven emergence of convolutional structure in neural networks [83.4920717252233]
We show how fully-connected neural networks solving a discrimination task can learn a convolutional structure directly from their inputs.
By carefully designing data models, we show that the emergence of this pattern is triggered by the non-Gaussian, higher-order local structure of the inputs.
arXiv Detail & Related papers (2022-02-01T17:11:13Z) - Self-Supervised Graph Representation Learning for Neuronal Morphologies [75.38832711445421]
We present GraphDINO, a data-driven approach to learn low-dimensional representations of 3D neuronal morphologies from unlabeled datasets.
We show, in two different species and across multiple brain areas, that this method yields morphological cell type clusterings on par with manual feature-based classification by experts.
Our method could potentially enable data-driven discovery of novel morphological features and cell types in large-scale datasets.
arXiv Detail & Related papers (2021-12-23T12:17:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.