• Stochastic Modeling and Statistical Properties of Biological Systems Inferred from Omics Data
  • Sala, Claudia <1987>

Subject

  • FIS/07 Fisica applicata (a beni culturali, ambientali, biologia e medicina)

Description

  • In this thesis we aim to describe the dynamic processes that govern the evolution of two very different ecological systems. First, we consider the ensemble of bacteria that populate the intestine (Gut Microbiota, GM), which has been proven to have great impact on human health, being associated to several metabolic and immunological diseases. Then, we deal with the set of protein domains enclosed in the genome of living organisms. In general, the neutrality hypothesis, that was proposed by Hubbell as the Ockham’s razor for ecology, is a respectable approximation for both the GM and the protein domains ecosystems. In the first case, a birth-death model that takes into account demographic noise is able to describe the population dynamics if we relax the neutrality assumption and consider two non-interacting niches in which species equivalence holds. Interestingly, the biodiversity index derived from our modeling predicts healthy aging with better accuracy than common indices. When constructing the empirical Relative Species Abundances distribution (RSA) for GM, a fundamental step regards the clustering of particular DNA sequences (16S rRNA). This is a critical task that enables to redefine the concept of species according to the phylogenetic tree. Here we introduce LOC-kNN, that is a parameter-free clustering algorithm recently developed by d’Errico et al, and we adapt it for this purpose. LOC-kNN detects clusters as density peaks based on the dataset topography and, besides still having difficulties in detecting small clusters, shows promising performances. Finally, for what concerns the protein domains ecosystem, environmental noise should also be taken into account. This has a multiplicative effect and, together with the introduction of the Gompertzian death hypothesis, predicts a Poisson Log-Normal RSA. The model fits well the protein domain RSA and captures the dynamics of genome evolution, manifesting good agreement with the phylogenetic distances among bacteria.

Date

  • 2017-03-22

Type

  • Doctoral Thesis
  • PeerReviewed

Format

  • application/pdf

Identifier

urn:nbn:it:unibo-20772

Sala, Claudia (2017) Stochastic Modeling and Statistical Properties of Biological Systems Inferred from Omics Data, [Dissertation thesis], Alma Mater Studiorum Università di Bologna. Dottorato di ricerca in Fisica , 29 Ciclo. DOI 10.6092/unibo/amsdottorato/7810.

Relations