Date: Friday, January 20, 2023
Time: 3:30 – 4:30 PM (coffee starting at 3:00 p.m.)
Room: Herzberg building, room 4351 (MacPhail Room)
Title: Cluster analysis of microbiome data via mixtures of Dirichlet-multinomial regression models
Speaker:  Sanjeena Dang, Carleton University

Abstract:  The human gut microbiome is an essential component of our physiology and exploring the relationship between biological/environmental covariates and the resulting taxonomic composition of a given microbial community is an active area of research. Previously, a Dirichlet-multinomial regression framework has been suggested to model this relationship, but it did not account for any underlying latent group structure. An underlying group structure of guts (such as enterotypes) has been observed across gut microbiome samples in which guts in the same group share similar biota compositions. In this talk, a finite mixture of Dirichlet-multinomial regression models will be presented that accounts for this underlying group structure and to allow for a probabilistic investigation of the relationship between bacterial abundance and biological/environmental covariates within each inferred group.

Furthermore, finite mixtures of regression models, which incorporate the concomitant effect of the covariates on the resulting mixing proportions are also proposed and examined within the Dirichlet-multinomial framework.

We utilize the proposed mixture model to gain insight on underlying subgroups in a microbiome dataset comprising of tumor and healthy samples and the relationships between covariates and microbial abundance in those subgroups. The talk will conclude with some current and future research directions involving microbiome data.

For more information please contact Sanjeena Dang – sanjeena.dang@carleton.ca