>

Microba Research Discovery Report

Analysis: Aim 1 Longitudinal Treatment Effect

Taxonomic Profiles (Family)

The charts below show the taxonomic composition of the analysed samples using different quantitative visualization techniques. Only the top most abundant families are shown.

Areachart

Click here to open full-sized image in new window.

Barchart clustered within each group

Click here to open full-sized image in new window.

All features barchart clustered within each group

Click here to open full-sized image in new window.

Scaled heatmap of most abundant features ordered by study group

Features were filtered by mean abundance. Samples were first ordered by study group and then clustered within each study group. Abundances were scaled to a max value of 1. Color scale shows not-detected (white), and abundance ranging from low (blue) to high (yellow).

Click here to open full-sized image in new window.

Heatmap of most abundant features ordered by study group

Features were filtered by mean abundance. Samples were first ordered by study group and then clustered within each study group. Color scale shows not-detected (white), and abundance ranging from low (blue) to high (yellow).

Click here to open full-sized image in new window.

Scaled heatmap of all features

Profiles were clustered by hierarchical clustering. Abundances were scaled to a max value of 1. Color scale shows not-detected (white), and abundance ranging from low (blue) to high (yellow).

Click here to open full-sized image in new window.

Heatmap of all features

Profiles were clustered by hierarchical clustering. Color scale shows not-detected (white), and abundance ranging from low (blue) to high (yellow).

Click here to open full-sized image in new window.

Hierarchically clustered barchart

Profiles were clustered by hierarchical clustering.

Click here to open full-sized image in new window.

Interactive Barchart

Click here to open interactive barchart in new window.

Microbial alpha diversity (family)

This page provides an overview of the microbial alpha diversity of the analysed samples. Alpha diversity is measured by the Shannon index and species richnes. Richness simply quantifies the total number of families present in each sample. Shannon index additionally accounts for relative abundance and evenness of the families present and quantifies the entropy of microbial communties. Barcharts and boxplots present the mean diversity in each study group.

Methods:

Shannon diversity was compared using linear mixed effect regression, including Participant as a random effect (intercept), and Treatment as a fixed effect. Effect tested: Treatment. Richness was compared using linear mixed effect regression, including Participant as a random effect (intercept), and Treatment as a fixed effect. Effect tested: Treatment. Data was rarefied to 702268 reads.

Index: Richness

Index: Shannon index

Summary Table

Index rarefiedTo P lmer condition effect Mean Pos Mean Abundance Median Abundance Mean Treatmentpost_treatment Median Treatmentpost_treatment SD Treatmentpost_treatment Mean Treatmentpre_treatment Median Treatmentpre_treatment SD Treatmentpre_treatment Fold Change Log2(Treatmentpre_treatment/Treatmentpost_treatment) Positive samples Positive Treatmentpost_treatment Positive Treatmentpre_treatment Positive_Treatmentpost_treatment_percent Positive_Treatmentpre_treatment_percent
Shannon 702268 0.35 1.4 1.4 1.4 1.4 1.4 0.29 1.4 1.4 0.33 0 38 / 38 (100%) 19 / 19 (100%) 19 / 19 (100%) 1 1
Richness 702268 0.059 14 14 13 14 14 4.7 15 12 6 0.1 38 / 38 (100%) 19 / 19 (100%) 19 / 19 (100%) 1 1
Download diversity values in csv format . On some platforms (including Windows) you may need to change the suffix from .txt or .html to .csv before opening the file in a spreadsheet program, like Excel.

Clustering and ordination (family)

Taxonomic profiles were analyzed using supervised and unsupervised multivariate methods. Profiles were ordinated using the unsupervised methods Principal Coordinates Analysis (PCoA), Non-Metric Multidimensional Scaling (NMDS) and Principal Component Analysis (PCA). PCoA and NMDS are related to PCA, but take dissimlarity matrices as input. PCoA and NMDS both attempt to represent the pairwise dissimlarities between samples in low dimensional space as close as possible. NMDS is a rank-based approach and therefore less effected by outliers.

The supervised methods Adonis and Redundancy analysis (RDA) were used to assess if variance in microbial community composition can be attributed to the study condition. NMDS, PCoA and Adonis were run on Bray-Curtis dissimilarities. A short introduction of the used methods can be found at the GUide to STatistical Analysis in Microbial Ecology (GUSTA ME). Sparse Partial Least Square Discriminant Analysis (sPLS-DA) from the MixMc package was additionaly used to extract features associated with the study condition.

Unsupervised ordination




Interactive PCA (clr transformed)

Please click here to view an interactive 3D PCA.

Univariate analysis of family abundance

Differentially abundant families were identified by ANOVA or LMER (linear mixed effect regression) of clr transformed relative abudances, Fisher's exact test and/or ALDEx2 (on families read counts). Fisher's exact test is used to test for differences in the detection rate, i.e. number of samples in which each family has been detected.

LMER is used for repeated measures data, using random effects to control for correlation between samples from the same subject. Fixed effects are included for treatment groups, time, and treatment over time, where appropriate. The LMER P values correspond to a nested model test of the significance of including the corresponding fixed effect.

ALDEx2 uses subsampling (Bayesian sampling) to estimate the underlying technical variation. For each subsample instance, center log-ratio transformed data is statistically compared across study groups and computed P values are corrected for multiple testing using the Benjamini–Hochberg procedure. The expected P value (mean P value) is reported, which are those that would likely have been observed if the same samples had been run multiple times. The expected values are reported for both the distribution of P values and for the distribution of Benjamini–Hochberg corrected values.

Taxon GTDB taxonomy P lmer condition effect (sqrt) FDR lmer condition effect (sqrt) Pbonf lmer condition effect (sqrt) P lmer condition effect (clr) FDR lmer condition effect (clr) Pbonf lmer condition effect (clr) Mean Pos Mean Abundance Median Abundance Mean Treatmentpost_treatment Median Treatmentpost_treatment SD Treatmentpost_treatment Mean Treatmentpre_treatment Median Treatmentpre_treatment SD Treatmentpre_treatment Fold Change Log2(Treatmentpre_treatment/Treatmentpost_treatment) Positive samples Positive Treatmentpost_treatment Positive Treatmentpre_treatment Positive_Treatmentpost_treatment_percent Positive_Treatmentpre_treatment_percent
Acidaminococcaceae Acidaminococcaceae 0.015 0.23 0.63 0.16 0.51 1 0.76 0.2 0 0.15 0 0.49 0.25 0 0.61 0.74 10 / 38 (26%) 4 / 19 (21%) 6 / 19 (32%) 0.211 0.316
Acutalibacteraceae Acutalibacteraceae 0.16 0.44 1 0.19 0.56 1 5 4.5 3.7 4 3 3.7 4.9 4.7 3.6 0.29 34 / 38 (89%) 16 / 19 (84%) 18 / 19 (95%) 0.842 0.947
Akkermansiaceae Akkermansiaceae 0.33 0.5 1 0.85 0.98 1 1.4 0.37 0 0.24 0 0.59 0.5 0 0.97 1.1 10 / 38 (26%) 5 / 19 (26%) 5 / 19 (26%) 0.263 0.263
Anaerovoracaceae Anaerovoracaceae 0.6 0.7 1 0.59 0.77 1 0.54 0.086 0 0.099 0 0.28 0.072 0 0.21 -0.46 6 / 38 (16%) 3 / 19 (16%) 3 / 19 (16%) 0.158 0.158
Bacteroidaceae Bacteroidaceae 0.15 0.44 1 0.51 0.77 1 8.1 4.9 1.6 6.9 1.7 9.9 2.9 1.4 3.8 -1.3 23 / 38 (61%) 12 / 19 (63%) 11 / 19 (58%) 0.632 0.579
Barnesiellaceae Barnesiellaceae 0.25 0.49 1 0.47 0.77 1 0.31 0.025 0 0.0066 0 0.029 0.043 0 0.14 2.7 3 / 38 (7.9%) 1 / 19 (5.3%) 2 / 19 (11%) 0.0526 0.105
Bifidobacteriaceae Bifidobacteriaceae 0.33 0.5 1 0.059 0.35 1 9.4 8.2 4.6 7.4 4.6 7.6 8.9 4.7 15 0.27 33 / 38 (87%) 17 / 19 (89%) 16 / 19 (84%) 0.895 0.842
Burkholderiaceae Burkholderiaceae 0.3 0.49 1 0.31 0.71 1 0.28 0.052 0 0.08 0 0.2 0.024 0 0.063 -1.7 7 / 38 (18%) 4 / 19 (21%) 3 / 19 (16%) 0.211 0.158
Butyricicoccaceae Butyricicoccaceae 0.025 0.23 0.99 0.015 0.31 0.6 0.68 0.18 0 0.29 0 0.4 0.072 0 0.22 -2 10 / 38 (26%) 8 / 19 (42%) 2 / 19 (11%) 0.421 0.105
CAG-1000 CAG-1000 0.19 0.47 1 0.43 0.77 1 4.9 0.38 0 0.14 0 0.62 0.63 0 1.9 2.2 3 / 38 (7.9%) 1 / 19 (5.3%) 2 / 19 (11%) 0.0526 0.105
CAG-138 CAG-138 0.061 0.29 1 0.069 0.36 1 2.5 0.32 0 0.051 0 0.22 0.59 0 1.5 3.5 5 / 38 (13%) 1 / 19 (5.3%) 4 / 19 (21%) 0.0526 0.211
CAG-274 CAG-274 0.057 0.29 1 0.051 0.35 1 0.5 0.11 0 0.16 0 0.35 0.048 0 0.14 -1.7 8 / 38 (21%) 5 / 19 (26%) 3 / 19 (16%) 0.263 0.158
CAG-302 CAG-302 0.033 0.23 1 0.086 0.4 1 0.74 0.18 0 0.097 0 0.24 0.25 0 0.51 1.4 9 / 38 (24%) 3 / 19 (16%) 6 / 19 (32%) 0.158 0.316
CAG-313 CAG-313 0.17 0.44 1 0.51 0.77 1 0.46 0.037 0 0.023 0 0.1 0.05 0 0.18 1.1 3 / 38 (7.9%) 1 / 19 (5.3%) 2 / 19 (11%) 0.0526 0.105
CAG-314 CAG-314 0.22 0.49 1 0.43 0.77 1 0.32 0.025 0 0.0075 0 0.033 0.043 0 0.13 2.5 3 / 38 (7.9%) 1 / 19 (5.3%) 2 / 19 (11%) 0.0526 0.105
CAG-508 CAG-508 0.79 0.87 1 0.74 0.91 1 0.73 0.25 0 0.2 0 0.38 0.3 0 0.58 0.58 13 / 38 (34%) 7 / 19 (37%) 6 / 19 (32%) 0.368 0.316
CAG-611 CAG-611 0.29 0.49 1 0.56 0.77 1 1.2 0.16 0 0.12 0 0.37 0.2 0 0.62 0.74 5 / 38 (13%) 2 / 19 (11%) 3 / 19 (16%) 0.105 0.158
CAG-74 CAG-74 0.12 0.44 1 0.24 0.58 1 1 0.21 0 0.17 0 0.52 0.25 0 0.62 0.56 8 / 38 (21%) 3 / 19 (16%) 5 / 19 (26%) 0.158 0.263
CAG-917 CAG-917 0.25 0.49 1 0.39 0.77 1 1.3 0.1 0 0.017 0 0.074 0.19 0 0.67 3.5 3 / 38 (7.9%) 1 / 19 (5.3%) 2 / 19 (11%) 0.0526 0.105
Clostridiaceae Clostridiaceae 0.48 0.63 1 0.98 1 1 0.61 0.16 0 0.082 0 0.16 0.24 0 0.71 1.5 10 / 38 (26%) 5 / 19 (26%) 5 / 19 (26%) 0.263 0.263
Coriobacteriaceae Coriobacteriaceae 0.14 0.44 1 0.21 0.56 1 1.9 1.3 0.65 0.91 0.44 1 1.6 0.68 2.1 0.81 26 / 38 (68%) 12 / 19 (63%) 14 / 19 (74%) 0.632 0.737
Dialisteraceae Dialisteraceae 0.47 0.63 1 1 1 1 1.6 0.69 0 0.66 0 1.1 0.71 0 1 0.11 16 / 38 (42%) 8 / 19 (42%) 8 / 19 (42%) 0.421 0.421
Eggerthellaceae Eggerthellaceae 0.92 0.92 1 0.95 1 1 0.47 0.17 0 0.17 0 0.27 0.17 0 0.24 0 14 / 38 (37%) 7 / 19 (37%) 7 / 19 (37%) 0.368 0.368
Enterobacteriaceae Enterobacteriaceae 0.27 0.49 1 0.51 0.77 1 2.2 0.17 0 0.14 0 0.6 0.21 0 0.7 0.58 3 / 38 (7.9%) 1 / 19 (5.3%) 2 / 19 (11%) 0.0526 0.105
Erysipelatoclostridiaceae Erysipelatoclostridiaceae 0.14 0.44 1 0.049 0.35 1 0.97 0.82 0.51 0.73 0.43 0.91 0.91 0.59 1.1 0.32 32 / 38 (84%) 14 / 19 (74%) 18 / 19 (95%) 0.737 0.947
Erysipelotrichaceae Erysipelotrichaceae 0.41 0.58 1 0.56 0.77 1 1.8 0.47 0 0.44 0 1 0.5 0 1.1 0.18 10 / 38 (26%) 5 / 19 (26%) 5 / 19 (26%) 0.263 0.263
Lachnospiraceae Lachnospiraceae 0.77 0.87 1 0.15 0.51 1 41 41 43 42 43 12 41 43 11 -0.035 38 / 38 (100%) 19 / 19 (100%) 19 / 19 (100%) 1 1
Lactobacillaceae Lactobacillaceae 0.015 0.23 0.62 0.0036 0.15 0.15 1.2 0.27 0 0.52 0 1.2 0.03 0 0.1 -4.1 9 / 38 (24%) 7 / 19 (37%) 2 / 19 (11%) 0.368 0.105
Megasphaeraceae Megasphaeraceae 0.53 0.66 1 0.4 0.77 1 1.1 0.087 0 0.069 0 0.25 0.11 0 0.46 0.67 3 / 38 (7.9%) 2 / 19 (11%) 1 / 19 (5.3%) 0.105 0.0526
Methanobacteriaceae Methanobacteriaceae 0.52 0.66 1 0.96 1 1 0.91 0.095 0 0.051 0 0.15 0.14 0 0.42 1.5 4 / 38 (11%) 2 / 19 (11%) 2 / 19 (11%) 0.105 0.105
Monoglobaceae Monoglobaceae 0.062 0.29 1 0.1 0.41 1 0.2 0.038 0 0.0091 0 0.027 0.066 0 0.13 2.9 7 / 38 (18%) 2 / 19 (11%) 5 / 19 (26%) 0.105 0.263
Muribaculaceae Muribaculaceae 0.81 0.87 1 0.87 0.98 1 0.8 0.085 0 0.098 0 0.37 0.071 0 0.21 -0.46 4 / 38 (11%) 2 / 19 (11%) 2 / 19 (11%) 0.105 0.105
Oscillospiraceae Oscillospiraceae 0.022 0.23 0.88 0.11 0.41 1 2.1 1.4 0.39 0.96 0.27 1.5 1.9 0.57 2.7 0.98 25 / 38 (66%) 12 / 19 (63%) 13 / 19 (68%) 0.632 0.684
Peptostreptococcaceae Peptostreptococcaceae 0.9 0.92 1 0.86 0.98 1 0.69 0.29 0 0.26 0 0.4 0.32 0 0.55 0.3 16 / 38 (42%) 8 / 19 (42%) 8 / 19 (42%) 0.421 0.421
QAMH01 QAMH01 0.27 0.49 1 0.55 0.77 1 0.5 0.039 0 0.022 0 0.094 0.057 0 0.17 1.4 3 / 38 (7.9%) 1 / 19 (5.3%) 2 / 19 (11%) 0.0526 0.105
Rikenellaceae Rikenellaceae 0.28 0.49 1 0.21 0.56 1 1.7 0.54 0 0.53 0 1.6 0.55 0 1.1 0.053 12 / 38 (32%) 5 / 19 (26%) 7 / 19 (37%) 0.263 0.368
Ruminococcaceae Ruminococcaceae 0.15 0.44 1 0.037 0.35 1 10 10 10 11 12 4.1 9.5 8.9 4.4 -0.21 38 / 38 (100%) 19 / 19 (100%) 19 / 19 (100%) 1 1
Selenomonadaceae Selenomonadaceae 0.83 0.87 1 0.91 1 1 0.46 0.049 0 0.063 0 0.19 0.035 0 0.13 -0.85 4 / 38 (11%) 2 / 19 (11%) 2 / 19 (11%) 0.105 0.105
Streptococcaceae Streptococcaceae 0.58 0.7 1 0.32 0.71 1 1.1 0.52 0 0.52 0.31 0.71 0.51 0 1.1 -0.028 18 / 38 (47%) 10 / 19 (53%) 8 / 19 (42%) 0.526 0.421
Tannerellaceae Tannerellaceae 0.28 0.49 1 0.47 0.77 1 0.62 0.11 0 0.096 0 0.36 0.13 0 0.33 0.44 7 / 38 (18%) 3 / 19 (16%) 4 / 19 (21%) 0.158 0.211
Turicibacteraceae Turicibacteraceae 0.38 0.55 1 0.65 0.83 1 0.22 0.017 0 0.0079 0 0.034 0.027 0 0.096 1.8 3 / 38 (7.9%) 1 / 19 (5.3%) 2 / 19 (11%) 0.0526 0.105
UBA1381 UBA1381 0.033 0.23 1 0.041 0.35 1 0.79 0.37 0 0.47 0.37 0.7 0.28 0 0.5 -0.75 18 / 38 (47%) 11 / 19 (58%) 7 / 19 (37%) 0.579 0.368

Click here to open full-sized image in new window.

Click here to open full-sized image in new window.

Methods:

P lmer condition effect (sqrt): Differentially abundant species were identified by linear mixed effect regression without confounders. Random effects: Participant (intercept). Fixed effects: Treatment. Effect tested: Treatment.

P lmer condition effect (clr): Differentially abundant species were identified by linear mixed effect regression without confounders. Random effects: Participant (intercept). Fixed effects: Treatment. Effect tested: Treatment.

Glossary:

clr: Centered log-ratio transformation. P Welch's ANOVA (sqrt): Welch's ANOVA p-values (Anova was run on sqrt transformed abundances). It is an alternative to the classic ANOVA and can be used even if the samples have unequal variances and/or unequal sample sizes. P Welch's t-test (sqrt): Welch's T-test p-values (run on sqrt transformed abundances). It is an alternative to the classic Student's t-test and is more reliable if the data violates the assumption of homogeneity of variances and/or if the study conditions have unequal sample sizes. P Welch's ANOVA (clr): Welch's ANOVA p-values (Anova was run on clr transformed abundances). It is an alternative to the classic ANOVA and can be used even if the samples have unequal variances and/or unequal sample sizes. P Welch's t-test (clr): Welch's T-test p-values (run on clr transformed abundances). It is an alternative to the classic Student's t-test and is more reliable if the data violates the assumption of homogeneity of variances and/or if the study conditions have unequal sample sizes. P lmer XXX (clr): P-value of Linear Mixed-Effects Regression on clr transformed abundances, testing significance of XXX effect. Varying intercepts per subjects are used to control for repeated measures. Pbonf: Bonferroni corrected p-value. FDR: False Discovery Rate q-value. Mean Pos: Mean abundance in positive samples. Positive samples: The number and percentage of samples in which each family has been detected. Positive XXX: The number and percentage of positive samples in study group XXX. Positive_XXX_percent: Percentage of positive samples in study group XXX. P Welch's t-test (Aldex2): Expected P value of Welch’s t-test computed by Aldex2. FDR Welch's t-test (Aldex2): Expected Benjamini-Hochberg corrected P value for Welch’s t-test. P Wilcoxon rank test (Aldex2): Expected P value of Wilcoxon rank test computed by Aldex2. FDR Wilcoxon rank test (Aldex2): Expected Benjamini-Hochberg corrected P value of Wilcoxon test. P Kruskal-Wallace test (Aldex2): Expected P value of Kruskal-Wallace test. FDR Kruskal-Wallace test (Aldex2): Expected Benjamini-Hochberg corrected P value of Kruskal-Wallace test. P GLM test (Aldex2): Expected P value of generalized linear model. FDR GLM test (Aldex2): Expected Benjamini-Hochberg corrected P value of generalized linear model.

Differentially abundant taxa (family)

The following plots present the distribution of the top most differentially abundant families across all applied statistical analysis. Plots are ordered alphabetically.

Glossary:

"Rel. Abundance": Relative abundance data; "Rel. Abundance (sqrt)": sqrt (Hellinger) transformed relative abundances; "Rel. Abundance (clr)": centered log-ratio (clr) transformed read counts.
Acidaminococcaceae

Acutalibacteraceae

Bacteroidaceae

Bifidobacteriaceae

Butyricicoccaceae

CAG 74

CAG 138

CAG 274

CAG 302

CAG 313

CAG 1000

Coriobacteriaceae

Erysipelatoclostridiaceae

Lachnospiraceae

Lactobacillaceae

Monoglobaceae

Oscillospiraceae

Rikenellaceae

Ruminococcaceae

UBA1381