Ability to Predict Symptoms with 99.9% probability using Bacteria Incidence and Amount

This is a follow up to Ability to Predict Symptoms with 99.9% probability using Bacteria Incidence alone. We compute a threshold for each bacteria; then compute the odds ratio using this threshold and add it to the computations. Because of the larger number of samples, Biomesight results dominated and only those are shown.

Most have a cumulative value around 100 [i.e. log(OR)=100] and a few have very small values suggesting that the root association is not the microbiome for those conditions.

Biomesight

Symptom NameTax_rankCumulativeCnt
Autonomic Manifestations: irritable bowel syndromespecies103.4279
Autonomic Manifestations: light-headednessspecies106.0251
Autonomic Manifestations: nauseaspecies104.5230
Autonomic Manifestations: Neurally mediated hypotension (NMH)species1.51
Autonomic Manifestations: Orthostatic intolerancespecies105.2253
Autonomic Manifestations: Postural orthostatic tachycardia syndrome (POTS)species115.7264
General: Fatiguespecies94.8319
General: Headachesspecies100.8269
General: Myalgia (pain)species100.1237
Neurological: Confusionspecies83.7179
Neurological: Difficulty processing information (Understanding)species105.1265
Neurological: Disorientationspecies82.2165
Neurological: emotional overloadspecies95.9252
Neurological: fasciculationsspecies74.3162
Neurological: Impairment of concentrationspecies121.3276
Neurological: Short-term memory issuesspecies110.4280
Neurological: Spatial instability and disorientationspecies2.92
Neurological: Word-finding problemsspecies103.6286
Neurological-Audio: hypersensitivity to noisespecies113.4282
Neurological-Sleep: Chaotic diurnal sleep rhythms (Erratic Sleep)species80.2173
Neurological-Vision: inability to focus eye/visionspecies93.4204
Neurological-Vision: photophobia (Light Sensitivity)species115.4257
Post-exertional malaise: Inappropriate loss of physical and mental stamina,species114.8278
Sleep: Unrefreshed sleepspecies98.3306

Ability to Predict Symptoms with 99.9% probability using Bacteria Incidence alone

I am off from my usual day job until the new year. In the new year I know that I will be very busy because my firm just landed a contract for a major software product that I am the principal for. I decided to give myself a challenge to explore on these down days:

How accurately can you prediction symptoms from the presence or absence of bacteria reported ALONE with different 16s test.

For those familiar with various forms of Artificial Intelligence, that approach is often used. It reduces the problem to a collection of true/false. For most microbiologists, it is a road not even thought about, lest travelled.

To make the challenge harder, I required the data to have a P value of 0.001. The analysis demanded bacteria-symptom associations with a stringent P-value of 0.001 (Chi² > 10.83), exceeding typical microbiome study thresholds. 

I have four contributed and annotated datasets:

  • 16s
    • uBiome: 791 samples
    • Ombre (formerly Thryve): 1,319 samples
    • Biomesight: 4,436 samples
  • Shotgun
    • Thorne : 253 samples

Using these datasets, I explored the strength of relationships based on Odds Ratio. A subsequent post includes using odds ratio based on a threshold of bacteria which will get much higher values. A high cumulative value indicates a very strong microbiome basis of the symptom and thus remediation.

For details, see the methodology in : New Standards for Microbiome Analysis? Also, taking amount of each bacteria into consider is shown in Ability to Predict Symptoms with 99.9% probability using Bacteria Incidence and Amount

Relationships were quantified using Odds Ratios (OR) at consistent taxonomy levels to avoid dependence, with cumulative log(OR) indicating symptom-microbiome strength (higher values suggest robust basis for remediation). ​

The Tax Rank indicates what is likely the most effective level to use for investigation (i.e. highest discrimination ability).

Thorne

Symptom NameTax_rankCumulativeCnt
General: Fatiguespecies25.2637

Ombre / Thryve

SymptomNameTax_rankCumulativeCnt
Autonomic Manifestations: Orthostatic intolerancegenus21.3923
General: Fatiguespecies7.7232
General: Headachesgenus27.2637
General: Myalgia (pain)species8.4931
Neurological: Confusionspecies1.312
Neurological: Difficulty processing information (Understanding)species9.0519
Neurological: Disorientationspecies1.403
Neurological: emotional overloadspecies4.7411
Neurological: Impairment of concentrationgenus22.7132
Neurological: Word-finding problemsgenus15.2415
Neurological-Audio: hypersensitivity to noisegenus29.5843
Neurological-Sleep: Chaotic diurnal sleep rhythms (Erratic Sleep)genus35.0141
Neurological-Vision: inability to focus eye/visiongenus42.3453
Neurological-Vision: photophobia (Light Sensitivity)genus47.8764
Post-exertional malaise: Inappropriate loss of physical and mental stamina,species20.2045
Sleep: Unrefreshed sleepspecies21.1949

uBiome

While no longer in existence, sharing numbers may be interesting.

Symptom NameTax_rankCumulativeCnt
General: Fatiguespecies5.6915
General: Headachesspecies3.127
General: Myalgia (pain)species1.746
Neurological: Confusionspecies2.823
Neurological: Difficulty processing information (Understanding)species1.576
Neurological: emotional overloadspecies6.0817
Neurological: fasciculationsstrain1.253
Neurological: Impairment of concentrationspecies14.2918
Neurological: Short-term memory issuesspecies0.947
Neurological: Spatial instability and disorientationspecies1.821
Neurological: Word-finding problemsspecies6.3811
Neurological-Audio: hypersensitivity to noisespecies7.7111
Neurological-Sleep: Chaotic diurnal sleep rhythms (Erratic Sleep)species7.726
Neurological-Vision: inability to focus eye/visionspecies11.6314
Neurological-Vision: photophobia (Light Sensitivity)species10.5713
Sleep: Unrefreshed sleepspecies5.4711

BiomeSight

Symptom NameTax_rankCumulativeCnt
Autonomic Manifestations: irritable bowel syndromespecies3.8710
Autonomic Manifestations: light-headednessspecies8.0715
Autonomic Manifestations: nauseaspecies2.5511
Autonomic Manifestations: Neurally mediated hypotension (NMH)species1.461
Autonomic Manifestations: Postural orthostatic tachycardia syndrome (POTS)species1.8711
General: Fatiguespecies7.6520
General: Headachesspecies3.9415
General: Myalgia (pain)species1.6310
Neurological: Confusionspecies4.936
Neurological: Difficulty processing information (Understanding)species0.639
Neurological: emotional overloadspecies0.8110
Neurological: fasciculationsgenus3.619
Neurological: Impairment of concentrationspecies2.105
Neurological: Short-term memory issuesspecies2.415
Neurological: Spatial instability and disorientationspecies2.914
Neurological: Word-finding problemsspecies2.6821
Neurological-Audio: hypersensitivity to noisegenus2.536
Neurological-Vision: inability to focus eye/visionspecies2.104
Neurological-Vision: photophobia (Light Sensitivity)species9.4628
Post-exertional malaise: Inappropriate loss of physical and mental stamina,species2.4412
Sleep: Unrefreshed sleepspecies1.7517

Summary

Nota Bene: the above is the cumulative of the log values. It is assumed that for each bacteria, the highest odd ratio is used /hit. A value of 0.81 means exp(0.81) = 2.25 is the highest odds ratio possible if the sample hits every child highest odds ratio. A value of 8.07 becomes odds ratio of 3188.

All of the above are 16s tests which typically are viewed accurate to species at best. The difference of test processing is strongly exhibited in the table below. For background on the challenge on a lack of standardization in microbiome testing, see my post from 6 years ago: The taxonomy nightmare before Christmas…

Symptom NameOmbreuBiomeBiomeSight
General: Fatigue7.725.697.65
General: Headaches27.263.123.94
General: Myalgia (pain)8.491.741.63
Neurological: Confusion1.312.824.93
Neurological: Difficulty processing information (Understanding)9.051.570.63
Neurological: emotional overload4.746.080.81
Neurological: Impairment of concentration22.7114.292.10
Neurological: Word-finding problems15.246.382.68
Neurological-Audio: hypersensitivity to noise29.587.712.53
Neurological-Vision: inability to focus eye/vision42.3411.632.10
Neurological-Vision: photophobia (Light Sensitivity)47.8710.579.46
Sleep: Unrefreshed sleep21.195.471.75

Looking at the counts:

Symptom NameOmbreuBiomeBiomeSight
General: Fatigue321520
General: Headaches37715
General: Myalgia (pain)31610
Neurological: Confusion236
Neurological: Difficulty processing information (Understanding)1969
Neurological: emotional overload111710
Neurological: Impairment of concentration32185
Neurological: Word-finding problems151121
Neurological-Audio: hypersensitivity to noise43116
Neurological-Vision: inability to focus eye/vision53144
Neurological-Vision: photophobia (Light Sensitivity)641328
Sleep: Unrefreshed sleep491117

I view these stark differences due to the fragments of RNA that each test looks at to make the identification of bacteria. It is those RNA fragments that is important.

All of the data used above is available for download.

Odds Ratio Snapshot: Attention deficit hyperactivity disorder (ADHD)

This document presents the results of statistical analysis on symptoms from viable, self-annotated Biomesight microbiome samples. The methodology for data acquisition is outlined in New Standards for Microbiome Analysis?.

​Quick Best Probiotics

For details, see bottom

  • Bifidobacterium breve
  • Bifidobacterium longum
  • Bifidobacterium adolescentis

Lacticaseibacillus (one of the lactobacillus probiotics) is very excessive and Lactobacillus probiotics should generally be avoided. Check your yogurt labels!

Tables have been refined to display only genus- and species-level taxa, the 20 most prominent entries per group, and associations achieving statistical significance (P < 0.01).

The following sections provide the processed data, accompanied by guidance on interpretation and application. Counts of significant bacterial taxa are included, reflecting the application of non-standard but rigorously validated statistical approaches to extensive sample and reference populations, where statistical power derives from dataset scale.

SignificanceGenus
p < 0.01134
p < 0.001125
p < 0.0001119
p < 0.00001105

Averages and Medians

I prefer medians over averages. Medians are the values where half of the people have less and half has more. If the data was a bell-curve, then the values will almost be the same… with bacteria that happens rarely. Look at the bacterua below, we see that for some the average is above and the median below. Should one increase or decrease this bacteria?

If symptom median is higher than reference median, it means there is more of this bacteria. If lower, then less. This ignores how often the bacteria is seen (we average only over reports). IMHO using average value instead of median will often result in a worse situation for the patient

tax_nameRankSymptom AverageReference AverageSymptom MedianReference Median
Phocaeicola vulgatusspecies7.3725.7743.4275.031
Faecalibacteriumgenus12.48212.78412.07310.514
Phocaeicolagenus10.90810.8549.36910.392
Blautiagenus8.978.4487.1766.431
Lachnospiragenus1.8632.7461.8991.168
Roseburiagenus3.5742.8221.7782.222
Phocaeicola doreispecies1.7172.9350.430.038
Parabacteroidesgenus3.2522.6111.7242.116
Bacteroides uniformisspecies2.9382.7271.5711.917
Oscillospiragenus2.6562.3491.9522.285
Parabacteroides distasonisspecies1.9431.2280.6040.911
Clostridiumgenus1.9591.8571.3641.665
Sutterellagenus1.8341.641.2441.49
Sutterella wadsworthensisspecies0.7340.6570.050.262
Coprococcusgenus1.1121.4380.730.53
Lachnospira pectinoschizaspecies0.3690.670.340.162
Novispirillumgenus1.0360.8640.0950.259
Insolitispirillumgenus1.0350.8650.0950.259
Insolitispirillum peregrinumspecies1.0350.8650.0950.259
Bacteroides thetaiotaomicronspecies1.091.0720.4660.628

Bacteria Incidence – How often is it reported

The common sense belief is that if a bacteria is reported more often, then the amount should be higher. This is often not true. The microbiome is a complex thing. Excessive Lacticaseibacillus (one of the lactobacillus probiotics) is very excessive.

tax_nameRankIncidence Odds RatioChi2Symptoms %Reference %
Collinsella tanakaeispecies2.4319.937.415.4
Anaerofustis stercorihominisspecies2.0612.736.317.6
Anaerofustisgenus1.9811.436.318.3
Lacticaseibacillusgenus1.839.238.521

More or Less often based on Symptom Median All Incidence

This is a little more complex to understand. If we compute the mid point for people with the symptom, then if the bacteria was not involved then half of the reference should be above this value and half below this value. If not, it means that the symptom tends to over or under growth.

tax_nameRankSymptom MedianOdds RatioChi2BelowAbove
Moraxella caviaespecies0.0020.2122.68117
Moraxellagenus0.0020.2519.18321
Rickettsiellagenus0.0020.2617.47620
Treponema porcinumspecies0.0020.3214.38427
Clostridium hveragerdensespecies0.0020.439.410244
Streptococcus infantisspecies0.0030.557.8808442
Desulfotomaculum defluviispecies0.0030.567.41033576
Alkalibacteriumgenus0.0030.576.8914521
Hydrogenophilusgenus0.0030.586.71166671

More or Less often based on Reference Median All Incidence

This is like the above, but with a different line in the sand. Instead of the median of those with the condition, we use the median of the reference set.

tax_nameRankReference MedianOdds RatioChi2BelowAbove
Phocaeicola doreispecies0.0382.32335.211712721
Corynebacteriumgenus0.0120.32324.41279413
Odoribacter denticanisspecies0.0060.41294.31881771
Lachnospira pectinoschizaspecies0.1622.18293.412602744
Sporotomaculumgenus0.0040.38268.11329500
Oribacteriumgenus0.0352.14264.911452451
Slackiagenus0.04650.47256.223361094
Oribacterium sinusspecies0.0352.11255.911512432
Luteolibactergenus0.0170.39243.31238479
Luteolibacter algaespecies0.0170.39238.11227479
Collinsella intestinalisspecies0.0090.41234.51330542
Collinsellagenus0.1080.48233.821261011
Blautia obeumspecies0.109951.98223.412712520
Lachnobacteriumgenus0.031.9220212842463
Johnsonella ignavaspecies0.04290.53200.325711356
Johnsonellagenus0.04290.53199.525711358
Eggerthella sinensisspecies0.0060.44196.61296574
Adlercreutzia equolifaciensspecies0.0130.491911674814
Pontibactergenus0.0040.42190.91085456
Pontibacter niistensisspecies0.0040.42189.61082456

More or Less often based on Symptom Median High Incidence

Above we see that many of the top bacteria identified are sparse, that is not reported often. We then restrict them to those that occur above 50% or the time.

None were found

More or Less often based on Reference Median High Incidence

Above we see that many of the top bacteria identified are sparse, that is not reported often. We then restrict them to those that occur above 50% or the time.

tax_nameRankReference Median FreqOdds RatioChi2BelowAbove
Phocaeicola doreispecies0.0382.32335.211712721
Corynebacteriumgenus0.0120.32324.41279413
Odoribacter denticanisspecies0.0060.41294.31881771
Lachnospira pectinoschizaspecies0.1622.18293.412602744
Sporotomaculumgenus0.0040.38268.11329500
Oribacteriumgenus0.0352.14264.911452451
Slackiagenus0.04650.47256.223361094
Oribacterium sinusspecies0.0352.11255.911512432
Luteolibactergenus0.0170.39243.31238479
Luteolibacter algaespecies0.0170.39238.11227479
Collinsella intestinalisspecies0.0090.41234.51330542
Collinsellagenus0.1080.48233.821261011
Blautia obeumspecies0.109951.98223.412712520
Lachnobacteriumgenus0.031.9220212842463
Johnsonella ignavaspecies0.04290.53200.325711356
Johnsonellagenus0.04290.53199.525711358
Eggerthella sinensisspecies0.0060.44196.61296574
Adlercreutzia equolifaciensspecies0.0130.491911674814
Pontibactergenus0.0040.42190.91085456
Pontibacter niistensisspecies0.0040.42189.61082456

Summary

A large number of bacterial taxa exhibit shifts with P < 0.01 in association with this condition. The subsequent challenge is determining how to modulate these taxa, since the volume of candidates exceeds what most individuals can practically consider. Moreover, for many of the taxa identified, there is no published evidence in the U.S. National Library of Medicine describing how to alter their abundance.

A deep optimization model, such as the one implemented on the Microbiome Taxa R2 site, can be used to inform probiotic selection. This model provides coverage for each identified taxon and infers which probiotics are most likely to shift their levels. Its output may then be integrated with more conventional recommendations derived from literature indexed in the U.S. National Library of Medicine where such evidence exists, with the two recommendation sets reconciled by giving priority to probiotic-based suggestions.

Development of a dedicated database based on Biomesight samples is in progress. The current model uses data contributed by PrecisionBiome, and datasets generated with differing laboratory processing pipelines cannot be safely combined, as discussed in The taxonomy nightmare before Christmas…. Once the Biomesight-specific database is complete, an option for generating (offline-only) personalized suggestions will be added to the Microbiome Prescription website.

Probiotics Suggestions

The following are based on a simplified algorithm using R2 data for Biomesight. These are tentative numbers subject to future refinements. Bacteria listed are only for probiotics detected with Biomesight tests. Probiotics include some that are available only in some countries and some that are pending approval for retail sale.

  • Good Count: Number of bacteria expected to shift in desired direction
  • Bad Count: Number of bacteria expected to shift in wrong direction
  • Impact: Estimator of impact based on Chi-2, Slope and R2 vectors
Probiotic SpeciesImpactGood CountBad Count
Faecalibacterium prausnitzii174.4865
Bifidobacterium breve35.77180
Bifidobacterium longum31.47170
Bifidobacterium adolescentis23.42160
Bifidobacterium bifidum7.27110
Blautia wexlerae7.0633
Bifidobacterium catenulatum6.1780
Bifidobacterium animalis3.2570
Bacillus subtilis2.422212
Escherichia coli2.3732
Clostridium butyricum0.85913
Veillonella atypica0.38125
Segatella copri0.3311
Heyndrickxia coagulans-0.0768
Bifidobacterium pseudocatenulatum-0.0985
Leuconostoc mesenteroides-0.1736
Limosilactobacillus reuteri-0.36917
Lactiplantibacillus plantarum-0.3603
Lacticaseibacillus rhamnosus-0.3715
Lactiplantibacillus pentosus-0.4403
Ligilactobacillus salivarius-0.4847
Lacticaseibacillus casei-0.5216
Akkermansia muciniphila-0.54410
Lactobacillus acidophilus-0.5989
Limosilactobacillus fermentum-0.611111
Odoribacter laneus-0.6903
Lactobacillus crispatus-0.76313
Lacticaseibacillus paracasei-1.1337
Lactococcus lactis-1.413
Enterococcus durans-2.911511
Lactobacillus jensenii-3.052528
Lactobacillus helveticus-3.96234
Limosilactobacillus vaginalis-4.081944
Enterococcus faecium-4.7917
Enterococcus faecalis-11.454539
Bacteroides thetaiotaomicron-12.3636
Parabacteroides goldsteinii-15.08414
Streptococcus thermophilus-15.7709
Bacteroides uniformis-18.4244
Lactobacillus johnsonii-23.913336
Pediococcus acidilactici-37.581834
Parabacteroides distasonis-74.4619
Blautia hansenii-77.3129

Odds Ratio Snapshot: Gluten Sensitivity (Non-Celiac)

A reader asked about gluten sensitivity profile in an email. Here are the results. The short form for probiotics:

  • Bifidobacterium breve
  • Bifidobacterium longum
  • Bifidobacterium adolescentis
  • AVOID LACTOBACILLUS

This document presents the results of statistical analysis on symptoms from viable, self-annotated Biomesight microbiome samples. The methodology for data acquisition is outlined in New Standards for Microbiome Analysis?.

Tables have been refined to display only genus- and species-level taxa, the 20 most prominent entries per group, and associations achieving statistical significance (P < 0.01).

The following sections provide the processed data, accompanied by guidance on interpretation and application. Counts of significant bacterial taxa are included, reflecting the application of non-standard but rigorously validated statistical approaches to extensive sample and reference populations, where statistical power derives from dataset scale.

SignificanceGenus
p < 0.01162
p < 0.001146
p < 0.0001131
p < 0.00001116

Averages and Medians

I prefer medians over averages. Medians are the values where half of the people have less and half has more. If the data was a bell-curve, then the values will almost be the same… with bacteria that happens rarely. Look at the bacterua below, we see that for some the average is above and the median below. Should one increase or decrease this bacteria?

tax_nameRankSymptom AverageReference AverageSymptom MedianReference Median
Bacteroidesgenus27.5482624.26926.905
Faecalibacterium prausnitziispecies12.69512.19611.32912.474
Roseburiagenus2.3242.8571.8091.382
Lachnospiragenus3.1732.7111.8852.302
Oscillospiragenus2.6682.3451.9472.323
Bacteroides uniformisspecies2.8392.7281.5651.903
Parabacteroidesgenus3.1382.6071.7192.022
Clostridiumgenus2.0871.8511.3631.531
Pedobactergenus1.3150.9880.5520.706
Coprococcusgenus1.131.4420.730.604
Bacteroides thetaiotaomicronspecies0.9431.0770.4640.59
Bifidobacteriumgenus0.3520.9550.1290.028
Hathewaya histolyticaspecies0.4420.2730.1540.251
Hathewayagenus0.4420.2730.1540.251
Ruminococcus bromiispecies1.0390.7830.1670.262
Bacteroides cellulosilyticusspecies1.2660.8390.0760.155
Bilophilagenus0.4150.3480.2090.285
Bilophila wadsworthiaspecies0.3930.340.1990.262
Doreagenus0.3290.4880.2950.242
Bacteroides rodentiumspecies0.3610.3930.1860.235

If symptom median is higher than reference median, it means there is more of this bacteria. If lower, then less. This ignores how often the bacteria is seen (we average only over reports). IMHO using average value instead of median will often result in a worse situation for the patient

Bacteria Incidence – How often is it reported

The common sense belief is that if a bacteria is reported more often, then the amount should be higher. This is often not true. The microbiome is a complex thing. In this case two specific probiotic species are seen rarely and thus, supplementation could be inferred.

tax_nameRankIncidence Odds RatioChi2Symptoms %Reference %
Bifidobacterium brevespecies0.578.823.641.3
Bifidobacterium catenulatumspecies0.66.721.635.9

More or Less often based on Symptom Median All Incidence

This is a little more complex to understand. If we compute the mid point for people with the symptom, then if the bacteria was not involved then half of the reference should be above this value and half below this value. If not, it means that the symptom tends to over or under growth.

tax_nameRankSymptom MedianOdds RatioChi2BelowAbove
Thiorhodococcus mannitoliphagusspecies0.0020.237.913227
Cystobactergenus0.0020.2137.413127
Psychrobacter glacialisspecies0.0020.3633.8675243
Rickettsia marmionii Stenos et al. 2005species0.0020.3630.3393140
Niabellagenus0.0020.3829585224
Viridibacillus neideispecies0.0020.3927470182
Thiorhodococcusgenus0.0020.4322.9579247
Thermodesulfovibrio thiophilusspecies0.0020.4421.5541236
Oenococcusgenus0.0020.4520.7614275
Thermodesulfovibriogenus0.0020.4520.1626284
Helicobacter suncusspecies0.0020.4619.6765355
Viridibacillusgenus0.0020.514.8488244
Desulfotomaculum defluviispecies0.0030.5611.61017569
Alkalibacteriumgenus0.0030.5710.6899514
Sporotomaculum syntrophicumspecies0.0030.5810.41127652
Pelagicoccusgenus0.0020.5810.1842487
Treponemagenus0.0030.589.7593342
Olivibacter solispecies0.0020.579.5457262
Hydrogenophilusgenus0.0030.599.51133671
Mycoplasma iguanaespecies0.0020.589.1458266

More or Less often based on Reference Median All Incidence

This is like the above, but with a different line in the sand. Instead of the median of those with the condition, we use the median of the reference set.

tax_nameRankReference MedianOdds RatioChi2BelowAbove
Bifidobacteriumgenus0.0282.37347.911542736
Tetragenococcusgenus0.0040.44234.41638719
Bifidobacterium adolescentisspecies0.0042.03215.710742176
Hathewaya histolyticaspecies0.25050.52202.825681345
Hathewayagenus0.25050.52202.225671346
Psychrobacter glacialisspecies0.0020.36168.1675243
Anaerotruncusgenus0.17850.57155.724391383
Caloramator uzoniensisspecies0.0060.51153.91408712
Bifidobacterium choerinumspecies0.00551.87151.99171718
Mogibacteriumgenus0.0220.57145.421151195
Methylonatrumgenus0.0040.541451627872
Methylonatrum kenyensespecies0.0040.541451627872
Anaerotruncus colihominisspecies0.17050.5814324151403
Hymenobacter xinjiangensisspecies0.0070.53137.41486795
Niabellagenus0.0020.38135.9585224
Streptococcus australisspecies0.00950.57127.517731010
Leptolyngbya laminosaspecies0.00450.44125.9698304
Leptolyngbyagenus0.00450.44125.8701306
Bifidobacterium longumspecies0.01951.7312410471814
Vagococcusgenus0.0030.48119.9841403

More or Less often based on Symptom Median High Incidence

Above we see that many of the top bacteria identified are sparse, that is not reported often. We then restrict them to those that occur above 50% or the time.

tax_nameRankSymptom Median FreqOdds RatioChi2BelowAbove
Clostridium taeniosporumspecies0.0030.68.91354818

More or Less often based on Reference Median High Incidence

Above we see that many of the top bacteria identified are sparse, that is not reported often. We then restrict them to those that occur above 50% or the time.

tax_nameRankReference Median FreqOdds RatioChi2BelowAbove
Bifidobacteriumgenus0.0282.37347.911542736
Tetragenococcusgenus0.0040.44234.41638719
Bifidobacterium adolescentisspecies0.0042.03215.710742176
Hathewaya histolyticaspecies0.25050.52202.825681345
Hathewayagenus0.25050.52202.225671346
Psychrobacter glacialisspecies0.0020.36168.1675243
Anaerotruncusgenus0.17850.57155.724391383
Caloramator uzoniensisspecies0.0060.51153.91408712
Bifidobacterium choerinumspecies0.00551.87151.99171718
Mogibacteriumgenus0.0220.57145.421151195
Methylonatrumgenus0.0040.541451627872
Methylonatrum kenyensespecies0.0040.541451627872
Anaerotruncus colihominisspecies0.17050.5814324151403
Hymenobacter xinjiangensisspecies0.0070.53137.41486795
Niabellagenus0.0020.38135.9585224
Streptococcus australisspecies0.00950.57127.517731010
Leptolyngbya laminosaspecies0.00450.44125.9698304
Leptolyngbyagenus0.00450.44125.8701306
Bifidobacterium longumspecies0.01951.7312410471814
Vagococcusgenus0.0030.48119.9841403

Summary

A large number of bacterial taxa exhibit shifts with P < 0.01 in association with this condition. The subsequent challenge is determining how to modulate these taxa, since the volume of candidates exceeds what most individuals can practically consider. Moreover, for many of the taxa identified, there is no published evidence in the U.S. National Library of Medicine describing how to alter their abundance.

A deep optimization model, such as the one implemented on the Microbiome Taxa R2 site, can be used to inform probiotic selection. This model provides coverage for each identified taxon and infers which probiotics are most likely to shift their levels. Its output may then be integrated with more conventional recommendations derived from literature indexed in the U.S. National Library of Medicine where such evidence exists, with the two recommendation sets reconciled by giving priority to probiotic-based suggestions.

Development of a dedicated database based on Biomesight samples is in progress. The current model uses data contributed by PrecisionBiome, and datasets generated with differing laboratory processing pipelines cannot be safely combined, as discussed in The taxonomy nightmare before Christmas…. Once the Biomesight-specific database is complete, an option for generating (offline-only) personalized suggestions will be added to the Microbiome Prescription website.

Probiotics Suggestions

The following are based on a simplified algorithm using R2 data for Biomesight. These are tentative numbers subject to future refinements. Bacteria listed are only for probiotics detected with Biomesight tests. Probiotics include some that are available only in some countries and some that are pending approval for retail sale.

  • Good Count: Number of bacteria expected to shift in desired direction
  • Bad Count: Number of bacteria expected to shift in wrong direction
  • Impact: Estimator of impact based on Chi-2, Slope and R2 vectors
Probiotic SpeciesImpactGood CountBad Count
Bifidobacterium breve103.8160
Bifidobacterium longum93.37172
Bifidobacterium adolescentis68.66142
Enterococcus faecalis42.165924
Lactobacillus johnsonii30.974620
Segatella copri24.1660
Bifidobacterium bifidum21.18123
Bifidobacterium catenulatum18.93100
Akkermansia muciniphila15.46819
Lactobacillus helveticus10.824644
Bifidobacterium animalis9.6770
Pediococcus acidilactici6.743120
Enterococcus faecium6.472126
Blautia wexlerae2.621
Streptococcus thermophilus2.2871
Escherichia coli2.2342
Clostridium butyricum2.07213
Enterococcus durans0.622716
Lactococcus lactis0.4782
Bacillus subtilis0.241427
Bifidobacterium pseudocatenulatum0.171812
Limosilactobacillus fermentum0.1188
Veillonella atypica0.07143
Heyndrickxia coagulans0.01912
Limosilactobacillus reuteri-0.011213
Lacticaseibacillus paracasei-0.03410
Lactobacillus crispatus-0.041313
Ligilactobacillus salivarius-0.0416
Leuconostoc mesenteroides-0.0988
Lacticaseibacillus casei-0.1329
Lactiplantibacillus plantarum-0.1306
Lactiplantibacillus pentosus-0.1406
Lacticaseibacillus rhamnosus-0.1718
Lactobacillus acidophilus-0.211622
Limosilactobacillus vaginalis-0.313229
Odoribacter laneus-0.4503
Lactobacillus jensenii-1.033026
Blautia hansenii-3.61014
Parabacteroides goldsteinii-38.45016
Parabacteroides distasonis-48.84011
Bacteroides uniformis-147.38210
Bacteroides thetaiotaomicron-157.0728
Faecalibacterium prausnitzii-278.227

Another Follow-up Test, ME/CFS

Prior posts for this person are linked below

I am going to review using the traditional analysis. My initial impression is that suggested retesting and plotting the next course correction was missed. As with sailing a boat, this can sail a person to an unintended spot. The last section is trying some work in progress on his sample(s). This is experimental work which I have high hope on yielding much finer identification of the critical bacteria that should be addressed.

Analysis

My usual starting point for multiple tests is compare forecasts of symptoms: New sample is 2025-11, old sample is 2025-03. Things have gotten worse.

I decide to compare 2025-11 to 2024-12 and see the latest sample is still worse, but not as bad. In other words, the gains made over the summer has been lost. This person is in a northern climate so seasonal variation could (theoretically) be significant.

As often, we have a high hit rate of projected symptoms against actual symptoms.

Current Back Story

The first line is reflected above.

I have not been feeling so well lately (since the last year).

I would say that my symptoms has become worse.

Earlier it has always felt as I have done some progress but the last 12 months it has been the opposite. 

Earlier I got rid of my muscle and joint pain but it has come back and I have much bigger issues with my red nose and my body feels very stressed.

Also feel very bloated.

A summary of my biggest issues:

  • Get the red nose (some form of rosacea). 
  • Feel fatigued (both physically and mentally). 
  • Feeling stressed. 
  • Brain fog.
  • Bloated.
  • Lots of gas – I fart and burps a lot. 
  • Issues with allergies
  • Muscle and joint pain

For the last 4 years I’ve been eating large amounts of rye and oats.

Around 150-200 gram of rye bread every day.

Around 70 gram of oats every day.

Been eating low fat, low protein and high carb (specially from rye, oats, apple juice and potatoes) because this diet seem to reduce my symptoms.

As soon as I start to eat high meat and high fat my symptoms get worse.

Traditional Analysis

First, I am doing the “traditional” analysis before exploring some work in progress to improve suggestions further. The process is simple, pick Beginner-Symptoms, mark symptoms and get suggestions. This is the process that seems to produce the best results. Other choices are intended to satisfy people with different assumptions. The site purpose is allow people to use the data according to their beliefs about the best way.

Despite having 42 symptoms entered, this boiled down to just 20 bacteria. Many related symptoms are connected to the same bacteria.

Investigate: “As soon as I start to eat high meat and high fat my symptoms”

Looking at the consensus report we see:

Which agrees with his reported response. On the other side, generic “fat” is a significant plus– so the type of fat seems to be critical.

Investigate Current Eating Habits: eating large amounts of rye and oats.

The suggestions are intended to be course corrections for the microbiome. Keeping on a course for too long may end up running aground on mudflats (instead of the original reefs that the course correction was intended for).

The question is why rye is ok and other grains are not? It may be due to some composition aspect or a side-effect of having sparse data. It looks like some change of diet is suggested.

The positive food items appear to be:

  • fruit: grapes, especially lingonberries, cranberries (and likely cloudberries!), citric fruits, bananas
  • fish: fish oil,
  • legumes: beets, nuts, Asparagus, Rhubarb but NOT Pulses, Beans
  • Quinoa – which unlike cereals above is gluten free

Vitamins

The list is pretty short.

  • To take: Vitamin B6,7 and 12 but no other B vitamins (i.e. no mixtures)
  • Iron and Vitamin D are fine

Probiotics

There are two approaches here.

  • Using PubMed Studies only (sparse data, negative impacts almost never published)
  • Using Inference from Microbiome Taxa R2 Site.
  • A third way is in the next section with is like the R2 site, but using a 16s Biomesight reference set instead of a shotgun.

PubMed Studies

The top ones, in descending order are (with strike thru on ones that are hard to obtain):

Microbiome Taxa R2 Site

We end up with just 2 “safe” (positive impact only)

This leaves only one with 100% consensus: E.Coli (Mutaflor, Symbioflor-2). Looking at PubMed with a net positive with R2, we also have

My Own Experience

While fighting ME/CFS, I retested about 6 weeks after getting the results of the last test. I noticed that suggestions swing back and worth a lot – but I kept following them. Often there can be a battle between “common sense beliefs” and what the algorithms find. Avoiding something during one cycle and then taking it the next cycle seemed “irrational”. I borrowed from physical processes the concept of “microbiome oscillations” and stopped worrying about the swings.

My personal advice is simple, get results and then do suggestions for 6-8 week and do another test. With test processing delays, it means about 10-12 weeks on each set of suggestions

Going Forward — and a new Algorithm

Recently I have been working on an Odds Ratio investigation. The reason is simple, the Odds ratio gives an objective measure of the importance of each bacteria for the symptoms. The new approach uses Odds Ratio to determine the odds of a bacteria causing a symptom. The odds tells me the importance. If you are interested in more technical data, see:

I will be trying it out on his data. The databases involved are about 160GB with processing often taking 20 minutes for each processing state, so they are on my “garage” high performance server (nerd talk: 64GB of memory, fast M.2 NVMe  2TB drive for disk) and strictly for research/exploration at the moment.

Key differences

  • we are going to estimate symptoms a different way than traditional (using odds)
  • we are likely to have 100+ bacteria to shift

Predicted Symptoms Rank Order

Using the Odds Ratio approach we get the following predictions that agrees with his reported symptoms/characteristics.

Symptom NameStrength
Age: 30-4011.2
Sleep: Unrefreshed sleep10.9
Comorbid: Small intestinal bacterial overgrowth (SIBO)9.1
Immune Manifestations: Inflammation (General)9
General: Fatigue9
DePaul University Fatigue Questionnaire : Tingling feeling8.8
Neuroendocrine: Cold limbs (e.g. arms, legs hands)8.5
Neuroendocrine Manifestations: cold extremities8.3
Neurocognitive: Brain Fog8.1
Post-exertional malaise: Worsening of symptoms after mild mental activity8
DePaul University Fatigue Questionnaire : Fatigue7.9
Gender: Male7.9
DePaul University Fatigue Questionnaire : Muscle Pain (i.e., sensations of pain or aching in your muscles. This does not include weakness or pain in other areas such as joints)7.8
Immune Manifestations: Bloating7.5
DePaul University Fatigue Questionnaire : Allergies7.4
DePaul University Fatigue Questionnaire : Muscle weakness6.9
DePaul University Fatigue Questionnaire : Post-exertional malaise, feeling worse after doing activities that require either physical or mental exertion6.7
DePaul University Fatigue Questionnaire : Rash6.5
Post-exertional malaise: Mentally tired after the slightest effort6.3
Comorbid: Histamine or Mast Cell issues6.3
Post-exertional malaise: Next-day soreness after everyday activities6
Post-exertional malaise: Muscle fatigue after mild physical activity6
Official Diagnosis: Mast Cell Dysfunction5.9
Neuroendocrine Manifestations: worsening of symptoms with stress.5.9
Post-exertional malaise: Worsening of symptoms after mild physical activity5.8
DePaul University Fatigue Questionnaire : Does physical activity make you feel worse5.7
DePaul University Fatigue Questionnaire : Unrefreshing Sleep, that is waking up feeling tired5.7
Immune: Flu-like symptoms5.7
DePaul University Fatigue Questionnaire : Pain in Multiple Joints without Swelling or Redness5.5
Neurological-Audio: hypersensitivity to noise5.2
Immune Manifestations: Inflammation of skin, eyes or joints5.1

Looking at the existing estimates, we see far greater separation in weight/estimates. I favor separation because that implies much better focus on bacteria.

  1.  DePaul University Fatigue Questionnaire : Pain in Multiple Joints without Swelling or Redness ✅ – [83.2%]
  2.  DePaul University Fatigue Questionnaire : Muscle weakness ✅ – [83.1%]
  3.  Neuroendocrine Manifestations: cold extremities ✅ – [83.1%]
  4.  Neurocognitive: Brain Fog ✅ – [82.7%]
  5.  DePaul University Fatigue Questionnaire : Muscle Pain (i.e., sensations of pain or aching in your muscles. This does not include weakness or pain in other areas such as joints) ✅ – [82.7%]
  6.  Post-exertional malaise: Worsening of symptoms after mild physical activity ✅ – [82.4%]
  7.  Post-exertional malaise: Next-day soreness after everyday activities ✅ – [82.3%]
  8.  General: Fatigue ✅ – [82.3%]
  9.  Sleep: Unrefreshed sleep ✅ – [82.2%]
  10.  Comorbid: Small intestinal bacterial overgrowth (SIBO) ✅ – [82.1%]
  11.  Immune Manifestations: Bloating ✅ – [81.9%]
  12.  DePaul University Fatigue Questionnaire : Fatigue ✅ – [81.7%]
  13.  Official Diagnosis: Chronic Fatigue Syndrome (CFS/ME) ✅ – [81.5%]
  14.  Neurological-Audio: hypersensitivity to noise ✅ – [81.5%]
  15.  DePaul University Fatigue Questionnaire : Unrefreshing Sleep, that is waking up feeling tired ✅ – [81.4%]
  16.  DePaul University Fatigue Questionnaire : Difficulty staying asleep ✅ – [81.3%]
  17.  Official Diagnosis: Autoimmune Disease ✅ – [81.2%]
  18.  Immune Manifestations: Inflammation (General) ✅ – [81.2%]
  19.  DePaul University Fatigue Questionnaire : Easily irritated – [81.2%]
  20.  Neuroendocrine Manifestations: worsening of symptoms with stress. ✅ – [81.2%]

Key Bacteria identified

The new approach identifies these bacteria to target, with their relative importance (Weight). I just did another post on a ME/CFSer, Microbiome Interpretation – Questions From A User. Megamonas also was her top one.

tax nametax rankWeightTarget
Megamonasgenus108.9Too High
Klebsiella oxytocaspecies95.8Too High
Ruminococcus bromiispecies-17.9Too Low
Eubacterialesorder-16.1Too Low
Clostridiaclass-15.1Too Low
Megamonas funiformisspecies15Too High
Bacillotaphylum-14.2Too Low
Ruminococcaceaefamily-14.2Too Low
Segatellagenus13.6Too High
Oscillospiraceaefamily-13.4Too Low
Ruminococcusgenus-12.4Too Low
Terrabacteria groupclade-12.4Too Low
Segatella coprispecies11.9Too High
Bacteroidiaclass11.4Too High
Bacteroidalesorder11.4Too High
Bacteroidota/Chlorobiota groupclade10.3Too High
Bacteroidotaphylum10.1Too High
FCB groupclade9.8Too High
Prevotellagenus9.7Too High
Lachnospiraceaefamily-9.3Too Low
Prevotellaceaefamily9Too High
Phocaeicola vulgatusspecies8.4Too High
Akkermansiaceaefamily6.7Too High
Bacteroides uniformisspecies6Too High
Pseudomonadotaphylum5.9Too High
Gammaproteobacteriaclass5.2Too High
Yersiniagenus4.8Too High
Verrucomicrobiotaphylum4.3Too High
Akkermansiagenus4.1Too High
Verrucomicrobialesorder4Too High

The traditional approach identifies the list below. There is relatively little overlap. My ‘gut’ reading is that those above are likely a better candidate set than those below.

BacteriaRankShift
ThiotrichalesorderHigh
SharpeagenusHigh
SelenomonasgenusHigh
RuminococcusgenusHigh
NegativicutesclassLow
JohnsonellagenusHigh
HoldemaniagenusHigh
ErysipelothrixgenusHigh
DoreagenusHigh
DesulfovibrioniaclassLow
delta/epsilon subdivisionscladeLow
CyanophyceaeclassLow
Cyanobacteriota/Melainabacteria groupcladeLow
CoprococcusgenusHigh
ChlorobiotaphylumHigh
ChlorobiiaclassHigh
ChlorobiaceaefamilyHigh
ChlorobaculumgenusHigh
ActinomycetotaphylumLow
ActinomycetesclassLow

Suggestions

Since two bacteria dominates, I ran the suggestion algorithm only on those two bacteria. The results are below and very similar to the results from the traditional approach. “All algorithms lead to the same suggestions”. Doing the full list cited above, produced very similar suggestions.

ModifierNet
fruit/legume fibre263
fruit241
Fiber, total dietary217
Chitosan217
Slow digestible carbohydrates. {Low Glycemic}210
oolong teas205
polyphenols203
resveratrol-pterostilbene x Quercetin  {quercetin x resveratrol}202
(2->1)-beta-D-fructofuranan {Inulin}201
3,3′,4′,5,7-pentahydroxyflavone {Quercetin}200
High-fibre diet {Whole food diet}199
Citrus limon  {Lemon}194
 5,6-dihydro-9,10-dimethoxybenzo[g]-1,3-benzodioxolo[5,6-a]quinolizinium {Berberine}193
Lonicera periclymenum {Epazote}184
Grape Polyphenols {Grape Flavonoids}177
tea177
dietary fiber169
red wine168
grapes163
Linum usitatissimum {Flaxseed}160
Lactobacillus plantarum {L. plantarum}158
Camellia Linnaeus {camellia}158
Coffee150
Hydrastis canadensis {Goldenseal}148
Ethanoic acid {Vinegar}148
Ulmus rubra {slippery elm}145
Ligilactobacillus salivarius {L. salivarius}144
Ginkgo biloba {Ginkgo}139
Cola aspartame {Diet Cola}138
pseudo-cereals  {amaranth,quinoa, taro,buckwheat}137
Agaricus bisporus {White button mushrooms}134
Theobroma cacao {Cacao}131
Caffeine127
Lacticaseibacillus rhamnosus {l. rhamnosus}124
fucoidan {Brown Algae Extract}122
Litchi chinensis {lychee fruit}121
coptis chinensis {Chinese goldthread }121

Probiotics using R2

First I used only the top two bacteria to see what is suggested with a very targeted set.

  • Bacillus subtilis 75
  • Lactobacillus jensenii 60

Below are pushing the full set of identified bacteria through BiomeSight R2 matrix, then filtered to positive impact with no risk. Escherichia coli (cited above) continues to be a take. Bacillus subtilis would be my fall back suggestion for a probiotic. It is marginally negative on the consensus report and not cited on other R2 suggestion list. For others candidates

  • Lactobacillus jensenii was one for and no comment
  • Lactococcus lactis is a one for and one against
  • Lactobacillus helveticus is one strong against and no comment
  • Lacticaseibacillus casei is one strong against and no comment
  • Akkermansia muciniphila is two strong avoid

I tend to do a variation of the traditional “Do no harm”, minimize the risk of adverse shifts.

ProbioticNet ImpactGood CountBad Count
Bacillus subtilis81.950
Lactobacillus jensenii68.520
Lactococcus lactis26.820
Lactobacillus helveticus21.440
Lacticaseibacillus casei21.420
Segatella copri21.230
Lactiplantibacillus pentosus2150
Akkermansia muciniphila18.740
Bacillus amyloliquefaciens group18.310
Enterococcus faecium16.310
Limosilactobacillus fermentum13.610
Heyndrickxia coagulans13.430
Enterococcus durans12.420
Pediococcus acidilactici1110
Enterococcus faecalis10.620
Leuconostoc mesenteroides10.510
Escherichia coli4.330

Bottom Line

The purpose of this post was to evaluate suggestions for a regular reader. The secondary goal was to see how well a new approach that I am developing is working. This new approach produces different targeted bacteria with very similar suggestions generated, the most significance difference is far more targeted probiotics for the symptoms based on the same lab data.

The one interesting aspect is that the key bacteria (just 2) were clearly identified. These two bacteria alone produced suggestions similar to the bigger bacteria selection. I do like this narrow bacteria selection of key bacteria and will likely do a few more samples to further explore things.

Follow Up

I decided to look at all of his samples with the new algorithm to look for patterns. Megamonas stands out as the one that most frequently appears and disappears. Klebsiella oxytoca and Morganellaceae are the next candidates.

Upload DateTop Bacteria
2021-09-24Megamonas genus 108.9 Too High
Lachnospiraceae family -56.2 Too Low
Eubacteriales order -47.3 Too Low
Bacillota phylum -46.6 Too Low
Clostridia class -46.4 Too Low
Terrabacteria group clade -44.8 Too Low
2021-09-24 Bacillota phylum -46.5 Too Low
Eubacteriales order -45.1 Too Low
Terrabacteria group clade -45 Too Low
Clostridia class -44.1 Too Low
Lachnospiraceae family -43.6 Too Low
2022-04-19 Klebsiella oxytoca species 95.8 Too High
Morganellaceae family 91 Too High
2022-09-04 Oscillospiraceae family 17.7 Too High
Ruminococcaceae family 17.4 Too High
Bacteroidaceae family -16 Too Low
Bacteroides genus -15.9 Too Low
2023-03-15 Megamonas genus 108.9 Too High
Lachnospiraceae family -26 Too Low
2023-09-26 Megamonas genus 108.9 Too High
2024-02-13 Bacteroidaceae family -27.3 Too Low
Bacteroides genus -27.3 Too Low
Phocaeicola dorei species -21.4 Too Low
2024-09-25 Segatella genus 13.6 Too High
Segatella copri species 11.9 Too High
2025-04-22Morganellaceae family 91.6 Too High
2025-12-08Megamonas genus 108.9 Too High
Klebsiella oxytoca species 95.8 Too High

Odds Ratio Snapshot: Restless Legs

This document presents the results of statistical analysis on symptoms from viable, self-annotated Biomesight microbiome samples. The methodology for data acquisition is outlined in New Standards for Microbiome Analysis?.

In my weekly review, I noticed this new study: “Analysis of gut microbiota in Restless Legs Syndrome: searching for a metagenomic signature” Dec 2025 That identifies “a statistically significant decrease in the abundance of Lachnoclostridium and Flavonifractor genera in RLS compared to CTRL“. Just two bacteria. I was surprised to see so few reported. Below you will see my results (all all of the source data is available for download for those that wish to verify the numbers). I was able to get 137 significant shifts

Tables have been refined to display only genus- and species-level taxa, the 20 most prominent entries per group, and associations achieving statistical significance (P < 0.01).

The following sections provide the processed data, accompanied by guidance on interpretation and application. Counts of significant bacterial taxa are included, reflecting the application of non-standard but rigorously validated statistical approaches to extensive sample and reference populations, where statistical power derives from dataset scale.

SignificanceGenus
p < 0.01137
p < 0.001126
p < 0.0001119
p < 0.00001103

Averages and Medians

I prefer medians over averages. Medians are the values where half of the people have less and half has more. If the data was a bell-curve, then the values will almost be the same… with bacteria that happens rarely. Look at the bacterua below, we see that for some the average is above and the median below. Should one increase or decrease this bacteria?

If symptom median is higher than reference median, it means there is more of this bacteria. If lower, then less. This ignores how often the bacteria is seen (we average only over reports). IMHO using average value instead of median will often result in a worse situation for the patient

tax_nameRankSymptom AvarageReference AverageSymptom MedianReference Median
Bacteroidesgenus26.79726.01724.33728.038
Phocaeicolagenus10.59310.8529.36510.564
Bacteroides uniformisspecies2.922.7241.5552.385
Ruminococcusgenus6.6755.5764.3774.823
Clostridiumgenus2.0421.8551.361.803
Oscillospiragenus2.8982.3451.9522.313
Parabacteroidesgenus2.7412.6221.7232.016
Bacteroides cellulosilyticusspecies1.3590.8430.0750.312
Parabacteroides merdaespecies0.8230.7450.2980.531
Pedobactergenus1.050.9990.5530.742
Coprococcusgenus1.1931.4380.7350.558
Sutterellagenus1.3691.6511.2511.095
Roseburia faecisspecies1.2821.2050.5730.709
Novispirillumgenus0.8450.8660.0910.225
Insolitispirillumgenus0.8450.8670.0930.222
Insolitispirillum peregrinumspecies0.8450.8670.0930.222
Ruminococcus bromiispecies1.0150.7840.1690.292
Blautia coccoidesspecies0.7260.9150.5930.47
Caloramatorgenus1.0130.9370.1030.211
Blautia hanseniispecies1.191.0350.7170.824

Bacteria Incidence – How often is it reported

The common sense belief is that if a bacteria is reported more often, then the amount should be higher. This is often not true. The microbiome is a complex thing.

Nothing significant was found

More or Less often based on Symptom Median All Incidence

This is a little more complex to understand. If we compute the mid point for people with the symptom, then if the bacteria was not involved then half of the reference should be above this value and half below this value. If not, it means that the symptom tends to over or under growth.

tax_nameRankSymptom MedianOdds RatioChi2BelowAbove
Streptococcus ursorisspecies0.0020.2621.69825
Actinopolymorphagenus0.0020.3517.415052
Actinopolymorpha rutilaspecies0.0020.3417.313847
Helicobacter suncusspecies0.0020.4713.1771363
Thermodesulfovibriogenus0.0020.4713.1627293
Desulfotomaculum defluviispecies0.0030.558.51037569
Bacteroides helcogenesspecies0.0020.448.58035
Bifidobacterium pullorumspecies0.0020.498.313968
Hydrogenophilusgenus0.0030.587.21162670
Pelagicoccusgenus0.0020.586.9850493
Sporotomaculum syntrophicumspecies0.0030.586.81134663

More or Less often based on Reference Median All Incidence

This is like the above, but with a different line in the sand. Instead of the median of those with the condition, we use the median of the reference set.

tax_nameRankReference MedianOdds RatioChi2BelowAbove
Caldicellulosiruptorgenus0.0270.46279.525721189
Bifidobacterium longumspecies0.0122.23257.18911990
Hymenobacter xinjiangensisspecies0.0080.43239.61608696
Hymenobactergenus0.0080.47223.61832854
Thermicanusgenus0.1890.5211.122541135
Bifidobacterium gallicumspecies0.00352.21204.66541448
Anaerotruncus colihominisspecies0.1780.5320125321331
Bacteroides cellulosilyticusspecies0.3120.53191.224631311
Candidatus Glomeribactergenus0.0040.46179.21282592
Anaerotruncusgenus0.180.55170.624881379
Segatellagenus0.0161.8167.313472426
Staphylococcusgenus0.0040.43167.3943402
Erysipelothrixgenus0.01650.55167.222311220
Porphyromonasgenus0.0130.55165.721721185
Erysipelothrix murisspecies0.0150.55158.921861211
Clostridiumgenus1.8030.57158.325511454
Emticicia oligotrophicaspecies0.0070.54156.719441057
Caloramator uzoniensisspecies0.00650.51155.51416716
Emticiciagenus0.0070.5515419421062
Bifidobacterium choerinumspecies0.0051.86151.49271728

More or Less often based on Symptom Median High Incidence

Above we see that many of the top bacteria identified are sparse, that is not reported often. We then restrict them to those that occur above 50% or the time.

Per above, nothing was found

More or Less often based on Reference Median High Incidence

Above we see that many of the top bacteria identified are sparse, that is not reported often. We then restrict them to those that occur above 50% or the time.

tax_nameRankReference Median FreqOdds RatioChi2BelowAbove
Caldicellulosiruptorgenus0.0270.46279.525721189
Bifidobacterium longumspecies0.0122.23257.18911990
Hymenobacter xinjiangensisspecies0.0080.43239.61608696
Hymenobactergenus0.0080.47223.61832854
Thermicanusgenus0.1890.5211.122541135
Bifidobacterium gallicumspecies0.00352.21204.66541448
Anaerotruncus colihominisspecies0.1780.5320125321331
Bacteroides cellulosilyticusspecies0.3120.53191.224631311
Candidatus Glomeribactergenus0.0040.46179.21282592
Anaerotruncusgenus0.180.55170.624881379
Segatellagenus0.0161.8167.313472426
Staphylococcusgenus0.0040.43167.3943402
Erysipelothrixgenus0.01650.55167.222311220
Porphyromonasgenus0.0130.55165.721721185
Erysipelothrix murisspecies0.0150.55158.921861211
Clostridiumgenus1.8030.57158.325511454
Emticicia oligotrophicaspecies0.0070.54156.719441057
Caloramator uzoniensisspecies0.00650.51155.51416716
Emticiciagenus0.0070.5515419421062
Bifidobacterium choerinumspecies0.0051.86151.49271728

Summary

A large number of bacterial taxa exhibit shifts with P < 0.01 in association with this condition. The subsequent challenge is determining how to modulate these taxa, since the volume of candidates exceeds what most individuals can practically consider. Moreover, for many of the taxa identified, there is no published evidence in the U.S. National Library of Medicine describing how to alter their abundance.

A deep optimization model, such as the one implemented on the Microbiome Taxa R2 site, can be used to inform probiotic selection. This model provides coverage for each identified taxon and infers which probiotics are most likely to shift their levels. Its output may then be integrated with more conventional recommendations derived from literature indexed in the U.S. National Library of Medicine where such evidence exists, with the two recommendation sets reconciled by giving priority to probiotic-based suggestions.

Development of a dedicated database based on Biomesight samples is in progress. The current model uses data contributed by PrecisionBiome, and datasets generated with differing laboratory processing pipelines cannot be safely combined, as discussed in The taxonomy nightmare before Christmas…. Once the Biomesight-specific database is complete, an option for generating (offline-only) personalized suggestions will be added to the Microbiome Prescription website.

Probiotics Suggestions

The following are based on a simplified algorithm using R2 data for Biomesight. These are tentative numbers subject to future refinements. Bacteria listed are only for probiotics detected with Biomesight tests. Probiotics include some that are available only in some countries and some that are pending approval for retail sale.

  • Good Count: Number of bacteria expected to shift in desired direction
  • Bad Count: Number of bacteria expected to shift in wrong direction
  • Impact: Estimator of impact based on Chi-2, Slope and R2 vectors
Probiotic SpeciesImpactGood CountBad Count
Segatella copri52.2880
Bifidobacterium breve43.16170
Enterococcus faecalis37.793733
Bifidobacterium longum37.42180
Bifidobacterium adolescentis28.09170
Lactobacillus johnsonii20.642828
Lactobacillus helveticus14.22441
Blautia wexlerae13.6631
Bifidobacterium bifidum8.5110
Bifidobacterium catenulatum7.5990
Enterococcus faecium5.851218
Akkermansia muciniphila4.13821
Bifidobacterium animalis3.7670
Bacillus subtilis2.132021
Clostridium butyricum1.211011
Bifidobacterium pseudocatenulatum0.287
Ligilactobacillus salivarius0.0525
Limosilactobacillus fermentum0.0497
Limosilactobacillus reuteri0.021117
Enterococcus durans-0.011814
Heyndrickxia coagulans-0.0156
Lactococcus lactis-0.0352
Veillonella atypica-0.04211
Leuconostoc mesenteroides-0.0657
Lactiplantibacillus pentosus-0.0613
Lactiplantibacillus plantarum-0.0714
Lacticaseibacillus rhamnosus-0.0705
Lacticaseibacillus casei-0.0816
Lactobacillus crispatus-0.1310
Lactobacillus acidophilus-0.3511
Limosilactobacillus vaginalis-0.471634
Streptococcus thermophilus-0.9214
Lactobacillus jensenii-1.131824
Odoribacter laneus-2.203
Pediococcus acidilactici-5.982328
Escherichia coli-28.64012
Parabacteroides goldsteinii-37.48019
Blautia hansenii-39.47419
Parabacteroides distasonis-42.8209
Faecalibacterium prausnitzii-239.335
Bacteroides uniformis-252.6127
Bacteroides thetaiotaomicron-284.92210

Microbiome Interpretation – Questions From A User

A reader wrote the following to me. This post and associate video is an attempt to answer.

When I compare my first two analyses from Biomesight under “Multiple Samples,” the distribution of bacteria improves toward normal—assuming that the reduction in “Lab Read Quality” from 39.3 to 23.5 does not mean that the latest sample is a false positive. I have read your article “Lab Quality Versus Bacteria Reported,” but I am too brain fogged to really understand the content. 

In fact, “Symptom Pattern Matching” states that there was only one improvement and 150 deteriorations. Subjectively, I feel differently, not really better, but not much worse either. That’s why I don’t know how to interpret these results. (Better: nausea, migraine, constipation—Worse: brain fog, fatigue, sleep, muscle strength. So, speculation that: MCAS is improving, but ME-CFS continues to progressively worsen).

The suggestions under “To Avoid” have changed under “Food.” It is interesting to note that, before I was aware of these suggestions, they were already reflected in my appetite or aversion to certain foods; so it is reassuring that my body perception actually corresponds to the measured facts and the statistically supported conclusions. 

Unfortunately, my subjective perception of a change, but not an improvement, would then also correspond with the “outside ranges” and “symptom pattern matching.” (Objectively, I measure heart rate variability, pulse, and temperature with a simple GARMIN device and thermometer.)

I don’t know enough about statistics and am not fully mentally present, so I would be grateful if you could provide a few assessments of the measured changes, if you have the time. 

https://youtu.be/JXP9UWQVU_A

Sample Comparison

With two or more samples, the ability to compare samples using symptom forecasts is intended to give a good indicator of change. In most cases, there is significant improvement. In this case, things became worse.

There can be many causes:

  • Catching a virus, food poisoning, etc between samples
  • Ignoring the “Avoids” – items that feed the bad bacteria
  • Trying to follow two sets of advice that have not been reconcilled
    • A health consultant and microbiome prescription suggestions.

“Feeling the same” with these minor shifts is not unexpected. Remembering how you were tend to be unreliable; bad memories fail fast.

Getting Suggestions is Easy, Picking Bacteria is not

Microbiome Prescription is constantly update on what influence bacteria from new studies every week. At present if has a 7,432,372 Modifier-to-Bacteria relationships in its database. Given a set of bacteria, their shifts and the relative importance of each bacteria, the suggestions are a relatively simple computation.

Picking Bacteria

If you go to the typical alternative health practitioners, or just ask on line, you may not need to get a microbiome tests. From your symptoms, they will speculate on the issue and give suggestions that they will swear works. In reality, they may appear to work because the patients that they worked for, will come back to see them again. For those where it does not work, they will move on to the next “expert”.

Testing labs will often provide a reference range for some bacteria and thus identify if you have too much or too low. There are many technical issues using these. Personally, I avoid using them — but to make people happy, several are provided on the site (Old UI).

The Simple UI gives a few canned choices for selecting the bacteria and then does the easy part, computing suggestions. The numbers of bacteria vary greatly.

A new algorithm in development, identified 2 very critical bacteria (HIGH WEIGHT) and 20 minor bacteria. The goal of the new algorithm is to better pinpointing the key bacteria and their impact.to look

  • Both Samples had the same top item
    • Megamonas, genus, weight 109, too high on both tests
    • Morganellaceae, famiy, weight 92, too high on latest test only,

The addition of Morganellaceae in the latest sample may account for the worse report.

Going forward, I would look at what decreases these two bacteria. I ran your latest data through the suggestion algorithm and attach the full set of suggestions in excel.

The main take items from the new algorithm are below

  • Slow digestible carbohydrates. {Low Glycemic}
  • dietary fiber
  • Fiber, total dietary
  • fruit
  • fruit/legume fibre
  • High-fibre diet {Whole food diet}
  • Lactobacillus plantarum {L. plantarum}
  • (2->1)-beta-D-fructofuranan {Inulin}
  • oligosaccharides {oligosaccharides}
  • 5,6-dihydro-9,10-dimethoxybenzo[g]-1,3-benzodioxolo[5,6-a]quinolizinium {Berberine}
  • bacillus,lactobacillus,streptococcus,saccharomyces probiotic
  • 3,3′,4′,5,7-pentahydroxyflavone {Quercetin}
  • fucoidan {Brown Algae Extract}
  • Lacticaseibacillus rhamnosus {l. rhamnosus}
  • yogurt
  • grapes
  • polyphenols
  • Grape Polyphenols {Grape Flavonoids}
  • bacillus
  • bifidobacterium longum {B.Longum }
  • Linum usitatissimum {Flaxseed}
  • resveratrol-pterostilbene x Quercetin {quercetin x resveratrol}
  • ß-glucan {Beta-Glucan}
  • Saccharomyces cerevisiae var boulardii {S. boulardii}
  • Outer Layers of Triticum aestivum {Wheat Bran}
  • tea

Compared to the existing using symptoms. They are similar and both have top items of:

  • Lactobacillus plantarum
  • fruit
  • dietary fiber

Odds Ratio Snapshot: Photophobia (Light Sensitivity)

This document presents the results of statistical analysis on symptoms from viable, self-annotated Biomesight microbiome samples. The methodology for data acquisition is outlined in New Standards for Microbiome Analysis?.

Tables have been refined to display only genus- and species-level taxa, the 20 most prominent entries per group, and associations achieving statistical significance (P < 0.01).

The following sections provide the processed data, accompanied by guidance on interpretation and application. Counts of significant bacterial taxa are included, reflecting the application of non-standard but rigorously validated statistical approaches to extensive sample and reference populations, where statistical power derives from dataset scale.

SignificanceGenus
p < 0.01177
p < 0.001160
p < 0.0001138
p < 0.00001124

Averages and Medians

I prefer medians over averages. Medians are the values where half of the people have less and half has more. If the data was a bell-curve, then the values will almost be the same… with bacteria that happens rarely. Look at Faecalibacterium prausnitzii is below, we see that the average is above and the median below.

If symptom median is higher than reference median, it means there is more of this bacteria. If lower, then less. This ignores how often the bacteria is seen (we average only over reports).

tax_nameRankSymptom AvarageReference AverageSymptom MedianReference Median
Faecalibacterium prausnitziispecies10.36612.27711.4159.08
Faecalibacteriumgenus10.88112.84312.1319.826
Lachnospiragenus2.4012.7381.91.418
Coprococcusgenus1.0711.4430.7370.428
Phocaeicola doreispecies2.6992.9160.4180.128
Parabacteroidesgenus2.3852.6341.7241.989
Clostridiumgenus2.0051.8541.3591.6
Roseburia faecisspecies0.9511.2150.5760.457
Bacteroides caccaespecies1.590.8520.2860.402
Mediterraneibactergenus0.8050.7130.2770.381
Bacteroides thetaiotaomicronspecies1.1041.0710.4630.561
Lachnospira pectinoschizaspecies0.5470.6670.3360.249
Bifidobacteriumgenus0.7610.940.1270.042
Bacteroides cellulosilyticusspecies1.3960.8360.0760.151
Blautia wexleraespecies0.8690.5690.3140.386
Bilophilagenus0.3430.350.210.278
Anaerotruncusgenus0.2840.1840.1360.203
Akkermansia muciniphilaspecies2.3981.3250.050.117
Akkermansiagenus2.3981.3250.0510.117
Anaerotruncus colihominisspecies0.2590.1730.1330.198

Bacteria Incidence – How often is it reported

The common sense belief is that if a bacteria is reported more often, then the amount should be higher. This is often not true. The microbiome is a complex thing. Look at Bacteroides uniformis below, we see that the average is above and the median below

tax_nameRankIncidence Odds RatioChi2Symptoms %Reference %
Actinobacillus porcinusspecies0.616.924.540.2
Slackia faecicanisspecies1.537.844.829.2
Mogibacterium vescumspecies1.7911.532.218

More or Less often based on Symptom Median All Incidence

This is a little more complex to understand. If we compute the mid point for people with the symptom, then if the bacteria was not involved then half of the reference should be above this value and half below this value. If not, it means that the symptom tends to over or under growth.

Psychrobacter glacialisspecies0.0020.3730.5664247
Niabellagenus0.0020.3927.4583226
Thermoanaerobacteriumgenus0.0020.424.4485195
Chromatiumgenus0.0020.4124.2508206
Chromatium weisseispecies0.0020.4124.1507206
Thermoanaerobacterium islandicumspecies0.0020.4123.6478195
Syntrophomonas sapovoransspecies0.0020.4222.5536226
Sporosarcina pasteuriispecies0.0020.4221.9440184
Thermodesulfovibrio thiophilusspecies0.0020.4321543236
Sporosarcinagenus0.0020.4320.6444191
Oenococcusgenus0.0020.4520.1609272
Thermodesulfovibriogenus0.0020.4519.5629285
Helicobacter suncusspecies0.0020.4718.3768361
Desulfofundulusgenus0.0020.4618.2496227
Herbaspirillum magnetovibriospecies0.0020.5113.6447226
Streptococcus infantisspecies0.0030.5412804437
Sphingomonasgenus0.0020.5311.9457242
Desulfotomaculum defluviispecies0.0030.5611.31022570
Alkalibacteriumgenus0.0030.5710.5906514
Hydrogenophilusgenus0.0030.5810.21149662

More or Less often based on Reference Median All Incidence

This is like the above, but with a different line in the sand. Instead of the median of those with the condition, we use the median of the reference set.

tax_nameRankReference MedianOdds RatioChi2BelowAbove
Methylonatrumgenus0.0050.35374.21861655
Methylonatrum kenyensespecies0.0050.35374.21861655
Anaerotruncus colihominisspecies0.1980.41365.127121113
Anaerotruncusgenus0.2030.42340.326881141
Odoribacter denticanisspecies0.0060.412911856760
Luteolibactergenus0.0170.38245.61225468
Luteolibacter algaespecies0.0170.39240.41214468
Finegoldiagenus0.01150.41212.11210501
Anaerococcusgenus0.0120.4206.31099444
Eggerthella sinensisspecies0.0060.44197.21289568
Finegoldia magnaspecies0.0080.4195.41014408
Coprococcusgenus0.42851.87191.413792577
Desulfovibrio fairfieldensisspecies0.03950.4175.1868347
Mogibacteriumgenus0.0230.54170.421541159
Bifidobacteriumgenus0.042451.8169.213902505
Rubritaleagenus0.0040.43168.7969415
Bifidobacterium longumspecies0.0161.9167.69861876
Lysobactergenus0.0040.36164.9657236
Porphyromonasgenus0.0130.54164.721561174
Psychrobacter glacialisspecies0.0020.37158664247

More or Less often based on Symptom Median High Incidence

Above we see that many of the top bacteria identified are sparse, that is not reported often. We then restrict them to those that occur above 50% or the time.

tax_nameRankSymptom Median FreqOdds RatioChi2BelowAbove
Clostridium taeniosporumspecies0.0030.68.61359821

More or Less often based on Reference Median High Incidence

Above we see that many of the top bacteria identified are sparse, that is not reported often. We then restrict them to those that occur above 50% or the time.

tax_nameRankReference Median FreqOdds RatioChi2BelowAbove
Methylonatrumgenus0.0050.35374.21861655
Methylonatrum kenyensespecies0.0050.35374.21861655
Anaerotruncus colihominisspecies0.1980.41365.127121113
Anaerotruncusgenus0.2030.42340.326881141
Odoribacter denticanisspecies0.0060.412911856760
Luteolibactergenus0.0170.38245.61225468
Luteolibacter algaespecies0.0170.39240.41214468
Finegoldiagenus0.01150.41212.11210501
Anaerococcusgenus0.0120.4206.31099444
Eggerthella sinensisspecies0.0060.44197.21289568
Finegoldia magnaspecies0.0080.4195.41014408
Coprococcusgenus0.42851.87191.413792577
Desulfovibrio fairfieldensisspecies0.03950.4175.1868347
Mogibacteriumgenus0.0230.54170.421541159
Bifidobacteriumgenus0.042451.8169.213902505
Rubritaleagenus0.0040.43168.7969415
Bifidobacterium longumspecies0.0161.9167.69861876
Lysobactergenus0.0040.36164.9657236
Porphyromonasgenus0.0130.54164.721561174
Psychrobacter glacialisspecies0.0020.37158664247

Summary

A large number of bacterial taxa exhibit shifts with P < 0.01 in association with this condition. The subsequent challenge is determining how to modulate these taxa, since the volume of candidates exceeds what most individuals can practically consider. Moreover, for many of the taxa identified, there is no published evidence in the U.S. National Library of Medicine describing how to alter their abundance.

A deep optimization model, such as the one implemented on the Microbiome Taxa R2 site, can be used to inform probiotic selection. This model provides coverage for each identified taxon and infers which probiotics are most likely to shift their levels. Its output may then be integrated with more conventional recommendations derived from literature indexed in the U.S. National Library of Medicine where such evidence exists, with the two recommendation sets reconciled by giving priority to probiotic-based suggestions.

Development of a dedicated database based on Biomesight samples is in progress. The current model uses data contributed by PrecisionBiome, and datasets generated with differing laboratory processing pipelines cannot be safely combined, as discussed in The taxonomy nightmare before Christmas…. Once the Biomesight-specific database is complete, an option for generating (offline-only) personalized suggestions will be added to the Microbiome Prescription website.

Probiotics Suggestions

The following are based on a simplified algorithm using R2 data for Biomesight. These are tentative numbers subject to future refinements. Bacteria listed are only for probiotics detected with Biomesight tests. Probiotics include some that are available only in some countries and some that are pending approval for retail sale.

  • Good Count: Number of bacteria expected to shift in desired direction
  • Bad Count: Number of bacteria expected to shift in wrong direction
  • Impact: Estimator of impact based on Chi-2, Slope and R2 vectors
Probiotic SpeciesImpactGood CountBad Count
Faecalibacterium prausnitzii434.7180
Bifidobacterium breve64.25170
Bifidobacterium longum57.5200
Bifidobacterium adolescentis42.25170
Bifidobacterium bifidum13.35122
Bifidobacterium catenulatum11.39110
Bifidobacterium animalis6.1470
Enterococcus faecium2.642325
Escherichia coli1.6758
Enterococcus faecalis1.375144
Limosilactobacillus reuteri0.811617
Streptococcus thermophilus0.7340
Bacillus subtilis0.522832
Veillonella atypica0.592
Lactobacillus helveticus0.344644
Enterococcus durans0.32521
Lactococcus lactis0.2261
Heyndrickxia coagulans0.121011
Lacticaseibacillus paracasei-0.0236
Lactiplantibacillus pentosus-0.0222
Lactiplantibacillus plantarum-0.0635
Bifidobacterium pseudocatenulatum-0.081913
Lacticaseibacillus rhamnosus-0.0924
Lacticaseibacillus casei-0.1337
Leuconostoc mesenteroides-0.24411
Ligilactobacillus salivarius-0.3219
Clostridium butyricum-0.41520
Odoribacter laneus-0.5503
Lactobacillus crispatus-0.65724
Lactobacillus acidophilus-1.02929
Segatella copri-1.142
Limosilactobacillus vaginalis-1.192545
Bacteroides thetaiotaomicron-2.2105
Bacteroides uniformis-2.7706
Lactobacillus jensenii-3.62251
Pediococcus acidilactici-19.952440
Blautia wexlerae-19.9932
Parabacteroides goldsteinii-22.17115
Akkermansia muciniphila-27.03226
Parabacteroides distasonis-35.63010
Blautia hansenii-39.03410
Lactobacillus johnsonii-41.553141

Odds Ratio Snapshot: Tinnitus (ringing in ear)

This document presents the results of statistical analysis on symptoms from viable, self-annotated Biomesight microbiome samples. The methodology for data acquisition is outlined in New Standards for Microbiome Analysis?.

Tables have been refined to display only genus- and species-level taxa, the 20 most prominent entries per group, and associations achieving statistical significance (P < 0.01).

The following sections provide the processed data, accompanied by guidance on interpretation and application. Counts of significant bacterial taxa are included, reflecting the application of non-standard but rigorously validated statistical approaches to extensive sample and reference populations, where statistical power derives from dataset scale.

SignificanceGenus
p < 0.01226
p < 0.001199
p < 0.0001176
p < 0.00001157

Averages and Medians

I prefer medians over averages. Medians are the values where half of the people have less and half has more. If the data was a bell-curve, then the values will almost be the same… with bacteria that happens rarely. Look at Bacteroides below, we see that the average is above and the median below.

If symptom median is higher than reference median, it means there is more of this bacteria. If lower, then less. This ignores how often the bacteria is seen (we average only over reports).

tax_nameRankSymptom AvarageReference AverageSymptom MedianReference Median
Bacteroidesgenus28.63325.84224.11427.576
Phocaeicola vulgatusspecies6.5935.7443.3474.644
Phocaeicolagenus11.77710.789.29110.397
Bacteroides uniformisspecies3.0312.7071.522.104
Parabacteroidesgenus2.5522.6311.7142.058
Bacteroides thetaiotaomicronspecies1.271.0560.4550.678
Clostridiumgenus2.0041.8471.3521.569
Oscillospiragenus2.452.3481.9442.144
Bacteroides cellulosilyticusspecies1.0320.840.070.237
Parabacteroides merdaespecies0.8540.7370.2870.44
Coprococcusgenus1.0711.4610.7390.59
Pedobactergenus1.2320.980.5480.695
Novispirillumgenus0.8160.8720.0870.229
Insolitispirillumgenus0.8160.8730.0890.229
Insolitispirillum peregrinumspecies0.8160.8730.0890.229
Bacteroides caccaespecies1.1350.8540.2810.38
Bifidobacteriumgenus0.5130.9690.1340.048
Bilophilagenus0.3950.3470.2060.276
Parabacteroides goldsteiniispecies0.5650.5710.130.194
Bilophila wadsworthiaspecies0.3760.3390.1970.256

Bacteria Incidence – How often is it reported

The common sense belief is that if a bacteria is reported more often, then the amount should be higher. This is often not true. The microbiome is a complex thing. Look at Bacteroides uniformis below, we see that the average is above and the median below

tax_nameRankIncidence Odds RatioChi2Symptoms %Reference %
Bifidobacterium catenulatumspecies0.698.325.236.2
Thiomonas thermosulfataspecies1.41829.420.9
Aggregatibactergenus0.657.81624.4
Desulfurispirillumgenus1.458.425.517.5
Desulfurispirillum alkaliphilumspecies1.458.125.217.4
Actinobacillus pleuropneumoniaespecies0.68.41118.5
Bifidobacterium cuniculispecies0.627.81219.3
Desulfonatronovibriogenus1.487.519.913.5
Erysipelothrix inopinataspecies1.477.419.913.5
Paraprevotella xylaniphilaspecies1.69.216.610.4
Candidatus Phytoplasma phoeniciumspecies1.598.71610.1
Trichodesmiumgenus1.566.612.98.3

More or Less often based on Symptom Median All Incidence

This is a little more complex to understand. If we compute the mid point for people with the symptom, then if the bacteria was not involved then half of the reference should be above this value and half below this value. If not, it means that the symptom tends to over or under growth.

tax_nameRankSymptom MedianOdds RatioChi2BelowAbove
Alcanivoraxgenus0.0020.2675.237598
Isoalcanivoraxgenus0.0020.2674.536595
Isoalcanivorax indicusspecies0.0020.2674.536595
Nostoc flagelliformespecies0.0020.2767.331684
Salidesulfovibriogenus0.0020.363.2381115
Salidesulfovibrio brasiliensisspecies0.0020.363.2381115
Niabella aurantiacaspecies0.0020.3462.1524176
Mycoplasmopsis lipophilaspecies0.0020.2761.227775
Psychroflexusgenus0.0020.360.4348105
Psychroflexus gondwanensisspecies0.0020.360.4348105
Deferribacter autotrophicusspecies0.0020.3159.6374117
Pelagicoccus croceusspecies0.0020.3259.2380120
Deferribactergenus0.0020.3259377119
Psychrobacter glacialisspecies0.0020.3755.3636238
Thermodesulfatator atlanticusspecies0.0020.353.627683
Thermodesulfatatorgenus0.0020.353.627683
Segetibacter aerophilusspecies0.0020.3453.1364122
Bacillus ferrariarumspecies0.0020.3452.1356120
Niabellagenus0.0020.3851.8560212
Rickettsia marmionii Stenos et al. 2005species0.0020.3551.6379131

More or Less often based on Reference Median All Incidence

This is like the above, but with a different line in the sand. Instead of the median of those with the condition, we use the median of the reference set.

tax_nameRankReference MedianOdds RatioChi2BelowAbove
Psychrobacter glacialisspecies0.0020.37150636238
Niabella aurantiacaspecies0.0020.34148.2524176
Desulfotomaculumgenus0.0040.51147.51394711
Alcanivoraxgenus0.0020.26145.837598
Isoalcanivoraxgenus0.0020.26142.836595
Isoalcanivorax indicusspecies0.0020.26142.836595
Niabellagenus0.0020.38132.4560212
Salidesulfovibriogenus0.0020.3127.5381115
Salidesulfovibrio brasiliensisspecies0.0020.3127.5381115
Bacteroides cellulosilyticusspecies0.2370.59127.222431325
Actinopolysporagenus0.0020.38125.5524198
Bifidobacteriumgenus0.0481.68125.313922332
Nostoc flagelliformespecies0.0020.27122.831684
Geobacillusgenus0.0030.43122.4672291
Bacteroides heparinolyticusspecies0.0030.49122.1928454
Erysipelothrix murisspecies0.0140.59121.620291195
Pelagicoccus croceusspecies0.0020.32120.7380120
Deferribacter autotrophicusspecies0.0020.31120.3374117
Deferribactergenus0.0020.32119.9377119
Psychroflexusgenus0.0020.3117.5348105

More or Less often based on Symptom Median High Incidence

Above we see that many of the top bacteria identified are sparse, that is not reported often. We then restrict them to those that occur above 50% or the time.

tax_nameRankSymptom Median FreqOdds RatioChi2BelowAbove
Clostridium taeniosporumspecies0.0030.62161295805
Tetragenococcus doogicusspecies0.0030.6711.51316881
Dethiosulfovibriogenus0.0040.6711.31457981
Hydrocarboniphaga daqingensisspecies0.0040.718.715491096
Mycoplasmopsisgenus0.0050.72817091230

More or Less often based on Reference Median High Incidence

Above we see that many of the top bacteria identified are sparse, that is not reported often. We then restrict them to those that occur above 50% or the time.

tax_nameRankReference Median FreqOdds RatioChi2BelowAbove
Psychrobacter glacialisspecies0.0020.37150636238
Niabella aurantiacaspecies0.0020.34148.2524176
Desulfotomaculumgenus0.0040.51147.51394711
Alcanivoraxgenus0.0020.26145.837598
Isoalcanivoraxgenus0.0020.26142.836595
Isoalcanivorax indicusspecies0.0020.26142.836595
Niabellagenus0.0020.38132.4560212
Salidesulfovibriogenus0.0020.3127.5381115
Salidesulfovibrio brasiliensisspecies0.0020.3127.5381115
Bacteroides cellulosilyticusspecies0.2370.59127.222431325
Actinopolysporagenus0.0020.38125.5524198
Bifidobacteriumgenus0.0481.68125.313922332
Nostoc flagelliformespecies0.0020.27122.831684
Geobacillusgenus0.0030.43122.4672291
Bacteroides heparinolyticusspecies0.0030.49122.1928454
Erysipelothrix murisspecies0.0140.59121.620291195
Pelagicoccus croceusspecies0.0020.32120.7380120
Deferribacter autotrophicusspecies0.0020.31120.3374117
Deferribactergenus0.0020.32119.9377119
Psychroflexusgenus0.0020.3117.5348105

Summary

A large number of bacterial taxa exhibit shifts with P < 0.01 in association with this condition. The subsequent challenge is determining how to modulate these taxa, since the volume of candidates exceeds what most individuals can practically consider. Moreover, for many of the taxa identified, there is no published evidence in the U.S. National Library of Medicine describing how to alter their abundance.

A deep optimization model, such as the one implemented on the Microbiome Taxa R2 site, can be used to inform probiotic selection. This model provides coverage for each identified taxon and infers which probiotics are most likely to shift their levels. Its output may then be integrated with more conventional recommendations derived from literature indexed in the U.S. National Library of Medicine where such evidence exists, with the two recommendation sets reconciled by giving priority to probiotic-based suggestions.

Development of a dedicated database based on Biomesight samples is in progress. The current model uses data contributed by PrecisionBiome, and datasets generated with differing laboratory processing pipelines cannot be safely combined, as discussed in The taxonomy nightmare before Christmas…. Once the Biomesight-specific database is complete, an option for generating (offline-only) personalized suggestions will be added to the Microbiome Prescription website.

Probiotics Suggestions

The following are based on a simplified algorithm using R2 data for Biomesight. These are tentative numbers subject to future refinements. Bacteria listed are only for probiotics detected with Biomesight tests. Probiotics include some that are available only in some countries and some that are pending approval for retail sale.

  • Good Count: Number of bacteria expected to shift in desired direction
  • Bad Count: Number of bacteria expected to shift in wrong direction
  • Impact: Estimator of impact based on Chi-2, Slope and R2 vectors
Probiotic SpeciesImpactGood CountBad Count
Blautia hansenii92.34111
Bifidobacterium breve70.82173
Faecalibacterium prausnitzii67.96120
Bifidobacterium longum64.14190
Blautia wexlerae50.1360
Bifidobacterium adolescentis46.86150
Segatella copri39.4870
Enterococcus faecalis30.686031
Bifidobacterium bifidum14.43153
Lactobacillus helveticus13.524777
Bifidobacterium catenulatum13.5132
Escherichia coli12.84130
Enterococcus faecium7.352026
Bifidobacterium animalis6.5981
Lactobacillus johnsonii1.254046
Streptococcus thermophilus1.1730
Veillonella atypica0.9390
Bacillus subtilis0.853346
Clostridium butyricum0.742425
Enterococcus durans0.453623
Ligilactobacillus salivarius0.31211
Leuconostoc mesenteroides0.231712
Lactococcus lactis0.13511
Limosilactobacillus fermentum0.081012
Lacticaseibacillus paracasei-0.081228
Lacticaseibacillus casei-0.109
Heyndrickxia coagulans-0.222035
Lactiplantibacillus plantarum-0.2629
Bifidobacterium pseudocatenulatum-0.411835
Lactiplantibacillus pentosus-0.44822
Lactobacillus crispatus-0.46132
Lactobacillus acidophilus-0.521128
Lacticaseibacillus rhamnosus-0.6328
Lactobacillus jensenii-1.441941
Odoribacter laneus-1.5404
Limosilactobacillus vaginalis-1.793052
Limosilactobacillus reuteri-2.223146
Pediococcus acidilactici-6.82523
Akkermansia muciniphila-13.33232
Parabacteroides goldsteinii-41.6018
Parabacteroides distasonis-60.2507
Bacteroides uniformis-250.42011
Bacteroides thetaiotaomicron-261.55111

Odds Ration Snapshot: Bloating

This document presents the results of statistical analysis on symptoms from viable, self-annotated Biomesight microbiome samples. The methodology for data acquisition is outlined in New Standards for Microbiome Analysis?.

Tables have been refined to display only genus- and species-level taxa, the 20 most prominent entries per group, and associations achieving statistical significance (P < 0.01).

The following sections provide the processed data, accompanied by guidance on interpretation and application. Counts of significant bacterial taxa are included, reflecting the application of non-standard but rigorously validated statistical approaches to extensive sample and reference populations, where statistical power derives from dataset scale.

SignificanceGenus
p < 0.01202
p < 0.001173
p < 0.0001145
p < 0.00001129

Averages and Medians

I prefer medians over averages. Medians are the values where half of the people have less and half has more. If the data was a bell-curve, then the values will almost be the same… with bacteria that happens rarely. Look at Bacteroides uniformis below, we see that the average is above and the median below.

If symptom median is higher than reference median, it means there is more of this bacteria. If lower, then less. This ignores how often the bacteria is seen (we average only over reports).

tax_nameRankSymptom AvarageReference AverageSymptom MedianReference Median
Bacteroidesgenus27.63325.88424.12126.729
Bacteroides uniformisspecies2.9852.7051.5272.059
Novispirillumgenus0.7840.8760.0850.174
Insolitispirillumgenus0.7840.8770.0860.174
Insolitispirillum peregrinumspecies0.7840.8770.0860.174
Bacteroides cellulosilyticusspecies1.0240.8360.0730.151
Bilophilagenus0.3760.3480.2060.265
Bifidobacteriumgenus0.6070.9690.1310.072
Bilophila wadsworthiaspecies0.3650.3390.1960.255
Blautia obeumspecies0.6540.5630.2280.281
Hathewaya histolyticaspecies0.3180.2750.1540.188
Hathewayagenus0.3180.2750.1540.188
Lachnobacteriumgenus0.3270.320.0750.049
Anaerotruncusgenus0.2130.1840.1360.159
Bifidobacterium longumspecies0.2480.330.050.029
Oribacteriumgenus0.1230.1310.0740.053
Anaerotruncus colihominisspecies0.20.1730.1330.153
Oribacterium sinusspecies0.1160.1270.0720.053
Odoribactergenus0.1940.1970.1230.139
Megamonasgenus0.420.4390.0030.018

Bacteria Incidence – How often is it reported

The common sense belief is that if a bacteria is reported more often, then the amount should be higher. This is often not true. The microbiome is a complex thing. Look at Bacteroides uniformis below, we see that the average is above and the median below

tax_nameRankIncidence Odds RatioChi2Symptoms %Reference %
Prevotella biviaspecies1.429.526.919

More or Less often based on Symptom Median All Incidence

This is a little more complex to understand. If we compute the mid point for people with the symptom, then if the bacteria was not involved then half of the reference should be above this value and half below this value. If not, it means that the symptom tends to over or under growth.

tax_nameRankSymptom MedianOdds RatioChi2BelowAbove
Alcanivoraxgenus0.0020.2777.836099
Isoalcanivoraxgenus0.0020.2777.334995
Isoalcanivorax indicusspecies0.0020.2777.334995
Niabella aurantiacaspecies0.0020.3469.1505172
Psychroflexusgenus0.0020.366.9345105
Psychroflexus gondwanensisspecies0.0020.366.9345105
Salidesulfovibriogenus0.0020.3265.7376120
Salidesulfovibrio brasiliensisspecies0.0020.3265.7376120
Deferribacter autotrophicusspecies0.0020.3265.4366116
Deferribactergenus0.0020.3264.5368118
Psychrobacter glacialisspecies0.0020.3862.5623237
Pelagicoccus croceusspecies0.0020.3262.3357116
Rickettsia marmionii Stenos et al. 2005species0.0020.3460.6372125
Bacillus ferrariarumspecies0.0020.3458.4350118
Niabellagenus0.0020.3956.6542211
Actinopolysporagenus0.0020.3955.8505195
Chromatiumgenus0.0020.3954.7480185
Chromatium weisseispecies0.0020.3954.4479185
Viridibacillus neideispecies0.0020.3853.7445170
Segetibacter aerophilusspecies0.0020.3653.7353126

More or Less often based on Reference Median All Incidence

This is like the above, but with a different line in the sand. Instead of the median of those with the condition, we use the median of the reference set.

tax_nameRankReference MedianOdds RatioChi2BelowAbove
Methylobacillus glycogenesspecies0.0030.4217.91190477
Methylobacillusgenus0.0030.42203.21190496
Psychrobacter glacialisspecies0.0020.38143.2623237
Niabella aurantiacaspecies0.0020.34140.5505172
Alcanivoraxgenus0.0020.27133.436099
Isoalcanivoraxgenus0.0020.2713134995
Isoalcanivorax indicusspecies0.0020.2713134995
Niabellagenus0.0020.39122.8542211
Salidesulfovibriogenus0.0020.32117.8376120
Salidesulfovibrio brasiliensisspecies0.0020.32117.8376120
Actinopolysporagenus0.0020.39117.1505195
Deferribacter autotrophicusspecies0.0020.32115.9366116
Psychroflexusgenus0.0020.3115.2345105
Psychroflexus gondwanensisspecies0.0020.3115.2345105
Deferribactergenus0.0020.32114.9368118
Chromatiumgenus0.0020.39112.4480185
Chromatium weisseispecies0.0020.39111.9479185
Helicobacter suncusspecies0.0020.46111.6717333
Pelagicoccus croceusspecies0.0020.32110357116
Rickettsia marmionii Stenos et al. 2005species0.0020.34109.4372125

More or Less often based on Symptom Median High Incidence

Above we see that many of the top bacteria identified are sparse, that is not reported often. We then restrict them to those that occur above 50% or the time.

tax_nameRankSymptom Median FreqOdds RatioChi2BelowAbove
Clostridium taeniosporumspecies0.0030.61211269777
Dethiosulfovibriogenus0.0040.6714.61417946
Tetragenococcus doogicusspecies0.0030.6812.61279876
Hydrocarboniphaga daqingensisspecies0.0040.729.814991078
Mycoplasmopsisgenus0.0050.729.716611199
Pediococcusgenus0.0040.757.21217913
Propionispora hippeispecies0.0050.766.814491101
Propionisporagenus0.0050.766.714481102

More or Less often based on Reference Median High Incidence

Above we see that many of the top bacteria identified are sparse, that is not reported often. We then restrict them to those that occur above 50% or the time.

Methylobacillus glycogenesspecies0.0030.4217.91190477
Methylobacillusgenus0.0030.42203.21190496
Psychrobacter glacialisspecies0.0020.38143.2623237
Niabella aurantiacaspecies0.0020.34140.5505172
Alcanivoraxgenus0.0020.27133.436099
Isoalcanivoraxgenus0.0020.2713134995
Isoalcanivorax indicusspecies0.0020.2713134995
Niabellagenus0.0020.39122.8542211
Salidesulfovibriogenus0.0020.32117.8376120
Salidesulfovibrio brasiliensisspecies0.0020.32117.8376120
Actinopolysporagenus0.0020.39117.1505195
Deferribacter autotrophicusspecies0.0020.32115.9366116
Psychroflexusgenus0.0020.3115.2345105
Psychroflexus gondwanensisspecies0.0020.3115.2345105
Deferribactergenus0.0020.32114.9368118
Chromatiumgenus0.0020.39112.4480185
Chromatium weisseispecies0.0020.39111.9479185
Helicobacter suncusspecies0.0020.46111.6717333
Pelagicoccus croceusspecies0.0020.32110357116
Rickettsia marmionii Stenos et al. 2005species0.0020.34109.4372125

Summary

A large number of bacterial taxa exhibit shifts with P < 0.01 in association with this condition. The subsequent challenge is determining how to modulate these taxa, since the volume of candidates exceeds what most individuals can practically consider. Moreover, for many of the taxa identified, there is no published evidence in the U.S. National Library of Medicine describing how to alter their abundance.

A deep optimization model, such as the one implemented on the Microbiome Taxa R2 site, can be used to inform probiotic selection. This model provides coverage for each identified taxon and infers which probiotics are most likely to shift their levels. Its output may then be integrated with more conventional recommendations derived from literature indexed in the U.S. National Library of Medicine where such evidence exists, with the two recommendation sets reconciled by giving priority to probiotic-based suggestions.

Development of a dedicated database based on Biomesight samples is in progress. The current model uses data contributed by PrecisionBiome, and datasets generated with differing laboratory processing pipelines cannot be safely combined, as discussed in The taxonomy nightmare before Christmas…. Once the Biomesight-specific database is complete, an option for generating (offline-only) personalized suggestions will be added to the Microbiome Prescription website.

Probiotics Suggestions

The following are based on a simplified algorithm using R2 data for Biomesight. These are tentative numbers subject to future refinements. Bacteria listed are only for probiotics detected with Biomesight tests. Probiotics include some that are available only in some countries and some that are pending approval for retail sale.

  • Good Count: Number of bacteria expected to shift in desired direction
  • Bad Count: Number of bacteria expected to shift in wrong direction
  • Impact: Estimator of impact based on Chi-2, Slope and R2 vectors
Probiotic SpeciesImpactGood CountBad Count
Bifidobacterium breve38.64130
Bifidobacterium longum34.39120
Bifidobacterium adolescentis25.49110
Segatella copri21.7650
Akkermansia muciniphila16.19137
Lactobacillus helveticus10.857036
Bifidobacterium bifidum7.8162
Bifidobacterium catenulatum6.98140
Escherichia coli3.7980
Lactobacillus johnsonii3.745216
Bifidobacterium animalis3.5380
Pediococcus acidilactici3.333126
Enterococcus faecalis2.724726
Enterococcus durans2.315014
Enterococcus faecium1.832822
Streptococcus thermophilus1.2821
Clostridium butyricum0.92411
Limosilactobacillus reuteri0.822712
Lactococcus lactis0.2863
Leuconostoc mesenteroides0.261812
Lacticaseibacillus paracasei0.25208
Ligilactobacillus salivarius0.25165
Limosilactobacillus fermentum0.231914
Bacillus subtilis0.153729
Bifidobacterium pseudocatenulatum0.14299
Lactiplantibacillus plantarum0.1231
Limosilactobacillus vaginalis0.114336
Lactiplantibacillus pentosus0.172
Veillonella atypica0.0931
Lacticaseibacillus casei0.051110
Lactobacillus crispatus-0.021415
Lactobacillus acidophilus-0.081813
Lacticaseibacillus rhamnosus-0.128
Heyndrickxia coagulans-0.161421
Odoribacter laneus-0.402
Parabacteroides goldsteinii-0.4324
Lactobacillus jensenii-1.693630
Faecalibacterium prausnitzii-3.3914
Blautia hansenii-12.1222
Blautia wexlerae-12.4601
Bacteroides uniformis-136.4809
Bacteroides thetaiotaomicron-148.9709