Suggestions Cross Validation using PubMed

PrecisionBiome.EU, who is using the Microbiome Prescription suggestion engine (with minor customization), ask me to update the engine’s Cross Validation statistics. They are doing Shotgun analysis in conformity to all EU laws and regulations. It has been almost 2 years since I posted Cross Validation of AI Suggestions for Nonalcoholic Fatty Liver Disease which had 92% agreement. As you can see below, we are now at 97% for NAFLD.

Cross validation means computing suggestions from the typical shifts reported on PubMed. For simple validation, we do a new count of the highs and lows. Some studies report a bacteria being high, a different study reported it being low for the same condition — both claiming statistical significant. I term this data to be fuzzy.

We then run the resulting shifts through the suggestion engine and get two numbers:

  • Matches — suggestions says to take and studies report that it helps the condition (for some people in the study)
  • Not Matches — suggestions says to avoid and studies report that it helps the condition (for some people in the study)

Again, we see different results reported in the same study, we have more fuzzy data. Last item is that we need a sufficient number of substances known to help the condition to reduce randomness.

The results are below. My criteria starting Microbiome Prescription was to do better than random for suggestions. Random would be 50%. We exceed that.

ConditionNameMatchesNot MatchesPercentage Correct
Depression65790%
Irritable Bowel Syndrome46787%
Biofilm390100%
Chronic Fatigue Syndrome371867%
Crohn’s Disease35783%
High Histamine/low DAO35295%
hypertension (High Blood Pressure33587%
Nonalcoholic Fatty Liver Disease  (nafld) Nonalcoholic30197%
Obesity30391%
Autism28780%
Inflammatory Bowel Disease24196%
Type 2 Diabetes24389%
obsessive-compulsive disorder24389%
Allergic Rhinitis (Hay Fever)21195%
Ulcerative colitis20969%
Asthma18869%
Constipation19290%
Brain Trauma151060%
Metabolic Syndrome140100%
Stress / posttraumatic stress disorder14193%
Mast Cell Issues / mastitis130100%
Functional constipation / chronic idiopathic constipation13381%
Cancer (General)12571%
Overall60910386%

Let us look at the poor performers above:

  • Chronic Fatigue Syndrome has a complex set of causes: COVID, Lyme Disease, Food Poisoning etc.
  • Brain Trauma can be from many causes also, having a wide variation
  • Cancer – this is also a big collection of things

From the point of view of most Artificial Intelligence methodologies, 86% is very good.

Bottom Line

None of the above conditions are “clean”, in general there can be a multitude of subsets for each condition; each subset likely have different microbiome shifts. For every modifier, you have a percentage of responders, non-responders and adverse events. Furthermore, diet styles of the patients impacts bacteria shifts. In short, all of the data is fuzzy. This is not a concern for me because I have several decades dealing with fuzzy logic issues professionally.

P.S. IMHO, this is the best way of producing evidence that any suggestion engine are producing reasonable results. The usual clinical study methodology just does not work.