This collection of pages describes commercial offering of selected data as a service. Before embarking on starting a business in this area, please read Microbiome Startup for the business challenges.
See these pages for general information.
- Basic Informational Calls documentation
- Suggestions API
- Suggested implementation guidance
- User Agreement – terms and conditions
Simplest Implementation
- From the sample obtain the following information
- NCBI Taxon Number (no need for names) for the bacteria
- Percentage of the sample
- Percentile rank against a reference population (for those reporting some of this bacteria)
- Optional Data
- NCBI Taxon number and the percentage range deemed to be normal (needing no correction)
- Sent this data to the API in a REST call as JSON
- The response will consist of a list of modifiers number and the estimated weight (positive helps, negative hurts)
On the client side:
- The bacteria names shown are those picked by the client
- The modifier names shown are those picked by the client
- Modifiers are excluded or included based on client preferences
- Example: Only products that the client sells as a side business.
An overview of issues to be considered with roll-your-own.
Citation Extraction Process
This video demonstrates the process that is done at least once a month to keep the data current. It is not intended to discover 100% of possible citations, rather just a very high percentage.
Citations Overview
Information on citations are usually given as shown below for an item that modifies a bacteria. The basis of citations are always:
- id — our internal identifier, numeric
- name — this is the common name that we are using, there is an API that will give alternative names
- doi — Digital Object Identifier System
- url — the url to the source or a summary. Note that some url may be behind paywalls or have moved.
{
"id": 40,
"name": "berberine",
"citations": [
{
"id": 143,
"url": "https://pubmed.ncbi.nlm.nih.gov/26923892",
"doi": "0161903/AIM.008",
"logic": "Direct",
"impact": 1
},
{
"id": 283,
"url": "https://pubmed.ncbi.nlm.nih.gov/28447763",
"doi": "10.3892/mmr.2017.6321",
"logic": "Direct",
"impact": -1
}
]
},
Modifier Context
Citations in the context of a bacteria being specified too high or too low may include
- Impact:
- 1 – shifts towards medium
- -1 – shifts away from medium
- 0 – no impact (rarely reported)
- Logic: Indicates the logic connecting See Taxonomy Inference with the Microbiome
- Direct (C) – this bacteria is explicitly Cited in the source
- Children Impacted (D) – Descendents
- Parent Impacted (P) – the hierarchy layer above the bacteria is impacted
The above only applies to the lowest levels of the taxonomy hierarchy as shown in the table below. Note that some items impacts only some species of a genus – a study may report that the genus changes. we cannot be certain that every species will be impacted. In our internal algorithm we diminish the expected impact of items that are not Direct to reflect this uncertainty.
Strain | Species, Subspecies, species group, species subgroup | Genus, Subgenus | |
Impact Strain | C– Direct | P | |
Impact Species | D | C– Direct | P |
Impacts Genus | D | C – Direct |
In a few cases (where there is an absence of data), we may apply this to Family following the maxim “Poor data is better than no data”.
Logic | Facts |
C | 92,289 |
D | 1,738,971 |
P | 9,324 |
For PubMed items, there is C# core sample code on GitHub that will allow you to obtain a summary of each article. Please note that the information may be coming from any of the following:
- The Summary
- The full article
- Appendix to the full article (example CSV files, Excel files, word documents etc)
- Charts or tables (rendered as images) from the article.
Recent Comments