Raw measurement from a system of record.
"James's CBC from Quest Diagnostics"
The evidence substrate for living systems. Every biological fact — lab result, AI prediction, researcher hypothesis — structured, provenance-tagged, and cross-species queryable.
James, 58 — 847 evidence assertions visualized across body systems
The Atomic Unit
Not a row in a table. Not a PDF attachment. An EvidenceAssertion — typed, timestamped, traceable. It knows what kind of claim it is, where it came from, how confident you should be, and what it connects to.
James, 58 — Homo sapiens · TAX_9606
The Friction
“James's oncologist ordered 14 tests across 3 systems over 6 months. By treatment decision time, no one could reconstruct which EGFR result came from which instrument run — or which interpretation was from the AI classifier and which from the pathologist.”
76% of preclinical studies cannot be reproduced. The reason isn't fraud — it's missing provenance. OpenBio is the evidence substrate that fixes the bridge.
Lineage source_fact → 3 derived_features → 12 model_outputs
100%
Traceability across every digital asset generated within the OpenBio mesh.
Epistemic Boundaries
Raw measurement from a system of record.
"James's CBC from Quest Diagnostics"
Standardized and mapped to a shared ontology.
"EGFR mapped to HGNC:3236"
Derived from documents or images via parsing.
"Physician note, NLP-extracted"
Computed from one or more other assertions.
"Progression score from 3 lab values"
AI/ML prediction with attached confidence.
"Recurrence risk: 74% (AlphaFold3)"
Proposed interpretation, under active review.
"EGFR-TKI resistance via T790M"
How Data Moves Through the Levels
Raw sequence data cleaning and metadata normalization.
Produces source_fact · normalized_fact
Cryptographic hashing of biological records.
Validates source_fact integrity
Multi-node validation of experimental results.
Elevates to normalized_fact · extracted_annotation
Clinical-grade archiving of final biological truths.
Archives all levels with full lineage
evidence_vectors
From bacteria to primates — every subject is encoded into the same evidence dimensions. Different shapes, identical axes.
James
TAX_9606 · human
847 assert.
Luna
TAX_9615 · canine
234 assert.
M-4872
TAX_9544 · macaque
512 assert.
HeLa-S3
CELL_HELA · cell line
1,247 pass.
E. coli K-12
TAX_83333 · microbe
4,891 feat.
The Evidence Mesh
The same EvidenceAssertion model spans the entire tree of life. Find an EGFR variant in James. Trace it to macaque trial data. Cross-reference E. coli gene expression. One query. Five organisms.
Query: EGFR-related assertions across subjects
The same EGFR exon 19 deletion in James links to macaque trial results in M-4872 through the evidence mesh.
Built For
Query provenance-complete evidence across organisms. Know whether you're looking at a raw measurement or an AI inference. Compare findings cross-species. Never lose the chain of custody from instrument to publication.
847
assertions on a single human subject
Structured, typed biological data for training and inference pipelines. Every example knows its trust level — preventing source facts from being mixed with model outputs. The data substrate your agents deserve.
6
trust levels, queryable by filter
An evidence API that speaks FHIR, OMOP, GA4GH — and adds provenance none of them offer. Build on infrastructure instead of data engineering heroics. The layer above your existing systems, not a replacement.
5
organism types, one data model