Published in


An explanation of how Tag.bio is awesome in the most concise way I can think of at the moment

I’m the technical co-founder of Tag.bio, and the software platform we’ve built is my baby. So naturally, I’m biased—this 10-point overview is intended to clarify my declarations of technological awesomeness with more objectivity.

Disclaimer — this is intended to be as concise as possible, so it’s chock-full of technical jargon. If that suits you, please continue.

The 30,000 foot view

1) Tag.bio provides secure, advanced data research software to healthcare and life sciences customers deployed to cloud or on-premise — integrating one or more public/proprietary datasets.

2) Our flexible platform, forward-thinking User Experience, reproducibility, federated architecture, and agile collaboration process with customers make deployments of Tag.bio software far more rapid and useful than anything else out there.

3) With Tag.bio, biologists and clinicians within an organization are enabled to repeatedly ask and answer their own questions on high-value datasets - without waiting for bioinformatics, statistics, or IT support. As a result, our customers — healthcare providers and biomedical research groups — are currently seeing 10x to 1000x acceleration of data research.

The deeper technical dive

4) Tag.bio's analysis engine — our "dataset hypervisor" — is not designed to replace an existing data warehouse, but rather "sits on top of it", and delivers enhanced analysis functionality and connectivity via API. Tag.bio is therefore uniquely capable of leveraging an organization's prior and ongoing efforts to clean and assemble high-value datasets.

5) As soon as each Tag.bio dataset hypervisor is deployed into our web portal — typically within one to five days — end users can then rapidly iterate on important research questions via "protocol apps". Protocol apps give biologists and clinicians simple, tailored analysis workflows to conduct repeatable, reproducible, and collaborative data research like never before.

6) Tag.bio's federated architecture facilitates multiple dataset hypervisors to be deployed independently to our web portal by different research groups/departments. Different dataset hypervisors in the Tag.bio platform often have significantly different protocol apps, so expansion of data diversity and emergence of new analysis needs over time do not dilute utility or increase technical debt within the platform.

Return on investment

7) Our agile deployment process and federated architecture breaks down organizational data silos significantly faster than centralized "data lake" efforts, which require either authoritarian control or consensus building across all departmental stakeholders — both of which are incredibly inefficient, if not impossible.

8) In other words, Tag.bio makes data actionable to an organization in days, not years. At the same time, prior and ongoing efforts to clean and assemble data into warehouses can still be leveraged by Tag.bio to great effect.

9) When an organization does not have software that lets a biologist/clinician ask and answer their own complex data research questions, the alternative human-support process — bioinformatics/biostatistics services — is extremely slow and costly. We view this as a Last Mile Problem.

10) Tag.bio’s platform architecture and User Experience accelerate most data research workflows currently requiring human labor from weeks to seconds. As mentioned above, our customers are currently seeing 10x to 1000x acceleration of data research.

So yeah, it’s awesome

Thanks for reading. Stay tuned!



Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Jesse Paquette

Jesse Paquette


Full-stack data scientist, computational biologist, and pick-up soccer junkie. Brussels and San Francisco. Opinions are mine alone.