How Matchpoint Therapeutics accelerated discovery by fusing siloed data into a unified pipeline

Highlights

Unified data pipeline

Integrating own assay results, public data and off-the-shelf tools in a flexible cloud

Speed and flexibility

Experiment-to-insight in realtime, tailored to Matchpoint’s unique approach

Data quality by design

Maximum automation with scientist input guided by intelligent assistants

Client

Matchpoint is a biotechnology company harnessing the power of covalency to discover precision covalent medicines to transform the treatment of immune diseases and other serious illnesses. The company's proprietary Advanced Covalent Exploration (ACE) platform integrates advanced chemoproteomics, machine learning and covalent chemistry library evolution. Matchpoint has an emerging pipeline of novel covalent medicines initially focused on immunology.

The company was founded in 2021 by Atlas Venture and Access Biotechnology together with leading scientists from the Dana Farber Cancer Institute and Stanford and has raised $100M to date.

Challenges

Speed, flexibility and data quality need to be balanced out

High volume of non-standard experiments

Cross-functional teams need to work independently of data support

Solution

Pipeline architecture

Success factor

Custom dashboards for all standardized analysis

Moving quickly from experiment to insight to decision is key to accelerating discovery.

Therefore, we created custom dashboards automating the most common types of analysis, reflecting Matchpoint’s unique way of looking at the data.

Moreover, experimental results are contextualized with further annotations integrated from external databases.

Success factor

Convenient user interfaces supporting manual data ingestion steps along the pipeline

When ingesting new data, expert input is crucial e.g. for quality checks and organizing it into meaningful logical units.

To make this as efficient as possible, we developed a web-based user interface which guides through the entire process.

Apart from speed and convenience, this also reduces the scope for errors and enforces Matchpoint’s internal standards by design.

Success factor

Close collaboration with the science and platform technology teams on all key design decisions

To optimally support discovery, data structure and pipeline setup need to be closely aligned with the R&D strategy.

Therefore, we co-designed the system to match current and anticipated future needs.

This created a high level of transparency and alignment, reflected in a visual map of the pipeline on a shared canvas, which is continuously updated with changes and feature requests.

What our client says

“It is a pleasure working with the idalab team on our data and machine learning pipeline. They are an outstanding strategic partner, collaborating seamlessly with our science team. Fast, clear communication, structured — yet always happy to adapt ad hoc, if necessary. We are looking forward to continuing the collaboration.“
Suresh Singh, PhD
Senior Vice President,
Computational Sciences

Get in touch

Let's talk data

If you are not getting what you want out of your data: Let’s talk.

Fully confidential (NDA), with no strings attached.

Schedule a call

Frequently asked questions

We supported Matchpoint along the entire process. Working closely with the science team, we conceptualised the pipeline in a series of workshops. Afterwards, we implemented it piece-by-piece in Matchpoint’s cloud environment while keeping the science team in the loop to ensure data quality and consistency.
Twelve weeks from the kick-off workshop to the delivery of the initial pipeline.
Anything goes, even custom tools written in Fortran can be brought into a 21st century cloud pipeline.
Definitely! We would love to team up with your data engineering team, and actively support phasing us out when the time is right.
- Programming language: Python
- Web-app development: Streamlit (a Python package)
- Data lake: Google Cloud Storage
- Data warehouse: Google BigQuery
- Web-app deployment: Google App Engine
- Securing web-app access: Google Identity-Aware Proxy
- Integration of external tools: Google Cloud Functions
For this project, the requirement was to implement everything in Matchpoint’s Google Cloud environment, but any other cloud provider (e.g., AWS, Azure) works as well.

Learn more

Insights

Data management for early-stage biotechs: how to get started on the right path

Data management at a biotech needs to work for everyone – from CEO’s courting investors to lab scientists designing the next experiment. Get it wrong and you’ll either need to put up with your system’s foibles or go through a painful migration to a new system.