👋 Welcome to BastionLab!
Where data owners and data scientists can securely collaborate without exposing data - opening the way to projects that were too risky to consider.
What is BastionLab?
BastionLab is a simple privacy framework for data science collaboration, covering data exploration and AI traning.
It acts like an access control solution, for data owners to protect the privacy of their datasets, and stands as a guard, to enforce that only privacy-friendly operations are allowed on the data and anonymized outputs are shown to the data scientist.
- Data scientists can remotely run queries on data frames and train their models without seeing the original data or intermediary results.
This wasn’t possible until now for highly regulated fields like health, finance, or advertising. When collaborating remotely, data owners had to open their whole dataset, often through a Jupyter notebook. This was dangerous because too many operations were allowed and the data scientist had numerous ways to extract information from the remote infrastructure (print the whole database, save the dataset in the weights, etc).
BastionLab solves this problem by ensuring that no information is ever accessible locally to the data scientist.
BastionLab is an open-source project. Our solution is coded in Rust 🦀, uses Polars 🐻, a pandas-like library for data exploration, and Torch 🔥, a popular library for AI training. We also have an option to set-up confidential computing 🔒, a hardware-based technology that ensures no one but the processor of the machine can see the data or the model.
We’ll update the documentation as new features come in, so dive in!
- Follow our “Quick tour” tutorial
- Read about the technologies we use to ensure privacy
- Find our benchmarks documenting BastionLab’s speed
- Go to our Discord #support channel
- Report bugs by opening an issue on our BastionLab Github
- Book a meeting with us
How do we organize the documentation?
How-to guides are recipes. They guide you through the steps involved in addressing key problems and use cases. They are more advanced than tutorials and assume some knowledge of how BastionLab works.
Concepts guides discuss key topics and concepts at a high level. They provide useful background information and explanations, especially on cybersecurity.
API Reference contains technical references for BastionLab’s API machinery. They describe how it works and how to use it but assume you have a good understanding of key concepts.
Security guides contain technical information for security engineers. They explain the threat models and other cybersecurity topics required to audit BastionLab's security standards.
Advanced guides are destined to developpers wanting to dive deep into BastionLab and eventually collaborate with us to the open-source code. We'll cover in the future exactly how to do so.
Who made BastionLab?
Mithril Security is a startup aiming to make privacy-friendly data science easy so data scientists and data owners can collaborate without friction. Our solutions apply Privacy Enhancing Technologies and security best practices, like Remote Data Science and Confidential Computing.