Pricing & Features

Open Source
Free
Teams
Contact us

Core Features
Unstructured Storages
Unstructured Data Types
No Data Duplication
Metadata Extraction
Structured Data in DBs
Data Versioning & Lineage
Semantic Search & Filters
Flexible Python Pipelines
Parallel Processing

High-Scale Datasets
Size of data
Terabytes scale
Petabytes scale
Dataset Cardinality
Up to 30M items
Up to 1B+ items
Metadata engine

Coding Copilot
Cursor & GitHub Copilot
Coming soon
Early Access
Lineage-Aware Autocomplete
Coming soon
Early Access
MCP
Coming soon
Early Access

High-Scale Processing
Distributed Processing
Cloud Support
Auto-scaling

Team Collaboration
Shared Dataset Registry
Web UI
SSO/SAML
RBAC for Data

Deployments
Local
SaaS
Bring Your Own Cloud
On-Premise

Ready to get started?

Start for free
Book a demo or explore use cases
Explore our open source tools