🇪🇺 The open source and cloud independent data lake analytics platform
Unified data lake analytics for your data and AI ambitions
Accelerate your data ambitions with a fully integrated and unified data analytics and AI platform on any data lake (Azure, AWS, GCP, S3, Kubernetes). Use public cloud, private cloud or a hybrid setup. Unilake, the fully integrated, open source data lake analytics platform at scale, from megabytes to petabytes of data


Identify data lineage and trace back to its origins, supporting advanced lineage analysis
Explore Lineage

Accelerate your AI/ML initiatives and workflows from local development to petabyte-scale workloads
Explore AI/ML
Package your data and AI/ML models into production-ready, scalable, and secure solutions
Explore Data Products

Ensure compliance with industry regulations and standards with manual or automated access requests
Explore Access Requests
Manage user access and permissions for data security using AI, scanning your data and identify potential security risks
Explore Data Scanner
Monitor and analyse user activity and access logs, helping you identify and resolve security issues
Explore Audit Logs

Access, Combine, and Scale Your Data on Any Cloud


Open Source
Unilake is a fully open source data and AI platform licensed under the AGPL 3.0 and EUPL license. Giving full freedom to audit and adapt the platform to your needs. Both control plane and compute plane are open source, you can run Unilake fully isolated on an environment of your choice.

At Scale
Whether you have gigabytes or petabytes of data, Unilake is battle tested for both large and small datasets. Analyse vast amounts of data in subseconds securely using our scalable and unified security approach with dynamic ABAC policies. Create branches on your environments, allowing different workspaces and teams to work on data securely and instantly.

Fully Integrated
Unilake integrates with over 300+ source connectors, from databases to APIs. Latest technologies like Daft (dataframes), SqlMesh (data models), StarRocks (SQL Lakehouse), Gravitino (metalake) and Ray (distributed compute) create the foundation of Unilake