🇪🇺 The open source and cloud independent data lake analytics platform

Unified data lake analytics for your data and AI ambitions

Accelerate your data ambitions with a fully integrated and unified data analytics and AI platform on any data lake (Azure, AWS, GCP, S3, Kubernetes). Use public cloud, private cloud or a hybrid setup. Unilake, the fully integrated, open source data lake analytics platform at scale, from megabytes to petabytes of data

Application Query Result

Access, Combine, and Scale Your Data on Any Cloud

Cloud Unilake

Open Source

Unilake is a fully open source data and AI platform licensed under the AGPL 3.0 and EUPL license. Giving full freedom to audit and adapt the platform to your needs. Both control plane and compute plane are open source, you can run Unilake fully isolated on an environment of your choice.

At Scale

Whether you have gigabytes or petabytes of data, Unilake is battle tested for both large and small datasets. Analyse vast amounts of data in subseconds securely using our scalable and unified security approach with dynamic ABAC policies. Create branches on your environments, allowing different workspaces and teams to work on data securely and instantly.

Fully Integrated

Unilake integrates with over 300+ source connectors, from databases to APIs. Latest technologies like Daft (dataframes), SqlMesh (data models), StarRocks (SQL Lakehouse), Gravitino (metalake) and Ray (distributed compute) create the foundation of Unilake