About Lakehouse
Lakehouse provides a unified platform for storing diverse data (structured, semi-structured, unstructured) and performing advanced analytics. Key capabilities include:
-
Simplified data ingestion: Streamlines data movement from databases, APIs, and files.
-
Data transformation: Enables enrichment, aggregation, and transformation of data.
-
Data cataloging: Facilitates easy management and discovery of data assets.
-
Pipeline creation: Create complex pipelines from any database or service.
-
Manage transformations: Overwrite, append (full/incremental), and deduplicate data for optimal control.
-
Orchestrate data: Drag and drop pipelines - build complex pipelines visually. Easily orchestrate data ingestion, transformation, and data movement.
-
ER diagram generation: Automatically infers table/column relationships and creates visual ER diagrams.
-
Materialized views: Accelerates query performance through advanced materialized views.
-
Granular access control: Ensures data security and governance through user permission management.
-
Schema change detection: Proactively tracks and logs alterations to source table structures, including added/removed columns and data type modifications.
-
Data lineage: Traces your data's complete lifecycle, visualizing its origin, movement, and transformations across your systems.
Lakehouse comprises of the following: