Skip to main content

About Data sources

Data sources are essentially the points where your raw information is stored. These can range from traditional databases like PostgreSQL and MySQL to modern cloud warehouses such as Snowflake or object storage systems like Amazon S3. The data sources module is designed for simple integration, allowing you to connect to a wide variety of systems, including REST APIs, MongoDB, and other platforms. The connection process is streamlined, using minimal credentials and automatically discovering the structure of your data as soon as you connect. After you are connected, you can perform several key actions to get the most out of your data. 

This approach proactively establishes metadata and data lineage for every new source. This simplifies future compliance audits and enhances the traceability of your data, giving you a clear picture of its origin and journey.

Connecting data sources

You can integrate a new data source into the Lakehouse seamlessly by following the detailed steps for any of the following data sources.

DatabasesData Files and storage servicesBusiness Applications and SaaS
Amazon RedshiftApify DatasetAmplitude
DatabricksAzure FileChargeBee
Google BigQueryFolderCvent
IBM DB2S3HubSpot
MongoDB AtlasSFTP BulkJira
MS SQLMonday
MySQLNetSuite
PostgreSQLNetSuite JDBC
SAP HANANetSuite Reporting
SnowflakePipedrive
Posthog
Quickbooks
Salesforce
Shopify
Stripe
SurveyMonkey
Xero
Zendesk Support