Skip to content
/ dozer Public

Dozer is a real-time data movement tool that leverages CDC from various sources and moves data into various sinks.

License

Notifications You must be signed in to change notification settings

getdozer/dozer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

8b0b4cc · Apr 16, 2024
Feb 29, 2024
Feb 20, 2024
Apr 16, 2024
Feb 25, 2024
Aug 29, 2023
Nov 24, 2022
Apr 16, 2024
Apr 2, 2024
Mar 19, 2024
Apr 16, 2024
Apr 16, 2024
Apr 16, 2024
Apr 16, 2024
Apr 2, 2024
Apr 16, 2024
Mar 19, 2024
Sep 6, 2023
Apr 2, 2024
Apr 25, 2023
Sep 20, 2023
Jan 13, 2023
Apr 24, 2023
Apr 16, 2024
Apr 16, 2024
Sep 26, 2023
Jan 17, 2024
Apr 16, 2024
Mar 6, 2023
Mar 19, 2024

Overview

Dozer is a real time data movement tool leveraging CDC from various sources to multiple sinks.

Dozer is magnitudes of times faster than Debezium+Kafka and natively supports stateless transformations. Primarily used for moving data into warehouses. In our own application, we move data to Clickhouse and build data APIs and integration with LLMs.

How to use it

Dozer runs with a single configuration file like the following:

app_name: dozer-bench
version: 1
connections:
  - name: pg_1
    config: !Postgres
      user: user
      password: postgres
      host: localhost
      port: 5432
      database: customers
sinks:
  - name: customers
    config: !Dummy
      table_name: customers

Full documentation can be found here

Supported Sources

Connector Extraction Resuming Enterprise
Postgres
MySQL
Snowflake
Kafka 🚧
MongoDB 🎯
Amazon S3 🎯
Google Cloud Storage 🎯
**Oracle Enterprise Only
**Aerospike Enterprise Only

Supported Sinks

Database Connectivity Enterprise
Clickhouse
Postgres
MySQL
Big Query
Oracle Enterprise Only
Aerospike Enterprise Only