DataGen

Availability: Core Standard Plus Pro Enterprise Flex Self-Managed Enterprise PyAirbyte
Support Level: Marketplace
Connector Version: 0.1.6 (last updated 4 months ago)
Sync Success Rate
Usage Rate
Definition ID: f14d5125-dc0d-4f6c-abe5-acde821a2203

The DataGen source connector generates synthetic data for testing and development purposes. This connector is designed for end-to-end testing of data destinations and for testing Airbyte configurations in speed mode without requiring access to an external data source.

Prerequisites

No prerequisites are required to use this connector. DataGen generates data locally and does not connect to any external systems.

Setup guide

Log in to your Airbyte Cloud or Airbyte Open Source account.
Click Sources and then click + New source.
On the Set up the source page, select DataGen from the Source type dropdown.
Enter a name for your DataGen source.
Configure the data generation settings:
- Data Generation Type: Choose either Incremental or All Types.
- Max Record: Specify the total number of records to generate (minimum 1, maximum 100 billion). Default is 100.
- Max Concurrency (optional): Set the maximum number of concurrent data generators. Leave empty to let Airbyte optimize performance automatically.
Click Set up source.

Supported sync modes

The DataGen source connector supports the following sync mode:

Feature	Supported?
Full Refresh Sync	Yes
Incremental Sync	No

Supported data generation types

The connector supports two data generation patterns:

Incremental

Generates a stream named increment with a single column named id that contains monotonically increasing integers. This mode is useful for testing incremental data loading and verifying that data arrives in the expected order.

All types

Generates a stream named all types with columns for various Airbyte data types, including id, string, boolean, number, big integer, big decimal, date, time (with and without time zones), timestamp (with and without time zones), and JSON. This mode is useful for testing type handling and schema compatibility across different destinations.

Reference

Config fields reference

Field

Type

Property name

object

flavor

integer

max_records

integer

concurrency

Changelog

Expand to review

Version	Date	Pull Request	Subject
0.1.6	2025-10-23	68611	Update cdk version
0.1.5	2025-10-21	68581	Update dataChannel version
0.1.4	2025-10-16	68131	Increment naming fix
0.1.3	2025-10-16	68129	Increment encoding fix
0.1.2	2025-10-14	67720	Removal of Array type
0.1.1	2025-10-13	67110	Addition of proto types
0.1.0	2025-09-29	66331	Creation of initial DataGen Source

Prerequisites​

Setup guide​

Supported sync modes​

Supported data generation types​

Incremental​

All types​

Reference​