_airbyte_raw_*tables storing the JSON blob data first. Afterward, these are transformed and normalized into separate tables, potentially "exploding" nested streams into their own tables if basic normalization is configured.
Bigquery (Denormalized): Instead of splitting the final data into multiple tables, this destination leverages BigQuery capabilities with Structured and Repeated fields to produce a single "big" table per stream. This does not write the
_airbyte_raw_*tables in the destination and normalization from this connector is not supported at this time.
_airbyte_ab_id: a uuid assigned by Airbyte to each event that is processed. The column type in BigQuery is
_airbyte_emitted_at: a timestamp representing when the event was pulled from the data source. The column type in BigQuery is
_airbyte_data: a json blob representing with the event data. The column type in BigQuery is
_airbyte_emitted_atat a daily granularity. Partitions boundaries are based on UTC time. This is useful to limit the number of partitions scanned when querying these partitioned tables, by using a predicate filter (a WHERE clause). Filters on the partitioning column will be used to prune the partitions and reduce the query cost. (The parameter "Require partition filter" is not enabled by Airbyte, but you may toggle this by updating the produced tables if you wish so)