Bulk loading via GCS
Stages files in Google Cloud Storage, then runs BigQuery load jobs at warehouse speed — the high-throughput path, not streaming inserts.
Google BigQuery ETL
ETL, ELT, real-time CDC, and Reverse ETL — all into and out of Google BigQuery. GCS-staged load jobs, MERGE-based change data, partitioning and clustering, 270+ source connectors. Predictable monthly pricing instead of per-row consumption.
The problem
Most ways into BigQuery cost you twice. Consumption-priced ETL tools bill per row on top of your on-demand or slot spend, or you wire up Cloud Functions and scheduled queries that someone has to own. Either way, loading the warehouse becomes its own project — and naive loads inflate the bytes every downstream query scans.
Where budgets go to die
CDC pipelines, hourly SaaS syncs, large historical backfills — exactly the data you want in BigQuery — are the workloads consumption-priced tools cost the most for. Etlworks bills per platform tier, not per record, and loads into partitioned, clustered tables so downstream queries scan less. Predictable for your CFO, painless for your data team.
Capabilities
Stages files in Google Cloud Storage, then runs BigQuery load jobs at warehouse speed — the high-throughput path, not streaming inserts.
Log-based CDC from MySQL, Postgres, SQL Server, Oracle, Mongo, DB2 — sub-second latency, MERGE-based deduping into partitioned tables.
Push modeled BigQuery data to Salesforce, HubSpot, Marketo, NetSuite, and 200+ SaaS targets. Same platform, same subscription.
Load into partitioned, clustered tables — managed automatically — to cut scanned bytes and query cost. Schema evolution propagates without DDL drift.
SQL, JavaScript, Python — transform in flight or push down to run inside BigQuery. dbt-friendly, dbt-optional.
GCS-staged load jobs, batched MERGE, and partition-aware writes keep both load throughput high and downstream scan cost low.
Patterns
Every BigQuery data pipeline pattern, configured the same way. No separate tool for CDC, no separate tool for Reverse ETL, no Cloud Functions to maintain by hand.
Stage files in GCS, then run a BigQuery load job. The pattern Google recommends, automated end to end.
Log-based CDC streams change events into BigQuery via MERGE. Sub-second latency, no Kafka.
Push enriched data from BigQuery to Salesforce, HubSpot, Marketo, NetSuite — 200+ SaaS targets.
Pricing transparency
Same workload — Salesforce account changes, Postgres orders, hourly SaaS syncs into BigQuery — priced under three common ETL pricing models. Numbers are approximate, based on public pricing as of 2026, and exclude BigQuery compute itself.
Consumption (per-row)
~$8,000/mo
Scales linearly with row volume. Hidden surge pricing during busy months.
Credit-based
~$3,500/mo
Better, but credits expire, and peak-load tier upgrades add cost.
Etlworks (fixed tier)
$1,000/mo
Standard tier, all features, all rows. Predictable for budgets, painless for data teams.
Specifications
Every part of a BigQuery pipeline you'd actually run — loading, CDC, table design, and security — supported and documented.
Comparing BigQuery ETL tools? See Etlworks vs Fivetran, Matillion, and Airbyte
Proof
Staples Canada integrates data from SQL Server and file-based sources into Google BigQuery, and improved data integration performance 10x by using Etlworks's BigQuery connector with bulk load.
FAQ
MERGE so INSERT, UPDATE, and DELETE are preserved idempotently, with sub-second source latency.Start your trial
Spin up a free trial, point it at your BigQuery dataset, and load production data. See what predictable ETL pricing actually feels like.