Prymat

Prymat Automates High-Volume Data Integration Between SQL Server and BigQuery.

Introduction

Prymat, a leading food manufacturer in Europe, uses Etlworks to move and synchronize large volumes of data between on-premise SQL Server databases and Google BigQuery. By leveraging automatic partitioning and reverse ETL, Prymat has streamlined analytics and operational reporting across its data stack.

The Challenge

Prymat needed a solution to efficiently manage high-volume data movement between legacy and cloud systems:

Large Source Tables: Key operational tables in SQL Server contained millions of rows and grew continuously.
Slow Data Loads: Traditional ETL tools struggled to ingest and transform these tables within acceptable windows.
Reverse ETL Needs: Required bidirectional flows to push processed data from BigQuery back into operational systems.
Scalability and Maintenance: Needed a low-maintenance solution that could scale with growing data volumes.

Why Etlworks

Prymat selected Etlworks for its built-in support for parallel data extraction and tight integration with both SQL Server and BigQuery:

Automatic Partitioning: Etlworks detects physical or logical partitions and distributes processing across parallel threads.
Optimized SQL Server Integration: High-performance extraction using native drivers and custom queries.
Native BigQuery Support: Seamless loading and querying with full control over datasets, partitions, and formats.
Reverse ETL: Easily pushes curated data from BigQuery back to SQL Server and other operational systems.
Monitoring and Scheduling: Reliable job orchestration and alerting with no need for external schedulers or scripts.

The Solution

Etlworks implemented a scalable integration framework tailored to Prymat’s needs:

Partitioned Extraction: Large SQL Server tables are split into chunks based on date ranges, IDs, or partitions. Each chunk is processed in parallel for faster throughput.
BigQuery Loader: Transformed data is loaded into partitioned tables in BigQuery for downstream analytics.
Reverse ETL Flows: Selected BigQuery datasets are synced back to SQL Server to support operational reporting.
Fully Automated Pipelines: All workflows are scheduled, monitored, and logged with error handling and recovery built in.

Results

Faster Processing: Partitioning reduced ETL execution time by over 80% for large tables.
Scalable Architecture: System handles growing data volumes with no manual reconfiguration.
Improved Data Availability: Near real-time updates between SQL Server and BigQuery.
Operational Efficiency: Reduced reliance on manual batch jobs and scripting.

Key Takeaways

Automatic Partitioning: Etlworks distributes heavy ETL loads across partitions for faster, parallel processing.
BigQuery Integration: Native connectors enable fast, schema-aware data loading and querying.
Reverse ETL: Bi-directional flows keep operational and analytical systems in sync.
Enterprise-Grade Reliability: Scalable, monitored, and fully automated ETL pipelines.

Ready to tackle your most complex data challenges? Discover how Etlworks can transform your data integration workflows. Start your free trial today or request a demo.

Tackle your most complex data challenges with Etlworks.

Start free trial → Talk to us