← All case studies

Prymat

Prymat Automates High-Volume Data Integration Between SQL Server and BigQuery.

Introduction

Prymat, a leading food manufacturer in Europe, uses Etlworks to move and synchronize large volumes of data between on-premise SQL Server databases and Google BigQuery. By leveraging automatic partitioning and reverse ETL, Prymat has streamlined analytics and operational reporting across its data stack.

The Challenge

Prymat needed a solution to efficiently manage high-volume data movement between legacy and cloud systems:

  • Large Source Tables: Key operational tables in SQL Server contained millions of rows and grew continuously.

  • Slow Data Loads: Traditional ETL tools struggled to ingest and transform these tables within acceptable windows.

  • Reverse ETL Needs: Required bidirectional flows to push processed data from BigQuery back into operational systems.

  • Scalability and Maintenance: Needed a low-maintenance solution that could scale with growing data volumes.

Why Etlworks

Prymat selected Etlworks for its built-in support for parallel data extraction and tight integration with both SQL Server and BigQuery:

  • Automatic Partitioning: Etlworks detects physical or logical partitions and distributes processing across parallel threads.

  • Optimized SQL Server Integration: High-performance extraction using native drivers and custom queries.

  • Native BigQuery Support: Seamless loading and querying with full control over datasets, partitions, and formats.

  • Reverse ETL: Easily pushes curated data from BigQuery back to SQL Server and other operational systems.

  • Monitoring and Scheduling: Reliable job orchestration and alerting with no need for external schedulers or scripts.

The Solution

Etlworks implemented a scalable integration framework tailored to Prymat’s needs:

  • Partitioned Extraction: Large SQL Server tables are split into chunks based on date ranges, IDs, or partitions. Each chunk is processed in parallel for faster throughput.

  • BigQuery Loader: Transformed data is loaded into partitioned tables in BigQuery for downstream analytics.

  • Reverse ETL Flows: Selected BigQuery datasets are synced back to SQL Server to support operational reporting.

  • Fully Automated Pipelines: All workflows are scheduled, monitored, and logged with error handling and recovery built in.

Results

  • Faster Processing: Partitioning reduced ETL execution time by over 80% for large tables.

  • Scalable Architecture: System handles growing data volumes with no manual reconfiguration.

  • Improved Data Availability: Near real-time updates between SQL Server and BigQuery.

  • Operational Efficiency: Reduced reliance on manual batch jobs and scripting.

Key Takeaways

  • Automatic Partitioning: Etlworks distributes heavy ETL loads across partitions for faster, parallel processing.

  • BigQuery Integration: Native connectors enable fast, schema-aware data loading and querying.

  • Reverse ETL: Bi-directional flows keep operational and analytical systems in sync.

  • Enterprise-Grade Reliability: Scalable, monitored, and fully automated ETL pipelines.


Ready to tackle your most complex data challenges? Discover how Etlworks can transform your data integration workflows. Start your free trial today or request a demo.

Tackle your most complex data challenges with Etlworks.