Blog

Welcome to The Scalable Way Blog — your source for insights, strategies, and best practices in data platform engineering, analytics, and data science.

Blog posts

Running dbt Rescue Rebuild in Production: Operational Playbooks, Failure Models, and Recovery Patterns

dbt data reliability pipeline recovery

Go beyond the setup and into real-world execution. Learn how we run dbt rescue rebuilds in production: scoping dependencies, managing warehouse contention, handling incremental models, and recovering from outages with precision, without introducing new risks to pipeline stability.

The Rescue dbt_rerun Deployment: Rebuilding Changed and Broken Models Without Disrupting Production

dbt data reliability pipeline recovery

Keeping production data correct after a dbt change is harder than it looks. Learn how we introduced a dedicated rescue deployment to rebuild exactly what’s needed and when it’s needed, bringing consistency back to production data without costly full reruns or pipeline disruptions.

Why Data Teams Struggle Without Separate Dev and Prod Environments

Data Engineering Dev vs Prod Data Infrastructure CI/CD

When development and production share the same data environment, even small changes can trigger costly outages. This article explains why separating dev and prod is foundational for reliable analytics, and how teams can do it without overengineering or blowing the budget.

Data Platform Cost Optimization: Practical Strategies for Query Performance, Storage, and Cloud Resource Management

Data Platform Optimization Cloud Cost Management Query Performance Data Engineering

Explore how you can dramatically reduce data platform costs without sacrificing performance. This guide breaks down actionable techniques across query tuning, incremental data loading, cloud resource management, and storage lifecycle design.

SAP Data Ingestion with Python: A Technical Breakdown of Using the SAP RFC Protocol

SAP Python Data Integration RFC Protocol Data Engineering

Streamline SAP data integration with Python by leveraging the RFC protocol. This interview with the lead engineer of a new SAP RFC Connector explores the challenges of large-scale data extraction and explains how a C++ integration improves stability, speed, and reliability for modern data workflows.

CI/CD for Data Workflows: Automating Prefect Deployments with GitHub Actions

prefect prefect worker github actions CI/CD data workflows data platform architecture productized data platform

The final part of the Data Platform Infrastructure on GCP series covers CI/CD for Prefect deployments using GitHub Actions and Docker. Automate flow builds, worker updates, and streamline orchestration across environments.