Tags
Deploying Prefect on any Cloud Using a Single Virtual Machine
A pattern to improve observability, monitoring and, ultimately, data operations with Prefect. We show how to find the right trade off between number of deployments and improved operations.
dlt and Prefect, a Great Combo for Streamlined Data Ingestion Pipelines
Streamline your data ingestion pipelines with dlt and Prefect. This article explores how combining these powerful open-source tools enables scalable, efficient, and production-ready data workflows. Learn best practices, key features, and real-world insights to simplify data engineering.
Breaking Down Prefect Deployments To Improve The Data Ops Efficiency
Discover how breaking down monolithic ETL flows into modular deployments enhances observability, streamlines troubleshooting, and boosts scalability. Learn to design data pipelines that evolve with your needs while maintaining performance and reliability.
What is a Modular Data Platform?
Learn why modularity is crucial for building scalable, efficient data architectures. This article covers the core components of modern data platforms, from ingestion to governance, and shares best practices for flexibility, interoperability, and security.
Organizing Networking for Data Platforms: Key Connectivity Options
Optimize your data platform by making informed networking decisions. This article explores how networking impacts ELT workflows, covering key connectivity options, security considerations, and best practices. Learn how to design a secure, scalable, and high-performing data platform architecture with the right networking.
How to Setup Data Platform Infrastructure on Google Cloud Platform with Terraform
Learn how to set up a secure, scalable data platform infrastructure on Google Cloud Platform (GCP) using Terraform. This step-by-step guide covers VPC configuration, Compute Engine setup, firewall rules, Identity-Aware Proxy (IAP), Cloud NAT, and more, ensuring a cost-effective, flexible, and secure foundation for your data platform.
Roles in the Context of the Analytics Workflow
Code-based analytics workflows offer a unique advantage - the ability to combine robust collaboration with strict governance. While technically straightforward to implement, this approach only reaches its full potential when aligned with well-adapted business and analytics processes.
Getting to Your First Flow Run: Prefect Worker & Deployment Setup
Run your first data ingestion workflow with Prefect, Docker, and Kubernetes. This guide walks through containerized flow execution, Prefect worker deployment, and clean deployment configs, laying the foundation for a scalable, maintainable orchestration layer.
Scaling Secure Data Access: A Systematic RBAC Approach Using Entra ID
Establish scalable, secure access controls for your data platform with a systematic RBAC strategy built on Microsoft Entra ID. This article outlines a five-phase implementation—from user persona mapping to automated auditing—designed to balance flexibility, compliance, and operational efficiency.
CI/CD for Data Workflows: Automating Prefect Deployments with GitHub Actions
The final part of the Data Platform Infrastructure on GCP series covers CI/CD for Prefect deployments using GitHub Actions and Docker. Automate flow builds, worker updates, and streamline orchestration across environments.
SAP Data Ingestion with Python: A Technical Breakdown of Using the SAP RFC Protocol
Streamline SAP data integration with Python by leveraging the RFC protocol. This interview with the lead engineer of a new SAP RFC Connector explores the challenges of large-scale data extraction and explains how a C++ integration improves stability, speed, and reliability for modern data workflows.
Data Platform Cost Optimization: Practical Strategies for Query Performance, Storage, and Cloud Resource Management
Explore how you can dramatically reduce data platform costs without sacrificing performance. This guide breaks down actionable techniques across query tuning, incremental data loading, cloud resource management, and storage lifecycle design.