- How to build a data project with step-by-step instructions
- What are the Key Parts of Data Engineering?
- Data Engineering Interview Series #1: Data Structures and Algorithms
- How to implement data quality checks with greatexpectations
- What are the types of data quality checks?
- SQL or Python for Data Transformations?
- Why use Apache Airflow (or any orchestrator)?
- Data Engineering Projects
- Data Engineering Project for Beginners - Batch edition
- Build Data Engineering Projects, with Free Template
- Python Essentials for Data Engineers
- dbt(Data Build Tool) Tutorial
- Building Cost Efficient Data Pipelines with Python & DuckDB
- Enable stakeholder data access with Text-to-SQL RAGs
- How to reduce your Snowflake cost
- How to test PySpark code with pytest
- Docker Fundamentals for Data Engineers
- Data Engineering Best Practices - #2. Metadata & Logging
- Uplevel your dbt workflow with these tools and techniques
- What is an Open Table Format? & Why to use one?
- 6 Steps to Avoid Messy Data in Your Warehouse
- Data Engineering Best Practices - #1. Data flow & Code
- What is a self-serve data platform & how to build one
- How to become a valuable data engineer
- Data Engineering Project: Stream Edition
- Change Data Capture, with Debezium
- Data Pipeline Design Patterns - #2. Coding patterns in Python
- Data Pipeline Design Patterns - #1. Data flow patterns
- How to gather requirements for your data project
- 5 Steps to land a high paying data engineering job
- Setting up a local development environment for python data projects using Docker
- What is the difference between a data lake and a data warehouse?
- End-to-end data engineering project - batch edition
- Automating data testing with CI pipelines, using Github Actions
- How to choose the right tools for your data pipeline
- Setting up end-to-end tests for cloud data pipelines
- How to improve at SQL as a data engineer
- 6 Responsibilities of a Data Engineer
- 6 Key Concepts, to Master Window Functions
- What are Common Table Expressions(CTEs) and when to use them?
- Whats the difference between ETL & ELT?
- How to add tests to your data pipelines
- 10 Skills to Ace Your Data Engineering Interviews
- What is a staging area?
- What is a Data Warehouse?
- How to Scale Your Data Pipelines
- Understand & Deliver on Your Data Engineering Task
- 4 Key Patterns to Load Data Into A Data Warehouse
- How to Validate Datatypes in Python
- Designing a Data Project to Impress Hiring Managers
- How to make data pipelines idempotent
- Writing memory efficient data pipelines in Python
- How to gather requirements to re-engineer a legacy data pipeline
- How to trigger a spark job from AWS Lambda
- How to set up a dbt data-ops workflow, using dbt cloud and Snowflake
- Apache Superset Tutorial
- How to Join a fact and a type 2 dimension (SCD2) table
- How to update millions of records in MySQL?
- How to unit test sql transforms in dbt
- How to Backfill a SQL query using Apache Airflow
- How to do Change Data Capture (CDC), using Singer
- How to Pull Data from an API, Using AWS Lambda
- How to submit Spark jobs to EMR cluster from Airflow
- Ensuring Data Quality, With Great Expectations
- Designing a "low-effort" ELT system, using stitch and dbt
- 3 Key techniques, to optimize your Apache Spark code
- What, why, when to use Apache Kafka, with an example
- A proven approach to land a Data Engineering job
- What Does It Mean for a Column to Be Indexed
- Advantages of Using dbt(Data Build Tool)
- Apache Airflow Review: the good, the bad
- Review: Building a Real Time Data Warehouse
- 3 Key Points to Help You Partition Late Arriving Events
- Scheduling a SQL script, using Apache Airflow, with an example
- 10 Key skills, to help you become a data engineer