Cover of Data Quality and Observability
DRM-free · Yours to keep forever
Data Engineering

Data Quality and Observability

Contracts, tests and lineage for pipelines you can trust

By Houssam Kodad

PDF 152 pages Intermediate English

One-time purchase

€22.95

VAT included
where applicable

Download sample
  • Instant download after purchase
  • Readable on any device
  • Free updates to this edition
  • Secure checkout

About this book

What's inside

The fastest way to lose a stakeholder's trust is a dashboard that's quietly wrong. This concise book lays out a practical system for data quality: tests that catch issues before users do, contracts that stop bad data at the source, and observability that tells you when a pipeline silently breaks. It's a playbook for making 'is this number right?' a question you can answer with confidence.

What you'll learn

Skills you'll walk away with

  • Define data quality dimensions that actually matter
  • Write tests for freshness, volume, schema and distribution
  • Set up data contracts between producers and consumers
  • Detect anomalies and silent pipeline failures
  • Track lineage to find blast radius fast
  • Design alerting that signals without crying wolf
  • Build a culture and SLA around trustworthy data

Table of contents

8 chapters
  1. 01

    What Trustworthy Data Means

    • · The six dimensions of quality
    • · Quality as a product feature
    • · The cost of a wrong number
  2. 02

    Tests That Catch Issues Early

    • · Schema and not-null tests
    • · Freshness and volume checks
    • · Distribution and referential tests
  3. 03

    Data Contracts at the Source

    • · Producer responsibilities
    • · Enforcing contracts in CI
    • · Handling breaking changes
  4. 04

    Anomaly Detection for Pipelines

    • · Static thresholds vs learned baselines
    • · Seasonality and drift
    • · Reducing false positives
  5. 05

    Lineage and Blast Radius

    • · Table and column lineage
    • · Tracing an incident upstream
    • · Impact analysis before changes
  6. 06

    Alerting Without Alert Fatigue

    • · Severity and ownership
    • · Routing and escalation
    • · Tuning noisy checks
  7. 07

    SLAs, SLOs and Data Reliability

    • · Setting freshness SLAs
    • · Measuring reliability
    • · Reporting to stakeholders
  8. 08

    Building a Quality Culture

    • · Ownership and on-call
    • · Post-incident reviews
    • · A rollout roadmap

This is the full chapter list — exactly what you'll receive in the PDF.