My Data Quality Tools List

    Discover the best 31 data quality, data testing, data observability, shift-left data quality, and unified data trust tools with data governance features. The most comprehensive, actionable, and up-to-date list you'll find. Trust me.

    By Ari Bajo - Data Engineer turned Writer.

    Updated on February 24, 2026

    My Data Quality Tools Landscape - Visual overview of data quality, testing, and observability tools
    Click to enlarge

    Great Expectations

    GX Cloud is a managed data quality platform with a library of expectations based on the open-source GX Core Python library, data profiling, scheduling, and anomaly detection.

    OSStestingobservability
    My Opinion

    Best for data teams looking for a mix of UI-managed tests and custom Python workflows to validate a mix of files, SQL databases, data warehouses, and in-memory DataFrames.

    Soda

    Soda Cloud is a flexible data quality platform based on the open-source Soda Core Python library with built-in metrics to write data tests, monitors, and contracts in YAML, a UI, or AI.

    OSStestingobservabilitycontracts
    My Opinion

    Best for data engineering teams looking to embed tests at every pipeline stage, quarantining failed records, and bi-directional integrations with data governance tools.

    AWS Glue Data Quality

    Managed data quality platform built on top of the open-source Deequ framework with data quality rulesets, scheduling, data quality dashboards, and anomaly detection.

    OSStestingobservability
    My Opinion

    Best for data teams using the AWS Glue Data Catalog and AWS Glue ETL jobs that want to monitor data quality at rest and in transit, with the possibility to quarantine data.

    Google CloudDQ

    Cloud-native data validation CLI with YAML-based data quality checks for BigQuery tables and GCS structured data.

    OSStesting
    My Opinion

    Best for data teams looking for a BigQuery-native solution to write reusable SQL checks and consume data quality outputs programmatically.

    DQX by Databricks

    Data quality framework for Apache Spark with data quality rule generation from profiling results, YAML and Python-based data validation checks, and a data quality dashboard.

    OSStestingobservability
    My Opinion

    Best for Databricks users looking to validate PySpark DataFrames and Tables across Spark Core, Spark Structured Streaming, and Lakeflow Pipelines / DLT.

    DataKitchen

    Open-source data testing and observability platform with automated test generation, data profiling, and anomaly detection.

    OSStestingobservability
    My Opinion

    Best for data teams looking for a cost-effective data testing and observability solution that also offers DataOps training services.

    Monte Carlo

    Leading data observability platform with data monitors, anomaly detection, customizable data quality dashboards, and column-level lineage.

    testingobservabilitylineage
    My Opinion

    Best for data teams with a big budget looking for a mature and customizable data observability platform with various integrations.

    DQLabs

    Unified data quality and observability platform with anomaly detection, data quality checks, end-to-end data lineage, and pipeline observability.

    testingobservabilitylineage
    My Opinion

    Best for enterprises looking for unified data quality and observability that integrates with modern data catalogs and issue management tools.

    DQOps

    Open-source data quality testing and observability platform with data quality checks, monitors, data lineage with Marquez, data quality dashboards, and incident management.

    OSStestingobservabilitylineage
    My Opinion

    Best for data teams looking to customize built-in data quality checks and data quality dashboards with Looker Studio to monitor data quality KPIs.

    Metaplane by Datadog

    End-to-end data observability platform with data monitors and column-level lineage from data sources to BI dashboards.

    observabilitylineage
    My Opinion

    Best for data analytics teams looking to quickly add anomaly detection monitors through the UI.

    Join my monthly newsletter

    Building or buying a data tool in 2026?

    I write the most comprehensive lists of tools and actionable market guides for data teams. Next, covering Data Governance, LLMOps, Data Orchestration, Data Integration, ...

    BigEye

    Lineage-enabled data observability platform with data quality metrics monitoring, anomaly detection, a data catalog, and end-to-end data lineage.

    observabilitylineagecatalog
    My Opinion

    Best for data teams looking to add code-based data observability for a mix of modern and legacy data warehouses and ETLs.

    SYNQ

    Data observability platform built around data products with UI-based data monitors, column-level lineage from sources to BI, and incident management.

    observabilitylineage
    My Opinion

    Best for data analytics teams using dbt / SQLMesh looking for advanced incident management workflows with ownership activation, AI-powered issue triaging, and resolution.

    Anomalo

    Automated data quality monitoring platform with UI-based anomaly detection tests for structured and unstructured data.

    observability
    My Opinion

    Best for data teams looking for a specialized data quality monitoring tool that integrates with data catalogs, notably Alation, Snowflake Horizon, and Databricks Unity Catalog.

    Lightup

    Data observability platform with data profiling, metrics, anomaly detection monitors, table-level lineage, and incident management.

    observability
    My Opinion

    Best for data teams looking for efficient and scalable window-based metrics for data warehouses monitoring with integrations with data catalogs and issue management tools.

    Telmai

    Real-time data observability platform for data lakes with anomaly detection, data health reports, and incident management.

    observability
    My Opinion

    Best for data teams looking for data observability for data lakes and data lakehouses with native support for Apache Iceberg, Hudi, and Delta Lake.

    Unravel

    Agentic data observability and FinOps platform for the cloud with integrations with external data quality checks, cost optimization, incident management, and resolution.

    observabilityFinOps
    My Opinion

    Best for data and infrastructure teams that want to combine data quality results with costs and performance insights into one platform with optimization recommendations.

    Pantomath

    Automated data operations platform with data observability, pipeline observability, end-to-end pipeline lineage, and incident management.

    observabilitylineageFinOps
    My Opinion

    Best for data operations teams looking for end-to-end data pipeline lineage with automated root-cause analysis and integrations with Jira or ServiceNow.

    IBM Databand

    Data pipeline and data warehouse monitoring platform with data pipeline job monitors, schema change monitors, custom SQL monitors, alerts, and task-based data lineage.

    observabilitylineage
    My Opinion

    Best for data teams looking for end-to-end ETL pipeline monitoring with tasks that span across dbt, Airlfow, Azure Data Factory, Spark, IBM DataStage, and IBM Watsonx Data.

    Validio

    Real-time data observability platform with window-based data validators, end-to-end data lineage, and incident management.

    observabilitylineage
    My Opinion

    Best for data teams looking for real-time anomaly detection in data streams, lakes, and warehouses.

    Datafold

    Proactive data quality platform with data diff tests, data impact reports, column-level lineage, and data monitors.

    lineagedata diff
    My Opinion

    Best for data teams looking for data impact reports in PRs to validate code changes and accelerate data migrations with SQL translation and data reconciliation tests.

    Join my monthly newsletter

    Building or buying a data tool in 2026?

    I write the most comprehensive lists of tools and actionable market guides for data teams. Next, covering Data Governance, LLMOps, Data Orchestration, Data Integration, ...

    Recce

    Recce Cloud is a specialized dbt data validation toolkit with column-level impact and data-diff reports.

    OSSlineagedata diff
    My Opinion

    Best for data teams using dbt looking for a cost-effective solution to streamline reviewing dbt code changes with data impact reports on PRs.

    Gable

    Shift left data platform with data contracts, static code analysis, and CI/CD integrations.

    contracts
    My Opinion

    Best for data operations teams looking to prevent bad data during CI/CD in database tables, files, Kafka topics, or Protobuf messages.

    Foundational

    Data management platform with source code analysis, data impact reports, column-level data lineage to BI, and data contracts.

    lineagecontracts
    My Opinion

    Best for data analytics teams looking to prevent data quality incidents with data impact reports integrated within their development lifecycle through Git and PRs.

    Entropy Data

    Data product platform to build data marketplaces with data contracts based on the Open Data Contract Standard (ODCS), the open-source Data Contract CLI, and data policy checks.

    OSScontracts
    My Opinion

    Best for organizations looking to build a data product marketplace with data policy checks.

    Elementary

    dbt-native data observability platform with anomaly detection monitors, column-level lineage, dbt performance monitoring, incident management, and a data catalog.

    OSSobservabilitylineagecatalog
    My Opinion

    Best for data analytics teams looking to add advanced anomaly detection monitors in their dbt codebase.

    Sifflet

    AI-augmented data observability platform with data monitors, column-level data lineage, incident management, and a data catalog.

    observabilitylineagecatalog
    My Opinion

    Best for data teams looking for full-featured data observability, data lineage, and a data catalog in the same tool with integrations for data warehouses and databases.

    Decube

    Unified data trust platform with data monitoring, pipeline monitoring, a data catalog, column-level lineage, and data access control.

    observabilitylineagecatalog
    My Opinion

    Best for data teams looking to combine data observability, a data catalog, and data governance in the same tool.

    Collate

    Unified metadata platform built upon the open-source OpenMetadata project with data discovery features, observability metrics, column-level lineage, and governance workflows.

    OSStestingobservabilitylineagecatalogcontracts
    My Opinion

    Best for data teams looking for a unified enterprise solution to data discovery, observability, and governance with a wide range of integrations.

    SelectZero

    Comprehensive data observability platform with data validation, data profiling, column-level data lineage, a data catalog, and a business glossary.

    testingobservabilitylineagecatalog
    My Opinion

    Best for enterprises looking for a data quality tool that can be easily self-hosted with a Docker deployment and also offers a one seat free version to get started.

    Ataccama ONE

    Data trust platform with data quality evaluation rules, anomaly detection, data lineage, a data catalog, and master data management.

    testingobservabilitylineagecatalogMDM
    My Opinion

    Best for organizations looking to scale data management initiatives with enterprise master data management, data quality, and data governance.

    Join my monthly newsletter

    Building or buying a data tool in 2026?

    I write the most comprehensive lists of tools and actionable market guides for data teams. Next, covering Data Governance, LLMOps, Data Orchestration, Data Integration, ...

    Acceldata

    Agentic data observability platform with AI agents for data monitoring, data lineage, and FinOps.

    observabilitylineage
    My Opinion

    Best for data teams looking for an enterprise data observability platform pivoting to a ChatGPT-like interface for all data management initiatives.

    Frequently Asked Questions

    What is a data quality tool?
    A data quality tool is a solution to data reliability problems that implements features such as data tests, data profiling, data observability monitors, data quality workflows, data quality dashboards, data lineage, and incident management. Data quality tools are classified into: data testing, data observability, shift-left data quality, and unified data trust tools with data governance features. Read more on my data quality tool market guide.
    Why create yet another tools list?
    I found no comprehensive, actionable, and up-to-date list of data quality tools. The MAD Landscape misclassifies 3 out of 19 data quality and observability tools. The Gartner Magic Quadrant for augmented data quality solutions lists 13 tools, half of which are enterprise data platforms, and I need to enter my professional email on a featured tool's website to get access to a reprint. To access the Forrester Landscape with an overview of 29 data quality solutions I need to pay $2995. Other lists by vendors contain a random sample of less than 10 tools, are written by AI, are highly biased, or are never updated. If I ask Google or ChatGPT for a list, the results are no good. How could they be, given all the conflicting and outdated sources?
    How can I edit this tools list?
    First, consider that this list focuses on specialized data quality tools for data teams. I intend to cover separately data governance tools (Collibra...), MLOps/LLMOps tools (Evidently AI, langfuse...) and enterprise data platforms (Qlik...) that also offer data quality features or products. Other specialized data quality tools on my radar that I haven't had the time to research yet include: CluedIn, Qualdo, Qualytics, iceDQ, RightData, Kensu, FirstEigen, Rakuten SixthSense, Eagle EYE, BIG EVAL. Bigger companies with data quality products I inted to cover separately include: dbt, Qlik, Informatica, IBM, Actian, Irion, Ab Initio, SAP, Oracle, SAS. Specialized data quality tools for business users that I don't intent to cover include: Experian, Precisely, Melissa. That being said, if you think a tool belongs here or you want to suggest an edit, I would love to hear from you. You can fill up the feedback form or contact me on LinkedIn.