DQX by Databricks
Data quality framework for Apache Spark with data quality rule generation from profiling results, and YAML and Python-based data validation checks.
Best for Databricks users looking to validate PySpark DataFrames and Tables across Spark Core, Spark Structured Streaming, and Lakeflow Pipelines / DLT.
Monte Carlo
Leading data observability platform with data monitors, anomaly detection, customizable data quality dashboards, and column-level lineage.
Best for data teams with a big budget looking for a mature and customizable data observability platform that also offers AI observability.
DQLabs
Unified data quality and observability platform with anomaly detection, data quality checks, end-to-end data lineage, and pipeline observability.
Best for enterprises looking for unified data quality and observability that integrates with modern data catalogs and issue management tools.
Qualytics
ML-powered data quality platform with auto-generated tests from profiling results, anomaly detection, and data quality context for humans and AI agents.
Best for enterprises in highly regulated industries looking for a scalable data quality platform with on-premise cloud deployments via Kubernetes.
DQOps
Open-source data quality testing and observability platform with data quality checks, monitors, data lineage with Marquez, and data quality dashboards.
Best for data teams looking to customize built-in data quality checks and data quality dashboards with Looker Studio to monitor data quality KPIs.
DataKitchen
Open-source data testing and observability platform with automated test generation, data profiling, and anomaly detection.
Best for data teams looking for a cost-effective data testing and observability solution that prices per database connection and user.
Elementary OSS
Open-source dbt package to add data observability to dbt projects with anomaly detection tests and a local data observability report generated via CLI.
Best for data analytics teams using dbt looking to add anomaly detection monitors to their existing dbt codebase without a cloud account.
Metaplane by Datadog
End-to-end data observability platform with data monitors and column-level lineage from data sources to BI dashboards.
Best for data analytics teams with a modern data stack looking to quickly add anomaly detection monitors through the UI.
Anomalo
Automated data quality monitoring platform with UI-based anomaly detection tests for structured and unstructured data.
Best for data teams looking for a specialized data quality monitoring tool that integrates with specialized and cloud-native data catalog tools.
Validio
Real-time data observability platform with window-based data validators, end-to-end data lineage, and incident management.
Best for data teams looking for real-time anomaly detection in data streams, lakes, and warehouses.
Building or buying a data tool in 2026?
One email a month — a new market guide and tool list, straight to your inbox. Next up: Data Governance, LLMOps, Data Orchestration.
By Ari Bajo - Data Engineer turned Writer.
Telmai
Real-time data observability platform for data lakes with anomaly detection, data health reports, and incident management.
Best for data teams looking for data observability for data lakes and data lakehouses with native support for Apache Iceberg, Hudi, and Delta Lake.
Lightup
Data observability platform with data profiling, metrics, anomaly detection monitors, and incident management.
Best for data teams looking for scalable window-based metrics for data warehouses with integrations with data catalogs and issue management tools.
Acceldata
Agentic data observability platform with AI agents for data monitoring, data lineage, and FinOps.
Best for data teams looking for an enterprise data observability platform pivoting to a ChatGPT-like interface for all data management initiatives.
Pantomath
Automated data operations platform with data observability, pipeline observability, end-to-end pipeline lineage, and incident management.
Best for data operations teams looking for end-to-end data pipeline lineage with automated root-cause analysis and integrations with Jira or ServiceNow.
Unravel
Agentic data observability and FinOps platform for the cloud with integrations with external data quality checks, cost optimization, and incident management.
Best for data teams that want to combine in one platform data quality results with costs and performance recommendations.
AWS Glue Data Quality
Managed data quality platform built on the open-source Deequ framework with data quality rulesets, scheduling, data quality dashboards, and anomaly detection.
Best for data teams using AWS Glue Data Catalog and ETL jobs that want to monitor data quality at rest and in transit, with the possibility to quarantine data.
IBM Databand
Data pipeline and data warehouse monitoring platform with job pipeline monitors, data monitors, and task-based data lineage.
Best for data teams looking for end-to-end ETL pipeline monitoring with tasks that span across dbt, Airlfow, Spark, IBM DataStage, and IBM Watsonx Data.
Soda Cloud
Managed data quality platform with built-in metrics to write data contracts (using YAML, UI, or AI), anomaly detection and AI agents to clean data.
Best for data teams looking to embed data contracts within data pipeline steps, collaborate with business users to fix bad data, and integrate with data catalogs.
Datafold
Proactive data quality platform with data diff tests, data impact reports, column-level lineage, and data monitors.
Best for data teams looking for data impact reports in PRs to validate code changes and automate data migrations with SQL translation and data reconciliation tests.
Sifflet
AI-augmented data observability platform with data monitors, column-level data lineage, incident management, and a data catalog.
Best for data teams looking to collaborate with business users through integrated data observability, data lineage, and a data catalog for cloud data warehouses.
Building or buying a data tool in 2026?
One email a month — a new market guide and tool list, straight to your inbox. Next up: Data Governance, LLMOps, Data Orchestration.
By Ari Bajo - Data Engineer turned Writer.
OpenMetadata
Open-source unified metadata platform with data discovery, data quality checks, observability metrics, column-level lineage, and governance workflows.
Best for data teams looking for a self-hosted open-source platform covering data discovery, observability, and governance with a wide range of integrations.
Collate
Managed enterprise data platform built on OpenMetadata with data discovery, observability metrics, column-level lineage, and governance workflows.
Best for data teams looking for a fully managed enterprise version of OpenMetadata with dedicated support, security features, and advanced governance worflows.
Elementary Cloud
Managed data observability platform with advanced anomaly detection monitors, column-level lineage, incident management, a data catalog, and AI agents.
Best for data analytics teams using dbt looking for a managed observability platform with team collaboration features and AI-powered issue resolution.
Bigeye
Lineage-enabled data observability platform with data quality metrics monitoring, anomaly detection, a data catalog, and end-to-end data lineage.
Best for data teams looking to add code-based data observability for a mix of modern and legacy data warehouses and ETLs.
Decube
Unified data trust platform with data monitoring, pipeline monitoring, a data catalog, column-level lineage, and data access control.
Best for data teams looking to combine data observability, a data catalog, and data governance in the same tool.
SelectZero
Comprehensive data observability platform with data validation, data profiling, column-level data lineage, a data catalog, and a business glossary.
Best for enterprises looking for a data quality tool that can be easily self-hosted with a Docker deployment.
Ataccama ONE
Data trust platform with data quality evaluation rules, anomaly detection, data lineage, a data catalog, and master data management.
Best for organizations looking to scale data management initiatives with enterprise master data management, data quality, and data governance.
Coalesce Data Quality
Data observability product by Coalesce after having acquired SYNQ with UI-based data monitors, column-level lineage, and incident management workflows.
Best for Coalesce users that want to unify data transformation, data catalog and data quality in one product.