← Integrations matrix

    My Amazon Athena Data Quality Tools List

    Discover the best 20 data quality tools that integrate with Amazon Athena.

    By Ari Bajo - Data Engineer turned Writer.

    See the full data quality tools list

    All 36 data quality, data testing, and data observability tools.

    View full list

    Collate

    Managed enterprise data platform built on OpenMetadata with data discovery, observability metrics, column-level lineage, governance workflows, and enterprise support.

    My Opinion

    Best for data teams looking for a fully managed enterprise version of OpenMetadata with dedicated support, advanced governance features, and a wide range of integrations.

    Ataccama ONE

    Data trust platform with data quality evaluation rules, anomaly detection, data lineage, a data catalog, and master data management.

    My Opinion

    Best for organizations looking to scale data management initiatives with enterprise master data management, data quality, and data governance.

    Monte Carlo

    Leading data observability platform with data monitors, anomaly detection, customizable data quality dashboards, and column-level lineage.

    My Opinion

    Best for data teams with a big budget looking for a mature and customizable data observability platform that also offers AI observability.

    Acceldata

    Agentic data observability platform with AI agents for data monitoring, data lineage, and FinOps.

    My Opinion

    Best for data teams looking for an enterprise data observability platform pivoting to a ChatGPT-like interface for all data management initiatives.

    Anomalo

    Automated data quality monitoring platform with UI-based anomaly detection tests for structured and unstructured data.

    My Opinion

    Best for data teams looking for a specialized data quality monitoring tool that integrates with specialized and cloud-native data catalog tools.

    Building or buying a data tool in 2026?

    One email a month — a new market guide and tool list, straight to your inbox. Next up: Data Governance, LLMOps, Data Orchestration.

    By Ari Bajo - Data Engineer turned Writer.

    Qualytics

    AI-powered data quality platform with auto-generated tests from profiling results, anomaly detection, and data quality context for humans and AI agents.

    My Opinion

    Best for enterprises in highly regulated industries looking for a scalable data quality platform with on-premise cloud deployments via Kubernetes.

    Bigeye

    Lineage-enabled data observability platform with data quality metrics monitoring, anomaly detection, a data catalog, and end-to-end data lineage.

    My Opinion

    Best for data teams looking to add code-based data observability for a mix of modern and legacy data warehouses and ETLs.

    Elementary Cloud

    Managed data observability platform built on the Elementary OSS dbt package with advanced anomaly detection monitors, column-level lineage, incident management, a data catalog, and AI agents.

    My Opinion

    Best for data analytics teams using dbt looking for a managed observability platform with advanced anomaly detection, team collaboration, and AI-powered issue resolution.

    Elementary OSS

    Open-source dbt package to add data observability to dbt projects with anomaly detection tests and a local data observability report generated via CLI.

    My Opinion

    Best for data analytics teams using dbt looking to add anomaly detection monitors to their existing dbt codebase without a cloud account.

    DQOps

    Open-source data quality platform with 150+ Jinja2 SQL-based checks configurable via YAML or a local web UI, data lineage, and incident management. DQOps Cloud adds cloud storage for quality results, hosted Looker Studio dashboards, and team collaboration.

    My Opinion

    Best for data teams looking for a highly customizable open-source data quality check library with a local web UI and Looker Studio dashboard integrations. DQOps Cloud adds shared dashboards and multi-user collaboration without managing your own infrastructure.

    Building or buying a data tool in 2026?

    One email a month — a new market guide and tool list, straight to your inbox. Next up: Data Governance, LLMOps, Data Orchestration.

    By Ari Bajo - Data Engineer turned Writer.

    Soda Cloud

    Managed data quality platform with built-in metrics to write data contracts (using YAML, UI, or AI), anomaly detection and AI agents to clean data.

    My Opinion

    Best for data teams looking to embed tests at every pipeline stage, collaborate with business users to quarantine and fix bad data, and integrate with data enterprise catalogs.

    Soda Core

    Open-source Python library and CLI to write and run data contracts in YAML using SodaCL with integrations for data warehouses, databases and query engines.

    My Opinion

    Best for data engineering teams looking for a YAML-based OSS data testing library that embeds directly in pipelines and CI/CD workflows.

    Sifflet

    AI-augmented data observability platform with data monitors, column-level data lineage, incident management, and a data catalog.

    My Opinion

    Best for data analytics teams looking for full-featured data observability, data lineage, and a data catalog in the same tool with integrations for cloud data warehouses.

    Datafold

    Proactive data quality platform with data diff tests, data impact reports, column-level lineage, and data monitors.

    My Opinion

    Best for data teams looking for data impact reports in PRs to validate code changes and automate data migrations with SQL translation and data reconciliation tests.

    Foundational

    Data management platform with source code analysis, data impact reports, column-level data lineage to BI, and data contracts.

    My Opinion

    Best for data teams looking to prevent data quality incidents with data impact reports integrated within their development lifecycle through Git and PRs.

    Building or buying a data tool in 2026?

    One email a month — a new market guide and tool list, straight to your inbox. Next up: Data Governance, LLMOps, Data Orchestration.

    By Ari Bajo - Data Engineer turned Writer.

    Validio

    Real-time data observability platform with window-based data validators, end-to-end data lineage, and incident management.

    My Opinion

    Best for data teams looking for real-time anomaly detection in data streams, lakes, and warehouses.

    Decube

    Unified data trust platform with data monitoring, pipeline monitoring, a data catalog, column-level lineage, and data access control.

    My Opinion

    Best for data teams looking to combine data observability, a data catalog, and data governance in the same tool.

    Great Expectations

    Open-source Python library with 50+ declarative expectations to validate data in files, SQL databases, data warehouses, and in-memory DataFrames.

    My Opinion

    Best for data engineering teams looking for a code-first OSS data testing library with a large built-in expectation library and Python extensibility.

    DQLabs

    Unified data quality and observability platform with anomaly detection, data quality checks, end-to-end data lineage, and pipeline observability.

    My Opinion

    Best for enterprises looking for unified data quality and observability that integrates with modern data catalogs and issue management tools.

    Lightup

    Data observability platform with data profiling, metrics, anomaly detection monitors, table-level lineage, and incident management.

    My Opinion

    Best for data teams looking for efficient and scalable window-based metrics for data warehouses monitoring with integrations with data catalogs and issue management tools.

    Building or buying a data tool in 2026?

    One email a month — a new market guide and tool list, straight to your inbox. Next up: Data Governance, LLMOps, Data Orchestration.

    By Ari Bajo - Data Engineer turned Writer.

    Telmai

    Real-time data observability platform for data lakes with anomaly detection, data health reports, and incident management.

    My Opinion

    Best for data teams looking for data observability for data lakes and data lakehouses with native support for Apache Iceberg, Hudi, and Delta Lake.

    AWS Glue Data Quality

    Managed data quality platform built on the open-source Deequ framework with data quality rulesets, scheduling, data quality dashboards, and anomaly detection.

    My Opinion

    Best for data teams using the AWS Glue Data Catalog and AWS Glue ETL jobs that want to monitor data quality at rest and in transit, with the possibility to quarantine data.

    Want to go deeper?

    Read the complete data quality tool market guide — features, pricing, and how to choose.

    Read the data quality guide