Reusable, pipeline-agnostic data quality framework built on PySpark. Plug into any Databricks notebook, AWS Glue job, or dbt post-hook. All thresholds are driven by YAML config — zero hardcoded values ...
┌─────────────────────────────────────────────────────────────────────────┐ │ DATA SOURCES │ │ NYC TLC Yellow Taxi CSV (ADLS Gen2 / S3 / DBFS ...
一些您可能无法访问的结果已被隐去。
显示无法访问的结果