Feature/spark expectation enhancements#123 #125

sudeep7978 · 2024-12-19T18:00:00Z

Add Column for Column-Level Visibility in Data Quality Framework Result Table

Description

Schema Evolution with AutoMerge:
Enabled Delta Lake's spark.databricks.delta.schema.autoMerge.enabled configuration to allow schema evolution during write operations.
Modified the data quality framework to include the affected_column_name field dynamically if not already present.

ENHANCEMENT

Enhanced granularity in data quality reporting.
Improved ease of debugging and resolving data quality issues.
Better alignment with industry practices for data governance and observability.

Motivation and Context

Increased Transparency: Builds trust by providing clear visibility into how data quality rules are applied and which columns are impacted.
Operational Efficiency: Reduces manual intervention and effort required to diagnose data issues, optimizing resource utilization.

How Has This Been Tested?

Ensure backward compatibility is maintained for legacy workflows making sure that exusting pipelines donot break

Screenshots (if appropriate):

Types of changes

Checklist:

My code follows the code style of this project.
My change requires a change to the documentation.
I have updated the documentation accordingly.
I have read the CONTRIBUTING document.
I have added tests to cover my changes.
All new and existing tests passed.

Add Column for Column-Level Visibility in Data Quality Framework Result Table

…column-level visibility. Add Column for Column-Level Visibility in Data Quality Framework Result Table

Enabled Delta Lake's spark.databricks.delta.schema.autoMerge.enabled configuration to allow schema evolution during write operations. Modified the data quality framework to include the affected_column_name field dynamically if not already present.

…configuration to allow schema evolution during write operations.

sudeep7978 · 2025-01-13T15:30:57Z

@asingamaneni Can you please look into it.
Let me know any changes required.
THANK YOU.

…nts#123

…ements#123' into feature/spark_expectation_enhancements#123

sudeep7978 · 2025-01-20T03:45:52Z

@asingamaneni
closing this PR will raise a new PR taking the SMTP authentication changes after the other PR is merged
#54

sudeep7978 added 8 commits December 19, 2024 22:29

adding additional column in the detailed table.

a33c3aa

Add Column for Column-Level Visibility in Data Quality Framework Result Table

add a new column in the result table of a data quality framework for …

5ee0376

…column-level visibility. Add Column for Column-Level Visibility in Data Quality Framework Result Table

updated the test case for added column

cc9e5a0

Update test case for added column name

55fff86

Enabled Delta Lake's spark.databricks.delta.schema.autoMerge.enabled …

c857252

…configuration to allow schema evolution during write operations.

adding additional column in the detailed table.

86e2bec

Update CONTRIBUTORS.md

f2cf131

sudeep7978 requested review from asingamaneni and Umeshsp22 as code owners December 19, 2024 18:00

sudeep7978 and others added 3 commits January 20, 2025 09:06

Merge branch 'Nike-Inc:main' into feature/spark_expectation_enhanceme…

585cc62

…nts#123

changes

ca7ca7f

Merge remote-tracking branch 'origin/feature/spark_expectation_enhanc…

1f0c8a9

…ements#123' into feature/spark_expectation_enhancements#123

sudeep7978 closed this Jan 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/spark expectation enhancements#123 #125

Feature/spark expectation enhancements#123 #125

sudeep7978 commented Dec 19, 2024

sudeep7978 commented Jan 13, 2025

sudeep7978 commented Jan 20, 2025

Feature/spark expectation enhancements#123 #125

Feature/spark expectation enhancements#123 #125

Conversation

sudeep7978 commented Dec 19, 2024

Description

ENHANCEMENT

Motivation and Context

How Has This Been Tested?

Screenshots (if appropriate):

Types of changes

Checklist:

sudeep7978 commented Jan 13, 2025

sudeep7978 commented Jan 20, 2025