Skip to content

Conversation

@siying
Copy link
Contributor

@siying siying commented Dec 19, 2025

What changes were proposed in this pull request?

in Python Stream Data Source, if records returned by users don't match schema they provided, a classified error is thrown, instead of an assertion failure.

Why are the changes needed?

This will provide better experience to users who get the correct error message and code.

Does this PR introduce any user-facing change?

No

How was this patch tested?

Add a unit test

Was this patch authored or co-authored using generative AI tooling?

Generated-by: Cursor 2.2.20 with claude-4.5-sonnet

@siying siying changed the title WIP [SPARK-54768][SS]Python Stream Data Source should classify error if data returned doesn't match configured schema Dec 19, 2025
@siying siying closed this Dec 19, 2025
@siying siying reopened this Dec 19, 2025
@siying siying marked this pull request as draft December 19, 2025 20:51
@siying
Copy link
Contributor Author

siying commented Dec 19, 2025

Unit test I constructed all failed in other places. I haven't yet found a way to fail in this check.

@siying siying marked this pull request as ready for review December 19, 2025 23:15
@siying
Copy link
Contributor Author

siying commented Dec 19, 2025

I hope now it works.

Copy link
Contributor

@anishshri-db anishshri-db left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm, thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants