You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am trying to process the Snowplow enriched TSV events from Kafka in PySpark but getting |-- _corrupt_record: string (nullable = true) in PySpark while in Scala it is working fine.
I am trying to process the Snowplow enriched TSV events from Kafka in PySpark but getting
|-- _corrupt_record: string (nullable = true)
in PySpark while in Scala it is working fine.TSV Event:
JSON event after Python SDK Transform:
Pyspark code:
Output:
Why I am getting corrupted records through Python SDK which works fine with Scala SDK ?
The text was updated successfully, but these errors were encountered: