[FEA] Add Parquet and ORC unit tests based on Apache sample files #13627
Labels
0 - Backlog
In queue waiting for assignment
cuIO
cuIO issue
feature request
New feature or request
libcudf
Affects libcudf (C++/CUDA) code.
Spark
Functionality that helps Spark RAPIDS
tests
Unit testing for project
Milestone
During the 23.06 release, we encountered several important Parquet and ORC writer issues that risked data corruption. These issues included:
After discussion with the team we agreed on these additions to our testing suite to help prevent similar issues in the future:
Note: please also see (#12739), for reader benchmarks, verify that the roundtripped table matches the starting table
The text was updated successfully, but these errors were encountered: