Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Schema support for Tensorflow #19721

Open
damccorm opened this issue Jun 4, 2022 · 0 comments
Open

Add Schema support for Tensorflow #19721

damccorm opened this issue Jun 4, 2022 · 0 comments

Comments

@damccorm
Copy link
Contributor

damccorm commented Jun 4, 2022

Similar to BEAM-5807, Tensorflow's defacto storage format is TFRecord files with Example proto payload and its own schema.proto. We already have TFRecordIO support. Need to implement:

  • Conversion between Beam and TF schema
  • Conversion between Beam Row and TF Example proto
  • TFRecordTableProvider

https://github.com/tensorflow/metadata/blob/master/tensorflow_metadata/proto/v0/schema.proto

https://github.com/tensorflow/tensorflow/blob/master/tensorflow/core/example/example.proto

 

Also it seems the metadata protos are not published as Java artifacts:

tensorflow/metadata#5

My WIP branch: https://github.com/spotify/beam/tree/neville/tf

Imported from Jira BEAM-7921. Original Jira may contain additional context.
Reported by: sinisa_lyh.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant