You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have a very large tfrecord directory, and need to filter it with some column to generate new tfrecord files.
Code likes that
When I run it in spark cluster, I find it will run with two steps.
I check the code in https://github.com/tensorflow/ecosystem/blob/master/spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords/TensorFlowInferSchema.scala#L39, it have the aggregate steps !
Can I avoid it?
The text was updated successfully, but these errors were encountered:
I have a very large tfrecord directory, and need to filter it with some column to generate new tfrecord files.
Code likes that
When I run it in spark cluster, I find it will run with two steps.
I check the code in
https://github.com/tensorflow/ecosystem/blob/master/spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords/TensorFlowInferSchema.scala#L39
, it have the aggregate steps !Can I avoid it?
The text was updated successfully, but these errors were encountered: