Replies: 1 comment
-
The filter needs to be enabled in order to take effect.
or, if you want to enable other filters at the same time
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Version
spark: [email protected]
spline-agent:za.co.absa.spline.agent.spark:spark-3.1-spline-agent-bundle_2.12:0.6.1
Background
After following the steps in the following blog https://www.capitalone.com/tech/software-engineering/spline-spark-data-lineage/, I've been able to extract data lineage from spark and upload it to the spline server. The next thing I want to do is inject metadata to the lineage using a code-less configuration. Based on the spline agent's default properties https://github.com/AbsaOSS/spline-spark-agent/blob/develop/core/src/main/resources/spline.default.properties, I tried to configured spark.spline.postProcessingFilter.userExtraMeta.rules but I have not been successful.
Question
What is the correct approach to configure spark.spline.postProcessingFilter.userExtraMeta.rule using a code-less configuration?
Setup
spark-submit \ --packages za.co.absa.spline.agent.spark:spark-3.1-spline-agent-bundle_2.12:0.6.1 \ --conf spark.sql.queryExecutionListeners=za.co.absa.spline.harvester.listener.SplineQueryExecutionListener \ --conf spark.spline.producer.url=http://localhost:8080/producer \ --conf spark.spline.lineageDispatcher=console \ --conf spark.spline.postProcessingFilter.userExtraMeta.rules"={\"executionPlan\":{\"foo\":{\"$js\":\"executionPlan.name()\"}\,\"bar\":{\"$env\":\"JAVA_HOME\"}\,\"baz\":{\"$jvm\":\"some.jvm.prop\"}\,\"qux\":42\,\"tags\":[\"aaa\"\,\"bbb\"\,\"ccc\"]}}" \ my_spark_job.py
Output [Extra metadata has not been added]
Beta Was this translation helpful? Give feedback.
All reactions