Skip to content

Conversation

@dongjoon-hyun
Copy link
Member

@dongjoon-hyun dongjoon-hyun commented Aug 9, 2024

What changes were proposed in this pull request?

This PR aims to fix spark.ui.custom.executor.log.url configuration's documentation by adding K8s on top of YARN.

-    For now, only YARN mode supports this configuration
+    For now, only YARN and K8s cluster manager supports this configuration

In addition, K8s example is added to running-on-kubernetes.md docs to help users refer the existing environment variables, SPARK_APPLICATION_ID and SPARK_EXECUTOR_ID, more easily to enable this feature.

Why are the changes needed?

To fix a wrong description.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Manual review. This is a doc-only PR.

Was this patch authored or co-authored using generative AI tooling?

No.

@github-actions github-actions bot added the DOCS label Aug 9, 2024
@dongjoon-hyun
Copy link
Member Author

cc @viirya

@dongjoon-hyun
Copy link
Member Author

Oh, thank you for review, @yaooqinn !

@dongjoon-hyun
Copy link
Member Author

Thank you, @viirya .

@HyukjinKwon
Copy link
Member

Merged to master.

@dongjoon-hyun
Copy link
Member Author

Thank you, @HyukjinKwon !

@dongjoon-hyun
Copy link
Member Author

Let me backport this documentation update to the live release branches because this has been supported for a long time already. Thank you all again.

dongjoon-hyun added a commit that referenced this pull request Aug 9, 2024
…by adding K8s

### What changes were proposed in this pull request?

This PR aims to fix `spark.ui.custom.executor.log.url` configuration's documentation by adding K8s on top of YARN.
```
-    For now, only YARN mode supports this configuration
+    For now, only YARN and K8s cluster manager supports this configuration
```

In addition, K8s example is added to `running-on-kubernetes.md` docs to help users refer the existing environment variables, `SPARK_APPLICATION_ID` and `SPARK_EXECUTOR_ID`, more easily to enable this feature.

- https://kubernetes.io/docs/tasks/inject-data-application/define-environment-variable-container/#using-environment-variables-inside-of-your-config

### Why are the changes needed?

To fix a wrong description.

### Does this PR introduce _any_ user-facing change?

No.

### How was this patch tested?

Manual review. This is a doc-only PR.

### Was this patch authored or co-authored using generative AI tooling?

No.

Closes #47681 from dongjoon-hyun/SPARK-49176.

Authored-by: Dongjoon Hyun <dhyun@apple.com>
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
dongjoon-hyun added a commit that referenced this pull request Aug 9, 2024
…by adding K8s

### What changes were proposed in this pull request?

This PR aims to fix `spark.ui.custom.executor.log.url` configuration's documentation by adding K8s on top of YARN.
```
-    For now, only YARN mode supports this configuration
+    For now, only YARN and K8s cluster manager supports this configuration
```

In addition, K8s example is added to `running-on-kubernetes.md` docs to help users refer the existing environment variables, `SPARK_APPLICATION_ID` and `SPARK_EXECUTOR_ID`, more easily to enable this feature.

- https://kubernetes.io/docs/tasks/inject-data-application/define-environment-variable-container/#using-environment-variables-inside-of-your-config

### Why are the changes needed?

To fix a wrong description.

### Does this PR introduce _any_ user-facing change?

No.

### How was this patch tested?

Manual review. This is a doc-only PR.

### Was this patch authored or co-authored using generative AI tooling?

No.

Closes #47681 from dongjoon-hyun/SPARK-49176.

Authored-by: Dongjoon Hyun <dhyun@apple.com>
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
dongjoon-hyun added a commit that referenced this pull request Aug 10, 2024
…f `CUSTOM_EXECUTOR_LOG_URL` is defined

### What changes were proposed in this pull request?

This PR aims to add `SPARK_EXECUTOR_ATTRIBUTE_(APP|EXECUTOR)_ID` if `CUSTOM_EXECUTOR_LOG_URL` is defined.

### Why are the changes needed?

Apache Spark has been supported `spark.ui.custom.executor.log.url` in K8s environment well.

- #47681

This PR aims to help users use it more easily by providing the required `SPARK_EXECUTOR_ATTRIBUTE_APP_ID` and `SPARK_EXECUTOR_ATTRIBUTE_EXECUTOR_ID` automatically if `spark.ui.custom.executor.log.url` is defined.

### Does this PR introduce _any_ user-facing change?

- No by default because `spark.ui.custom.executor.log.url` is not used.
- When `spark.ui.custom.executor.log.url` is used,
    - For YARN users, there is no change for YARN users because this is K8s only change.
    - For K8s users, this will reduce the existing steps by providing the same environment variables. In addition, the user variables always overwrite this built-in environment variables by design.

### How was this patch tested?

No.

### Was this patch authored or co-authored using generative AI tooling?

No.

Closes #47692 from dongjoon-hyun/SPARK-49190.

Authored-by: Dongjoon Hyun <dhyun@apple.com>
Signed-off-by: Dongjoon Hyun <dhyun@apple.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants