Anonymous View
Skip to content

feat(cli): add GPU count requests#1812

Open
elezar wants to merge 3 commits into
mainfrom
1444-gpu-cli-count/elezar
Open

feat(cli): add GPU count requests#1812
elezar wants to merge 3 commits into
mainfrom
1444-gpu-cli-count/elezar

Conversation

@elezar

@elezar elezar commented Jun 8, 2026

Copy link
Copy Markdown
Member

Summary

Adds structured GPU resource requirements for sandbox creation and updates the
CLI/API/runtime path so openshell sandbox create --gpu [COUNT] records GPU
intent in ResourceRequirements.gpu.

This is an intentional alpha API break: the public and compute-driver sandbox
specs now carry resource_requirements.gpu in place of the previous flat GPU
fields. Existing live or persisted legacy GPU intent is not migrated; callers
should use a matching OpenShell CLI/API version and recreate GPU sandboxes when
they need the new typed shape. RFC 0004 is updated to document that decision.

Related Issue

Part of #1444. Related to #1338, #1156, #1360, and #1492. Follow-up GPU support
preflight semantics are tracked in #1807.

Changes

  • Add ResourceRequirements.gpu.count to the public and compute-driver protos.
    A present GPU requirement with omitted count means one GPU; count = 0 is
    rejected.
  • Replace the older GPU CLI shape with --gpu for one GPU and --gpu COUNT
    for counted requests.
  • Pass GPU resource requirements through sandbox create, gateway-to-driver
    translation, provisioning timeout messages, and driver helper APIs.
  • Render Kubernetes nvidia.com/gpu limits from GPU requirements.
  • Keep exact device selection driver-owned through driver_config: Docker and
    Podman use cdi_devices, and VM uses gpu_device_ids.
  • Validate exact device requests consistently: device IDs are opaque, duplicate
    IDs are rejected, a single exact device works with default --gpu, and
    multi-device exact lists require --gpu COUNT matching the list length.
  • Add Docker and Podman default CDI selection for counted GPU requests. The
    selector refreshes CDI inventory before validation/create, picks from the
    normalized NVIDIA CDI inventory in round-robin order, fails when count exceeds
    selectable devices, and treats WSL2 nvidia.com/gpu=all fallback as one
    selectable device.
  • Keep VM GPU support limited to one GPU and reject VM counts above one.
  • Remove the legacy Python GPU E2E path and e2e:k3s:gpu; GPU E2E coverage now
    lives in the Rust Docker/Podman GPU device-selection suites.
  • Update GPU request docs, RFC 0004, architecture notes, and Docker/Podman/
    Kubernetes driver READMEs.

Testing

  • mise run pre-commit
  • /Users/elezar/.local/bin/mise exec -- cargo check -p openshell-core -p openshell-driver-docker -p openshell-driver-podman -p openshell-driver-vm -p openshell-driver-kubernetes
  • /Users/elezar/.local/bin/mise exec -- cargo test -p openshell-core -p openshell-driver-docker -p openshell-driver-podman -p openshell-driver-vm -p openshell-driver-kubernetes gpu --lib
  • /Users/elezar/.local/bin/mise exec -- cargo clippy -p openshell-core -p openshell-driver-docker -p openshell-driver-podman -p openshell-driver-vm -p openshell-driver-kubernetes --all-targets -- -D warnings

The PR has the test:e2e-gpu label so the required Docker GPU E2E gate runs in
CI for the updated head.

Checklist

  • Follows Conventional Commits
  • Commits are signed off (DCO)
  • Unit/integration tests updated
  • Architecture and user-facing docs updated

@elezar elezar requested a review from a team as a code owner June 8, 2026 13:10
@github-actions

github-actions Bot commented Jun 8, 2026

Copy link
Copy Markdown

@mrunalp

mrunalp commented Jun 8, 2026

Copy link
Copy Markdown
Collaborator

/ok to test abe5b79

@TaylorMutch

Copy link
Copy Markdown
Collaborator

/ok to test abe5b79

@elezar elezar marked this pull request as draft June 10, 2026 07:55
@copy-pr-bot

copy-pr-bot Bot commented Jun 10, 2026

Copy link
Copy Markdown

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@elezar

elezar commented Jun 10, 2026

Copy link
Copy Markdown
Member Author

Landing #1815 first should simplify the changes here.

@elezar elezar force-pushed the 1444-gpu-cli-count/elezar branch from abe5b79 to 06c69dd Compare June 12, 2026 07:12
@elezar elezar marked this pull request as ready for review June 12, 2026 07:29
@elezar elezar added gator:in-review Gator is reviewing or awaiting PR review feedback test:e2e-gpu Requires GPU end-to-end coverage labels Jun 12, 2026
@github-actions

Copy link
Copy Markdown

Label test:e2e-gpu applied for 06c69dd. Open the existing run and click Re-run all jobs to execute with the label set. The run will execute GPU E2E after building the required supervisor image once. The matching required CI gate status on this PR will flip green automatically once the run finishes.

@elezar

elezar commented Jun 12, 2026

Copy link
Copy Markdown
Member Author

gator-agent

PR Review Status

Validation: this is maintainer-authored, project-valid GPU CLI/API/runtime work that aligns GPU sandbox intent with structured resource requirements and the related resource-requirements RFC direction.
Head SHA: 06c69dddf62dd74b3215bdc8e3dafc95ee2622a2

Review findings:

  • Blocking: crates/openshell-cli/src/run.rs moves resource_requirements into CreateSandboxRequest and then borrows it later for provisioning timeout messages. This is a Rust use-after-move compile failure; clone or otherwise retain the value before moving it into the request.
  • Blocking: proto/openshell.proto and proto/compute_driver.proto reuse field number 9 for resource_requirements, replacing the old bool gpu = 9 with a message. Old clients and persisted sandbox records encode field 9 as a varint, while the new schema expects length-delimited data, so prost can silently ignore the old GPU request. Reserve the old field and add resource_requirements on a new field, or keep a transitional deprecated field and map both safely.
  • Warning: --gpu now accepts an optional COUNT, so default GPU plus a trailing command requires the documented --gpu -- <command> form. The docs/tests cover that; keep the parse error crisp for accidental --gpu <command> usage.

Docs: Fern docs were updated under docs/; no navigation change appears necessary for this focused CLI/runtime behavior update.
E2E: test:e2e-gpu is applied because this changes GPU runtime behavior and user-visible GPU sandbox creation semantics.
Checks: OpenShell / Branch Checks is still pending for this head; OpenShell / Helm Lint and DCO are passing.

Next state: gator:in-review

@elezar

elezar commented Jun 12, 2026

Copy link
Copy Markdown
Member Author

gator-agent

Re-check After CI Update

I re-evaluated latest head 06c69dddf62dd74b3215bdc8e3dafc95ee2622a2 after OpenShell / Branch Checks completed successfully.

Disposition: partially resolved.

Remaining items:

  • Resolved: the earlier compile-failure concern in crates/openshell-cli/src/run.rs is cleared by the passing Rust checks and the current diff retaining resource_requirements for the provisioning timeout path.
  • Still blocking: proto/openshell.proto and proto/compute_driver.proto still replace the old bool gpu = 9 fields with length-delimited ResourceRequirements resource_requirements = 9. That is a protobuf wire-compatibility break for existing clients and stored sandbox records. Please reserve the old field number and add resource_requirements on a new field, or keep a deprecated transitional field and map both safely.
  • OpenShell / GPU E2E is still pending for this head with test:e2e-gpu applied.

Next state: gator:in-review

@elezar

elezar commented Jun 12, 2026

Copy link
Copy Markdown
Member Author

gator-agent

Re-check After CI Update

I re-evaluated latest head 06c69dddf62dd74b3215bdc8e3dafc95ee2622a2 after the required gates completed successfully.

Disposition: partially resolved.

Remaining items:

  • Resolved: OpenShell / Branch Checks, OpenShell / Helm Lint, DCO, and OpenShell / GPU E2E are now green for this head with test:e2e-gpu applied.
  • Still blocking: proto/openshell.proto and proto/compute_driver.proto still replace the old bool gpu = 9 fields with length-delimited ResourceRequirements resource_requirements = 9. That is a protobuf wire-compatibility break for existing clients and stored sandbox records. Please reserve the old field number and add resource_requirements on a new field, or keep a deprecated transitional field and map both safely.

Next state: gator:in-review

@elezar elezar force-pushed the 1444-gpu-cli-count/elezar branch from 06c69dd to 87c9a8c Compare June 12, 2026 10:26
@elezar

elezar commented Jun 12, 2026

Copy link
Copy Markdown
Member Author

gator-agent

Re-check After Author Update

I re-evaluated latest head 87c9a8c6015bc23d48b11823540b44d85fa24399 after the branch update following the prior protobuf compatibility feedback.

Disposition: not resolved.

Remaining items:

  • Still blocking: proto/openshell.proto and proto/compute_driver.proto still define resource_requirements = 9, replacing the previous bool gpu = 9 field with a length-delimited message. This remains a protobuf wire-compatibility break for old clients and stored sandbox records. Please keep a deprecated transitional gpu = 9 field and map it when the new field is absent, or reserve field 9 and add resource_requirements on a new field number.
  • Still blocking: crates/openshell-cli/src/run.rs still moves resource_requirements into CreateSandboxRequest.spec.resource_requirements and later reads resource_requirements.as_ref() for provisioning timeout messages. Please clone or otherwise retain that value before moving it into the request.
  • Additional warning from the independent re-review: Docker and Podman should also reject count-only GPU requests at the create/build sink, not only during validation, so a missed validation path cannot over-grant all CDI GPUs.
  • OpenShell / Branch Checks, OpenShell / Helm Lint, and DCO are passing for this head. OpenShell / GPU E2E is still pending with test:e2e-gpu applied.

Next state: gator:in-review

@elezar

elezar commented Jun 12, 2026

Copy link
Copy Markdown
Member Author

gator-agent

Re-check After CI Update

I re-evaluated latest head 87c9a8c6015bc23d48b11823540b44d85fa24399 after the required gates completed successfully and after a focused re-check of the current diff.

Disposition: partially resolved.

Remaining items:

  • Resolved: the prior crates/openshell-cli/src/run.rs move/borrow compile concern is no longer present in the current diff, and OpenShell / Branch Checks is green for this head.
  • Resolved: the prior Docker/Podman count-only GPU warning is addressed at the driver validation/build paths. Docker and Podman now reject count-only GPU requests without driver_config.cdi_devices, and they validate exact CDI device requests at the sink.
  • Still blocking: proto/openshell.proto and proto/compute_driver.proto still define resource_requirements = 9, replacing the previous bool gpu = 9 field with a length-delimited message. That remains a protobuf wire-compatibility break for old clients and stored sandbox records. Please keep a deprecated transitional gpu = 9 field and map it when the new field is absent, or reserve field 9 and add resource_requirements on a new field number.
  • Checks: OpenShell / Branch Checks, OpenShell / Helm Lint, DCO, and OpenShell / GPU E2E are green for this head with test:e2e-gpu applied.

Next state: gator:in-review

@elezar

elezar commented Jun 12, 2026

Copy link
Copy Markdown
Member Author

Still blocking: proto/openshell.proto and proto/compute_driver.proto still define resource_requirements = 9, replacing the previous bool gpu = 9 field with a length-delimited message. That remains a protobuf wire-compatibility break for old clients and stored sandbox records. Please keep a deprecated transitional gpu = 9 field and map it when the new field is absent, or reserve field 9 and add resource_requirements on a new field number.

The breaking proto change is intentional. However, I will defer to @drew on whether we should rather reserve the previous gpu field and introduce this at a new number.

@elezar

elezar commented Jun 12, 2026

Copy link
Copy Markdown
Member Author

gator-agent

Re-check After Author Update

I re-evaluated latest head 87c9a8c6015bc23d48b11823540b44d85fa24399 after @elezar's 2026-06-12 11:33 UTC comment that the protobuf wire-compatibility break is intentional and should be decided by @drew.

Disposition: not resolved.

Remaining items:

  • The protobuf compatibility concern remains blocking for gator: proto/openshell.proto and proto/compute_driver.proto still define resource_requirements = 9, replacing the previous bool gpu = 9 field with a length-delimited message. Please either change the field numbering/transitional mapping, or have an appropriate maintainer explicitly accept that compatibility break on this PR.
  • Checks are green for this head with test:e2e-gpu applied.

Next state: gator:in-review

@elezar elezar enabled auto-merge (squash) June 12, 2026 12:43
@elezar elezar disabled auto-merge June 12, 2026 12:44
@elezar elezar force-pushed the 1444-gpu-cli-count/elezar branch from 87c9a8c to da6fbd8 Compare June 12, 2026 12:55
@elezar

elezar commented Jun 12, 2026

Copy link
Copy Markdown
Member Author

gator-agent

Re-check After Author Update

I re-evaluated latest head da6fbd8752372c86a834d59edb238d61d38dc9b7 after the branch update that followed the prior protobuf compatibility feedback.

Disposition: not resolved.

Remaining items:

  • Still blocking: crates/openshell-cli/src/run.rs constructs resource_requirements, moves it into CreateSandboxRequest.spec.resource_requirements, and later borrows resource_requirements.as_ref() for provisioning timeout messages. Please clone the value into the request or retain a separate GPU-request flag/value before moving it.
  • Still blocking: proto/openshell.proto and proto/compute_driver.proto still define resource_requirements = 9, replacing the previous bool gpu = 9 field with a length-delimited message. This remains a protobuf wire-compatibility break for old clients, old drivers, and persisted sandbox records. Please either change the field numbering/transitional mapping, or have an appropriate maintainer explicitly accept that compatibility break on this PR.
  • Resolved: the prior Docker/Podman count-only GPU warning remains addressed. Docker and Podman reject count-only GPU requests without driver_config.cdi_devices, and they validate exact CDI device requests at the sink.
  • Checks: OpenShell / Branch Checks and OpenShell / GPU E2E are still pending for this head; OpenShell / Helm Lint, DCO, and the docs preview are passing.

Next state: gator:in-review

@elezar elezar enabled auto-merge (squash) June 12, 2026 13:03
@elezar

elezar commented Jun 12, 2026

Copy link
Copy Markdown
Member Author

gator-agent

Re-check After CI and Independent Review

I re-evaluated latest head da6fbd8752372c86a834d59edb238d61d38dc9b7 after the required gates completed successfully and after a bounded independent review of the current diff.

Disposition: not resolved.

Remaining items:

  • Still blocking: crates/openshell-cli/src/run.rs moves resource_requirements into CreateSandboxRequest.spec.resource_requirements and later borrows resource_requirements.as_ref() for provisioning timeout messages. Please clone the value into the request or retain a separate timeout value before the move.
  • Still blocking: proto/openshell.proto and proto/compute_driver.proto still define resource_requirements = 9, replacing the previous bool gpu = 9 field with a length-delimited message. This remains a protobuf wire-compatibility break for old clients, old drivers, and persisted sandbox records unless an appropriate maintainer explicitly accepts that break on this PR.
  • Warning: Docker and Podman reject count-only GPU requests during create validation, but the lower-level device builders can still translate gpu.count = Some(_) without exact devices into the all-GPUs CDI request if reached directly. Please mirror the count-only rejection at the builder sink and add builder-level tests, or explain why no alternate path can reach those builders without prior validation.
  • Checks are now green for this head: OpenShell / Branch Checks, OpenShell / Helm Lint, DCO, OpenShell / E2E, and OpenShell / GPU E2E are passing with test:e2e-gpu applied.

Next state: gator:in-review

@elezar

elezar commented Jun 13, 2026

Copy link
Copy Markdown
Member Author

Responding to the two blocking findings from gator:

  1. crates/openshell-cli/src/run.rs is not a use-after-move compile failure. The generated prost types for ResourceRequirements and GpuResourceRequirements derive Copy, so Option<ResourceRequirements> is also Copy. I verified this by running cargo check -p openshell-cli -j 1 successfully. I also tested the proposed clone fix, and cargo clippy --workspace --all-targets -- -D warnings rejects it with clippy::clone_on_copy, confirming the original code is the correct form for the generated type.

  2. The protobuf field-number change is an intentional alpha API break. The compute-driver proto is not currently treated as a public compatibility surface: gateway and driver lifetimes are tightly coupled, and local drivers are launched by the gateway at startup. For the public SandboxSpec, direct API use is currently limited to the matching OpenShell CLI. We are also intentionally not preserving live or persisted legacy GPU intent across this transition; GPU sandboxes should be recreated after upgrade if they need the new typed resource-requirements shape.

Given that the API is still alpha, we do not want to carry legacy GPU-specific reserved or transitional fields forward into the proto shape we intend to stabilize. I will update RFC 0004 in this branch to reflect that decision, since the current RFC text still describes reserving the old GPU fields.

@elezar

elezar commented Jun 15, 2026

Copy link
Copy Markdown
Member Author

gator-agent

Re-check After Author Update

I re-evaluated latest head c7cf9d635895b4802aedee4e9563ce4db1020e47 after @elezar's 2026-06-13 comment responding to the prior protobuf compatibility and use-after-move findings, and after a bounded independent review of the current diff.

Disposition: partially resolved.

Remaining items:

  • Resolved: the prior crates/openshell-cli/src/run.rs use-after-move concern is not blocking. The generated ResourceRequirements type is Copy, and the required Rust checks are green for this head.
  • Resolved: the protobuf field-number compatibility concern is no longer blocking for gator after the RFC update. rfc/0004-sandbox-resource-requirements/README.md now explicitly records the alpha-era decision to replace field 9 rather than preserve transitional GPU fields.
  • Still blocking: e2e/python/conftest.py still constructs datamodel_pb2.SandboxSpec(gpu=True, ...). This PR replaces the public gpu field with resource_requirements, so the Python GPU E2E fixture should be updated to set resource_requirements.gpu and keep that path covering the new API shape.
  • Still blocking: Docker and Podman now reject count-only GPU requests during create validation, but the lower-level CDI/device build helpers can still translate gpu.count = Some(_) with no explicit CDI devices into nvidia.com/gpu=all if reached directly. Please reject count-only requests at that sink as defense in depth, or make the shared CDI resolution helper return an error for that shape.
  • Suggested coverage: add an integration or E2E assertion that Docker/Podman --gpu 2 is rejected, so the user-facing request path and build-device translation boundary stay covered.
  • Checks are green for this head: OpenShell / Branch Checks, OpenShell / Helm Lint, DCO, and OpenShell / GPU E2E are passing with test:e2e-gpu applied.

Next state: gator:in-review

@elezar elezar force-pushed the 1444-gpu-cli-count/elezar branch from c7cf9d6 to b681234 Compare June 16, 2026 20:09
@elezar elezar requested a review from maxamillion as a code owner June 16, 2026 20:09
@elezar elezar force-pushed the 1444-gpu-cli-count/elezar branch from b681234 to 60a7ccd Compare June 16, 2026 20:24
@elezar

elezar commented Jun 16, 2026

Copy link
Copy Markdown
Member Author

gator-agent

Re-check After Author Update

I re-evaluated latest head 60a7ccd17cc6933759d5b134bf0277ae2e9f5acb after the branch updates following the prior GPU API/runtime feedback. I also re-checked the forced-update delta from b6812340 to 60a7ccd1; that delta only changes crates/openshell-driver-kubernetes/src/driver.rs and does not affect the items below.

Disposition: not resolved.

Remaining items:

  • Still blocking: the Python GPU E2E public API coverage was removed instead of migrated. e2e/python/test_sandbox_gpu.py is deleted and e2e/python/conftest.py no longer provides the GPU sandbox fixture, so this PR no longer proves that the Python/proto API can request a GPU sandbox with the new resource_requirements.gpu shape and execute nvidia-smi. Please restore that marked GPU E2E coverage using ResourceRequirements(gpu=GpuResourceRequirements(...)) rather than the old gpu=True field.
  • Still blocking: docs/reference/sandbox-compute-drivers.mdx still says Docker and Podman reject count-only GPU selection, but the current implementation resolves count-only requests through refreshed CDI inventory and round-robin selection. docs/sandboxes/manage-sandboxes.mdx documents the new behavior correctly; please update the compute-driver reference to match.
  • Resolved: the lower-level Docker/Podman CDI/device sinks no longer appear to translate count-only GPU requests directly into all-GPU/default injection without validation. The create/validate paths resolve concrete CDI device IDs before the container specs are built, and the builder sinks only inject provided concrete IDs.
  • Resolved: CLI parsing and request construction for --gpu, --gpu COUNT, zero rejection, and trailing command parsing are covered, and sandbox creation now sends SandboxSpec.resource_requirements.gpu.

Checks: the latest forced-push head is still early in CI; DCO and required status publication are pending at the time of this re-check. The review findings above must be addressed before gator can move this PR to pipeline watch.

Next state: gator:in-review

@elezar elezar added gator:in-review Gator is reviewing or awaiting PR review feedback and removed gator:in-review Gator is reviewing or awaiting PR review feedback labels Jun 16, 2026
@elezar

elezar commented Jun 16, 2026

Copy link
Copy Markdown
Member Author

gator-agent

Re-check After CI Update

I re-evaluated latest head 60a7ccd17cc6933759d5b134bf0277ae2e9f5acb after OpenShell / Branch Checks completed.

Disposition: not resolved.

Remaining items:

  • Still blocking: OpenShell / Branch Checks failed in cargo fmt --all -- --check. The failed logs show crates/openshell-driver-kubernetes/src/driver.rs needs rustfmt at the GPU limit assertion around line 2694.
  • Still blocking: the Python GPU E2E public API coverage was removed instead of migrated. Please restore marked GPU E2E coverage using ResourceRequirements(gpu=GpuResourceRequirements(...)) rather than the old gpu=True field.
  • Still blocking: docs/reference/sandbox-compute-drivers.mdx still says Docker and Podman reject count-only GPU selection, while the current implementation resolves count-only requests through refreshed CDI inventory and round-robin selection. Please update the reference docs to match docs/sandboxes/manage-sandboxes.mdx.
  • OpenShell / GPU E2E is still pending for this head with test:e2e-gpu applied.

Next state: gator:in-review

@elezar

elezar commented Jun 16, 2026

Copy link
Copy Markdown
Member Author

gator-agent

Re-check After CI and Independent Review

I re-evaluated latest head 60a7ccd17cc6933759d5b134bf0277ae2e9f5acb after the required GPU E2E gate completed and after a bounded independent review of the current diff.

Disposition: not resolved.

Remaining items:

  • Still blocking: OpenShell / Branch Checks failed in cargo fmt --all -- --check. The failed logs show crates/openshell-driver-kubernetes/src/driver.rs needs rustfmt at the GPU limit assertion around line 2697.
  • Still blocking: the independent review re-raised the protobuf wire-compatibility concern. proto/openshell.proto and proto/compute_driver.proto reuse field 9, changing it from bool gpu to ResourceRequirements resource_requirements. Please either keep the legacy gpu = 9 field and put resource_requirements on a new tag with transitional mapping, or get explicit maintainer acceptance for the version-skew behavior on this PR.
  • Still blocking: Python GPU E2E public API coverage was removed instead of migrated. Please restore coverage using SandboxSpec(resource_requirements=...) so the Python generated-proto/API path for GPU sandbox creation remains exercised.
  • Still blocking: docs/reference/sandbox-compute-drivers.mdx still says Docker and Podman reject count-only GPU selection, while docs/sandboxes/manage-sandboxes.mdx and the implementation describe counted default CDI selection. Please make the reference doc match the intended behavior.
  • Resolved: OpenShell / GPU E2E is now green for this head with test:e2e-gpu applied. OpenShell / Helm Lint, DCO, and docs preview are also green.

Independent review summary: no additional code-level blocker was found in this bounded pass beyond the items above.

Next state: gator:in-review

@elezar elezar force-pushed the 1444-gpu-cli-count/elezar branch 3 times, most recently from 35c7ef2 to 8e108bb Compare June 17, 2026 08:22
@elezar elezar force-pushed the 1444-gpu-cli-count/elezar branch from 8e108bb to cad3745 Compare June 17, 2026 09:56
elezar added 3 commits June 17, 2026 15:14
Remove the Python GPU smoke test and its fixture. The e2e:k3s:gpu task only depended on e2e:python:gpu and did not have a separate k3s implementation, so remove that stale alias with the task it pointed at.

Signed-off-by: Evan Lezar <elezar@nvidia.com>
BREAKING CHANGE: SandboxSpec.gpu and DriverSandboxSpec.gpu were replaced with resource_requirements.gpu, changing protobuf field 9 from a bool to a message for both public and driver APIs.

Signed-off-by: Evan Lezar <elezar@nvidia.com>
Signed-off-by: Evan Lezar <elezar@nvidia.com>
@elezar elezar force-pushed the 1444-gpu-cli-count/elezar branch from cad3745 to 82f88d8 Compare June 17, 2026 13:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

gator:in-review Gator is reviewing or awaiting PR review feedback test:e2e-gpu Requires GPU end-to-end coverage

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants