Fix NVML tests for machines with more than one device #1352

mdboom · 2025-12-10T00:50:04Z

Description

As @rwgk discovered, all of the tests using the for_all_devices fixture fail with fixture function has more than one 'yield'. I didn't realize that wasn't allowed. This changes the fixture to just yield the whole list of devices.

copy-pr-bot · 2025-12-10T00:50:07Z

Auto-sync is disabled for ready for review pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

mdboom · 2025-12-10T00:50:19Z

/ok to test

github-actions · 2025-12-10T01:01:32Z

Doc Preview CI
🚀 View preview at https://nvidia.github.io/cuda-python/pr-preview/pr-1352/
https://nvidia.github.io/cuda-python/pr-preview/pr-1352/cuda-core/
https://nvidia.github.io/cuda-python/pr-preview/pr-1352/cuda-bindings/
https://nvidia.github.io/cuda-python/pr-preview/pr-1352/cuda-pathfinder/
Preview will be ready when the GitHub Pages deployment is complete.

rwgk

LGTM although I'd try to preserve the skip information. I think it's worth a small effort, so that we can reason about what we're actually testing. E.g. where you have the continue, maybe build a defaultdict(int) with the current skip messages as keys; when the end of a subtest is reached, check if the dict is empty, if not call pytest.skip with the key/counts in the message.

I'd try to give that as a prompt to Cursor. Good chance it'll "just do it".

mdboom · 2025-12-10T01:18:45Z

/ok to test

mdboom · 2025-12-10T01:19:37Z

Good idea.

I'd try to give that as a prompt to Cursor. Good chance it'll "just do it".

I did one of these, and VSCode/Copilot was able to infer the rest.

rwgk

The TestIpcReexport.test_main segfault is interesting. @Andy-Jost for awareness:

https://github.com/NVIDIA/cuda-python/actions/runs/20083999706/job/57617779182?pr=1352

I expect a rerun will deflake that.

Removed preview folders for the following PRs: - PR #1352

* Add experimental NVML bindings to 12.9.x branch * Add tests from main to 12.9.x branch * Remove newer APIs * Remove hand-written 13.0 bindings * Queue up 'skip reasons' (#1352) * Handle backport correctly * Fix cimport * More versioned struct fixes * Fix test for 12.9 * Add 13.1 bindings

mdboom requested a review from rwgk December 10, 2025 00:50

rwgk approved these changes Dec 10, 2025

View reviewed changes

Queue up 'skip reasons'

634abf3

mdboom force-pushed the fix-nvml-tests-multiple-devices branch from 7580dfa to 634abf3 Compare December 10, 2025 01:18

mdboom enabled auto-merge (squash) December 10, 2025 01:24

rwgk approved these changes Dec 10, 2025

View reviewed changes

mdboom merged commit 4886636 into NVIDIA:main Dec 10, 2025
79 of 80 checks passed

github-actions bot pushed a commit that referenced this pull request Dec 10, 2025

Clean up PR preview folders for 1 closed/merged PRs

f9ccbf4

Removed preview folders for the following PRs: - PR #1352

mdboom added a commit to mdboom/cuda-python that referenced this pull request Dec 10, 2025

Queue up 'skip reasons' (NVIDIA#1352)

2060af5

leofang added this to the cuda-python 13-next, 12-next milestone Dec 14, 2025

leofang assigned mdboom Dec 14, 2025

leofang added bug Something isn't working test Improvements or additions to tests cuda.bindings Everything related to the cuda.bindings module P0 High priority - Must do! labels Dec 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix NVML tests for machines with more than one device #1352

Fix NVML tests for machines with more than one device #1352

Uh oh!

mdboom commented Dec 10, 2025

Uh oh!

copy-pr-bot bot commented Dec 10, 2025

Uh oh!

mdboom commented Dec 10, 2025

Uh oh!

github-actions bot commented Dec 10, 2025

Preview will be ready when the GitHub Pages deployment is complete.

Uh oh!

rwgk left a comment

Uh oh!

mdboom commented Dec 10, 2025

Uh oh!

mdboom commented Dec 10, 2025

Uh oh!

rwgk left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix NVML tests for machines with more than one device #1352

Fix NVML tests for machines with more than one device #1352

Uh oh!

Conversation

mdboom commented Dec 10, 2025

Description

Uh oh!

copy-pr-bot bot commented Dec 10, 2025

Uh oh!

mdboom commented Dec 10, 2025

Uh oh!

github-actions bot commented Dec 10, 2025

Preview will be ready when the GitHub Pages deployment is complete.

Uh oh!

rwgk left a comment

Choose a reason for hiding this comment

Uh oh!

mdboom commented Dec 10, 2025

Uh oh!

mdboom commented Dec 10, 2025

Uh oh!

rwgk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants