[VSphere] 64bitMMIOSizeGB should be configurable #48269
Labels
bug
Something that is supposed to be working; but isn't
core
Issues that should be addressed in Ray Core
core-clusters
For launching and managing Ray clusters/jobs/kubernetes
P2
Important issue, but not time-critical
What happened + What you expected to happen
Hi all,
ray/python/ray/autoscaler/_private/vsphere/gpu_utils.py
Line 272 in fc34890
Sets the 64bitMMIOSizeGB paramater to 64, but this is depended on the GPU being passed through:
https://docs.nvidia.com/ai-enterprise/latest/release-notes/index.html#tesla-p40-large-memory-vms
I think we should add a configuration for this in the vsphere gpu_config
Versions / Dependencies
Ray-2.38.0
Reproduction script
Launch cluster on vsphere with a GPU requiring more than 64gb MMIO space
Issue Severity
High: It blocks me from completing my task.
The text was updated successfully, but these errors were encountered: