You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Since CUDA 11.7 new cudaStreamOps_v2 introduced which doesn't require kernel module parameter to be set (https://docs.nvidia.com/cuda/cuda-driver-api/group__CUDA__MEMOP.html#group__CUDA__MEMOP). This lower requirements on using these APIs.
As far as i understand this will allow lower number of kernels launched (i.e. executor_start or executor_shutdown_ack).
Also, as far as i understand from sources, STREAM_TASK_MODE=driver is not really fully supported, so it would be great to see those API used is some way or another.
Thanks,
The text was updated successfully, but these errors were encountered:
Since CUDA 11.7 new cudaStreamOps_v2 introduced which doesn't require kernel module parameter to be set (https://docs.nvidia.com/cuda/cuda-driver-api/group__CUDA__MEMOP.html#group__CUDA__MEMOP). This lower requirements on using these APIs.
As far as i understand this will allow lower number of kernels launched (i.e.
executor_start
orexecutor_shutdown_ack
).Also, as far as i understand from sources,
STREAM_TASK_MODE=driver
is not really fully supported, so it would be great to see those API used is some way or another.Thanks,
The text was updated successfully, but these errors were encountered: