Class CudaTensorCoreManager
- Namespace
- DotCompute.Backends.CUDA.Advanced
- Assembly
- DotCompute.Backends.CUDA.dll
Manager for CUDA Tensor Core operations (RTX 2000 Ada specific)
public sealed class CudaTensorCoreManager : IDisposable
- Inheritance
-
CudaTensorCoreManager
- Implements
- Inherited Members
- Extension Methods
Constructors
CudaTensorCoreManager(CudaContext, CudaDeviceProperties, ILogger)
Initializes a new instance of the CudaTensorCoreManager class.
public CudaTensorCoreManager(CudaContext context, CudaDeviceProperties deviceProperties, ILogger logger)
Parameters
contextCudaContextThe context.
devicePropertiesCudaDevicePropertiesThe device properties.
loggerILoggerThe logger.
Properties
IsSupported
Gets whether Tensor Cores are supported on this device
public bool IsSupported { get; }
Property Value
Metrics
Gets performance metrics for Tensor Core usage
public CudaTensorCoreMetrics Metrics { get; }
Property Value
TensorCoreGeneration
Gets the Tensor Core generation
public int TensorCoreGeneration { get; }
Property Value
Methods
Dispose()
Performs dispose.
public void Dispose()
ExecuteOptimizedGEMMAsync(CudaTensorGEMMOperation, CancellationToken)
Creates an optimized GEMM operation using Tensor Cores
public Task<CudaTensorCoreExecutionResult> ExecuteOptimizedGEMMAsync(CudaTensorGEMMOperation gemmOp, CancellationToken cancellationToken = default)
Parameters
gemmOpCudaTensorGEMMOperationcancellationTokenCancellationToken
Returns
ExecuteTensorOperationAsync(CudaTensorOperation, CancellationToken)
Executes a Tensor Core optimized operation
public Task<CudaTensorCoreExecutionResult> ExecuteTensorOperationAsync(CudaTensorOperation operation, CancellationToken cancellationToken = default)
Parameters
operationCudaTensorOperationcancellationTokenCancellationToken
Returns
OptimizeKernelAsync(CudaCompiledKernel, KernelArgument[], CancellationToken)
Optimizes a kernel for Tensor Core acceleration
public Task<CudaOptimizationResult> OptimizeKernelAsync(CudaCompiledKernel kernel, KernelArgument[] arguments, CancellationToken cancellationToken = default)
Parameters
kernelCudaCompiledKernelargumentsKernelArgument[]cancellationTokenCancellationToken
Returns
OptimizeMemoryLayout(CudaTensorDescriptor, CudaTensorPrecision)
Optimizes memory layout for Tensor Core operations
public CudaTensorMemoryLayout OptimizeMemoryLayout(CudaTensorDescriptor descriptor, CudaTensorPrecision precision)
Parameters
descriptorCudaTensorDescriptorprecisionCudaTensorPrecision
Returns
PerformMaintenance()
Performs maintenance operations
public void PerformMaintenance()