Skip to main content

CUDA Backend

  • Targets NVIDIA GPUs using CUDA toolchain
  • Supports kernel generation, streams, shared memory usage
  • Requires compatible driver + CUDA toolkit
  • Best for massive parallel throughput; consider transfer costs

See also: CUDA Basics and Performance Metrics.