CUDA Backend
- Targets NVIDIA GPUs using CUDA toolchain
- Supports kernel generation, streams, shared memory usage
- Requires compatible driver + CUDA toolkit
- Best for massive parallel throughput; consider transfer costs
See also: CUDA Basics and Performance Metrics.