CUDA BasicsPerformance MetricsPerformance Metrics Issue efficiency, achieved occupancy, SM utilization Memory throughput, L2 hit rate, DRAM utilization Roofline perspective: compute vs bandwidth bound Practical tuning checklist for Hybridizer-generated kernels