Skip to main content

Performance Metrics

  • Issue efficiency, achieved occupancy, SM utilization
  • Memory throughput, L2 hit rate, DRAM utilization
  • Roofline perspective: compute vs bandwidth bound
  • Practical tuning checklist for Hybridizer-generated kernels