Categories
Misc

CUDA Pro Tip: Increase Performance with Vectorized Memory Access

GPU Pro TipMany CUDA kernels are bandwidth bound, and the increasing ratio of flops to bandwidth in new hardware results in more bandwidth bound kernels. This makes it…GPU Pro Tip

Source

Leave a Reply

Your email address will not be published. Required fields are marked *