cudaTest cuda speed up demo, include upload cv::Mat to cuda, write cuda kernel using __shfl_down, atomicAdd, __ballot, __popc... running environment: visual studio 2019 opencv cuda 11.x zhihu title: https://zhuanlan.zhihu.com/p/608592612