This repository was archived by the owner on Sep 25, 2023. It is now read-only.

Description
Hi, I'm using cuSignal in my real-time processing application. The upfirdn-kernel is bottlenecking my application. To me, the upfirdn kernel does not look highly optimized as it makes no use of shared memory (e.g. fir taps) or Tensor Cores. Do you think the upfirdn-kernels will be improved in future?
In any case, I like the cuSignal library and appreciate your work!
Greetings