Triton kernels for causal depthwise separable convolutions with per-token (dynamic) convolution weights. The following example shows how to add low-rank dynamic short convolutions to the queries, keys ...