Depthwise 1D convolution needed for Mamba prefill #8571

esmalTT · 2024-05-16T19:46:58Z

We require a 1D depth-wise convolution to implement the Mamba prefill phase. There is currently no support for 1D convolution in tt-metal.

We require something that implements the same behaviour has the following PyTorch layer:

torch.nn.Conv1d(
    in_channels=5120,
    out_channels=5120,
    bias=True,
    kernel_size=4,
    groups=5120,
    padding=3,
)

The text was updated successfully, but these errors were encountered:

esmalTT added mamba P1_critical labels May 16, 2024

esmalTT changed the title ~~Support 1D depthwise convolution~~ Depth-wise 1D convolution needed for Mamba prefill May 16, 2024

esmalTT added LLM_feature prefill LLM models have prefill mode and it's optimization is usually separated from decode mode. labels May 16, 2024

esmalTT changed the title ~~Depth-wise 1D convolution needed for Mamba prefill~~ Depthwise 1D convolution needed for Mamba prefill May 21, 2024

esmalTT closed this as completed May 30, 2024

Provide feedback