Depthwise 1D convolution needed for Mamba prefill #8571
Labels
LLM_feature
mamba
P1_critical
prefill
LLM models have prefill mode and it's optimization is usually separated from decode mode.
We require a 1D depth-wise convolution to implement the Mamba prefill phase. There is currently no support for 1D convolution in
tt-metal
.We require something that implements the same behaviour has the following PyTorch layer:
The text was updated successfully, but these errors were encountered: