What category does the M2 model belong to

Hello, thank you for your great work! M2bert paper mentioned that "Monarch Mixer is part of a new class of architectures called state-space models (SSMs), which include S4, Mamba, and BiGS".
Is Monarch Mixer and M2BERT a part of SSMs? 
I consider M2BERT to be:
(1) replace attention with bidirectional gated convolutions with a residual convolution, and set the Monarch matrices to DFT and inverse DFT matrices to speed up DFT for conv; 
(2)In the dimension mixer, replace the two dense matrices in MLPs with learned block-diagonal matrices to speed up MLP computation.

I wonder which part of it is related to SSM?  I would be very grateful if you could help me with the answer : )

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What category does the M2 model belong to #34

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

What category does the M2 model belong to #34

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions