Mamba: Linear-Time Sequence Modeling with Selective State Spaces (Gu & Dao, 2024)
state-spaces/mamba, state-spaces/s4
Idea: to use state space representations in control theory to for sequences modeling.
Mamba
Mamba: Linear-Time Sequence Modeling with Selective State Spaces (Gu & Dao, 2024) Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality
(Dao & Gu, 2024) Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality
State-space duality (SSD): SSM + attentions layers (SMA, or structured masked attention)