'MAMBA' 태그의 글 목록

[논문리뷰] Mamba: Linear-Time Sequence Modeling with Selective State Spaces

2024.05.16·Study/Paper

Mamba: Linear-Time Sequence Modeling with Selective State Spaces[Arxiv] Mamba: Linear-Time Sequence Modeling with Selective State SpacesFoundation models, now powering most of the exciting applications in deep learning, are almost universally based on the Transformer architecture and its core attention module. Many subquadratic-time architectures such as linear attention, gated convolutionarxiv...

티스토리툴바