ECCV 2024 中与mamba相关的文献分享

ECCV(European Conference on Computer Vision,欧洲计算机视觉会议)是计算机视觉领域的顶级会议。ECCV 2024 于9月29日-10月4日在意大利米兰举行。

ECCV2024公布了录用论文名单,共2395篇,录用率18%。多位学者及团队在图像编辑、生成等领域提出新算法或模型,并被ECCV接收。

Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion  (ECCV 2024 最佳论文奖)

Famba-V:具有跨层令牌融合的快速视觉 Mamba

Abstract: Mamba and Vision Mamba (Vim) models have shown their potential as an alternative to methods based on Transformer architecture. This work introduces Fast Mamba for Vision (Famba-V), a cross-layer token fusion technique to enhance the training efficiency of Vim models. The key idea of Famba-V is to identify and fuse similar tokens across different Vim layers based on a suit of cross-layer strategies instead of simply applying token fusion uniformly across all the layers that existing works propose. We evaluate the performance of Famba-V on CIFAR-100. Our results show that Famba-V is able to enhance the training efficiency of Vim models by reducing both training time and peak memory usage during training. Moreover, the proposed cross-layer strategies allow Famba-V to deliver superior accuracy-efficiency trade-offs. These results all together demonstrate Famba-V as a promising efficiency enhancement technique for Vim models. 

地址:arXiv:2409.09808 

VideoMamba: Spatio-Temporal Selective State Space Model

VideoMamba:时空选择性状态空间模型 

Abstract

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值