All notable updates to Towhee models will be documented in this file.
Added 4 SOTA mdoels
-
Vis4mer
-
MCProp
-
RepLKNet
-
Shunted Transformer
Added 4 SOTA mdoels
-
ISC
-
MetaFormer
-
ConvNeXt
- paper: A ConvNet for the 2020s
-
HorNe
Add 3 SOTA models
-
nnfp
-
RepMLPNet
-
Wave-ViT
Add 5 SOTA models
-
CoCa
- paper: CoCa
-
CoFormer
- paper: CoFormer
-
TransRAC
- paper: TransRAC
-
CVNet
- paper: CVNet
-
MaxViT
- paper: MaxViT
Add 1 vision transformer backbone, 1 text-image retrieval model, 2 video retrieval models
-
MPViT
-
LightningDOT
-
BridgeFormer
-
collaborative-experts
Add 6 video understanding/classification models
-
Video Swin Transformer
-
TSM
-
Uniformer
-
OMNIVORE
-
TimeSformer
-
MoViNets
Add 4 video retrieval models
-
CLIP4Clip
-
DRL
-
Frozen in Time
-
MDMMT
Add 3 text-image multimodal models
-
CLIP
-
BLIP
-
LightningDOT
Add 6 video understanding/classification models
-
I3D (from PyTorchVideo)
-
C2D (from PyTorchVideo)
-
Slow (from PyTorchVideo)
-
SlowFast (from PyTorchVideo)
-
X3D (from PyTorchVideo)
-
MViT (from PyTorchVideo)