Coarse-to-Fine Grained Text-based Video-moment Retrieval pipeline utilizing T-MASS and MESM models for efficient multi-stage text-video alignment.
-
Updated
Aug 28, 2024 - Python
Coarse-to-Fine Grained Text-based Video-moment Retrieval pipeline utilizing T-MASS and MESM models for efficient multi-stage text-video alignment.
paper list on Video Moment Retrieval (VMR), or Natural Language Video Localization (NLVL), Video Grounding (VG), or Temporal Sentence Grounding in Videos (TSGV)
ACM Multimedia 2023 - Temporal Sentence in Streaming Videos
Pytorch implementation of the paper 'Gaussian Mixture Proposals with Pull-Push Learning Scheme to Capture Diverse Events for Weakly Supervised Temporal Video Grounding' (AAAI2024).
Tensorflow Reproduction of the EMNLP-2018 paper "Temporally Grounding Natural Sentence in Video"
source code of our MGPN in SIGIR 2022
paper list on Video Moment Retrieval (VMR), or Natural Language Video Localization (NLVL), or Temporal Sentence Grounding in Videos (TSGV))
source code of our RaNet in EMNLP 2021
Temporal Sentence Grounding in Videos / Natural Language Video Localization / Video Moment Retrieval的相关工作
Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware Prediction"
"Video Moment Retrieval from Text Queries via Single Frame Annotation" in SIGIR 2022.
[EMNLP2024 Demo] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.
Add a description, image, and links to the video-moment-retrieval topic page so that developers can more easily learn about it.
To associate your repository with the video-moment-retrieval topic, visit your repo's landing page and select "manage topics."