22 related articles for article (PubMed ID: 38662568)
1. Building an Open-Vocabulary Video CLIP Model With Better Architectures, Optimization and Data.
Wu Z; Weng Z; Peng W; Yang X; Li A; Davis LS; Jiang YG
IEEE Trans Pattern Anal Mach Intell; 2024 Jul; 46(7):4747-4762. PubMed ID: 38261478
[TBL] [Abstract][Full Text] [Related]
2. ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation.
Yang B; Liu F; Zou Y; Wu X; Wang Y; Clifton DA
IEEE Trans Pattern Anal Mach Intell; 2024 Aug; 46(8):5712-5724. PubMed ID: 38421845
[TBL] [Abstract][Full Text] [Related]
3. Towards Visual-Prompt Temporal Answer Grounding in Instructional Video.
Li S; Li B; Sun B; Weng Y
IEEE Trans Pattern Anal Mach Intell; 2024 Jun; PP():. PubMed ID: 38848233
[TBL] [Abstract][Full Text] [Related]
4. Dynamic Spatio-Temporal Graph Reasoning for VideoQA with Self-Supervised Event Recognition.
Nie J; Wang X; Hou R; Li G; Chen H; Zhu W
IEEE Trans Image Process; 2024 Jul; PP():. PubMed ID: 38954578
[TBL] [Abstract][Full Text] [Related]
5. DTCM: Joint Optimization of Dark Enhancement and Action Recognition in Videos.
Tu Z; Liu Y; Zhang Y; Mu Q; Yuan J
IEEE Trans Image Process; 2023; 32():3507-3520. PubMed ID: 37335800
[TBL] [Abstract][Full Text] [Related]
6. Separable Spatial-Temporal Residual Graph for Cloth-Changing Group Re-Identification.
Zhang Q; Lai J; Xie X; Jin X; Huang S
IEEE Trans Pattern Anal Mach Intell; 2024 Aug; 46(8):5791-5805. PubMed ID: 38393853
[TBL] [Abstract][Full Text] [Related]
7. Divert More Attention to Vision-Language Object Tracking.
Guo M; Zhang Z; Jing L; Ling H; Fan H
IEEE Trans Pattern Anal Mach Intell; 2024 Jun; PP():. PubMed ID: 38833398
[TBL] [Abstract][Full Text] [Related]
8. Data-driven fine-grained region discovery in the mouse brain with transformers.
Lee AJ; Yao S; Lusk N; Ng L; Kunst M; Zeng H; Tasic B; Abbasi-Asl R
bioRxiv; 2024 Jun; ():. PubMed ID: 38766132
[TBL] [Abstract][Full Text] [Related]
9. AMS-Net: Modeling Adaptive Multi-Granularity Spatio-Temporal Cues for Video Action Recognition.
Wang Q; Hu Q; Gao Z; Li P; Hu Q
IEEE Trans Neural Netw Learn Syst; 2023 Oct; PP():. PubMed ID: 37824318
[TBL] [Abstract][Full Text] [Related]
10. Enhancing Video-Language Representations with Structural Spatio-Temporal Alignment.
Fei H; Wu S; Zhang M; Zhang M; Chua TS; Yan S
IEEE Trans Pattern Anal Mach Intell; 2024 Apr; PP():. PubMed ID: 38662568
[TBL] [Abstract][Full Text] [Related]
11. Adaptive Spatio-Temporal Graph Enhanced Vision-Language Representation for Video QA.
Jin W; Zhao Z; Cao X; Zhu J; He X; Zhuang Y
IEEE Trans Image Process; 2021; 30():5477-5489. PubMed ID: 33950840
[TBL] [Abstract][Full Text] [Related]
12. Video Captioning with Object-Aware Spatio-Temporal Correlation and Aggregation.
Zhang J; Peng Y
IEEE Trans Image Process; 2020 Apr; ():. PubMed ID: 32356746
[TBL] [Abstract][Full Text] [Related]
13. Fine-Grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection.
Long Y; Han J; Huang R; Xu H; Zhu Y; Xu C; Liang X
IEEE Trans Neural Netw Learn Syst; 2023 Jul; PP():. PubMed ID: 37506020
[TBL] [Abstract][Full Text] [Related]
14. Fine-Grained Video Captioning via Graph-based Multi-Granularity Interaction Learning.
Yan Y; Zhuang N; Ni B; Zhang J; Xu M; Zhang Q; Zhang Z; Cheng S; Tian Q; Xu Y; Yang X; Zhang W
IEEE Trans Pattern Anal Mach Intell; 2022 Feb; 44(2):666-683. PubMed ID: 31613750
[TBL] [Abstract][Full Text] [Related]
15.
; ; . PubMed ID:
[No Abstract] [Full Text] [Related]
16.
; ; . PubMed ID:
[No Abstract] [Full Text] [Related]
17.
; ; . PubMed ID:
[No Abstract] [Full Text] [Related]
18.
; ; . PubMed ID:
[No Abstract] [Full Text] [Related]
19.
; ; . PubMed ID:
[No Abstract] [Full Text] [Related]
20.
; ; . PubMed ID:
[No Abstract] [Full Text] [Related]
[Next] [New Search]