140 related articles for article (PubMed ID: 38052637)
1. A cross-modal conditional mechanism based on attention for text-video retrieval.
Du W; Jing X; Zhu Q; Wang X; Liu X
Math Biosci Eng; 2023 Nov; 20(11):20073-20092. PubMed ID: 38052637
[TBL] [Abstract][Full Text] [Related]
2. Deep Relation Embedding for Cross-Modal Retrieval.
Zhang Y; Zhou W; Wang M; Tian Q; Li H
IEEE Trans Image Process; 2021; 30():617-627. PubMed ID: 33232230
[TBL] [Abstract][Full Text] [Related]
3. Cross-Modal Attention With Semantic Consistence for Image-Text Matching.
Xu X; Wang T; Yang Y; Zuo L; Shen F; Shen HT
IEEE Trans Neural Netw Learn Syst; 2020 Dec; 31(12):5412-5425. PubMed ID: 32071004
[TBL] [Abstract][Full Text] [Related]
4. Semantics-Aware Spatial-Temporal Binaries for Cross-Modal Video Retrieval.
Qi M; Qin J; Yang Y; Wang Y; Luo J
IEEE Trans Image Process; 2021; 30():2989-3004. PubMed ID: 33560984
[TBL] [Abstract][Full Text] [Related]
5. Learning Hierarchical Modular Networks for Video Captioning.
Li G; Ye H; Qi Y; Wang S; Qing L; Huang Q; Yang MH
IEEE Trans Pattern Anal Mach Intell; 2024 Feb; 46(2):1049-1064. PubMed ID: 37878438
[TBL] [Abstract][Full Text] [Related]
6. CAM-RNN: Co-Attention Model Based RNN for Video Captioning.
Zhao B; Li X; Lu X
IEEE Trans Image Process; 2019 Nov; 28(11):5552-5565. PubMed ID: 31107650
[TBL] [Abstract][Full Text] [Related]
7. Cross-Domain Image Captioning via Cross-Modal Retrieval and Model Adaptation.
Zhao W; Wu X; Luo J
IEEE Trans Image Process; 2021; 30():1180-1192. PubMed ID: 33306468
[TBL] [Abstract][Full Text] [Related]
8. End-to-End Pre-Training With Hierarchical Matching and Momentum Contrast for Text-Video Retrieval.
Shen W; Song J; Zhu X; Li G; Shen HT
IEEE Trans Image Process; 2023; 32():5017-5030. PubMed ID: 37186535
[TBL] [Abstract][Full Text] [Related]
9. Fs-DSM: Few-Shot Diagram-Sentence Matching via Cross-Modal Attention Graph Model.
Hu X; Zhang L; Liu J; Zheng Q; Zhou J
IEEE Trans Image Process; 2021; 30():8102-8115. PubMed ID: 34554913
[TBL] [Abstract][Full Text] [Related]
10. A Short Video Classification Framework Based on Cross-Modal Fusion.
Pang N; Guo S; Yan M; Chan CA
Sensors (Basel); 2023 Oct; 23(20):. PubMed ID: 37896519
[TBL] [Abstract][Full Text] [Related]
11. Referring Segmentation in Images and Videos With Cross-Modal Self-Attention Network.
Ye L; Rochan M; Liu Z; Zhang X; Wang Y
IEEE Trans Pattern Anal Mach Intell; 2022 Jul; 44(7):3719-3732. PubMed ID: 33497325
[TBL] [Abstract][Full Text] [Related]
12. Video Captioning with Object-Aware Spatio-Temporal Correlation and Aggregation.
Zhang J; Peng Y
IEEE Trans Image Process; 2020 Apr; ():. PubMed ID: 32356746
[TBL] [Abstract][Full Text] [Related]
13. Hybrid Attention Network for Language-Based Person Search.
Li Y; Xu H; Xiao J
Sensors (Basel); 2020 Sep; 20(18):. PubMed ID: 32942720
[TBL] [Abstract][Full Text] [Related]
14. Query-Adaptive Late Fusion for Hierarchical Fine-Grained Video-Text Retrieval.
Ma W; Chen Q; Liu F; Zhou T; Cai Z
IEEE Trans Neural Netw Learn Syst; 2022 Oct; PP():. PubMed ID: 36279326
[TBL] [Abstract][Full Text] [Related]
15. Latent Space Semantic Supervision Based on Knowledge Distillation for Cross-Modal Retrieval.
Zhang L; Wu X
IEEE Trans Image Process; 2022; 31():7154-7164. PubMed ID: 36355734
[TBL] [Abstract][Full Text] [Related]
16. BCAN: Bidirectional Correct Attention Network for Cross-Modal Retrieval.
Liu Y; Liu H; Wang H; Meng F; Liu M
IEEE Trans Neural Netw Learn Syst; 2023 May; PP():. PubMed ID: 37256811
[TBL] [Abstract][Full Text] [Related]
17. Referring Segmentation via Encoder-Fused Cross-Modal Attention Network.
Feng G; Zhang L; Sun J; Hu Z; Lu H
IEEE Trans Pattern Anal Mach Intell; 2023 Jun; 45(6):7654-7667. PubMed ID: 36367919
[TBL] [Abstract][Full Text] [Related]
18. HAAN: Learning a Hierarchical Adaptive Alignment Network for Image-Text Retrieval.
Wang S; Liu Z; Pei X; Xu J
Sensors (Basel); 2023 Feb; 23(5):. PubMed ID: 36904776
[TBL] [Abstract][Full Text] [Related]
19. Deep Semantic Multimodal Hashing Network for Scalable Image-Text and Video-Text Retrievals.
Jin L; Li Z; Tang J
IEEE Trans Neural Netw Learn Syst; 2023 Apr; 34(4):1838-1851. PubMed ID: 32502968
[TBL] [Abstract][Full Text] [Related]
20. Actor and Action Modular Network for Text-Based Video Segmentation.
Yang J; Huang Y; Niu K; Huang L; Ma Z; Wang L
IEEE Trans Image Process; 2022; 31():4474-4489. PubMed ID: 35763476
[TBL] [Abstract][Full Text] [Related]
[Next] [New Search]