142 related articles for article (PubMed ID: 37878438)
1. Learning Hierarchical Modular Networks for Video Captioning.
Li G; Ye H; Qi Y; Wang S; Qing L; Huang Q; Yang MH
IEEE Trans Pattern Anal Mach Intell; 2024 Feb; 46(2):1049-1064. PubMed ID: 37878438
[TBL] [Abstract][Full Text] [Related]
2. Aligning Source Visual and Target Language Domains for Unpaired Video Captioning.
Liu F; Wu X; You C; Ge S; Zou Y; Sun X
IEEE Trans Pattern Anal Mach Intell; 2022 Dec; 44(12):9255-9268. PubMed ID: 34855588
[TBL] [Abstract][Full Text] [Related]
3. Semantic guidance network for video captioning.
Guo L; Zhao H; Chen Z; Han Z
Sci Rep; 2023 Sep; 13(1):16076. PubMed ID: 37752267
[TBL] [Abstract][Full Text] [Related]
4. A Semantics-Assisted Video Captioning Model Trained With Scheduled Sampling.
Chen H; Lin K; Maye A; Li J; Hu X
Front Robot AI; 2020; 7():475767. PubMed ID: 33501293
[TBL] [Abstract][Full Text] [Related]
5. Video Captioning Using Global-Local Representation.
Yan L; Ma S; Wang Q; Chen Y; Zhang X; Savakis A; Liu D
IEEE Trans Circuits Syst Video Technol; 2022 Oct; 32(10):6642-6656. PubMed ID: 37215187
[TBL] [Abstract][Full Text] [Related]
6. Visual Commonsense-Aware Representation Network for Video Captioning.
Zeng P; Zhang H; Gao L; Li X; Qian J; Shen HT
IEEE Trans Neural Netw Learn Syst; 2023 Dec; PP():. PubMed ID: 38127607
[TBL] [Abstract][Full Text] [Related]
7. Concept-Aware Video Captioning: Describing Videos With Effective Prior Information.
Yang B; Cao M; Zou Y
IEEE Trans Image Process; 2023; 32():5366-5378. PubMed ID: 37639408
[TBL] [Abstract][Full Text] [Related]
8. Hierarchical Representation Network With Auxiliary Tasks for Video Captioning and Video Question Answering.
Gao L; Lei Y; Zeng P; Song J; Wang M; Shen HT
IEEE Trans Image Process; 2022; 31():202-215. PubMed ID: 34710043
[TBL] [Abstract][Full Text] [Related]
9. Cross-Modal Graph With Meta Concepts for Video Captioning.
Wang H; Lin G; Hoi SCH; Miao C
IEEE Trans Image Process; 2022; 31():5150-5162. PubMed ID: 35901005
[TBL] [Abstract][Full Text] [Related]
10. Syntax Customized Video Captioning by Imitating Exemplar Sentences.
Yuan Y; Ma L; Zhu W
IEEE Trans Pattern Anal Mach Intell; 2022 Dec; 44(12):10209-10221. PubMed ID: 34847021
[TBL] [Abstract][Full Text] [Related]
11. SibNet: Sibling Convolutional Encoder for Video Captioning.
Liu S; Ren Z; Yuan J
IEEE Trans Pattern Anal Mach Intell; 2021 Sep; 43(9):3259-3272. PubMed ID: 32149622
[TBL] [Abstract][Full Text] [Related]
12. CAM-RNN: Co-Attention Model Based RNN for Video Captioning.
Zhao B; Li X; Lu X
IEEE Trans Image Process; 2019 Nov; 28(11):5552-5565. PubMed ID: 31107650
[TBL] [Abstract][Full Text] [Related]
13. Video Captioning with Object-Aware Spatio-Temporal Correlation and Aggregation.
Zhang J; Peng Y
IEEE Trans Image Process; 2020 Apr; ():. PubMed ID: 32356746
[TBL] [Abstract][Full Text] [Related]
14. Describing Video With Attention-Based Bidirectional LSTM.
Bin Y; Yang Y; Shen F; Xie N; Shen HT; Li X
IEEE Trans Cybern; 2019 Jul; 49(7):2631-2641. PubMed ID: 29993730
[TBL] [Abstract][Full Text] [Related]
15. Cross-Domain Image Captioning via Cross-Modal Retrieval and Model Adaptation.
Zhao W; Wu X; Luo J
IEEE Trans Image Process; 2021; 30():1180-1192. PubMed ID: 33306468
[TBL] [Abstract][Full Text] [Related]
16. A cross-modal conditional mechanism based on attention for text-video retrieval.
Du W; Jing X; Zhu Q; Wang X; Liu X
Math Biosci Eng; 2023 Nov; 20(11):20073-20092. PubMed ID: 38052637
[TBL] [Abstract][Full Text] [Related]
17. Dense Relational Image Captioning via Multi-Task Triple-Stream Networks.
Kim DJ; Oh TH; Choi J; Kweon IS
IEEE Trans Pattern Anal Mach Intell; 2022 Nov; 44(11):7348-7362. PubMed ID: 34648432
[TBL] [Abstract][Full Text] [Related]
18. Video captioning based on vision transformer and reinforcement learning.
Zhao H; Chen Z; Guo L; Han Z
PeerJ Comput Sci; 2022; 8():e916. PubMed ID: 35494808
[TBL] [Abstract][Full Text] [Related]
19. Evaluation of automatic video captioning using direct assessment.
Graham Y; Awad G; Smeaton A
PLoS One; 2018; 13(9):e0202789. PubMed ID: 30180174
[TBL] [Abstract][Full Text] [Related]
20. AAP-MIT: Attentive Atrous Pyramid Network and Memory Incorporated Transformer for Multisentence Video Description.
Prudviraj J; Reddy MI; Vishnu C; Mohan CK
IEEE Trans Image Process; 2022; 31():5559-5569. PubMed ID: 35994530
[TBL] [Abstract][Full Text] [Related]
[Next] [New Search]