217 related articles for article (PubMed ID: 35044914)
1. Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering.
Liu Y; Zhang X; Huang F; Zhang B; Li Z
IEEE Trans Image Process; 2022; 31():1684-1696. PubMed ID: 35044914
[TBL] [Abstract][Full Text] [Related]
2. Dynamic Spatio-Temporal Graph Reasoning for VideoQA with Self-Supervised Event Recognition.
Nie J; Wang X; Hou R; Li G; Chen H; Zhu W
IEEE Trans Image Process; 2024 Jul; PP():. PubMed ID: 38954578
[TBL] [Abstract][Full Text] [Related]
3. Compositional Attention Networks with Two-Stream Fusion for Video Question Answering.
Yu T; Yu J; Yu Z; Tao D
IEEE Trans Image Process; 2019 Sep; ():. PubMed ID: 31535995
[TBL] [Abstract][Full Text] [Related]
4. A multi-scale self-supervised hypergraph contrastive learning framework for video question answering.
Wang Z; Wu B; Ota K; Dong M; Li H
Neural Netw; 2023 Nov; 168():272-286. PubMed ID: 37774513
[TBL] [Abstract][Full Text] [Related]
5. Contrastive Video Question Answering via Video Graph Transformer.
Xiao J; Zhou P; Yao A; Li Y; Hong R; Yan S; Chua TS
IEEE Trans Pattern Anal Mach Intell; 2023 Nov; 45(11):13265-13280. PubMed ID: 37402185
[TBL] [Abstract][Full Text] [Related]
6. Multi-Turn Video Question Answering via Hierarchical Attention Context Reinforced Networks.
Zhao Z; Zhang Z; Jiang X; Cai D
IEEE Trans Image Process; 2019 Aug; 28(8):3860-3872. PubMed ID: 30835223
[TBL] [Abstract][Full Text] [Related]
7. Graph-Based Multi-Interaction Network for Video Question Answering.
Gu M; Zhao Z; Jin W; Hong R; Wu F
IEEE Trans Image Process; 2021; 30():2758-2770. PubMed ID: 33476268
[TBL] [Abstract][Full Text] [Related]
8. An effective spatial relational reasoning networks for visual question answering.
Shen X; Han D; Chen C; Luo G; Wu Z
PLoS One; 2022; 17(11):e0277693. PubMed ID: 36441742
[TBL] [Abstract][Full Text] [Related]
9. Event Graph Guided Compositional Spatial-Temporal Reasoning for Video Question Answering.
Bai Z; Wang R; Gao D; Chen X
IEEE Trans Image Process; 2024; 33():1109-1121. PubMed ID: 38294915
[TBL] [Abstract][Full Text] [Related]
10. Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering.
Liu Y; Li G; Lin L
IEEE Trans Pattern Anal Mach Intell; 2023 Oct; 45(10):11624-11641. PubMed ID: 37289602
[TBL] [Abstract][Full Text] [Related]
11. Video Captioning with Object-Aware Spatio-Temporal Correlation and Aggregation.
Zhang J; Peng Y
IEEE Trans Image Process; 2020 Apr; ():. PubMed ID: 32356746
[TBL] [Abstract][Full Text] [Related]
12. Adaptive Spatio-Temporal Graph Enhanced Vision-Language Representation for Video QA.
Jin W; Zhao Z; Cao X; Zhu J; He X; Zhuang Y
IEEE Trans Image Process; 2021; 30():5477-5489. PubMed ID: 33950840
[TBL] [Abstract][Full Text] [Related]
13. Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question Answering.
Yu T; Fu K; Zhang J; Huang Q; Yu J
IEEE Trans Image Process; 2024; 33():3115-3129. PubMed ID: 38656836
[TBL] [Abstract][Full Text] [Related]
14. Bridging the Cross-Modality Semantic Gap in Visual Question Answering.
Wang B; Ma Y; Li X; Gao J; Hu Y; Yin B
IEEE Trans Neural Netw Learn Syst; 2024 Mar; PP():. PubMed ID: 38446647
[TBL] [Abstract][Full Text] [Related]
15. Transformer-Empowered Invariant Grounding for Video Question Answering.
Li Y; Wang X; Xiao J; Ji W; Chua TS
IEEE Trans Pattern Anal Mach Intell; 2023 Aug; PP():. PubMed ID: 37556333
[TBL] [Abstract][Full Text] [Related]
16. HiAM: A Hierarchical Attention based Model for knowledge graph multi-hop reasoning.
Ma T; Lv S; Huang L; Hu S
Neural Netw; 2021 Nov; 143():261-270. PubMed ID: 34157650
[TBL] [Abstract][Full Text] [Related]
17. Fine-Grained Video Captioning via Graph-based Multi-Granularity Interaction Learning.
Yan Y; Zhuang N; Ni B; Zhang J; Xu M; Zhang Q; Zhang Z; Cheng S; Tian Q; Xu Y; Yang X; Zhang W
IEEE Trans Pattern Anal Mach Intell; 2022 Feb; 44(2):666-683. PubMed ID: 31613750
[TBL] [Abstract][Full Text] [Related]
18. MRA-Net: Improving VQA Via Multi-Modal Relation Attention Network.
Peng L; Yang Y; Wang Z; Huang Z; Shen HT
IEEE Trans Pattern Anal Mach Intell; 2022 Jan; 44(1):318-329. PubMed ID: 32750794
[TBL] [Abstract][Full Text] [Related]
19. Variational Cross-Graph Reasoning and Adaptive Structured Semantics Learning for Compositional Temporal Grounding.
Li J; Tang S; Zhu L; Zhang W; Yang Y; Chua TS; Wu F; Zhuang Y
IEEE Trans Pattern Anal Mach Intell; 2023 Oct; 45(10):12601-12617. PubMed ID: 37155378
[TBL] [Abstract][Full Text] [Related]
20. Adversarial Learning With Multi-Modal Attention for Visual Question Answering.
Liu Y; Zhang X; Huang F; Cheng L; Li Z
IEEE Trans Neural Netw Learn Syst; 2021 Sep; 32(9):3894-3908. PubMed ID: 32833656
[TBL] [Abstract][Full Text] [Related]
[Next] [New Search]