These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.
205 related articles for article (PubMed ID: 36031991)
21. Dual modality prompt learning for visual question-grounded answering in robotic surgery. Zhang Y; Fan W; Peng P; Yang X; Zhou D; Wei X Vis Comput Ind Biomed Art; 2024 Apr; 7(1):9. PubMed ID: 38647624 [TBL] [Abstract][Full Text] [Related]
22. Learning to Reason on Tree Structures for Knowledge-Based Visual Question Answering. Li Q; Tang X; Jian Y Sensors (Basel); 2022 Feb; 22(4):. PubMed ID: 35214484 [TBL] [Abstract][Full Text] [Related]
23. Adversarial Learning with Bidirectional Attention for Visual Question Answering. Li Q; Tang X; Jian Y Sensors (Basel); 2021 Oct; 21(21):. PubMed ID: 34770471 [TBL] [Abstract][Full Text] [Related]
24. Depth and Video Segmentation Based Visual Attention for Embodied Question Answering. Luo H; Lin G; Yao Y; Liu F; Liu Z; Tang Z IEEE Trans Pattern Anal Mach Intell; 2023 Jun; 45(6):6807-6819. PubMed ID: 34982673 [TBL] [Abstract][Full Text] [Related]
25. Bilateral Cross-Modality Graph Matching Attention for Feature Fusion in Visual Question Answering. Cao J; Qin X; Zhao S; Shen J IEEE Trans Neural Netw Learn Syst; 2022 Feb; PP():. PubMed ID: 35130171 [TBL] [Abstract][Full Text] [Related]
26. Collaborative Modality Fusion for Mitigating Language Bias in Visual Question Answering. Lu Q; Chen S; Zhu X J Imaging; 2024 Feb; 10(3):. PubMed ID: 38535137 [TBL] [Abstract][Full Text] [Related]
27. Structured Multimodal Attentions for TextVQA. Gao C; Zhu Q; Wang P; Li H; Liu Y; Hengel AVD; Wu Q IEEE Trans Pattern Anal Mach Intell; 2022 Dec; 44(12):9603-9614. PubMed ID: 34855584 [TBL] [Abstract][Full Text] [Related]
28. On solving textual ambiguities and semantic vagueness in MRC based question answering using generative pre-trained transformers. Ahmed M; Khan H; Iqbal T; Khaled Alarfaj F; Alomair A; Almusallam N PeerJ Comput Sci; 2023; 9():e1422. PubMed ID: 37547420 [TBL] [Abstract][Full Text] [Related]
29. MedFuseNet: An attention-based multimodal deep learning model for visual question answering in the medical domain. Sharma D; Purushotham S; Reddy CK Sci Rep; 2021 Oct; 11(1):19826. PubMed ID: 34615894 [TBL] [Abstract][Full Text] [Related]
30. Re-Attention for Visual Question Answering. Guo W; Zhang Y; Yang J; Yuan X IEEE Trans Image Process; 2021; 30():6730-6743. PubMed ID: 34283714 [TBL] [Abstract][Full Text] [Related]
31. Medical visual question answering: A survey. Lin Z; Zhang D; Tao Q; Shi D; Haffari G; Wu Q; He M; Ge Z Artif Intell Med; 2023 Sep; 143():102611. PubMed ID: 37673579 [TBL] [Abstract][Full Text] [Related]
32. Factorization machines and deep views-based co-training for improving answer quality prediction in online health expert question-answering services. Zhang Z; Hu Z; Yang H; Zhu R; Zuo D J Biomed Inform; 2018 Nov; 87():21-36. PubMed ID: 30240803 [TBL] [Abstract][Full Text] [Related]
33. Center-enhanced video captioning model with multimodal semantic alignment. Zhang B; Gao J; Yuan Y Neural Netw; 2024 Sep; 180():106744. PubMed ID: 39326191 [TBL] [Abstract][Full Text] [Related]
34. Vision-Language Model for Visual Question Answering in Medical Imagery. Bazi Y; Rahhal MMA; Bashmal L; Zuair M Bioengineering (Basel); 2023 Mar; 10(3):. PubMed ID: 36978771 [TBL] [Abstract][Full Text] [Related]
35. Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool. Liu F; Xiang T; Hospedales TM; Yang W; Sun C IEEE Trans Pattern Anal Mach Intell; 2020 Feb; 42(2):460-474. PubMed ID: 30418897 [TBL] [Abstract][Full Text] [Related]
36. Rich Visual Knowledge-Based Augmentation Network for Visual Question Answering. Zhang L; Liu S; Liu D; Zeng P; Li X; Song J; Gao L IEEE Trans Neural Netw Learn Syst; 2021 Oct; 32(10):4362-4373. PubMed ID: 32941156 [TBL] [Abstract][Full Text] [Related]
37. Multitask Learning for Visual Question Answering. Ma J; Liu J; Lin Q; Wu B; Wang Y; You Y IEEE Trans Neural Netw Learn Syst; 2023 Mar; 34(3):1380-1394. PubMed ID: 34460390 [TBL] [Abstract][Full Text] [Related]
38. Latent Attention Network With Position Perception for Visual Question Answering. Zhang J; Liu X; Wang Z IEEE Trans Neural Netw Learn Syst; 2024 Mar; PP():. PubMed ID: 38530725 [TBL] [Abstract][Full Text] [Related]
39. Memory Guided Transformer With Spatio-Semantic Visual Extractor for Medical Report Generation. Divya P; Sravani Y; Vishnu C; Mohan CK; Chen YW IEEE J Biomed Health Inform; 2024 May; 28(5):3079-3089. PubMed ID: 38421843 [TBL] [Abstract][Full Text] [Related]
40. Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering. Liu Y; Zhang X; Huang F; Zhang B; Li Z IEEE Trans Image Process; 2022; 31():1684-1696. PubMed ID: 35044914 [TBL] [Abstract][Full Text] [Related] [Previous] [Next] [New Search]