Multimodal feature fusion by relational reasoning and attention for visual question answering

Publication
Information Fusion, 2020 (Impact factor:12.975)