Cross-Modal Reasoning, Visual Question Answering, Visual Dialogue, Image/Video Captioning.
Cross Modal Retrieval, Video-text Grounding
Vision and Language Alignment
Scene Recognition
Visual Reasoning
Relation Extraction, Visual Question Answering, Visual Dialogue
Encrypted Traffic Analysis, Machine Learning
computer vision
Encrypted Traffic Analysis
Cross-modal Reasoning
Knowledge-based Vision Question Answering
Knowledge-based Vision Question Answering The Chinese University of Hong Kong
Visual Question Answering
Visual Dialog, Image Captioning Kuai Star, Kwai, Beijing
Cross-Modal Information Retrieval Baidu, Beijing
Cross-modal Information Retrieval, Multi-style Image Caption
Query Expansion, Question Answering Xiaomi
Cross-modal Information Retrieval Alibaba Group, Beijing, China
Large Graph Matching University of International Relations
encrypted traffic analysis, adversarial machine learning
Graph Visualization WeChat Group, Tencent
Large Graph Matching University of Minnesota, USA
Large Graph Management University of Southampton, UK
Graph Visualization HUAWEI, China
Graph Visualization Tobacco Company
Cross-modal Information Retrieval Columbia University, USA
Visual Question Answering Carnegie Mellon University, USA
Scene Graph Generation, Visual Relation Detection
Video Captioning, Cross-media Retrieval
Video Captioning, Video-text Grounding
Visual Dialog, Visual Reasoning