GCPR-2021 AudioCLIP:Extending CLIP to Image, Text and Audio
arXiv-2021 How Much Can CLIP Benefit Vision-and-Language Tasks?
arXiv-2021 ActionCLIP:A New Paradigm for Video Action Recognition
arXiv-2021 CLIP4Clip:An Empirical Study of CLIP for End to End Video Clip Retrieval
SIGGRAPH-2022 CLIPasso:Semantically-Aware Object Sketching
ACL-2015 Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks
arXiv-2022 A Survey of Deep Learning Models for Structural Code Understanding
arXiv-2022 GLIPv2:Unifying Localization and Vision-Language Understanding
ICLR-2022 Open-vocabulary Object Detection via Vision and Language Knowledge Distillation
CVPR-2022 Grounded Language-Image Pre-trainin
Archive
Total 248 articles