arXiv-2022 GLIPv2：Unifying Localization and Vision-Language Understanding

2022-09-05 PaperNote CL, CV 0 0 Comments Word Count: 101(words) Read Count: 1(minutes)

论文地址：GLIPv2：Unifying Localization and Vision-Language Understanding

论文实现：https://github.com/microsoft/GLIP

GLIPv2：在GLIP上增加了更多任务和数据集

Abstratc

基本架构还是GLIP，只是把更多的任务，数据集融合进GLIP，比如分割，检测，VQA，image captioning

Introduction

图像还是一个编码器，但是文本就多了很多理解任务，再做deep fusion

GLIPv2: Unifying Localization and VL Understanding

Experiment

本文链接： https://tyang816.github.io/2022/09/05/GLIPv2：Unifying Localization and Vision-Language Understanding/

版权声明： 本博客所有文章除特别声明外，均采用 CC BY 4.0 CN协议许可协议。转载请注明出处！

Yang Tan

Master Student @ECUST

arXiv-2022 GLIPv2：Unifying Localization and Vision-Language Understanding

GLIPv2：在GLIP上增加了更多任务和数据集

Abstratc

Introduction

GLIPv2: Unifying Localization and VL Understanding

Experiment

Yang TanMaster Student @ECUST