匿名用户2024年07月31日
15阅读
开发技术pvt_v2、pytorch
所属分类ai
开源地址https://modelscope.cn/models/OpenGVLab/pvt_v2_b4
授权协议apache-2.0

作品详情

PVTv2

This is the Hugging Face PyTorch implementation of the PVTv2 model.

Model Description

The Pyramid Vision Transformer v2 (PVTv2) is a powerful, lightweight hierarchical transformer backbone for vision tasks. PVTv2 infuses convolution operations into its transformer layers to infuse properties of CNNs that enable them to learn image data efficiently. This mix transformer architecture requires no added positional embeddings, and produces multi-scale feature maps which are known to be beneficial for dense and fine-grained prediction tasks.

Vision models using PVTv2 for a backbone:

  1. Segformer for Semantic Segmentation.
  2. GLPN for Monocular Depth.
  3. Deformable DETR for 2D Object Detection.
  4. Panoptic Segformer for Panoptic Segmentation.
声明:本文仅代表作者观点,不代表本站立场。如果侵犯到您的合法权益,请联系我们删除侵权资源!如果遇到资源链接失效,请您通过评论或工单的方式通知管理员。未经允许,不得转载,本站所有资源文章禁止商业使用运营!
下载安装【程序员客栈】APP
实时对接需求、及时收发消息、丰富的开放项目需求、随时随地查看项目状态

评论