Vision_Transformer

论文：AN IMAGE IS WORTH 16X16 WORDS: TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE 方法首先结构上，我们采取的是原始Transformer模型，方便开箱即用。整体结构如图所示图像处理为了处理2D图像，我们将图像\(\mathbf{x} \in \mathbb{R}^{H \times W \t

2022-11-05

Deep Learning cv

yolov5源码详解

yolo.yaml 以yolov5s.yaml为例 12345678910111213141516171819202122232425262728293031323334353637383940414243444546474849# YOLOv5 🚀 by Ultralytics, GPL-3.0 license# Parametersnc: 80 # number of clas

2022-10-21

Deep Learning cv

yolov1

2022-09-30

Deep Learning cv

SSD

2022-09-30

Deep Learning cv

FPN

2022-09-30

Deep Learning cv

Faster_R-CNN

2022-09-30

Deep Learning cv

Fast_R-CNN

2022-09-30

Deep Learning cv

RCNN

2022-09-29

Deep Learning cv

GoogleNet

模型介绍模型代码 123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919

2022-09-29

Deep Learning cv

AlexNet

网络介绍使用了上下两层的结构，上下两层结构一样模型代码 123456789101112131415161718192021222324252627282930313233343536373839404142434445class AlexNet(nn.Module): def __init__(self, num_classes=1000, init

2022-09-29

Deep Learning cv