Vision_Transformer 论文:AN IMAGE IS WORTH 16X16 WORDS: TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE 方法 首先结构上,我们采取的是原始Transformer模型,方便开箱即用。 整体结构如图所示 图像处理 为了处理2D图像,我们将图像\(\mathbf{x} \in \mathbb{R}^{H \times W \t 2022-11-05 Deep Learning cv
yolov5源码详解 yolo.yaml 以yolov5s.yaml为例 12345678910111213141516171819202122232425262728293031323334353637383940414243444546474849# YOLOv5 🚀 by Ultralytics, GPL-3.0 license# Parametersnc: 80 # number of clas 2022-10-21 Deep Learning cv
GoogleNet 模型介绍 模型代码 123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919 2022-09-29 Deep Learning cv
AlexNet 网络介绍 使用了上下两层的结构,上下两层结构一样 模型代码 123456789101112131415161718192021222324252627282930313233343536373839404142434445class AlexNet(nn.Module): def __init__(self, num_classes=1000, init 2022-09-29 Deep Learning cv