vit-large-patch32-384_a1405.../README.md

3 lines
288 B
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

# vit-large-patch32-384_a14058242923032576570175
Vision Transformer ViT 模型在 ImageNet-21k1400 万张图像21843 个类别)上以 224x224 的分辨率进行预训练,并在 ImageNet 2012100 万张图像1000 个类别)上以 384x384 的分辨率进行微调。