51 Star 364 Fork 116

PaddlePaddle / PaddleClas

加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
克隆/下载
data_en.md 1.54 KB
一键复制 编辑 原始数据 按行查看 历史
littletomatodonkey 提交于 2020-06-18 11:45 . fix title

Data


Introducation

This document introduces the preparation of ImageNet1k and flowers102

Dataset

Dataset train dataset size valid dataset size category
flowers102 1k 6k 102
ImageNet1k 1.2M 50k 1000
  • Data format

Please follow the steps mentioned below to organize data, include train_list.txt and val_list.txt

# delimiter: "space"

ILSVRC2012_val_00000001.JPEG 65
...

ImageNet1k

After downloading data, please organize the data dir as below

PaddleClas/dataset/imagenet/
|_ train/
|  |_ n01440764
|  |  |_ n01440764_10026.JPEG
|  |  |_ ...
|  |_ ...
|  |
|  |_ n15075141
|     |_ ...
|     |_ n15075141_9993.JPEG
|_ val/
|  |_ ILSVRC2012_val_00000001.JPEG
|  |_ ...
|  |_ ILSVRC2012_val_00050000.JPEG
|_ train_list.txt
|_ val_list.txt

Flowers102 Dataset

Download Data then decompress:

jpg/
setid.mat
imagelabels.mat

Please put all the files under PaddleClas/dataset/flowers102

generate generate_flowers102_list.py and train_list.txt和val_list.txt

python generate_flowers102_list.py jpg train > train_list.txt
python generate_flowers102_list.py jpg valid > val_list.txt

Please organize data dir as below

PaddleClas/dataset/flowers102/
|_ jpg/
|  |_ image_03601.jpg
|  |_ ...
|  |_ image_02355.jpg
|_ train_list.txt
|_ val_list.txt
Python
1
https://gitee.com/paddlepaddle/PaddleClas.git
git@gitee.com:paddlepaddle/PaddleClas.git
paddlepaddle
PaddleClas
PaddleClas
master

搜索帮助