Hi everyone,
I don’t understand how to download only the requested split. See the following for this [EDIT: current version uses .zip] dataset:
>>> ds = load_dataset("ego-thales/cifar10", name="no_airplane", split="left_out_calibration")
train/automobile/25706.png: 100%|██████████████████████████████████████████████████████| 2.27k/2.27k [00:01<00:00, 2.07kB/s]
train/automobile/25700.png: 100%|██████████████████████████████████████████████████████| 2.39k/2.39k [00:00<00:00, 3.64kB/s]
train/automobile/25701.png: 100%|██████████████████████████████████████████████████████| 2.28k/2.28k [00:00<00:00, 3.19kB/s]
...
It starts with train/automobile, probably because of the train split (I did not request!). Dataset config (only the relevant part):
- config_name: no_airplane
data_files:
- split: train
path:
- train/automobile/*.png
- train/bird/*.png
- train/cat/*.png
- train/deer/*.png
- train/dog/*.png
- train/frog/*.png
- train/horse/*.png
- train/ship/*.png
- train/truck/*.png
- split: calibration
path:
- calibration/automobile/*.png
- calibration/bird/*.png
- calibration/cat/*.png
- calibration/deer/*.png
- calibration/dog/*.png
- calibration/frog/*.png
- calibration/horse/*.png
- calibration/ship/*.png
- calibration/truck/*.png
- split: test
path:
- test/automobile/*.png
- test/bird/*.png
- test/cat/*.png
- test/deer/*.png
- test/dog/*.png
- test/frog/*.png
- test/horse/*.png
- test/ship/*.png
- test/truck/*.png
- split: left_out_train
path: train/airplane/*.png
- split: left_out_calibration
path: calibration/airplane/*.png
- split: left_out_test
path: test/airplane/*.png
- split: left_out
path:
- train/airplane/*.png
- calibration/airplane/*.png
- test/airplane/*.png
Is there an elegant solution? It’s for a tutorial, and user may only want to download very small splits…
Thanks in advance!
Élie
