class BDD100KDataset(FiftyOneDataset): (source)
Constructor: BDD100KDataset(source_dir, copy_files)
The Berkeley Deep Drive (BDD) dataset is one of the largest and most diverse video datasets for autonomous vehicles.
The BDD100K dataset contains 100,000 video clips collected from more than 50,000 rides covering New York, San Francisco Bay Area, and other regions. The dataset contains diverse scene types such as city streets, residential areas, and highways. Furthermore, the videos were recorded in diverse weather conditions at different times of the day.
The videos are split into training (70K), validation (10K) and testing (20K) sets. Each video is 40 seconds long with 720p resolution and a frame rate of 30fps. The frame at the 10th second of each video is annotated for image classification, detection, and segmentation tasks.
This version of the dataset contains only the 100K images extracted from the videos as described above, together with the image classification, detection, and segmentation labels.
In order to load the BDD100K dataset, you must download the source data manually. The directory should be organized in the following format:
source_dir/ labels/ bdd100k_labels_images_train.json bdd100k_labels_images_val.json images/ 100k/ train/ test/ val/
You can register at https://bdd-data.berkeley.edu in order to get links to download the data.
Example usage:
import fiftyone as fo import fiftyone.zoo as foz # The path to the source files that you manually downloaded source_dir = "/path/to/dir-with-bdd100k-files" dataset = foz.load_zoo_dataset( "bdd100k", split="validation", source_dir=source_dir, ) session = fo.launch_app(dataset)
- Dataset size
- 7.10 GB
- Source
- https://bdd-data.berkeley.edu
Parameters | |
source | the directory containing the manually downloaded BDD100K files |
copy | whether to move (False) or create copies (True) of the source files when populating the dataset directory |
Method | __init__ |
Undocumented |
Instance Variable | copy |
Undocumented |
Instance Variable | source |
Undocumented |
Property | name |
The name of the dataset. |
Property | requires |
Whether this dataset requires some files to be manually downloaded by the user before the dataset can be loaded. |
Property | supported |
A tuple of supported splits for the dataset, or None if the dataset does not have splits. |
Property | tags |
A tuple of tags for the dataset. |
Method | _download |
Internal implementation of downloading the dataset and preparing it for use in the given directory. |
Inherited from ZooDataset
(via FiftyOneDataset
):
Static Method | get |
Returns the path to the ZooDatasetInfo for the dataset. |
Static Method | has |
Determines whether the directory contains ZooDatasetInfo . |
Static Method | load |
Loads the ZooDatasetInfo from the given dataset directory. |
Method | download |
Downloads the dataset and prepares it for use. |
Method | get |
Returns the directory for the given split of the dataset. |
Method | has |
Whether the dataset has the given split. |
Method | has |
Whether the dataset has the given tag. |
Property | has |
Whether the dataset has patches that may need to be applied to already downloaded files. |
Property | has |
Whether the dataset has splits. |
Property | has |
Whether the dataset has tags. |
Property | importer |
A dict of default kwargs to pass to this dataset's fiftyone.utils.data.importers.DatasetImporter . |
Property | is |
Whether the dataset is remotely-sourced. |
Property | parameters |
An optional dict of parameters describing the configuration of the zoo dataset when it was downloaded. |
Property | supports |
Whether the dataset supports downloading partial subsets of its splits. |
Method | _get |
Undocumented |
Method | _is |
Undocumented |
Method | _is |
Undocumented |
Method | _patch |
Internal method called when an already downloaded dataset may need to be patched. |
Whether this dataset requires some files to be manually downloaded by the user before the dataset can be loaded.
Internal implementation of downloading the dataset and preparing it for use in the given directory.
Parameters | |
dataset | the directory in which to construct the dataset. If a split is provided, this is the directory for the split |
scratch | a scratch directory to use to download and prepare any required intermediate files |
split | the split to download, or None if the dataset does not have splits |
Returns | |
tuple of |
|