The platelet-em dataset contains two 3D scanning electron microscope (EM) images of human platelets, as well as instance and semantic segmentations of those two image volumes.

This data has been reviewed by NIBIB, contains no PII or PHI, and is cleared for public release. All files use a multipage uint16 TIF format. A 3D image with size [Z, X, Y] is saved as Z pages of size [X, Y]. Image voxels are approximately 40x10x10 nm3.

Download

Label files can be downloaded as RGBA TIF image volumes or as string-based JSON dictionaries.

TIF

Download images and TIF labels (95.3 MB)

JSON

Download images and JSON labels (98.2 MB)


Image files

50-images.tif, Z=0

The platelet image volumes were acquired by a Gatan 3View serial block-face scanning electron microscope (SBF-SEM).

Image files are:

  • 50-images.tif, a 50x800x800 SBF-SEM image saved as a grayscale TIF.

  • 24-images.tif, a 24x800x800 SBF-SEM image saved as a grayscale TIF.


Label files

Label data can be stored in image-based or text-based formats. Image files are better for visualization and image manipulation with GUIs, while text files are better for use with scripting. For its labels, platelet-em dataset uses TIF-formatted image files and JSON-formatted text files.


TIF format

Label TIF files assign a color to each voxel in a corresponding image file. The colors correspond to labels, either object classes for semantic labels or unique object ids for instance labels.


Semantic labels

50-semantic.tif, Z=0

Semantic label files classify each image voxel into one of seven classes, indexed from 0-6:

Index Color Class name
0 None Background
1 Dark Blue Cell
2 Cyan Mitochondria
3 Green Alpha granule
4 Yellow Canalicular vessel
5 Red Dense granule body
6 Purple Dense granule core

Semantic label files are:

  • 50-semantic.tif: A 50x800x800 semantic segmentation of 50-images.tif, saved as an RGB TIF.

  • 24-semantic.tif: A 24x800x800 semantic segmentation of 24-images.tif, saved as an RGB TIF.


Instance labels

50-instance-cell-2d.tif, Z=0

50-instance-organelle-2d.tif, Z=0

Instance labels are separated into two label files per image: cell and organelle. Cell instance labels assign a unique id to each cell in an image volume. Organelle instance labels assign a unique id to each organelle in an image volume.

IDs are assigned to distinct objects within the entire 3D image volume. Tags for cross-sections of the same 3D object are the same between different 2D image windows.

Instance label files are:

  • 50-instance-cell.tif: A 50x800x800 instance segmentation of cells in 50-images.tif, saved as an RGB TIF.

  • 50-instance-organelle.tif: A 50x800x800 instance segmentation of organelles in 50-images.tif, saved as an RGB TIF.

  • 24-instance-cell.tif: A 24x800x800 instance segmentation of cells in 24-images.tif, saved as an RGB TIF.

  • 24-instance-organelle.tif: A 24x800x800 instance segmentation of organelles in 24-images.tif, saved as an RGB TIF.


JSON format

For both semantic and instance labels, label data can be represented with a dictionary structure. Dictionary keys are the unique non-zero integer labels in the label data. For each key, the corresponding value is the binary mask of the region associated to the key. This mask is stored as a run-length encoded string. An additional '.info' key stores image shape and datatype information.

File names are the same as for the TIF format, with the .json file format instead of .tif. Functions for converting between TIF and JSON formats, as well as between python strings and numpy ndarrays, can be found in the bio3d_vision package.


Example

In python - a synthetic blob image with two nonzero values:

blob1 = skimage.data.binary_blobs(length=10, seed=1, volume_fraction=0.2)
blob2 = skimage.data.binary_blobs(length=10, seed=2, volume_fraction=0.1)
image = np.zeros((10, 10), dtype=int)
image[blob1] = 1
image[blob2] = 2

has the following JSON-compatible dictionary representation:

image_as_dict = {
    '.info': {'dtype': 'np.int', 'shape': (10, 10)},
    '1': '7 2 12 1 21 1 40 1 50 4 60 3 68 1 71 1 81 1',
    '2': '1 3 11 1 33 2 42 3 87 2'}