Cell Segmentation

Training

The data structure used to train cell segmentation networks is different than to train classification networks on WSI/Patient level. Cureently, due to the massive amount of cells inside a WSI, all famous cell segmentation datasets (such like PanNuke, https://doi.org/10.48550/arXiv.2003.10778) provide just patches with cell annotations. Therefore, we use the following dataset structure (with k folds):

dataset  
├── dataset_config.yaml  
├── fold0  
│   ├── images  
|   |   ├── 0_imgname0.png
|   |   ├── 0_imgname1.png
|   |   ├── 0_imgname2.png
...
|   |   └── 0_imgnameN.png  
│   ├── labels
|   |   ├── 0_imgname0.npy
|   |   ├── 0_imgname1.npy
|   |   ├── 0_imgname2.npy
...
|   |   └── 0_imgnameN.npy  
|   └── types.csv
├── fold1  
│   ├── images  
|   |   ├── 1_imgname0.png
|   |   ├── 1_imgname1.png
...
│   ├── labels
|   |   ├── 1_imgname0.npy
|   |   ├── 1_imgname1.npy
...
|   └── types.csv
...
└── foldk  
│   ├── images  
    |   ├── k_imgname0.png
    |   ├── k_imgname1.png
...
    ├── labels
    |   ├── k_imgname0.npy
    |   ├── k_imgname1.npy
    └── types.csv

Each type csv should have the following header:

img,type                            # Header
foldnum_imgname0.png,SetTypeHeare   # Each row is one patch with tissue type

The labels are numpy masks with the following structure: TBD

Add a new dataset

add to dataset coordnator.

All settings of the dataset must be performed in the correspondinng yaml file, under the data section

dataset name is not case sensitive!