mirror of
https://github.com/botastic/SoftGroup.git
synced 2025-10-16 11:45:42 +00:00
2.3 KiB
2.3 KiB
Tips for custom dataset.
Data preparation.
- Step 1: split your data to
train/valfolder - Step 2: for each scene construct a
.pthfile that contains:point XYZ coordinates: shape of (N, 3)colors RGB: shape of (N, 3)semantic labels: shape of (N, )insance labels: shape(N, )
Noted that colors should be normalized in range [-1, 1], see here.
Config
The following configs may be modified for custom dataset.
semantic_classes: the number of class for semantic segmentationinstance_classes: the number of semantic classes considered for instance segmentation. For example, in ScanNet dataset config,wallandflooris not considered for instance segmentation. So thatinstance_classes = semantic_classes - 2.sem2ins_classes: use this when you directly use semantic segmentation results as instance segmentation results for specified classes. For example, in S3DIS dataset, classfloorandceil(index [0, 1]) are specified since most of the cases, each scene has only one floor and one ceil.class_numpoint_mean: the number of points for each instance per class. shape of(semantic_classes, )scale: the point coordinates are scaled up for voxelization. Fromscale, we can infervoxel_size = 1 / scale. Indoor datasets often use scale = 50 (voxel_size = 0.02m). In outdoor datasets, the voxelize should be larger due to higher spasity. For example, in STPLS3D dataset,scaleis set to 3 (voxel_size = 0.33m). Ablation may be needed to figure out whichscaleis most suitable to your dataset.grouping_cfg.radius: The radius for grouping. This value is related to voxel_size. When the voxelize is higher, the radius should be also higher.grouping_cfg.ignore_classes: the semantic class indices that are not considered for grouping.
For further information, you can compare the configs of STPLS3D and ScanNet.