Pre-Training Hyperspectral Image Encoder via Synthetic Data
2025-01-0442
09/16/2025
- Content
- Computer vision is being revolutionized by the use of transformer-based machine learning architectures. However, these models need large datasets to enable pre-training through self-supervised learning. However, there is a lack of open-source datasets of the same magnitude as standard RGB color images. This work analyzes the effect of using randomly generated fractal-based hyperspectral images versus real data to understand the effect of pre-training dataset on a Swin image encoder model performance, during supervised-training of a semantic segmentation hyperspectral dataset. Two real data datasets are used for comparison to the synthetic dataset, one RGB-based and another hyperspectral-based to understand how variability in spectral resolution during pre-training effects model performance on semantic segmentation.
- Pages
- 14
- Citation
- Medellin, A., Grabowsky, D., Mikulski, D., and Langari, R., "Pre-Training Hyperspectral Image Encoder via Synthetic Data," SAE Technical Paper 2025-01-0442, 2025, https://doi.org/10.4271/2025-01-0442.