Pre-Training Hyperspectral Image Encoder via Synthetic Data

2025-01-0442

09/16/2025

Authors Abstract
Content
Computer vision is being revolutionized by the use of transformer-based machine learning architectures. However, these models need large datasets to enable pre-training through self-supervised learning. However, there is a lack of open-source datasets of the same magnitude as standard RGB color images. This work analyzes the effect of using randomly generated fractal-based hyperspectral images versus real data to understand the effect of pre-training dataset on a Swin image encoder model performance, during supervised-training of a semantic segmentation hyperspectral dataset. Two real data datasets are used for comparison to the synthetic dataset, one RGB-based and another hyperspectral-based to understand how variability in spectral resolution during pre-training effects model performance on semantic segmentation.
Meta TagsDetails
DOI
https://doi.org/10.4271/2025-01-0442
Pages
14
Citation
Medellin, A., Grabowsky, D., Mikulski, D., and Langari, R., "Pre-Training Hyperspectral Image Encoder via Synthetic Data," SAE Technical Paper 2025-01-0442, 2025, https://doi.org/10.4271/2025-01-0442.
Additional Details
Publisher
Published
Sep 16
Product Code
2025-01-0442
Content Type
Technical Paper
Language
English