Document Type
Article
Keywords
Hazy Image Dataset, Deep Learning, CNN Classification, Fog simulation, Haze Dataset
Abstract
To enhance image dehazing and visual recognition in real-world conditions, we introduce HAZE-IMAGE-DATASET, a large-scale dataset comprising nearly 42,000 images. It is constructed from 1,532 clean images sourced globally and captured using a Samsung smartphone, covering diverse natural and urban scenes. The dataset includes synthetic and real haze variations. Synthetic haze was generated using MATLAB-based atmospheric scattering models with depth maps for 10 fog levels. Colored haze was created using alpha blending (α = 0.4) in six colors: red, green, blue, yellow, white, and black. Low-light conditions were simulated via uniform darkening at 10 levels. Also, 616 real haze images were captured using a steam device to replicate genuine haze characteristics. Benchmarking was performed using MobileNetV2 and GoogLeNet. On a hazy subset, MobileNetV2 improved PSNR from 13.34 dB to 21.77 dB and SSIM from 0.805 to 0.950, Also, GoogleNet achieved a PSNR of 22.38 dB and SSIM of 0.953.Classification experiments using a seven-class subset showed high accuracy: 98.93% with AlexNet and 98.13% with GoogLeNet.The dataset is organized into haze types, levels, and color folders. It serves as a strong foundation for training and evaluating image dehazing and recognition models under different visibility conditions. It is available on GitHub: https://github.com/mustafa1jamal/HAZE-IMAGE-DATASET.
How to Cite This Article
Shahbaz, Mustafa J. and Al-Zuky, Ali A.D.
(2025)
"HAZE-IMAGE-DATASET: A Large-Scale Benchmark for Image Dehazing in Variable Fog and Low-Light Conditions,"
Mesopotamian Journal of Computer Science: Vol. 5:
Iss.
1, Article 12.
DOI: https://doi.org/10.58496/MJCSC/2025/012
Available at:
https://map.researchcommons.org/mjcsc/vol5/iss1/12