Master Thesis

Improved Driver Distraction Detection Using Self-Supervised Learning

This thesis investigates driver distraction detection using the Drive&Act dataset and Vision Transformers (ViTs) trained with supervised and self-supervised learning (SSL). The core challenge addressed was imbalanced data and generalization across views and modalities (RGB vs. IR). To tackle this, I proposed the Clustered Feature Weighting (CFW) algorithm, a label-free sampling strategy that balances training batches using unsupervised clustering (HDBSCAN) and weighted random sampling.

CFW improved dataset balance and boosted cross-modality generalization by up to +7.17% balanced accuracy when adapting RGB-trained models to infrared imagery. SSL-based encoders (DINOv2) consistently outperformed supervised ViTs in generalization, particularly for grayscale and IR modalities, affirming their potential for robust, adaptable distraction detection systems in automotive safety.


Thesis Information

  • Title: Improved Driver Distraction Detection Using Self-Supervised Learning
  • Author: Suraj Bhardwaj
  • Institution: Universität Siegen, Faculty of Electrical Engineering and Computer Science
  • Program: M.Sc. International Graduate Studies in Mechatronics
  • Supervisors: Prof. Dr. Michael Möller, Dr. Jovita Lukasik, David Lerch M.Sc.
  • Submission Date: 15 May 2024

Key Highlights

  • Introduced Clustered Feature Weighting (CFW) to mitigate dataset imbalance in driver distraction detection.
  • Demonstrated cross-view and cross-modality generalization (RGB ↔ IR) using Vision Transformers.
  • Achieved +7.17% improvement in balanced accuracy on cross-modality tests with SSL-based encoders.
  • Collaboration with Fraunhofer IOSB, advancing research in Human–AI Interaction.

Resources