Mathematics-Driven Enhancements in Object Detection: a Hybrid Deep Learning Framework

EasyChair Preprint 15504

6 pages•Date: November 30, 2024

Abstract

This paper explores the mathematical foundation of hybrid object detection models, combining Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs). We provide a detailed mathematical formulation for feature extraction, attention mechanisms, and optimization strategies. By integrating advanced regularization techniques and loss functions, we aim to improve accuracy while reducing computational overhead. Key contributions include mathematical derivations for attention-aware convolutional layers and a custom dynamic loss function that balances localization and classification errors.

Keyphrases: Algorithms, CNN, ViT, deep learning

Links:

https://easychair.org/publications/preprint/Xmj5

BibTeX entry

BibTeX does not have the right entry for preprints. This is a hack for producing the correct reference:

@booklet{EasyChair:15504,
  author    = {Michael Lornwood},
  title     = {Mathematics-Driven Enhancements in Object Detection: a Hybrid Deep Learning Framework},
  howpublished = {EasyChair Preprint 15504},
  year      = {EasyChair, 2024}}

Download PDF Open PDF in browser