Hasan, M. A., & Dey, K. (2024). Depthwise Separable Convolutions with Deep Residual Convolutions. arXiv preprint arXiv:2411.07544v1.
This paper aims to address the challenge of deploying computationally expensive deep learning models, specifically the Xception architecture, on resource-constrained edge devices. The authors propose an optimized Xception architecture that reduces computational complexity while maintaining accuracy for object detection tasks.
The authors propose a modified Xception architecture that replaces standard convolutional layers with depthwise separable convolutions and incorporates deep residual connections. This architecture is designed to reduce the number of trainable parameters and computational load. The proposed model is evaluated on the CIFAR-10 object detection dataset and compared to the original Xception architecture in terms of training time, memory consumption, and validation accuracy.
The proposed optimized Xception architecture demonstrates a significant reduction in the number of trainable parameters (approximately 3 times fewer) compared to the original Xception model. This reduction leads to faster training times and lower memory consumption. Despite the architectural changes, the optimized model achieves comparable and even surpasses the original Xception's accuracy on the CIFAR-10 dataset.
The study demonstrates the feasibility of deploying optimized deep learning models like Xception on edge devices without significantly compromising accuracy. The proposed architecture, utilizing depthwise separable convolutions and deep residual connections, effectively reduces computational complexity and resource requirements, making it suitable for edge deployments.
This research contributes to the growing field of efficient deep learning for edge computing. The proposed optimized Xception architecture offers a practical solution for deploying complex models on resource-constrained devices, potentially enabling a wider range of AI-powered applications on edge devices.
The study is limited by its evaluation on a single dataset (CIFAR-10). Further evaluation on larger and more diverse datasets is necessary to confirm the generalizability of the findings. Additionally, exploring the performance of the optimized model on various edge devices with different hardware specifications would provide valuable insights for real-world deployment scenarios.
Egy másik nyelvre
a forrásanyagból
arxiv.org
Mélyebb kérdések