Authors
Zhichao Lu, Chuntao Ding, Felix Juefei-Xu, Vishnu Naresh Boddeti, Shangguang Wang, Yun Yang
Publication date
2022/11/17
Journal
IEEE Transactions on Parallel and Distributed Systems
Volume
34
Issue
2
Pages
598-610
Publisher
IEEE
Description
Deploying high-performance vision transformer (ViT) models on ubiquitous Internet of Things (IoT) devices to provide high-quality vision services will revolutionize the way we live, work, and interact with the world. Due to the contradiction between the limited resources of IoT devices and resource-intensive ViT models, the use of cloud servers to assist ViT model training has become mainstream. However, due to the larger number of parameters and floating-point operations (FLOPs) of the existing ViT models, the model parameters transmitted by cloud servers are large and difficult to run on resource-constrained IoT devices. To this end, this article proposes a transmission-friendly ViT model, TFormer, for deployment on resource-constrained IoT devices with the assistance of a cloud server. The high performance and small number of model parameters and FLOPs of TFormer are attributed to the proposed hybrid …
Total citations
Scholar articles
Z Lu, C Ding, F Juefei-Xu, VN Boddeti, S Wang, Y Yang - IEEE Transactions on Parallel and Distributed Systems, 2022