Authors
Manikyala Rao Tankala, Ch Srinivasa Rao
Publication date
2022/5/7
Book
Computational Intelligence in Data Mining: Proceedings of ICCIDM 2021
Pages
667-680
Publisher
Springer Nature Singapore
Description
Image classification and image recognition are significantly impacted by the creation of Vision Transformers. It is no wonder that CNN architectures utilize multiple layers and a considerable number of hyper-parameters are necessary for training, as they are much more complex. While training and implementing, CNN uses a considerable number of resources. Alternately, Vision Transformers are the innovative architecture of neural networks that allows it to take in an image and break it into little pieces, then use an attention mechanism to search for correlations in the pieces. For localization of the attention mechanism, the transformers are well trained on small visual patches. With the introduction of image manipulation tools, much of the picture data available on the Internet today are designed to fool consumers into thinking the image is genuine. To authenticate photographs, we must utilize many different neural …
Scholar articles
MR Tankala, C Srinivasa Rao - … Intelligence in Data Mining: Proceedings of ICCIDM …, 2022