294 0

Single Image Super-Resolution Based on Transformer Bridge

Title
Single Image Super-Resolution Based on Transformer Bridge
Author
Shuting WANG
Alternative Author(s)
왕서정
Advisor(s)
정제창
Issue Date
2022. 8
Publisher
한양대학교
Degree
Master
Abstract
With the improvement of computer hardware computing power and the continuous development of artificial intelligence technology, deep learning has achieved outstanding performance in the field of computer vision. At this stage, super-resolution (SR) based on deep learning mainly uses convolutional networks to recover texture details. However, the receptive field of such neural networks depends on the size of the filter and the number of convolutional layers. Increasing the values of these hyper parameters can increase the complexity of the model or even make it impossible to train. The long-distance dependencies of the convolutional feature extraction process tracking images can make the model quite complex. So convolutional feature extraction has the disadvantage of not being able to obtain long-range dependencies. The transformer, on the other hand, can model longer distance dependencies by using the self-attention mechanism. In this paper, we carry out the research of image SR reconstruction method based on deep learning, and the main research contents and contributions are as follows: (1) In order to better utilize the high-frequency information of the image, an attention-based image SR reconstruction method is proposed. The method introduces attention in the network. The method introduces the attention mechanism in the network, explores a lightweight transformer structure, and uses the self-attentive capability of the transformer to establish a long-range dependency for the network. (2) Proposed a bridging feature extraction structure based on convolution and transformer. The transformer structure is combined with CNNs to establish distal dependencies for the transformer by convolving the input fine local texture details, while the feature representation obtained by the transformer through global similarity modeling can provide distal information for the convolution. Thus, it assists the image SR network to establish distal dependencies and complete the extraction and recovery of high-frequency information from images. In this paper, we use Flickr2k as the training set and extract a part of it as the test set. Experimental results show that by comparing PSNR and SSIM with other SR algorithms, the transformer bridge-based SR n method proposed in this paper produces better SR images compared with the state-of-the-art methods, and our model consistently produces more distinct edges and shapes, effectively improving the visualization of the images.
URI
http://hanyang.dcollection.net/common/orgView/200000628137https://repository.hanyang.ac.kr/handle/20.500.11754/174614
Appears in Collections:
GRADUATE SCHOOL[S](대학원) > DEPARTMENT OF ELECTRONIC ENGINEERING(융합전자공학과) > Theses (Master)
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE