208 0

The influence of stop words on the sentence classifier based on BERT

Title
The influence of stop words on the sentence classifier based on BERT
Author
박도영
Alternative Author(s)
Doyeong Park
Advisor(s)
김영훈
Issue Date
2023. 2
Publisher
한양대학교
Degree
Master
Abstract
Machine learning algorithms advance rapidly and show good perfor- mance. However, interpreting the results calculated by algorithms or train- ing is unclear. BERT, which exhibits excellent performance in natural lan- guage processing, is pre-trained with large-scale data and then used to im- prove performance through fine-tuning according to the user’s intention. BERT is also difficult to analyze how the output is calculated. In this paper, we try to find out which word BERT pays attention to and outputs the re- sults when classifying a given sentence. To interpret the results of BERT’s output, we analyzed the influence of masked words in pre-trained and fine- tuned BERT. Additionally, through the LIME technique, we mainly exam- ined which words have a great influence on the embedding vector results of sentences. Finally, we found that the high-frequency words in a sentence significantly affect the results, but not all the high-frequency words in the sentence impact the result.
URI
http://hanyang.dcollection.net/common/orgView/200000651456https://repository.hanyang.ac.kr/handle/20.500.11754/179438
Appears in Collections:
GRADUATE SCHOOL[S](대학원) > COMPUTER SCIENCE & ENGINEERING(컴퓨터공학과) > Theses (Master)
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE