The influence of stop words on the sentence classifier based on BERT
- Title
- The influence of stop words on the sentence classifier based on BERT
- Author
- 박도영
- Alternative Author(s)
- Doyeong Park
- Advisor(s)
- 김영훈
- Issue Date
- 2023. 2
- Publisher
- 한양대학교
- Degree
- Master
- Abstract
- Machine learning algorithms advance rapidly and show good perfor- mance. However, interpreting the results calculated by algorithms or train- ing is unclear. BERT, which exhibits excellent performance in natural lan- guage processing, is pre-trained with large-scale data and then used to im- prove performance through fine-tuning according to the user’s intention. BERT is also difficult to analyze how the output is calculated. In this paper, we try to find out which word BERT pays attention to and outputs the re- sults when classifying a given sentence. To interpret the results of BERT’s output, we analyzed the influence of masked words in pre-trained and fine- tuned BERT. Additionally, through the LIME technique, we mainly exam- ined which words have a great influence on the embedding vector results of sentences. Finally, we found that the high-frequency words in a sentence significantly affect the results, but not all the high-frequency words in the sentence impact the result.
- URI
- http://hanyang.dcollection.net/common/orgView/200000651456https://repository.hanyang.ac.kr/handle/20.500.11754/179438
- Appears in Collections:
- GRADUATE SCHOOL[S](대학원) > COMPUTER SCIENCE & ENGINEERING(컴퓨터공학과) > Theses (Master)
- Files in This Item:
There are no files associated with this item.
- Export
- RIS (EndNote)
- XLS (Excel)
- XML