9. Effectiveness of Data Augmentation in Pointer-Generator Model

Tomohito Ouchi, Masayoshi Tabuse
Graduate School of Life and Environmental Sciences, Kyoto Prefectural University, 1-5 Shimogamohangi-cho, Sakyo-ku, Kyoto 606-8522, Japan
pp.96-100
ABSTRACT
We propose a new data augmentation method in automatic summarization system, especially the Pointer-Generator model. A large corpus is required to create an automatic summarization system using deep learning. However, in the field of natural language processing, especially in the field of automatic summarization, there are not many data sets that are sufficient to train automatic summarization system. Therefore, we propose a new method of data augmentation. We use the Pointer-Generator model. First, we determine the importance of each sentence in an article using topic model. In order to augment the data, we remove the least important sentence from an input article and use it as a new article. We examine the effectiveness of our proposed data augmentation method in automatic summarization system.

ARTICLE INFO
Article History
Received 13 November 2019
Accepted 21 July 2020

Keywords
Automatic summarization
Data augmentation
Pointer-Generator Model

JAALR1209

Download article(PDF)