Paper
8 December 2023 Multi-document reading comprehension model based on electra and document sliding window
Zheyu Yang, Benkui Zhang, Nan Jiang
Author Affiliations +
Proceedings Volume 12943, International Workshop on Signal Processing and Machine Learning (WSPML 2023); 129430X (2023) https://doi.org/10.1117/12.3014619
Event: International Workshop on Signal Processing and Machine Learning (WSPML 2023), 2023, Hangzhou, ZJ, China
Abstract
Multi-document reading comprehension is an important and difficult task in natural language processing. To address the issue that ELECTRA pre-training model has length limitation and cannot be directly adapt to multi-document reading comprehension task, this paper proposes a novel model based on ELECTRA and document sliding windows. In the model multiple documents are split and merged through document sliding windows, new segmentation embedding is introduced, answer position in documents is modelled as a learning target, and ELECTRA is used for joint training in each window. After obtaining all prediction outcomes of each window, the results are comprehensively sorted to achieve the optimal answer. The experiments show that Rouge-L of this model reaches 51.28% on the multi-document reading comprehension dataset MS-MARCO, ranking the current best result.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Zheyu Yang, Benkui Zhang, and Nan Jiang "Multi-document reading comprehension model based on electra and document sliding window", Proc. SPIE 12943, International Workshop on Signal Processing and Machine Learning (WSPML 2023), 129430X (8 December 2023); https://doi.org/10.1117/12.3014619
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
Back to Top