Semantic Coherence Analysis of English Texts Based on Sentence Semantic Graphs
DOI:
https://doi.org/10.4108/eetsis.3312Keywords:
english text, semantic coherence theory, sentence semantic graph, VF2 subgraph matching algorithm, frequent subgraphAbstract
With the reform of China's education industry, more and more universities are using computers to conduct examinations. For the automatic correction of essays as subjective questions, existing automatic English text scoring systems suffer from insufficient extraction of coherence information and low accuracy when analysing text coherence. Therefore, this paper proposes an unsupervised semantic coherence analysis model for English texts based on sentence semantic graphs, taking Chinese students' English compositions as the research context. Guided by the semantic coherence theory, the English text is represented as a sentence semantic graph, and an improved VF2 subgraph matching algorithm is used to mine the frequently occurring subgraph patterns in the sentence semantic graph. After that, the set of frequent subgraphs is generated by filtering the subgraph patterns according to their frequencies, and the subgraph frequency of each frequent subgraph is calculated separately. Finally, the distribution characteristics of frequent subgraphs and the semantic values of subgraphs in the sentence semantic graphs are extracted to quantify the overall coherence quality of English texts. The experimental results show that the model proposed in this paper has higher accuracy and practical value compared with the current methods of coherence analysis.
References
Liu S, Zeng S, Li S. Evaluating text coherence at sentence and paragraph levels[J]// Proceedings of the 12th Conference on Language Resources and Evaluation, 2020, 1695-1703.
Chen R, Wang J, Yu L C, et al. Learning to Memorize Entailment and Discourse Relations for Persona-Consistent Dialogues[J]// Proceedings of the AAAI conference on artificial intelligence, 2023.
Mesgar M, Bücker S, Gurevych I. Dialogue coherence assessment without explicit dialogue act labels[J]// Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020, 1439-1450.
Hsiao M, Hung M. Construction of an Artificial Intelligence Writing Model for English Based on Fusion Neural Network Model[J]. Computational Intelligence and Neuroscience, 2022: 2022.
Guinaudeau C, Strube M. Graph-based local coherence modeling[C]// Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2013, 93-103.
Goyal T, Li J J, Durrett G. Snac: Coherence error detection for narrative summarization[J]. arXiv preprint arXiv:2205.09641, 2022.
Ghazarian S, Wen N, Galstyan A, et al. DEAM: Dialogue Coherence Evaluation using AMR-based Semantic Manipulations[J]. arXiv preprint arXiv:2203.09711, 2022.
Papalampidi P, Cao K, Kocisky T. Towards coherent and consistent use of entities in narrative generation[C]// International Conference on Machine Learning. PMLR, 2022, 17278-17294.
Zhao Q, Niu J, Liu X, et al. Generation of Coherent Multi-Sentence Texts with a Coherence Mechanism[J]. Computer Speech & Language, 2023, 78: 101457.
Akula A R, Zhu S C. Discourse Analysis for Evaluating Coherence in Video Paragraph Captions[J]. arXiv e-prints, 2022.
Zhao W, Strube M, Eger S. Discoscore: Evaluating text generation with bert and discourse coherence[J]. arXiv preprint arXiv:2201.11176, 2022.
Pu L, Heng R, Xu B. Language Development for English-Medium Instruction: A Longitudinal Perspective on the Use of Cohesive Devices by Chinese English Majors in Argumentative Writing[J]. Sustainability, 2022, 15(1): 17.
Vrana S R, Bono R S, Konig A, et al. Assessing the coherence of narratives of traumatic events with latent semantic analysis[J]. Psychological Trauma: Theory, Research, Practice, and Policy, 2019, 11(5): 521.
Gotama P J W , Tokunaga T . Evaluating text coherence based on semantic similarity graph[C]. 2017 Workshop on Graph Based Methods in Natural Language Processing, Annual Meeting of Association for Computational Linguistics (ACL), 2017.
Huang G, Tang H, Wang J, et al. A Coherence Analysis Model for English Essay Based on Sentence Semantic Graph[C]// Journal of Physics: Conference Series. IOP Publishing, 2020, 1693(1): 012077.
Hung M, Hsiao M. Application of Adaptive Neural Network Algorithm Model in English Text Analysis[J]. Computational Intelligence and Neuroscience, 2022, 2022.
Srinivas V, Santhirani C. Optimization-based support vector neural network for speaker recognition[J]. The Computer Journal, 2020, 63(1): 151-167
Toutanova K, Klein D, Manning C D, et al. Feature-rich part-of-speech tagging with a cyclic dependency network[C]// Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003, 252-259.
Raghunathan K, Lee H, Rangarajan S, et al. A multi-pass sieve for coreference resolution[C]// Proceedings of the 2010 conference on empirical methods in natural language processing, 2010, 492-501.
Lee H, Peirsman Y, Chang A, et al. Stanford’s multi-pass sieve coreference resolution system at the conll-2011 shared task[C]// Proceedings of the fifteenth conference on computational natural language learning: Shared task, 2011, 28-34.
Oberoi K S, Del Mondo G, Gaüzère B, et al. Detecting dynamic patterns in dynamic graphs using subgraph isomorphism[J]. Pattern Analysis and Applications, 2023: 1-17.
Born L , Mesgar M , Strube M . Using a Graph-based Coherence Model in Document-Level Machine Translation[C]// Proceedings of the Third Workshop on Discourse in Machine Translation. 2017.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 Nanxiao Deng, Yabing Wang, Guimin Huang, Ya Zhou, Yiqun Li
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
This is an open access article distributed under the terms of the CC BY-NC-SA 4.0, which permits copying, redistributing, remixing, transformation, and building upon the material in any medium so long as the original work is properly cited.
Funding data
-
National Natural Science Foundation of China
Grant numbers 62066009