Author(s): Rasim Alguliev | Ramiz Aliguliyev | Makrufa Hajirahimova
Journal: Intelligent Control and Automation
ISSN 2153-0653
Volume: 01;
Issue: 02;
Start page: 105;
Date: 2010;
Original page
Keywords: Multi-Document Summarization | Content Coverage | Less Redundancy | Integer Linear Programming
ABSTRACT
This paper proposes an extractive generic text summarization model that generates summaries by selecting sentences according to their scores. Sentence scores are calculated using their extensive coverage of the main content of the text, and summaries are created by extracting the highest scored sentences from the original document. The model formalized as a multiobjective integer programming problem. An advantage of this model is that it can cover the main content of source (s) and provide less redundancy in the generated sum- maries. To extract sentences which form a summary with an extensive coverage of the main content of the text and less redundancy, have been used the similarity of sentences to the original document and the similarity between sentences. Performance evaluation is conducted by comparing summarization outputs with manual summaries of DUC2004 dataset. Experiments showed that the proposed approach outperforms the related methods.
Journal: Intelligent Control and Automation
ISSN 2153-0653
Volume: 01;
Issue: 02;
Start page: 105;
Date: 2010;
Original page
Keywords: Multi-Document Summarization | Content Coverage | Less Redundancy | Integer Linear Programming
ABSTRACT
This paper proposes an extractive generic text summarization model that generates summaries by selecting sentences according to their scores. Sentence scores are calculated using their extensive coverage of the main content of the text, and summaries are created by extracting the highest scored sentences from the original document. The model formalized as a multiobjective integer programming problem. An advantage of this model is that it can cover the main content of source (s) and provide less redundancy in the generated sum- maries. To extract sentences which form a summary with an extensive coverage of the main content of the text and less redundancy, have been used the similarity of sentences to the original document and the similarity between sentences. Performance evaluation is conducted by comparing summarization outputs with manual summaries of DUC2004 dataset. Experiments showed that the proposed approach outperforms the related methods.