Open Access Open Access  Restricted Access Subscription or Fee Access

Context Based Topical Document Summarization

Pratik Kamble, S.C. Dharmadhikari

Abstract


A condition of information available on the web is getting increased day by day as a result leading to information overload. To find important and useful information is becoming difficult. This growth has created a huge demand for automatic methods and tools for text summarization. In the process of text summarization, text is reduced to meaningful small size. Sentences are extracted to build summary. summary will be effective when there will be more topical terms in it. So to summarize the text or in general for proper information retrieval term weighting schemes is very important. In this paper a review of various term weighting schemes and different summarization techniques is presented. Proposed context score based text summarization model is presented.


Keywords


Term Weighting, Summarization, Context Score, Information Retrieval.

Full Text:

PDF

References


Ying Liu, Han Tong Loh, Aixin Sun “Imbalanced text classification: A term weighting approach,” Expert Systems with Applications 36 (2009) 690–701

Xiaojun Quan, Wenyin Liu, Senior Member, IEEE, and Bite Qiu, “Term Weighting Schemes for Question Categorization,” IEEE Transactions On Pattern Analysis And Machine Intelligence, VOL. 33, NO. 5, May 2011

R. Radev et al. "Centroid-based summarization of multiple documents," J.nformation Processing and Management: an International, Journal Volume 40 Issue 6 pp919-938, November 2004

G. Erkan and D.R. Radev, “LexRank: Graph-Based Lexical Centrality as Salience in Text Summarization,” J. Artificial Intelligence Research, vol. 22, pp. 457-479, http://portal.acm.org/ citation.cfm?id=1622487.1622501, Dec. 2004.

P. Goyal, L. Behera and T. M. McGinnity "A Context-Based Word Indexing Model for Document Summarization," IEEE Trans. Knowl. Data Eng., VOL. 25, NO. 8 pp 1693-1705, AUGUST 2013.

K. Wong et al. , "Extractive summarization using supervised and semi-supervised learning," Proc. International Conference on Computational Linguistics, pp.985-992, August 18-22, 2008, Manchester, United Kingdom

Iyad Batal , Milos Hauskrecht, "Boosting KNN text classification accuracy by using supervised term weighting schemes," Proc. ACM conference on Information and knowledge management, November 02-06, 2009, Hong Kong, China [doi>10.1145/1645953.1646296]

C.-Y. Lin and E. Hovy, “The Automated Acquisition of Topic Signatures for Text Summarization,” Proc. 18th Conf. Computational Linguistics, pp. 495-501, http://dx.doi.org/10.3115/990820.9 90892, 2000.

Chris Buckley, "The importance of proper weighting methods," Proc. Human Language Technology, March 21-24, 1993, Princeton, New Jersey [doi>10.3115/1075671.1075753]

Youngjoong Ko , Jungyun Seo, "An effective sentence-extraction technique using contextual information and statistical approaches for text summarization," Pattern Recognition Letters, v.29 n.9, p.1366-1371, July, 2008 [doi>10.1016/j.patrec.2008.02.008]

Regina Barzilay , Michael Elhadad, "Using Lexical Chains for Text Summarization," Proc. ACL Workshop on Intelligent Scalable Text Summarization pp.10-17, 1997

H.P. Luhn, “The Automatic Creation of Literature Abstracts,” IBM J. Research and Development, vol. 2, pp. 159-165, http://dx.doi.org/ 10.1147/rd.22.0159, Apr. 1958.

0R. Mihalcea, "Graph-based ranking algorithms for sentence extraction, applied to text summarization," Proc.ACL Interactive poster and demonstration sessions, pp.20-es, July 21-26, 2004, Barcelona, Spain [doi>10.3115/1219044.1219064]

M. Liu, " Extractive summarization based on event term clustering," Proc. Proceedings Interactive Poster and Demonstration Sessions pp. 185-188

G. Carenini, R. T. Ng and X. Zhou,"Summarizing Emails with Conversational Cohesion and Subjectivity," Proc. of ACL-08 HLT, pp. 353–361, Columbus, Ohio, USA, June 2008

S. Wan and C. Paris, "In-browser summarisation: Generating elaborative summaries biased towards the reading context,"in Human Language Technologies, 2008, pp.129-132

Y. Ouyang et al., "An Integrated Multi-document Summarization Approach based on

Word Hierarchical Representation," proc. ACL-IJCNLP 2009 Conference Short Papers, pp. 113-116, Suntec, Singapore, 4 August 2009.

Man Lan, Chew Lim Tan, Senior Member, IEEE, Jian Su, and Yue Lu, Member, IEEE, “Supervised and Traditional Term Weighting Methods for Automatic Text Categorization,” IEEE Transactions On Pattern Analysis And Machine Intelligence, VOL. 31, NO. 4, April 2009

X. Wan and J. Xiao, “Exploiting Neighborhood Knowledge for Single Document Summarization and Keyphrase Extraction,” ACM Trans. Information Systems, vol. 28, pp. 8:1-8:34, http://doi. acm.org/10.1145/1740592.1740596, June 2010.

V. Murthy.G et al.," A comparative study on term weighting methods for automated telugu text categorization with effective classifiers," J. Data mining and knowledge management process, vol. 3, pp. 95-105, 2013.

K. Hashimoto and T. Yukawa,"Term Weighting Classification System Using the Chi-square Statistic for theClassification Subtask at NTCIR-6 Patent Retrieval Task," Proc. NTCIR-6 Workshop Meeting, pp. 385-389, 2007.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.