Controversy Trend Detection in Social Media
Web Mining is the retrieval of useful information from web. The information exchanged in social network posts includes not only text but also images, URLs, and videos. This system focuses on mentions of users – links between users that are generated through replies, mentions, and re-tweets. The probability model is proposed to detect the emergence of a new topic from the anomalies measured through the model. The technique demonstrated on dataset which are gathered from twitter. Aggregating anomaly scores from hundreds of users, shows that detecting emerging topics only based on the reply/mention relationships in social network posts. The experiments show that the proposed mention-anomaly-based approaches can detect new topics at least as early as text-anomaly-based approaches.
Toshimitsu Takahashi, Ryota Tomioka, and Kenji Yamanishi, Jan (2014) “Discovering Emerging Topics in Social Streams via Link-Anomaly Detection”, IEEE Transactions On Knowledge And Data Engineering, Vol. 26, No. 1.
J. Allan et al., “Topic Detection and Tracking Pilot Study: Final Report,” Proc. DARPA Broadcast News Transcription and Understanding Workshop, 1998.
J. Kleinberg, “Bursty and Hierarchical Structure in Streams,” Data Mining Knowledge Discovery, vol. 7, no. 4, pp. 373-397, 2003.
Y. Urabe, K. Yamanishi, R. Tomioka, and H. Iwai, “Real-Time Change-Point Detection Using Sequentially Discounting Normalized Maximum Likelihood Coding,” Proc. 15th Pacific-Asia Conf. Advances in Knowledge Discovery and Data Mining (PAKDD’ 11), 2011.
S. Morinaga and K. Yamanishi, “Tracking Dynamics of Topic Trends Using a Finite Mixture Model,” Proc. 10th ACM SIGKDD Int’l Conf. Knowledge Discovery and Data Mining, pp. 811-816, 2004.
Q. Mei and C. Zhai, “Discovering Evolutionary Theme Patterns from Text: An Exploration of Temporal Text Mining,” Proc. 11th ACM SIGKDD Int’l Conf. Knowledge Discovery in Data Mining, pp. 198-207, 2005.
Krause, J. Leskovec, and C. Guestrin, “Data Association for Topic Intensity Tracking,” Proc. 23rd Int’l Conf. Machine Learning (ICML’ 06), pp. 497-504, 2006.
D. He and D.S. Parker, “Topic Dynamics: An Alternative Model of Bursts in Streams of Topics,” Proc. 16th ACM SIGKDD Int’l Conf. Knowledge Discovery and Data Mining, pp. 443-452, 2010.
H. Small, “Visualizing Science by Citation Mapping,” J. Am. Soc. Information Science, vol. 50, no. 9, pp. 799-813, 1999.
D. Aldous, “Exchangeability and Related Topics,” _ Ecoled’ _ Ete´de Probabilite´s de Saint-Flour XIII—1983, pp. 1-198, Springer, 1985.
Y. Teh, M. Jordan, M. Beal, and D. Blei, “Hierarchical Dirichlet Processes,” J. Am. Statistical Assoc., vol. 101, no. 476, pp. 1566-1581, 2006.
D. Lewis, “Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval,” Proc. 10th European Conf. Machine Learning (ECML’ 98), pp. 4-15, 1998.
K. Yamanishi and J. Takeuchi, “A Unifying Framework for Detecting Outliers and Change Points from Non-Stationary Time Series Data,” Proc. Eighth ACM SIGKDD Int’l Conf. Knowledge Discovery and Data Mining, 2002.
J. Takeuchi and K. Yamanishi, “A Unifying Framework for Detecting Outliers and Change Points from Time Series,” IEEE Trans. Knowledge Data Eng., vol. 18, no. 4, pp. 482-492, Apr. 2006.
- There are currently no refbacks.
This work is licensed under a Creative Commons Attribution 3.0 License.