2009 CombiningLinkandContentforCommu

From GM-RKB
Jump to navigation Jump to search

Subject Headings:

Notes

Cited By

Quotes

Author Keywords

Discriminative Model, EM Algorithm, Link analysis, Two-Stage Optimization

Abstract

In this paper, we consider the problem of combining link and content analysis for community detection from networked data, such as paper citation networks and World Wide Web. Most existing approaches combine link and content information by a generative model that generates both links and contents via a shared set of community memberships. These generative models have some shortcomings in that they failed to consider additional factors that could affect the community memberships and isolate the contents that are irrelevant to community memberships. To explicitly address these shortcomings, we propose a discriminative model for combining the link and content analysis for community detection. First, we propose a conditional model for link analysis and in the model, we introduce hidden variables to explicitly model the popularity of nodes. Second, to alleviate the impact of irrelevant content attributes, we develop a discriminative model for content analysis. These two models are unified seamlessly via the community memberships. We present efficient algorithms to solve the related optimization problems based on bound optimization and alternating projection. Extensive experiments with benchmark data sets show that the proposed framework significantly outperforms the state-of-the-art approaches for combining link and content analysis for community detection.



References

,

 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2009 CombiningLinkandContentforCommuTianbao Yang
Rong Jin
Yun Chi
Shenghuo Zhu
Combining Link and Content for Community Detection: A Discriminative ApproachKDD-2009 Proceedings10.1145/1557019.15571202009