张中军, 董仕. 基于聚类融合的邮件社交网络社区划分方法[J]. 云南大学学报(自然科学版), 2017, 39(2): 178-184. doi: 10.7540/j.ynu.20160469
引用本文: 张中军, 董仕. 基于聚类融合的邮件社交网络社区划分方法[J]. 云南大学学报(自然科学版), 2017, 39(2): 178-184. doi: 10.7540/j.ynu.20160469
ZHANG Zhong-jun, DONG Shi. Mail social network community partitioning method based on clustering ensemble[J]. Journal of Yunnan University: Natural Sciences Edition, 2017, 39(2): 178-184. DOI: 10.7540/j.ynu.20160469
Citation: ZHANG Zhong-jun, DONG Shi. Mail social network community partitioning method based on clustering ensemble[J]. Journal of Yunnan University: Natural Sciences Edition, 2017, 39(2): 178-184. DOI: 10.7540/j.ynu.20160469

基于聚类融合的邮件社交网络社区划分方法

Mail social network community partitioning method based on clustering ensemble

  • 摘要: 为解决邮件社区挖掘中涉及内容隐私及社区形态单一问题,提出了一种基于聚类融合的邮件社区划分算法.该方法中首先利用邮件社交网络特征及邮件属性衡量节点间距离,避免对邮件内容的分析导致涉及用户隐私,其次使用K-Means算法产生若干初始聚类结果,同时引入共协矩阵记录初始聚类时节点的归属,最后根据共协矩阵中邮箱节点间的相似程度,使用融合算法合并初始聚类结果得到最终社区结构.实验表明,该算法未使用邮件内容,得到的社区结构质量较高,并能发现多形态社区.

     

    Abstract: In order to solve the problems related to user’s privacies and single forms in detecting mail communities,the paper proposes an effective mail community partition method based on clustering ensemble.In a social network,the proposed method first utilizes the network structure features and mail attributes to measure the distances between nodes,instead of analyzing mail content that might be involved in user’s privacies.Then,the method generates initial clusters by the K-Means algorithm and adopts the co-association matrix to decide which cluster each node belongs to.Finally,according to the similarity between nodes in the co-association matrix,initial clusters are effectively combined to produce final communities.Experimental results show that the communities obtained by this algorithm are of high quality,although it does not use E-mail content,also it can find communities with various forms.

     

/

返回文章
返回