Community Detection

Definitions and Preliminaries

image-20240929160509328

  • Definition 1 (Network)

    G\cal{G} =(V,E)=(V, E), A basic network

    V={v1,...,vn}V=\{v_1, ..., v_n\}, the node set

    E={eij}i,j=1nE=\{e_{ij}\}^n_{i,j=1}, the edge set

    N(vi)={uV(vi,u)E}N(v_i)=\{u\in V|(v_i,u)\in E\}, the neighborhood of a node viv_i

    A=[aij]A=[a_{ij}] denotes an n×nn\times n dimensional adjacency matrix, where aij=1a_{ij}=1 if eijEe_{ij} \in E; otherwise, aij=0a_{ij}=0.

    If aijajia_{ij}\neq a_{ji}, G\cal{G} is a directed network and an undirected network otherwise.

    If aija_{ij} is weighted by wijWw_{ij}\in W, G\cal{G} =(V,E,W)=(V,E,W) is a weighted network and unweighted network otherwise.

    If aija_{ij}'s value differs in +1 (positive) and -1 (negative), G\cal{G} is a signed network[1].

    If node viVv_i \in V is attributed by xiXRn×dx_i \in X \subseteq \mathbb{R}^{n\times d}, G\cal{G}=(V,E,X)=(V,E,X) is an attributed network; otherwise, it is an unattributed network[2].

  • Definition 2 (Community)

    C\cal{C}={C1,C2,...,CK}=\{C_1, C_2, ..., C_K\}, a set of communities,

    Each community CkC_k is a partition(划分) of G\cal{G} that has a regional structure(区域结构) and some cluster properties(聚类性质).

    A node viv_i clustered into community CkC_k should satisfy the condition that the internal node degree[3] inside the community exceeds its external degree.

    Suppose CkCk=,(k,k,kk)C_k \cap C_{k'}=\emptyset,(\forall k,k',k\ne k'), C\cal{C} denotes disjoint communities; otherwise, overlapping communities.

  • Community Detection Input

    the network topology and network attributes

  • Community Detection Output

    a set of communities that can be either disjoint or overlapping

Convolutional Network-Based Community Detection

  • CNN-Based Community Detection

  • GCN-Based Community Detection

    GCNs were designed for graph-structured data. The layer-wise propagation rule for a GCN is

    H(l+1)=σ(D~12A~D~12H(l)W(l))H^{(l+1)}=\sigma (\tilde{D}^{-\frac{1}{2}} \tilde{A} \tilde{D}^{-\frac{1}{2}} H^{(l)} W^{(l)})

    where the latent representations of the ll th layer are preserved in the matrix H(l)(H(0))=XH^{(l)} (H^{(0)})=X through an activation function σ()\sigma (\cdot) with a layer-specific trainable weight matrix W(l)W^{(l)}. Here, A~=A+In\tilde{A}=A+I_n, where InI_n denotes the identity matrix, and D~ii=ja~ijA~\tilde{D}_{ii}=\sum_j\tilde{a}_{ij} \in \tilde{A}.
    <<<<<<< HEAD

    • Community clustering with unsupervised network representation.
      =======

      • Community clustering with unsupervised network representation.

Hierarchical Networks

d458109d25b68b76821b68eb3978bd202b084e46


  1. 符号网络(Signed Network)是一种特殊类型的网络,其中不仅包含传统的无向或有向的边来表示实体(如人、组织、国家等)之间的关系,而且这些边带有正负符号,用以表示关系的类型是积极的(如友谊、合作)还是消极的(如敌意、竞争)。符号网络能够更准确地反映现实世界中的复杂关系,因为现实世界中的许多关系不仅仅是连接的存在与否,还包括连接的性质。 ↩︎

  2. 属性网络(Attributed Network)是一种网络数据结构,其中节点(顶点)和/或边(连接线)不仅由其在网络中的位置定义,还由关联的属性(特征)定义。这些属性可以是各种类型的信息,如文本、数字、类别或任何可以描述节点或边的实值数据。 ↩︎

  3. 节点的度(Degree)是指与该节点直接相连的边的数量。它是描述节点连接水平的一个基本指标。 ↩︎