Social Structure-Based Methodology for User Identity Matching Across Social Networks

Authors

  • Guanlong Zhu School of Computer Science and Technology, Qingdao University, Qingdao, China Author

DOI:

https://doi.org/10.63313/CS.8002

Keywords:

Social Networks, User Matching, Network Embedding, Transformer

Abstract

Cross-social-network user matching technology can identify the same individual across different social networks, enabling cross-network data integration. This allows for precise user profiling, which facilitates applications such as user recommendation and influence maximization. In such tasks, the social structure of users plays a critical role. However, due to the vast and complex nature of social network data, issues such as data sparsity and heterogeneity across net-works arise. Additionally, as the depth of network layers increases, extracted features tend to degrade and become homogeneous. To address these challenges, this paper proposes a Social Network Structure-based User Matching (SSUM) method. The approach introduces a subgraph structure extraction module, which, combined with the original network topology, is fed into a Transformer layer to further extract global features. This mitigates over-smoothing and over-squeezing caused by excessive layer depth, thereby obtaining vector rep-resentations of user topological structures. Subsequently, word-level vectors of user information and text-level vectors of user-generated content are extracted as user attribute features. These are concatenated to form the final user embed-ding representation. User matching is then transformed into a ranking problem by calculating Manhattan distances. Furthermore, to enhance model perfor-mance, user relationships in the dataset are pre-completed. Experimental results on real-world social network datasets demonstrate that the proposed method effectively extracts user features, improves matching accuracy, and outperforms comparative models.

References

[1] Gao, Ming, Lim, Ee Peng, Lo, David, et al. CNL: Collective Network Linkage Across Hetero-geneous Social Platforms[C]. In: IEEE International Conference on Data Min-ing.2015.757-762.https://doi.org/10.1109/ICDM.2015.34

[2] Li, Chaozhuo, Wang, Senzhang, Yu, Philip S, et al. Distribution distance minimization for unsupervised user identity linkage[C]. In: Proceedings of the 27th ACM International Con-ference on Information and Knowledge Management.2018.447-456.

https://doi.org/10.1145/3269206.3271675

[3] Tao Zhou,Ee-Peng Lim,Roy Ka-Wei Lee,Feida Zhu,Jiuxin Cao.Retrofitting Embeddings for Unsupervised User Identity Linkage[C].In Proceeding ofthe 24th Pacific-Asia Conference on Knowledge Discovery and Data Mining(PAKDD 2020).Singapore.May 2020.https://doi.org/10.1007/978-3-030-47426-3_30

[4] HU Z, WANG J, CHEN S, et al. A semi-supervised framework with efficient feature extrac-tion and network alignment for user identity linkage[C]//International Conference on Da-tabase Systems for Advanced Applications. Cham, Springer, 2021: 675-691.

https://doi.org/10.1007/978-3-030-73197-7_46

[5] Baiyang C ,Xiaoliang C .MAUIL: Multilevel attribute embedding for semisupervised user identity linkage[J].Information Sciences,2022,593527-545.

https://doi.org/10.1016/j.ins.2022.02.023

[6] Li C , Wang S , Liu Z ,et al.Semi-Supervised Variational User Identity Linkage via Noise-Aware Self-Learning[J]. 2021. https://doi.org/10.48550/arXiv.2112.07373

[7] Liu, L.; Cheung, W.K.; Li, X.; Liao, L. Aligning users across social networks using network embedding. In Proceedings of the 25th International Joint Conference on Artificial Intelli-gence, New York, NY, USA, 9-15 July 2016; pp. 1774-1780. https://dl.acm.org/doi/abs/10.5555/3060832.3060869

[8] Liu, L.; Zhang, Y.M.; Fu, S.; Zhong, F.J.; Hu, J.; Zhang, P. ABNE: An Attention-Based Network Embedding for User Alignment Across Social Networks. IEEE Access 2019, 7, 23595-23605.

https://doi.org/10.1109/ACCESS.2019.2900095.

[9] Zhang J , Chen B , Wang X ,et al.MEgo2Vec: Embedding Matched Ego Networks for User Alignment Across Social Networks[J].ACM, 2018.DOI:10.1145/3269206.3271705.

https://doi.org/10.1145/3269206.3271705.

[10] Huang S , Xiang H , Leng C ,et al.Cross-Social-Network User Identification Based on Bidirec-tional GCN and MNF-UI Models[J].Electronics (2079-9292), 2024, 13(12).DOI:10.3390/electronics13122351.

https://doi.org/10.3390/electronics13122351.

[11] Mikolov T , Chen K , Corrado G ,et al.Efficient Estimation of Word Representations in Vec-tor Space[J].Computer Science, 2013.DOI:10.48550/arXiv.1301.3781. https://doi.org/10.48550/arXiv.1301.3781

[12] Sanh V , Debut L , Chaumond J ,et al.DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter[J]. 2019.DOI:10.48550/arXiv.1910.01108. https://doi.org/10.48550/arXiv.1910.01108

[13] Xu K , Hu W , Leskovec J ,et al.How Powerful are Graph Neural Networks?[J]. 2018.DOI:10.48550/arXiv.1810.00826. https://doi.org/10.48550/arXiv.1810.00826

[14] Vaswani A , Shazeer N , Parmar N ,et al.Attention Is All You Need[J].arXiv, 2017.DOI:10.48550/arXiv.1706.03762. https://doi.org/10.48550/arXiv.2407.15516

[15] Min Y ,Baiyang C ,Xiaoliang C .JARUA: Joint Embedding of Attributes and Relations for User Alignment across Social Networks[J].Applied Sciences,2022,12(24):12709-12709.

https://doi.org/10.3390/app122412709.

[16] Lei T ,Ji L ,Wang G , et al.Transformer-Based User Alignment Model across Social Net-works[J].Electronics,2023,12(7).

https://doi.org/10.3390/electronics12071686.

Downloads

Published

2025-03-25

How to Cite

Social Structure-Based Methodology for User Identity Matching Across Social Networks. (2025). 计算机科学辑要, 1(1), 23-32. https://doi.org/10.63313/CS.8002