MarkTechPost@AI 2024年11月10日
From Edges to Nodes: SEGMN’s Comprehensive Approach to Graph Similarity
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文介绍了一种名为SEGMN的图相似性计算框架,旨在解决现有方法在图表示和匹配方面的局限性。SEGMN通过双重嵌入学习模块和结构感知匹配模块,增强了节点和边的表示,并考虑了图结构信息,从而提高了图相似性计算的准确性。研究人员在多个真实数据集上进行了评估,结果表明SEGMN在多种指标上均优于其他基线模型,为图相似性计算提供了更鲁棒的解决方案。

🤔 **SEGMN框架旨在解决现有图相似性计算方法在图表示和匹配方面的局限性。** 许多现有方法仅使用简单的节点嵌入,而忽略了边缘信息,导致结构匹配不准确。

💡 **SEGMN采用了双重嵌入学习模块,分别学习节点和边的嵌入。** 其中,边缘增强GCN用于学习边缘嵌入,残差连接GCN用于学习节点嵌入,最终每个节点聚合其连接的边缘嵌入生成最终的双重嵌入。

🚀 **SEGMN引入结构感知匹配模块,进一步提高了匹配准确性。** 该模块通过考虑节点对之间的结构关系,改进节点对相似性得分,最终通过卷积和自注意力机制细化相似性矩阵,从而获得更准确的相似性分数。

📊 **SEGMN在AIDS、LINUX和IMDB等真实数据集上进行了评估,结果表明其性能优于其他基线模型。** 在MSE、Spearman秩相关系数、Kendall Tau系数和p@10等指标上,SEGMN都取得了更好的结果,其中结构感知匹配模块提升了高达25%的性能。

🏆 **SEGMN为图相似性计算提供了更鲁棒的解决方案,为该领域未来的研究奠定了基础。**

In today’s world, Graph similarity computation (GSC) plays an important role in various applications such as code detection, molecular graph similarity, image matching, etc., by evaluating the similarity between two graphs, and it is based on Graph similarity learning. Graph Edit Distance (GED) and Maximum Common Subgraph (MCS) are widely used to measure graph similarity. GED refers to the minimum number of operations that convert one graph to the other, and MCS is the largest subgraph that is simultaneously isomorphic(structurally identical) to both graphs. However, calculating GED and MCS is difficult because of problems that belong to a category called NP-complete, meaning they are extremely hard to solve efficiently, especially as the graphs get larger. Traditional algorithms like Hungar and A* can compute GED accurately but at the cost of high computational complexity. Computing graph similarity scores accurately is important, as this can significantly impact its applications. 

The existing methods to compute graph similarity have two major drawbacks:

1. Representation limitation: A comprehensive representation enables matching from multiple perspectives. Most methods only use simple node embeddings without highlighting the edge representation, which is crucial for accurately comparing structures.

2. Matching inadequacy:  Some recent GSC methods use Graph neural networks (GNNs) to take advantage of intra-graph structures in message passing. However, most of them need to fully utilize the information presented by edges to boost the representation of the connected nodes. Moreover, previous cross-graph node embedding matching needs to include the bigger picture of the overall structure of the graph pair because the node representations from GNNs only focus on the intra-graph structure, causing an unreasonable similarity score.

To overcome the drawbacks of previous works, researchers from Nanjing University of Posts and Telecommunications proposed a Structure Enhanced Graph Matching Network (SEGMN) framework. Equipped with a dual embedding learning module and a structure perception matching module, SEGMN achieves structure enhancement in both embedding learning and cross-graph matching. This proposed framework consists of four modules: Dual embedding learning, Cross-graph interaction, Structure perception matching, and Similarity matrix learning.

The dual embedding learning module can be divided into three steps. An edge-enhanced GCN is applied to the line graph for edge embedding. Secondly, a GCN with residual connections is applied to the node graph for embedding. Thirdly, each node aggregates the connected edge embeddings to generate the ultimate dual embedding. This dual embedding is used for cross-graph matching on the node level, where node-graph attention is adopted to calculate each node pair’s similarity score. The structure perception matching improves these scores by considering the structural relationships between node pairs across graphs. Finally, the similarity matrix is refined using convolution and self-attention to produce accurate similarity scores, with a loss function based on Mean Squared Error to optimize the model’s predictions.

Researchers evaluated SEGMN using three benchmark real-world datasets: AIDS, LINUX, and IMDB, and compared it to other groups of baselines such as GCN, GIN, GAT, SimGNN, and GraphSim. Upon evaluation, the proposed method outperformed other models in terms of Mean Square Error (MSE), Spearman’s Rank Correlation (ρ), Kendall’s Tau (τ), and precision at top 10 (p@10), with the structure perception matching module further improving performance by up to 25%.

In conclusion, the proposed framework is based on a dual embedding learning module and a structure-matching perception module, which perform better than traditional methods and mitigate their issues. It also offers a robust solution for graph similarity computation by providing a strong way to calculate it more accurately. This research marks an essential step toward understanding graph similarity computation and can serve as a baseline for future research in this domain!


Check out the Paper. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. If you like our work, you will love our newsletter.. Don’t Forget to join our 55k+ ML SubReddit.

[AI Magazine/Report] Read Our Latest Report on ‘SMALL LANGUAGE MODELS

The post From Edges to Nodes: SEGMN’s Comprehensive Approach to Graph Similarity appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

图相似性计算 SEGMN 图神经网络 图嵌入 结构感知匹配
相关文章