Is Complex Query Answering Really Complex?

cs.AI updates on arXiv.org 07月04日 12:08

Is Complex Query Answering Really Complex?

文章指出现有CQA基准可能低估了实际复杂度，提出更具挑战性的基准，实证研究表明现有方法存在不足。

arXiv:2410.12537v3 Announce Type: replace-cross Abstract: Complex query answering (CQA) on knowledge graphs (KGs) is gaining momentum as a challenging reasoning task. In this paper, we show that the current benchmarks for CQA might not be as complex as we think, as the way they are built distorts our perception of progress in this field. For example, we find that in these benchmarks, most queries (up to 98% for some query types) can be reduced to simpler problems, e.g., link prediction, where only one link needs to be predicted. The performance of state-of-the-art CQA models decreases significantly when such models are evaluated on queries that cannot be reduced to easier types. Thus, we propose a set of more challenging benchmarks composed of queries that require models to reason over multiple hops and better reflect the construction of real-world KGs. In a systematic empirical investigation, the new benchmarks show that current methods leave much to be desired from current CQA methods.

Fish AI Reader

AI辅助创作，多种专业模板，深度分析，高质量内容生成。从观点提取到深度思考，FishAI为您提供全方位的创作支持。新版本引入自定义参数，让您的创作更加个性化和精准。

FishAI

鱼阅，AI 时代的下一个智能信息助手，助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

复杂查询解答知识图谱基准测试模型评估实证研究

相关文章

Organizing Knowledge With Knowledge Graphs: Industry Trends

Graphs and Language

GraphRAG: Knowledge Graphs for AI Applications with Kirk Marple - #681

Cross-Device AI Acceleration, Compilation & Execution with Jeff Gehlhaar - #500

Building an Autonomous Knowledge Graph with Mike Tung - #319

Knowledge Graphs and Expert Augmentation with Marisa Boston - TWiML Talk #204

英國釋出AI模型安全評估平臺Inspect

TIGER-Lab Introduces MMLU-Pro Dataset for Comprehensive Benchmarking of Large Language Models’ Capabilities and Performance

Researchers at the University of Freiburg and Bosch AI Propose HW-GPT-Bench: A Hardware-Aware Language Model Surrogate Benchmark

Fujitsu Chosen For GENIAC Project To Enhance Reliability Of GenAI in Business Applications