MarkTechPost@AI 前天 04:57
This AI Paper Introduces TableRAG: A Hybrid SQL and Text Retrieval Framework for Multi-Hop Question Answering over Heterogeneous Documents
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

处理包含文本和表格的混合格式文档已成为构建更智能AI系统的关键任务。当前语言模型在处理表格时往往难以准确理解,因为表格被扁平化为纯文本会丢失行列关系,影响答案准确性。为解决此问题,华为云BU提出TableRAG框架,采用混合系统设计,交替进行文本数据检索和基于SQL的结构化执行,同时保留表格布局。该系统通过离线阶段将异构文档解析为结构化数据库,在线阶段采用四步迭代处理用户问题:查询分解、文本检索、SQL编程执行和中间答案生成。实验表明,TableRAG在多个基准测试中显著优于NaiveRAG、React和TableGPT2等基线方法,特别是在处理复杂跨文档推理任务时表现出色。

📊 TableRAG采用混合系统设计,交替进行文本数据检索和基于SQL的结构化执行,同时保留表格布局,有效解决了现有语言模型在处理表格时丢失行列关系的问题。

🔍 系统通过离线阶段将异构文档解析为结构化数据库,将表格存储在关系数据库中,文本内容存储在分块的知识库中,在线阶段采用四步迭代处理用户问题:查询分解、文本检索、SQL编程执行和中间答案生成。

📈 TableRAG在多个基准测试中显著优于基线方法,包括HybridQA、WikiTableQuestions和HeteQA。HeteQA包含9个不同领域的304个复杂问题,涵盖136个表格和5300个实体,验证了系统在过滤、聚合、分组、计算和排序等任务中的强大能力。

🔧 系统使用SQL进行精确的符号执行,特别擅长数值和逻辑计算,通过至多5次迭代进行文档级推理,并使用Claude-3.5-Sonnet和Qwen-2.5-72B等大型模型验证结果,确保了答案的准确性和可解释性。

Handling questions that involve both natural language and structured tables has become an essential task in building more intelligent and useful AI systems. These systems are often expected to process content that includes diverse data types, such as text mixed with numerical tables, which are commonly found in business documents, research papers, and public reports. Understanding such documents requires the AI to perform reasoning that spans both textual explanations and table-based details—a process that is inherently more complicated than traditional text-based question answering.

One of the major problems in this area is that current language models often fail to interpret documents accurately when tables are involved. Models tend to lose the relationships between rows and columns when the tables are flattened into plain text. This distorts the underlying structure of the data and reduces the accuracy of answers, especially when the task involves computations, aggregations, or reasoning that connects multiple facts across the document. Such limitations make it challenging to utilize standard systems for practical multi-hop question-answering tasks that require insights from both text and tables.

To solve these problems, previous methods have attempted to apply Retrieval-Augmented Generation (RAG) techniques. These involve retrieving text segments and feeding them into a language model for answer generation. However, these techniques are insufficient for tasks that require compositional or global reasoning across large tabular datasets. Tools like NaiveRAG and TableGPT2 try to simulate this process by converting tables into Markdown format or generating code-based execution in Python. Yet, these methods still struggle with tasks where maintaining the table’s original structure is necessary for correct interpretation.

Researchers from Huawei Cloud BU proposed a method named TableRAG that directly addresses these limitations. Research introduced TableRAG as a hybrid system that alternates between textual data retrieval and structured SQL-based execution. This approach preserves the tabular layout and treats table-based queries as a unified reasoning unit. This new system not only preserves the table structure but also executes queries in a manner that respects the relational nature of data, organized in rows and columns. The researchers also created a dataset called HeteQA to benchmark the performance of their method across different domains and multi-step reasoning tasks.

TableRAG functions in two main stages. The offline stage involves parsing heterogeneous documents into structured databases by extracting tables and textual content separately. These are stored in parallel corpora—a relational database for tables and a chunked knowledge base for text. The online phase handles user questions through an iterative four-step process: query decomposition, text retrieval, SQL programming and execution, and intermediate answer generation. When a question is received, the system identifies whether it requires tabular or textual reasoning, dynamically chooses the appropriate strategy, and combines the outputs. SQL is used for precise symbolic execution, enabling better performance in numerical and logical computations.

During experiments, TableRAG was tested on several benchmarks, including HybridQA, WikiTableQuestions, and the newly constructed HeteQA. HeteQA consists of 304 complex questions across nine diverse domains and includes 136 unique tables, as well as over 5,300 Wikipedia-derived entities. The dataset challenges models with tasks like filtering, aggregation, grouping, calculation, and sorting. TableRAG outperformed all baseline methods, including NaiveRAG, React, and TableGPT2. It achieved consistently higher accuracy, with document-level reasoning powered by up to 5 iterative steps, and utilized models such as Claude-3.5-Sonnet and Qwen-2.5-72B to verify the results.

The work presented a strong and well-structured solution to the challenge of reasoning over mixed-format documents. By maintaining structural integrity and adopting SQL for structured data operations, the researchers demonstrated an effective alternative to existing retrieval-based systems. TableRAG represents a significant step forward in question-answering systems that handle documents containing both tables and text, offering a viable method for more accurate, scalable, and interpretable document understanding.


Check out the Paper and GitHub Page. All credit for this research goes to the researchers of this project. Ready to connect with 1 Million+ AI Devs/Engineers/Researchers? See how NVIDIA, LG AI Research, and top AI companies leverage MarkTechPost to reach their target audience [Learn More]

The post This AI Paper Introduces TableRAG: A Hybrid SQL and Text Retrieval Framework for Multi-Hop Question Answering over Heterogeneous Documents appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

TableRAG 自然语言处理 混合文档推理 SQL检索 多跳问答
相关文章