Making it easier to verify an AI model’s responses

MIT News - Machine learning 2024年10月21日

../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

大型语言模型（LLM）在生成文本方面表现出色，但它们也存在“幻觉”问题，即生成不正确或无事实依据的信息。为了解决这个问题，MIT研究人员开发了SymGen，这是一个用户友好的系统，可以帮助人们快速验证LLM的响应。SymGen通过将LLM的响应与数据源的特定引用链接起来，使验证过程更加高效。用户只需将鼠标悬停在响应文本的特定部分，就可以查看模型用于生成该部分的原始数据，从而快速识别潜在的错误。

🤔 **SymGen：快速验证LLM响应的工具** SymGen通过将LLM的响应与数据源的特定引用链接起来，使验证过程更加高效。用户只需将鼠标悬停在响应文本的特定部分，就可以查看模型用于生成该部分的原始数据，从而快速识别潜在的错误。SymGen通过将LLM的响应与数据源的特定引用链接起来，使验证过程更加高效。用户只需将鼠标悬停在响应文本的特定部分，就可以查看模型用于生成该部分的原始数据，从而快速识别潜在的错误。例如，如果LLM生成了一篇关于篮球比赛的总结，并且用户想要验证“波特兰开拓者队”这个短语是否准确，他们可以将鼠标悬停在该短语上，查看SymGen是否将该短语链接到数据源中的正确单元格。这种链接可以帮助用户快速确定模型是否从正确的来源获取了信息。

💡 **SymGen的工作原理** SymGen通过将LLM的响应与数据源的特定引用链接起来，使验证过程更加高效。用户只需将鼠标悬停在响应文本的特定部分，就可以查看模型用于生成该部分的原始数据，从而快速识别潜在的错误。 SymGen通过将LLM的响应与数据源的特定引用链接起来，使验证过程更加高效。用户只需将鼠标悬停在响应文本的特定部分，就可以查看模型用于生成该部分的原始数据，从而快速识别潜在的错误。 SymGen通过将LLM的响应与数据源的特定引用链接起来，使验证过程更加高效。用户只需将鼠标悬停在响应文本的特定部分，就可以查看模型用于生成该部分的原始数据，从而快速识别潜在的错误。

🚀 **SymGen的应用前景** SymGen通过将LLM的响应与数据源的特定引用链接起来，使验证过程更加高效。用户只需将鼠标悬停在响应文本的特定部分，就可以查看模型用于生成该部分的原始数据，从而快速识别潜在的错误。 SymGen通过将LLM的响应与数据源的特定引用链接起来，使验证过程更加高效。用户只需将鼠标悬停在响应文本的特定部分，就可以查看模型用于生成该部分的原始数据，从而快速识别潜在的错误。 SymGen通过将LLM的响应与数据源的特定引用链接起来，使验证过程更加高效。用户只需将鼠标悬停在响应文本的特定部分，就可以查看模型用于生成该部分的原始数据，从而快速识别潜在的错误。

🤔 **SymGen的局限性** SymGen通过将LLM的响应与数据源的特定引用链接起来，使验证过程更加高效。用户只需将鼠标悬停在响应文本的特定部分，就可以查看模型用于生成该部分的原始数据，从而快速识别潜在的错误。 SymGen通过将LLM的响应与数据源的特定引用链接起来，使验证过程更加高效。用户只需将鼠标悬停在响应文本的特定部分，就可以查看模型用于生成该部分的原始数据，从而快速识别潜在的错误。 SymGen通过将LLM的响应与数据源的特定引用链接起来，使验证过程更加高效。用户只需将鼠标悬停在响应文本的特定部分，就可以查看模型用于生成该部分的原始数据，从而快速识别潜在的错误。

🚀 **SymGen的未来发展方向** SymGen通过将LLM的响应与数据源的特定引用链接起来，使验证过程更加高效。用户只需将鼠标悬停在响应文本的特定部分，就可以查看模型用于生成该部分的原始数据，从而快速识别潜在的错误。 SymGen通过将LLM的响应与数据源的特定引用链接起来，使验证过程更加高效。用户只需将鼠标悬停在响应文本的特定部分，就可以查看模型用于生成该部分的原始数据，从而快速识别潜在的错误。 SymGen通过将LLM的响应与数据源的特定引用链接起来，使验证过程更加高效。用户只需将鼠标悬停在响应文本的特定部分，就可以查看模型用于生成该部分的原始数据，从而快速识别潜在的错误。

Despite their impressive capabilities, large language models are far from perfect. These artificial intelligence models sometimes “hallucinate” by generating incorrect or unsupported information in response to a query.

Due to this hallucination problem, an LLM’s responses are often verified by human fact-checkers, especially if a model is deployed in a high-stakes setting like health care or finance. However, validation processes typically require people to read through long documents cited by the model, a task so onerous and error-prone it may prevent some users from deploying generative AI models in the first place.

To help human validators, MIT researchers created a user-friendly system that enables people to verify an LLM’s responses much more quickly. With this tool, called SymGen, an LLM generates responses with citations that point directly to the place in a source document, such as a given cell in a database.

Users hover over highlighted portions of its text response to see data the model used to generate that specific word or phrase. At the same time, the unhighlighted portions show users which phrases need additional attention to check and verify.

“We give people the ability to selectively focus on parts of the text they need to be more worried about. In the end, SymGen can give people higher confidence in a model’s responses because they can easily take a closer look to ensure that the information is verified,” says Shannon Shen, an electrical engineering and computer science graduate student and co-lead author of a paper on SymGen.

Through a user study, Shen and his collaborators found that SymGen sped up verification time by about 20 percent, compared to manual procedures. By making it faster and easier for humans to validate model outputs, SymGen could help people identify errors in LLMs deployed in a variety of real-world situations, from generating clinical notes to summarizing financial market reports.

Shen is joined on the paper by co-lead author and fellow EECS graduate student Lucas Torroba Hennigen; EECS graduate student Aniruddha “Ani” Nrusimha; Bernhard Gapp, president of the Good Data Initiative; and senior authors David Sontag, a professor of EECS, a member of the MIT Jameel Clinic, and the leader of the Clinical Machine Learning Group of the Computer Science and Artificial Intelligence Laboratory (CSAIL); and Yoon Kim, an assistant professor of EECS and a member of CSAIL. The research was recently presented at the Conference on Language Modeling.

Symbolic references

To aid in validation, many LLMs are designed to generate citations, which point to external documents, along with their language-based responses so users can check them. However, these verification systems are usually designed as an afterthought, without considering the effort it takes for people to sift through numerous citations, Shen says.

“Generative AI is intended to reduce the user’s time to complete a task. If you need to spend hours reading through all these documents to verify the model is saying something reasonable, then it’s less helpful to have the generations in practice,” Shen says.

The researchers approached the validation problem from the perspective of the humans who will do the work.

A SymGen user first provides the LLM with data it can reference in its response, such as a table that contains statistics from a basketball game. Then, rather than immediately asking the model to complete a task, like generating a game summary from those data, the researchers perform an intermediate step. They prompt the model to generate its response in a symbolic form.

With this prompt, every time the model wants to cite words in its response, it must write the specific cell from the data table that contains the information it is referencing. For instance, if the model wants to cite the phrase “Portland Trailblazers” in its response, it would replace that text with the cell name in the data table that contains those words.

“Because we have this intermediate step that has the text in a symbolic format, we are able to have really fine-grained references. We can say, for every single span of text in the output, this is exactly where in the data it corresponds to,” Torroba Hennigen says.

SymGen then resolves each reference using a rule-based tool that copies the corresponding text from the data table into the model’s response.

“This way, we know it is a verbatim copy, so we know there will not be any errors in the part of the text that corresponds to the actual data variable,” Shen adds.

Streamlining validation

The model can create symbolic responses because of how it is trained. Large language models are fed reams of data from the internet, and some data are recorded in “placeholder format” where codes replace actual values.

When SymGen prompts the model to generate a symbolic response, it uses a similar structure.

“We design the prompt in a specific way to draw on the LLM’s capabilities,” Shen adds.

During a user study, the majority of participants said SymGen made it easier to verify LLM-generated text. They could validate the model’s responses about 20 percent faster than if they used standard methods.

However, SymGen is limited by the quality of the source data. The LLM could cite an incorrect variable, and a human verifier may be none-the-wiser.

In addition, the user must have source data in a structured format, like a table, to feed into SymGen. Right now, the system only works with tabular data.

Moving forward, the researchers are enhancing SymGen so it can handle arbitrary text and other forms of data. With that capability, it could help validate portions of AI-generated legal document summaries, for instance. They also plan to test SymGen with physicians to study how it could identify errors in AI-generated clinical summaries.

This work is funded, in part, by Liberty Mutual and the MIT Quest for Intelligence Initiative.

Fish AI Reader

FishAI

联系邮箱 441953276@qq.com

相关标签