MarkTechPost@AI 2024年10月18日
Jina AI Released g.jina.ai: A Powerful API for Strengthening Human Written Content with Grounded, Fact-Based Information from Real-Time Searches
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

Jina AI 发布了最新的产品 g.jina.ai,旨在解决生成式 AI 模型中日益严重的信息误导和幻觉问题。该创新工具是其更广泛的应用程序套件的一部分,旨在提高 AI 生成和人工编写内容的真实性和可靠性。g.jina.ai 专注于大型语言模型(LLM),集成实时网络搜索结果以确保陈述以经过验证的事实信息为基础。

😄 **实时网络搜索接地:** g.jina.ai 使用实时网络搜索来查找与陈述相关的相关信息。结果包括 URL 和支持或反驳原始陈述的关键引文。

🤔 **真实性评分:** 在分析陈述后,系统会提供 0 到 1 之间的真实性评分,该评分根据收集到的参考资料估计陈述的准确性。

📚 **详细参考资料:** API 为每个陈述返回多达 30 个参考资料,大多数情况下至少有 10 个。这些参考资料包括 URL 和支持或反驳陈述的直接引文。

💰 **成本和可访问性:** Jina AI 提供其 API 的免费试用版,包含 100 万个令牌,使开发人员和组织能够测试该工具。每个接地请求的成本约为 0.006 美元,对于大规模事实核查来说是一种经济高效的解决方案。

🚀 **性能基准:** Jina AI 对 g.jina.ai 进行了性能基准测试,将其与 Gemini Pro 和 GPT-4 等其他接地模型进行了比较。结果令人印象深刻,g.jina.ai 的 F1 得分为 0.92,优于竞争对手。该基准测试涉及对 100 个具有已知真实值的陈述测试 API,证明了其在事实核查中的准确性和可靠性。

⚠️ **g.jina.ai 的局限性:** 尽管功能强大,但 g.jina.ai 也并非没有局限性。

⚠️ **高延迟和令牌消耗:** 每个接地请求可能需要长达 30 秒才能完成,并且会消耗大量令牌。这可能会限制其在没有仔细资源管理的情况下在高需求环境中的使用。

⚠️ **适用性约束:** 并非所有陈述都适合接地。无法有效地对个人意见、未来事件或假设情况进行事实核查。

⚠️ **对网络数据质量的依赖性:** 接地过程的准确性取决于在网络搜索过程中检索到的来源的质量。低质量或有偏见的来源会对结果产生负面影响。

Jina AI announced the release of their latest product, g.jina.ai, designed to tackle the growing problem of misinformation and hallucination in generative AI models. This innovative tool is part of their larger suite of applications to improve factual accuracy and grounding in AI-generated and human-written content. Focusing on Large Language Models (LLMs), g.jina.ai integrates real-time web search results to ensure that statements are grounded in verified, factual information.

The Importance of Grounding in AI

Grounding ensures that an AI model’s statements generated or assessed are based on factual and accurate data. This is especially critical for LLMs, which are often trained on massive datasets but may need access to the most recent or domain-specific information. Without grounding, LLMs can be prone to what is known as “hallucination,” a phenomenon where the model generates convincing but incorrect or fabricated information.

For instance, the training data for many models may have a knowledge cutoff, meaning that they need to be made aware of events or information that came after their training period. In this scenario, grounding becomes essential. Tools like g.jina.ai help bridge this gap by introducing real-time web searches that validate the information provided by AI models or even human-written content.

g.jina.ai by Jina AI

The g.jina.ai API was developed to provide a robust fact-checking and grounding mechanism by employing real-time web searches. It takes a given statement, grounds it using search results from reliable sources, and provides a factuality score and the exact references supporting or challenging the statement. This approach ensures that the results are transparent, & users can verify the source of the information themselves. The g.jina.ai API returned multiple references to validate the statement, each sourced from trusted platforms like Arxiv and Hugging Face, complete with supporting quotes.

Key Features of g.jina.ai

    Real-time Web Search Grounding: The tool uses real-time web search to find relevant information related to the statement. The results include URLs and key quotes supporting or contradicting the original statement.Factuality Score: After analyzing the statement, the system provides a factuality score between 0 and 1, which estimates how accurate the statement is based on the references collected.Detailed References: The API returns up to 30 references for each statement, with a minimum of 10 in most cases. These references include URLs and direct quotes that are either supportive or contradictory.Cost and Accessibility: Jina AI offers free trials of their API with 1 million tokens, making it accessible for developers and organizations to test the tool. Each grounding request costs approximately $0.006, making it a cost-effective solution for large-scale fact-checking.

Step-by-Step Explanation of g.jina.ai

To understand how g.jina.ai functions, here is a detailed step-by-step process of how it grounds statements:

    Input Statement: The user provides a statement that needs to be fact-checked, such as “The latest model released by Jina AI is jina-embeddings-v3.” No additional fact-checking instructions are necessary at this stage.Generate Search Queries: The system uses an LLM to generate relevant search queries. These queries cover all aspects of the input statement to ensure a thorough search.Call s.jina.ai for Web Search: For each query, g.jina.ai initiates a web search using s.jina.ai, which gathers relevant documents and web pages. The tool also uses r.jina.ai to extract content from these sources.Extract Key References: Once the search results are collected, an LLM extracts the key references from each document. Each reference includes:
      URL: The web address of the source.Key Quote: A direct quote from the document that supports or contradicts the statement.Supportive Status: A Boolean indicator that shows whether the reference supports or refutes the statement.
    Aggregate and Trim References: All collected references are aggregated into a single list. If there are more than 30 references, the system trims them down to a manageable size by selecting 30 random references.Evaluate the Statement: The system evaluates the statement using the gathered references. This evaluation includes the factuality score, a Boolean result indicating whether the statement is true or false, and detailed reasoning that cites supporting or contradicting references.Output the Result: Finally, the system outputs the results, including the factuality score, detailed reasoning, and the list of references. This output allows users to see exactly how the statement was evaluated and to verify the sources themselves.

Performance Benchmark

Jina AI conducted a performance benchmark of g.jina.ai, comparing it against other grounding models such as Gemini Pro and GPT-4. The results were impressive, with g.jina.ai achieving an F1 score of 0.92, outperforming competitors. This benchmark involved testing the API against 100 statements with known truth values, demonstrating its accuracy and reliability in fact-checking.

Limitations of g.jina.ai

Despite its impressive capabilities, g.jina.ai is not without limitations:

Conclusion

The release of g.jina.ai, offering a real-time, transparent fact-checking tool, provides a valuable resource for developers, researchers, and organizations looking to ensure the accuracy and credibility of their content. Despite some limitations, the tool’s overall utility and performance make it a promising addition to the AI toolkit. Also, Jina AI plans to expand the capabilities of g.jina.ai, integrating private data sources and enhancing multi-hop reasoning for deeper evaluations.


Check out the Details here. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. If you like our work, you will love our newsletter.. Don’t Forget to join our 50k+ ML SubReddit.

[Upcoming Live Webinar- Oct 29, 2024] The Best Platform for Serving Fine-Tuned Models: Predibase Inference Engine (Promoted)

The post Jina AI Released g.jina.ai: A Powerful API for Strengthening Human Written Content with Grounded, Fact-Based Information from Real-Time Searches appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Jina AI g.jina.ai 事实核查 AI 模型 大型语言模型
相关文章