MarkTechPost@AI 2024年07月06日
Qdrant Unveils BM42: A Cutting-Edge Pure Vector-Based Hybrid Search Algorithm Optimizing RAG and AI Applications
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

Qdrant发布了BM42,这是一种旨在彻底改变混合搜索的新算法。BM42结合了BM25的优势和现代Transformer模型,为搜索应用程序提供显著升级,旨在弥合传统搜索算法与现代搜索技术之间的差距。

🤔 BM25长期以来一直是搜索引擎的标准算法,例如谷歌和雅虎,但随着向量搜索和RAG系统的出现,需要更先进的解决方案。BM25在传统的网页搜索环境中表现出色,但在RAG系统中,BM25基于文档统计的优势变得不那么有效。

🚀 BM42通过将BM25的核心原理与Transformer模型的能力相结合,解决了这些挑战。BM42的关键创新是使用Transformer的注意力矩阵来确定术语在文档中的重要性。Transformer生成各种输出,包括嵌入和注意力矩阵,突出了每个标记在输入序列中的重要性。通过利用对应于特殊[CLS]标记的注意力行,BM42可以准确地衡量每个标记在文档中的重要性,即使对于RAG应用程序中常见的较短文本也是如此。

💡 BM42与传统的BM25和SPLADE相比,具有显著优势。BM42保留了BM25的可解释性和简单性,同时克服了SPLADE的局限性,包括对大量计算资源的需求以及标记化和域依赖性问题。BM42可以快速执行文档和查询推断,使其适合实时应用程序。它还具有较小的内存占用,确保它可以在不显著增加资源需求的情况下处理大型数据集。BM42支持多种语言和领域,只要有合适的Transformer模型可用,使其非常通用。

💻 BM42可以无缝集成到Qdrant的向量搜索引擎中。该实现涉及为混合搜索设置一个包含BM42的集合,并使用来自jina.ai等模型的密集嵌入。这种组合允许采用平衡的方法,其中稀疏和密集嵌入相互补充以提高检索精度。Qdrant进行的基准测试表明,BM42在涉及短文本的场景中优于BM25,这在现代搜索应用程序中是一个常见的用例。

🤝 Qdrant发布BM42不仅引入了一种新的算法,还促进了社区参与和创新。该公司邀请开发人员和研究人员尝试使用BM42,分享他们的项目,并为其持续发展做出贡献。通过提供这个强大的工具,Qdrant旨在赋予其社区力量,推动搜索技术的可能性。

Qdrant, a leading provider of vector search technology, has introduced BM42, a new algorithm designed to revolutionize hybrid search. For the past four decades, BM25 has been the standard algorithm used by search engines, from Google to Yahoo. However, the advent of vector search and the introduction of Retrieval-Augmented Generation (RAG) have highlighted the need for a more advanced solution. BM42 aims to bridge this gap by combining the strengths of BM25 with modern transformer models, offering a significant upgrade for search applications.

The Legacy of BM25

BM25 has remained relevant for a long time due to its simple yet effective formula, which calculates the relevance of documents based on term frequency and inverse document frequency (IDF). This method excels in traditional web search environments where document length and query structures are consistent. However, the landscape of text retrieval has shifted dramatically with the rise of RAG systems, which require handling shorter, more varied documents and queries. BM25’s reliance on document statistics, such as term frequency and document length, becomes less effective in these scenarios.

The Introduction of BM42

BM42 addresses these challenges by integrating the core principles of BM25 with the capabilities of transformer models. The key innovation in BM42 is using attention matrices from transformers to determine the importance of the term within documents. Transformers generate a range of outputs, including embeddings and attention matrices, highlighting each token’s significance in the input sequence. By leveraging the attention row corresponding to the special [CLS] token, BM42 can accurately gauge the importance of each token in a document, even for shorter texts typical in RAG applications.

Advantages of BM42

BM42 offers several advantages over BM25 and SPLADE, another modern alternative that uses transformers to create sparse embeddings. While SPLADE has shown superior performance in academic benchmarks, it needs to improve its performance, including the need for extensive computational resources and issues with tokenization and domain dependency. BM42, on the other hand, retains the interpretability and simplicity of BM25 while overcoming SPLADE’s limitations.

One of BM42’s primary benefits is its efficiency. The algorithm can perform document and query inferences quickly, making it suitable for real-time applications. It also has a low memory footprint, ensuring it can handle large datasets without significant resource demands. BM42 supports multiple languages and domains, provided a suitable transformer model is available, making it highly versatile.

Practical Implementation

BM42 can be seamlessly integrated into Qdrant’s vector search engine. The implementation involves setting up a collection for hybrid search with BM42 and using dense embeddings from models like jina.ai. This combination allows for a balanced approach, where sparse and dense embeddings complement each other to enhance retrieval accuracy. Benchmarks conducted by Qdrant demonstrate that BM42 outperforms BM25 in scenarios involving short texts, a common use case in modern search applications.

Encouraging Community Engagement

Qdrant’s release of BM42 introduces a new algorithm and fosters community engagement and innovation. The company invites developers and researchers to experiment with BM42, share their projects, and contribute to its ongoing development. By providing this powerful tool, Qdrant aims to empower its community to push the boundaries of what is possible in search technology.

Conclusion

The release of BM42 by Qdrant marks a significant milestone in the evolution of search algorithms. By combining the robustness of BM25 with the intelligence of transformers, BM42 sets a new standard for hybrid search. It addresses the limitations of earlier methods and modern alternatives, offering a versatile, efficient, and highly accurate solution for today’s search applications.

The post Qdrant Unveils BM42: A Cutting-Edge Pure Vector-Based Hybrid Search Algorithm Optimizing RAG and AI Applications appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

BM42 向量搜索 混合搜索 RAG Qdrant
相关文章