MarkTechPost@AI 01月02日
Meta AI Proposes LIGER: A Novel AI Method that Synergistically Combines the Strengths of Dense and Generative Retrieval to Significantly Enhance the Performance of Generative Retrieval
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

LIGER是一种新型混合检索模型,它巧妙地融合了密集检索和生成检索的优势,旨在提升推荐系统的性能。传统密集检索方法虽然效果好,但计算和存储成本高昂,而生成检索虽然存储需求低,但在处理冷启动项目时性能较差。LIGER通过先使用生成模型快速生成候选集,再利用密集检索技术进行精确筛选,从而在效率和准确性之间取得平衡。该模型利用语义ID和文本属性表示项目,有效降低了计算和存储开销,并显著改善了冷启动项目的推荐效果。LIGER的创新架构为现代推荐系统提供了一个更高效、更具扩展性的解决方案。

💡LIGER模型巧妙结合了密集检索和生成检索的优点,通过混合方式提升推荐系统的整体性能。

🔍 LIGER模型使用双向Transformer编码器和生成解码器,其中密集检索部分融合了项目文本表示、语义ID和位置嵌入,并通过余弦相似性损失进行优化;生成检索部分则利用波束搜索预测用户交互历史后的项目语义ID。

🚀LIGER模型在多个基准数据集上,如Amazon Beauty, Sports, Toys和Steam,都超越了现有模型,尤其是在冷启动项目的推荐效果上表现突出,如在Amazon Beauty数据集上,冷启动项目的Recall@10得分达到了0.1008,远超其他模型。

📝LIGER模型通过结合文本表示,在处理未见过的项目时也表现良好,从而解决了传统生成模型的一个关键限制。这种设计使得LIGER在各种推荐场景中都具有很强的适应性和效率。

Recommendation systems are essential for connecting users with relevant content, products, or services. Dense retrieval methods have been a mainstay in this field, utilizing sequence modeling to compute item and user representations. However, these methods demand substantial computational resources and storage, as they require embeddings for every item. As datasets grow, these requirements become increasingly burdensome, limiting their scalability. Generative retrieval, an emerging alternative, reduces storage needs by predicting item indices through generative models. Despite its potential, it struggles with performance issues, especially in handling cold-start items—new items with limited user interactions. The absence of a unified framework combining the strengths of these approaches highlights a gap in addressing trade-offs between computation, storage, and recommendation quality.

Researchers from the University of Wisconsin, Madison, ELLIS Unit, LIT AI Lab, Institute for Machine Learning, JKU Linz, Austria, and Meta AI have introduced LIGER (LeveragIng dense retrieval for GEnerative Retrieval), a hybrid retrieval model that blends the computational efficiency of generative retrieval with the precision of dense retrieval. LIGER refines a candidate set generated by generative retrieval through dense retrieval techniques, achieving a balance between efficiency and accuracy. The model leverages item representations derived from semantic IDs and text-based attributes, combining the strengths of both paradigms. By doing so, LIGER reduces storage and computational overhead while addressing performance gaps, particularly in scenarios involving cold-start items.

Technical Details and Benefits

LIGER employs a bidirectional Transformer encoder alongside a generative decoder. The dense retrieval component integrates item text representations, semantic IDs, and positional embeddings, optimized using a cosine similarity loss. The generative component uses beam search to predict semantic IDs of subsequent items based on user interaction history. This combination allows LIGER to retain generative retrieval’s efficiency while addressing its limitations with cold-start items. The model’s hybrid inference process, which first retrieves a candidate set via generative retrieval and then refines it through dense retrieval, effectively reduces computational demands while maintaining recommendation quality. Additionally, by incorporating textual representations, LIGER generalizes well to unseen items, addressing a key limitation of prior generative models.

Results and Insights

Evaluations of LIGER across benchmark datasets, including Amazon Beauty, Sports, Toys, and Steam, show consistent improvements over state-of-the-art models like TIGER and UniSRec. For example, LIGER achieved a Recall@10 score of 0.1008 for cold-start items on the Amazon Beauty dataset, compared to TIGER’s 0.0. On the Steam dataset, LIGER’s Recall@10 for cold-start items reached 0.0147, again outperforming TIGER’s 0.0. These findings demonstrate LIGER’s ability to merge generative and dense retrieval techniques effectively. Moreover, as the number of candidates retrieved by generative methods increases, LIGER narrows the performance gap with dense retrieval. This adaptability and efficiency make it suitable for diverse recommendation scenarios.

Conclusion

LIGER offers a thoughtful integration of dense and generative retrieval, addressing challenges in efficiency, scalability, and handling cold-start items. Its hybrid architecture balances computational efficiency with high-quality recommendations, making it a viable solution for modern recommendation systems. By bridging gaps in existing approaches, LIGER lays the groundwork for further exploration into hybrid retrieval models, fostering innovation in recommendation systems.


Check out the Paper. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. Don’t Forget to join our 60k+ ML SubReddit.

FREE UPCOMING AI WEBINAR (JAN 15, 2025): Boost LLM Accuracy with Synthetic Data and Evaluation IntelligenceJoin this webinar to gain actionable insights into boosting LLM model performance and accuracy while safeguarding data privacy.

The post Meta AI Proposes LIGER: A Novel AI Method that Synergistically Combines the Strengths of Dense and Generative Retrieval to Significantly Enhance the Performance of Generative Retrieval appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

LIGER 推荐系统 混合检索 密集检索 生成检索
相关文章