MarkTechPost@AI 2024年10月11日
Apple Researchers Propose BayesCNS: A Unified Bayesian Approach Tackling Cold Start and Non-Stationarity in Large-Scale Search Systems
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

苹果的研究人员提出了一种名为BayesCNS的统一贝叶斯方法,旨在解决大规模搜索系统中的冷启动和非平稳性问题。该方法将问题转化为一个贝叶斯在线学习问题,利用经验贝叶斯框架,基于上下文特征学习用户-物品交互的表达先验分布。该方法与排序模型相结合,提供排序模型引导的在线学习,根据上下文信息有效地探索相关物品。在全面离线和在线实验,包括A/B测试中,BayesCNS的有效性得到了证明,与基线相比,整体新物品交互提高了10.60%,整体成功率提高了1.05%。

🍎 **BayesCNS的原理**:BayesCNS将冷启动和非平稳性问题转化为一个贝叶斯在线学习问题,通过学习用户-物品交互的表达先验分布来解决冷启动问题。该方法利用经验贝叶斯框架,基于上下文特征学习先验分布,并通过排序模型引导的在线学习,根据上下文信息有效地探索相关物品。

🍎 **BayesCNS的优势**:与传统方法相比,BayesCNS能够有效地解决冷启动和非平稳性问题,并具有以下优势:1. 能够根据上下文特征学习用户-物品交互的表达先验分布,从而有效地解决冷启动问题;2. 能够利用排序模型引导的在线学习,根据上下文信息有效地探索相关物品,从而提高推荐系统的效率和准确性。

🍎 **BayesCNS的评估**:研究人员对BayesCNS进行了全面离线和在线实验,包括A/B测试,结果表明BayesCNS在解决冷启动和非平稳性问题方面取得了显著效果。与基线相比,整体新物品交互提高了10.60%,整体成功率提高了1.05%。

Information Retrieval (IR) systems for search and recommendations often utilize Learning-to-Rank (LTR) solutions to prioritize relevant items for user queries. These models heavily depend on user interaction features, such as clicks and engagement data, which are highly effective for ranking. However, this reliance presents significant challenges. User Interaction data can be noisy and sparse, especially for newer or less popular items, resulting in cold start problems where these items are ranked poorly and receive no attention. Exploring item recommendations may address cold start issues, but negatively impacts key business metrics and user trust.

Existing methods to address cold start in recommendation systems depend on heuristics to boost item rankings or use additional information to compensate for the lack of interaction data. Next, non-stationary distribution shifts are managed through periodic model retraining, which is costly and unstable due to varying data quality. Last is the Bayesian modeling that offers a principled approach to handle the dynamic nature of user interaction features, allowing for real-time updates as new data is observed. However, Bayesian methods are computationally intensive, as exact estimation of the posterior distribution is intractable. Also, recent advancements in variational inference using neural networks to simultaneously address cold start and non-stationarity in recommendation systems at scale remain unexplored.

To this end, researchers from Apple have proposed BayesCNS, a unified Bayesian approach that holistically addresses cold start and non-stationarity challenges in search systems at scale. The method is formulated as a Bayesian online learning problem, utilizing an empirical Bayesian framework to learn expressive prior distributions of user-item interactions based on contextual features. The approach interfaces with a ranker model, providing ranker-guided online learning to explore relevant items based on contextual information efficiently. The efficacy of BayesCNS on comprehensive offline and online experiments, including an A/B test shows a 10.60% improvement in overall new item interactions and a 1.05% increase in overall success rate compared to the baseline.

BayesCNS utilizes a Thompson sampling algorithm for online learning under non-stationarity, allowing continuous updates of previous estimates and learning from new data to maximize cumulative reward. BayesCNS is evaluated on three diverse benchmark datasets addressing cold start in recommender systems: CiteULike, LastFM, and XING. These datasets cover user preferences for scientific articles, music artists, and job recommendations, respectively. For comparison, five state-of-the-art cold start recommendation algorithms are KNN, LinMap, NLinMap, DropoutNet, and Heater. These algorithms use different techniques such as nearest neighbor algorithms, linear transformations, deep neural networks, dropout methods, and a mixture of experts to generate recommendations and solve cold-start issues.

The performance of BayesCNS is evaluated using metrics such as Recall@k, Precision@k, and NDCG@k for k values of 20, 50, and 100. Results show that BayesCNS performed competitively compared to other state-of-the-art methods across all datasets. An online A/B test introduces millions of new items, comprising 22.81% of the original item index size. The test ran for one month, comparing BayesCNS with a baseline that introduced new items without considering cold start and non-stationary effects. BayesCNS consistently outperformed the baseline, showing statistically significant improvements in success rate and new item surface rate across most cohorts.

In conclusion, researchers from Apple have introduced BayesCNS, a Bayesian online learning approach, that effectively addresses cold start and non-stationarity challenges in large-scale search systems. This method predicts prior user-item interaction distributions using contextual item features, utilizing a novel deep neural network parameterization to learn expressive priors while enabling efficient posterior updates. The efficacy of BayesCNS has been demonstrated through comprehensive evaluation showing significant improvements in critical metrics such as click-through rates, new item impression rates, and overall user success metrics. These findings use the potential of BayesCNS to enhance the performance of search and recommendation systems in dynamic, real-world environments.


Check out the Paper. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. If you like our work, you will love our newsletter.. Don’t Forget to join our 50k+ ML SubReddit

[Upcoming Event- Oct 17 202] RetrieveX – The GenAI Data Retrieval Conference (Promoted)

The post Apple Researchers Propose BayesCNS: A Unified Bayesian Approach Tackling Cold Start and Non-Stationarity in Large-Scale Search Systems appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

BayesCNS 冷启动 非平稳性 搜索系统 贝叶斯在线学习
相关文章