Learning in Strategic Queuing Systems with Small Buffers

cs.AI updates on arXiv.org 07月22日 12:34

../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

本文探讨了具有回合间结转效应的游戏中的学习成果，特别关注了网络路由器如何通过简单的学习算法来优化数据包传输。与现有模型不同，本文引入了服务器的小缓冲区，并消除了对时间戳和优先级的依赖，使得模型更贴近实际应用。通过理论分析和模拟，研究发现，即使服务器容量仅比集中协调情况下的需求略有增加，也能有效保证系统的稳定性，即使在服务器随机选择同时到达的数据包的情况下也是如此。这项研究对于理解和优化大规模排队系统具有重要意义。

📦 **实际应用中的路由器学习模型：** 文章研究了网络路由器如何通过学习算法优化数据包传输，并考虑了回合间结转效应，即当前回合的输出会影响未来游戏进程。这对于理解和设计大规模排队系统至关重要。

🚫 **改进现有模型：** 研究提出了对现有路由器学习模型的重要改进，引入了服务器的小缓冲区，允许服务器暂存单个数据包，并取消了对时间戳和旧数据包优先级的强制要求，使得模型更加现实。

📈 **系统稳定性与容量提升：** 通过理论分析和模拟，研究表明，与集中协调情况相比，即使服务器容量仅有小幅度的恒定因子提升，也足以在学习队列的路由器系统中保持稳定，即使服务器在同时到达的数据包中随机选择。

💡 **研究的意义：** 该研究为理解和优化具有学习机制和结转效应的系统提供了新的视角，尤其是在网络路由领域，通过更贴近实际的建模，为提高系统效率和稳定性提供了理论依据。

arXiv:2502.08898v2 Announce Type: replace-cross Abstract: We consider learning outcomes in games with carryover effects between rounds: when outcomes in the present round affect the game in the future. An important example of such systems is routers in networking, as they use simple learning algorithms to find the best way to deliver packets to their desired destination. This simple, myopic, and distributed decision process makes large queuing systems easy to operate, but at the same time, the system needs more capacity than would be required if all traffic were centrally coordinated. Gaitonde and Tardos (EC 2020 and JACM 2023) initiated the study of such systems, modeling them as an infinitely repeated game in which routers compete for servers and the system maintains a state (the number of packets held at each queue) that results from outcomes of previous rounds. However, their model assumes that servers have no buffers at all, so routers have to resend all packets that were not served successfully, which makes their system model unrealistic. They show that in their model, even with hugely increased server capacity relative to what is needed in the centrally coordinated case, ensuring that the system is stable requires the use of timestamps and priority for older packets. We consider a system with two important changes, which make the model more realistic and allow for much higher traffic rates: first, we add a very small buffer to each server, allowing the server to hold on to a single packet to be served later (if it fails to serve it immediately), and second, we do not require timestamps or priority to older packets. Using theoretical analysis and simulations, we show that when queues are learning, a small constant-factor increase in server capacity, compared to what would be needed if centrally coordinating, suffices to keep the system stable, even if servers select randomly among packets arriving simultaneously.

Fish AI Reader

FishAI

联系邮箱 441953276@qq.com

相关标签