The GitHub Blog 07月17日 05:08
GitHub Availability Report: June 2025
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

GitHub在2025年6月经历了三次服务性能下降事件。6月5日,Actions服务受损,导致运行延迟和作业失败,影响了Copilot和Pages。6月12日,GitHub Copilot服务因模型提供商故障而中断,影响了代码补全功能。6月17日,内部路由策略部署问题导致网络中断,影响了UI和API的访问。GitHub通过修复配置、禁用受影响的提供商和回滚部署来缓解问题,并计划改进监控和验证流程以提高服务稳定性。

🗓️ 6月5日事件:Actions服务出现问题,导致运行启动延迟和间歇性作业失败。47.2%的运行延迟启动,平均延迟14分钟,21.0%的运行失败。Copilot Coding Agent会话取消60%,基于分支的Pages站点构建失败。

💻 6月12日事件:GitHub Copilot服务中断,Gemini模型不可用,Claude模型可用性降低。用户在VS Code、JetBrains IDEs和GitHub Copilot Chat中遇到高错误率、响应慢、超时和聊天功能中断。

🌐 6月17日事件:内部路由策略部署问题导致网络中断。github.com UI的已认证用户出现3-4%的错误率,API的已认证调用者出现40%的错误率。Actions的2.5%的运行延迟,平均8分钟,3%的运行失败。

In June, we experienced three incidents that resulted in degraded performance across GitHub services.

June 5 17:47 UTC (lasting 1 hour and 33 minutes)

On June 5, 2025, between 17:47 UTC and 19:20 UTC, the Actions service was degraded, leading to run start delays and intermittent job failures. During this period, 47.2% of runs had delayed starts of 14 minutes on average, and 21.0% of runs failed. The impact extended beyond Actions itself; 60% of Copilot Coding Agent sessions were cancelled, and all Pages sites using branch-based builds failed to deploy (though Pages serving remained unaffected). The issue was caused by a spike in load between internal Actions services exposing a misconfiguration that caused throttling of requests in the critical path of run starts. We mitigated the incident by correcting the service configuration to prevent throttling and have updated our deployment process to ensure the correct configuration is preserved moving forward.

June 12 17:55 UTC (lasting 3 hours and 12 minutes)

On June 12, 2025, between 17:55 UTC and 21:07 UTC, the GitHub Copilot service was degraded and experienced unavailability for Gemini models and reduced availability for Claude models. Users experienced significantly elevated error rates for chat completions, slow response times, timeouts, and chat functionality interruptions across VS Code, JetBrains IDEs, and GitHub Copilot Chat. This was due to an outage affecting one of our model providers.

We mitigated the incident by temporarily disabling the affected provider endpoints to reduce user impact.

We are working to update our incident response playbooks for infrastructure provider outages and improve our monitoring and alerting systems to reduce our time to detection and mitigation of issues like this one in the future.

June 17 19:32 UTC (lasting 31 minutes)

On June 17, 2025, between 19:32 UTC and 20:03 UTC, an internal routing policy deployment to a subset of network devices caused reachability issues for certain network address blocks within our datacenters. Authenticated users of the github.com UI experienced 3-4% error rates for the duration of the incident. Authenticated callers of the API experienced 40% error rates. Unauthenticated requests to the UI and API experienced nearly 100% error rates. Actions experienced 2.5% of runs being delayed for an average of 8 minutes and 3% of runs failing. Large File Storage (LFS) requests experienced 1% errors. At 19:54 UTC, the deployment was rolled back, and network availability for the affected systems was restored. At 20:03 UTC, we fully restored normal operations. To prevent similar issues, we are expanding our validation process for routing policy changes.


Please follow our status page for real-time updates on status changes and post-incident recaps. To learn more about what we’re working on, check out the GitHub Engineering Blog.

The post GitHub Availability Report: June 2025 appeared first on The GitHub Blog.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

GitHub 服务中断 可用性
相关文章