Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] 多次连接错误后应该先标记节点不可用再进行健康检查 #1483

Open
2 tasks done
ayasechan opened this issue Sep 1, 2024 · 0 comments
Open
2 tasks done
Labels
enhancement New feature or request

Comments

@ayasechan
Copy link

ayasechan commented Sep 1, 2024

8### Verify steps

Description

目前的实现会在连接错误达到 max-failed-times 后直接进行健康检查
而没有将问题节点标记为不可用
如果问题节点一开始的延迟很低
会导致在健康检查结束之前新的连接依旧会被分配到问题节点
尤其是在节点比较多的时候
这个问题尤为明显
因为节点多全部健康检查花的时间也长
故障后要等很久才能恢复网络

log.Warnln("because %s failed multiple times, active health check", gb.Name())
gb.healthCheck()

Possible Solution

No response

@ayasechan ayasechan added the enhancement New feature or request label Sep 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant