You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
SkyServe uses Uvicorn at the moment for the controller / load balancer servers. It has not shown up as bottlenecks as of yet due to the compute-heavy nature of GenAI models. Do you foresee for your use cases this may become a bottleneck?
We'd love to get community's help on this! Encrypting the requests/responses will be great to see in SkyServe.
Yes, it should be possible. There are two prototypes of this based on Nginx, from @cblmemo:
[Serve][Example] Authentication example #3251 user-side: user simply sets up the auth in YAML; during curl, pass curl --location-trusted .. -H "Authorization: Bearer static_secret_token" ..
Quick update: for 2, we have an ongoing PR #3380 and we plan to merge it soon ;) for 1, recently we are exploring the possibility to adopy envoy proxy as our load balancer. Stay tuned!
I've combed thru the docs to try and find an answer, without any luck.
If a rogue actor gets hold of the IP:Port of a running instance, the costs could go up significantly if there's no auth
I do not expect the project team to solve these, but of there are some directions given, I could take a shot at it.
The text was updated successfully, but these errors were encountered: