Skip to content
This repository has been archived by the owner on Oct 11, 2023. It is now read-only.

Dev Spaces Service Health Dashboard #342

Closed
moewill opened this issue May 19, 2020 · 22 comments
Closed

Dev Spaces Service Health Dashboard #342

moewill opened this issue May 19, 2020 · 22 comments

Comments

@moewill
Copy link

moewill commented May 19, 2020

Describe the bug
Running command azds space list fails with error message: The operation was canceled.

To Reproduce
az aks get-credentials -g <rg-name> --name <cluster-name>
azds controller list
azds controller select -g <rg-name> -n <controller-name>
azds space list -v

A list of the existing Dev Spaces is returned with the appropriate column headers.

Logs
2020-05-19T16:15:22.2951820Z | CLI | TRACE | Event: Command.Start <json>{"properties":{"arguments":"space list -v"},"metrics":null}</json>\nOperation context: <json>{"clientRequestId":null,"correlationRequestId":null,"requestId":null,"userSubscriptionId":"<subscription-id>","startTime":"2020-05-19T16:15:21.9678980Z","userAgent":"CLI/1.0.20200507.16","requestHttpMethod":null,"requestUri":null,"apiVersion":"3.2","version":"1.0.20200507.16","requestHeaders":{},"loggingProperties":{"applicationName":"CLI","deviceOperatingSystem":"Darwin 19.4.0 Darwin Kernel Version 19.4.0: Wed Mar 4 22:28:40 PST 2020; root:xnu-6153.101.6~15/RELEASE_X86_64","framework":".NET Core 4.6.26515.07","macAddressHash":"<mac-hash>","commandId":"64a842ba-5f71-4140-af80-32ea9b2ecef8","processId":10055,"targetEnvironment":"Production"}}</json>

Environment Details
Client used: CLI

1.0.20200507.16
API v3.2```

Client's version: `API v3.2`

Operating System: `Mac OS`

I was informed also that the product team is reviewing the issue, so insight on when the service is down or active tickets is good to know so that development teams are informed and can adjust if needed.
@amsoedal
Copy link
Collaborator

Hi @moewill, does this reproduce consistently? Usually "the operation was cancelled" indicates a transient issue, but I'm curious to know if you're seeing it every time.

@moewill
Copy link
Author

moewill commented May 19, 2020

Yes @amsoedal , unfortunately, I am seeing it every time...

@amsoedal
Copy link
Collaborator

@moewill would you be able to attach more of your logs? A request id or a subscription id would be helpful for us to investigate on our side. Thank you!

@moewill
Copy link
Author

moewill commented May 19, 2020

Thank you @amsoedal

The request IDs are all null:

2020-05-19T16:15:22.2951820Z | CLI | TRACE | Event: Command.Start <json>{"properties":{"arguments":"space list -v"},"metrics":null}</json>\nOperation context: <json>{"clientRequestId":null,"correlationRequestId":null,"requestId":null,

Can I PM you the subscription ID?

@amsoedal
Copy link
Collaborator

Yes! that would be great. You can email me at ansoedal@microsoft.com

@moewill
Copy link
Author

moewill commented May 19, 2020

Awesome - just sent 👍

@amsoedal
Copy link
Collaborator

Thanks @moewill, investigating now. Will follow up on email.

@moewill
Copy link
Author

moewill commented May 19, 2020 via email

@keikumata
Copy link

@moewill This seems to be an issue pertaining to the API server authorized IP ranges feature. Please take a look at https://github.com/Azure/dev-spaces/tree/master/public-ips to allow additional ranges based on your region when using AKS with Dev Spaces.

This is documented here: https://docs.microsoft.com/en-us/azure/aks/api-server-authorized-ip-ranges#create-an-aks-cluster-with-api-server-authorized-ip-ranges-enabled

@moewill
Copy link
Author

moewill commented May 20, 2020 via email

@amsoedal
Copy link
Collaborator

Hi @moewill, after investigating some more my colleague figured out that this is connected to an ongoing infrastructure issue we're facing related to our whitelisted IPs. We are currently working on a release to get this fixed. Will keep this thread updated, but the fix should be out by EOD today or tomorrow, after which your cluster should be in a better state again

@moewill
Copy link
Author

moewill commented May 20, 2020 via email

@amsoedal
Copy link
Collaborator

Hi @moewill unfortunately we don't currently have a way of notifying on this granular level of outage. We're moving towards a local experience which won't require controllers and will thus have less potential for outages (you can read more here: #341). This new feature will provide similar capabilities to azds (debugging your code, routing). Hope this helps

@moewill
Copy link
Author

moewill commented May 21, 2020

Thank you @amsoedal

With that new feature, what will be the future of Dev Spaces currently? Will Dev Spaces and azds tooling be deprecated in favor of that new workflow, or will they both be available side by side?

This is helpful because adopting Dev Spaces, developing workflows and automation now will prove to be painful if needing to migrate to a completely new solution/implementation.

@amsoedal
Copy link
Collaborator

Our future investments currently revolve around the new workflow, but we'd love to hear your feedback on your current usage of Dev Spaces and what capabilities are most useful for you! Would you be open to scheduling a short (30 min) talk with us?

@moewill
Copy link
Author

moewill commented May 28, 2020

Thanks for the offer @amsoedal - we will likely plan to do that.

I also wanted to check on the status of the whitelisting issue?

@amsoedal
Copy link
Collaborator

Hi again @moewill ! The whitelisting issue should be fixed now, but I believe our on-call is currently dealing with a separate issue. Are you running into problems?

@moewill
Copy link
Author

moewill commented May 28, 2020 via email

@amsoedal
Copy link
Collaborator

amsoedal commented Jun 1, 2020

Hi @moewill, the issue is resolved. Please let us know if you encounter any issues!

@moewill
Copy link
Author

moewill commented Jun 1, 2020

Thank you @amsoedal .

Can you point me to the correct list of IP addresses that should be whitelisted for both the bare AKS service and also for the Dev Spaces feature?

@amsoedal
Copy link
Collaborator

amsoedal commented Jun 1, 2020

For Dev Spaces:
[
"13.69.71.144/28",
"13.70.78.176/28",
"13.71.175.112/28",
"13.71.199.96/28",
"13.73.244.128/28",
"13.74.111.128/28",
"13.78.111.144/28",
"13.86.221.224/28",
"20.37.157.64/28",
"20.37.195.80/28",
"20.38.85.128/28",
"20.39.11.64/28",
"20.41.5.128/28",
"20.42.6.32/27",
"20.42.6.128/28",
"20.42.64.64/26",
"20.42.131.192/27",
"20.42.230.64/28",
"20.43.65.208/28",
"20.43.130.240/28",
"20.189.108.32/28",
"40.69.110.176/28",
"40.70.151.80/28",
"40.74.30.144/28",
"40.75.35.224/28",
"40.78.239.0/28",
"40.78.251.224/27",
"40.82.253.112/28",
"40.89.17.192/28",
"40.119.9.240/28",
"51.104.25.208/28",
"51.105.77.64/28",
"52.150.139.144/28",
"52.182.141.128/28",
"52.228.81.224/28",
"104.214.161.48/28"
]
(downloaded from https://www.microsoft.com/en-us/download/details.aspx?id=56519)

For AKS, see: https://docs.microsoft.com/en-us/azure/aks/limit-egress-traffic
Note: it's important to pass the client machine IP while creating the AKS cluster

By EOD (we're releasing this feature), you should be able to whitelist us using service tags as described here: https://github.com/Azure/dev-spaces/blob/master/public-ips/README.md

@rakeshvanga
Copy link
Contributor

rakeshvanga commented Jun 1, 2020

@moewill You can go over Azure Dev Spaces documentation for this specific scenario here: https://docs.microsoft.com/en-us/azure/dev-spaces/configure-networking#using-api-server-authorized-ip-ranges
As amsoedal, mentioned it would be updated to reflect service tags changes.

Also, the json list of ips downloaded from https://www.microsoft.com/en-us/download/details.aspx?id=56519 contains ips used by dev spaces per region as well, if you want to whitelist a small set of ip ranges.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants