[Terrafrom] Add rate limiting #1084

phalbert · 2024-10-18T17:03:45Z

Description

What - This PR implements rate limiting logic in the Terraform Cloud integration to address frequent 429 Too Many Requests warnings in the logs.

Why - We were experiencing excessive 429 Too Many Requests warnings, indicating that our integration was exceeding Terraform Cloud's API rate limits. This was cluttering the logs and potentially impacting performance.

How - Implemented a more robust rate limiting mechanism in the TerraformClient class:

Added a wait_for_rate_limit method to enforce rate limits.
Implemented a token bucket algorithm to manage API request rates.
Added asyncio locks to ensure thread-safe rate limiting in asynchronous operations.
Updated the send_api_request method to use the new rate limiting logic.

Type of change

Bug fix (non-breaking change which fixes an issue)

All tests should be run against the port production environment(using a testing org).

Core testing checklist

Integration able to create all default resources from scratch
Resync finishes successfully
Resync able to create entities
Resync able to update entities
Resync able to detect and delete entities
Scheduled resync able to abort existing resync and start a new one
Tested with at least 2 integrations from scratch
Tested with Kafka and Polling event listeners
Tested deletion of entities that don't pass the selector

Integration testing checklist

Integration able to create all default resources from scratch
Resync able to create entities
Resync able to update entities
Resync able to detect and delete entities
Resync finishes successfully
If new resource kind is added or updated in the integration, add example raw data, mapping and expected result to the examples folder in the integration directory.
If resource kind is updated, run the integration with the example data and check if the expected result is achieved
If new resource kind is added or updated, validate that live-events for that resource are working as expected
Docs PR link here

Preflight checklist

Handled rate limiting
Handled pagination
Implemented the code in async
Support Multi account

Screenshots

[Include screenshots of logs showing reduced 429 warnings, if available]

API Documentation

Terraform Cloud API Rate Limiting Documentation

mk-armah

Great work 👏🏽, left few comments. Also I was wondering if we could use aiolimiter to control requests easily since terraform rate limit is time bound

mk-armah · 2024-10-22T04:53:45Z

integrations/terraform-cloud/client.py

+# https://developer.hashicorp.com/terraform/cloud-docs/api-docs#rate-limiting
+RATE_LIMIT_PER_SECOND = 30
+RATE_LIMIT_BUFFER = 5  # Buffer to avoid hitting the exact limit
+MAX_CONCURRENT_REQUESTS = 10


mk-armah · 2024-10-22T04:53:52Z

integrations/terraform-cloud/client.py

-                json=json_data,
-            )
-            response.raise_for_status()
+        async with self.semaphore:


Is there a concurrency constraint on the Terraform Cloud API as well ?

mk-armah · 2024-10-22T05:01:59Z

integrations/terraform-cloud/main.py

+            workspaces[i : i + CHUNK_SIZE]
+            for i in range(0, len(workspaces), CHUNK_SIZE)
+        ]:
+            chunk_results = await gather(*[fetch_runs_for_workspace(w) for w in chunk])


Is there a specific reason for replacing as_completed with gather ? waiting for all tasks within a chunk to complete before moving on to the next chunk appears to impede performance in this context. Please correct me.

mk-armah · 2024-10-22T05:30:37Z

integrations/terraform-cloud/client.py

+
+        self.rate_limit = RATE_LIMIT_PER_SECOND
+        self.rate_limit_remaining = RATE_LIMIT_PER_SECOND
+        self.rate_limit_reset: float = 0.0
+        self.last_request_time = time.time()
+        self.request_times: list[float] = []
+        self.semaphore = asyncio.Semaphore(MAX_CONCURRENT_REQUESTS)
+        self.rate_limit_lock = asyncio.Lock()
+
+    async def wait_for_rate_limit(self) -> None:
+        async with self.rate_limit_lock:
+            current_time = time.time()
+            self.request_times = [t for t in self.request_times if current_time - t < 1]
+
+            if len(self.request_times) >= RATE_LIMIT_PER_SECOND:
+                wait_time = 1 - (current_time - self.request_times[0])
+                if wait_time > 0:
+                    logger.info(
+                        f"Rate limit reached, waiting for {wait_time:.2f} seconds"
+                    )
+                    await asyncio.sleep(wait_time)
+                self.request_times = self.request_times[1:]
+
+            self.request_times.append(current_time)


aiolimiter ?

…e-limits

[Terrafrom] add rate limiting

d17b67f

phalbert self-assigned this Oct 18, 2024

phalbert requested a review from a team as a code owner October 18, 2024 17:03

github-actions bot added the size/L label Oct 18, 2024

phalbert requested a review from mk-armah October 18, 2024 17:04

mk-armah requested changes Oct 22, 2024

View reviewed changes

Merge branch 'main' into PORT-10857-overcome-terraform-clouds-api-rat…

b306ce9

…e-limits

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Terrafrom] Add rate limiting #1084

[Terrafrom] Add rate limiting #1084

phalbert commented Oct 18, 2024

mk-armah left a comment

mk-armah Oct 22, 2024

mk-armah Oct 22, 2024

mk-armah Oct 22, 2024

mk-armah Oct 22, 2024

[Terrafrom] Add rate limiting #1084

Are you sure you want to change the base?

[Terrafrom] Add rate limiting #1084

Conversation

phalbert commented Oct 18, 2024

Description

Type of change

All tests should be run against the port production environment(using a testing org).

Core testing checklist

Integration testing checklist

Preflight checklist

Screenshots

API Documentation

mk-armah left a comment

Choose a reason for hiding this comment

mk-armah Oct 22, 2024

Choose a reason for hiding this comment

mk-armah Oct 22, 2024

Choose a reason for hiding this comment

mk-armah Oct 22, 2024

Choose a reason for hiding this comment

mk-armah Oct 22, 2024

Choose a reason for hiding this comment