Calling MetricsLogger.flush() causes RuntimeWarnings #52

Dunedan · 2020-08-27T15:03:05Z

Calling MetricsLogger().flush() as documented in the README causes RuntimeWarnings:

Here is a minimal example:

#!/usr/bin/env python3

import os
os.environ["AWS_LAMBDA_FUNCTION_NAME"] = "dummy-function-name"

from aws_embedded_metrics import metric_scope

@metric_scope
def foo(metrics):
    metrics.put_metric("foo", 1, "Count")
    metrics.flush()

foo()

Output when calling:

./poc.py:11: RuntimeWarning: coroutine 'MetricsLogger.flush' was never awaited
  metrics.flush()
RuntimeWarning: Enable tracemalloc to get the object allocation traceback
{"LogGroup": "dummy-function-name", "ServiceName": "dummy-function-name", "ServiceType": "AWS::Lambda::Function", "executionEnvironment": "", "memorySize": "", "functionVersion": "", "logStreamId": "", "_aws": {"Timestamp": 1598540408195, "CloudWatchMetrics": [{"Dimensions": [["LogGroup", "ServiceName", "ServiceType"]], "Metrics": [{"Name": "foo", "Unit": "Count"}], "Namespace": "aws-embedded-metrics"}]}, "foo": 1}

The text was updated successfully, but these errors were encountered:

jaredcnance · 2020-08-27T15:45:55Z

This is related to #21 where we will likely make flush() synchronous. Today, it is an async call and should be awaited. We can update the documentation in the meantime to make this more clear.

ryandeivert · 2021-11-12T23:21:44Z

what's the status on this issue? this is super noisy in lambda CWL output

SamStephens · 2021-11-12T23:52:03Z

@ryandeivert status is unchanged. The method is still asynchronous, and you need to be awaiting it. In my code, I'm doing this:

@metric_scope
def handler(event, context, metrics):
    try:
        # Actual handler logic
    finally:
        # Need to call flush like this because it's a coroutine/asynchronous
        loop = asyncio.get_event_loop()
        loop.run_until_complete(metrics.flush())

ryandeivert · 2021-11-13T00:20:35Z

yes I understand - but for larger lambda codebases, where the bulk of the logic does not occur within the scope of a single function, this isn't a super ideal way to use this library (as a decorator). effectively, I'd have to wrap any of my functions that log metrics with this, instead of just creating a logger object to be used directly. unless I'm misunderstanding the API, in which I'd love to hear about alternatives.

edit: can you also clarify why certain certain properties are injected into the output, with no ability to override these (see here)

I'm currently using the below to work around this:

from aws_embedded_metrics import MetricsLogger as _MetricsLogger
from aws_embedded_metrics.environment.lambda_environment import LambdaEnvironment


class MetricsLogger(_MetricsLogger):
    def __init__(self):
        super().__init__(None, None)
        self.environment = LambdaEnvironment()

    def flush(self) -> None:
        """Override the default async MetricsLogger.flush method, flushing to stdout immediately"""
        sink = self.environment.get_sink()
        sink.accept(self.context)
        self.context = self.context.create_copy_with_context()

    def with_dimensions(self, *dimensions):
        return self.set_dimensions(*dimensions)


def main():

    new_logger = MetricsLogger()
    new_logger.put_metric('metric_name', 10).with_dimensions({'dim01': 'value01', 'dim02': 'value02'})
    new_logger.flush()

heldersepu · 2023-01-13T16:46:09Z

@ryandeivert status is unchanged. The method is still asynchronous, and you need to be awaiting it. In my code, I'm doing this:
@metric_scope
def handler(event, context, metrics):
    try:
        # Actual handler logic
    finally:
        # Need to call flush like this because it's a coroutine/asynchronous
        loop = asyncio.get_event_loop()
        loop.run_until_complete(metrics.flush())

@SamStephens Why would anyone need to do that?
That is already done here:
https://github.com/awslabs/aws-embedded-metrics-python/blob/v3.0.0/aws_embedded_metrics/metric_scope/__init__.py#L48-L50
Unless I'm missing something there is no need to call flush ourselves

SamStephens · 2023-09-14T06:02:35Z

@ryandeivert thanks for your workaround, it's saved my bacon with Flask, Gunicorn and Gevent where for reasons I don't fully understand I cannot use Flask's async support.

However, also I don't understand why you need your workaround with Lambda. If you're actually calling other functions, surely your main functions really looks like

def main():

    new_logger = MetricsLogger()
    do_some_work(new_logger)
    do_something_else(new_logger)
    new_logger.flush()

If so, I don't actually see how this is different to

@metric_scope
def main(new_logger):
    do_some_work(new_logger)
    do_something_else(new_logger)

SamStephens · 2023-09-14T23:48:53Z

what's the status on this issue? this is super noisy in lambda CWL output

@ryandeivert it's worth noting this isn't just noise, the warning means that it's possible for the Lambda function to be shutdown before the flush actually completes, because you're not awaiting it to complete. This means there's a chance of metrics being lost.

lukepafford · 2024-09-11T18:37:34Z

Yeah this is pretty annoying. I basically want to emit a metric in our lambda that can possibly process multiple failing hostnames:

from aws_embedded_metrics.logger.metrics_logger import MetricsLogger

async def emit_hostname_failure(logger: MetricsLogger, hostname: str) -> None:
    logger.set_namespace("MyNamespace")
    logger.put_metric("HostnameFailure", 1, "Count")

    # Hostname is a high cardinality value. Do NOT use put_dimensions, but instead use set_property
    # where results will be queried through CloudWatch Insights.
    logger.set_property("Hostname", hostname)

    # A property can only be tied to a single metric, so call flush
    # for each device data point.
    await logger.flush() # ERROR! Result of async function call is not used; use "await" or assign result to variable

Unfortunately I now need to either figure out how to make the lambda work async, or override my own class like @ryandeivert did. Either way this is a headache.

Looks like wrapping the call in asyncio.run should work to run the function synchronously:

import asyncio
from aws_embedded_metrics.logger.metrics_logger_factory import create_metrics_logger

def handler(event, context):
    logger = create_metrics_logger()
    ...
    asyncio.run(emit_hostname_failure(logger, "hostname")

jaredcnance added the documentation Improvements or additions to documentation label Aug 27, 2020

SamStephens mentioned this issue Sep 14, 2023

Better support for non-async scenarios #14

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Calling MetricsLogger.flush() causes RuntimeWarnings #52

Calling MetricsLogger.flush() causes RuntimeWarnings #52

Dunedan commented Aug 27, 2020

jaredcnance commented Aug 27, 2020 •

edited

Loading

ryandeivert commented Nov 12, 2021

SamStephens commented Nov 12, 2021

ryandeivert commented Nov 13, 2021 •

edited

Loading

heldersepu commented Jan 13, 2023

SamStephens commented Sep 14, 2023

SamStephens commented Sep 14, 2023

lukepafford commented Sep 11, 2024 •

edited

Loading

Calling MetricsLogger.flush() causes RuntimeWarnings #52

Calling MetricsLogger.flush() causes RuntimeWarnings #52

Comments

Dunedan commented Aug 27, 2020

jaredcnance commented Aug 27, 2020 • edited Loading

ryandeivert commented Nov 12, 2021

SamStephens commented Nov 12, 2021

ryandeivert commented Nov 13, 2021 • edited Loading

heldersepu commented Jan 13, 2023

SamStephens commented Sep 14, 2023

SamStephens commented Sep 14, 2023

lukepafford commented Sep 11, 2024 • edited Loading

jaredcnance commented Aug 27, 2020 •

edited

Loading

ryandeivert commented Nov 13, 2021 •

edited

Loading

lukepafford commented Sep 11, 2024 •

edited

Loading