Overhead of Event Hub outputs with Azure Function Apps
In a world where you're billed for what you use, it pays to really understand what exactly it is you are using. The pricing model of the Azure Function Apps consumption plan sounds pretty simple (pay based on execution count and execution time) - though as always the devil is in the detail.
In a recent project where we've migrated a workload from a dedicated (on-prem) server to a function app, someone asked what sounded like a fairly simple question:
- Do we pay (on the function app side) for the output to an Event Hub, and if so how much does it cost?
This post explores the answer to that question for a trivial C# function, and provides a few pointers to help get your head around consumption billing.
If you're wondering why we wanted to deconstruct the function's cost when an execution unit costs a mere 16 picodollars (16x10^-12), consider what we saw after our first week at full load:
5 trillion execution units... Interesting!
Execution units vs. what you see on the pricing page is covered later in the post.
Consumption billing
Functions hosted on the consumption plan charge you for:
- Every function execution ($0.20 per million execution)
- Every gigabyte-second used ($0.000016 per GBs)
If the function runs for one second and uses 1GB of memory, your function has used 1GBs. The smallest quantum is 128MB for memory, and 1 millisecond for duration. Billing rounds up, so a function using 1MB for 10us is billed at 0.128GB x 1ms, a function using 129MB for 1001us is billed at 0.256GB for 2ms, etc.
Execution time is measured from the start of the function to the end of the function - and is not CPU time. If your function does any calls to external systems (like an Event Hub) that is billed. This includes the input/output bindings.
This is one reason why your functions shouldn't call other functions - you'll get billed for the execution time of both! If you need to use this pattern then Durable Functions might be a better fit, though prepare for a whole host of new billing complexities to wrap your head around.
Billing information is provided at the function-app level, and not per-function level.
Azure Monitor doesn't expose the GBs billing metric directly, and instead exposes a metric called Function Execution Units. One of these is equal to 1 megabyte millisecond. Converting from GBs to MBms can be done by dividing through by 1,024,000 (GB -> MB is 1024, s -> ms is 1000).
All of these together tell us that:
- Any optimisation that doesn't help us cross a boundary of 128MB won't impact cost
- Any optimisation that doesn't help us cross a boundary of 1ms execution time wont' impact cost
- Adding an Event Hub output is only 'free' if it takes <1ms and has a memory impact that won't take us over a 128MB memory boundary
For more details on the consumption plan check out the Consumption FAQ. Precise details on how the memory usage is billed is pretty hard to pin down - there are a handful of outstanding issues that may need to be resolved before it's straightforward to examine (see here, here, and here).
Test harness
Our test harness is going to take a stock function (return
) and compare it to a function which outputs a message to an Event Hub.
The functions below were deployed as separate function apps (to enable parallel testing and still get the granular cost metrics needed). You can view the sample project on GitHub. The deployment was a straightforward 'Publish' from Visual Studio, with each function getting it's own storage account/app service plan (on the consumption tier), as well as an application insights resource. An Event Hub was created to act as the output binding for the second test.
No-Op Test
[FunctionName("NoOp")]
public static async Task<IActionResult> Run(
[HttpTrigger(AuthorizationLevel.Anonymous, "post", Route = null)] HttpRequest req,
ILogger log)
{
return new OkResult();
}
Event Hub Test
[FunctionName("ToEventHub")]
public static async Task<IActionResult> Run(
[HttpTrigger(AuthorizationLevel.Anonymous, "post", Route = null)] HttpRequest req,
ILogger log,
[EventHub(eventHubName: "fatesteventhub", Connection = "eh-connection")] ICollector<string> outMessages)
{
outMessages.Add("Hello from the function app");
return new OkResult();
}
Running the Test
This PowerShell script uses runspaces to execute the tests in parallel. We're interested in executing the function a set number of times and examining cost (as opposed to load testing), and so any delay or 'unfairness' due to the client-driver being slow isn't an issue.
Once the code has finished executing we can then go to Azure Monitor and pull the stats on execution count for both function apps and see what overhead there is.
$BASELINE_URI = "https://fa-eh-test-appinsights.azurewebsites.net/api/noop"
$EH_OUTPUT_URI = "https://fa-eh-test-output.azurewebsites.net/api/toeventhub"
$REPEATS = 10000
$tests = @(
@{ Name = "Baseline"; Uri = $BASELINE_URI },
@{ Name = "EH Output"; Uri = $EH_OUTPUT_URI }
)
$scriptblock = {
Param (
$test,
$repeats
)
Write-Output "Testing $($test.Name) for $repeats repeats at $([DateTime]::UtcNow)"
$timings = @()
for($i = 0; $i -lt $repeats; $i++) {
$timings += (Measure-Command {
Invoke-WebRequest -Uri $test.Uri -Method Post | Out-Null
}).TotalMilliseconds
}
$summary = $timings | Measure-Object -Average -Maximum
Write-Output "Finished $($test.Name) - Average $($summary.Average) - Max $($summary.Maximum) at $([DateTime]::UtcNow)"
}
## Thank you https://blog.netnerds.net/2016/12/runspaces-simplified/ !
$pool = [RunspaceFactory]::CreateRunspacePool(1,2)
$pool.Open()
$runspaces = @()
$tests | ForEach-Object {
$runspace = [PowerShell]::Create()
$null = $runspace.AddScript($scriptblock)
$null = $runspace.AddArgument($_)
$null = $runspace.AddArgument($repeats)
$runspace.RunspacePool = $pool
$runspaces += [PSCustomObject]@{ Pipe = $runspace; Status = $runspace.BeginInvoke() }
}
while ($runspaces.Status -ne $null)
{
$completed = $runspaces | Where-Object { $_.Status.IsCompleted -eq $true }
foreach ($runspace in $completed)
{
$runspace.Pipe.EndInvoke($runspace.Status)
$runspace.Status = $null
}
}
$pool.Close()
$pool.Dispose()
Results
After running the test multiple times the numbers were fairly consistent - the function with an Event Hub output binding cost about 18% more to run than a stock function.
After crunching the raw numbers (which you can export from Azure Monitor to a CSV) we get to:
Function | Execution Units |
---|---|
Stock | 17,371 |
Event Hub Output | 20,500 |
The addition of an Event Hub output was costing 3,129 execution units, or about $0.000000049 per execution (from the function app side - don't forget there's a whole set of billing concerns associated with the hub itself!).
To put that into context here is what that translates to per-week in additional cost for the Event Hub output binding for various requests per second:
RPS | Additional Weekly Cost |
---|---|
100 | $2.96 |
500 | $14.78 |
1000 | $29.57 |
It's worth emphasising that this is additional cost, and that at high RPS the execution count component of cost starts to dominate. For a function being executed at 1000 RPS you'd pay:
- $164.15 in GBs for the stock function
- $29.58 extra in GBs for the Event Hub output binding
- $120.96 for the execution count (1000 RPS is 605 million executions per week)
- $314.69 in total
It's safe to say that when looking into cost that the presence of a binding itself is not the thing to worry about. Over time as the hosting environment and functions runtime improve I would expect the cost associated with a stock function to go down. For very fast functions execution count will dominate cost.
Closing Thoughts
Cost-control in the cloud can be very... interesting? Knowing exactly how much you're spending is easy, but knowing what we're spending it on (and if that's the most efficient way to solve the problem)...another matter entirely.
This post has scratched the surface of this topic - probably leaving you with more questions than answers. I haven't even got into considerations such as dedicated vs. consumption (some excellent coverage on the Azure from the trenches blog).
For our 5 trillion execution unit app we came to realise pretty quickly that the real issue is the architecture of our system - changing the client to reduce the number of messages sent (via batching) will deliver fair more value than than chasing down micro-optimisations in the function app. Some napkin math reveals we can probably get a 5-10x reduction in call count, and as we know executions dominate our cost that optimisation is falling straight to the bottom line.
Understanding how the cloud services are built and how you're charged for using them is key in building the mental models required to architect a cost-effective deployment - the savings at scale can be quite significant.