To effectively manage scaling decisions for Microsoft Fabric Real-Time Intelligence workloads, particularly for Eventstream and Event Source, you should monitor the following metrics:
Key Metrics for Scaling Decisions:
- Input Events: Count the number of event data items pulled from sources. A sudden increase may indicate the need for scaling.
- Output Events: Monitor the number of events sent to destinations. A decrease in output relative to input may signal a bottleneck.
- Backlogged Input Events: This metric indicates how many events are waiting to be processed. A high number suggests that the system is becoming constrained.
- Runtime Errors: Track the total number of errors related to event processing. An increase in errors can indicate performance issues that may require scaling.
- Watermark Delay: Monitor the maximum watermark delay across all partitions. A significant delay can indicate that the system is struggling to keep up with incoming events.
- Incoming and Outgoing Bytes: Measure the amount of data being processed. High throughput may necessitate scaling up.
Recommended Thresholds:
- Capacity Constraints: If backlogged input events consistently exceed a certain threshold (e.g., 10% of total input events), consider scaling up.
- Scaling Up: If input events are consistently high and output events are low, or if runtime errors increase, it may be time to scale up.
- Scaling Down: If backlogged input events are consistently low (e.g., below 5% of total input events) and performance metrics are stable, scaling down may be safe.
CU Usage and Throttling:
- Capacity Units (CUs): Monitor CU usage closely. If usage approaches the limit (e.g., 80-90% of allocated CUs), consider scaling up.
- Throttling: Throttling may occur when the system is overwhelmed, typically when CU usage exceeds the allocated capacity. Keep an eye on system alerts for throttling events.
Microsoft Architectural Guidance:
For best practices and architectural guidance, refer to the Microsoft documentation on optimizing capacity and monitoring workloads. This includes strategies for effective scaling and resource management.
References: