Thread pool starvation triage checklist
A small incident package: a triage checklist plus a dependency-call logging schema for proving queueing and stopping repeat timeouts in legacy ASP.NET services.
FreeJan 21, 2026
DownloadFrom this article
Browse allUse this when CPU looks fine but the whole service starts timing out.
The download contains the real checklist and the logging schema. Start with thread-pool-starvation-triage-checklist.md.
What you get
thread-pool-starvation-triage-checklist.md(the incident checklist)dependency-call-logging-schema.md(the log fields and examples that make queueing obvious)
When to use this
Use it when you see this cluster:
- latency rises across many endpoints at the same time
- throughput drops
- CPU is not pegged
- downstream timeouts appear (HTTP, SQL)
- recycle or restart appears to fix it temporarily
How to use it
- On call: run the checklist top to bottom, then capture one short artifact while it is slow.
- Tech lead: ship the smallest fixes that remove thread capture and add budgets (timeouts, concurrency caps, bounded retries).
- CTO: use the log schema to make risk measurable (timeouts by dependency, backlog shape, and repeat incidents).
Newsletter
Get the .NET production reliability newsletter
Weekly runbooks, failure patterns, and practical fixes.
No spam. Unsubscribe anytime.
Need help implementing this?
I can help you apply this to your systems without the drama.
Work with meSimilar resources
More resources to help you succeed
Canonical: https://matrixtrak.com/resources/thread-pool-starvation-triage-checklist