Resources/Thread pool starvation triage checklist

Thread pool starvation triage checklist

A small incident package: a triage checklist plus a dependency-call logging schema for proving queueing and stopping repeat timeouts in legacy ASP.NET services.

FreeJan 21, 2026
Download

Use this when CPU looks fine but the whole service starts timing out.

The download contains the real checklist and the logging schema. Start with thread-pool-starvation-triage-checklist.md.

What you get

  • thread-pool-starvation-triage-checklist.md (the incident checklist)
  • dependency-call-logging-schema.md (the log fields and examples that make queueing obvious)

When to use this

Use it when you see this cluster:

  • latency rises across many endpoints at the same time
  • throughput drops
  • CPU is not pegged
  • downstream timeouts appear (HTTP, SQL)
  • recycle or restart appears to fix it temporarily

How to use it

  • On call: run the checklist top to bottom, then capture one short artifact while it is slow.
  • Tech lead: ship the smallest fixes that remove thread capture and add budgets (timeouts, concurrency caps, bounded retries).
  • CTO: use the log schema to make risk measurable (timeouts by dependency, backlog shape, and repeat incidents).

Newsletter

Get the .NET production reliability newsletter

Weekly runbooks, failure patterns, and practical fixes.

No spam. Practical updates only.

We respect your inbox. Unsubscribe anytime.

No spam. Unsubscribe anytime.

Need help implementing this?

I can help you apply this to your systems without the drama.

Work with me

Similar resources

More resources to help you succeed

View all
Canonical: https://matrixtrak.com/resources/thread-pool-starvation-triage-checklist