The Main Thread

Principles over hacks. Systems over shortcuts.

Failure Modes

Mar 16, 2026

•

13 min read

“Just retry” only works if your operation is idempotent

Anirudh Sharma

LLM

Mar 9, 2026

•

14 min read

128K tokens doesn’t mean you should use 128K tokens

Anirudh Sharma

Failure Modes

Mar 2, 2026

•

15 min read

Wrong timeout settings cause more outages than bugs

Anirudh Sharma

LLM

Feb 23, 2026

•

6 min read

Your LLM application will hit rate limits. Here's how to handle it gracefully with exponential backoff, token buckets, and priority queuing.

Anirudh Sharma

Database

Feb 16, 2026

•

9 min read

Production decision-making checklist to understand if a vector database is really required

Anirudh Sharma

LLM

Feb 9, 2026

•

7 min read

How models learn semantic relationships in language

Anirudh Sharma