Building systems, breaking illusions, and sharing learnings.
How models learn semantic relationships in language
Feb 9, 2026
The survival of your AI app depends on your token economics
Feb 2, 2026
The Invisible Tax of AI Tokenization: Why Non-English Speakers Pay More
Jan 26, 2026
How your prompt becomes vectors, attention, and text in milliseconds
Jan 19, 2026
How words find their place in semantic space through push, pull, and careful trade-offs
Jan 12, 2026
How a simple observation about language created modern AI's understanding of meaning
Jan 5, 2026
The 35-year engineering journey that turned a linguistic insight into AI's foundation
Dec 29, 2025
How I stopped seeing vectors as magic and started seeing them as the geometry of meaning itself.
Dec 22, 2025
Should you build a custom tokenizer? A production decision framework
Dec 15, 2025
Why Swahili speakers pay 1.8x more than English speakers—and how to fix it
Dec 8, 2025
From GPT to BERT to T5 - the algorithms that learned to speak 100 languages
Dec 1, 2025
How the invisible step before training shapes model intelligence, costs, and fairness.
Nov 24, 2025
Uncover the sources of cache inconsistency, its impacts, and hands-on strategies to measure and mitigate it for reliable systems.
Nov 17, 2025
Nov 10, 2025
Keeping Your Cache Lean and Mean
Nov 3, 2025
Caching Fundamentals and Core Patterns
Oct 27, 2025
Exporting B-Tree Wisdom to Caches, Queues, and Distributed Architectures for Real-World Wins
Oct 20, 2025
Databases don’t just store data in B-Trees - they update them concurrently at massive scale.
Oct 13, 2025
Every byte you leave empty, every split you delay, and every page-size decision trades space, latency, and bandwidth - here’s how to choose.
Oct 6, 2025
The Hidden Design Principle That Makes B-Trees Endure
Sep 29, 2025