LSM Trees: Why Your Database Writes Are Fast and Your Reads Are Lying to You

LSM Trees, Database Performance, Write Latency

The Problem That Made Me Actually Care About Storage Engines The thing that broke my comfortable ignorance about storage engines was a pipeline ingesting sensor telemetry — about 50,000 inserts per second into a PostgreSQL 15 cluster. The hardware wasn’t cheap: NVMe drives, 32 cores, 128GB RAM. Didn’t matter. Around 40k inserts/sec, write latency would … Read more

How I Tuned Adaptive Compression for Inverted Indexes and Stopped Wasting 40% of My Disk

Adaptive Compression for Inverted Indexes

The Problem Nobody Warns You About The thing that caught me off guard wasn’t the query latency — it was the storage invoice. We had a working Elasticsearch cluster, decent relevance tuning, p95 query times under 200ms. Then we crossed 100M documents and the disk bill tripled inside of two billing cycles. Not doubled. Tripled. … Read more

Ubuntu vs Fedora for Home Server: I Ran Both for 6 Months and Here’s What Actually Matters

Ubuntu vs Fedora for Home Server

I Needed a Home Server OS and Couldn’t Stop Second-Guessing Myself The machine I was setting up wasn’t impressive — a decommissioned Dell OptiPlex 7050 with 16GB RAM, a 500GB NVMe I had lying around, and an old spinning disk I repurposed for media. The workload: Plex with hardware transcoding, Nextcloud for file sync, Pi-hole … Read more

Docker Is Not the Only Option: I Tested Podman, containerd, and nerdctl So You Don’t Have To

Podman containerd nerdctl Docker alternatives

Why I Started Looking Past Docker The licensing change hit us on a Thursday afternoon — Docker Inc. quietly updated their terms and suddenly Docker Desktop required a paid subscription for companies with more than 250 employees or $10M in revenue. We were squarely in that bracket. The per-seat cost wasn’t catastrophic, but it was … Read more

Adaptive Compression in Inverted Indexes: What Actually Happens Inside Lucene, Elasticsearch, and Tantivy

Adaptive Compression in Inverted Indexes

The Problem I Kept Running Into: Index Bloat at Scale The thing that broke my mental model first wasn’t slow queries — it was watching disk I/O climb to 95% utilization on NVMe drives while average query latency jumped from 12ms to 340ms on a corpus I’d carefully tuned for months. We were running Elasticsearch … Read more