currybab's devlog

2508W2 Recently Read

06 Aug, 2025

Scaling vLLM for Embeddings: 16x Throughput and Cost Reduction
What is the N+1 Query Problem and How to Solve it?
Developers, Reinvented
- good article for new generation of developers with AI.
- I think I’m in stage 2 AI Explorer. I want to be in stage 3 AI Strategist.
DSPy SIMBA explained
How Instacart Built a Modern Search Infrastructure on Postgres
- interesting article about search infrastructure.
- moving text search infra from elasticsearch to postgresql.
- moving vector search infra from faiss to postgresql(pgvector).
Milvus: Building a large-scale vector DB for LINE VOOM’s real-time recommendation system
- knowhow of LINE VOOM’s vector database selection.
Go’s race detector has a mutex blind spot
Inside Go Channels: A deep dive into the Runtime’s concurrency magic
- good articles about go concurrency and channels.
From GPT-2 to gpt-oss: Analyzing the Architectural Advances