claude-mem: The AI That Takes Notes While You Code
One problem that grinds down every Claude Code session: you spend the first few minutes re-explaining what you were doing yesterday. What files matter, what decision you made last week and why, what that weird bug was that you half-fixed.… [… read more · 466 words →]
The Idea File: Why LLM Agents Change How We Share Work(x.com)
Karpathy's follow-up to a viral tweet. The argument: now that agents can write the code, what you actually want to share is the idea, not the implementation. He calls it an "idea file": a short spec of what to build, nothing more. Worth thinking about for anyone who shares research tools or scripts with collaborators.
Customize Your Claude Code Status Line
Claude Code has a status bar at the bottom of the terminal. By default it's minimal — but you can make it show anything you want. [… read more · 243 words →]
The Shorthand Guide to Everything Claude Code(x.com)
Affaan Mustafa's complete Claude Code setup after 10 months of daily use, covering skills, hooks, subagents, MCPs, plugins, context window management, and editor integration. Dense with practical patterns: hook examples for auto-formatting and tmux reminders, rules structure, subagent scoping, and the one rule that matters most. Keep your active MCPs under 10 or your 200k context window quietly becomes 70k.
Everything We Learned About LLMs in 2025: Simon Willison's Annual Roundup(simonwillison.net)
Simon Willison's annual LLM recap, this time 26 sections long. Covers reasoning models, multimodal, tool use, agents, fine-tuning, inference efficiency, safety, and open weights. He's been doing this for three years so there's a lot of accumulated context. Don't read it straight through. Pick a section from the table of contents and start there.
What We Know About AI Agents: A 264-Page Survey from Meta, DeepMind, Stanford(arxiv.org)
A 264-page survey on AI agents from researchers at Meta, Yale, Stanford, Google DeepMind, and Microsoft. Covers memory, planning, tool use, multi-agent coordination, and evaluation. If you only read one chapter, make it the one on evaluation. That's where most agent benchmark claims quietly stop making sense.