-
Let the LLM try first
-
Reflections on prototyping a sysadmin benchmark
-
Gemini 3 Flash's Claudiness
-
Benchmarks I'm watching now
-
Dia browser
-
Coding agent with Simon Willison's llm
-
Pareto frontier LLMs, Aider edition
-
Pareto frontier LLMs, Kagi edition
-
Model Context Protocol, simply
-
o3 is the research assistant I wanted
-
ChatGPT's improved search
-
Steam engine time
-
Perplexity is still better at quick searches