samek.fyi
About Me Benchmarks Blog
  • Let the LLM try first March 11, 2026
  • Reflections on prototyping a sysadmin benchmark February 13, 2026
  • Gemini 3 Flash's Claudiness December 19, 2025
  • Benchmarks I'm watching now October 20, 2025
  • Dia browser June 13, 2025
  • Coding agent with Simon Willison's llm May 28, 2025
  • Pareto frontier LLMs, Aider edition May 8, 2025
  • Pareto frontier LLMs, Kagi edition May 3, 2025
  • Model Context Protocol, simply May 1, 2025
  • o3 is the research assistant I wanted April 30, 2025
  • ChatGPT's improved search April 28, 2025
  • Steam engine time April 27, 2025
  • Perplexity is still better at quick searches April 25, 2025