Tag: ai
Re-running my RTX 5090 LLM benchmark on Ollama 0.30
3 min read
Ollama 0.30 delivered dramatically better performance on my RTX 5090, especially for Qwen 3.6 35B-A3B.
Benchmarking LLM context scaling on a local RTX 5090 setup
5 min read
Measured real-world context scaling performance of Qwen 3.6 27B and Qwen 3.6 35B-A3B on a local RTX 5090 using Ollama.
Practical Ways to Reduce Token Usage in AI Agents
3 min read
Practical, real-world techniques for reducing token usage in AI agents by controlling context, input, and output.
How I Turned Codex CLI into a Structured Engineering Assistant
3 min read
Why I built reusable Codex CLI skills to add structure, discipline, and safer workflows to AI-assisted development.