Tag: ai

Re-running my RTX 5090 LLM benchmark on Ollama 0.30

Ollama 0.30 delivered dramatically better performance on my RTX 5090, especially for Qwen 3.6 35B-A3B.

Measured real-world context scaling performance of Qwen 3.6 27B and Qwen 3.6 35B-A3B on a local RTX 5090 using Ollama.

Practical, real-world techniques for reducing token usage in AI agents by controlling context, input, and output.

Why I built reusable Codex CLI skills to add structure, discipline, and safer workflows to AI-assisted development.