The Blog of Daniel J Hand

Configuring OpenClaw for Hybrid Search with a Privately Hosted Embedding Model

Unless you are using OpenClaw with OpenAI, your current configuration may not be taking advantage of its inbuilt support for RAG. Without access to an embedding service, OpenClaw falls back to full-text search only. In this article, I show the key steps required to take advantage of hybrid search by …

more ...

Ornith-1.0: A New Kind of Leaderboard Contender—or Just Smarter Reward Hacking?

Ornith-1.0, released by DeepReinforce.ai in late June, reported strong benchmark results: the 397B variant achieved 77.5 on Terminal-Bench 2.1 and 82.4 on SWE-Bench Verified, outperforming Claude Opus 4.7 and comparable open models of similar size. Is Ornith just another leaderboard contender, or does it …

more ...

GLM-5.2 and the New Economics of Open Frontier Models

GLM-5.2 has generated significant interest since its release earlier this month. It combines strong published benchmark results with a 1M-token context window and open-weight availability under the MIT license. This has enabled hosted access and local quantisation options for high-memory systems.

Z.ai's published benchmark card for GLM-5.2 …

more ...

AI and the Future of Creative Work

People often argue that AI, like many of the technologies that preceded it, will simply augment the workforce—and while some jobs will be replaced, many new jobs will be created.

I agree with that broad view, but I think it misses something important: AI is no longer just a …

more ...

The AI Scaling Bottleneck: Moving from Human-In-The-Loop to Human-On-The-Loop

Treating AI as a synchronous, human-gated process kills the very efficiency it promises. If organisations are going to actually scale these workflows, they need to transition from Human-In-The-Loop (HITL) to Human-On-The-Loop (HOTL).

Most of us are familiar with the concept of a Human-In-The-Loop (HITL). A predefined process workflow has one …

more ...

Yak Shaving for Token Speed: Chasing MTP and TurboQuant on Linux

Inspired by the performance gains and resource efficiency of Multi-Token Prediction (MTP) and KV cache quantisation, I started what would become another yak-shaving endeavour.

I was an early adopter of Ollama to host LLMs on my MacBook and home Ubuntu server. Between the easy access to the latest models, reliable …

more ...

Why Outsourcing Your Thinking Impacts Your Understanding

You can outsource your thinking, but you can't outsource your understanding.

Andrej Karpathy shared this thought at AI Ascent 2026, and it struck a chord.

Today, it's never been easier to outsource cognitive tasks. We use AI to research, summarise, and draft. It drives incredible productivity. But can we achieve …

more ...

Leading Tech Teams in the Age of AI - Why the What and Why Now Matter More Than Ever

Leading technical teams across large geographies like APAC requires a tricky balancing act: thinking strategically while staying tactically close to the technology. Nowhere is this dichotomy more apparent right now than in Data and AI. It feels like a month in AI is comparable to a year in other disciplines …

more ...

Rethinking SLMs: Separating Reasoning from Knowledge

How small can Small Language Models (SLMs) get before they stop being useful?

This is a critical question for edge AI. Currently, even small models waste valuable space memorising an extensive, if lossy, "offline Wikipedia" of facts. Is this embedded knowledge an asset, or just bloat?

I believe we must …

more ...

The Invisible Hand

It’s not quite 6:00 am. It's still dark outside, but my attention has been occupied for the last couple of hours. I find I'm able to get most of my chores done while the family is still asleep.

My focus since late last night has been the upcoming …

more ...