Devs Are Making Claude Talk Like a Caveman to Cut Costs—And It Works

1 month ago 14

In brief

A developer discovered that forcing Claude to talk similar a caveman slashes output tokens, and truthful costs, by up to 75%.
The net instantly turned it into a GitHub skill.
With Anthropic charging truthful precocious per output tokens, grunt-mode is little of a gag and much of a fund strategy.

Somewhere betwixt punctual engineering and show art, a developer posted a find connected Reddit that made the AI assemblage laughter earlier paying attention: thatch Claude to pass similar a prehistoric quality and ticker your token measure shrink by up to 75%.

The station deed r/ClaudeAI past week and has since racked up implicit 400 comments and 10K votes—a uncommon operation of genuine method penetration and absurdist drama that the net tends to reward.

The mechanic is simple. Instead of letting Claude lukewarm up with pleasantries, narrate each measurement it takes, and adjacent with an connection to assistance further, the developer constrains the exemplary to short, stripped-down sentences. Tool first, effect first, nary explanation. A mean web hunt task that would tally astir 180 output tokens dropped to astir 45. The archetypal poster claims up to 75% simplification successful output, achieved by making the exemplary dependable similar it conscionable discovered fire.

In caveman terms, arsenic 1 Redditor said: "Why discarded clip accidental batch connection erstwhile fewer connection bash trick?”

What this method does not interaction is the input context: the afloat speech history, attached files, and strategy instructions that the exemplary re-reads connected each azygous turn. That input typically dwarfs the output, particularly successful longer coding sessions. Real-world sessions counting each this input, relationship for savings astir 25%, not 75%. Still meaningful, conscionable not the header number.

It’s besides a bully thought to provender the exemplary with mean instructions. Don’t springiness it the “caveman” speech arsenic it could spiral down into a “garbage in, garbage out” situation.

There is besides the question of quality degradation. A fistful of researchers successful the thread argued that forcing an AI to inhabit a little blase persona could actively wounded its reasoning quality—that the verbal constraints mightiness bleed into cognitive ones. The interest has not been definitively settled, but it is worthy considering erstwhile evaluating results.

Skill good, accomplishment spell viral

Despite the caveats, the method recovered a 2nd beingness connected GitHub astir immediately.

Developer Shawnchee packaged the rules into a standalone caveman-skill compatible with Claude Code, Cursor, Windsurf, Copilot, and implicit 40 different agents. The accomplishment distills the attack into 10 rules: nary filler phrases, execute earlier explaining, nary meta-commentary, nary preamble, nary postamble, nary instrumentality announcements, explicate lone erstwhile needed, fto codification talk for itself, and dainty errors arsenic things to hole alternatively than narrate.

Benchmarks successful the repo, verified with tiktoken, amusement output token reductions of 68% connected web hunt tasks, 50% connected codification edits, and 72% connected question-and-answer exchanges—for an mean output simplification of 61% crossed 4 modular tasks.

A parallel repo by developer Julius Brussee took a somewhat antithetic approach, framing the aforesaid thought arsenic a SKILL.md file with 562 stars connected GitHub. The spec: respond similar a astute caveman, chopped articles, filler, and pleasantries, support each method substance. Code blocks stay unchanged. Error messages are quoted exactly. Technical presumption enactment intact. Caveman lone speaks the English wrapper astir the facts.

This 1 adjacent comes with antithetic modes to impact however overmuch you privation to strip, switching betwixt Normal, Lite, and Ultra. The models bash the nonstop aforesaid enactment but supply a overmuch shorter answer, which results successful a large redeeming implicit time.

The broader outgo discourse gives the gag a sharper edge. Anthropic is among the astir costly models successful presumption of terms per token. For developers moving agentic workflows with dozens of turns per session, output verbosity is not a stylistic complaint. It is simply a enactment item. If a caveman grunt tin regenerate a five-sentence summary of what the exemplary conscionable did, those saved tokens adhd up crossed thousands of API calls.

The caveman accomplishment is installable successful 1 bid via skills.sh and works globally crossed projects. Whether oregon not it makes Claude marginally little articulate, it has already made a batch of developers importantly little annoyed.