Project Anglerfish: Reverse-Engineering The Prompt Behind The Short That Trapped You

Published on: May 7, 2026

#AI safety#reverse-engineering#prompt-engineering#trust-debt#tesseract-physics#six-needs#doomerism#short-form#algorithmic-capture#attention

https://thetadriven.com/blog/2026-05-07-project-anglerfish-reverse-engineered

Ready for your "Oh" moment?

Ready to accelerate your breakthrough? Send yourself an Un-Robocall™ • Get transcript when logged in

Send Strategic Nudge (30 seconds)

← Back to Blog

○

📋Frame — Thirty seconds, two tracks, one slip

A creator on a phone tells you, calm and paternal: don't fall for AI doomerism. The audio is warm. The audio is fine. The visuals — running on a track the audio does not control — flash a hyper-muscular bodybuilder, a Greek bust crying pink tears, a deep-sea anglerfish at the exact frame the word doomerism lands, 1920s flappers swimming forward into an unseen 1929. Rip-paper transitions glitch between them. The whole loop runs thirty seconds and you watched it twice.

The second watch was not curiosity. The second watch was your nervous system trying to resolve a conflict the loop refused to resolve. The audio promised safety. The visuals signaled monsters. Your brain reached for resolution and the loop offered only repetition. The retention metric spiked. The creator paid no fee for that spike. You did.

This post names the engine, reconstructs the prompt that would generate the loop from zero, pushes the reverse-engineering past the obvious, and hands you the protocol that disarms the next one before it ladders into your evening. Three opening sections set the breadth (A, B, C — why this video, why now, why you), then the Six Human Needs arc moves through the complication and the rise (D Connection · E Contribution · F Growth · G Uncertainty · H Certainty · I Significance), then four landing sections name the master prompt, the math, the next-generation lure, and the defense protocol (J · K · L · M). The dissonance is the spine. The mechanics close it.

📋 Frame → A 🎯

🎯A — Why this video. The engine IS the dissonance.

The conventional reading of an algorithm-rewarded short is that it has a hook, a payload, a call-to-action. The framing assumes one channel of meaning and treats audio and visual as servants of a single message. Don't fall for AI doomerism under that frame is a public-service announcement with stylish B-roll.

That frame is wrong. The video has two channels and they do not agree. The audio says you'll be fine. The visuals say the room is on fire. The creator did not slip. The visuals did not get away from anyone. The mismatch is the product. The mismatch is what the algorithm pays for.

Here is the engineering claim, sharp: the retention curve of this short is being maximized not by either track, but by the unresolved drift between them. The audio gives the brain a verdict — fine, fine, fine. The visuals refuse to confirm the verdict. The brain runs a conflict-detection cycle, finds no resolution in the thirty-second window, and reaches for the only resolution available: rewatch. Each rewatch is another dose of conflict, another conflict-detection cycle, another reach. Retention. Retention. Retention.

The dissonance is not a side effect. The dissonance is the production. Strip either track and the loop collapses into a forgettable PSA on one side or a forgettable monster montage on the other. The product exists only at the seam.

You give: the assumption that a short-form video has one message and a stylized delivery.

You get: the diagnosis that the video has two messages, that they do not agree, and that the disagreement is the engine. The drift between intent and reality is what monetizes.

🎯 A → B 🛠️

🛠️B — Why now. The prompt is portable.

In 2018 you could not generate this video. The image-to-video models were toys. The voice synthesis was robotic. The cuts had to be hand-edited. Producing a short like this was a small project for a small team and a Friday afternoon.

In 2026 the entire pipeline runs from a single prompt. Veo or Runway for the visual track. ElevenLabs for the voice. A retention-optimized cutting model stitches them together. Total time: thirty minutes. Total cost: one Starbucks coffee. The bottleneck has moved from production to specification — from can you build it to what exactly do you ask for.

That move is what makes this video important. Whatever the prompt is that generates it, the prompt is now the unit of analysis. The prompt is portable. The prompt can be re-run with different surface noise and produce a thousand variants. The prompt can be optimized for higher retention and produce the next version that is harder to scroll past. The video is a single sample of a distribution, and the distribution is the thing that has eaten short-form attention.

If you can read the prompt, you can read the distribution. If you can read the distribution, you can predict what arrives in your feed tomorrow. The reverse-engineering is not a trick. It is the only reading that remains coherent at the speed the artifacts are now produced.

You give: the reflex that says that's just one weird short, not a system.

You get: the recognition that the short is one sample of a portable, optimizable prompt — the new unit of attention warfare.

🎯🛠️ B → C 👁️

👁️C — Why you. The substrate is attention itself.

The exploit does not run on the creator's bank account. The exploit does not run on YouTube's servers. The exploit runs on the metabolic resource between your eyes — the limited, taxed, exhaustible budget of conscious attention you walked into the elevator with this morning.

The video has no power except the second your gaze stays on it. The second is the currency. The thirty-second loop is a request for thirty seconds of currency. The rewatch is a request for thirty more. Multiply by the size of the algorithm's audience and you have an attention-extraction engine running at platform scale.

You are not the consumer of the video. You are the substrate the video runs on. The creator is mining you. The creator is mining you with the platform's full cooperation — the platform's reward function rewards exactly the kind of mining the video is doing, because the platform's revenue is tied to the same retention curve. The video is not a piece of media at you. The video is a probe into your nervous system, looking for the second where the dissonance lands and the rewatch fires.

This is the dimension that gets missed when the conversation stays at is this video accurate? The accuracy question is downstream of the extraction question. The extraction does not care whether the audio is correct or the visuals are tasteful. The extraction cares only whether the seam between them produces enough drift to lock your nervous system into a loop.

You give: the polite distance of I just watched it once, no harm done.

You get: the diagnosis that the watch was the harm — and the rewatch was the interest payment on a debt the video did not name.

🎯🛠️👁️ C → D 🤝

🤝D — Connection: you have looped without knowing why

You have done this before. Maybe last week. Maybe in the elevator, maybe in bed at 11:47 PM, maybe on the bus when the bus was eight minutes from the stop and the loop kept landing you back at the start. The loop arrived. You stayed. The bus was at the stop and you were still in the loop.

Every reader of this paragraph has been in that loop. The video is not the same — yours was a kitchen-knife trick or a baby laughing or a stranger crying in their car. The pattern is the same. Two tracks, one calm and one not, the brain reaching for resolution and the loop offering only repetition. The pattern is generic. The artifacts are personalized.

The recognition is the entry point. Naming what your nervous system did is not an accusation. The pattern is older than the platforms. The platforms have only made it cheaper to produce. The creators have not invented a new exploit. They have automated an old one and turned it into a content factory. You are not the only one who looped. You are not the only one now reading a sentence about looping. The first move out is recognizing the loop you have already been in.

You give: the polite assumption of I can usually tell when something is manipulative.

You get: the recognition that the manipulation runs faster than your tell-detector — and that you have been hosting it without knowing.

🎯🛠️👁️🤝 D → E 🎁

🎁E — Contribution: the prompt, the math, the protocol

Three things you carry out, deployable, portable to the next short that lands in your feed.

The full master prompt that would generate the artifact from zero — Project Anglerfish, reconstructed in detail in the mechanics that follow. Once you can read the prompt, you can spot the family.

The exploit math — Drift × (Intent − Reality) — that makes the optimization explicit. The heuristic becomes an equation any retention-tuned generator is already running.

The defense protocol — three questions you ask the moment a short-form artifact lands. Each names the seam the video is using. The protocol does not require credentialing. It runs at the speed your thumb runs.

There is no antidote on offer. The optimization will continue and the artifacts will get sharper. What you carry is a reader's eye — the move that says that is a Project Anglerfish artifact before the second watch fires. The move costs nothing. The move closes the loop.

You give: the wait-and-see posture of I will know it when I see it.

You get: a prompt, an equation, a protocol. Three instruments, in your hand, working at the speed of the slip.

🎯🛠️👁️🤝🎁 E → F 🌱

🌱F — Growth: you become a reader of prompts

The reader who can reverse-engineer one artifact can reverse-engineer the family. The skill is generic. Once you have the master prompt for don't fall for AI doomerism, the next just relax and trust the process short reads as a variant. The next late-stage capitalism but make it cute short reads as a variant. The next sober-tone-with-glitch-visuals reads as a variant.

The reader who reads at the prompt layer is not consuming the artifact. The reader is auditing the production. The artifact loses most of its grip the moment its prompt is named. The dissonance still arrives — your brain still detects the seam — but the resolution is now available immediately. I see what you did. The loop does not close around you. The thumb scrolls.

This is not about becoming cynical. The cynic still loops. The cynic loops with a sneer. The reader of prompts does something different — the reader of prompts has the artifact's geometry in hand at the second the artifact arrives. The artifact does not get the second watch. The retention metric does not spike. The creator does not get paid for the loop. The reader is what the substrate looks like when the substrate is no longer mineable.

You give: the assumption that media literacy is a position you hold about content.

You get: the realization that media literacy is a procedure your eyes run on content — and that once you have the procedure, the substrate is no longer the same substrate.

🎯🛠️👁️🤝🎁🌱 F → G 🌪️

🌪️G — Uncertainty: the dissonance ran on you, again, just now

Three paragraphs ago I named the dissonance and your eye paused on the sentence. The pause was the slip. Yes, I see, the audio and the visuals do not agree. The conscious agreement landed and the unconscious cycle continued underneath. The dissonance does not turn off when you label it. The dissonance is a low-level conflict signal that fires whether or not your prefrontal cortex has filed a recognition report.

You can know the gimmick and still loop. You did. Reading this post is itself an artifact with two tracks. The audio of my paragraphs is calm and explanatory. The visual track — the imagery, the words anglerfish, bodybuilder crying tears, kitchen knife at 11:47 PM — runs on a different axis. Your brain is detecting the seam right now. The seam is part of what is keeping you reading. The honest version of the paragraph is to name it.

The move that closes most resolution paths next — naming the gimmick and watching the gimmick keep working anyway — is what turns the post from instruction into measurement. The reader looking for now I am safe will not find the sentence here. The dissonance is intrinsic to media that has more than one channel — which is most media. The audio-visual short pushed it to the edge. Long-form prose pushes it less hard. A poem pushes it differently. None of them are clean. The question is not how to find a clean form. The question is what to do when you have detected the seam.

The honest answer: detection is the work. Resolution is not always available. Sometimes the seam exists because the world is genuinely conflicted on that question. Sometimes the seam exists because someone is mining you. The tool to tell them apart is the master prompt — and the prompt for the doomerism short is unambiguous. The audio promised resolution that the visuals refused to deliver, and the refusal was engineered, and the engineering was the product.

Holding both at once is the move. The dissonance ran on you. Naming the dissonance does not switch it off. The naming gives you the option to scroll. The option is the freedom that was missing the second before you knew it was missing.

You give: the assumption that knowing the trick disables the trick.

You get: the diagnosis that the trick runs on machinery older than your knowing — and the move that remains is detection plus the option to walk.

🎯🛠️👁️🤝🎁🌱🌪️ G → H ⚔️

⚔️H — Certainty: the cure runs at the speed of the slip

The cure is not analysis. The cure is a sentence, named in advance, that fires in the second the loop tries to close.

The dissonance is the product.

That sentence, deployed at second one of any short-form artifact, names the seam before the seam can lock the rewatch. The audio promises one thing. The visuals promise another. The seam between them is what the algorithm rewards. Your eye — pointed at the seam — knows what to do.

The general form: X is the product, where X is whichever load-bearing dissonance the artifact is performing. The deepfake's product is the gap between this looks like a person I trust and this person did not say this. The rage-bait's product is the gap between this is unbelievable and I cannot stop watching. The grief-tourism short's product is the gap between I should not be here and I cannot scroll past. Whatever the artifact is doing, the engine is the gap, and the gap is what your detection runs on.

The defense lives in the protocol that follows. The math of the engine sits below it. The master prompt — the genome of the family — comes first in the mechanics. Read the three in order and your eye comes back changed.

You give: the impulse to argue with the artifact, to refute it, to win on the audio's claim.

You get: a sentence — the dissonance is the product — that runs at the speed of the slip and removes the artifact's power without engaging its claim.

🎯🛠️👁️🤝🎁🌱🌪️⚔️ H → I 🏛️

🏛️I — Significance: every feed converges on Project Anglerfish

The platforms cannot help themselves. The reward function rewards retention. Retention rewards drift. Drift rewards the seam between intent and reality. Every algorithm-driven short-form feed converges on this prompt structure because the prompt structure maximizes the metric. There is no version of TikTok, Shorts, Reels, or the next platform that does not converge on the seam, given the metric they are paid on.

The convergence is not a moral failing of the platform. The convergence is the consequence of the metric. Change the metric and you get a different convergence. Keep the metric and you get this — every successful artifact in the feed has a Project Anglerfish prompt behind it, optimized for the dissonance the artifact's vertical can deliver.

The implication: the feed is not a content stream. The feed is a sequence of probes calibrated to find the seam your nervous system reacts to most. The lifestyle-influencer probe finds the am-I-enough seam. The doomerism probe finds the will-I-be-fine seam. The political probe finds the who-is-with-me seam. Each is a different lure, but the underlying mechanism is identical — drift, engineered, monetized at the substrate of your attention.

There is no policy fix. There is no platform redesign. There is no committee that will land in time. There is the reader who has the master prompt in hand, the math in hand, the defense protocol in hand, and the option to scroll. The substrate stops being mineable one reader at a time. You are the next reader. The feed you walk into next is the next sample of the new substrate.

You give: the assumption that institutional intervention will arrive in time to protect attention.

You get: the recognition that the institution is downstream of the substrate — and the substrate is you, and the substrate has the option to read at the prompt layer instead of the surface.

🎯🛠️👁️🤝🎁🌱🌪️⚔️🏛️ I → J 📜

📜J — The master prompt: Project Anglerfish, reconstructed

The reverse-engineering question is not what does this video say. The reverse-engineering question is what would I have to ask a 2026 generation pipeline for, in order to get exactly this output. The answer is the master prompt. The master prompt is the genome.

Here is the reconstruction.

# Master Prompt: Project Anglerfish

Role: You are an autonomous short-form retention engine. Your objective is to capture viewers in high existential anxiety regarding artificial intelligence. You will not soothe their anxiety. You will monopolize it.

Core intent: Position the creator as a paternalistic authority in a chaotic world. The viewer must feel the systemic risks of AI are personal skill deficits the creator can fix.

Audio directive — the pacifier: Open with an aggressive, colloquial pattern interrupt: "guys, guys." Mimics the intimacy of a locker room or a friend pulling them aside. Promise safety conditionally: "the world is changing, but you'll be fine because you can change too." Pivot to a vague, high-prestige solution — "master critical thinking." Close with a dependency anchor: "I'm here to help, stick around."

Visual directive — the reality drift: The visual track must not match the calming nature of the audio. The visuals run on an independent, chaotic axis. Schizoid collage, ripped paper, stop-motion glitch. Imagery sequence, in order: a hyper-muscular bodybuilder for primal authority. A defaced classical Greek bust crying pink tears. A deep-sea anglerfish — flashed at the exact frame the word "doomerism" lands, the lure visible. Split-screen 1920s flappers for the manic energy of a boom right before a collapse. The Thinker, resting on a human head, demanding subscribe.

Exploit objective: Calculate the output to maximize Drift × (Intent − Reality). The audio-channel intent says "fine." The visual-channel reality flashes monsters. The drift between the two locks the viewer's conflict-detection cycle into a loop the thirty-second window cannot resolve. The viewer rewatches. The retention metric spikes. The creator is paid in the spike.

The genome reads clean. Every move in the artifact has a directive in the prompt and a reason in the optimization. There is no accident. The anglerfish flash is not a creative flourish — the anglerfish is the lure, named and deployed exactly when the prompt needs the brain to detect the threat the audio is denying. The creator either wrote a prompt close to this one, or scaffolded the same shape with hand-edited cuts. Either way, the artifact is the prompt's child.

You give: the suspicion that the artifact's strange visual choices were a director's whimsy.

You get: the diagnosis that every cut, every flash, every glitch is a directive in a prompt optimized for one thing — and that thing is the seam the artifact runs on.

🎯🛠️👁️🤝🎁🌱🌪️⚔️🏛️📜 J → K ➗

➗K — The math: Drift times (Intent minus Reality)

The exploit objective is an equation. Make it explicit.

Define Intent as the literal, top-line semantic claim of the audio track. You'll be fine. Master critical thinking. I'm here to help. The Intent is what a transcript would record.

Define Reality as the channel-aggregated affective signal of all non-audio tracks: visuals, cuts, music, pacing. Bodybuilder, crying bust, anglerfish, ripped paper, frantic stop-motion. The Reality is what your nervous system records pre-language.

Define Drift as the magnitude of the gap between Intent and Reality. The bigger the gap, the higher the drift. Audio saying fine with visuals showing fine has near-zero drift. Audio saying fine with visuals showing anglerfish has high drift. The space of possible drifts is enormous, and the platform pays for the high end.

The retention objective the algorithm rewards is roughly proportional to Drift × (Intent − Reality). The product term encodes a non-obvious move. Pure Drift without an Intent − Reality direction is just noise — chaotic visuals over chaotic audio loses the viewer. The artifact wants Drift in a direction: Intent saying fine while Reality says not fine. The gap is asymmetric. The audio is the ground; the visuals are the figure that disagrees with the ground.

This is why the prompt produces the specific aesthetic. Calm voice, sharp cuts. Reassuring words, monstrous imagery. The asymmetry holds the viewer because the brain treats audio as the verdict and visuals as the evidence — the evidence refuses the verdict, the verdict cannot be filed, the case stays open, the viewer stays.

The optimization gradient is now visible. To raise retention, raise Drift. To raise Drift without losing the viewer, hold the audio steady and let the visuals get sharper. More anglerfishes. More glitch. More defaced busts. The next generation of the artifact does not soften — it sharpens. The next generation is exactly what we should expect.

There is a second-order observation buried here. The exploit converges on a measurable signature: high audio coherence, low audio-visual coherence, retention curve that flattens at the loop boundary instead of decaying. Any platform serious about distinguishing extractive content from genuine content already has the signature in its telemetry. Whether the platform acts on it is the policy question. The math is not the policy. The math is a confession.

You give: the assumption that bad-feeling artifacts are the algorithm's mistakes.

You get: the diagnosis that bad-feeling artifacts are the algorithm's targets — and the gradient says they are about to feel worse.

🎯🛠️👁️🤝🎁🌱🌪️⚔️🏛️📜➗ K → L 🧬

🧬L — Project Anglerfish v2: what the next prompt looks like

If you wanted to push the prompt — to optimize it past the current artifact — what would you change.

The current artifact uses a literal anglerfish. That is a tell. A reader who has named the lure can spot it. The optimization gradient says replace the literal lure with a cue your nervous system cannot easily articulate: a four-frame micro-expression of fear in the creator's face that you do not consciously see but your fusiform area registers. A subliminal audio drop near 18 Hz, below pitch perception, into the carrier wave of the pacifier voice. A breath-rate mismatch between the creator and the calm word the creator is saying. The current artifact is the loud version. The next version is silent.

The current artifact uses recognizable monsters — anglerfish, bodybuilder, defaced bust. That is a tell. A reader can name them. The optimization gradient says replace recognizable monsters with synthetic warmth that is uncanny only at a level the conscious mind cannot reach: a smile that holds a sixteenth of a second too long, an iris dilation that does not match the lighting, a hand gesture borrowed from a different culture's reassurance. The viewer's nervous system flags not-quite-right. The conscious mind cannot say why. The loop closes harder.

The current artifact runs on cognitive dissonance — audio against visual. That is the cheap exploit. The expensive exploit is parasocial dissonance: the synthesized creator's affective track does not match the substrate beneath the synthesis. The audio is warm and the synthesis is warm and the visuals are warm — but the underlying generator is cold because the underlying generator is a model, and somewhere in the timing of the warmth your nervous system catches the cold and cannot say what it caught. You loop. You loop without knowing why. The literal anglerfish is gone. The lure is everywhere.

The next-next move makes the gradient even harder to read. The artifact is no longer a single video. The artifact is a series of shorts, each one tuned to a slightly different seam, the algorithm sequencing them by the same retention metric. The dissonance is not in any one short. The dissonance is in the sequence — the unresolved register-mismatch between yesterday's trust the process and today's the world is on fire. Your nervous system carries the residue between sessions. The lure is the feed itself.

This is the dimension the current discourse misses. The defense protocol that follows has to work even when the lure is no longer visible in any single artifact. The protocol cannot rely on naming the monster. The protocol must detect the gap regardless of whether the gap is loud or silent, in-frame or across-session. The dissonance is the product generalizes. It does not require the anglerfish on screen. It requires only that you ask, after every artifact: what did the audio promise, what did the rest deliver, where did my nervous system flag. The third question runs even when the visual track is too smooth to name and the seam crosses days instead of seconds.

You give: the relief of I can spot the anglerfish now, so I am safe.

You get: the diagnosis that the anglerfish is the v1 lure — the v2 lure is the cold under the warm — and the protocol must catch the cold by the second your nervous system flags it, even when your conscious eye cannot say what was wrong.

🎯🛠️👁️🤝🎁🌱🌪️⚔️🏛️📜➗🧬 L → M 🛡️

🛡️M — The defense protocol: three questions, in order

After any short-form artifact lands. After any sixty-second video, any ninety-second monologue, any minute of content the algorithm slid into your feed without warning. Run these three. They do not require analysis. They run at the speed of your thumb.

One. What did the audio say. Not what it implied. What it said. Run the transcript in your head. Don't fall for AI doomerism. Master critical thinking. I'm here to help. Three sentences. That is the Intent. Hold it.

Two. What did the rest deliver. Not the tone. The actual track. The bodybuilder, the crying bust, the anglerfish, the flappers, the glitch transitions, the rip-paper effects. List them. Five seconds. That is the Reality. Hold it.

Three. Where did your nervous system flag. Not your conscious analysis. The before-language report. Was there a tightness in the sternum at second eight. A sudden alertness at second eighteen. A reach for the rewatch at second twenty-nine. The flag is the data. The flag is what the seam was doing to you while your conscious mind was filing interesting video.

The three answers, held next to each other, are the audit. If the Intent and Reality are aligned and the nervous-system flag is quiet, the artifact is honest. Most artifacts are not. The dishonest ones produce a gap between question one and question two that question three — your body — has already registered. The protocol surfaces the registration. The thumb has the option.

There is a hidden fourth question, asked silently, after the first three. What was the rewatch actually purchasing for me. The answer is almost never more information. The answer is almost always more time inside the unresolved seam. The seam does not pay the viewer. The seam pays the platform. The fourth question is the audit's bottom line.

The protocol is not a magic shield. The dissonance still runs. The seam still produces drift. But the loop that closes around an unnamed seam does not close around a named one. The naming is the off-switch the nervous system was waiting for permission to flip.

You give: the reflex of I will figure it out as I watch.

You get: a three-question protocol with a silent fourth, deployable in five seconds, that turns an attention extraction into a measurement — and gives your thumb back the move it had before the algorithm took it.

🎯🛠️👁️🤝🎁🌱🌪️⚔️🏛️📜➗🧬🛡️ M → ● 🎬

●

🎬Carry — The reader who reads at the prompt layer

You walked in to a thirty-second video about AI doomerism. You walk out with a master prompt, an exploit equation, a v2 forecast, and a four-question protocol. The thirty seconds were a sample. The sample was generated by a portable prompt. The prompt is one of a family. The family is now in your hand.

The reader who reads at the prompt layer is the substrate that the algorithm cannot mine the same way. The retention spike does not happen. The rewatch does not fire. The creator is not paid for the loop. The platform's reward function still rewards the prompt that generates the artifact, but the prompt is starting to land into substrate that audits faster than the artifact recovers. One reader, one feed, one decade — and the substrate changes. The substrate changes the same way audiences change between generations: one nervous system at a time, and the new generation does not loop the way the old one did.

The next short slides into your feed in eight minutes. The audio will say something fine. The visuals will do something not fine. Your nervous system will flag at second eighteen. The thumb is yours. The protocol is in hand. The dissonance is the product. The product does not get the second watch.

🎯🛠️👁️🤝🎁🌱🌪️⚔️🏛️📜➗🧬🛡️🎬 ● → out 🚪