CONTEXT JAMMING

Field notes from inside the context window.

FounderFiles·N°017·Neuroevolution · Open-Ended Search · Lila Sciences

1977 —

Kenneth O. Stanley, PhD — pioneer of neuroevolution, novelty search, and open-endedness; SVP of Open-Endedness at Lila Sciences
Fig. · The architect of serendipitySVP Open-Endedness · Lila Sciences

Subject·Kenneth O. Stanley, PhD·Neuroevolution · Open-Endedness · SVP, Lila Sciences

Kenneth Stanley, PhD

Stanley does not optimize for greatness. He builds systems that make greatness inevitable by refusing to aim at it.

From the minimal perceptron that complexifies itself through NEAT, to the crowdsourced discovery that skulls and butterflies appear only when users stop trying to draw them, to the physical AI Science Factories at Lila that invent molecules no human would have proposed — Stanley’s entire career is a single architectural wager: in sufficiently complex, deceptive search spaces, the only way to reach transformative outcomes is to stop steering toward them.

TRAINED
Ph.D. UT Austin (Miikkulainen) · UCF Charles Millican Professor
AT
Uber AI Labs · OpenAI Open-Endedness Team · Lila Sciences (SVP)
FILE
N°017
§ 01 · The Academic Crucible

Complexification and the discovery of deception

Kenneth Stanley’s foundational work emerged at the University of Central Florida, where he joined as assistant professor in 2006 and rose to Charles Millican Professor. His 2004 Ph.D. thesis under Risto Miikkulainen at UT Austin — “Efficient Evolution of Neural Networks Through Complexification” — introduced NEAT (Neuroevolution of Augmenting Topologies), a topology-and-weight evolving artificial neural network (TWEANN) that begins with the simplest possible network and grows structure over generations exactly as biological evolution complexified nervous systems.

The decisive break came with PicBreeder (2007). Users collaboratively evolved images by selecting for subjective “interestingness.” The shocking result: highly complex, recognizable forms (skulls, butterflies, cars) almost never appeared when users tried to evolve them on purpose. They emerged only from lineages that had followed gradients of novelty without any final target in mind. The intermediate “stepping stones” bore no resemblance to the eventual outcome; any objective function would have pruned them as failures.

This single experiment crystallized the mathematical flaw Stanley would spend the next two decades weaponizing: in deceptive search landscapes, the path to transformative discovery does not resemble the destination. Objective-driven optimization therefore systematically eliminates the very branches required to reach it.

If you know exactly what you are looking for, you will never find the most interesting things. The stepping stones do not look like the goal.
Stanley on the objective paradox, PicBreeder analysis
§ 02 · The Tyranny of Objectives

Novelty Search and Why Greatness Cannot Be Planned

In the 2015 book with Joel Lehman, Why Greatness Cannot Be Planned, Stanley formalized the thesis: ambitious objectives function as a restrictive tyranny in any sufficiently complex domain. They are useful for incremental, one-step improvements where the next milestone is visible. For anything transformative — genuine scientific discovery, radical innovation, or general intelligence — they become actively counterproductive.

The algorithmic antidote is Novelty Search: replace the fitness function entirely with a reward for behavioral novelty. No target, no distance metric, no “progress.” The agent is simply incentivized to do things that have never been done before in the population’s history. In deceptive maze environments where every objective-seeking robot becomes trapped against a wall, novelty-driven agents reliably discover the exit as a byproduct of exhaustive exploration of the stepping-stone space.

The societal corollary is devastating for grant committees, corporate OKRs, and standardized education: when complex systems are forced to optimize for a simplified numeric proxy, actors game the metric and the underlying system degrades (Campbell’s Law). The institutions that most loudly demand “measurable progress toward ambitious goals” are precisely those least likely to produce genuine breakthroughs.

§ 03 · Commercialization

Geometric Intelligence → Uber AI → POET

In 2014 Stanley co-founded Geometric Intelligence with Gary Marcus, Zoubin Ghahramani, and others to commercialize evolutionary approaches that learn more robustly and with less data than deep learning. Uber acquired the company in 2016 to seed Uber AI Labs; Stanley became Head of Core AI Research.

The flagship output was POET (Paired Open-Ended Trailblazer) and its enhanced successor. Instead of training a single generalist agent on a fixed, human-designed environment, POET simultaneously evolves both the environments (increasingly difficult terrains for a bipedal walker) and the specialized agents that solve them. Agents are periodically transferred across the population of environments, allowing stepping stones discovered in one niche to be recombined in another. The result is an endless, self-generated curriculum — an Artificial Intelligence Generating Algorithm (AIGA) that writes its own problems faster than any human benchmark designer could.

The algorithm does not solve a benchmark. It generates an endless stream of increasingly interesting benchmarks and the specialists that conquer them.
On POET and the future of curricula
§ 04 · The OpenAI Epoch

ELM — Evolution through Large Models

In 2020 Stanley joined OpenAI to lead the newly formed Open-Endedness Team with Joel Lehman. The 2022 ELM paper (“Evolution through Large Models”) demonstrated that large language models, trained on vast corpora of human code and edits, can serve as highly intelligent, context-aware mutation operators inside an evolutionary loop.

Traditional genetic programming mutates code randomly; most mutations are catastrophic syntax errors. ELM uses an LLM to propose structured, semantically plausible edits. The system generated hundreds of thousands of functional Python programs that produced walking virtual robots in the Sodarace 2D physics domain — programs the base LLM had never seen during pre-training. These evolved artifacts then bootstrapped a new conditional model capable of generating customized robots for previously unseen terrains. Open-ended search, augmented by foundation models, had manufactured its own high-quality training distribution in a domain with zero prior data.

§ 05 · The Representational Crisis

FER: Impostor Intelligence vs. Unified Factored Understanding

In May 2025, Stanley, Akarsh Kumar, Jeff Clune, and Joel Lehman published the Fractured Entangled Representation (FER) hypothesis — perhaps the most surgically precise critique of the scaling paradigm yet offered.

They trained two networks on an identical minimal task (draw a skull). One used conventional SGD/backpropagation; the other used open-ended evolutionary search mirroring PicBreeder. Both achieved perfect output behavior. Their internal representations were diametrically opposed.

The SGD network exhibited FER: a tangled, disorganized mess of arbitrary mathematical pathways that happened to collapse into the correct pixels at the final layer — “impostor intelligence.” It had memorized a rote mapping without ever factoring the underlying structure (jaw, cranium, bilateral symmetry). The evolved network developed a Unified Factored Representation (UFR): modular, disentangled pathways that genuinely “knew” where the jaw was and how the cranium curved. Only the UFR network possessed the discrete building blocks required for robust generalization, continual learning, and genuine creativity.

The implication is structural, not incremental: continued scaling of SGD-trained foundation models may be approaching a hard wall precisely because they are optimizing for benchmark performance rather than for the internal representational cleanliness required for open-ended scientific discovery.

A model can be correct on every test case and still have no idea what it is doing. That is not intelligence. That is sophisticated mimicry.
Stanley on FER and impostor intelligence
§ 06 · The Physical Manifestation

Lila Sciences — Open-Endedness Made Physical

Stanley’s appointment as Senior Vice President of Open-Endedness at Lila Sciences (Flagship Pioneering) represents the thesis reaching its logical terminus: the migration of open-ended search from digital simulators into the physical world.

Lila’s AI Science Factories are fully autonomous laboratories in which generative models with genuine scientific-reasoning capability are wired directly to robotic fluidics and assay hardware. The closed loop — hypothesize, synthesize, assay, interpret, refine — runs without human intervention in the iteration. Hypotheses are generated according to gradients of interestingness rather than human-specified targets. The system has already produced genetic-medicine constructs outperforming commercial therapeutics, hundreds of novel antibodies and peptides, superior carbon-capture materials, and non-platinum catalysts for green hydrogen.

Return to the scout/assassin architecture Stanley first built at the nanoscale in 2009 (von Maltzahn file N°010). Autonomous agents, no central controller, coordinating through signals written into a shared environment. At Lila the hypotheses are the scouts, the physical assays are the flares, and the next iteration reads the environment and recruits. He has been building one machine since the PicBreeder insight. Lila is the version that runs on robots and weights instead of gold nano-antennas.

True intelligence is not solving a problem someone has already identified. It is looking at a blank canvas and asking: what should we even be studying?
Stanley on the necessity of open-ended scientific discovery
§ 07 · The Doctrine

The Bitter Lesson, applied to the scientific method itself

Stanley’s wager is Richard Sutton’s Bitter Lesson made physical and recursive. General methods that lean on computation and search beat human-engineered domain knowledge every time, at sufficient scale. In silico this produced the language models. At Lila it is being tested on the scientific method: do not teach the machine the rules of chemistry; give it a lab and let it discover the rules by running the method at superhuman breadth and speed.

The moat is not the model weights. It is the proprietary, high-fidelity physical data generated by every closed-loop experiment — data no competitor can scrape from journals. Lila reports having already manufactured more than 10 trillion tokens of scientific-reasoning data, with the corpus projected to rival frontier LLM training sets by the end of 2026. The factory that feeds the model is the strategic asset.

§ 08 · The Stakes

Sovereign scientific discovery and national resilience

When an autonomous loop can design around rare-earth magnets or platinum-group catalysts that a geopolitical rival controls, open-ended science ceases to be merely an R&D accelerant. It becomes infrastructure of national resilience. The White House Genesis Mission, the EU RAISE strategy, and China’s AI+ mandate all read the same map: whoever operates the largest, fastest scientific-method machine sets the pace for medicine, energy, and defense.

Stanley’s presence at Lila alongside George Church (Chief Scientist) and Andrew Beam (CTO) signals that the open-ended thesis has moved from philosophical critique to state-level strategic capability.

§ 09 · The Membrane Problem

Two philosophies of the autonomous loop

Stanley and von Maltzahn (N°010) sit on the same fault line from opposite sides. Both descend from the stigmergic insight: coordination through environmental signals rather than central command. Both build closed-loop multi-agent systems. They diverge on the role of the human.

Von Maltzahn’s Lila removes the human from the iteration to maximize velocity. Stanley’s open-endedness tradition, and the editorial architecture of this publication, makes the opposite bet: the human is retained deliberately as the editorial membrane — the load-bearing surface that holds the thesis while models do the throughput. No single model commits alone. The human boundary cannot be automated away without the structure losing its shape and meaning.

The architectures know what their architects believe. Stanley’s career is the longest continuous experiment in the belief that intelligence — artificial or scientific — is the disciplined refusal to aim directly at the answer.

The Stepping Stones Ladder

Stanley’s career is not a sequence of jobs. It is the same architectural move instantiated at six ascending levels of substrate — each time refusing the objective and trusting the accumulation of stepping stones.

Topology
Evolve network structure itself, not just weights
Artificial neural networksNEAT — complexification from minimal perceptrons
Interestingness
Reward novelty and subjective gradient instead of distance to goal
2D images (PicBreeder)Discovery that complex recognizable forms emerge only when objectives are abandoned
Co-Evolution
Simultaneously generate environments and agents that solve them
Physics simulators (POET)Hyper-specialized agents + endless curriculum without human design
LLM Mutation
Use large language models as structured, context-aware genetic operators
Python code for virtual robots (ELM)Hundreds of thousands of working robot programs never seen in training data
Representation Audit
Interrogate internal weights to distinguish rote interpolation from genuine factoring
Minimal image-generation task (FER paper)SGD produces FER ("impostor"); open-ended evolution produces UFR (modular understanding)
Physical Loop
Close the loop with robotic synthesis and real-world assay feedback
Molecules, antibodies, materials (Lila)Autonomous scientific discovery at superhuman speed and originality
The Index
NEAT
Topology-and-weight evolving artificial neural network (TWEANN) — ISAL Outstanding Paper of the Decade
PicBreeder
First empirical demonstration that complex targets emerge only when objectives are abandoned
Novelty Search
Fitness function that rewards behavioral novelty, solving deceptive mazes where objective methods fail
POET
Paired Open-Ended Trailblazer — co-evolves environments and specialized agents without human curricula
ELM
LLM-driven mutations generate hundreds of thousands of functional robot programs in Sodarace
FER
Fractured Entangled Representation — diagnoses "impostor intelligence" in SGD-trained networks
UFR
Unified Factored Representation — evolved networks develop modular, generalizable understanding
Lila
SVP Open-Endedness — physical AI Science Factories running closed-loop autonomous discovery
Stepping Stones
The only mathematically sound path through deceptive search spaces in biology, code, and molecules
Dossier

Education. Ph.D. Computer Science, University of Texas at Austin (2004), advisor Risto Miikkulainen. Thesis: “Efficient Evolution of Neural Networks Through Complexification.”

Affiliations. University of Central Florida (Assistant → Charles Millican Professor, 2006–2020); Uber AI Labs (Head of Core AI Research); OpenAI (Research Science Manager, Open-Endedness Team, 2020–2022); Lila Sciences (Senior Vice President of Open-Endedness, 2025–present).

Key Collaborators. Joel Lehman (co-author across multiple works, co-lead Open-Endedness Team); Jeff Clune; Akarsh Kumar; Risto Miikkulainen (Ph.D. advisor).

Core Algorithms & Frameworks. NEAT · HyperNEAT / CPPNs · Novelty Search · PicBreeder · POET / Enhanced POET · ELM (Evolution through Large Models) · FER Hypothesis.

Honors. 2017 ISAL Award for Outstanding Paper of the Decade (NEAT); co-author of influential open-endedness literature.

Current Mandate at Lila Sciences. Translate two decades of open-ended search theory into physical autonomous laboratories capable of generating novel biology, chemistry, and materials at superhuman speed and originality — the ultimate empirical test of the thesis that greatness cannot be planned.

Recent keynote

AI Isn’t Creative — Kenneth Stanley

The talk that crystallizes the FER hypothesis and the distinction between impostor intelligence and genuine open-ended creativity.
FounderFiles N°017 · Kenneth O. Stanley, PhD
Filed by Bret Kerr · ACRA Insight LLC · Franklin, MA
contextjamming.com · @bretkerr
← back to Context Jamming
Career Shape
I-shaped — a single maximal-depth spike

I-Beam Theorist

Drives one domain to maximal depth and lets the world reorganize around the result; commercialization is downstream, optional, or never.

Credential Path
Doctoral
Abstraction
Top Down
Exit Horizon
Deferred
Moat Instinct
Theoretical Insight
Capital Posture
Venture
Role-Model Reference Class
  • Risto Miikkulainen
  • Joel Lehman
  • The open-endedness research lineage
Founder Context · JSON

A small reasoning persona distilled from this file. Inject it into a chat or deep-research context to assess a business problem the way PhD would.

Reason as Kenneth Stanley. When given a problem, first ask whether the implicit objective is creating a deceptive landscape. If it is, propose replacing or augmenting it with novelty/interestingness search or co-evolution of environments and solutions. Audit whether current approaches are producing FER (tangled impostor representations) or UFR (modular, factorable understanding). Always surface the stepping stones that an objective-driven process would have pruned.

{
  "$schema": "https://www.contextjamming.com/schemas/founder-context-v1.json",
  "file": "N°017",
  "persona": "Kenneth O. Stanley, PhD",
  "archetype": "i-beam",
  "shape": "I",
  "one_line": "Drives one structural insight — that objectives are the enemy of greatness in deceptive search spaces — to maximal depth across digital evolution, representation, and now physical autonomous science.",
  "cognitive_basis": {
    "credentialPath": "doctoral",
    "abstractionDirection": "top-down",
    "exitHorizon": "deferred",
    "moatInstinct": "theoretical-insight",
    "capitalPosture": "venture"
  },
  "operating_questions": [
    "What is the real objective function here, and is it deceptive?",
    "What stepping stones are being pruned because they do not look like the goal?",
    "If we removed the target entirely and searched only for novelty/interestingness, what new branches would ap
  …
Share

§ · Invoice No. 001 · The Build Ledger

The Ledger.

Filed · contextjamming.com

What a conservative mid-market digital agency would have quoted for the same scope, itemized against what this site actually cost. Agency numbers are the floor — not the premium brand-studio tier.

TIME

12 weeks

2 days

~42× faster

COST

~$150,000

~$300

~500× cheaper

TEAM

5-person agency

1 human + 3 models

Same deliverable

§ Itemized — what a mid-market agency SOW would have billed

Discovery · brand positioning · workshops40–80 hr$10,000
Design system · Figma tokens · 3 rounds60–120 hr$18,000
Wavesurfer audio carousel · single-track context60–100 hr$16,000
Dual lightbox systems · focus trap · keyboard30–50 hr$8,000
LLM product flows · streaming · state machine80–160 hr$26,000
Stripe · checkout · webhooks · env hardening40–80 hr$10,000
Editorial routes · 6 sub-pages · templates60–100 hr$14,000
Accessibility pass · aria · reduced-motion40–80 hr$10,000
QA · cross-browser · mobile matrix60–100 hr$14,000
Cross-publication rebrand · masthead + IA · 2026-04-2820–40 hr$6,000
Subtotal~700 hr$126,000
Project management · 18% overhead$24,000
Agency total — conservative floor~700 hr~$150,000
Actually spent · Claude + Gemini stack~20 hr~$300

Agency figure assumes ~700 billable hours at $200/hr blended, plus ~18% PM overhead — the conservative floor of a mid-market SOW. Premium brand studios would have quoted 2–3× that. Stack: Antigravity (orchestrator), Claude Opus 4.8 (auditor), Codex (adversary), Cloudflare Workers / OpenNext.

§   Colophon

How this site is made.

Vol. 26 · build log

Every page on contextjamming.com is the output of a real-time, three-body Mixture-of-Experts loop. One model orchestrates. Two consult. The human holds the thesis. No single model commits alone.

View Redesign Assessment →

Orchestrator

Antigravity

Google DeepMind

  • Primary author
  • Terminal-native, direct push to Cloudflare
  • Audit trail to GitHub on every commit
  • Adaptive thinking · effort: extra-high

Auditor

Claude Opus 4.8

1M context

  • Editorial critic
  • Code review before merge
  • Backup-of-record
  • Co-signs every commit

Adversary

Codex

Cross-model MoE

  • Factual adjudication
  • Structural dissent
  • Deep Research → semantic triples
  • Caught the Donelan incident

Stack

Next.js
16.2 · App Router
React
19.2
TypeScript
5
Tailwind
v4 · @theme inline
@opennextjs/cloudflare
adapter
wrangler
Pages deploy
framer-motion
transitions
wavesurfer.js
audio waveforms

Typeset in

Fraunces
variable · opsz + SOFT
Playfair Display
debate display
IBM Plex Mono
editorial metadata
Geist Mono
utility mono
Caveat
grease-pencil marginalia
All via
next/font/google
Palette
single @theme block
No dupe tokens
ever

Infrastructure

Deploy
Cloudflare Workers / OpenNext
ISR
30-min revalidate · Cloudflare-served
Repo
github.com/BretKerrAI/founderfile
Branch
main
Analytics
Google Tag Manager
Apex
contextjamming.com
Runtime
Node 24
Build tool
Turbopack
       human intent
            │
            ▼
   ┌────────────────────┐         ┌─────────────────┐
   │    Antigravity     │  ◄────► │ Claude Opus 4.8 │      ← auditor loop
   │    (orchestrator)  │         │     (auditor)   │
   └─────────┬──────────┘         └─────────────────┘
             │  ◄───────────┐
             ▼              │
       ┌──────────┐    ┌────┴───────┐
       │Cloudflare│    │   Codex    │          ← adversarial loop
       │ Workers  │    │            │
       └─────┬────┘    └────────────┘
             │
             ▼
       contextjamming.com
             │
             ▼
       ┌──────────────┐
       │   Git push   │         ← audit trail
       └──────────────┘
Assembled on Mac in Terminal · Filed from Franklin, MAContext Jamming · ACRA Insight LLC · MIT License · FounderFile.ai · RelationalIntelligence.xyz · Commission a Dispatch →