RESEARCH DIRECTION

World Models &
Latent Imagination

An agent that must experience every consequence firsthand learns slowly and dangerously. We build agents that learn predictive world models — compressed spatio-temporal representations of how their environment evolves — and use them as internal simulators: rehearsing candidate futures in latent space, selecting actions by imagined rollout, and reserving real-world interaction for what imagination cannot settle.

Compressed spatio-temporal representation

Raw observation streams are too high-dimensional to plan over. We study how generative models can be trained self-supervised to distill perception into compact latent states that preserve exactly the structure needed for prediction and control — and discard the rest. Compression is not a convenience here; the quality of the latent code bounds the quality of every downstream decision.

Planning inside the model

Once dynamics live in a learned latent space, planning becomes rollout: propose an action sequence, unroll the model, score the imagined trajectory, repeat. Policies can be trained largely inside this learned simulator and transferred back to the real environment — a sample-efficiency lever measured in orders of magnitude. The hard science is in the transfer: a policy that exploits its own model's imperfections has learned to cheat a dream.

Model error as steering signal

The divergence between imagined and observed outcomes is the most informative signal an agent receives. We treat prediction error as the compass for both learning and exploration: it tells the model where to improve, the planner where to distrust itself, and the curiosity system where the frontier of competence lies.

WORKING PRINCIPLES

How we hold this work to account.

Predict before acting

Imagined rollouts are cheaper than real mistakes.

Compress what matters

The latent code should carry decision-relevant structure, nothing else.

Transfer must survive reality

A policy is graded in the environment, never in the dream.

CONTINUE EXPLORING

More research directions.

Self-Supervised Representation Learning

Joint-embedding predictive architectures that learn hierarchical abstractions from raw observation — without labels.

Neurosymbolic Reasoning

Hybrid architectures that combine learned representations with explicit symbol manipulation and verifiable inference.

Spatial & Embodied Intelligence

Grounding intelligence beyond language: geometric scene understanding, simulation, and perception-action loops.

Intrinsic Motivation & Open-Ended Learning

Curiosity as compression progress: agents that generate their own curricula and allocate compute to their frontier.

Grounding & Calibration

Closed-loop evaluation against reality: held-out prediction, proper scoring, and confidence that means something.

Systems & Cognitive Architecture

Modular architectures — perception, world model, memory, critic, actor — engineered as dependable, measurable systems.

Alignment & Safety

Bounded agency, calibrated honesty, and oversight designed into the architecture — not appended to it.

ALL RESEARCH