RESEARCH DIRECTION

Alignment &
Safety

Capability without alignment is a liability. We treat safety as an architectural discipline: agents whose scope of action is explicit and bounded, whose uncertainty is calibrated and surfaced, whose reasoning is inspectable, and whose deployment is gated on demonstrated understanding rather than demonstrated performance.

Honesty as a design objective

A system that overstates its confidence, hides its uncertainty, or optimizes appearances over substance is unsafe long before it is powerful. We build anti-deception measures into evaluation itself — proper scoring, adversarial probes, and audits that make confident nonsense costly — and apply the same standard to our own research claims.

Bounded by construction

We prefer agents whose action space, resource envelope, and escalation paths are architectural facts rather than behavioral hopes. Permissions, audit trails, and fail-safes are designed in from the first commit; oversight is real, and a human can always reconstruct what the system did and why.

Deferral under uncertainty

A well-calibrated system knows where its competence ends — and the safe response at that boundary is deferral, not improvisation. We study uncertainty-aware decision policies that trade autonomy for oversight exactly when the model's own error bars say they should.

WORKING PRINCIPLES

How we hold this work to account.

No confident nonsense

Systems should say less when they know less.

Bounded by construction

Safety properties are architectural, not behavioral.

Earn each step

New capability deploys only when it is understood.

CONTINUE EXPLORING

More research directions.

World Models & Latent Imagination

Learning compressed generative models of environment dynamics — and planning inside them before acting in the world.

Self-Supervised Representation Learning

Joint-embedding predictive architectures that learn hierarchical abstractions from raw observation — without labels.

Neurosymbolic Reasoning

Hybrid architectures that combine learned representations with explicit symbol manipulation and verifiable inference.

Spatial & Embodied Intelligence

Grounding intelligence beyond language: geometric scene understanding, simulation, and perception-action loops.

Intrinsic Motivation & Open-Ended Learning

Curiosity as compression progress: agents that generate their own curricula and allocate compute to their frontier.

Grounding & Calibration

Closed-loop evaluation against reality: held-out prediction, proper scoring, and confidence that means something.

Systems & Cognitive Architecture

Modular architectures — perception, world model, memory, critic, actor — engineered as dependable, measurable systems.

ALL RESEARCH

Alignment &Safety

Honesty as a design objective

Bounded by construction

Deferral under uncertainty

How we hold this work to account.

No confident nonsense

Bounded by construction

Earn each step

More research directions.

World Models & Latent Imagination

Self-Supervised Representation Learning

Neurosymbolic Reasoning

Spatial & Embodied Intelligence

Intrinsic Motivation & Open-Ended Learning

Grounding & Calibration

Systems & Cognitive Architecture

Alignment &
Safety