vid2scene

Phone video → metric 3D reconstruction → a room a robot can explore.

original video left, reconstruction right

A benchmarked scene (Replica room0): original video (left) and our reconstruction (right), along the same path. Sub-centimetre accuracy against the ground-truth mesh.

Interactive viewers (live)

4 methods + consensus

PGSR · DN-Splatter · MonoSDF · the consensus fusion, aligned in one frame. Toggle to compare.

Open viewer →

Reconstruction vs ground truth

Our reconstruction toggled against the Replica ground-truth mesh (Chamfer 0.62 cm).

Open viewer →

Reference Gaussian splat

The PSNR-32.3 splat (within 0.7 dB of SOTA on this scene).

Open viewer →

Quality vs ground truth (Replica, visibility-culled · cm / F-score@5cm)

MethodAccuracy ↓Completion ↓Chamfer-L1 ↓F-score ↑
PGSR1.137.604.370.898
DN-Splatter0.576.143.360.936
Consensus (ours)1.076.473.770.913

5-scene average. Sub-centimetre accuracy. Full protocol + per-scene table in the repo's docs/BENCHMARK.md.

reconstructed living room reconstructed bedroom reconstructed study

Interior fly-throughs of three reconstructed benchmark scenes (living room, bedroom, study).

An agent explores it

agent navigation path over the reconstructed room

The metric mesh is loaded as a Habitat navmesh. Top-down view of a recorded agent trajectory over the reconstructed room. The Genesis physics upgrade is described in docs/PHASE2_GENESIS.md.

Code

Repo & full write-up on GitHub — see the repo, docs/ARCHITECTURE.md, and docs/BENCHMARK.md.