Layer 1: Human-like perception (Perceptual Taxonomy)

Problem: Models recognize objects but fail to reason over hierarchy, attributes, and affordances.

Key idea: A cognitively grounded taxonomy and evaluation of scene reasoning.

Outcome: Diagnostic signals that reveal where perception breaks and how structure helps.

Layer 2: Structured 3D semantics & parts (Name That Part)

Problem: Parts are segmented without semantics or named without structure.

Key idea: Joint part segmentation and naming as set alignment with scalable annotation.

Outcome: Reusable, consistent part semantics that scale under constrained data.

Layer 3: Analysis by synthesis & world reconstruction

Problem: Perception lacks grounded 3D world models for reasoning and synthesis.

Key idea: Analysis by synthesis pipelines that reconstruct structured 3D worlds.

Outcome: World models that support part-level reasoning and compositional inference.

Layer 4: Efficient & strict continual learning (UWSH, EigenLoRAx, Share)

Problem: Continual adaptation is compute-heavy and prone to forgetting.

Key idea: Shared low-dimensional update geometry for reuse, merging, and strict continual learning.

Outcome: Sustainable adaptation that scales beyond privileged compute and data.

Ideas I've Introduced (Selected)

  1. Universal weight geometry / shared subspaces. Showed that neural networks end up converging to the same subspace.
  2. Named 3D parts as set alignment. Introduced first large-scale method for named 3D part segmentation.
  3. Hierarchical perceptual diagnostics. Introduced/advocated the framing of diagnostic evaluations for reasoning.
  4. Catastrophic remembering + strict continual learning. Introduced/advocated the framing of remembering and strict continual learning.
  5. Source-free, image-only UDA for 3D pose (3DUDA). Only method for source-free, image-only unsupervised 3D object pose adaptation.
  6. Adapter recycling / principal subspaces. Introduced/advocated the framing of recycling adapters into principal subspaces.
  7. Shared subspace continual finetuning (Share). Introduced the framing of shared-subspace continual finetuning.