Mirror Metric v6

Mirror Metric v6 is a research-backed quality score with five dimensions, each mapped through a sigmoid function for smooth, non-linear scoring. Mouth quality and perceptual fidelity are weighted highest because the dental region and photorealism are what patients examine most closely in the mirror.

f(x) = 100 / (1 + exp(-k * (x - mid)))

face_psnr = 0.10 * f(face_psnr, k=0.25, mid=28) // sigmoid — full-face baseline [AUTO]

ms_ssim = 0.15 * f(ms_ssim, k=20, mid=0.88) // sigmoid — structural fidelity [AUTO]

lpips = 0.30 * f(lpips, k=-20, mid=0.15) // sigmoid inverted — perceptual [AUTO]

csim = 0.25 * f(csim, k=10, mid=0.65) // sigmoid — identity preservation [AUTO]

mouth = 0.20 * f(mouth_psnr, k=0.30, mid=20) // sigmoid — dental region quality [AUTO]

MM = round(face_psnr + ms_ssim + lpips + csim + mouth)

[AUTO] All five dimensions are computed automatically from validation_log.csv (validation runs every 2.5K steps) with fallback to training_log.csv estimates. Sigmoid mapping provides smooth scoring without floor/ceiling clamp artifacts.

LPIPS (30%) — Learned perceptual image patch similarity. Lower values mean more photorealistic renders. The sigmoid is inverted (k=-20) so that lower LPIPS scores map to higher MM contributions. Midpoint at 0.15.

CSIM (25%) — ArcFace cosine similarity. A patient looking in a mirror must see themselves. 0.85+ = strong identity preservation. Sigmoid k=10, midpoint 0.65.

Mouth PSNR (20%) — The dental region is the product. Mouth-region PSNR is computed on a masked crop centered on the oral cavity. This is the hardest region to reconstruct (teeth, tongue, mucosa) and the most important for patient acceptance. Sigmoid k=0.30, midpoint 20 dB.

MS-SSIM (15%) — Multi-scale structural similarity captures structural fidelity across multiple resolutions, complementing the pixel-level PSNR and perceptual LPIPS metrics. Sigmoid k=20, midpoint 0.88.

Face PSNR (10%) — Full-face masked PSNR provides a stable baseline for overall reconstruction quality. Not a primary quality gate (Blau & Michaeli 2018), but useful for tracking convergence and regression. Sigmoid k=0.25, midpoint 28 dB.

v5 → v6 changes: Switched from 4-dimension linear clamping to 5-dimension sigmoid mapping. Added MS-SSIM (15%). Replaced DISTS with LPIPS (30%). Reweighted: LPIPS 30%, CSIM 25%, Mouth 20%, MS-SSIM 15%, Face PSNR 10%. Sigmoid curves provide smoother scoring without hard floor/ceiling artifacts.
Sources: NeRSemble benchmark (CSIM), Arc2Avatar, Cafca SIGGRAPH Asia 2024, Apple 3DGS Optimization 2025.

Path to 100

0-20

Proof of Concept

Avatar renders in real-time. Mouth region barely recognizable. Not photorealistic.

20-40

RecognizableYOU ARE HERE

Patient can identify themselves. Dental region shows correct geometry. Still synthetic-looking.

40-60

Convincing

Photorealistic face. Mouth region renders teeth and gingiva accurately. Identity strong.

60-80

Mirror-Like

Full dental region indistinguishable. Expressions perfect. Only experts notice artifacts.

80-100

Indistinguishable

Patient cannot tell the difference from a real mirror. Production ready.