ViewSplat: View-Adaptive 3D Gaussian Splatting for Feed-Forward Synthesis

Moonyeon Jeong¹, Seunggi Min¹, Suhyeon Lee^2,*, Hongje Seong^1,*

¹University of Seoul, ²Korea Electronics Technology Institute

^*Co-corresponding Authors

ECCV 2026

Abstract

We present ViewSplat, a view-adaptive 3D Gaussian splatting network for novel view synthesis from unposed images. While recent feed-forward 3D Gaussian splatting has significantly accelerated 3D scene reconstruction by bypassing per-scene optimization, a fundamental fidelity gap remains. We attribute this gap to the limited capacity of single-step feed-forward networks to regress static Gaussian primitives that satisfy all viewpoints.

To address this limitation, we shift the paradigm from static primitive regression to view-adaptive splatting. Instead of a rigid Gaussian representation, our pipeline learns a view-adaptive latent representation. Specifically, ViewSplat initially predicts base Gaussian primitives alongside the weights of scene-conditioned View MLPs. During rendering, these MLPs take target-view coordinates as input and predict view-dependent residual updates for each Gaussian attribute (i.e., 3D position, scale, rotation, opacity, and color). This mechanism, which we term view-adaptive splatting, allows each primitive to rectify initial estimation errors, effectively capturing high-fidelity appearances.

Extensive experiments demonstrate that ViewSplat achieves state-of-the-art fidelity while maintaining fast inference and real-time rendering; our large backbone variant runs at 15 FPS during inference and 90 FPS during rendering.

Method Overview

Overview of ViewSplat. While static 3D Gaussians (a) often result in blurred renderings, our ViewSplat (b) refines Gaussian attributes based on the target-view pose on the fly. This allows for superior reconstruction of fine-grained details like sharp edges and specularities compared to existing methods.

Architecture of ViewSplat. Our framework uses a shared Geometry Transformer backbone to jointly estimate camera poses and canonical 3D Gaussians from unposed images. A specialized view-dependent head generates per-pixel View MLPs that predict pose-specific residual offsets, refining Gaussian attributes during rendering to capture complex view-dependent effects.

Qualitative Results

Main qualitative results of novel view synthesis on RE10K.

More qualitative results on RE10K with large image overlap.

More qualitative results on RE10K with medium image overlap.

More qualitative results on RE10K with small image overlap.

@article{Jeong2026viewsplat, title={ViewSplat: View-Adaptive 3D Gaussian Splatting for Feed-Forward Synthesis}, author={Jeong, Moonyeon and Min, Seunggi and Lee, Suhyeon and Seong, Hongje}, journal={arXiv preprint arXiv: 2603.25265}, year={2026} }

ViewSplat: View-Adaptive 3D Gaussian Splatting for Feed-Forward Synthesis

Abstract

Method Overview

Qualitative Results

BibTeX