Advancing Multi-View Scene Interpretation: Leveraging Deep Learning for Optimized Input Image Analysis

Mallick, Arijit

Publikationsdienste
→
TOBIAS-lib - Publikationen und Dissertationen
→
7 Mathematisch-Naturwissenschaftliche Fakultät
→
Dokumentanzeige

« zurück

Advancing Multi-View Scene Interpretation: Leveraging Deep Learning for Optimized Input Image Analysis

Mallick, Arijit

Dateien:	Arijit_Mallick_dissertation.pdf 17.2 MB PDF Beschreibung: Main article

Zitierfähiger Link (URI):	http://hdl.handle.net/10900/170931 http://nbn-resolving.org/urn:nbn:de:bsz:21-dspace-1709319 http://dx.doi.org/10.15496/publikation-112258
Dokumentart:	Dissertation
Erscheinungsdatum:	2025-10-13
Sprache:	Englisch
Fakultät:	7 Mathematisch-Naturwissenschaftliche Fakultät
Fachbereich:	Informatik
Gutachter:	Lensch, Hendrik P. A. (Prof. Dr.)
Tag der mündl. Prüfung:	2025-07-11
DDC-Klassifikation:	004 - Informatik
Schlagworte:	Deep Learning , Machine learning , Computer graphics , Machine vision , MVS , Image processing
Lizenz:	http://tobias-lib.uni-tuebingen.de/doku/lic_ohne_pod.php?la=de http://tobias-lib.uni-tuebingen.de/doku/lic_ohne_pod.php?la=en
Zur Langanzeige

Abstract:

This research provides a comprehensive analysis of multi-view scene interpretation, leveraging deep learning models to enhance input image quality. We delve into tasks ranging from low-level view interpolation to high-level 3D reconstruction and burst image denoising. Our approach leverages deep learning techniques and innovative methodologies to overcome limitations in existing classical and learning methods. We introduce a novel view interpolation technique that generates intermediate frames accurately without requiring additional geometric input. This method lays the foundation for our subsequent work on multi-view 3D reconstruction. To address the lack of ground truth depth information in 3D reconstruction, we propose a meta-learning and unsupervised approach to tackle the classic problem of multi-view stereo. We also tackle the issue of low-resolution depth maps by introducing a depth enhancing transformer-CNN hybrid module. Finally, we explore burst image denoising, proposing a model that utilizes multiple image alignment and feature volume merging to achieve state-of-the-art performance. Our research contributes significantly to the field of computer vision and has potential applications in various domains.

Das Dokument erscheint in:

7 Mathematisch-Naturwissenschaftliche Fakultät [5099]

Veröffentlichen

Stöbern

Gesamter Bestand
Diese Sammlung

Mein Benutzerkonto

Einloggen

Advancing Multi-View Scene Interpretation: Leveraging Deep Learning for Optimized Input Image Analysis

DSpace Repositorium (Manakin basiert)

Advancing Multi-View Scene Interpretation: Leveraging Deep Learning for Optimized Input Image Analysis

Abstract:

Das Dokument erscheint in:

Stöbern

Gesamter Bestand

Diese Sammlung

Mein Benutzerkonto