Ongoing Thesis

Download thesis as PDF

Evaluation of Inverse Rendering using Multi-View RGB-D data

Description

The core idea is to detect illumination in a pre-defined scene (Digital Twin) and adapt the moving objects in the simulation.

In this work, the student would have to:
- Create a sensor setup
- Inverse Rendering
- Show lighting changes in the room
- Estimate novel views

Prerequisites

Preferable

Experience with Git
Python (Pytorch)
Nvidia Omniverse

Contact

driton.salihu@tum.de

Supervisor:

Driton Salihu

Download thesis as PDF

Multiband evaluation of passive signal for human activity recognition

Keywords:
CSI; HAR; AI

Short Description:
To obtain samples

Description

The student must use a rf system to collect samples for different activities.
Implement classification algorithms to determinine dofferent activies from CSI or using T-F transforms

Prerequisites

Contact

fabian.seguel@tum.de

of. 2940

Supervisor:

Fabian Esteban Seguel Gonzalez

Download thesis as PDF

Vital Sign Monitoring Using Multi-resolution Analysis and Machine Learning

Keywords:
CSI; HAR; AI

Short Description:
To obtain samples

Description

The student must use a radar sytem to obtain vital signs of a patient.
The system must be embedded in a hospital bed.
Vital signs such as breathing rate and HR will be targeted; Others applications must be discussed.

Prerequisites

Contact

fabian.seguel@tum.de

of. 2940

Supervisor:

Fabian Esteban Seguel Gonzalez

Download thesis as PDF

Learning-based human-robot shared autonomy

Keywords:
robot learning, shared control

Description

In shared control teleoperation, the robot intelligence and human input can be blended together to achieve improved task performance and reduce the human workload. In this topic, we would like to investigate how we can combine human input and robot intelligence effectively to achieve at the end full robot autonomy. We will employ robot learning from demonstration approaches, where we provide task demonstrations using teleoperation.

We aim to test the developed algorithms in simulation and using Franka Emika robot arm.

Requirements:

Basic experience in C/C++

ROS is a plus

High motivation to learn and conduct research

Supervisor:

Basak Gülecyüz

Download thesis as PDF

Elfolgsmaximierung auf der streaming Platform YouTube

Description

...

Supervisor:

Eckehard Steinbach

Download thesis as PDF

Optimization of Saliency Map Creation

Keywords:
Saliency maps, deep learning, computer vision

Description

Saliency maps can be interpreted as probability maps the assess the scenes attractiveness and highlight the regions that might be interesting for the user to look. The objective of this thesis is to help create a novel dataset that records the head-motions and gaze directions for participants that watch 360° videos with varying dynamics in the scene. This dataset is then to be used to improve state-of-the-art saliency map creation algorithms and make them soft realtime-capable. Deep learning proved to be a robust technique to create saliency maps. The student is supposed to either use pruning techniques to boost the performance of state-of-the-art methods or to develop an own approach that delivers a trade-off between accuracy and computational complexity.

Prerequisites

Computer Vision, Machine Learning, C++, Python

Supervisor:

Download thesis as PDF

Simulation and Optimization for 6G Network Planning using Digital Twins

Description

The Chair of Media Technology at the Technical University of Munich (TUM) is offering an exciting Master’s Thesis opportunity in the context of future 6G networks and digital twin-based optimization. The thesis will contribute to a cutting-edge research project focused on the planning and simulation of 6G wireless systems in complex environments.

The topic is split into the following steps.

Selection and Evaluation of Simulation software

As 6G networks are still under development, simulation is essential for validating novel methods. The first part of this thesis involves identifying and evaluating suitable 6G simulation platforms that can replicate key network functions and ray-tracing-based propagation models. The selected simulator should ideally support:

3D point clouds or triangular mesh input
CAD data processing
Modifiable object classes
Material-aware modeling

Candidate simulators:

DeepMIMO, https://www.deepmimo.net/, Letzter Zugriff Februar 2025
NIST, https://github.com/usnistgov/qd-realization, Letzter Zugriff Februar 2025
Hoydis, Jakob, et al. "Sionna: An open-source library for next-generation physical layer research." arXiv preprint arXiv:2203.11854 (2022).

Offline Optimization for 6G Network Deployment

Building upon the simulator selected above, the second part focuses on offline optimization of access point placement and beamforming strategies using a digital twin of the environment. This involves:

Differentiating between static and dynamic objects (e.g., fixed machinery or columns)
Incorporating material-specific signal propagation effects
Developing methods to maximize signal coverage by optimizing AP locations in a virtual replica of the environment

Prerequisites

Background in electrical engineering, computer science, or a related field
Interest in wireless networks, digital twins, and simulation
Programming experience (e.g., Python, MATLAB)
Motivation to work independently on a technically challenging and innovative topic

Supervisor:

Rahul Chaudhari

Download thesis as PDF

Real-time Multi-View Visual SLAM

Description

How can a SLAM system utilize a multi camera rig efficiently, robust and fast.

Supervisor:

Sebastian Eger

Download thesis as PDF

Hand Pose Estimation Using Multi-View RGB-D Sequences

Keywords:
Hand Object Interaction, Pose Estimation, Deep Learning

Description

In this project the task is to fit a parametric hand mesh model and a set of rigid objects to a sequence of multi-view RGB-D cameras. Existing models for hand key-point detection and 6DoF pose estimation for rigid objects models have significantly evolved in recent years. Our goal is to utilize such models to estimate the hand and object poses.

Related Work

https://dex-ycb.github.io/
https://www.tugraz.at/institute/icg/research/team-lepetit/research-projects/hand-object-3d-pose-annotation/
https://github.com/hassony2/obman
https://github.com/ylabbe/cosypose

Prerequisites

Knowledge in computer vision.
Experience about segmentation models (i.e. Detectron2)
Experience with deep learning frameworks PyTorch or TensorFlow(2.x).
Experience with Pytorch3D is a plus.

Contact

marsil.zakour@tum.de

Supervisor:

Marsil Zakour

Download thesis as PDF

Attentive observation using intensity-assisted segmentation for SLAM in a dynamic environment

Keywords:
SLAM, ROS, Deep Learning, Segmentation

Description

Attentive observation using intensity-assisted segmentation for SLAM in a dynamic environment.

Supervisor:

Mojtaba Karimi

Download thesis as PDF

Illumination of Augmented Reality Content using a Digital Enviroment Twin

Description

...

Supervisor:

Eckehard Steinbach

Download thesis as PDF

Klassifikation von Wafermuster mit Hilfe von Machine Learing Methoden zur Automatisierung der Mustererkennung

Description

...

Supervisor:

Eckehard Steinbach

Download thesis as PDF

Solid-State LiDAR and Stereo-Camera based SLAM for unstructured planetary-like environments

Keywords:
Solid-State LiDAR; Stereo-Camera; SLAM

Description

New developments in solid-state LiDAR technology open the possibility of integrating range sensors in possible space-qualifiable perception setups, thanks to mechanical designs with reduced moveable parts. Thereby, the development of a hybrid stereo-camera/LiDAR sensor setup might overcome disadvantages each technology comes with, such as limit range for stereo camera setups or the minimum range Lidars need. This thesis investigates such a new solid-state Lidar's possibilities by incorporating it along with a stereo camera setup and an IMU sensor into a SLAM system. Foreseen activities might include, but are not limited to, the design and construction of a portable/handhold sensor setup for recording and testing in planetary-like environments, extrinsic calibration of the sensors, integration into a software pipeline, development of a ROS interface, and preliminary mapping tests.

Supervisor:

Mojtaba Karimi - (German Space Agency (DLR))

Download thesis as PDF

Deep Predictive Attention Controller for LiDAR-Inertial localization and mapping

Keywords:
SLAM, Sensor Fusion, Deep Learning

Description

The multidimensional sensory data is computationally expensive for localization algorithms in autonomous navigation for drones. Research shows that not all sensory data are equivalently important during the entire process of SLAM to perform a reliable output. The attention control scheme is one of the effective ways to filter out the highly valuable sensory data for such a system. The predictive attention model, for instance, can help us to improve the result of the sensor fusion algorithms by concentrating on the most valuable sensory data based on the dynamic of the vehicle motion or the semantic understanding of the environment. The aim of this work is to investigate the state-of-the-art attention control models that can be adapted for the multidimensional sensory data acquisition system and compare them from different modalities.

Prerequisites

- Strong background in Python and C++ programming

- Solid background in robot control theory

- Be familiar with deep learning frameworks (Tensorflow)

- Be familiar with the robot operating system (ROS)

Contact

leox.karimi@tum.de

Supervisor:

Mojtaba Karimi

Download thesis as PDF

Model based Collision Identification for Real-Time Jaco2 Robot Manipulation

Keywords:
ROS, Haptics, Teleoperation, Jaco2

Description

By the advancement of robotics and communication networks such as 5G, telemedicine has become a critical application for remote diagnosis and treatment.

In this project, we want to perform robotic teleoperation using a Sigma 7 haptic master and a Jaco 2 robotic manipulator.

Tasks:

State of the art review and mathematical modeling
Jaco2 haptic controller implementation
Fault-tolerant (delay, network disconnect) controller design
System evaluation with external force-torque sensor

Prerequisites

Strong background in C++ programming
Solid background in control theory
Be familiar with robot dynamics and kinematics
Be familiar with the robot operating system (ROS) and ROS Control (Optional)

Contact

edwin.babaians@tum.de

Supervisor:

Edwin Babaians

Download thesis as PDF

Extension of an Open-source Autonomous Driving Simulation for German Autobahn Scenarios

Description

This work can be done in German or English in a team of 2-4 members.
Self-driving cars need to be safe in the interaction with other road users such as motorists, cyclists, and pedestrians. But how can car manufacturers ensure that their self-driving cars are safe with us humans? The only realistic and economic way to test this is to use simulation.
cogniBIT is a Munich-based Startup founded by Alumni of TUM and LMU and provides realistic models of all kind of road users. These models are based on state-of-the art neurocognitive and sensorimotor research and reproduce human perception, cognition, and action with all its limitations.
In this project the objective is to extend the open-source simulator CARLA (www.carla.org) such that German Autobahn-like scenarios can be simulated.

Tasks:
•    Design an Autobahn scenario using the road description format OpenDRIVE.
•    Adapt the CARLA OpenDRIVE standalone mode (requires C++ knowledge).
•    Design an environment for the scenario using the Unreal Engine 4 Editor.
•    Perform a simulation-based experiment using the German Autobahn scenario and the cogniBIT driver model.

Prerequisites

•    C++ knowledge
•    experience with Python is helpful
•    experience with the UE4 editor is helpful
•    interest in autonomous driving and cognitive models

Supervisor:

Markus Hofbauer - Lukas Brostek (cogniBIT)

Download thesis as PDF

Extension of an Open-source Autonomous Driving Simulation for German Autobahn Scenarios

Description

This work can be done in German or English in a team of 2-4 members.
Self-driving cars need to be safe in the interaction with other road users such as motorists, cyclists, and pedestrians. But how can car manufacturers ensure that their self-driving cars are safe with us humans? The only realistic and economic way to test this is to use simulation.
cogniBIT is a Munich-based Startup founded by Alumni of TUM and LMU and provides realistic models of all kind of road users. These models are based on state-of-the art neurocognitive and sensorimotor research and reproduce human perception, cognition, and action with all its limitations.
In this project the objective is to extend the open-source simulator CARLA (www.carla.org) such that German Autobahn-like scenarios can be simulated.

Tasks:
•    Design an Autobahn scenario using the road description format OpenDRIVE.
•    Adapt the CARLA OpenDRIVE standalone mode (requires C++ knowledge).
•    Design an environment for the scenario using the Unreal Engine 4 Editor.
•    Perform a simulation-based experiment using the German Autobahn scenario and the cogniBIT driver model.

Prerequisites

•    C++ knowledge
•    experience with Python is helpful
•    experience with the UE4 editor is helpful
•    interest in autonomous driving and cognitive models

Supervisor:

Markus Hofbauer - Lukas Brostek (cogniBIT)

Download thesis as PDF

Exploring Effective Pooling Layers for Vector Neuron Networks

Description

In the field of point cloud analysis, most feature extraction models are sensitive to rotation, which means a slight rotation of the input point cloud may lead to a completely different latent representation, posing an issue for real-world applications. The Vector Neuron (VN) Network addresses this challenge by replacing the conventional scalar neuron in a neural network with a 3-dimensional vector neuron. This representation allows the rotation of the input to naturally propagate to hidden layers, thus producing a consistent latent representation for an input under arbitrary rotations. For more details, see the reference below.

However, the VN framework lacks an important component in deep neural networks: the max-pooling layer. In computer vision models, max-pooling is employed to highlight the most significant feature in a local region, and it mostly performs better than average pooling. But the original VN-MaxPooling faces a critical logical issue such that it cannot be trained properly. Therefore, most VN-based models can only use AvgPooling, which may limit the representative power.

In this student thesis, we will explore different possibilities to implement a rotation-equivariant MaxPooling layer. We will test their performance with open-source models on common datasets, in tasks such as point cloud classification, segmentation, and/or completion.

Reference:

Deng et al. "Vector Neurons: A General Framework for SO(3)-Equivariant Networks." ICCV 2021.

Assaad et al. "VN-Transformer: Rotation-Equivariant Attention for Vector Neurons." TMLR 2023.

Prerequisites

Basic knowledge of deep learning (neural network components, back-propagation, ...) is required.

Basic programming knowledge is required, preferably in Python.

Experience with PyTorch is a plus.

Interest in 3D Computer Vision.

Contact

zhifan.ni@tum.de

Supervisor:

Zhifan Ni

Download thesis as PDF

Collaborative Robotic Grasping during Teleoperation Tasks

Description

This topic focuses on improving robotic grasping networks and advancing embodied intelligence. Grasping is a fundamental capability in robotic manipulation and often plays a decisive role in the overall success of a task. Despite significant progress in learning-based grasping, current models still struggle with generalization and robustness in unstructured environments. Our goal is to enhance the success rate of existing grasping models and deploy them in real-world scenarios, where they can provide intelligent assistance during teleoperation tasks. By leveraging pre-trained grasping networks, we aim to reduce the human operator's workload, increase autonomy, and improve manipulation efficiency in complex and dynamic settings. This work offers a unique opportunity to work at the intersection of perception, control, and learning—pushing the boundaries of what robots can achieve through smarter, more adaptive grasping.

Prerequisites

Good Programming Skills (Python, C++)
Knowledge about Ubuntu/Linux/ROS
Motivation to learn and conduct research

Contact

dong.yang@tum.de

(Please attach your CV and transcript)

Supervisor:

Dong Yang

Download thesis as PDF

Scene Graph-based Real-time Scene Understanding for Assistive Robot Manipulation Task

Description

With the rapid development of embodied intelligent robots, real-time and accurate scene understanding is crucial for robots to complete tasks efficiently and effectively. Scene graphs represent objects and their relations in a scene via a graph structure. Previous studies have generated scene graphs from images or 3D scenes, also with the assistance of large language models (LLMs).

In this work, we investigate the application of scene graphs in assisting the human operator during the teleoperated manipulation task. Leveraging real-time generated scene graphs, the robot system can obtain a comprehensive understanding of the scene and also reason the best solution to complete the manipulation task based on the current robot state.

Prerequisites

Good Programming Skills (Python, C++)
Knowledge about Ubuntu/Linux/ROS
Motivation to learn and conduct research

Contact

dong.yang@tum.de

(Please attach your CV and transcript)

Supervisor:

Dong Yang

Download thesis as PDF

DT-based Human-robot Teleoperation with Haptic Codecs Standard

Keywords:
Digital Twin, Teleoperation, Haptic Codecs Standard

Short Description:
Our project aims to build a DT-based human-robot teleoperation with haptic codecs standard and multiple sensors under a Linux system.

Description

For the system, the main achievements should be:

1. A completed human-in-the-loop haptic teleoperation: You should port the teleoperation system, which currently interacts with Unity on Windows, to a Linux system using the Robot Operating System (ROS). You can use Gazebo to create a remote environment. It should contain a robotic arm (the follower device) and an operational platform to simulate a real remote environment. You will use a Phantom device as the leader device to manipulate the virtual robot arm to gather information during the interaction to explore the environment updates, such as adding a new object, thus building a Digital Twin (DT) in the virtual environment on the leader side.

2. Multiple sensors for data collection on the Follower side: You should use visual and haptic devices to collect environment-update data and complete the environment restoration. Visual information is usually captured using 2D and depth cameras, and haptic information is expressed by the remote position and force feedback.

3. Haptic codecs for data transmission: The transmission of velocity, position, visual, and haptic information needs to follow the Haptic Codecs Standard.

4. Optional function: Plug-and-Play: When a haptic device is temporarily disconnected and reconnected, the teleoperation system should automatically restore normal operations, resuming synchronization between both sides. For both the leader side and the follower side, the detection of disconnection and the resumption of reconnection should be designed.

Prerequisites

Our requirements (preferably should have):

Familiarity with teleoperation systems, Linux systems, and visual and haptic sensors.

A good understanding of ROS (Robot Operating System).

Supervisor:

Siwen Liu

Download thesis as PDF

Refining 3D Hand-Object Reconstruction via Elastomer Model

Description

To model the interaction of hand and object, not only is a separate estimation of hand and object required but the contact between hand and object must also be taken into account. Significant progress has been made in modeling isolated hands and objects from RGB images. However, modeling the contact between a human hand and an object within a single image needs much effort because of the existence of occlusions. In this paper, we propose a method for the reconstruction of hands and objects in 3D based on elastomer models. This method simulates the Hand-Object(HO) contact based on the elastic energy of the elastomer model. At the same time, it imitates the deformation of soft hand tissue using the concept of elastic modulus, such that a more physically plausible grasp could be formed. Aside from that, an optimizer is applied to improve the HO interaction under the supervision of ground truth. The whole framework is constructed in an end-to-end manner. Several commonly used benchmarks show that the method leads to a better reconstruction result and produces more physically plausible hand and object estimation.

Supervisor:

Xinguo He

Download thesis as PDF

Feature enhancement based human-object detection

Short Description:
Human-object interaction, Feature pre-processing, VAE

Description

Human-object interaction detection is currently a famous research topic. It requires us to spatially distinguish human-object interaction in images. However, the current stage of feature extraction can be further optimized. This task will explore how to improve the performance of HOI detection, starting with feature extraction and optimization.

Prerequisites

- Computer vision

- Human-object interaction prediction

- Deep Learning

- Transformer

Contact

yuankai.wu@tum.de

Supervisor:

Yuankai Wu

Download thesis as PDF

Monocular RGB-based Digital Twin

Description

Using monocular RGB data to reconstruct a 3D interior environment with CAD-based reconstruction.

Prerequisites

Git, Python, PyTorch

Contact

driton.salihu@tum.de

Supervisor:

Driton Salihu

Download thesis as PDF

Human-robot interaction using vision-based human-object interaction prediction

Short Description:
Human-object interaction, human-robot interaction

Description

We use vision solution to locate the target object for the robot and send the desired object back to the operator to complete the whole process of human-robot interaction.

Prerequisites

- Panda arm

- Computer vision

- Human-object interaction prediction

-Grasping

Supervisor:

Yuankai Wu

Download thesis as PDF

GAN-based subjective haptic signal quality assessment database augmentation and enlargement methods

Description

This project needs the student to research and implement a novel GAN-based approach for subjective haptic signal quality assessment database augmentation and enlargement. Also, subjective experiments will also be conducted to evaluate the result of the automatic data expansion.

Supervisor:

Zican Wang

Download thesis as PDF

Inverse Rendering in a Digital Twin for Augmented Reality

Keywords:
Digital Twin, Illumination, HDR

Description

The task is to generate an End-to-End pipeline for illumination estimation inside of a digital twin.

Finally, also an AR application can be created.

Possible References

[1] https://arxiv.org/pdf/1905.02722.pdf

[2] https://arxiv.org/pdf/1906.07370.pdf

[3] https://arxiv.org/pdf/2011.10687.pdf

Prerequisites

Python (Pytorch)
Experience with Git

Contact

driton.salihu@tum.de

Supervisor:

Driton Salihu

Download thesis as PDF

Network Aware Shared Control

Keywords:
Teleoperation, Learning from Demonstration

Description

In this thesis, we would like the make the best out of varying quality demonstrations. We will test the developed approach with shared control.

Prerequisites

Requirements:

Experience in C/C++

ROS is a plus

High motivation to learn and conduct research

Supervisor:

Basak Gülecyüz

Download thesis as PDF

Optimization of 3D Object Detection Procedures for Indoor Environments

Keywords:
3D Object Detection, 3D Point Clouds, Digital Twin, Optimization

Description

3D object detection has been a major task for point cloud-based 3D reconstruction of indoor environments. Current research has focused on having a low inference time for 3D object detection. While this is preferable, a lot of cases do not profit from this. Especially considering the use of a pre-defined static Digital Twin for AR and robotics application, thus this decreases the incentive for low inference time at the cost of accuracy.

As such this thesis will follow the approach of [1] (in this work only based on point cloud data) to generate proposals of layout and objects in a scene through for example [2]/[3] and use some form of optimization algorithm (reinforcement learning, genetic algorithm) to optimize to the correct solution.

Further, for more geometrical-reasonable results the use of a relationship graph neural network, as in [4], would be applied in the pipeline.

References

[1] Hampali, Shreyas, et al. “Monte Carlo Scene Search for 3D Scene Understanding.” 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021): 13799-13808. https://arxiv.org/abs/2103.07969#:~:text=We explore how a general, from noisy RGB-D scans.

[2] Chen, Xiaoxue, Hao Zhao, Guyue Zhou, and Ya-Qin Zhang. “PQ-Transformer: Jointly Parsing 3D Objects and Layouts From Point Clouds.” IEEE Robotics and Automation Letters 7 (2022): 2519-2526. https://arxiv.org/abs/2109.05566

[3] Qi, C., Or Litany, Kaiming He and Leonidas J. Guibas. “Deep Hough Voting for 3D Object Detection in Point Clouds.” 2019 IEEE/CVF International Conference on Computer Vision (ICCV) (2019): 9276-9285. https://arxiv.org/abs/1904.09664

[4] Avetisyan, Armen, Tatiana Khanova, Christopher Bongsoo Choy, Denver Dash, Angela Dai and Matthias Nießner. “SceneCAD: Predicting Object Alignments and Layouts in RGB-D Scans.” ArXiv abs/2003.12622 (2020): n. pag. https://arxiv.org/abs/2003.12622

Prerequisites

Python (Pytorch)
Experience with Git
Knowledge in working with 3D Point Clouds (preferable)
Knowledge about optimization methods (preferable)

Contact

driton.salihu@tum.de

Supervisor:

Driton Salihu

Download thesis as PDF

Learning Temporal Knowledge Graphs with Neural Ordinary Differential Equations

Description

...

Contact

zhen.han@campus.lmu.de

Supervisor:

Eckehard Steinbach - Zhen Han (LMU)

Download thesis as PDF

Sim-to-Real Gap in Liquid Pouring

Keywords:
sim-to-real

Description

We want to investigate what are the simulation bottlenecks in order to learn the pouring task. How we can tackle this problem. This project is more paper reading and the field of research is skill refinement and domain adaptation. In addition, we will try to implement one of the states of the art methods of teaching by demonstration in order to adapt the simulation skill to the real-world scenario.

Prerequisites

Creativity

Motivation

Strong C++ Background

Strong Phyton Background

Contact

edwin.babaians@tum.de

Supervisor:

Edwin Babaians

Download thesis as PDF

"Pouring Liquids" dataset development

Keywords:
Nvidia Flex, Unity3D, Nvidia Physics 4.0

Short Description:
Using Unity3D and Nvidia Flex plugin, develop a learning environment and model different fluids for teaching pouring tasks to robots.

Description

The student will develop different liquid characteristics using Nvidia Flex, will add different containers and particle collision checking system. In addition, a ground truth system to later use for robot teaching.

Reference:

https://developer.nvidia.com/flex

https://developer.nvidia.com/physx-sdk%20

Prerequisites

Strong Unity3D background

Familiar with Nvidia Physics and Nvidia Flex libraries.

Contact

edwin.babaians@tum.de

Supervisor:

Edwin Babaians

Download thesis as PDF

Analysis and evaluation of DynaSLAM for dynamic object detection

Description

Investigation of DynaSLAM in terms of real-time capabilities and dynamic object detection.

Supervisor:

Sebastian Eger

Download thesis as PDF

Comparison of Driver Situation Awareness with an Eye Tracking based Decision Anticipation Model

Keywords:
Situation Awareness, Autonomous Driving, Region of Interest Prediction, Eye Tracking

Description

This work can be done in German or English

The transmission of control to the human driver in autonomous driving requires the observation of the human driver. The vehicle has to guarantee that the human driver is aware of the current driving situation. One input source for observing the human driver is based on the driver's gaze.

The objective of this project is to compare two existing approaches for driver observation [1,2]. While [1] measures the driver situation awareness (SA), [2] anticipates the drivers decision. As part of a user study [2] published a gaze dataset. An interesting cross validation would be the comparison of the
SA score generated by [1] and the predicted decision correctness of [2].

Tasks

Generate ROI predictions [3] from the dataset of [2]
Estimate the driver SA with the model of [1]
Compare [1] and [2]
(Optional) Extend driving experiments

References

[1] Markus Hofbauer, Christopher Kuhn, Lukas Puettner, Goran Petrovic, and Eckehard Steinbach. Measuring driver situation awareness using region-of-interest prediction and eye tracking. In 22nd IEEE International Symposium on Multimedia (ISM), Naples, Italy, Dec 2020.
[2] Pierluigi Vito Amadori, Tobias Fischer, Ruohan Wang, and Yiannis Demiris. Decision Anticipation for Driving Assistance Systems. June 2020.
[3] Markus Hofbauer, Christopher Kuhn, Jiaming Meng, Goran Petrovic, and Eckehard Steinbach. Multi-view region of interest prediction for autonomous driving using semisupervised labeling. In IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC), Rhodes, Greece, Sep 2020.

Prerequisites

Experience with ROS and Python
Basic knowledge of Linux

Supervisor:

Markus Hofbauer

Download thesis as PDF

3D object model reconstruction from RGB-D scenes

Description

The robots should be able to discover their environments and learn new objects in order to be a part of daily human life. There are still challenges to detect or recognize objects in unstructured environments like a household environment. For robotic grasping and manipulation, knowing 3D models of the objects are beneficial, hence the robot needs to infer the 3D shape of an object upon observation. In this project, we will investigate methods that can infer or produce 3D models of novel objects by observing RGB-D scenes. We will analyze the methods to reconstruct 3D information with different arrangements of an RGB-D camera.

Prerequisites

Basic knowledge of digital signal processing / computer vision
Experience with ROS, C++, Python.
Experience with Artificial Neural Network libraries or motivation to learn them
Motivation to yield a successful work

Contact

furkan.kaynar@tum.de

Supervisor:

Hasan Furkan Kaynar

Download thesis as PDF

Research on the implementation of an (automated) solution for the analysis of surface impurities on endoscope tubes

Description

...

Supervisor:

Eckehard Steinbach

Download thesis as PDF

AI-Enhanced Tool for Desk Research – Smart Analytical Engine

Description

...

Supervisor:

Eckehard Steinbach

Download thesis as PDF

Algorithm evaluation for robot grasping with compliant jaws

Keywords:
python, ROS, robot grasping

Short Description:
Apply state-of-the-art contact model for robot grasp planning with a customized physical setup including a KUKA robot arm and a parallel-jaw gripper with compliant materials.

Description

Model-based grasp planning algorithms depend on friction analysis since friction between objects and gripper-jaws highly affect the grasp robustness. A state-of-the-art friction analysis algorithm for grasp planning is evaluated with plastic robot fingers and achieved promising results, but will it work if grippers are mounted with compliant materials such as rubber and silicon, compared to more advanced contact models?

The task of this work is to create a new dataset and retrain an existing deep network by applying a state-of-the-art contact model for grasp planning.

Supervisor:

Jingyi Xu

Download thesis as PDF

Adaptive LiDAR data update rate control based on motion estimation

Keywords:
SLAM, Sensor Fusion, ROS

Description

...

Supervisor:

Mojtaba Karimi

Download thesis as PDF

UWB localization by Kalman filter and particle filter

Description

...

Supervisor:

Alexandra Zayets

Download thesis as PDF

Investigating the Potential of Machine Learning to Map Changes in Forest based on Earth Observation

Description

...

Supervisor:

Eckehard Steinbach

Download thesis as PDF

Recording of Robotic Grasping Failures

Description

The aim of this project is collecting data by robotic grasping experiments and creating a largescale labeled dataset. We will conduct experiments while attempting to grasp known or unknown objects autonomously. The complete pipeline includes:

- Estimating grasp poses via computer vision

- Robotic motion planning

- Executing the grasp physically

- Recording necessary data

- Organizing the recorded data into a well-structured dataset

Most of the data collection pipeline has been already developed, additions and modifications may be needed.

Prerequisites

Useful background:

- Digital signal processing

- Computer vision

- Dataset handling

Requirements:

- Experience with Python and ROS

- Motivation to yield a good outcome

Contact

furkan.kaynar@tum.de

(Please provide your CV and transcript in your application)

Supervisor:

Hasan Furkan Kaynar

Download thesis as PDF

Student Assistant Software Engineering Lab

Keywords:
Software Engineering, Unit Testing, TDD, C++

Description

We are looking for a teaching assistant student of our new Software Engineering Lab. In this course we explain basic principles of software engineering such as unit testing, test driven development and how to collaborate in teams [1].

You will act as a teaching assistant to supervise students during the lab session working on their practical homeworks. The tasks of the homeworks are generally C++ coding exercises where the students contribute to a common codebase. This means you should have a good experience in C++, unit testing, and git as this will be an essential part of the homeworks.

References

[1] Winters, Titus, Tom Manshreck, and Hyrum Wright, eds. Software Engineering at Google: Lessons Learned from Programming Over Time. O'Reilly Media, Incorporated, 2020

Prerequisites

Very good knowledge in C++
Experience with unit testing
Good understanding of git and collaborative software development

Supervisor:

Markus Hofbauer

Download thesis as PDF

MATLAB tutor for Digital Signal Processing lecture in summer semester 2022

Description

Tasks:

Help students with the basics of MATLAB (e.g. matrix operations, filtering, image processing, runtime errors)
Correct some of the homework problems
Understand the DSP coursework material

We offer:

Payment according to the working hours and academic qualification
The workload is approximately 6 hours per week from May 2022 to August 2022
Technische Universität München especially welcomes applications from female applicants

Application:

Please send your application with a CV and transcript per e-mail to basak.guelecyuez@tum.de
Students who have taken DSP course preferred.

Supervisor:

Basak Gülecyüz

Download thesis as PDF

Student Assistant Software Engineering Lab

Keywords:
Software Engineering, Unit Testing, TDD, C++

Description

We are looking for a teaching assistant student of our new Software Engineering Lab. In this course we explain basic principles of software engineering such as unit testing, test driven development and how to collaborate in teams [1].

You will act as a teaching assistant to supervise students during the lab session working on their practical homeworks. The tasks of the homeworks are generally C++ coding exercises where the students contribute to a common codebase. This means you should have a good experience in C++, unit testing, and git as this will be an essential part of the homeworks.

References

[1] Winters, Titus, Tom Manshreck, and Hyrum Wright, eds. Software Engineering at Google: Lessons Learned from Programming Over Time. O'Reilly Media, Incorporated, 2020

Prerequisites

Very good knowledge in C++
Experience with unit testing
Good understanding of git and collaborative software development

Supervisor:

Markus Hofbauer

Ongoing Thesis

Bachelor's Theses

Evaluation of Inverse Rendering using Multi-View RGB-D data

Evaluation of Inverse Rendering using Multi-View RGB-D data

Description

Prerequisites

Preferable

Contact

Supervisor:

Multiband evaluation of passive signal for human activity recognition

Multiband evaluation of passive signal for human activity recognition

Description

Prerequisites

Contact

Supervisor:

Vital Sign Monitoring Using Multi-resolution Analysis and Machine Learning

Vital Sign Monitoring Using Multi-resolution Analysis and Machine Learning

Description

Prerequisites

Contact

Supervisor:

Learning-based human-robot shared autonomy

Learning-based human-robot shared autonomy

Description

Supervisor:

Elfolgsmaximierung auf der streaming Platform YouTube

Elfolgsmaximierung auf der streaming Platform YouTube

Description

Supervisor:

Optimization of Saliency Map Creation

Optimization of Saliency Map Creation

Description

Prerequisites

Supervisor:

Master's Theses

Simulation and Optimization for 6G Network Planning using Digital Twins

Simulation and Optimization for 6G Network Planning using Digital Twins

Description

Prerequisites

Supervisor:

Real-time Multi-View Visual SLAM

Real-time Multi-View Visual SLAM

Description

Supervisor:

Hand Pose Estimation Using Multi-View RGB-D Sequences

Hand Pose Estimation Using Multi-View RGB-D Sequences

Description

Prerequisites

Contact

Supervisor:

Attentive observation using intensity-assisted segmentation for SLAM in a dynamic environment

Attentive observation using intensity-assisted segmentation for SLAM in a dynamic environment

Description

Supervisor:

Illumination of Augmented Reality Content using a Digital Enviroment Twin

Illumination of Augmented Reality Content using a Digital Enviroment Twin

Description

Supervisor:

Klassifikation von Wafermuster mit Hilfe von Machine Learing Methoden zur Automatisierung der Mustererkennung

Klassifikation von Wafermuster mit Hilfe von Machine Learing Methoden zur Automatisierung der Mustererkennung

Description

Supervisor:

Solid-State LiDAR and Stereo-Camera based SLAM for unstructured planetary-like environments

Solid-State LiDAR and Stereo-Camera based SLAM for unstructured planetary-like environments

Description

Supervisor:

Deep Predictive Attention Controller for LiDAR-Inertial localization and mapping

Deep Predictive Attention Controller for LiDAR-Inertial localization and mapping

Description

Prerequisites

Contact

Supervisor:

Model based Collision Identification for Real-Time Jaco2 Robot Manipulation

Model based Collision Identification for Real-Time Jaco2 Robot Manipulation

Description

Prerequisites

Contact

Supervisor:

Interdisciplinary Projects

Extension of an Open-source Autonomous Driving Simulation for German Autobahn Scenarios