Abschlussarbeit Detailansicht

B.Sc./M.Sc. Thesis Offer:  „Visual Voice Activity Detection“ in the field of social perception 

Motivation:

Robots are becoming more and more integrated into our everyday lives. In this sense, it is important to make human-machine interaction as natural as possible.

Visual Voice Activity Detection (VVAD) is an important cognitive function that allows people to see if the interaction partner is speaking. Robotic systems repeatedly show deficits in this aspect, which means that interaction with the robotic system is unnatural and not very fluid. In the thesis, a VVAD system is to be developed on the basis of the VVAD-LRS3 data set. 

Goal:

    Development of VVAD system for a robotic system

Prior Knowledge:

    Python

    Deep-Learning

Related Work:

    Adrian Lubitz, Matias Valdenegro-Toro, Frank Kirchner (2021) 
The VVAD-LRS3 Dataset for Visual Voice Activity Detection,                    https://doi.org/10.48550/arXiv.2109.13789


We look forward to receiving your complete informative application documents including current transcript of grades.
For further information and application, please contact:  Adrian.Lubitz@dfki.de.

© DFKI GmbH
zuletzt geändert am 31.03.2023
nach oben