Multimodal Audio-Visual Emergency Recognition
it is a system that takes image and audio as an input that classify them as a danger or not