The document describes an intuitive interaction system using speech recognition for fire safety. The system is part of a Cooperative Fire Security System (CFS2H) that uses robots, sensors, and a human interface to detect and respond to fires in high-rise buildings. The interaction system allows human operators to communicate with robots, sensors, and other systems using natural language to perform tasks like surveillance, remote control of rescue robots, and survivor detection. It utilizes speech recognition and face detection modules along with a messaging system called HARMS to facilitate communication between all components of the CFS2H system. The overall goal is to provide an effective way for human operators to coordinate the response to fires in high-rise buildings through an intuitive speech-