The document presents a video-based crowd density estimation system using the Accumulated Mosaic Image Difference (AMID) method. The system extracts crowded areas in monocular video sequences and estimates the number of people and their velocity. Using information from multiple cameras, it can predict crowd density levels several minutes in advance. Experiments in real public spaces like stations, parks and plazas demonstrate the effectiveness of the proposed AMID method for crowd analysis and prediction.