CAPTION-GUIDED INTERPRETABLE VIDEO ANOMALY DETECTION BASED ON MEMORY SIMILARITY