Details

Optimizing Moment Retrieval for Educational Videos

Project Image

Year: 2026

Term: Winter

Student Name: Julie Wechsler

Supervisor: Olga Baysal / Sriram Subramanian

Abstract: This thesis designs a moment retrieval model that is designed specifically for educational videos. Moment retrieval or temporal grounding refers to the task of retrieving the relevant segment in a video given a natural language query. There are currently no temporal grounding datasets with specifically educational videos, and because of this existing methods are tailored for short action based videos instead of more educational content. Throughout this thesis I create an educational dataset by selecting only educational videos from a much larger dataset, then create a temporal grounding method tailored specifically to educational videos. This approach first segments the video into topic segments, then uses the topic boundaries as candidates for a BiLSTM based temporal grounding model.