Engineering
FrameSkip: Training Vision-Language Models with Less Data, More Signal
FrameSkip offers a novel approach to efficiently train Vision-Language Models (VLMs) by strategically sampling fewer frames, demanding a deeper understanding of when and why this technique is applicable, and what trade-offs are involved compared to traditional frame-by-frame training.



















