Arctic-Extract: Compact, Efficient and State-of-the-Art Vision-Language Processing