View All Papers

Alfie Ranstead Matt Falconer Cláudio Lemos

The history of learning

April 2024

Bridging open-source and commercial multimodal models

Paper Image

This paper introduces InternVL 1.5, an open-source multimodal model that aims to match proprietary counterparts in capabilities. It does so through 3 key improvements: a reusable vision encoder, dynamic high resolution, and a bilingual dataset. When evaluated on 18 benchmarks, it achieved state-of-the-art results on 8, showing it has narrowed the gap.

Read More

September 2023

Learning to Pick Up Food with Robots

Paper Image

This paper introduces VAPORS, a method for robots to efficiently pick up food from plates using different strategies like pushing food together and twirling noodles with a fork. VAPORS learns to plan sequences of these skills to fully clear plates. In real-world tests, VAPORS picked up more food more efficiently than other methods.

Read More