Publications

Find more on my Google Scholar.

Multimodal Large-Scale Pre-Training

Apple Intelligence and Foundation Model

Video Foundation Model

2024

2023

2022

2021

2020

Abstract & Poster