Mengliu Zhao
Sep 13, 2024

--

It's a multi-modal model, which is supposed to learn and predic images and texts simultaneously. Multi-modal models can eventually be used for text-image translation and vice versa.

--

--

Mengliu Zhao
Mengliu Zhao

Written by Mengliu Zhao

Wish I can pay and get more humbleness

No responses yet