New top story on Hacker News: Vid2Seq: A pretrained visual language model for describing multi-event videos

Vid2Seq: A pretrained visual language model for describing multi-event videos
16 by og_kalu | 3 comments on Hacker News.


Comments

Popular posts from this blog

Northeastern US braces for foot of snow during first days of December

North Korea test fires two missiles month before deadline for US to respond on talks

Democratic debate winners and losers: Elizabeth Warren triumphs while Beto O'Rourke flounders