New top story on Hacker News: Vid2Seq: A pretrained visual language model for describing multi-event videos

Vid2Seq: A pretrained visual language model for describing multi-event videos
16 by og_kalu | 3 comments on Hacker News.


Comments

Popular posts from this blog

North Korea test fires two missiles month before deadline for US to respond on talks

Democratic debate winners and losers: Elizabeth Warren triumphs while Beto O'Rourke flounders

Hong Kong Train Disruptions Show Protests Becoming Daily Affair