首页  >>  来自播客: No Priors Podcast 更新   反馈

No Priors Ep. 24 | With Devi Parikh from Meta

发布时间 2023-07-20 10:00:49    来源

摘要

Video dominates modern media consumption, but video creation is still expensive and difficult. AI-generated and edited video is a holy grail of democratized creative expression. This week on No Priors, Sarah Guo and Elad Gil sit down with Devi Parikh. She is a Research Director in Generative AI at Meta and an Associate Professor in the School of Interactive Computing at Georgia Tech. Her work focuses on multimodality and AI for images, audio and video. Recently, she worked on Make a Video 3D, also called MAV3D, which creates animations from text prompts. She is also a talented AI-generated and analog artist herself. Elad, Sarah and Devi talk about what’s exciting in computer vision, what’s blocking researchers from fully immersive Generative 4-D, and AI controllability. 00:06 - Democratizing Creative Expression With AI-Generated Video 08:31 - Challenges in Video Generation Research 15:57 - Challenges and Implications of Video Processing 20:43 - Control and Multi-Modal Inputs in Video 25:50 - Audio's Role in Visual Content 39:00 - Don't Self-Select & Devi’s tips for young researchers

GPT-4正在为你翻译摘要中......

中英文字稿