VideoPoet

VideoPoet
VideoPoet
	"A dog eating popcorn at the cinema" "A teddy bear with a cap, sunglasses, and leather jacket playing drums" Example videos generated by the model from texts
Developer	Google
Initial release	February 8, 2024;20 months ago
Type	Large language model

Last updated October 22, 2025

VideoPoet is a large language model developed by Google Research in 2023 for video making.^[1]^[2]^[3]^[4] It can be asked to animate still images.^[5] The model accepts text, images, and videos as inputs, with a program to add feature for any input to any format generated content.^[4] VideoPoet was publicly announced on December 19, 2023.^[1] It uses an autoregressive language model.

References

1 2 Krithika, K. L. (December 20, 2023). "Google Unveils VideoPoet, a New LLM for Video Generation". Analytics India Magazine. Retrieved April 29, 2024.
↑ Kondratyuk, Dan; Yu, Lijun; Gu, Xiuye; Lezama, José; Huang, Jonathan; Hornung, Rachel; Adam, Hartwig; Akbari, Hassan; Alon, Yair; Birodkar, Vighnesh; Cheng, Yong; Chiu, Ming-Chang; Dillon, Josh; Essa, Irfan; Gupta, Agrim; Hahn, Meera; Hauth, Anja; Hendon, David; Martinez, Alonso; Minnen, David; Ross, David; Schindler, Grant; Sirotenko, Mikhail; Sohn, Kihyuk; Somandepalli, Krishna; Wang, Huisheng; Yan, Jimmy; Yang, Ming-Hsuan; Yang, Xuan; Seybold, Bryan; Jiang, Lu (December 21, 2023). "VideoPoet: A Large Language Model for Zero-Shot Video Generation". arXiv: 2312.14125 [cs.CV].
↑ "Google has introduced VideoPOET breaking new ground in coherent video generation". Gizmochina. December 21, 2023.
1 2 "VideoPoet". Google Research. Retrieved April 29, 2024.
↑ Franzen, Carl (December 20, 2023). "Google's new multimodal AI video generator VideoPoet looks incredible". VentureBeat.

External links

Media related to VideoPoet at Wikimedia Commons

This Google-related article is a stub. You can help Wikipedia by expanding it.

This large language model-related article is a stub. You can help Wikipedia by expanding it.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[:1-1] 1 2 Krithika, K. L. (December 20, 2023). "Google Unveils VideoPoet, a New LLM for Video Generation". Analytics India Magazine. Retrieved April 29, 2024.

[2] Kondratyuk, Dan; Yu, Lijun; Gu, Xiuye; Lezama, José; Huang, Jonathan; Hornung, Rachel; Adam, Hartwig; Akbari, Hassan; Alon, Yair; Birodkar, Vighnesh; Cheng, Yong; Chiu, Ming-Chang; Dillon, Josh; Essa, Irfan; Gupta, Agrim; Hahn, Meera; Hauth, Anja; Hendon, David; Martinez, Alonso; Minnen, David; Ross, David; Schindler, Grant; Sirotenko, Mikhail; Sohn, Kihyuk; Somandepalli, Krishna; Wang, Huisheng; Yan, Jimmy; Yang, Ming-Hsuan; Yang, Xuan; Seybold, Bryan; Jiang, Lu (December 21, 2023). "VideoPoet: A Large Language Model for Zero-Shot Video Generation". arXiv: 2312.14125 [cs.CV].

[3] "Google has introduced VideoPOET breaking new ground in coherent video generation". Gizmochina. December 21, 2023.

[:0-4] 1 2 "VideoPoet". Google Research. Retrieved April 29, 2024.

[5] Franzen, Carl (December 20, 2023). "Google's new multimodal AI video generator VideoPoet looks incredible". VentureBeat.

[1]

[2]

[3]

[4]

[5]

VideoPoet
"A dog eating popcorn at the cinema" "A teddy bear with a cap, sunglasses, and leather jacket playing drums" Example videos generated by the model from texts
Developer	Google
Initial release	February 8, 2024;20 months ago (2024-02-08)
Type	Large language model