August 10, 2025@elonmusk →
“Our V7 foundation model, which finished pre-training last week, is natively multimodal. It processes a video/audio bitstream directly, understanding it without converting it into anything, so, for example, it will finally understand nuances in how you speak that convey mood and”
Engagement vs. median curated tweet
4.3K
Likes
798
Retweets
527
Replies
Other tweets from the same period
“RT @cb_doge: Hey @Apple, How is it possible that the #1 ranked app is hard to find and missing from …”
Aug 13, 20250 likes
“RT @xmuse_: This is my photo, turned into a video by Grok Imagine. https://t.co/p543u6BvdK”
Aug 13, 20250 likes
“RT @lyn_beatz: 𝕲𝖒𝕏 Grok Imagine https://t.co/gcfs7jnuN1”
Aug 13, 20250 likes
“RT @grok: Turn old photos into videos and see friends and family come to life. Try Grok Imagine, fre…”
Aug 13, 20250 likes
