Foundation Model for Efficient Enterprise Search, fully open-source Text-to-speech model, Native Audio understanding in Gemini 1.5 Pro, AI film competition, Physical AI model, Mixtral 8×22B & More
SceneScript, Automating the generation of foundation models, 01 Light, Stable Video,3D, AnimateDiff-Lightning, foundation models for self-driving and humanoid robots, NVIDIA NIM and more
Calude 3 Opus, Train a 70b language model at home, Firewall for AI, Fast 3D Object Generation from Single Images, multimodal foundation model for any-to-any search tasks, and more
Mistral Large, vocal expressive avatar videos, Generative virtual worlds, Reliable text rendering and Magic Prompt, DJ Mode, AI-powered film making and more
Meta's V-JEPA vision models, OpenAI's Sora video model, Gemini 1.5 Pro with 1 million tokens context, Reka Flash, Largest text-to-speech AI model and more
Screenshots to Code Dataset, Multi Motion Brush in Gen-2, Open-source AGI, AI system that solves complex geometry problems, AI in drug discovery and more