Sitemap - 2025 - AI Brews

Eleven v3, OpenAudio S1, Higgsfield Speak, Self-improving coding agent, FLUX.1 Kontext, Modify Video, Runner H, Perplexity Labs, Mistral Code, Chatterbox and more

Claude Opus 4 & Claude Sonnet 4, Gemini Diffusion, Veo3, Imagen 4, Jules, NLWeb, BAGEL, Devstral, safe vibe coding, Matrix-Game, Lyria RealTime API and more

AlphaEvolve, Psyche, Windsurf SWE-1, HunyuanCustom, GenSpark's Download Agent, Step1X-3D, Meta 3D AssetGen 2.0, HealthBench, ElevenLab's Soundboard, Maunus Image Generation, Higgsfield Ads and more

Claude Integrations, Qwen3, Chai by Langbase, agentic commerce,Phi-4 Reasoning, LlamaFirewall, Kimi-Audio, Gen-4 References, DeepWiki by Cognition, F Lite, Dia, Suno v4.5 ,Xiaomi MiMo-7B and more

o3 & o4-mini, Bytedance's Seaweed & UI-TARS-1.5, GPT‑4.1, Gemini 2.5 Flash, first open-source native 1-bit LLM, DataDecide, Convex Chef, Grok Studio, Kling 2.0 Master & Kolors 2.0, Codex CLI and more

Llama 4, Nova Sonic and Nova Reel 1.1, Cogito v1, HiDream-I1, Agent2Agent (A2A) Protocol, Deep Research for arXiv, fully Open-Source 14B Coder at o1 Level, AutoRAG, MCP security issues, and more

Gemini 2.5 Pro, Qwen2.5-Omni, GPT-4o with native image generation, Reve Image 1.0, Anthropic's AI microscope, first real-time speech-to-speech VSM, Ideogram 3.0 and more

All-in-one model for video creation & editing, DeepWork in Proxy, Gemini Robotics, Gemma 3, Native image generation in Gemini 2.0 Flash, Reka Flash 3, Command A, Figma to Bolt, AgentExchange and more

Manus AI, Grounded language model, Tavus' Conversational Video Interface, Jamba 1.6, QwQ 32B, Mistral OCR, Character-3, audio-to-video model, Sesame's voice model, Aya Vision, Browser Operator & more

Diffusion large language model, GPT‑4.5, 3.7 Sonnet, Wan2.1 open-source video model, Phi-4-multimodal, Proxy Lite, Omni-capable text and voice engine, Poe Apps and App Creator, FastRTC,Scribe and more

Multi-robot collaboration,Grok 3 , smallest video language model, Generative AI Model for Gameplay, AI co-scientist, Mistral Saba, Fiverr Go, Step-Video-T2V and Step-Audio, Pikaswaps & more

New unified reasoning and intuitive language model, Video Ads Foundation Models, Agent Leaderboard, 1.6B open-source expressive TTS, Mobile App development in Replit and Bolt, and more

Gemini 2.0 Pro, Diffusion model for video restoration, OmniHuman , o3-mini, Deep Research in ChatGPT and Open-source DeepResearch, GitHub Agent mode, Arena-Price Plot, Pikadditions and more

Mistral Small 3, Open Music Foundation Models, Qwen2.5-Max and VL, FUZZ, Open-R1, Hailuo Director mode, Tülu 3 405B, Postman AI Agent Builder, Goose, LlamaReport, open-source operator, Codev, and more

Open-source reasoning models, OpenAI's Operator, Bytedance's free Cursor alternative, Spell 3D worlds, Smallest VLM, Perplexity Assistant, open-source native GUI agent model, Kling's Elements & more

Self-Adaptive LLMs, MatterGen, ChatGPT Reminders,MiniMax-01 with 4M tokens, Tarsier2 by ByteDance, Ray2, Vidu 2.0, Ambient Agents and Agent Inbox, FLUX Pro Finetuning API, Codestral 25.01 and more

Stable Point Aware 3D, Cosmos, Autonomous game characters and Digits by Nvidia, Qwen Chat, Hailuo's Subject Reference, rStar-Math, Text-to-Video gen with Transparency, Cohere's North, STAR, & more