Perplexity Comet Voice Mode 2 — powered by GPT Realtime 1.5
Comet upgraded Voice Mode to GPT Realtime 1.5 with 25%+ better reliability, screen awareness and voice browser navigation. Rolling out on desktop and Android.
14 noviniek
Comet upgraded Voice Mode to GPT Realtime 1.5 with 25%+ better reliability, screen awareness and voice browser navigation. Rolling out on desktop and Android.
Alongside its eve framework Vercel shipped infra for long-running agents. Sandbox now runs up to 24 hours, Passport is in public beta, Connect gives agents secure access to external services, and AI Gateway added GLM 5.2.
Google added speech streaming for `gemini-3.1-flash-tts-preview` and announced shutdown dates for legacy media models. Veo retires June 30, Imagen 4 and earlier Gemini image models on August 17, 2026.
Codex on macOS records a workflow you demonstrate once and converts it into a reusable skill it can execute autonomously on demand.
Enterprise customers can now plug their own model keys into Copilot CLI. Copilot-authored pull requests now appear in author searches, and auto-generated release notes credit them to the human user.
OpenAI launched OpenAI for Healthcare — a product suite for healthcare organizations with HIPAA support — and improved health intelligence in ChatGPT, which now sees 230M+ weekly health-related queries.
Microsoft's small coding model MAI-Code-1-Flash is rolling out across eight Copilot surfaces: CLI, Copilot app, Chat on GitHub, Visual Studio, GitHub Mobile, JetBrains, Eclipse, and Xcode.
The Copilot usage metrics API now exposes an 'ai_credits_used' field with total AI credit consumption per user in both daily and 28-day reports at the enterprise and org levels.
GitHub announced the deprecation of the Opus 4.6 (fast) model in Copilot; users are advised to migrate to current Opus generations or to Microsoft's MAI-Code-1-Flash for fast tasks.
OpenAI made GPT-5.5, GPT-5.4 and Codex available via Amazon Bedrock — ending Azure's hyperscaler exclusivity for OpenAI APIs.
The native desktop home for agent-driven development hit GA on all three OSes. Voice conversations via on-device speech-to-text, integrated with session management and github.com.
Eve is a TypeScript-native open-source framework where every agent = a directory of files (instructions.md + tools/ + skills/). Built in: durable execution, per-agent sandbox, human approvals, evals, OpenTelemetry. Plus the Sandbox runtime now goes up to 24h.
Codex app v26.616 captures a workflow from a single demonstration and generates a natural-language skill description that generalizes to new inputs. Requires Computer Use.
OpenAI made its frontier models and Codex available on AWS — ending Azure's de facto exclusivity for OpenAI API workloads.