Alibaba releases Qwen-AgentWorld — Language World Models for general agents
Qwen-AgentWorld is the first model trained to predict the next environment state (not the next action) across seven agent domains. Two MoE sizes, 256K context, Apache 2.0.
14 noviniek
Qwen-AgentWorld is the first model trained to predict the next environment state (not the next action) across seven agent domains. Two MoE sizes, 256K context, Apache 2.0.
xAI added /goal to Grok Build — a long-running autonomous mode that plans, executes and verifies multi-step coding tasks with status, pause, resume and clear controls.
OpenAI detailed how it preserved private network boundaries while supporting MCP streaming, authentication and an inspectable client, so enterprises don't have to expose internal MCP servers to the public internet.
OpenAI publishes a guide positioning Codex Remote as an 'engineering control plane' — phone as a controller for Codex sessions running on Macs, Windows machines or devboxes. Hosts, worktrees, goals, side chats, queued prompts.
xAI becomes the third independent lab on Bedrock after Anthropic and OpenAI. 1M context, configurable reasoning effort, $1.25/$2.50 per M tokens.
Free Grok add-in for Word, Excel and PowerPoint with live web research, inline source citations, X data lookups, Mermaid diagrams, and connectors for email, SharePoint and Google Drive.
Grok models are now natively available in Databricks Agent Bricks. Agents can reason directly over Lakehouse data without exfiltration, with governance through Unity Catalog.
OpenAI added credit usage analytics — broken down by user, product, and model — plus monthly spend limits configurable per workspace, group, or individual for ChatGPT Enterprise and Edu. The goal: give CFOs and IT clearer visibility into who actually burns the AI budget.
xAI launched a free Grok add-in for Microsoft Word and made Grok 4.3 available on Amazon Bedrock. The model claims the lowest hallucination rate among frontier models, a 1M-token context window, and configurable reasoning effort.
Demo a task once on a Mac and Codex bundles it into a parameterized skill — no scripting, no RPA, no low-code, just a demo.
The first xAI model on Bedrock — Grok 4.3 with a 1M context window, configurable reasoning effort, $1.25 in / $2.50 out per MTok. Cheapest US-lab frontier reasoner on Bedrock.
OpenAI launched OpenAI for Healthcare — a product suite for healthcare organizations with HIPAA support — and improved health intelligence in ChatGPT, which now sees 230M+ weekly health-related queries.
OpenAI made GPT-5.5, GPT-5.4 and Codex available via Amazon Bedrock — ending Azure's hyperscaler exclusivity for OpenAI APIs.
ChatGPT now auto-converts pastes over 10k characters into attachments for Free and Go users to save context. Enterprise/Edu plans gained Slack connector actions that let ChatGPT join channels, create reminders, upload files, and update Slack profiles.