Skip to content

Latest Updates

The AI development landscape changes weekly. New models drop, pricing shifts, and features land across all three tools. This page tracks the updates that matter so you can stay current without watching every changelog.

The newest Claude model and the new default recommendation for all complex coding tasks:

Top SWE-Bench Scores

Best performance on SWE-Bench Verified among all available models. Superior agentic behavior and sustained reasoning over long tasks.

Enhanced Agentic Performance

Improved tool use across hundreds of tools. Better prompt injection resistance. More reliable multi-step task execution.

200K Context Window

200K token context with 64K output limit. Effort parameter for adjustable reasoning depth. Memory improvements for complex tasks.

Available Everywhere

Available in Claude Code, Cursor (via model picker), and via Anthropic API. Recommended with Max/Ultra subscription plans for full access.

Latest Claude Code improvements:

  • PDF page ranges: Specify exact pages when reading PDFs (e.g., @file.pdf:1-5)
  • Token metrics: New /usage command with detailed input/output token breakdown
  • OAuth improvements: Better error handling and refresh token management
  • MCP health checks: Automatic restart for failing MCP servers

Major agent experience update:

  • Subagents: Specialized subagents for testing, docs, refactoring. Define custom subagents in .cursor/agents/
  • Skills (SKILL.md): Reusable skill definitions with SKILL.md manifests. Share across projects and teams
  • 40x faster hooks: Near-instant hook execution
  • Cursor Blame: AI-powered git blame with context
  • Agent clarification: Agents ask clarifying questions before complex tasks
  • CLI Plan/Ask modes: cursor --plan and cursor --ask for offline usage
  • Cloud handoff: Use & suffix to hand tasks to cloud agents

The latest model powering all Codex surfaces:

  • Improved coding performance across all languages
  • Better long-context handling with automatic compaction
  • Enhanced tool use for MCP integrations
  • Available in Codex App, CLI, IDE extensions, and Cloud
  • SKILL.md support: Define custom skills in .claude/skills/
  • Session forking: Create branches from any conversation point
  • --from-pr flag: Start with context from a GitHub pull request
  • Arrow key history: Navigate prompt history with up/down arrows
  • Cloud handoff: Use & suffix to hand off tasks to cloud agents
  • Layout customization: Drag-and-drop panel arrangement
  • Multi-monitor support: Layout sync across displays
  • 25% memory reduction: Better handling of large files
  • Improved crash recovery: Enhanced stability for long sessions

Codex automations moved from beta to GA:

  • GitHub issue assignment to @codex
  • Slack integration for team channels
  • Linear ticket automation
  • Scheduled cron-based tasks
  • All automations run as sandboxed cloud agents

Cursor v2.2 — Debug Mode and Visual Editing

Section titled “Cursor v2.2 — Debug Mode and Visual Editing”

Debug Mode

Runtime log instrumentation for automatic root cause analysis. Works across multiple tech stacks and languages.

Visual Style Editor

Real-time visual design in Cursor Browser. Modify elements and colors directly in a live preview.

Multi-Agent Judging

Run parallel agents on the same task, then automatic evaluation picks the best solution.

Pinned Chats

Pin important conversations in the agent sidebar for quick reference.

  • Background agents: Run tasks while you continue working
  • Named sessions: /rename to name, /resume <name> to resume
  • .claude/rules/ directory: Support for rules alongside CLAUDE.md
  • Prompt suggestions: Claude suggests follow-up prompts (Tab to accept)
  • Model switching (Alt+P): Switch models while composing a prompt

OpenAI launched Codex Cloud — background agents running on OpenAI infrastructure:

  • Git worktree isolation for every task
  • Parallel task execution (up to 10 concurrent)
  • GitHub PR creation on completion
  • Slack and Linear notification integrations
  • First model to break 80% on SWE-Bench Verified
  • Claude Code Desktop app launched (native experience with all CLI features)
  • Pro users gained access to Opus 4.5
  • Enhanced prompt injection resistance and agent capabilities
  • 8 parallel agents: Run up to 8 agents simultaneously
  • Cursor Composer 1: 4x faster model (250 tokens/sec)
  • Sandboxed terminals: Secure execution environments (macOS)
  • Voice mode: Speech-to-text agent control
  • Replaced GPT-5.1-Codex-Max
  • Specialized for bug fixing and UI generation
  • 24+ hour task endurance via compaction
  • Available in Cursor and GitHub Copilot
  • In-editor bug detection examining your changes
  • Instant Grep (beta) for all agent grep commands
  • Improved Plan Mode with interactive clarifying questions

OpenAI open-sourced the Codex CLI:

  • Full source available on GitHub
  • Community contributions accepted
  • Plugin architecture for extensions
  • Cross-platform support (macOS, Linux, Windows)
ModelProviderContextBest ForPricing (per 1M tokens)
Claude Opus 4.6Anthropic200KDefault for all complex tasks$5 / $25
Claude Sonnet 4.5Anthropic1MBudget-conscious, large context$3 / $15
GPT-5.3-CodexOpenAI200K+All Codex surfacesSubscription-based
GPT-5.2OpenAI200K+Bug fixing, UI generation (Cursor)$1.25 / $10
Gemini 3 ProGoogle1MMultimodal, extreme context$2 / $12
Cursor Composer 1CursorTBDSpeed-critical work in CursorSubscription-based
  • GPT-4, GPT-4.1, GPT-4o (replaced by GPT-5.x series)
  • GPT-5.1-Codex-Max (replaced by GPT-5.2 and GPT-5.3-Codex)
  • Claude 3.x series (fully replaced by 4.5+ series)
  • o1-preview, o1-mini (obsolete)
  1. Atlassian MCP — Jira and Confluence integration
  2. Cloudflare MCP — Workers, R2, and D1 management
  3. Vercel MCP — Deployment automation
  4. Supabase MCP — Database and auth operations
  5. Linear MCP — Issue tracking integration
  6. Stripe MCP — Payment processing automation

The Agent Skills ecosystem has expanded significantly:

  • Skills.sh marketplace now hosts 200+ skills
  • Universal CLI: npx skills add <owner/repo> works across 35+ agents
  • Popular skills: Cloudflare deployment, code review, Atlassian integration, API documentation generation

SOC 2 Type II

Cursor and Claude Code Enterprise maintain SOC 2 Type II certification. Codex Enterprise in progress.

Enhanced Privacy

All three tools guarantee no code training at paid tiers. Enterprise plans add data residency options.

Audit Logging

Comprehensive audit trails for all AI interactions. Available on enterprise plans for all three tools.

GDPR Compliance

Full GDPR compliance with EU data residency for Cursor and Claude Code Enterprise.

Settings > Update > Auto-update. Choose “Stable” or “Beta” channel.

  1. Model updates: Claude Opus 4.6 is the new default — update your model preferences
  2. Cursor v2.4: Subagents and Skills require updating to v2.4+
  3. Claude Code v2.1+: SKILL.md support requires v2.1.17+
  4. GPT-5.3-Codex: Replaces GPT-5.2 Codex as the default for Codex surfaces
  5. API key rotation: Rotate keys older than 6 months as a security best practice
ItemDeprecatedReplacementEnd of Support
Cursor v1.xFebruary 2026v2.4+February 2026
Claude Code WSL-onlyAugust 2025Native WindowsAugust 2025
MCP v1 protocolOctober 2025MCP v2.1October 2025
GPT-5.1-Codex-MaxNovember 2025GPT-5.2 / GPT-5.3-CodexNovember 2025

Last updated: February 8, 2026