The short version

On December 2, 2025 at AWS re:Invent in Las Vegas, Amazon unveiled the Nova 2 model family and a new build service called Nova Forge. Nova 2 arrives in tiers—Lite (GA), Pro (preview), Omni (preview) and Sonic (GA)—with long-context reasoning, built‑in tools, and multimodal options. Nova Forge lets enterprises “open up” the training pipeline by starting from early Nova checkpoints to create domain‑expert models. Nova 2, Nova 2 Omni, Nova 2 Sonic, and Nova Forge are live or in preview now.

In parallel, AWS deepened its NVIDIA partnership: it will adopt NVLink Fusion—a chiplet‑based, rack‑scale interconnect platform—for future Trainium4 accelerators (and even Graviton CPUs and the Nitro System), tying AWS custom silicon into NVIDIA’s NVLink 6 fabric and MGX rack architecture. The move aims to shorten time‑to‑market for cloud‑scale AI infrastructure.

Stage-style collage of AWS Nova 2 models, Nova Forge pipeline, and NVIDIA NVLink Fusion fabric connecting racks

Nova 2, at a glance

Amazon’s second‑generation models focus on controllable reasoning, longer context, and built‑in “do‑more” tools.

  • Extended thinking with selectable budgets (low/medium/high) for step‑by‑step reasoning and task decomposition.
  • Built‑in tools such as a code interpreter and web grounding to fetch and validate current information.
  • A one‑million‑token context window across Nova 2 variants, with multimodal inputs varying by model.
1,000,000 tokens
Nova 2 context windowSource: aws-whats-new-nova2-omni-2025-12-02

Nova 2 models overview

ModelInputsOutputsHighlightsContextStatusTypical use
Nova 2 LiteText, images, video, documentsTextFast, cost‑efficient reasoning; code interpreter and web groundingUp to 1M tokensGA in BedrockHigh‑volume chat, document Q&A, process automation
Nova 2 ProText, images, video, documentsTextHighest intelligence for multi‑step agentic tasksUp to 1M tokensPreview (via Nova Forge early access)Complex analysis, software migrations
Nova 2 OmniText, images, video, speechText + imagesMultimodal reasoning plus image generation/editing; 200+ text languages; speech understandingUp to 1M tokensPreviewRich content creation, video analysis, voice + vision workflows
Nova 2 SonicSpeech (plus cross‑modal voice↔text)SpeechReal‑time, low‑latency conversational AI with multilingual “polyglot” voicesUp to 1M tokensGA (regions vary)Contact centers, live voice agents

Sources: AWS What’s New posts and Nova docs.

Nova Forge: from fine‑tuning to “open training”

Nova Forge is a new path for enterprises that want expert, domain‑tuned models without starting from scratch. Builders can begin from pre‑, mid‑, or post‑training Nova checkpoints, blend proprietary data with Amazon‑curated corpora, run reinforcement fine‑tuning with their own reward functions, and apply responsible‑AI guardrails—all integrated with SageMaker AI. Early access to new Nova models (including Nova 2 Pro and Omni) is part of the package; GA launches first in US East (N. Virginia), with more regions to follow.

Industry reception points to a shift in priorities. As Amazon’s Rohit Prasad put it, today’s public leaderboards are “noisy” and less meaningful than real‑world utility—a stance consistent with Forge’s focus on bespoke capability. Reddit has already used Forge to train a platform‑expert safety model from decades of moderation data.

NVIDIA NVLink Fusion + AWS: the hardware story behind the software

NVIDIA’s NVLink Fusion is a chiplet and rack‑scale platform that lets custom ASICs snap into the NVLink scale‑up fabric and NVLink Switch—now in its sixth generation—alongside NVIDIA’s MGX rack architecture. For hyperscalers, it offers native memory semantics, in‑network reductions (SHARP), and a huge all‑to‑all topology—up to 72 custom ASICs at roughly 3.6 TB/s per‑ASIC, for 260 TB/s of scale‑up bandwidth. AWS says it will use NVLink Fusion with next‑gen Trainium4, and even connect Graviton CPUs and the Nitro System, to accelerate time‑to‑market for AI “factories.”

Why care? Because software velocity depends on infrastructure. The tighter scale‑up links inside a node (NVLink) and the standardization of rack‑level components (MGX) can reduce the time it takes AWS to stand up new, larger clusters—good news if you plan to train or serve Nova‑based agents at scale.

Conceptual NVLink Fusion rack: Trainium4 and NVIDIA GPU sleds interconnected via NVLink Switch in an MGX-style rack

What this means for builders and ops teams

  • If you’re building agents today, start small on Nova 2 Lite, then evaluate Pro for multi‑step tasks and Omni when you need image generation or video reasoning. Nova Sonic is the obvious pick for live voice.
  • If you need a domain‑expert model (legal, safety, trading, claims): pilot Nova Forge using a narrow, high‑signal dataset; define reward functions and quality gates before you scale.
  • Align infra plans with AWS’s roadmap. Trainium3 UltraServers are available now and target 4.4x more compute and 4x better energy efficiency versus the prior gen; NVLink Fusion will underpin the next Trainium era. Consider latency, memory, and parallelism choices early.
4.4x
Trn3 UltraServers compute upliftSource: about-amazon-reinvent-2025

Competitive context—without the leaderboard drama

Coverage from WIRED and others notes Amazon’s claim that Nova 2 can match or exceed top‑tier models on select benchmarks—but the more interesting bet is that customization (via Forge) beats leaderboard climbing for enterprise outcomes. As The Verge summarizes, Amazon wants us to judge models by “real‑world utility,” not just eval sheets.


Sources