AccueilEnglishNvidia’s Nemotron is built for AI “agents”, and it’s a direct shot...

Nvidia’s Nemotron is built for AI “agents”, and it’s a direct shot at OpenAI and Google

Nvidia showed up at CES 2025 with a message that boils down to: we’re not just selling the shovels anymore, we’re digging the hole, too.

The company unveiledNemotron, a new family of large language models designed specifically to powerautonomous AI agents, systems that don’t just chat, but actuallydothings: click through websites, call APIs, run multi-step workflows, and carry tasks across sessions without needing a human babysitter every five seconds.

Nvidia’s bet is bluntly financial. The company is aiming straight at what it pegs as a$50 billion agentic-AI market by 2028. And yes, that’s a not-so-subtle challenge to the current kings of AI mindshare,OpenAIandAnthropic, who’ve owned the “talking assistant” era.

Nemotron isn’t trying to be your chatbot. It’s trying to be your employee.

Most mainstream LLMs are optimized to generate text that sounds right. Nemotron is being pitched as optimized tofinish a job, especially jobs that require multiple steps, sequencing, and interacting with external tools.

Nvidia says Nemotron was trained on a specialized dataset that includes2.5 terabytesof “agent-environment interaction” data, basically, examples of an agent taking actions in digital spaces and learning what works. That’s a different flavor than the internet-text soup that trained many general-purpose models.

Nemotron also bakes in something Nvidia callsepisodic memory: the ability for an agent to retain and reuse information from prior sessions. Translation for normal people: the agent can remember what happened last time, so you don’t have to re-explain your whole mess every morning like it’s Groundhog Day.

Nvidia claims its internal benchmarks showabout 30% better performancethan GPT-4-style models adapted for agent tasks, especially incomplex web navigationandbusiness process automation. Internal benchmarks are marketing until proven otherwise, but the direction of travel is clear: Nvidia wants “agents” to be the next platform shift, and it wants to own the engine.

The lineup ranges from a smaller8-billion-parametermodel up to an enterprise-scale70-billion-parameterversion, an obvious play to cover everyone from scrappy startups to Fortune 50 IT departments with compliance teams and a fear of God.

Nvidia’s real move: stop being “just hardware” before AMD and Intel catch up

Nvidia built its modern empire on chips like theH100andA100. But the company isn’t blind: competitors likeAMDandIntelare pushing harder into AI accelerators, and big customers are itching to avoid being trapped in a single-vendor world where the prices feel… aspirational.

So Nvidia is widening the moat. Nemotron is part of a broader shift toward a full-stack strategy, hardwareplussoftwareplusmodels, so that even if rivals narrow the silicon gap, Nvidia still controls the ecosystem developers actually build on.

And that ecosystem is the quiet killer feature:CUDA(Nvidia’s programming platform) and tools likeTensorRT. If Nemotron runs best on Nvidia infrastructure, and Nvidia makes it easy and fast there, companies will “choose” Nvidia the way people “choose” the only gas station for 50 miles.

Meanwhile, the rest of Big Tech is already planting flags.Microsofthas been pushing enterprise automation throughCopilot Studio.Googlehas been talking up agent-like integrations (the article references “Bard Actions,” part of Google’s broader effort to wire assistants into its ecosystem). Nvidia wants to be the underlying supplier, the arms dealer for everyone building agentic systems.

Agentic AI is hot. It’s also messy, error-prone, and legally awkward.

The timing isn’t random. Companies are finally moving from “chatbots that write emails” to “systems that execute workflows.” AMcKinseysurvey published inDecember 2024found73%of companies surveyed said they plan to deploy autonomous AI agents by the end of2026, mainly forcustomer supportanddocument management.

But here’s the part vendors tend to mumble: autonomous agents screw up. AForrester Researchanalysis cited in the article pegs the average error rate at15%on complex tasks. In a demo, that’s a shrug. In a real business, where an agent can email the wrong customer, change the wrong record, or trigger the wrong workflow, that’s a lawsuit waiting for a calendar invite.

Nvidia says Nemotron includesvalidationandrollbackmechanisms, guardrails that let the system simulate an action before executing it and unwind mistakes when things go sideways. That’s the right instinct, because “move fast and break things” hits different when the thing you broke is payroll.

And Nvidia isn’t alone. Startups likeAdeptandRabbitare building agent-first products with lighter models tuned for specific use cases. That’s a problem for Nvidia: the market is fragmenting, and plenty of buyers will choose “good enough and cheaper” over “enterprise-grade and expensive,” especially after a couple years of corporate chatbot disappointments.

Nvidia’s challenge now is simple to describe and hard to pull off: prove Nemotron delivers real ROI, not just cooler demos, while convincing companies that locking into Nvidia’s stack is a feature, not a trap.

Un marché de l'agentique encore en structuration

[[EMBED_PLACEHOLDER_0]]
Un marché de l'agentique encore en structuration
Baptiste Laforge
Baptiste Laforge
"Soyez vous-même. Par-dessus tout, laissez qui vous êtes, ce que vous êtes, ce que vous croyez, briller à travers chaque phrase que vous écrivez, chaque pièce que vous terminez." - John Jakes. Ces lignes m'ont émue, je me retrouve dans l'écriture car c'est l'une des plus grandes joies pour moi. Si vous aimez lire mes articles et si vous avez des traces à modifier, alors n'hésitez pas à les partager

News

Coups de cœur