An AI agent is just as correct, related and well timed as the information that powers it.
Now usually obtainable, NVIDIA NeMo microservices are serving to enterprise IT rapidly construct AI teammates that faucet into information flywheels to scale worker productiveness. The microservices present an end-to-end developer platform for creating state-of-the-art agentic AI techniques and frequently optimizing them with information flywheels knowledgeable by inference and enterprise information, in addition to consumer preferences.
With a knowledge flywheel, enterprise IT can onboard AI brokers as digital teammates. These brokers can faucet into consumer interactions and information generated throughout AI inference to repeatedly enhance mannequin efficiency — turning utilization into perception and perception into motion.
Constructing Highly effective Information Flywheels for Agentic AI
With no fixed stream of high-quality inputs — from databases, consumer interactions or real-world indicators — an agent’s understanding can weaken, making responses much less dependable and brokers much less productive.
Sustaining and enhancing the fashions that energy AI brokers in manufacturing requires three sorts of information: inference information to collect insights and adapt to evolving information patterns, up-to-date enterprise information to offer intelligence, and consumer suggestions information to advise if the mannequin and software are performing as anticipated. NeMo microservices assist builders faucet into these three information sorts.
NeMo microservices pace AI agent improvement with end-to-end instruments for curating, customizing, evaluating and guardrailing the fashions that drive their brokers.
NVIDIA NeMo microservices — together with NeMo Customizer, NeMo Evaluator and NeMo Guardrails — can be utilized alongside NeMo Retriever and NeMo Curator to ease enterprises’ experiences constructing, optimizing and scaling AI brokers by means of customized enterprise information flywheels. For instance:
- NeMo Customizer accelerates giant language mannequin fine-tuning, delivering as much as 1.8x increased coaching throughput. This high-performance, scalable microservice makes use of common post-training methods together with supervised fine-tuning and low-rank adaptation.
- NeMo Evaluator simplifies the analysis of AI fashions and workflows on customized and trade benchmarks with simply 5 software programming interface (API) calls.
- NeMo Guardrails improves compliance safety by as much as 1.4x with solely half a second of further latency, serving to organizations implement sturdy security and safety measures that align with organizational insurance policies and tips.
With NeMo microservices, builders can construct information flywheels that enhance AI agent accuracy and effectivity. Deployed by means of the NVIDIA AI Enterprise software program platform, NeMo microservices are straightforward to function and might run on any accelerated computing infrastructure, on premises or within the cloud, with enterprise-grade safety, stability and help.
The microservices have grow to be usually obtainable at a time when enterprises are constructing large-scale multi-agent techniques, the place tons of of specialised brokers — with distinct targets and workflows — collaborate to sort out advanced duties as digital teammates, working alongside staff to help, increase and speed up work throughout capabilities.
This enterprise-wide influence positions AI brokers as a trillion-dollar alternative — with purposes spanning automated fraud detection, shopping assistants, predictive machine upkeep and doc evaluation — and underscores the vital function information flywheels play in remodeling enterprise information into actionable insights.

Trade Pioneers Increase AI Agent Accuracy With NeMo Microservices
NVIDIA companions and trade pioneers are utilizing NeMo microservices to construct responsive AI agent platforms in order that digital teammates might help get extra executed.
Working with Arize and Quantiphi, AT&T has constructed a sophisticated AI-powered agent utilizing NVIDIA NeMo, designed to course of a information base of practically 10,000 paperwork, refreshed weekly. The scalable, high-performance AI agent is fine-tuned for 3 key enterprise priorities: pace, price effectivity and accuracy — all more and more vital as adoption scales.
AT&T boosted AI agent accuracy by as much as 40% utilizing NeMo Customizer and Evaluator by fine-tuning a Mistral 7B mannequin to assist ship customized companies, forestall fraud and optimize community efficiency.
BlackRock is working with NeMo microservices for agentic AI capabilities in its Aladdin tech platform, which unifies the funding administration course of by means of a standard information language.
Teaming with Galileo, Cisco’s Outshift workforce is utilizing NVIDIA NeMo microservices to energy a coding assistant that delivers 40% fewer device choice errors and achieves as much as 10x sooner response occasions.
Nasdaq is accelerating its Nasdaq Gen AI Platform with NeMo Retriever microservices and NVIDIA NIM microservices. NeMo Retriever enhanced the platform’s search capabilities, resulting in as much as 30% improved accuracy and response occasions, along with price financial savings.
Broad Mannequin and Companion Ecosystem Help for NeMo Microservices
NeMo microservices help a broad vary of common open fashions, together with Llama, the Microsoft Phi household of small language fashions, Google Gemma, Mistral and Llama Nemotron Extremely, at the moment the highest open mannequin on scientific reasoning, coding and sophisticated math benchmarks.
Meta has tapped NVIDIA NeMo microservices by means of new connectors for Meta Llamastack. Customers can entry the identical capabilities — together with Customizer, Evaluator and Guardrails — through APIs, enabling them to run the complete suite of agent-building workflows inside their atmosphere.
“With Llamastack integration, agent builders can implement information flywheels powered by NeMo microservices,” mentioned Raghotham Murthy, software program engineer, GenAI, at Meta. “This enables them to repeatedly optimize fashions to enhance accuracy, enhance effectivity and scale back complete price of possession.”
Main AI software program suppliers comparable to Cloudera, Datadog, Dataiku, DataRobot, DataStax, SuperAnnotate, Weights & Biases and extra have built-in NeMo microservices into their platforms. Builders can use NeMo microservices in common AI frameworks together with CrewAI, Haystack by deepset, LangChain, LlamaIndex and Llamastack.
Enterprises can construct information flywheels with NeMo Retriever microservices utilizing NVIDIA AI Information Platform choices from NVIDIA-Licensed Storage companions together with DDN, Dell Applied sciences, Hewlett Packard Enterprise, Hitachi Vantara, IBM, NetApp, Nutanix, Pure Storage, VAST Information and WEKA.
Main enterprise platforms together with Amdocs, Cadence, Cohesity, SAP, ServiceNow and Synopsys are utilizing NeMo Retriever microservices of their AI agent options.
Enterprises can run AI brokers on NVIDIA-accelerated infrastructure, networking and software program from main system suppliers together with Cisco, Dell, Hewlett Packard Enterprise and Lenovo.
Consulting giants together with Accenture, Deloitte and EY are constructing AI agent platforms for enterprises utilizing NeMo microservices.
Builders can obtain NeMo microservices from the NVIDIA NGC catalog. The microservices might be deployed as a part of NVIDIA AI Enterprise with extended-life software program branches for API stability, proactive safety remediation and enterprise-grade help.