Nvidia tackles agentic AI safety and security with new NeMo Guardrails NIMs

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More


As the use of agentic AI continues to grow, so too does the need for safety and security.

Today, Nvidia announced a series of updates to its NeMo Guardrails technology designed specifically to address the needs of agentic AI. The basic idea behind guardrails is to provide some form of policy and control for large language models (LLMs) to help prevent unauthorized and unintended outputs. The guardrails concept has been broadly embraced in recent years by multiple vendors, including AWS.

The new NeMo Guardrails updates from Nvidia are designed to make it easier for organizations to deploy and provide more granular types of controls. NeMo Guardrails are now available as a NIM (Nvidia Inference Microservices), which are optimized for Nvidia’s GPUs. Additionally, there are three new specific NIM services that enterprises can deploy for content safety, topic control and jailbreak detection. The guardrails have been optimized for agentic AI deployments, rather than just singular LLMs.

“It’s not just about guard-railing a model anymore,” Kari Briski, VP for enterprise AI models, software and services at Nvidia, said in a press briefing. “It’s about guard railing and a total system.”

What the new NeMo Guardrails bring to enterprise Agentic AI

Agentic AI use is expected to be a dominant trend in 2025. 

While agentic AI has plenty of benefits, it also brings new challenges, particularly around security, data privacy and governance requirements, which can create significant barriers to deployment.

The three new NeMo Guardrails NIMs are intended to help solve some of those challenges. They include:

  • Content Safety NIM: Trained on Nvidia’s Aegis content safety dataset with 35,000 human-annotated samples, this service blocks harmful, toxic and unethical content.
  • Topic Control NIM: Helps ensure that AI interactions remain within predefined topical boundaries, preventing conversation drift and unauthorized information disclosure.
  • Jailbreak Detection NIM: Helps prevent security bypasses through clever hacks, leveraging training data from 17,000 known successful jailbreaks.

Complexity of safeguarding agentic AI systems

The complexity of safeguarding agentic AI systems is significant, as they can involve multiple interconnected agents and models. 

Briski provided an example of a retail customer service agent scenario. Consider a person interacting with at least three agents, a reasoning LLM, a retrieval-augmented generation (RAG) agent and a customer service assistant agent. All are required to enable the live agent. 

“Depending on the user interaction, many different LLMs or interactions can be made, and you have to guardrail each one of them,” said Briski.

While there is complexity, she noted that a key goal with NeMo Guardrails NIMs is to make it easier for enterprises. As part of today’s rollout, Nvidia is also providing blueprints to demonstrate how the different guardrail NIMs can be deployed for varying scenarios, including customer service and retail.

How Nvidia guardrails impact agentic AI performance

Another primary concern for enterprises deploying agentic AI is performance. 

Briski said that as enterprises deploy agentic AI, there can be concern about introducing latency by adding guardrails. 

“I think as people were initially trying to add guardrails in the past, they were applying larger LLMs to try and guardrail,” she explained. 

The latest NeMo Guardrail NIMs have been fine-tuned and optimized to address latency concerns. Nvidia’s early testing shows that organizations can get 50% better protection with guardrails, which only add approximately a half second of latency.

“This is really important when deploying agents, because as we know, it’s not just one agent, there are multiple agents that could be within an agentic system,” said Briski.

Nvidia NeMo Guardrails NIMs for agentic AI are available under the Nvidia AI enterprise license, which currently costs $4,500 per GPU per year. Developers can try them out for free under an open source license, as well as on build.nvidia.com.



Source link