Insights

Artificial Intelligence

Machine Learning

AprielGuard: ServiceNow AI Releases Advanced 8B Safeguard for Secure Agentic Workflows

ServiceNow AI has launched AprielGuard, a specialized 8-billion parameter safeguard model designed to protect autonomous AI agents from adversarial attacks and safety risks.

NewDecoded

Published Dec 25, 2025

Dec 25, 2025

3 min read

Image by Hugging Face

ServiceNow AI has introduced AprielGuard, a high-performance safeguard model specifically engineered to secure autonomous agentic systems. Released on December 23, 2025, the 8-billion parameter model addresses the growing need for safety in multi-step AI workflows. It marks a shift from simple text moderation to a wide-ranging defense against complex, logic-based threats. Developers can access the model and related research through Hugging Face. The model identifies 16 distinct categories of safety risks, including toxicity, misinformation, and illegal activities. Beyond standard safety, it provides a dedicated defense against adversarial tactics like prompt injection and multi-turn jailbreaks. This dual-focus ensures that enterprise AI applications remain both compliant and resilient against malicious manipulation. More technical details are available in the official research paper.

Unlike traditional classifiers, AprielGuard is optimized for modern agentic ecosystems where models interact with external tools and memory. It can detect specialized attacks such as memory poisoning and tool hijacking within complex reasoning traces. This capability allows developers to monitor the internal decision-making processes of their AI agents in real time, preventing exploitation before it reaches critical systems.

Operators can choose between two primary modes of execution depending on their performance requirements. The Reasoning mode generates structured explanations for its decisions, providing valuable transparency for debugging and auditing. For production environments where speed is essential, the Fast mode offers low-latency classification without the explanatory narrative.

Built on the Apriel-1.5 architecture, the model supports a significant context window of up to 32,000 tokens. This allows it to scan extensive conversation histories and long retrieval-augmented generation documents for subtle, embedded risks. It has demonstrated strong performance across various public benchmarks, maintaining high precision in multilingual and long-form scenarios.

AprielGuard is now available for the research and developer community on the Hugging Face model hub. Its release includes detailed documentation on the synthetic data generation processes and evaluation metrics used during development. This tool represents a foundational step toward building more trustworthy and autonomous enterprise AI systems.

Decoded Take

The Evolution of AI Security Architecture

The launch of AprielGuard signifies a major transition in the industry from reactive content filtering to proactive agentic defense. As Large Language Models move toward autonomous agency, the security perimeter must expand to include internal reasoning traces and external tool interactions. This release establishes a new standard for "Guardian Agents," suggesting that future enterprise deployments will require a dedicated safety layer to manage the inherent risks of hijacking and memory poisoning. By providing a specialized, open-weights model for this purpose, ServiceNow is pushing the industry toward a modular architecture where safety is not just a fine-tuned preference but a decoupled, verifiable component of the AI stack.

Want to advertise your Data, Analytics, or AI here? Reach out!

NewDecoded

Want to advertise your Data, Analytics, or AI here? Reach out!

NewDecoded

Want to advertise your Data, Analytics, or AI here? Reach out!

NewDecoded

Share this article

News

Feb 19, 2026

Cohere Labs Launches Tiny Aya to Bring Multilingual AI Directly to Mobile Devices

News

Feb 19, 2026

Cohere Labs Launches Tiny Aya to Bring Multilingual AI Directly to Mobile Devices

News

Feb 19, 2026

Cohere Labs Launches Tiny Aya to Bring Multilingual AI Directly to Mobile Devices

News

Feb 19, 2026

Meta and NVIDIA Forge Strategic Alliance to Build Hyperscale AI Infrastructure

News

Feb 19, 2026

Meta and NVIDIA Forge Strategic Alliance to Build Hyperscale AI Infrastructure

News

Feb 19, 2026

Meta and NVIDIA Forge Strategic Alliance to Build Hyperscale AI Infrastructure

News

Feb 19, 2026

Anthropic Launches Claude Sonnet 4.6 Delivering Frontier Power at Mid-Tier Pricing

News

Feb 19, 2026

Anthropic Launches Claude Sonnet 4.6 Delivering Frontier Power at Mid-Tier Pricing

News

Feb 19, 2026

Anthropic Launches Claude Sonnet 4.6 Delivering Frontier Power at Mid-Tier Pricing

News

Feb 19, 2026

Martech Veterans Launch Kana With $15M to Automate Marketing Through Agentic AI

News

Feb 19, 2026

Martech Veterans Launch Kana With $15M to Automate Marketing Through Agentic AI

News

Feb 19, 2026

Martech Veterans Launch Kana With $15M to Automate Marketing Through Agentic AI

News

Feb 19, 2026

World Labs Secures $1 Billion to Advance Spatial Intelligence and 3D World Models

News

Feb 19, 2026

World Labs Secures $1 Billion to Advance Spatial Intelligence and 3D World Models

News

Feb 19, 2026

World Labs Secures $1 Billion to Advance Spatial Intelligence and 3D World Models

News

Jan 24, 2026

Rare Earth Recycler Cyclic Materials Raises $75 Million to Scale Low Carbon Magnet Supply for AI and EVs

News

Jan 24, 2026

Rare Earth Recycler Cyclic Materials Raises $75 Million to Scale Low Carbon Magnet Supply for AI and EVs

News

Jan 24, 2026

Rare Earth Recycler Cyclic Materials Raises $75 Million to Scale Low Carbon Magnet Supply for AI and EVs