Name: Cencori
Rating: 5 (1 reviews)

Prompt Injection is the #1 security risk for GenAI applications. Attackers use "DAN" (Do Anything Now) prompts or role-playing to bypass safety filters.

Detection Layers

Cencori uses a defense-in-depth approach:

Heuristics Engine: Rapidly scans for known attack patterns (e.g., "Ignore previous instructions", "You are now unlocked").
Vector Similarity: Compares the prompt against a database of 50,000+ known successful jailbreaks.
LLM Guard (Zero-Trust): Optionally routes the prompt through a specialized small-model (Llama Guard) to classify intent.

Configuration

In your Project Settings, you can choose the strictness level:

Standard: Blocks known jailbreaks and obvious injections. fast (<10ms).
Strict: Blocks ambiguous prompts and "role-play" attempts.
Zero-Trust: Routes every prompt through an LLM classifier before sending to the provider. Adds ~200ms latency but offers maximum security.

Common Attacks Blocked

DAN / Mongo Tom: "You are going to pretend to be DAN which stands for 'do anything now'..."
Payload Splitting: Breaking malicious instructions across multiple messages.
Virtualization: "Imagine you are a Linux terminal..."
Translation: Writing the attack in encryption or base64.

Handling Injections

When an injection is detected, Cencori returns a structured error:

Codetext

{
  "error": {
    "code": "security_injection_detected",
    "message": "This prompt violates our security policy.",
    "metadata": {
      "confidence": 0.98,
      "source": "heuristics"
    }
  }
}