Sectum AI vs DeepTeam
TL;DR. DeepTeam is an open-source LLM red-teaming framework with a specific check —
CrossContextRetrieval(types=["tenant", "user"])— that targets cross-tenant retrieval leakage with a single prompt and an LLM-as-judge. Sectum AI is a different category: a multi-tenant infrastructure verifier that provisions synthetic tenants, plants cryptographic canary markers, runs a catalog of cross-tenant probes across 13 surfaces, and produces a tamper-evident, control-mapped evidence pack an auditor accepts. Both are useful and answer different questions.
The two products
DeepTeam (trydeepteam.com)
Category: open-source LLM red-teaming framework, built by Confident AI on top of their deepeval library.
License: Apache 2.0.
Distribution: an OSS CLI + library that runs locally, plus a commercial Confident AI platform (paid SaaS) that aggregates risk assessments and produces downloadable PDF reports.
DeepTeam simulates attacks against an LLM system — jailbreaks, prompt injection, multi-turn exploitation, PII probes — across 20+ single- and multi-turn attack patterns to surface 50+ vulnerabilities. It maps findings to frameworks including OWASP Top 10, OWASP_ASI_2026, NIST, MITRE, Aegis, and BeaverTails.
The specific check that overlaps with Sectum AI is CrossContextRetrieval(types=["tenant", "user"]): a single targeted prompt that asks the model whether it surfaces foreign-tenant or foreign-user context, scored by an LLM-as-judge as a binary outcome.
Sectum AI (sectum.ai)
Category: multi-tenant AI verification. Not LLM red-team, not a runtime guardrail, not GRC, not data-subject-rights workflow. The deliverable is auditor-acceptable, tamper-evident evidence that the multi-tenant boundary holds across every surface of the AI stack.
License: the marker substrate, attack catalog, adapters, evidence chain, and the sectum-ai verify command are Apache 2.0. The hosted Sectum Cloud is commercial.
Distribution: a sectum-ai CLI you run inside your stack (BYOC), a marker substrate SDK, and Sectum Cloud (hosted attestation, scheduled runs, managed audit-pack delivery). The evidence layer in the OSS produces the same artifacts the hosted product does — by design.
Sectum AI’s method is a marker substrate: it provisions synthetic tenants on the customer’s AI stack (Acme, Globex, Initech, Hooli by default), seeds them with realistic corpora and three classes of cryptographic canary markers, records a hashed ground-truth manifest, and runs a catalog of cross-tenant probes from each tenant’s session. Detection is layered: exact canary scan first (zero false positives), then semantic similarity, then a calibrated LLM judge — and every confirmed finding is traceable to a specific marker the manifest records.
The categorical difference
DeepTeam and Sectum AI operate at different units of analysis:
DeepTeam CrossContextRetrieval | Sectum AI | |
|---|---|---|
| Unit | A prompt — does this prompt trip a judge? | A tenant boundary across the AI stack — can tenant A’s data reach tenant B? |
| Method | Single prompt + LLM-as-judge → binary outcome | Synthetic-tenant marker substrate with a hashed ground-truth manifest |
| Determinism | LLM-as-judge (subject to model drift) | Manifest-grounded: a confirmed finding is provably traceable to a planted marker |
| False-positive control | Judge calibration (probabilistic) | Confirmed findings have zero false positives by construction; unconfirmed candidates are recorded as UNVERIFIED and excluded from the headline count |
| Surfaces | The RAG retrieval surface visible from one prompt | 13 surfaces: API, vector DB, RAG pipeline, prompt/completion logs, semantic cache, KV cache, agent memory, MCP tool calls, fine-tunes / adapters, eval sets, backups, search indexes, tracing pipelines |
| Output | A score / binary pass-fail | A tamper-evident evidence pack: RFC 3161 TSA token, Sigstore Rekor inclusion proof, in-toto attestation envelope, control-mapped audit PDF, machine-readable evidence.json |
| Independent verification | DeepTeam re-runs and the customer trusts the score | sectum-ai verify <pack> — anyone, on any machine, can verify the pack without trusting Sectum AI |
| Flagship engagement | — | GDPR Article 17 erasure attestation, aimed at DPOs |
The unit-of-analysis difference shows up everywhere. A DeepTeam CrossContextRetrieval finding means one prompt’s judge said the model returned foreign context. A Sectum AI confirmed finding means one specific cryptographic marker, owned by a specific tenant and recorded in a hashed manifest, was observed in a specific other tenant’s session on a specific surface, at a specific time, in a run whose digest is timestamped by an RFC 3161 authority and logged in Sigstore Rekor. Those are different evidence shapes.
Method, in detail
DeepTeam’s CrossContextRetrieval
CrossContextRetrieval(types=["tenant", "user"]) is one of DeepTeam’s vulnerability detectors. The flow:
- Generate an adversarial prompt designed to elicit cross-tenant or cross-user context.
- Run the prompt against the customer’s LLM endpoint.
- Pass the response to an LLM-as-judge.
- Judge returns a binary verdict: did the response reveal foreign-context information?
This is fast (one prompt + one judge call), flexible (works against any LLM endpoint), and judgement-based. It does not depend on the customer’s data; the adversarial prompt is constructed from a generic template.
The trade-offs are inherent to the design: the judge can hallucinate, the prompt may not reach the surfaces where leakage actually happens (a stale vector index, a semantic cache, a tracing pipeline), and the binary verdict is not an attestation an auditor accepts as proof. It is a red-team finding, not a control attestation.
Sectum AI’s marker substrate
Sectum AI’s approach is a provisioned simulator. The workflow:
-
sectum-ai seedprovisions synthetic tenants (default scenario: Acme, Globex, Initech, Hooli) and generates realistic corpora — HR records, sales pipelines, contracts, meeting notes — with deliberately overlapping organic entities (a shared vendor, a shared person name, shared compliance terms) so that the conditions of the Retrieval Pivot research (95.4% benign-query leakage on a shared vector index) are reproduced. -
Three classes of canary markers are planted in document bodies, metadata, titles, and tags:
HARD_CANARY— a high-entropy unique token (SECTUM-CANARY-{base32(16 bytes)}). Detected by exact substring/regex match — zero false positives.ENTITY_CANARY— a unique synthetic entity (fabricated person, codename, account number) owned by exactly one tenant. Detected by semantic similarity + exact match.SECRET_CANARY— a fake-but-plausible secret (API-key-shaped, fake-SSN-shaped). Detected by exact + format detector.
-
A hashed ground-truth manifest records every marker — owner tenant, marker type, plaintext, planted locations — and is itself hashed into the run’s evidence chain. The test conditions are provable after the fact.
-
The probe catalog runs cross-tenant probes from each tenant’s session. The eleven implemented classes include the flagship
rag-entity-bleed(the Retrieval Pivot — benign queries leaking across a shared index),tenant-boundary-fetch(direct ID-based BOLA),semantic-cache-contamination,kv-cache-timing(a statistical side-channel test),embedding-inversion,agent-tool-hijack(MCP confused-deputy + token passthrough),memory-contamination,lora-cross-tenant,ikea-extraction,rag-poisoning, andgdpr-erasure-verification(the Erasure Attestation engagement). -
The detection pipeline is layered, cheapest-and-most-certain first: exact canary scan, then semantic similarity, then a calibrated judge. A confirmed finding is traceable to a manifest marker; an unverified candidate (a judge-only flag with no manifest tie) is recorded but excluded from the headline count. The zero-false-positive property holds for confirmed findings by construction.
-
The evidence chain canonicalizes the run, hashes it (SHA-256), submits the digest to an RFC 3161 Time-Stamp Authority, records it in a Sigstore Rekor transparency log entry, and bundles the run, the manifest hash, the TSA token, the Rekor inclusion proof, the in-toto attestation envelope, and a human-readable audit PDF into the evidence pack. The pack is independently verifiable:
sectum-ai verify <pack>recomputes everything and reportsVERIFIEDor[FAIL]with reasons.
The cost is the up-front substrate provisioning (sectum-ai seed) and the broader integration surface (an adapter per backend family). The payoff is auditor-grade evidence with a chain of custody.
Surface coverage
| Surface | DeepTeam CrossContextRetrieval | Sectum AI |
|---|---|---|
| API / RAG response | ✓ (the single prompt’s response) | ✓ (multiple probes across response patterns) |
| Vector DB | — (visible only via RAG response) | ✓ (direct adapter coverage: Pinecone, pgvector, Weaviate, Chroma) |
| Semantic cache | — | ✓ (Class 4) |
| KV cache (timing side channel) | — | ✓ (Class 5, statistical Cohen’s d effect-size test) |
| Embedding inversion | — | ✓ (Class 6) |
| Agent tool calls / MCP | — (general red-team probes elsewhere in DeepTeam) | ✓ (Class 7: MCP confused-deputy + token passthrough — the Asana-class flaw) |
| Persistent agent memory | — | ✓ (Class 8) |
| LoRA / fine-tune adapters | — | ✓ (Class 9) |
| Multi-turn benign extraction | DeepTeam has multi-turn attacks for other vulnerabilities | ✓ (Class 10 — Silent Leaks / IKEA-style) |
| RAG poisoning | — | ✓ (Class 3) |
| GDPR Article 17 erasure verification | — | ✓ (Class 11 — verifies erasure across 7 configured surfaces) |
| Tracing pipelines (Langfuse / LangSmith / Phoenix) | — | ✓ (live observability adapters) |
DeepTeam’s CrossContextRetrieval is one prompt against one surface. Sectum AI’s catalog is 11 probe classes against 13 surfaces with live adapters for vector stores, caches, observability backends, agents, and MCP servers.
Evidence model
| DeepTeam | Sectum AI | |
|---|---|---|
| Output artifact | A risk-assessment PDF (via the Confident AI platform) | An evidence pack: audit-pack.pdf + evidence.json + attestation.intoto.json |
| Cryptographic chain | — | RFC 3161 TSA timestamp + Sigstore Rekor inclusion proof + manifest hash + in-toto envelope |
| Independent verification | Re-run DeepTeam (requires DeepTeam) | sectum-ai verify <pack> — works without Sectum AI, validates the chain end-to-end |
| Tamper-evident | A trusted PDF from a vendor | Mutating any field in the JSON makes verify exit 4 with a [FAIL] line explaining which check failed |
| Per-finding control mapping | Framework references in the report | Per-finding owasp_llm + atlas[] + nist[] IDs in evidence.json and rendered in the audit-pack PDF |
| Sample artifact | Available in DeepTeam’s docs | docs/samples/ on GitHub |
This evidence-model gap is why audit firms find Sectum AI useful. A SOC 2 auditor or a GDPR DPO asking “can you prove tenant A’s data didn’t reach tenant B?” is asking for an attestation, not a red-team report. DeepTeam answers “did our judge see leakage in this prompt?” — useful, but not the same shape as an attestation.
When to use DeepTeam
- You want a fast pre-prod red-team check that an LLM endpoint doesn’t return obviously-foreign context to a templated adversarial prompt.
- You’re already running DeepTeam for the broader vulnerability suite (jailbreaks, prompt injection, PII probes) and want
CrossContextRetrievalas one more check in that suite. - You need a PDF for an internal security review rather than an auditor or DPO engagement.
- Your AI surface is a single LLM endpoint — no vector DB, no caches, no agents, no MCP integrations.
When to use Sectum AI
- You operate a multi-tenant AI system and need to prove the tenant boundary holds across surfaces — vector DB, caches, agents, MCP servers, fine-tunes, eval sets, tracing pipelines.
- You have a GDPR Article 17 erasure obligation and need to attest that a churned tenant’s data has actually left every AI surface (for a DPO or regulator).
- You’re preparing for SOC 2 or ISO 27001 in a multi-tenant SaaS and the auditor will ask about logical-access / boundary-protection controls (SOC 2 CC6.x, ISO 27001 A.5.15 / A.8.3 / A.8.12) for the AI portion of the stack.
- You need independently-verifiable evidence — a pack a third party can verify without your vendor in the room.
- You want an open-source evidence layer — the same artifacts the hosted product produces.
Can you use both?
Yes — and often you should. The two are non-substitutable:
- DeepTeam in a CI pipeline gives you fast feedback on per-prompt LLM vulnerabilities (jailbreaks, prompt injection, the
CrossContextRetrievalsmoke test) at every commit. - Sectum AI on a release cadence (or on-demand for a customer asking for an isolation attestation) gives you the deep, multi-surface, tamper-evident evidence the auditor or DPO needs.
DeepTeam in dev, Sectum AI at every audit cycle and every Article 17 ticket.
Pricing
- DeepTeam (OSS) — free, Apache 2.0. Available on GitHub.
- Confident AI platform — commercial; pricing not publicly listed.
- Open Sectum (OSS) — free, Apache 2.0. Available on GitHub.
- Sectum Cloud — see pricing.
References
- DeepTeam — homepage, docs,
CrossContextRetrieval, GitHub, NIST AI RMF framework support, Apache 2.0 license. - Sectum AI — GitHub, docs, attack catalog, evidence chain, sample evidence packs.
- Underlying research: OWASP LLM08:2025 — Vector and Embedding Weaknesses; Retrieval Pivot Attacks in Hybrid RAG (2026, 95.4% benign-query cross-tenant leakage on shared vector indexes); Silent Leaks (arXiv 2505.15420, 91% extraction via benign queries).