Toward a Unified Application Data Model for Agentic AppSec

As security researchers and engineers, we’ve long understood that the behavior of modern applications cannot be fully understood through static code alone. Yet nearly every security system we’ve built over the last two decades has been predicated on that static assumption.

We scan source code, run containers through signature checks, and layer runtime agents to observe production behavior, but none of these systems talk to each other in a way that gives us a coherent understanding of the application itself. What’s missing is a unified application data model: a foundational representation that spans the full lifecycle of the app, across development, testing, and production, and across the different layers in which that app is expressed, code, container, and operating system.

Such a model is not only foundational for reasoning about real-world security posture, but essential for building the next generation of autonomous security agents: systems that don’t just scan for bugs, but reason about behavior, prioritize based on impact, and take intelligent action. These agents need more than logs. They need understanding. And understanding requires context.

written by

Pavel Furman

published on

April 21, 2025

topic

Application Security

Contents

Example H2

Example H3

Share this blog

The Case for a Unified Context Model

Imagine trying to secure a system based only on its source code, with no knowledge of how it behaves under load, what data flows through it, or what third-party dependencies it dynamically pulls in. Or conversely, trying to triage a production vulnerability with no access to the underlying function or logic responsible.

Both of these are the norm today.

What we lack is a data model that binds:

Lifecycle stage — insights from development, testing, staging, and production
System layers — observability and traceability across code, container, and OS layers
Execution semantics — real-world runtime context that distinguishes theoretical vulnerabilities from exploit paths

Without a model that ties these dimensions together, any security insight is necessarily partial. But if we build one, we unlock the ability to ask and answer a fundamentally richer set of questions:

Is this vulnerability reachable from an external input path in production?
Does this exploit path require environment-specific conditions to succeed?
What memory behavior accompanies this API usage pattern under real-world load?

These are the questions that matter for both security triage and autonomous decision-making.

Design Goals of the Model

To be useful, a unified application data model must satisfy three key properties:

Precision
The model must capture fine-grained detail at each layer—code-level ASTs, container metadata, syscall behavior, heap and memory access patterns—without losing fidelity across stages.
Correlation
It must correlate behaviors across layers and lifecycle stages. For example, it should relate a log line in staging to a function in a specific microservice that allocates memory in a container with known capabilities.
Temporal Binding
It should support time-based joins, making it possible to track how behaviors emerge or change over time, and how those shifts map to code changes or deployment events.

The Architecture

Here's how we at Kodem are thinking about architecting such a model to power intelligent security agents:

1. Sensor Fabric

We begin by deploying sensors at multiple points:

Static analyzers during CI for ASTs, dependency graphs, and taint flows
eBPF-based runtime sensors in Kubernetes for syscall and memory introspection
Sidecar proxies and tracing hooks for API paths and data flows
Container introspectors for file system and process behavior

Each sensor emits structured telemetry that is tagged with:

Code identity (repo, commit hash, function name)
Deployment identity (container ID, pod UID, node info)
Event context (timestamp, origin, trace ID)

2. Telemetry Ingestion + Normalization

Sensor outputs are ingested into a stream processing system that applies:

Type normalization (AST nodes, memory ops, syscall categories)
Entity linking (functions ↔ containers ↔ runtime events)
Privacy filtering (PII tagging and scrubbing)

3. Application Graph Construction

The normalized events are used to build a graph model of the application:

Nodes represent functions, files, containers, processes, endpoints
Edges represent observed execution paths, data flows, control flow transitions
Weights encode runtime frequency, timing, or risk

This graph is versioned and updated incrementally across lifecycle stages.

4. Agent Interface

Security agents (autonomous or human-in-the-loop) query this graph to:

Trace attack paths
Simulate reachability
Confirm exploitability
Recommend remediations
Run counterfactuals (e.g., “What if this component were removed?”)

The model allows reasoning about “what’s happening” in the app—at every layer, in every stage—at once.

Why This Matters for Agents

The most powerful AI agents in security won’t be the ones trained on generic code snippets or curated CVE datasets. They’ll be the ones trained on representations of how real applications behave under real workloads, in real environments.

These agents will learn that a buffer overflow in a library that’s never loaded into memory isn’t a priority. That an unpatched CVE that’s dynamically blocked by configuration is low risk. And that a low-severity bug, when chained with a permission escalation vector and an exposed endpoint, is actually a critical exploit path.

That kind of judgment only comes from context. And context only comes from a model that sees the whole picture.

‍

Blog written by

Pavel Furman

CTO & Co-Founder

More blogs

View all

Circumventing Security in Claude Code: Misconfiguration and Denial-of-Service

Kodem recently identified two security issues in Claude Code: a misconfiguration allowing circumvention of user approval and a subsequent Denial-of-Service (DoS) condition.

July 30, 2025

Kodem just built the world’s only Dev to Prod Agentic Taskforce in Cyber

At RSAC 2025, we launched Kai, the first AI-native application security engineer. Today, we’re expanding it into a fully agentic task force that truly performs AppSec tasks from start to finish.

July 28, 2025

FedRAMP RFC-0012

The Federal Risk and Authorization Management Program (FedRAMP) recently released RFC-0012, marking a notable shift towards more stringent standards for continuous vulnerability management. Cloud providers and security teams must adjust quickly to stay compliant and secure (FedRAMP, 2024).

July 25, 2025

A Primer on Runtime Intelligence

See how Kodem's cutting-edge sensor technology revolutionizes application monitoring at the kernel level.

5.1k

Applications covered

1.1m

False positives eliminated

4.8k

Triage hours reduced

Platform Overview Video

Watch our short platform overview video to see how Kodem discovers real security risks in your code at runtime.

5.1k

Applications covered

1.1m

False positives eliminated

4.8k

Triage hours reduced

The State of the Application Security Workflow

This report aims to equip readers with actionable insights that can help future-proof their security programs. Kodem, the publisher of this report, purpose built a platform that bridges these gaps by unifying shift-left strategies with runtime monitoring and protection.

Get the report

Get real-time insights across the full stack…code, containers, OS, and memory

Watch how Kodem’s runtime security platform detects and blocks attacks before they cause damage. No guesswork. Just precise, automated protection.

Watch a short demo

Stay up-to-date on Audit Nexus

A curated resource for the many updates to cybersecurity and AI risk regulations, frameworks, and standards.

Visit Audit Nexus