| Internet-Draft | AACP | June 2026 |
| Mackay | Expires 8 December 2026 | [Page] |
This document defines the Agent Action Compression Protocol (AACP), a typed coordination format for agent-to-agent communication in multi-agent large language model (LLM) systems. AACP transforms natural language coordination instructions into deterministic, machine-parseable packets that can be validated before transmission, logged as structured audit records, and replayed consistently across workflow runs.¶
AACP addresses a coordination content layer that existing agent protocols do not cover. The Model Context Protocol (MCP) and Agent-to-Agent Protocol (A2A) operate at the tool access and routing layers respectively. Neither specifies what agents say to each other inside coordination messages. AACP fills this gap with a shared, typed vocabulary for agent coordination intent.¶
For known workflow types, a rule-based encoder produces AACP packets deterministically at zero LLM cost. A four-tier fallback extends this to novel instructions: community registry lookup at zero cost; local cache lookup at zero cost; pattern matching at zero cost; LLM encoding for genuinely novel instructions, logged to registry for permanent reuse. An amortisation benchmark across 240 encoding operations demonstrated 91.6 percent cost saving versus per-call LLM encoding, with 6 LLM calls required across the full run.¶
As a secondary benefit, AACP reduces coordination token usage by approximately 23 percent versus equivalent natural language instructions. Framework integration benchmarks demonstrate 18 percent total workflow cost reduction in LangChain (59 coordination hops) and 30 percent in CrewAI (59 coordination hops), with all coordination LLM calls eliminated in both cases.¶
This document updates draft-mackay-aacp-01 with: an expanded encoder library (8 workflow encoders across 6 domains); a community rule library (241 validated rules at registry.aacp.dev); framework integration results for LangChain and CrewAI; amortisation benchmark data; and a five-workflow multi-model department day validation.¶
This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79.¶
Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet-Drafts is at https://datatracker.ietf.org/drafts/current/.¶
Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress."¶
This Internet-Draft will expire on 8 December 2026.¶
Copyright (c) 2026 IETF Trust and the persons identified as the document authors. All rights reserved.¶
This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document.¶
Multi-agent LLM systems coordinate by exchanging instructions between autonomous agents. In current implementations, these coordination messages are typically written in natural language -- verbose, ambiguous, and non-deterministic. The same intent expressed differently across runs makes coordination messages difficult to validate, audit, or replay reliably.¶
AACP replaces natural language coordination with a compact, typed, pipe-delimited packet format that is deterministic, schema-validated, and machine-parseable. The primary value is not token reduction -- it is auditability, determinism, and the elimination of LLM calls from the coordination layer for known workflow types.¶
Consider a payroll workflow where an orchestrator instructs an HR agent to retrieve salary records. A typical English instruction:¶
"Please retrieve the employee salary records for the period ending March 2026. I need all active employees, their departments, cost centres, base salary, any changes made this month, and pension contribution rates. Return as JSON array."¶
This instruction varies in wording across runs, provides no machine-readable structure, and cannot be reliably audited without natural language processing. The equivalent AACP packet:¶
FETCH|HR|return:HR-Agent|p:1|aacp:1.1|res:emp_salary| period:2026-03|filter:status=active|fmt:json¶
is produced identically on every run by the rule-based encoder, validates against the v1.1 schema before transmission, and constitutes a complete machine-readable audit record. The rule-based encoder produces this packet at zero LLM cost.¶
This document specifies AACP version 1.4 and updates draft-mackay-aacp-01 with expanded encoder coverage, community rule library, framework integration results, amortisation benchmark, and department day validation.¶
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all capitals, as shown here.¶
Natural language coordination in multi-agent systems creates three distinct operational problems: verbosity, non-determinism, and unstructured audit trails.¶
Verbosity has been identified independently by multiple research groups. Mou et al. (2025) [ECOLANG] achieved greater than 20 percent token reduction through evolved compression language for social simulation. AACP addresses the same problem with a typed, portable packet schema targeting business workflow coordination.¶
Non-determinism is the more operationally significant problem. When coordination messages are expressed in free-form natural language, the same intent produces different messages across runs. This makes it impossible to validate messages before transmission, correlate messages across runs for compliance purposes, or replay workflows reliably.¶
The specific technical gap AACP fills: neither MCP [MCP] nor A2A [A2A] address the semantic content of coordination messages. Both protocols operate at the transport and routing layers. AACP operates at the content layer, defining what agents say to each other rather than how messages are delivered.¶
Framework integration benchmarks confirm the problem is real and widespread. In standard LangChain and CrewAI implementations, every agent-to-agent coordination hop involves an LLM call to generate the instruction. In a five-workflow department day scenario with 59 coordination hops, this results in 59 LLM calls solely for coordination -- before any task work begins. AACP eliminates all 59 coordination calls for known workflows.¶
An AACP packet is a sequence of pipe-delimited fields. The pipe character (U+007C, "|") is the field separator.¶
packet = task "|" dom "|" named-fields¶
TASK and DOM are positional (fields 0 and 1 respectively). All other fields MUST be named key:value pairs. Empty positional slots MUST NOT appear in v1.1 packets.¶
After the two positional fields, all fields are named using the format key:value. The following named fields MUST appear in every packet:¶
The following named fields are RECOMMENDED where applicable:¶
Additional named fields MAY be appended after core fields. Defined extended keys include: src, src_prev, rules, validate, tmpl, data_ptr, amt, ccy, sup, match, terms, type, party, clause, issue, risk, block, flags, req, highlight, status, to, subj, att, flag_msg, tone, sentiment, actor, chain, prog, ltv, loyalty, urgency.¶
Unknown extended keys MUST generate advisory warnings in validators, not errors. This permits forward compatibility and organisation-private extensions.¶
Fetch active employee salary records:¶
FETCH|HR|return:HR-Agent|p:1|aacp:1.1|res:emp_salary| period:2026-03|filter:status=active|fmt:json¶
Merge datasets and calculate payroll:¶
MERGE|HR|return:HR-Agent|p:1|aacp:1.1|rules:payroll_v2| validate:budget_cc¶
Flag a legal clause for review:¶
FLAG|LEGAL|return:LEG-Agent|p:1|aacp:1.1|type:NDA| party:Acme-Ltd|clause:s7|issue:ip_rights_restriction| risk:high|block:signature¶
Score a sales lead using BANT:¶
CALC|SALES|return:SALES-Agent|p:1|aacp:1.1|res:lead_score| filter:id=L-001|rules:bant¶
Resolve a customer complaint with goodwill consideration:¶
RESOLVE|CS|return:CS-Agent|p:1|aacp:1.1|res:complaint| filter:id=T-001|sentiment:negative|tone:empathetic| ltv:8500|ccy:GBP|req:resolve,goodwill_consider¶
Acknowledge receipt and confirm SLA clock started:¶
ACK|IT|return:IT-Agent|p:1|aacp:1.1|res:incident_ticket| status:acknowledged¶
The following TASK values are defined in AACP v1.1: FETCH, PROC, FLAG, RESOLVE, LOG, SEND, BUILD, MERGE, CALC, REPORT, ACK, SYNC.¶
All 12 TASK values are represented in the community rule library [AACP-RULES] with validated usage examples across 7 business domains.¶
Implementations MUST warn on unrecognised TASK values. Implementations MUST NOT reject packets with unrecognised TASK values, to permit forward compatibility.¶
The following DOM values are defined in AACP v1.1: HR, FIN, SALES, LEGAL, IT, CS, MKT.¶
Implementors MAY define additional domain values. Unrecognised DOM values SHOULD generate advisory warnings.¶
The p: field accepts: 1 (critical), 2 (medium, default), 3 (low).¶
For known, repetitive workflow types, a rule-based encoder deterministically produces AACP packets from structured input without an LLM API call. This approach has zero API cost and produces identical output for identical input on every run.¶
The reference implementation [AACP-SDK] provides eight workflow encoders across six domains, each grounded in a documented real-world platform:¶
A community rule library [AACP-RULES] of 241 pre-validated AACP coordination rules is available covering all 12 TASK types across all 7 DOM values. The library is MIT-licensed and hosted at registry.aacp.dev as a queryable REST API.¶
Example query:¶
GET https://registry.aacp.dev/rules?domain=HR&task=FETCH
{
"total": 10,
"rules": [
{
"id": "hr-fetch-active-employee-salaries",
"packet": "FETCH|HR|return:HR-Agent|p:1|aacp:1.1|
res:emp_salary|filter:status=active|fmt:json",
"validated": true
},
...
]
}
¶
The RuleRegistry class in the SDK provides programmatic access to the community library:¶
from aacp import RuleRegistry
registry = RuleRegistry.from_community()
rule = registry.find("fetch active employee salaries", domain="HR")
print(rule.packet)
# FETCH|HR|return:HR-Agent|p:1|aacp:1.1|res:emp_salary|
# filter:status=active|fmt:json
print(rule.cost) # 0.0
¶
For novel instructions outside known workflow patterns, an LLM-assisted encoder produces AACP packets by submitting the English instruction to a language model with the AACP specification as a system prompt.¶
The FallbackEncoder routes encoding requests through four tiers in priority order:¶
Tier 0 -- Community registry: The instruction is matched against the community rule library. A match returns the pre-validated packet at zero cost.¶
Tier 1 -- Hash cache lookup: The SHA-256 hash of the normalised instruction is looked up in the local registry. An exact match returns the cached packet at zero cost.¶
Tier 2 -- Pattern match: The instruction is matched against keyword patterns from the registry and built-in pattern set. A match above threshold returns the cached or rule-based packet at zero cost.¶
Tier 3 -- LLM fallback: Novel instructions with no match are encoded via LLM and logged to the local registry, enabling Tier 1 routing on all subsequent occurrences.¶
This architecture ensures each novel instruction pattern incurs at most one LLM encoding call. An amortisation benchmark across 240 encoding operations (6 novel instructions, each repeated 30 times, plus rule-based comparisons) demonstrated 91.6 percent cost saving versus per-call LLM encoding, with 6 LLM calls required across the full benchmark run.¶
AACP validators MUST check:¶
AACP validators SHOULD warn on:¶
Validation errors MUST be reported before transmission. Warnings SHOULD be logged but MUST NOT prevent transmission.¶
Coordination token counts were measured using live API usage_metadata. Each message was submitted as a bare user message with no system prompt and max_tokens=1 to isolate coordination token counts. Benchmark date: May 2026.¶
Four-hop payroll workflow. Tokens from live API.¶
+---------------------+-------+------+--------+--------+ | Hop |English| AACP |Claude% |GPT-4o% | +---------------------+-------+------+--------+--------+ | fetch employees | 56 | 52 | -7.1% | -12.7% | | fetch budgets | 57 | 47 | -17.5% | -16.0% | | merge and calculate | 65 | 43 | -33.8% | -31.6% | | generate report | 62 | 43 | -30.6% | -33.3% | +---------------------+-------+------+--------+--------+ | TOTAL | 240 | 185 | -22.9% | -23.7% | +---------------------+-------+------+--------+--------+¶
A three-scenario amortisation benchmark validated the "one LLM call per novel pattern" property across 240 encoding operations. June 2026.¶
+---------------------------+--------+----------+----------+ | Scenario | Calls | LLM Calls| Saving | +---------------------------+--------+----------+----------+ | A: 1 instruction x 30 | 30 | 1| 96.7% | | B: 5 instructions x 30 | 150 | 4| 91.0% | | C: Rule-based vs novel | 60 | 1| 94.4% | +---------------------------+--------+----------+----------+ | TOTAL | 240 | 6| 91.6% | +---------------------------+--------+----------+----------+¶
In Scenario B, the first instruction had been encoded in Scenario A and was served from cache at zero cost across all 30 repetitions. Four new instructions each required one LLM call and zero cost on all subsequent calls.¶
A five-workflow department day lab [AACP-LAB] validated AACP packet interpretation across four LLM models. The lab runs payroll, month-end close, sales qualification, customer service resolution, and JML onboarding workflows using real CSV data. June 2026.¶
+-------------------+---------+-----------+---------+---------+ | Model | Cost | Tokens in | Latency | Success | +-------------------+---------+-----------+---------+---------+ | gpt-4.1-mini | $0.0190 | 40,585 | 160s | 5/5 | | gpt-4.1 | $0.0984 | 41,005 | 124s | 5/5 | | claude-sonnet-4-5 | $0.2237 | 53,127 | 423s | 5/5 | | gpt-4o | $0.2408 | 40,459 | 101s | 5/5 | +-------------------+---------+-----------+---------+---------+¶
All four models correctly interpreted AACP v1.1 packets across all five workflows and 76 coordination hops per model run without model-specific prompt tuning. Every packet validated against the v1.1 schema. The deterministic audit agent wrote structured records at zero LLM cost on every run.¶
AACP was integrated into two leading agent frameworks and benchmarked against standard natural language coordination on an equivalent five-workflow, 59-hop department day scenario. June 2026.¶
+------------+------+----------------+----------+--------------+ | Framework | Hops | Coord. Calls | Saving | Determinism | +------------+------+----------------+----------+--------------+ | LangChain | 59 | 59 -> 0 | 18% | YES | | CrewAI | 59 | 59 -> 0 | 30% | YES | +------------+------+----------------+----------+--------------+¶
In both cases, all 59 coordination LLM calls were eliminated. The higher saving in CrewAI reflects its more verbose default natural language task descriptions. In both frameworks, coordination messages became deterministic, schema-validated, and machine-readable audit records. Reference implementations are available as [AACP-LANGCHAIN] and [AACP-CREWAI].¶
MCP [MCP] defines how agents access external tools and resources. AACP operates inside MCP message payloads, compressing the coordination instructions that describe what to retrieve or action. The two protocols are complementary.¶
A2A [A2A] defines agent discovery and task routing between agents. AACP compresses the content of messages that A2A routes. The two protocols are complementary.¶
AACP fills a distinct layer between routing and task execution: deterministic, typed encoding of coordination message content. No existing published protocol addresses this specific layer.¶
A reference implementation is available as a Python package [AACP-SDK]:¶
pip install aacp # Python SDK v1.4.0 npm install aacp-ts # TypeScript SDK v1.4.0 pip install aacp-langchain # LangChain integration v0.1.0 pip install aacp-crewai # CrewAI integration v0.1.0¶
Source repositories, the working lab, community rule library, VS Code extension, and registry API are listed in the Informative References section.¶
Unknown named fields generate advisory warnings, not errors. Implementors MAY define organisation-private fields using a namespacing convention (e.g. org_fieldname:) to avoid collision with future AACP fields.¶
Implementors MAY define domain values beyond the seven defined in Section 5.2. The community registry at registry.aacp.dev provides a shared rule library for common workflow patterns. A commercial registry tier supporting private organisation rules is planned.¶
The aacp: field MUST be included in every packet and MUST specify the protocol version. Breaking changes increment the major version. Additive changes increment the minor version. Implementations SHOULD warn on version mismatch but MUST NOT reject packets on version mismatch alone.¶
Packet injection: AACP packets MUST be treated as untrusted input. Validators MUST be applied before processing. Implementations MUST NOT execute actions based on unvalidated packets.¶
Prompt injection: malicious content in AACP field values could be interpreted as instructions by a receiving LLM agent. Implementations SHOULD sanitise field values before including them in LLM prompts.¶
Replay attacks: AACP v1.1 does not include sequence numbers or timestamps. Security-sensitive environments SHOULD add replay protection at the transport layer.¶
Sensitive data: PII, financial records, and credentials MUST NOT be embedded in AACP field values. Use authenticated data access layers referenced by pointer instead.¶
Registry security: the local registry SHOULD be access-controlled and SHOULD NOT be committed to version control when it contains sensitive workflow instructions.¶
This document has no IANA actions.¶