Internet-Draft | DKIM Access Control and Differential Cha | February 2025 |
Nurpmeso | Expires 7 August 2025 | [Page] |
This document specifies a bundle of DKIM (RFC 6376) extensions and adjustments. They do not hinder the currently distributed processing environment that includes DKIM, ARC, DMARC and SPF, and are as such backward compatible. Their aim is however to ultimately slim down the email environment that needs to be administrated and maintained, by establishing mutual agreements in between sender and receiver(s), verifiable through public-key cryptography, and let the SMTP protocol handle decisions solely based upon that.¶
This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79.¶
Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet-Drafts is at https://datatracker.ietf.org/drafts/current/.¶
Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress."¶
This Internet-Draft will expire on 7 August 2025.¶
Copyright (c) 2025 IETF Trust and the persons identified as the document authors. All rights reserved.¶
This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Revised BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Revised BSD License.¶
Public-key cryptography is used for secure transactions on many levels, and in many protocols. For example, transport layer security TLS[RFC9325] provides encrypted data exchange. It is omnipresent, desired where optional, even enforced by standard means: newer IETF transports, like QUIC[RFC9369], may even exist only in conjunction with it. The usual public-key cryptography mode of operation is, that if no trust can be established, the operation is cancelled. It simply does not happen.¶
DKIM[RFC6376], on the other hand, defines as one of its core details that "signature verification failure does not force rejection". Yet there is such a pressing need of email operators to be able to enforce policy, that a plethora of extensive accompanying standards surrounding SMTP[RFC5321] and DKIM were developed, among which are ARC, DMARC and SPF. Reality is that the complexity of email setup, of administrative effort, has massively increased in the last decade plus, so much that many small commercial and private operators have ceased to exist, or have turned away from providing their own service. Reality is also that large parts of those which still exist do not follow-suit "so-called" IETF progress out of belief of improving the situation, but instead they wait until interoperability problems arise, especially with the giant email players, before minimally invasive solutions are searched for. These are usually found by searching the internet, often by doing copy and paste of shared configuration snippets.¶
Some of the mentioned standards even introduce massive complications of decade old habits and usage patterns. For example, many universities and other "groupings" offer stable member email addresses, and then forward email to current, "real addresses". This is made impossible by SPF[RFC7208] if taken by the word (RECOMMENDET), which it often, but dependent upon a software implementation or configuration, is. Non-standardized solutions, like "Sender Rewriting Scheme" for the given example, are then developed, and implemented, by the sheer necessity to keep a grown infrastructure in a usable state. Often these solutions are imperfect. In any case they try to circumvent a defect of an IETF standard, in an onion-alike environment of standards that has no other desire, if one lets aside all those masses of "reporting" capabilities that IETF standards developed, than to provide reliable and trustworthy verification of the sender / receiver relationship and the communicated data.¶
What this specification tries to achieve is to provide a path to lesser complexity, to easier maintenance and administration efforts, on the one hand. And on the other hand it tries to solve the issues which still exist, regardless of the sheer number of IETF standards invented to improve the situation.¶
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all capitals, as shown here.¶
The DKIM[RFC6376] extension Access Control and Differential Changes:¶
RCPT TO
addressees,
as well as to ensure that the
SMTP[RFC5321]
MAIL FROM
mailbox is identifieable;
the latter avoids that receivers can be fooled by man-in-the-middle
to send backscatter bounces to random addresses.¶
The DKIM[RFC6376] extension Access Control and Differential Changes is announced by adding an acdc= tag to the DKIM-Signature. (For efficiency reasons it SHOULD be placed early, before tags like h=, bh= and b=, for example.) The tag starts with "sequence", a decimal number starting at 1, or incremented by 1 from the highest DKIMACDC sequence number encountered in the message; the maximum value is 999: if incrementing would result in overflow, the message MUST to be rejected; sequence holes MUST also cause rejection (but see below); in both cases SMTP[RFC5321] reply code 550 is to be used; with enhanced SMTP status codes[RFC3463] 5.5.4 MUST be used.¶
Informative remark: 999 is both a constraint and a very high limit, dependent upon which type of processing is actually involved. In todays' DKIM use several signatures per actual hop are not uncommon, also in the sense that per-hop processing pipelines involve several processing steps that each create DKIM signatures. Since DKIMACDC is meant as a transparent upgrade path it seems unwise to introduce a limit too low thus. On the other hand a high limit creates a D(enial) O(f) S(ervice) attack surface. DKIMACDC allows for rather cheap and easy detection (and testing) of the highest numbered signature, which can be sufficient for intermediate hops given the DKIM paradigm that "a single successful verification is sufficient for validation". (For example, no From header parsing might be necessary.) With DKIMACDC certain detectable conditions allow for quick rejection in a broken chain of trust. DKIMACDC allows for pretty certain collection of statistics of organizational trust ([RFC5863], section 2.5), in turn improving the mentioned "detectable conditions".¶
Flag description is normative.
(Note the missing FWS
separators around =
.)
ABNF[RFC5234]:¶
acdc = %x61 %x63 %x64 %x63 = sequence ":" 1*(flag) ":" [id] ":" sequence = 1*3DIGIT; DIGIT from RFC 5234 flag = "A" / "a" / "D" / "O" / "P" / "R" / "V" / "v" / "X" / "x" / "Z" / "z" id = *20(ALPHA / DIGIT / "+" / "-"); optional bounce ID¶
MAIL FROM
is adjusted to refer to the domain that claims
ownership, etc.
Any formerly present DKIM-Access-Control header was removed.
Access control headers are only generated for messages with
the O flag set.¶
RCPT
receiver,
and if no more than two RCPT
receivers exist in
total,
then the P flag has to be set.
Once set, all future DKIMACDC signatures must copy it.¶
Unknown flags MUST be ignored. Invalid flag combinations and flag misuse MUST result in rejection with SMTP reply code 550; if enhanced status codes[RFC3463] are used, 5.5.4 MUST be used. (This includes the P flag upon incorrect use.)¶
The DKIM-Store header has no meaning in the email system. The sole purpose of mentioning it is to announce that it MUST be removed when messages enter and leave the email system. It could for example be temporarily created and used by non-integrated mail filter (milter) software to pass informational data in between the "ingress" and the "egress" processing side. To aid in software bugs and possible configuration errors this specification makes it a MUST to remove all occurrences. It is suggested to encrypt data passed around in this temporary header with a key internal to the "local" email processing system in order to achieve locality.¶
DKIM replay attacks have been reported,
where messages with valid DKIM signatures were repeatedly sent
to receivers not initially addressed by the sender.
That is: because the sent
IMF[RFC5322]
message does not include Bcc headers,
and, to be exact, because the actual
SMTP[RFC5321]
RCPT
receivers are not included at all,
DKIM does not cover the real set of message receivers:
effectively any malicious party can use the validatable message
with any possible
SMTP[RFC5321]
RCPT
.
Whereas DKIM x= signature validity expiration tags can be used,
and their use is hereby encouraged as a SHOULD,
the stamina and forgiveness of SMTP,
owed to the necessity to deliver messages to receivers in
various conditions,
requires an expiration timestamp that leaves plenty of time for
malicious players to misuse messages with valid signatures.¶
In addition the actual
SMTP[RFC5321]
MAIL FROM
sender is not covered by DKIM:
any intermediate hop can (use the validatable message and) cause
bounces to any possible MAIL FROM
(backscatter bounce).¶
Access control addresses replay and backscatter bounces.
When signing as an originator (O flag set),
all distinct domain-names found within the list of intended
SMTP RCPT
addressees are collected.
Thereafter the DKIMACDC state of all found domains is queried,
by looking up their _dkimacdc DNS entry, as below.
For any domain that announces DKIMACDC support
the completely prepared message,
including the readily prepared DKIM-Signature(s), is forged,
the A flag is set,
(a) dedicated DKIM-Access-Control header(s) is/are created and
prepended,
and the resulting domain-specific message is sent to the logical
receiver subset.¶
Informative remark: Dedicated DKIM-Signatures are necessary: if the message is also sent to a domain which does not support DKIMACDC, but which forwards the message to a domain which does, that destination would otherwise falsely assume the presence of access control; To simplify per-receiver-domain message creation the DKIM-Signature header(s) can be readily prepared except for toggling the single flag byte a to A, and, of course, creation of the cryptographic signature itself.¶
A DKIMACDC-enabled and -announcing receiver domain that receives a DKIMACDC message MUST reject messages which do not contain (a) DKIM-Access-Control header(s) dedicated to itself with SMTP reply code 550; if enhanced status codes[RFC3463] are used, 5.5.4 MUST be used. It MUST also reject messages which fail the signature verification of such a header with SMTP reply code 550; the enhanced status code MUST be 5.7.7. Senders MAY use Delivery Status Notifications[RFC3461] to fine-tune the resulting behaviour.¶
The presence of this header empowers the receiving domain to
cryptographically verify that it is indeed the correct
destination domain,
and that any given
SMTP[RFC5321]
RCPT TO
was indeed addressed by the message sender,
which indeed is the one mentioned in
MAIL FROM
;
if the header included and the SMTP list do not match,
the message MUST be rejected with SMTP reply
code 550; if
enhanced status codes[RFC3463]
are used, 5.5.4 MUST be used;
5.7.7 instead if signature verification failed.¶
This header is to be sent only as part of exclusive and dedicated message instances, as documented above, it MUST be removed by the destination domain as soon as possible; it MUST NOT be delivered by local delivery agents as part of the message, and it MUST NOT be part of a rejected message. Any instance of such a header that is not targeted to the destination domain indicates an error and MUST result in message rejection with SMTP reply code 550; if enhanced status codes[RFC3463] are used, 5.5.4 MUST be used.¶
The syntax of this header is a semicolon separated list.
It starts with the sequence number of the DKIM-Signature
to which it links,
which necessarily MUST have the O flag set,
followed by the selector value of the s= tag of the according
DKIM-Signature;
the actual algorithm can be deduced from there.
(As there may be multiple signatures,
multiple DKIM-Access-Control headers may be generated,
unless the _dkimacdc DNS entry hints a supported algorithm.)
Thereafter follows the
SMTP[RFC5321]
MAIL FROM
of the covered message,
the receiver domain name which is addressed,
followed by all SMTP
RCPT TO
local-parts of the the receiver domain.
The list is concluded with the cryptographic signature which
has been generated on the DKIM "relaxed" normalized content
of the DKIM-Access-Control header up to, and including, the
semicolon that precedes the signature.
Warning:
SMTP[RFC5321]
address local-parts permit quoted-strings.¶
The format of this DNS resource record mirrors the syntax of
DKIM[RFC6376]
section 3.5 on the DKIM-Signature header field,
with the exception that FWS
separation is not allowed;
supported are the tags v= and a=,
however, v= is optional,
and none to multiple a= tags MAY exist.
The a= tag indicates that DKIM-Access-Control headers
SHOULD only be generated with that algorithm,
and that an according DKIM-Signature is the most desirable for
this domain.
It is only a hint to reduce processing cost of senders,
it has no meaning beside this.
Senders MUST be capable to follow DNS CNAME chains
when looking up this DNS RR.¶
DKIM signatures never were designed to work with the existing mailing-list infrastructure, which often tags message subjects and/or appends footers (headers are supposed to be more of a theoretical issue). With the advent of some supplementary standard which worked around the DKIM "signature verification failure does not force rejection" paradigm, the resulting DKIM signature verification failures started to cause non-deliveries. Mailing-list software adopted in that they started to rewrite the From header in order to avoid breakage of the sender's signature. Further standards were developed that tried to bring back trust that was lost by those modifications initiated to avoid that the forced signature breakage caused message delivery breakage.¶
This specification adds the creation of differential changes, which can be applied in reverse order of creation, and therefore be used to cryptographically verify all intermediate changes back to the original version as sent by the sender. Whenever a DKIMACDC enabled domain breaks a message signature, for example if a mailing-list tags the subject and adds a message footer, an according DKIM-Diff header has to be created, and the D indicator flag has to be added to the acdc= tag. All existing DKIM-Diff headers MUST be included in DKIMACDC enabled DKIM-Signatures.¶
Informative remark: It follows that the "changes cause a new message" paradigm of today's DKIM/DMARC usage stays intact. It is deemed correct behaviour: Note that a message sent to a mailing list is addressed to a mailing list. It is not addressed to the 'final' recipients. That additional addressing is done by the mailing list, not the original author. This is a rather stark demonstration that the intermediary has taken delivery and then re-posted the message. However, DKIMACDC allows for cryptographically verifying the original message, and therefore can overcome the trust problem incurred by those "correct" changes, which of course break the DKIM signature of the original message.¶
Informative remark: Today many mailing-list instances re-encode message data for policy reasons, needlessly: for example from some 7-bit clean content-transfer-encoding to 8-bit, or anything into base64 (as below). This policy usually causes enlargening of the differential changes on at least the first level (which for one is most often the only one involved, and second it depends on the content of the original message). This negative impact can thus easily vanish, upon policy change.¶
The DKIM-Diff header field consists of a sequence number that links it with a DKIMACDC enabled DKIM-Signature header, followed by a semicolon, and the result of the BSDiff differential algorithm, as below. The input to this algorithm is the DKIM[RFC6376] "relaxed" normalized header and body content, separated by an empty (normalized) line, alongside the equally normalized version present before modifications took place. For non-integrated systems like mail filters the DKIM-Store header can for example be used to pass around the necessary data in between the ingress side that sees the original message, and the egress side which will dispatch the modified variant.¶
All headers covered by the DKIM-Signature MUST be included, as MUST be all MIME[RFC2045] related headers, regardless of their normal inclusion in the DKIM-Signature. (MIME related headers SHOULD be regulary included in DKIM signatures to avoid the otherwise existing attack surface against the MIME structure through maliciously injected headers and body content, but as a domain policy this is not in scope of this document.) All DKIMACDC-enabled DKIM-Signature headers MUST be included, as MUST be all DKIM-Diff ones. The headers MUST be sorted byte-wise alphabetically by name, and the formed subgroups MUST be sorted byte-wise alphabetically by content. Other than that the advice of DKIM[RFC6376], section 5.4.1., on recommended signature content, still applies, but is hereby extended with the Author Header Field[RFC9057].¶
Informative remark: Since DKIMACDC is meant to (effectively) incur the most minimal changes on the software side it does not change the way how existing DKIM software verifies or creates signatures in general. To integrate this extension into the existing infrastructure it seems best to accept a small overhead in the highly compressible BSDiff control data, instead of introducing expensive prefiltering processing costs, for example, by grouping "old" and "new" headers. Here also to note that in mail filters the name and the content of headers fly by as distinct data arrays, for example, so that the necessary control structures for the sorting algorithm as above can be implemented more efficiently than it sounds at first, and alongside the normal processing.¶
Differences are generated with the BSDiff algorithm of Colin Percival, which has excellent characteristics. No reimplementation of the algorithm was necessary due to the Open Source licenses used in all its different parts, instead it was taken from the FreeBSD operating system source code, and slightly rearranged: it was decoupled from the fixed I/O and compression machinery, the memory allocator is hookable, and the integer type width is (again) a build time option; in addition the sliding window is run-time configurable. There is a freely usable (BSD 2-clause/ISC and MIT licenses) plug-and-play ISO C99 and perl implementation available (https://github.com/sdaoden/s-bsdipa), which includes further references on the algorithm. DKIMACDC uses the 32-bit variant sufficient for email, which almost halves memory requirements compared to 64-bit, and also produces smaller difference control data. The resulting binary difference is then ZLIB[RFC1950] compressed and encoded with BASE64[RFC4648] for inclusion in the DKIM-Diff header.¶
Differences are included to allow DKIM verifiers to restore previous message content for cryptographical verification purposes. Whereas user interfaces may (and should) use them to offer differential visualization (after signature verification, and with the usual precautions necessary for displaying content), empowering users to make decisions on the trustworthiness of those intermediate stations which actually incurred message modifications, the restored message data is not meant to result in a usable message by itself. For example some embedded OpenPGP signature and text couple would likely fail to verify because of DKIM normalization (dependent upon the original MIME transfer encoding). This was deemed acceptable because of the purpose of including differential changes, and because a visualization of the DKIM covered message should still be sufficient to allow users making responsible decisions. Finally, the given example will likely verify as part of the complete received message, unless altered along the SMTP path: DKIMACDC can ideally say where.¶
User interfaces could for example use traffic light semantics that unfold on click to traffic light semantics of all stations that a message passed, which would visualize differences on a further click. They could build complex reputation statistics based upon DKIMACDC verification and perceived user hints. This could be used to restrict DKIMACDC verification, to reduce complete-chain-verification to random samples. Further possibilities could arise shall SMTP/DKIM/DKIMACDC remain as the only solution to email verification in the future.¶
This memo includes no request to IANA.¶
Public-key cryptography is the safest approach to identification of counterparts and verification of data. This specification aims in making use of these attributes for the combined pair of SMTP and DKIM. It opens a door to reduction of email server maintenance and administration efforts, and to restoration of some email core aspects which got lost, or became a nuisance to use, over the last decade(s), like email forwarding and mailing-list usage. It may reduce implementation burden and complexity of the entire email infrastructure. It allows for building of organizational trust ([RFC5863], section 2.5) that aids in decision making, to increase processing performance and decrease energy consumption. If superfluous protocols vanish this effect potentiates.¶
FWS
in ag-spec.
Second its use was never encountered by the author.
But first of all
MIME[RFC2045]
introduced parameters in ABNF as
parameter := attribute "=" value
without FWS
,
and its presence complicates parsers and hinders parser code reuse.
The acdc= tag ("parameter") is defined without FWS
support.¶
This document contains a citation of Dave Crocker. Thanks to, in the order of appearance, Jesse Thompson, Richard Clayton, and Douglas Foster. A big fat acknowledgment is due to Murray S. Kucherawy. Special thanks to Klaus Schulze, Manuel Goettsching, both also as Ash Ra Tempel, Laeuten der Seele, Laurent Garnier, as well as the Sleeping Environmental Bot broadcast.¶