Sean Whalen 7ef31f8083 Classify reverse DNS map: next 1000 unmapped MMDB ASN domains (#749)
Continued the MMDB ASN-domain coverage walk into the 14k-10k IPv4-weight
band. Added 883 new map entries and 117 new known-unknown entries from
the top 1,000 unmapped candidates.

ASN-domain coverage by IPv4 weight: 96.5% -> 96.8%.
ASN-domain coverage by domain count: 11.0% -> 12.4%.

Composition: ~50 globally-known brands (Vanguard, AIG, Aon, Equifax,
Mercedes-Benz USA, BP, BHP, Bechtel, Tetra Pak, Anheuser-Busch, Air
Canada, Maersk, NFL, NHL, MGM Resorts, Wolfram, Red Hat, Palo Alto
Networks, New Relic, Travelport, Epicor, IQVIA, Dassault Systemes,
Disney+, Valve, Seagate, Analog Devices, Renesas, Dow Jones, Lee
Enterprises, IGN, Mondadori, AtkinsRealis, Eiffage, Ogilvy, Interpublic,
Equifax, Ooredoo Maldives, MTN Zambia, Movistar Costa Rica, Telekom
Romania Mobile, Sparkle, Vodafone Ireland, etc.); ~30 universities
and government / state agencies (City of San Jose, City of Phoenix,
Bulgarian gov, Region Uppsala, Weld County, Long Beach Unified, Escambia
School District, Region 4 ESC, Merced COE, Santa Cruz COE, Politechnika
Warszawska, Bogazici, KAIST-affiliated Korean universities, Ural Federal
University, etc.); the long tail of regional ISPs / hosters / MSPs /
data-center operators classified via MMDB as_name + homepage / WHOIS
corroboration.

117 added to known-unknown where the two-corroborating-sources bar
wasn't met (Cloudflare-blocked sites with privacy-redacted WHOIS,
generic-token AS-names with empty homepages, parked domains, etc.).
Files remain disjoint per the workflow guardrail.

sortlists.py validates clean (types, sort, dedupe). CRLF preserved.

Co-authored-by: Sean Whalen <seanthegeek@users.noreply.github.com>
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-05 21:08:53 -04:00
2026-05-03 12:36:06 -04:00
2026-04-19 21:20:41 -04:00
2025-12-12 15:56:52 -05:00
2026-03-09 18:16:47 -04:00
2026-03-23 17:08:26 -04:00
2018-02-05 20:23:07 -05:00
2022-10-04 18:45:57 -04:00
2026-03-09 18:24:16 -04:00

parsedmarc

Build
Status Code
Coverage PyPI
Package PyPI - Downloads

A screenshot of DMARC summary charts in Kibana

parsedmarc is a Python module and CLI utility for parsing DMARC reports. When used with Elasticsearch and Kibana (or Splunk), it works as a self-hosted open-source alternative to commercial DMARC report processing services such as Agari Brand Protection, Dmarcian, OnDMARC, ProofPoint Email Fraud Defense, and Valimail.

Note

Domain-based Message Authentication, Reporting, and Conformance (DMARC) is an email authentication protocol.

Sponsors

This is a project is maintained by one developer. Please consider sponsoring my work if you or your organization benefit from it.

Features

  • Parses draft and 1.0 standard aggregate/rua DMARC reports
  • Parses forensic/failure/ruf DMARC reports
  • Parses reports from SMTP TLS Reporting
  • Can parse reports from an inbox over IMAP, Microsoft Graph, or Gmail API
  • Transparently handles gzip or zip compressed reports
  • Consistent data structures
  • Simple JSON and/or CSV output
  • Optionally email the results
  • Optionally send the results to Elasticsearch, Opensearch, and/or Splunk, for use with premade dashboards
  • Optionally send reports to Apache Kafka

Python Compatibility

This project supports the following Python versions, which are either actively maintained or are the default versions for RHEL or Debian.

Version Supported Reason
< 3.6 End of Life (EOL)
3.6 Used in RHEL 8, but not supported by project dependencies
3.7 End of Life (EOL)
3.8 End of Life (EOL)
3.9 Used in Debian 11 and RHEL 9, but not supported by project dependencies
3.10 Actively maintained
3.11 Actively maintained; supported until June 2028 (Debian 12)
3.12 Actively maintained; supported until May 2035 (RHEL 10)
3.13 Actively maintained; supported until June 2030 (Debian 13)
3.14 Supported (requires imapclient>=3.1.0)
S
Description
No description provided
Readme Apache-2.0 160 MiB
Languages
Python 98.2%
Shell 1.7%