Sean Whalen c25bf28c1c Classify reverse DNS map: final cleanup batch (~2,650 unmapped MMDB ASN domains) (#762)
Final cleanup pass to clear the remaining MMDB AS-domain queue. Applied an
expanded multilingual classifier covering all 44 README industry types
plus an Energy concept (mapped to Utilities pending a README addition).
Per-detector keyword lists now include Spanish, Portuguese, French,
Italian, German, Dutch, Russian, Polish, Czech, Turkish, Greek, Chinese
(simplified and traditional), Japanese, Korean, Arabic, Hebrew, Hindi,
Vietnamese, Indonesian, and Thai where the concept has a recognizable
local-language equivalent.

- 980 added to base_reverse_dns_map.csv (ISP 193, Education 193, Finance
  155, Government 109, Healthcare 93, Web Host 37, MSP 31, Manufacturing
  22, Logistics 17, Real Estate 12, Travel 11, Consulting 10, Tech 9,
  Nonprofit 9, Legal 9, Food 9, Retail 8, Religion 8, Utilities 7, plus
  smaller volumes across 14 more types).
- 1,669 added to known_unknown_base_reverse_dns.txt — the residual
  unfetchable / parked / Cloudflare-challenged / non-recognized-content
  rows.

ASN-domain coverage of the bundled IPinfo Lite MMDB after this batch:
  - by domain count:  29,083 / 63,993  (45.45%)
  - by IPv4 weight:   98.36%

Total since batch 5: ~16,400 map rows + ~17,400 known-unknown rows added
across 9 batches. Remaining unmapped pool size: 0 — every MMDB AS-domain
has now been processed (either classified or recorded in known-unknown).

Co-authored-by: Sean Whalen <seanthegeek@users.noreply.github.com>
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-07 16:48:56 -04:00
2026-05-03 12:36:06 -04:00
2026-04-19 21:20:41 -04:00
2025-12-12 15:56:52 -05:00
2026-03-09 18:16:47 -04:00
2026-03-23 17:08:26 -04:00
2018-02-05 20:23:07 -05:00
2022-10-04 18:45:57 -04:00
2026-03-09 18:24:16 -04:00

parsedmarc

Build
Status Code
Coverage PyPI
Package PyPI - Downloads

A screenshot of DMARC summary charts in Kibana

parsedmarc is a Python module and CLI utility for parsing DMARC reports. When used with Elasticsearch and Kibana (or Splunk), it works as a self-hosted open-source alternative to commercial DMARC report processing services such as Agari Brand Protection, Dmarcian, OnDMARC, ProofPoint Email Fraud Defense, and Valimail.

Note

Domain-based Message Authentication, Reporting, and Conformance (DMARC) is an email authentication protocol.

Sponsors

This is a project is maintained by one developer. Please consider sponsoring my work if you or your organization benefit from it.

Features

  • Parses draft and 1.0 standard aggregate/rua DMARC reports
  • Parses forensic/failure/ruf DMARC reports
  • Parses reports from SMTP TLS Reporting
  • Can parse reports from an inbox over IMAP, Microsoft Graph, or Gmail API
  • Transparently handles gzip or zip compressed reports
  • Consistent data structures
  • Simple JSON and/or CSV output
  • Optionally email the results
  • Optionally send the results to Elasticsearch, Opensearch, and/or Splunk, for use with premade dashboards
  • Optionally send reports to Apache Kafka

Python Compatibility

This project supports the following Python versions, which are either actively maintained or are the default versions for RHEL or Debian.

Version Supported Reason
< 3.6 End of Life (EOL)
3.6 Used in RHEL 8, but not supported by project dependencies
3.7 End of Life (EOL)
3.8 End of Life (EOL)
3.9 Used in Debian 11 and RHEL 9, but not supported by project dependencies
3.10 Actively maintained
3.11 Actively maintained; supported until June 2028 (Debian 12)
3.12 Actively maintained; supported until May 2035 (RHEL 10)
3.13 Actively maintained; supported until June 2030 (Debian 13)
3.14 Supported (requires imapclient>=3.1.0)
S
Description
No description provided
Readme Apache-2.0 160 MiB
Languages
Python 98.2%
Shell 1.7%